diff --git "a/sf_log.txt" "b/sf_log.txt" new file mode 100644--- /dev/null +++ "b/sf_log.txt" @@ -0,0 +1,8265 @@ +[2023-03-11 15:13:50,717][41256] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/config.json... +[2023-03-11 15:13:50,730][41256] Rollout worker 0 uses device cpu +[2023-03-11 15:13:50,731][41256] Rollout worker 1 uses device cpu +[2023-03-11 15:13:50,731][41256] Rollout worker 2 uses device cpu +[2023-03-11 15:13:50,731][41256] Rollout worker 3 uses device cpu +[2023-03-11 15:13:50,731][41256] Rollout worker 4 uses device cpu +[2023-03-11 15:13:50,731][41256] Rollout worker 5 uses device cpu +[2023-03-11 15:13:50,732][41256] Rollout worker 6 uses device cpu +[2023-03-11 15:13:50,732][41256] Rollout worker 7 uses device cpu +[2023-03-11 15:13:50,732][41256] In synchronous mode, we only accumulate one batch. Setting num_batches_to_accumulate to 1 +[2023-03-11 15:13:50,742][41256] InferenceWorker_p0-w0: min num requests: 2 +[2023-03-11 15:13:50,758][41256] Starting all processes... +[2023-03-11 15:13:50,758][41256] Starting process learner_proc0 +[2023-03-11 15:13:50,808][41256] Starting all processes... +[2023-03-11 15:13:50,839][41256] Starting process inference_proc0-0 +[2023-03-11 15:13:50,846][41256] Starting process rollout_proc0 +[2023-03-11 15:13:50,847][41256] Starting process rollout_proc1 +[2023-03-11 15:13:50,847][41256] Starting process rollout_proc2 +[2023-03-11 15:13:50,847][41256] Starting process rollout_proc3 +[2023-03-11 15:13:50,847][41256] Starting process rollout_proc4 +[2023-03-11 15:13:50,847][41256] Starting process rollout_proc5 +[2023-03-11 15:13:50,848][41256] Starting process rollout_proc6 +[2023-03-11 15:13:50,848][41256] Starting process rollout_proc7 +[2023-03-11 15:13:52,236][41500] Starting seed is not provided +[2023-03-11 15:13:52,237][41500] Initializing actor-critic model on device cpu +[2023-03-11 15:13:52,237][41500] RunningMeanStd input shape: (39,) +[2023-03-11 15:13:52,237][41500] RunningMeanStd input shape: (1,) +[2023-03-11 15:13:52,296][41500] Created Actor Critic model with architecture: +[2023-03-11 15:13:52,296][41500] ActorCriticSharedWeights( + (obs_normalizer): ObservationNormalizer( + (running_mean_std): RunningMeanStdDictInPlace( + (running_mean_std): ModuleDict( + (obs): RunningMeanStdInPlace() + ) + ) + ) + (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) + (encoder): MultiInputEncoder( + (encoders): ModuleDict( + (obs): MlpEncoder( + (mlp_head): RecursiveScriptModule( + original_name=Sequential + (0): RecursiveScriptModule(original_name=Linear) + (1): RecursiveScriptModule(original_name=Tanh) + (2): RecursiveScriptModule(original_name=Linear) + (3): RecursiveScriptModule(original_name=Tanh) + ) + ) + ) + ) + (core): ModelCoreIdentity() + (decoder): MlpDecoder( + (mlp): Identity() + ) + (critic_linear): Linear(in_features=64, out_features=1, bias=True) + (action_parameterization): ActionParameterizationContinuousNonAdaptiveStddev( + (distribution_linear): Linear(in_features=64, out_features=4, bias=True) + ) +) +[2023-03-11 15:13:52,356][41546] Worker 2 uses CPU cores [8, 9, 10, 11] +[2023-03-11 15:13:52,428][41583] Worker 5 uses CPU cores [20, 21, 22, 23] +[2023-03-11 15:13:52,458][41545] Worker 1 uses CPU cores [4, 5, 6, 7] +[2023-03-11 15:13:52,571][41548] Worker 4 uses CPU cores [16, 17, 18, 19] +[2023-03-11 15:13:52,592][41550] Worker 6 uses CPU cores [24, 25, 26, 27] +[2023-03-11 15:13:52,598][41500] Using optimizer +[2023-03-11 15:13:52,599][41500] No checkpoints found +[2023-03-11 15:13:52,599][41500] Did not load from checkpoint, starting from scratch! +[2023-03-11 15:13:52,599][41500] Initialized policy 0 weights for model version 0 +[2023-03-11 15:13:52,600][41500] LearnerWorker_p0 finished initialization! +[2023-03-11 15:13:52,744][41544] RunningMeanStd input shape: (39,) +[2023-03-11 15:13:52,744][41544] RunningMeanStd input shape: (1,) +[2023-03-11 15:13:52,776][41549] Worker 3 uses CPU cores [12, 13, 14, 15] +[2023-03-11 15:13:52,800][41256] Inference worker 0-0 is ready! +[2023-03-11 15:13:52,801][41256] All inference workers are ready! Signal rollout workers to start! +[2023-03-11 15:13:52,877][41547] Worker 0 uses CPU cores [0, 1, 2, 3] +[2023-03-11 15:13:52,969][41572] Worker 7 uses CPU cores [28, 29, 30, 31] +[2023-03-11 15:13:53,386][41256] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-11 15:13:57,504][41546] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,522][41546] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,523][41545] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,534][41583] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,540][41550] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,540][41548] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,540][41545] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,552][41583] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,557][41550] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,557][41548] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,573][41546] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,582][41549] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,591][41545] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,599][41549] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,603][41583] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,608][41550] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,608][41548] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,646][41547] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,649][41549] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,656][41546] Decorrelating experience for 192 frames... +[2023-03-11 15:13:57,663][41547] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,674][41545] Decorrelating experience for 192 frames... +[2023-03-11 15:13:57,686][41583] Decorrelating experience for 192 frames... +[2023-03-11 15:13:57,692][41550] Decorrelating experience for 192 frames... +[2023-03-11 15:13:57,692][41548] Decorrelating experience for 192 frames... +[2023-03-11 15:13:57,714][41547] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,733][41549] Decorrelating experience for 192 frames... +[2023-03-11 15:13:57,738][41572] Decorrelating experience for 0 frames... +[2023-03-11 15:13:57,755][41572] Decorrelating experience for 64 frames... +[2023-03-11 15:13:57,798][41547] Decorrelating experience for 192 frames... +[2023-03-11 15:13:57,806][41572] Decorrelating experience for 128 frames... +[2023-03-11 15:13:57,887][41572] Decorrelating experience for 192 frames... +[2023-03-11 15:13:58,385][41256] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-11 15:14:02,404][41546] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,420][41545] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,461][41583] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,463][41548] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,465][41550] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,476][41549] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,553][41546] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,566][41547] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,568][41545] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,608][41583] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,610][41548] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,613][41550] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,623][41549] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,642][41572] Decorrelating experience for 256 frames... +[2023-03-11 15:14:02,713][41547] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,731][41546] Decorrelating experience for 384 frames... +[2023-03-11 15:14:02,746][41545] Decorrelating experience for 384 frames... +[2023-03-11 15:14:02,788][41572] Decorrelating experience for 320 frames... +[2023-03-11 15:14:02,789][41548] Decorrelating experience for 384 frames... +[2023-03-11 15:14:02,793][41583] Decorrelating experience for 384 frames... +[2023-03-11 15:14:02,795][41550] Decorrelating experience for 384 frames... +[2023-03-11 15:14:02,806][41549] Decorrelating experience for 384 frames... +[2023-03-11 15:14:02,893][41547] Decorrelating experience for 384 frames... +[2023-03-11 15:14:02,942][41546] Decorrelating experience for 448 frames... +[2023-03-11 15:14:02,956][41545] Decorrelating experience for 448 frames... +[2023-03-11 15:14:02,968][41572] Decorrelating experience for 384 frames... +[2023-03-11 15:14:03,004][41583] Decorrelating experience for 448 frames... +[2023-03-11 15:14:03,005][41548] Decorrelating experience for 448 frames... +[2023-03-11 15:14:03,007][41550] Decorrelating experience for 448 frames... +[2023-03-11 15:14:03,018][41549] Decorrelating experience for 448 frames... +[2023-03-11 15:14:03,107][41547] Decorrelating experience for 448 frames... +[2023-03-11 15:14:03,179][41572] Decorrelating experience for 448 frames... +[2023-03-11 15:14:03,385][41256] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) +[2023-03-11 15:14:03,387][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000000_0.pth... +[2023-03-11 15:14:07,820][41544] Updated weights for policy 0, policy_version 80 (0.0005) +[2023-03-11 15:14:08,385][41256] Fps is (10 sec: 4505.6, 60 sec: 3003.7, 300 sec: 3003.7). Total num frames: 45056. Throughput: 0: 2734.9. Samples: 41024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:14:08,386][41256] Avg episode reward: [(0, '7.385')] +[2023-03-11 15:14:10,738][41256] Heartbeat connected on Batcher_0 +[2023-03-11 15:14:10,740][41256] Heartbeat connected on LearnerWorker_p0 +[2023-03-11 15:14:10,743][41256] Heartbeat connected on InferenceWorker_p0-w0 +[2023-03-11 15:14:10,748][41256] Heartbeat connected on RolloutWorker_w0 +[2023-03-11 15:14:10,751][41256] Heartbeat connected on RolloutWorker_w1 +[2023-03-11 15:14:10,751][41256] Heartbeat connected on RolloutWorker_w2 +[2023-03-11 15:14:10,754][41256] Heartbeat connected on RolloutWorker_w4 +[2023-03-11 15:14:10,756][41256] Heartbeat connected on RolloutWorker_w5 +[2023-03-11 15:14:10,770][41256] Heartbeat connected on RolloutWorker_w3 +[2023-03-11 15:14:10,775][41256] Heartbeat connected on RolloutWorker_w7 +[2023-03-11 15:14:10,777][41256] Heartbeat connected on RolloutWorker_w6 +[2023-03-11 15:14:11,844][41544] Updated weights for policy 0, policy_version 160 (0.0005) +[2023-03-11 15:14:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 4710.4, 300 sec: 4710.4). Total num frames: 94208. Throughput: 0: 3591.2. Samples: 71824. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:14:13,386][41256] Avg episode reward: [(0, '20.901')] +[2023-03-11 15:14:15,788][41544] Updated weights for policy 0, policy_version 240 (0.0005) +[2023-03-11 15:14:18,386][41256] Fps is (10 sec: 10239.9, 60 sec: 5898.2, 300 sec: 5898.2). Total num frames: 147456. Throughput: 0: 5362.9. Samples: 134072. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:14:18,386][41256] Avg episode reward: [(0, '22.021')] +[2023-03-11 15:14:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000288_147456.pth... +[2023-03-11 15:14:18,393][41500] Saving new best policy, reward=22.021! +[2023-03-11 15:14:19,829][41544] Updated weights for policy 0, policy_version 320 (0.0005) +[2023-03-11 15:14:23,385][41256] Fps is (10 sec: 10240.1, 60 sec: 6553.6, 300 sec: 6553.6). Total num frames: 196608. Throughput: 0: 6504.0. Samples: 195120. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:14:23,386][41256] Avg episode reward: [(0, '22.491')] +[2023-03-11 15:14:23,430][41500] Saving new best policy, reward=22.491! +[2023-03-11 15:14:23,831][41544] Updated weights for policy 0, policy_version 400 (0.0005) +[2023-03-11 15:14:27,784][41544] Updated weights for policy 0, policy_version 480 (0.0005) +[2023-03-11 15:14:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 7138.8, 300 sec: 7138.8). Total num frames: 249856. Throughput: 0: 6438.4. Samples: 225344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:14:28,386][41256] Avg episode reward: [(0, '25.498')] +[2023-03-11 15:14:28,386][41500] Saving new best policy, reward=25.498! +[2023-03-11 15:14:31,764][41544] Updated weights for policy 0, policy_version 560 (0.0005) +[2023-03-11 15:14:33,385][41256] Fps is (10 sec: 10240.0, 60 sec: 7475.2, 300 sec: 7475.2). Total num frames: 299008. Throughput: 0: 7181.7. Samples: 287268. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:14:33,386][41256] Avg episode reward: [(0, '25.919')] +[2023-03-11 15:14:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000592_303104.pth... +[2023-03-11 15:14:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000000_0.pth +[2023-03-11 15:14:33,402][41500] Saving new best policy, reward=25.919! +[2023-03-11 15:14:35,792][41544] Updated weights for policy 0, policy_version 640 (0.0005) +[2023-03-11 15:14:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 7827.9, 300 sec: 7827.9). Total num frames: 352256. Throughput: 0: 7738.3. Samples: 348224. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:14:38,386][41256] Avg episode reward: [(0, '27.770')] +[2023-03-11 15:14:38,387][41500] Saving new best policy, reward=27.770! +[2023-03-11 15:14:39,856][41544] Updated weights for policy 0, policy_version 720 (0.0005) +[2023-03-11 15:14:43,385][41256] Fps is (10 sec: 10649.6, 60 sec: 8110.1, 300 sec: 8110.1). Total num frames: 405504. Throughput: 0: 8418.9. Samples: 378852. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:14:43,386][41256] Avg episode reward: [(0, '28.544')] +[2023-03-11 15:14:43,387][41500] Saving new best policy, reward=28.544! +[2023-03-11 15:14:43,665][41544] Updated weights for policy 0, policy_version 800 (0.0005) +[2023-03-11 15:14:47,460][41544] Updated weights for policy 0, policy_version 880 (0.0005) +[2023-03-11 15:14:48,386][41256] Fps is (10 sec: 10649.5, 60 sec: 8340.9, 300 sec: 8340.9). Total num frames: 458752. Throughput: 0: 9865.0. Samples: 443928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:14:48,386][41256] Avg episode reward: [(0, '27.010')] +[2023-03-11 15:14:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000896_458752.pth... +[2023-03-11 15:14:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000288_147456.pth +[2023-03-11 15:14:51,393][41544] Updated weights for policy 0, policy_version 960 (0.0006) +[2023-03-11 15:14:53,385][41256] Fps is (10 sec: 10649.7, 60 sec: 8533.4, 300 sec: 8533.4). Total num frames: 512000. Throughput: 0: 10353.8. Samples: 506944. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:14:53,386][41256] Avg episode reward: [(0, '29.608')] +[2023-03-11 15:14:53,386][41500] Saving new best policy, reward=29.608! +[2023-03-11 15:14:55,233][41544] Updated weights for policy 0, policy_version 1040 (0.0005) +[2023-03-11 15:14:58,385][41256] Fps is (10 sec: 10649.7, 60 sec: 9420.8, 300 sec: 8696.1). Total num frames: 565248. Throughput: 0: 10384.1. Samples: 539108. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:14:58,386][41256] Avg episode reward: [(0, '29.445')] +[2023-03-11 15:14:59,113][41544] Updated weights for policy 0, policy_version 1120 (0.0005) +[2023-03-11 15:15:03,031][41544] Updated weights for policy 0, policy_version 1200 (0.0005) +[2023-03-11 15:15:03,386][41256] Fps is (10 sec: 10239.8, 60 sec: 10240.0, 300 sec: 8777.1). Total num frames: 614400. Throughput: 0: 10402.3. Samples: 602176. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:15:03,386][41256] Avg episode reward: [(0, '29.055')] +[2023-03-11 15:15:03,417][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001208_618496.pth... +[2023-03-11 15:15:03,418][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000592_303104.pth +[2023-03-11 15:15:06,988][41544] Updated weights for policy 0, policy_version 1280 (0.0005) +[2023-03-11 15:15:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 8902.0). Total num frames: 667648. Throughput: 0: 10415.7. Samples: 663828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:15:08,386][41256] Avg episode reward: [(0, '30.090')] +[2023-03-11 15:15:08,386][41500] Saving new best policy, reward=30.090! +[2023-03-11 15:15:10,896][41544] Updated weights for policy 0, policy_version 1360 (0.0005) +[2023-03-11 15:15:13,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10444.8, 300 sec: 9011.2). Total num frames: 720896. Throughput: 0: 10464.1. Samples: 696228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:15:13,386][41256] Avg episode reward: [(0, '29.263')] +[2023-03-11 15:15:14,915][41544] Updated weights for policy 0, policy_version 1440 (0.0005) +[2023-03-11 15:15:18,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 9059.4). Total num frames: 770048. Throughput: 0: 10446.9. Samples: 757380. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:15:18,386][41256] Avg episode reward: [(0, '29.388')] +[2023-03-11 15:15:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001504_770048.pth... +[2023-03-11 15:15:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000000896_458752.pth +[2023-03-11 15:15:18,923][41544] Updated weights for policy 0, policy_version 1520 (0.0005) +[2023-03-11 15:15:22,896][41544] Updated weights for policy 0, policy_version 1600 (0.0005) +[2023-03-11 15:15:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10444.8, 300 sec: 9147.7). Total num frames: 823296. Throughput: 0: 10460.5. Samples: 818948. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:15:23,386][41256] Avg episode reward: [(0, '30.028')] +[2023-03-11 15:15:27,027][41544] Updated weights for policy 0, policy_version 1680 (0.0005) +[2023-03-11 15:15:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 9183.7). Total num frames: 872448. Throughput: 0: 10427.7. Samples: 848096. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:15:28,386][41256] Avg episode reward: [(0, '29.960')] +[2023-03-11 15:15:30,956][41544] Updated weights for policy 0, policy_version 1760 (0.0005) +[2023-03-11 15:15:33,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10444.8, 300 sec: 9257.0). Total num frames: 925696. Throughput: 0: 10356.5. Samples: 909972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:15:33,386][41256] Avg episode reward: [(0, '30.106')] +[2023-03-11 15:15:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001808_925696.pth... +[2023-03-11 15:15:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001208_618496.pth +[2023-03-11 15:15:33,392][41500] Saving new best policy, reward=30.106! +[2023-03-11 15:15:34,968][41544] Updated weights for policy 0, policy_version 1840 (0.0005) +[2023-03-11 15:15:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9284.3). Total num frames: 974848. Throughput: 0: 10308.4. Samples: 970824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:15:38,386][41256] Avg episode reward: [(0, '28.989')] +[2023-03-11 15:15:39,021][41544] Updated weights for policy 0, policy_version 1920 (0.0005) +[2023-03-11 15:15:42,984][41544] Updated weights for policy 0, policy_version 2000 (0.0005) +[2023-03-11 15:15:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 9346.3). Total num frames: 1028096. Throughput: 0: 10280.9. Samples: 1001748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:15:43,386][41256] Avg episode reward: [(0, '30.057')] +[2023-03-11 15:15:46,970][41544] Updated weights for policy 0, policy_version 2080 (0.0005) +[2023-03-11 15:15:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9367.4). Total num frames: 1077248. Throughput: 0: 10253.3. Samples: 1063572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:15:48,386][41256] Avg episode reward: [(0, '31.080')] +[2023-03-11 15:15:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002104_1077248.pth... +[2023-03-11 15:15:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001504_770048.pth +[2023-03-11 15:15:48,392][41500] Saving new best policy, reward=31.080! +[2023-03-11 15:15:50,653][41544] Updated weights for policy 0, policy_version 2160 (0.0004) +[2023-03-11 15:15:53,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 9454.9). Total num frames: 1134592. Throughput: 0: 10370.4. Samples: 1130496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:15:53,386][41256] Avg episode reward: [(0, '31.675')] +[2023-03-11 15:15:53,386][41500] Saving new best policy, reward=31.675! +[2023-03-11 15:15:54,513][41544] Updated weights for policy 0, policy_version 2240 (0.0004) +[2023-03-11 15:15:58,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 9470.0). Total num frames: 1183744. Throughput: 0: 10308.0. Samples: 1160088. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:15:58,386][41256] Avg episode reward: [(0, '30.307')] +[2023-03-11 15:15:58,525][41544] Updated weights for policy 0, policy_version 2320 (0.0005) +[2023-03-11 15:16:02,173][41544] Updated weights for policy 0, policy_version 2400 (0.0004) +[2023-03-11 15:16:03,385][41256] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 9546.8). Total num frames: 1241088. Throughput: 0: 10389.5. Samples: 1224908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:03,386][41256] Avg episode reward: [(0, '30.980')] +[2023-03-11 15:16:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002424_1241088.pth... +[2023-03-11 15:16:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000001808_925696.pth +[2023-03-11 15:16:06,005][41544] Updated weights for policy 0, policy_version 2480 (0.0005) +[2023-03-11 15:16:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9557.3). Total num frames: 1290240. Throughput: 0: 10413.4. Samples: 1287552. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:16:08,386][41256] Avg episode reward: [(0, '32.312')] +[2023-03-11 15:16:08,387][41500] Saving new best policy, reward=32.312! +[2023-03-11 15:16:10,108][41544] Updated weights for policy 0, policy_version 2560 (0.0005) +[2023-03-11 15:16:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10376.6, 300 sec: 9596.4). Total num frames: 1343488. Throughput: 0: 10457.5. Samples: 1318684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:13,386][41256] Avg episode reward: [(0, '31.169')] +[2023-03-11 15:16:14,153][41544] Updated weights for policy 0, policy_version 2640 (0.0005) +[2023-03-11 15:16:18,247][41544] Updated weights for policy 0, policy_version 2720 (0.0005) +[2023-03-11 15:16:18,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10376.5, 300 sec: 9604.4). Total num frames: 1392640. Throughput: 0: 10407.9. Samples: 1378328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:18,386][41256] Avg episode reward: [(0, '31.032')] +[2023-03-11 15:16:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002720_1392640.pth... +[2023-03-11 15:16:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002104_1077248.pth +[2023-03-11 15:16:22,400][41544] Updated weights for policy 0, policy_version 2800 (0.0005) +[2023-03-11 15:16:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 10308.3, 300 sec: 9611.9). Total num frames: 1441792. Throughput: 0: 10376.3. Samples: 1437760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:23,386][41256] Avg episode reward: [(0, '31.810')] +[2023-03-11 15:16:26,524][41544] Updated weights for policy 0, policy_version 2880 (0.0005) +[2023-03-11 15:16:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 9619.0). Total num frames: 1490944. Throughput: 0: 10356.1. Samples: 1467772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:28,386][41256] Avg episode reward: [(0, '31.596')] +[2023-03-11 15:16:30,639][41544] Updated weights for policy 0, policy_version 2960 (0.0005) +[2023-03-11 15:16:33,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9651.2). Total num frames: 1544192. Throughput: 0: 10317.8. Samples: 1527872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:33,386][41256] Avg episode reward: [(0, '31.299')] +[2023-03-11 15:16:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003016_1544192.pth... +[2023-03-11 15:16:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002424_1241088.pth +[2023-03-11 15:16:34,507][41544] Updated weights for policy 0, policy_version 3040 (0.0005) +[2023-03-11 15:16:38,197][41544] Updated weights for policy 0, policy_version 3120 (0.0004) +[2023-03-11 15:16:38,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9681.5). Total num frames: 1597440. Throughput: 0: 10286.9. Samples: 1593408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:38,386][41256] Avg episode reward: [(0, '32.264')] +[2023-03-11 15:16:42,010][41544] Updated weights for policy 0, policy_version 3200 (0.0004) +[2023-03-11 15:16:43,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9709.9). Total num frames: 1650688. Throughput: 0: 10356.3. Samples: 1626120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:43,386][41256] Avg episode reward: [(0, '32.540')] +[2023-03-11 15:16:43,387][41500] Saving new best policy, reward=32.540! +[2023-03-11 15:16:45,827][41544] Updated weights for policy 0, policy_version 3280 (0.0004) +[2023-03-11 15:16:48,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 9736.8). Total num frames: 1703936. Throughput: 0: 10342.6. Samples: 1690324. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:16:48,386][41256] Avg episode reward: [(0, '31.482')] +[2023-03-11 15:16:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003328_1703936.pth... +[2023-03-11 15:16:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000002720_1392640.pth +[2023-03-11 15:16:49,819][41544] Updated weights for policy 0, policy_version 3360 (0.0005) +[2023-03-11 15:16:53,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9739.4). Total num frames: 1753088. Throughput: 0: 10297.3. Samples: 1750932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:53,386][41256] Avg episode reward: [(0, '32.085')] +[2023-03-11 15:16:53,876][41544] Updated weights for policy 0, policy_version 3440 (0.0005) +[2023-03-11 15:16:57,780][41544] Updated weights for policy 0, policy_version 3520 (0.0004) +[2023-03-11 15:16:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10376.5, 300 sec: 9764.0). Total num frames: 1806336. Throughput: 0: 10292.0. Samples: 1781824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:16:58,386][41256] Avg episode reward: [(0, '32.721')] +[2023-03-11 15:16:58,386][41500] Saving new best policy, reward=32.721! +[2023-03-11 15:17:01,599][41544] Updated weights for policy 0, policy_version 3600 (0.0004) +[2023-03-11 15:17:03,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 9787.3). Total num frames: 1859584. Throughput: 0: 10386.0. Samples: 1845696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:17:03,386][41256] Avg episode reward: [(0, '32.885')] +[2023-03-11 15:17:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003632_1859584.pth... +[2023-03-11 15:17:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003016_1544192.pth +[2023-03-11 15:17:03,392][41500] Saving new best policy, reward=32.885! +[2023-03-11 15:17:05,433][41544] Updated weights for policy 0, policy_version 3680 (0.0005) +[2023-03-11 15:17:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 9809.4). Total num frames: 1912832. Throughput: 0: 10497.5. Samples: 1910148. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:17:08,386][41256] Avg episode reward: [(0, '32.049')] +[2023-03-11 15:17:09,226][41544] Updated weights for policy 0, policy_version 3760 (0.0004) +[2023-03-11 15:17:13,145][41544] Updated weights for policy 0, policy_version 3840 (0.0004) +[2023-03-11 15:17:13,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 9830.4). Total num frames: 1966080. Throughput: 0: 10530.2. Samples: 1941632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:17:13,386][41256] Avg episode reward: [(0, '31.496')] +[2023-03-11 15:17:17,009][41544] Updated weights for policy 0, policy_version 3920 (0.0004) +[2023-03-11 15:17:18,385][41256] Fps is (10 sec: 10649.5, 60 sec: 10444.8, 300 sec: 9850.4). Total num frames: 2019328. Throughput: 0: 10606.3. Samples: 2005156. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:17:18,386][41256] Avg episode reward: [(0, '30.235')] +[2023-03-11 15:17:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003944_2019328.pth... +[2023-03-11 15:17:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003328_1703936.pth +[2023-03-11 15:17:20,770][41544] Updated weights for policy 0, policy_version 4000 (0.0005) +[2023-03-11 15:17:23,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10513.1, 300 sec: 9869.4). Total num frames: 2072576. Throughput: 0: 10584.5. Samples: 2069712. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:17:23,386][41256] Avg episode reward: [(0, '34.038')] +[2023-03-11 15:17:23,386][41500] Saving new best policy, reward=34.038! +[2023-03-11 15:17:24,628][41544] Updated weights for policy 0, policy_version 4080 (0.0005) +[2023-03-11 15:17:28,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10581.4, 300 sec: 9887.6). Total num frames: 2125824. Throughput: 0: 10567.0. Samples: 2101632. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:17:28,386][41256] Avg episode reward: [(0, '34.247')] +[2023-03-11 15:17:28,391][41500] Saving new best policy, reward=34.247! +[2023-03-11 15:17:28,391][41544] Updated weights for policy 0, policy_version 4160 (0.0003) +[2023-03-11 15:17:32,271][41544] Updated weights for policy 0, policy_version 4240 (0.0005) +[2023-03-11 15:17:33,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 9904.9). Total num frames: 2179072. Throughput: 0: 10587.1. Samples: 2166740. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:17:33,386][41256] Avg episode reward: [(0, '34.104')] +[2023-03-11 15:17:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004264_2183168.pth... +[2023-03-11 15:17:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003632_1859584.pth +[2023-03-11 15:17:36,133][41544] Updated weights for policy 0, policy_version 4320 (0.0004) +[2023-03-11 15:17:38,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 9921.4). Total num frames: 2232320. Throughput: 0: 10642.0. Samples: 2229820. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:17:38,386][41256] Avg episode reward: [(0, '33.546')] +[2023-03-11 15:17:40,024][41544] Updated weights for policy 0, policy_version 4400 (0.0005) +[2023-03-11 15:17:43,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 9937.3). Total num frames: 2285568. Throughput: 0: 10658.3. Samples: 2261448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:17:43,386][41256] Avg episode reward: [(0, '33.590')] +[2023-03-11 15:17:43,818][41544] Updated weights for policy 0, policy_version 4480 (0.0004) +[2023-03-11 15:17:47,574][41544] Updated weights for policy 0, policy_version 4560 (0.0004) +[2023-03-11 15:17:48,386][41256] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 9969.8). Total num frames: 2342912. Throughput: 0: 10686.8. Samples: 2326600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:17:48,386][41256] Avg episode reward: [(0, '33.594')] +[2023-03-11 15:17:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004576_2342912.pth... +[2023-03-11 15:17:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000003944_2019328.pth +[2023-03-11 15:17:51,428][41544] Updated weights for policy 0, policy_version 4640 (0.0004) +[2023-03-11 15:17:53,385][41256] Fps is (10 sec: 11059.1, 60 sec: 10717.9, 300 sec: 9984.0). Total num frames: 2396160. Throughput: 0: 10684.6. Samples: 2390956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:17:53,386][41256] Avg episode reward: [(0, '33.691')] +[2023-03-11 15:17:55,283][41544] Updated weights for policy 0, policy_version 4720 (0.0005) +[2023-03-11 15:17:58,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 9997.6). Total num frames: 2449408. Throughput: 0: 10688.9. Samples: 2422632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:17:58,386][41256] Avg episode reward: [(0, '32.622')] +[2023-03-11 15:17:59,044][41544] Updated weights for policy 0, policy_version 4800 (0.0004) +[2023-03-11 15:18:02,868][41544] Updated weights for policy 0, policy_version 4880 (0.0004) +[2023-03-11 15:18:03,386][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10010.6). Total num frames: 2502656. Throughput: 0: 10722.9. Samples: 2487688. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:18:03,386][41256] Avg episode reward: [(0, '30.940')] +[2023-03-11 15:18:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004888_2502656.pth... +[2023-03-11 15:18:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004264_2183168.pth +[2023-03-11 15:18:06,692][41544] Updated weights for policy 0, policy_version 4960 (0.0004) +[2023-03-11 15:18:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10023.2). Total num frames: 2555904. Throughput: 0: 10714.7. Samples: 2551872. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:18:08,386][41256] Avg episode reward: [(0, '34.127')] +[2023-03-11 15:18:10,525][41544] Updated weights for policy 0, policy_version 5040 (0.0005) +[2023-03-11 15:18:13,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10035.2). Total num frames: 2609152. Throughput: 0: 10717.8. Samples: 2583936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:18:13,386][41256] Avg episode reward: [(0, '33.928')] +[2023-03-11 15:18:14,342][41544] Updated weights for policy 0, policy_version 5120 (0.0004) +[2023-03-11 15:18:18,142][41544] Updated weights for policy 0, policy_version 5200 (0.0004) +[2023-03-11 15:18:18,386][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10046.8). Total num frames: 2662400. Throughput: 0: 10702.2. Samples: 2648340. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:18:18,386][41256] Avg episode reward: [(0, '34.084')] +[2023-03-11 15:18:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005200_2662400.pth... +[2023-03-11 15:18:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004576_2342912.pth +[2023-03-11 15:18:21,964][41544] Updated weights for policy 0, policy_version 5280 (0.0004) +[2023-03-11 15:18:23,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10058.0). Total num frames: 2715648. Throughput: 0: 10732.9. Samples: 2712800. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:18:23,386][41256] Avg episode reward: [(0, '34.196')] +[2023-03-11 15:18:25,717][41544] Updated weights for policy 0, policy_version 5360 (0.0004) +[2023-03-11 15:18:28,385][41256] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10083.6). Total num frames: 2772992. Throughput: 0: 10755.9. Samples: 2745464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:18:28,386][41256] Avg episode reward: [(0, '35.090')] +[2023-03-11 15:18:28,387][41500] Saving new best policy, reward=35.090! +[2023-03-11 15:18:29,462][41544] Updated weights for policy 0, policy_version 5440 (0.0004) +[2023-03-11 15:18:33,349][41544] Updated weights for policy 0, policy_version 5520 (0.0005) +[2023-03-11 15:18:33,386][41256] Fps is (10 sec: 11059.2, 60 sec: 10786.1, 300 sec: 10093.7). Total num frames: 2826240. Throughput: 0: 10747.0. Samples: 2810216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:18:33,386][41256] Avg episode reward: [(0, '34.103')] +[2023-03-11 15:18:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005520_2826240.pth... +[2023-03-11 15:18:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000004888_2502656.pth +[2023-03-11 15:18:37,145][41544] Updated weights for policy 0, policy_version 5600 (0.0004) +[2023-03-11 15:18:38,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10103.5). Total num frames: 2879488. Throughput: 0: 10747.3. Samples: 2874584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:18:38,386][41256] Avg episode reward: [(0, '34.422')] +[2023-03-11 15:18:40,949][41544] Updated weights for policy 0, policy_version 5680 (0.0003) +[2023-03-11 15:18:43,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10786.1, 300 sec: 10112.9). Total num frames: 2932736. Throughput: 0: 10763.6. Samples: 2906996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:18:43,386][41256] Avg episode reward: [(0, '33.313')] +[2023-03-11 15:18:44,860][41544] Updated weights for policy 0, policy_version 5760 (0.0005) +[2023-03-11 15:18:48,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10122.0). Total num frames: 2985984. Throughput: 0: 10713.1. Samples: 2969776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:18:48,386][41256] Avg episode reward: [(0, '34.297')] +[2023-03-11 15:18:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005832_2985984.pth... +[2023-03-11 15:18:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005200_2662400.pth +[2023-03-11 15:18:48,706][41544] Updated weights for policy 0, policy_version 5840 (0.0004) +[2023-03-11 15:18:52,454][41544] Updated weights for policy 0, policy_version 5920 (0.0004) +[2023-03-11 15:18:53,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10302.5). Total num frames: 3039232. Throughput: 0: 10739.5. Samples: 3035148. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:18:53,386][41256] Avg episode reward: [(0, '33.817')] +[2023-03-11 15:18:56,322][41544] Updated weights for policy 0, policy_version 6000 (0.0005) +[2023-03-11 15:18:58,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10483.0). Total num frames: 3092480. Throughput: 0: 10728.4. Samples: 3066716. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:18:58,386][41256] Avg episode reward: [(0, '35.217')] +[2023-03-11 15:18:58,387][41500] Saving new best policy, reward=35.217! +[2023-03-11 15:19:00,186][41544] Updated weights for policy 0, policy_version 6080 (0.0004) +[2023-03-11 15:19:03,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10510.8). Total num frames: 3145728. Throughput: 0: 10707.4. Samples: 3130172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:03,386][41256] Avg episode reward: [(0, '35.073')] +[2023-03-11 15:19:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006144_3145728.pth... +[2023-03-11 15:19:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005520_2826240.pth +[2023-03-11 15:19:03,987][41544] Updated weights for policy 0, policy_version 6160 (0.0004) +[2023-03-11 15:19:07,942][41544] Updated weights for policy 0, policy_version 6240 (0.0005) +[2023-03-11 15:19:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 3198976. Throughput: 0: 10694.2. Samples: 3194040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:08,386][41256] Avg episode reward: [(0, '34.869')] +[2023-03-11 15:19:11,822][41544] Updated weights for policy 0, policy_version 6320 (0.0004) +[2023-03-11 15:19:13,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10717.9, 300 sec: 10524.6). Total num frames: 3252224. Throughput: 0: 10678.1. Samples: 3225980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:13,386][41256] Avg episode reward: [(0, '34.908')] +[2023-03-11 15:19:15,666][41544] Updated weights for policy 0, policy_version 6400 (0.0005) +[2023-03-11 15:19:18,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10717.9, 300 sec: 10538.5). Total num frames: 3305472. Throughput: 0: 10643.0. Samples: 3289152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:18,386][41256] Avg episode reward: [(0, '35.004')] +[2023-03-11 15:19:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006456_3305472.pth... +[2023-03-11 15:19:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000005832_2985984.pth +[2023-03-11 15:19:19,525][41544] Updated weights for policy 0, policy_version 6480 (0.0004) +[2023-03-11 15:19:23,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10524.6). Total num frames: 3354624. Throughput: 0: 10623.0. Samples: 3352620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:23,386][41256] Avg episode reward: [(0, '34.526')] +[2023-03-11 15:19:23,390][41544] Updated weights for policy 0, policy_version 6560 (0.0004) +[2023-03-11 15:19:27,271][41544] Updated weights for policy 0, policy_version 6640 (0.0004) +[2023-03-11 15:19:28,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10538.5). Total num frames: 3407872. Throughput: 0: 10610.8. Samples: 3384484. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:19:28,386][41256] Avg episode reward: [(0, '35.162')] +[2023-03-11 15:19:31,046][41544] Updated weights for policy 0, policy_version 6720 (0.0004) +[2023-03-11 15:19:33,385][41256] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 3465216. Throughput: 0: 10649.4. Samples: 3449000. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:19:33,386][41256] Avg episode reward: [(0, '35.657')] +[2023-03-11 15:19:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006768_3465216.pth... +[2023-03-11 15:19:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006144_3145728.pth +[2023-03-11 15:19:33,392][41500] Saving new best policy, reward=35.657! +[2023-03-11 15:19:34,835][41544] Updated weights for policy 0, policy_version 6800 (0.0004) +[2023-03-11 15:19:38,386][41256] Fps is (10 sec: 11059.2, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 3518464. Throughput: 0: 10647.3. Samples: 3514276. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:19:38,386][41256] Avg episode reward: [(0, '35.614')] +[2023-03-11 15:19:38,611][41544] Updated weights for policy 0, policy_version 6880 (0.0004) +[2023-03-11 15:19:42,448][41544] Updated weights for policy 0, policy_version 6960 (0.0004) +[2023-03-11 15:19:43,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 3571712. Throughput: 0: 10676.2. Samples: 3547144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:43,386][41256] Avg episode reward: [(0, '36.398')] +[2023-03-11 15:19:43,387][41500] Saving new best policy, reward=36.398! +[2023-03-11 15:19:46,257][41544] Updated weights for policy 0, policy_version 7040 (0.0005) +[2023-03-11 15:19:48,386][41256] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 3624960. Throughput: 0: 10678.7. Samples: 3610712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:48,386][41256] Avg episode reward: [(0, '36.452')] +[2023-03-11 15:19:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007080_3624960.pth... +[2023-03-11 15:19:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006456_3305472.pth +[2023-03-11 15:19:48,392][41500] Saving new best policy, reward=36.452! +[2023-03-11 15:19:50,090][41544] Updated weights for policy 0, policy_version 7120 (0.0004) +[2023-03-11 15:19:53,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10552.4). Total num frames: 3678208. Throughput: 0: 10675.7. Samples: 3674448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:19:53,386][41256] Avg episode reward: [(0, '35.665')] +[2023-03-11 15:19:53,908][41544] Updated weights for policy 0, policy_version 7200 (0.0004) +[2023-03-11 15:19:57,762][41544] Updated weights for policy 0, policy_version 7280 (0.0004) +[2023-03-11 15:19:58,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 3731456. Throughput: 0: 10688.1. Samples: 3706944. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:19:58,386][41256] Avg episode reward: [(0, '36.634')] +[2023-03-11 15:19:58,387][41500] Saving new best policy, reward=36.634! +[2023-03-11 15:20:01,682][41544] Updated weights for policy 0, policy_version 7360 (0.0005) +[2023-03-11 15:20:03,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 3784704. Throughput: 0: 10683.6. Samples: 3769916. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:20:03,386][41256] Avg episode reward: [(0, '36.708')] +[2023-03-11 15:20:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007392_3784704.pth... +[2023-03-11 15:20:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000006768_3465216.pth +[2023-03-11 15:20:03,393][41500] Saving new best policy, reward=36.708! +[2023-03-11 15:20:05,640][41544] Updated weights for policy 0, policy_version 7440 (0.0005) +[2023-03-11 15:20:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10566.3). Total num frames: 3837952. Throughput: 0: 10680.3. Samples: 3833232. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:20:08,386][41256] Avg episode reward: [(0, '33.462')] +[2023-03-11 15:20:09,413][41544] Updated weights for policy 0, policy_version 7520 (0.0003) +[2023-03-11 15:20:13,293][41544] Updated weights for policy 0, policy_version 7600 (0.0004) +[2023-03-11 15:20:13,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 3891200. Throughput: 0: 10686.6. Samples: 3865380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:20:13,386][41256] Avg episode reward: [(0, '35.560')] +[2023-03-11 15:20:17,220][41544] Updated weights for policy 0, policy_version 7680 (0.0005) +[2023-03-11 15:20:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10566.3). Total num frames: 3940352. Throughput: 0: 10647.3. Samples: 3928128. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:20:18,386][41256] Avg episode reward: [(0, '36.574')] +[2023-03-11 15:20:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007704_3944448.pth... +[2023-03-11 15:20:18,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007080_3624960.pth +[2023-03-11 15:20:21,133][41544] Updated weights for policy 0, policy_version 7760 (0.0004) +[2023-03-11 15:20:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10649.6, 300 sec: 10580.2). Total num frames: 3993600. Throughput: 0: 10593.2. Samples: 3990968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:20:23,386][41256] Avg episode reward: [(0, '36.511')] +[2023-03-11 15:20:24,964][41544] Updated weights for policy 0, policy_version 7840 (0.0004) +[2023-03-11 15:20:28,385][41256] Fps is (10 sec: 11059.2, 60 sec: 10717.9, 300 sec: 10594.1). Total num frames: 4050944. Throughput: 0: 10585.9. Samples: 4023508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:20:28,386][41256] Avg episode reward: [(0, '34.912')] +[2023-03-11 15:20:28,761][41544] Updated weights for policy 0, policy_version 7920 (0.0004) +[2023-03-11 15:20:32,543][41544] Updated weights for policy 0, policy_version 8000 (0.0005) +[2023-03-11 15:20:33,386][41256] Fps is (10 sec: 11059.1, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 4104192. Throughput: 0: 10603.7. Samples: 4087880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:20:33,386][41256] Avg episode reward: [(0, '34.973')] +[2023-03-11 15:20:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008016_4104192.pth... +[2023-03-11 15:20:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007392_3784704.pth +[2023-03-11 15:20:36,429][41544] Updated weights for policy 0, policy_version 8080 (0.0004) +[2023-03-11 15:20:38,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10649.6, 300 sec: 10607.9). Total num frames: 4157440. Throughput: 0: 10612.1. Samples: 4151992. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:20:38,386][41256] Avg episode reward: [(0, '35.386')] +[2023-03-11 15:20:40,275][41544] Updated weights for policy 0, policy_version 8160 (0.0004) +[2023-03-11 15:20:43,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10649.6, 300 sec: 10621.8). Total num frames: 4210688. Throughput: 0: 10598.6. Samples: 4183880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:20:43,386][41256] Avg episode reward: [(0, '36.115')] +[2023-03-11 15:20:44,093][41544] Updated weights for policy 0, policy_version 8240 (0.0004) +[2023-03-11 15:20:48,024][41544] Updated weights for policy 0, policy_version 8320 (0.0004) +[2023-03-11 15:20:48,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 4259840. Throughput: 0: 10615.6. Samples: 4247616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:20:48,386][41256] Avg episode reward: [(0, '33.341')] +[2023-03-11 15:20:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008320_4259840.pth... +[2023-03-11 15:20:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000007704_3944448.pth +[2023-03-11 15:20:52,003][41544] Updated weights for policy 0, policy_version 8400 (0.0005) +[2023-03-11 15:20:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 4313088. Throughput: 0: 10574.0. Samples: 4309064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:20:53,386][41256] Avg episode reward: [(0, '30.475')] +[2023-03-11 15:20:55,909][41544] Updated weights for policy 0, policy_version 8480 (0.0004) +[2023-03-11 15:20:58,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10581.3, 300 sec: 10594.1). Total num frames: 4366336. Throughput: 0: 10569.9. Samples: 4341024. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:20:58,386][41256] Avg episode reward: [(0, '35.372')] +[2023-03-11 15:20:59,758][41544] Updated weights for policy 0, policy_version 8560 (0.0004) +[2023-03-11 15:21:03,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 4419584. Throughput: 0: 10584.5. Samples: 4404432. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:21:03,386][41256] Avg episode reward: [(0, '34.405')] +[2023-03-11 15:21:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008632_4419584.pth... +[2023-03-11 15:21:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008016_4104192.pth +[2023-03-11 15:21:03,602][41544] Updated weights for policy 0, policy_version 8640 (0.0004) +[2023-03-11 15:21:07,544][41544] Updated weights for policy 0, policy_version 8720 (0.0005) +[2023-03-11 15:21:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10581.3, 300 sec: 10607.9). Total num frames: 4472832. Throughput: 0: 10596.6. Samples: 4467816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:21:08,386][41256] Avg episode reward: [(0, '36.844')] +[2023-03-11 15:21:08,387][41500] Saving new best policy, reward=36.844! +[2023-03-11 15:21:11,481][41544] Updated weights for policy 0, policy_version 8800 (0.0005) +[2023-03-11 15:21:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10513.1, 300 sec: 10607.9). Total num frames: 4521984. Throughput: 0: 10566.6. Samples: 4499004. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:21:13,386][41256] Avg episode reward: [(0, '37.254')] +[2023-03-11 15:21:13,387][41500] Saving new best policy, reward=37.254! +[2023-03-11 15:21:15,456][41544] Updated weights for policy 0, policy_version 8880 (0.0004) +[2023-03-11 15:21:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10581.3, 300 sec: 10621.8). Total num frames: 4575232. Throughput: 0: 10496.3. Samples: 4560212. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:21:18,386][41256] Avg episode reward: [(0, '35.924')] +[2023-03-11 15:21:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008936_4575232.pth... +[2023-03-11 15:21:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008320_4259840.pth +[2023-03-11 15:21:19,592][41544] Updated weights for policy 0, policy_version 8960 (0.0005) +[2023-03-11 15:21:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10513.1, 300 sec: 10621.8). Total num frames: 4624384. Throughput: 0: 10408.0. Samples: 4620352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:21:23,386][41256] Avg episode reward: [(0, '37.241')] +[2023-03-11 15:21:23,619][41544] Updated weights for policy 0, policy_version 9040 (0.0005) +[2023-03-11 15:21:27,919][41544] Updated weights for policy 0, policy_version 9120 (0.0005) +[2023-03-11 15:21:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10376.5, 300 sec: 10607.9). Total num frames: 4673536. Throughput: 0: 10339.6. Samples: 4649164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:21:28,386][41256] Avg episode reward: [(0, '38.152')] +[2023-03-11 15:21:28,387][41500] Saving new best policy, reward=38.152! +[2023-03-11 15:21:32,111][41544] Updated weights for policy 0, policy_version 9200 (0.0005) +[2023-03-11 15:21:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10308.3, 300 sec: 10594.1). Total num frames: 4722688. Throughput: 0: 10223.9. Samples: 4707692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:21:33,386][41256] Avg episode reward: [(0, '37.342')] +[2023-03-11 15:21:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009224_4722688.pth... +[2023-03-11 15:21:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008632_4419584.pth +[2023-03-11 15:21:36,295][41544] Updated weights for policy 0, policy_version 9280 (0.0005) +[2023-03-11 15:21:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 10580.2). Total num frames: 4771840. Throughput: 0: 10174.2. Samples: 4766904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:21:38,386][41256] Avg episode reward: [(0, '38.056')] +[2023-03-11 15:21:40,431][41544] Updated weights for policy 0, policy_version 9360 (0.0005) +[2023-03-11 15:21:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10566.3). Total num frames: 4820992. Throughput: 0: 10120.0. Samples: 4796424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:21:43,386][41256] Avg episode reward: [(0, '38.691')] +[2023-03-11 15:21:43,386][41500] Saving new best policy, reward=38.691! +[2023-03-11 15:21:44,431][41544] Updated weights for policy 0, policy_version 9440 (0.0004) +[2023-03-11 15:21:48,338][41544] Updated weights for policy 0, policy_version 9520 (0.0004) +[2023-03-11 15:21:48,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10580.2). Total num frames: 4874240. Throughput: 0: 10088.7. Samples: 4858424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:21:48,386][41256] Avg episode reward: [(0, '39.103')] +[2023-03-11 15:21:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009520_4874240.pth... +[2023-03-11 15:21:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000008936_4575232.pth +[2023-03-11 15:21:48,392][41500] Saving new best policy, reward=39.103! +[2023-03-11 15:21:52,437][41544] Updated weights for policy 0, policy_version 9600 (0.0005) +[2023-03-11 15:21:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10566.3). Total num frames: 4923392. Throughput: 0: 10032.9. Samples: 4919296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:21:53,386][41256] Avg episode reward: [(0, '37.862')] +[2023-03-11 15:21:56,475][41544] Updated weights for policy 0, policy_version 9680 (0.0005) +[2023-03-11 15:21:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 10552.4). Total num frames: 4972544. Throughput: 0: 10004.9. Samples: 4949224. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:21:58,386][41256] Avg episode reward: [(0, '38.445')] +[2023-03-11 15:22:00,462][41544] Updated weights for policy 0, policy_version 9760 (0.0005) +[2023-03-11 15:22:03,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 10552.4). Total num frames: 5025792. Throughput: 0: 10021.0. Samples: 5011156. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:22:03,386][41256] Avg episode reward: [(0, '37.811')] +[2023-03-11 15:22:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009816_5025792.pth... +[2023-03-11 15:22:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009224_4722688.pth +[2023-03-11 15:22:04,531][41544] Updated weights for policy 0, policy_version 9840 (0.0005) +[2023-03-11 15:22:08,312][41544] Updated weights for policy 0, policy_version 9920 (0.0004) +[2023-03-11 15:22:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10552.4). Total num frames: 5079040. Throughput: 0: 10071.9. Samples: 5073588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:22:08,386][41256] Avg episode reward: [(0, '36.935')] +[2023-03-11 15:22:12,140][41544] Updated weights for policy 0, policy_version 10000 (0.0005) +[2023-03-11 15:22:13,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10552.4). Total num frames: 5132288. Throughput: 0: 10152.4. Samples: 5106024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:22:13,386][41256] Avg episode reward: [(0, '28.332')] +[2023-03-11 15:22:15,984][41544] Updated weights for policy 0, policy_version 10080 (0.0005) +[2023-03-11 15:22:18,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10552.4). Total num frames: 5185536. Throughput: 0: 10260.1. Samples: 5169396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:22:18,386][41256] Avg episode reward: [(0, '36.726')] +[2023-03-11 15:22:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010128_5185536.pth... +[2023-03-11 15:22:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009520_4874240.pth +[2023-03-11 15:22:19,868][41544] Updated weights for policy 0, policy_version 10160 (0.0004) +[2023-03-11 15:22:23,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10538.5). Total num frames: 5234688. Throughput: 0: 10351.6. Samples: 5232728. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:22:23,386][41256] Avg episode reward: [(0, '38.472')] +[2023-03-11 15:22:23,879][41544] Updated weights for policy 0, policy_version 10240 (0.0004) +[2023-03-11 15:22:28,100][41544] Updated weights for policy 0, policy_version 10320 (0.0005) +[2023-03-11 15:22:28,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10524.6). Total num frames: 5283840. Throughput: 0: 10351.5. Samples: 5262244. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:22:28,386][41256] Avg episode reward: [(0, '40.084')] +[2023-03-11 15:22:28,387][41500] Saving new best policy, reward=40.084! +[2023-03-11 15:22:32,311][41544] Updated weights for policy 0, policy_version 10400 (0.0005) +[2023-03-11 15:22:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10510.7). Total num frames: 5332992. Throughput: 0: 10264.7. Samples: 5320336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:22:33,386][41256] Avg episode reward: [(0, '39.835')] +[2023-03-11 15:22:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010416_5332992.pth... +[2023-03-11 15:22:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000009816_5025792.pth +[2023-03-11 15:22:36,476][41544] Updated weights for policy 0, policy_version 10480 (0.0005) +[2023-03-11 15:22:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10496.9). Total num frames: 5382144. Throughput: 0: 10213.9. Samples: 5378924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:22:38,386][41256] Avg episode reward: [(0, '40.137')] +[2023-03-11 15:22:38,387][41500] Saving new best policy, reward=40.137! +[2023-03-11 15:22:40,738][41544] Updated weights for policy 0, policy_version 10560 (0.0005) +[2023-03-11 15:22:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 10469.1). Total num frames: 5431296. Throughput: 0: 10185.2. Samples: 5407560. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:22:43,386][41256] Avg episode reward: [(0, '40.290')] +[2023-03-11 15:22:43,386][41500] Saving new best policy, reward=40.290! +[2023-03-11 15:22:44,992][41544] Updated weights for policy 0, policy_version 10640 (0.0005) +[2023-03-11 15:22:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10441.3). Total num frames: 5476352. Throughput: 0: 10095.0. Samples: 5465432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:22:48,386][41256] Avg episode reward: [(0, '40.857')] +[2023-03-11 15:22:48,405][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010704_5480448.pth... +[2023-03-11 15:22:48,407][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010128_5185536.pth +[2023-03-11 15:22:48,407][41500] Saving new best policy, reward=40.857! +[2023-03-11 15:22:49,224][41544] Updated weights for policy 0, policy_version 10720 (0.0005) +[2023-03-11 15:22:53,284][41544] Updated weights for policy 0, policy_version 10800 (0.0005) +[2023-03-11 15:22:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10441.3). Total num frames: 5529600. Throughput: 0: 10042.7. Samples: 5525512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:22:53,386][41256] Avg episode reward: [(0, '39.905')] +[2023-03-11 15:22:57,311][41544] Updated weights for policy 0, policy_version 10880 (0.0005) +[2023-03-11 15:22:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10427.4). Total num frames: 5578752. Throughput: 0: 9992.2. Samples: 5555672. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:22:58,386][41256] Avg episode reward: [(0, '35.219')] +[2023-03-11 15:23:01,613][41544] Updated weights for policy 0, policy_version 10960 (0.0005) +[2023-03-11 15:23:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10413.6). Total num frames: 5627904. Throughput: 0: 9879.2. Samples: 5613960. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:23:03,386][41256] Avg episode reward: [(0, '37.326')] +[2023-03-11 15:23:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010992_5627904.pth... +[2023-03-11 15:23:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010416_5332992.pth +[2023-03-11 15:23:05,864][41544] Updated weights for policy 0, policy_version 11040 (0.0005) +[2023-03-11 15:23:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10385.8). Total num frames: 5672960. Throughput: 0: 9748.5. Samples: 5671408. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:23:08,386][41256] Avg episode reward: [(0, '39.519')] +[2023-03-11 15:23:10,133][41544] Updated weights for policy 0, policy_version 11120 (0.0005) +[2023-03-11 15:23:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 10371.9). Total num frames: 5722112. Throughput: 0: 9729.5. Samples: 5700072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:23:13,386][41256] Avg episode reward: [(0, '40.058')] +[2023-03-11 15:23:14,499][41544] Updated weights for policy 0, policy_version 11200 (0.0005) +[2023-03-11 15:23:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 10358.0). Total num frames: 5771264. Throughput: 0: 9723.4. Samples: 5757888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:23:18,386][41256] Avg episode reward: [(0, '40.322')] +[2023-03-11 15:23:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011272_5771264.pth... +[2023-03-11 15:23:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010704_5480448.pth +[2023-03-11 15:23:18,620][41544] Updated weights for policy 0, policy_version 11280 (0.0004) +[2023-03-11 15:23:22,653][41544] Updated weights for policy 0, policy_version 11360 (0.0004) +[2023-03-11 15:23:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10330.3). Total num frames: 5820416. Throughput: 0: 9763.6. Samples: 5818288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:23:23,386][41256] Avg episode reward: [(0, '41.584')] +[2023-03-11 15:23:23,389][41500] Saving new best policy, reward=41.584! +[2023-03-11 15:23:26,576][41544] Updated weights for policy 0, policy_version 11440 (0.0004) +[2023-03-11 15:23:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 10330.3). Total num frames: 5873664. Throughput: 0: 9820.7. Samples: 5849492. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:23:28,386][41256] Avg episode reward: [(0, '40.352')] +[2023-03-11 15:23:30,553][41544] Updated weights for policy 0, policy_version 11520 (0.0004) +[2023-03-11 15:23:33,386][41256] Fps is (10 sec: 10649.5, 60 sec: 9898.7, 300 sec: 10330.2). Total num frames: 5926912. Throughput: 0: 9913.4. Samples: 5911536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:23:33,386][41256] Avg episode reward: [(0, '40.951')] +[2023-03-11 15:23:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011576_5926912.pth... +[2023-03-11 15:23:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000010992_5627904.pth +[2023-03-11 15:23:34,558][41544] Updated weights for policy 0, policy_version 11600 (0.0004) +[2023-03-11 15:23:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10316.4). Total num frames: 5976064. Throughput: 0: 9921.5. Samples: 5971980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:23:38,386][41256] Avg episode reward: [(0, '40.855')] +[2023-03-11 15:23:38,702][41544] Updated weights for policy 0, policy_version 11680 (0.0004) +[2023-03-11 15:23:42,705][41544] Updated weights for policy 0, policy_version 11760 (0.0004) +[2023-03-11 15:23:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10302.5). Total num frames: 6025216. Throughput: 0: 9919.4. Samples: 6002044. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:23:43,386][41256] Avg episode reward: [(0, '40.668')] +[2023-03-11 15:23:46,736][41544] Updated weights for policy 0, policy_version 11840 (0.0004) +[2023-03-11 15:23:48,390][41256] Fps is (10 sec: 10235.4, 60 sec: 10034.5, 300 sec: 10302.3). Total num frames: 6078464. Throughput: 0: 9977.1. Samples: 6062972. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:23:48,391][41256] Avg episode reward: [(0, '40.791')] +[2023-03-11 15:23:48,394][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011872_6078464.pth... +[2023-03-11 15:23:48,396][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011272_5771264.pth +[2023-03-11 15:23:50,808][41544] Updated weights for policy 0, policy_version 11920 (0.0003) +[2023-03-11 15:23:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10288.6). Total num frames: 6127616. Throughput: 0: 10048.5. Samples: 6123592. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:23:53,386][41256] Avg episode reward: [(0, '37.851')] +[2023-03-11 15:23:54,774][41544] Updated weights for policy 0, policy_version 12000 (0.0003) +[2023-03-11 15:23:58,385][41256] Fps is (10 sec: 9834.8, 60 sec: 9966.9, 300 sec: 10274.7). Total num frames: 6176768. Throughput: 0: 10097.3. Samples: 6154452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:23:58,386][41256] Avg episode reward: [(0, '38.501')] +[2023-03-11 15:23:59,063][41544] Updated weights for policy 0, policy_version 12080 (0.0005) +[2023-03-11 15:24:03,100][41544] Updated weights for policy 0, policy_version 12160 (0.0004) +[2023-03-11 15:24:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10260.8). Total num frames: 6225920. Throughput: 0: 10127.4. Samples: 6213620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:24:03,386][41256] Avg episode reward: [(0, '40.199')] +[2023-03-11 15:24:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012160_6225920.pth... +[2023-03-11 15:24:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011576_5926912.pth +[2023-03-11 15:24:07,086][41544] Updated weights for policy 0, policy_version 12240 (0.0003) +[2023-03-11 15:24:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10260.8). Total num frames: 6279168. Throughput: 0: 10150.6. Samples: 6275064. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:24:08,386][41256] Avg episode reward: [(0, '41.022')] +[2023-03-11 15:24:11,162][41544] Updated weights for policy 0, policy_version 12320 (0.0004) +[2023-03-11 15:24:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10246.9). Total num frames: 6328320. Throughput: 0: 10110.0. Samples: 6304440. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:24:13,386][41256] Avg episode reward: [(0, '40.496')] +[2023-03-11 15:24:15,623][41544] Updated weights for policy 0, policy_version 12400 (0.0005) +[2023-03-11 15:24:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10233.1). Total num frames: 6373376. Throughput: 0: 9990.3. Samples: 6361100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:24:18,386][41256] Avg episode reward: [(0, '39.285')] +[2023-03-11 15:24:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012448_6373376.pth... +[2023-03-11 15:24:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000011872_6078464.pth +[2023-03-11 15:24:19,815][41544] Updated weights for policy 0, policy_version 12480 (0.0005) +[2023-03-11 15:24:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 10219.2). Total num frames: 6422528. Throughput: 0: 9966.3. Samples: 6420464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:24:23,386][41256] Avg episode reward: [(0, '44.000')] +[2023-03-11 15:24:23,387][41500] Saving new best policy, reward=44.000! +[2023-03-11 15:24:23,802][41544] Updated weights for policy 0, policy_version 12560 (0.0005) +[2023-03-11 15:24:27,797][41544] Updated weights for policy 0, policy_version 12640 (0.0005) +[2023-03-11 15:24:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 6475776. Throughput: 0: 9993.1. Samples: 6451732. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:24:28,386][41256] Avg episode reward: [(0, '44.229')] +[2023-03-11 15:24:28,387][41500] Saving new best policy, reward=44.229! +[2023-03-11 15:24:31,972][41544] Updated weights for policy 0, policy_version 12720 (0.0005) +[2023-03-11 15:24:33,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10191.4). Total num frames: 6524928. Throughput: 0: 9970.0. Samples: 6511576. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:24:33,386][41256] Avg episode reward: [(0, '43.360')] +[2023-03-11 15:24:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012744_6524928.pth... +[2023-03-11 15:24:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012160_6225920.pth +[2023-03-11 15:24:36,090][41544] Updated weights for policy 0, policy_version 12800 (0.0005) +[2023-03-11 15:24:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 6574080. Throughput: 0: 9925.7. Samples: 6570248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:24:38,386][41256] Avg episode reward: [(0, '41.831')] +[2023-03-11 15:24:40,286][41544] Updated weights for policy 0, policy_version 12880 (0.0005) +[2023-03-11 15:24:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 6623232. Throughput: 0: 9895.9. Samples: 6599768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:24:43,386][41256] Avg episode reward: [(0, '40.577')] +[2023-03-11 15:24:44,544][41544] Updated weights for policy 0, policy_version 12960 (0.0005) +[2023-03-11 15:24:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9899.4, 300 sec: 10149.7). Total num frames: 6672384. Throughput: 0: 9893.2. Samples: 6658816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:24:48,386][41256] Avg episode reward: [(0, '41.245')] +[2023-03-11 15:24:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013032_6672384.pth... +[2023-03-11 15:24:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012448_6373376.pth +[2023-03-11 15:24:48,536][41544] Updated weights for policy 0, policy_version 13040 (0.0005) +[2023-03-11 15:24:52,467][41544] Updated weights for policy 0, policy_version 13120 (0.0004) +[2023-03-11 15:24:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10149.7). Total num frames: 6725632. Throughput: 0: 9919.9. Samples: 6721460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:24:53,386][41256] Avg episode reward: [(0, '42.070')] +[2023-03-11 15:24:56,483][41544] Updated weights for policy 0, policy_version 13200 (0.0005) +[2023-03-11 15:24:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10135.9). Total num frames: 6774784. Throughput: 0: 9945.5. Samples: 6751988. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:24:58,386][41256] Avg episode reward: [(0, '41.972')] +[2023-03-11 15:25:00,450][41544] Updated weights for policy 0, policy_version 13280 (0.0004) +[2023-03-11 15:25:03,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 6828032. Throughput: 0: 10047.1. Samples: 6813220. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:25:03,386][41256] Avg episode reward: [(0, '42.503')] +[2023-03-11 15:25:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013336_6828032.pth... +[2023-03-11 15:25:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000012744_6524928.pth +[2023-03-11 15:25:04,509][41544] Updated weights for policy 0, policy_version 13360 (0.0005) +[2023-03-11 15:25:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 6877184. Throughput: 0: 10077.0. Samples: 6873928. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:25:08,386][41256] Avg episode reward: [(0, '42.635')] +[2023-03-11 15:25:08,517][41544] Updated weights for policy 0, policy_version 13440 (0.0004) +[2023-03-11 15:25:12,386][41544] Updated weights for policy 0, policy_version 13520 (0.0004) +[2023-03-11 15:25:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 6930432. Throughput: 0: 10093.3. Samples: 6905928. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:25:13,386][41256] Avg episode reward: [(0, '42.309')] +[2023-03-11 15:25:16,295][41544] Updated weights for policy 0, policy_version 13600 (0.0004) +[2023-03-11 15:25:18,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 6983680. Throughput: 0: 10161.4. Samples: 6968840. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:25:18,386][41256] Avg episode reward: [(0, '41.381')] +[2023-03-11 15:25:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013640_6983680.pth... +[2023-03-11 15:25:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013032_6672384.pth +[2023-03-11 15:25:20,263][41544] Updated weights for policy 0, policy_version 13680 (0.0004) +[2023-03-11 15:25:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 7032832. Throughput: 0: 10214.3. Samples: 7029892. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:25:23,390][41256] Avg episode reward: [(0, '40.879')] +[2023-03-11 15:25:24,321][41544] Updated weights for policy 0, policy_version 13760 (0.0005) +[2023-03-11 15:25:28,268][41544] Updated weights for policy 0, policy_version 13840 (0.0004) +[2023-03-11 15:25:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10108.1). Total num frames: 7086080. Throughput: 0: 10260.7. Samples: 7061500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:25:28,396][41256] Avg episode reward: [(0, '41.382')] +[2023-03-11 15:25:32,384][41544] Updated weights for policy 0, policy_version 13920 (0.0004) +[2023-03-11 15:25:33,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10094.2). Total num frames: 7135232. Throughput: 0: 10290.9. Samples: 7121908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:25:33,396][41256] Avg episode reward: [(0, '40.843')] +[2023-03-11 15:25:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013936_7135232.pth... +[2023-03-11 15:25:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013336_6828032.pth +[2023-03-11 15:25:36,689][41544] Updated weights for policy 0, policy_version 14000 (0.0005) +[2023-03-11 15:25:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 7180288. Throughput: 0: 10179.6. Samples: 7179540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:25:38,396][41256] Avg episode reward: [(0, '37.074')] +[2023-03-11 15:25:40,973][41544] Updated weights for policy 0, policy_version 14080 (0.0005) +[2023-03-11 15:25:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 10103.5, 300 sec: 10066.4). Total num frames: 7229440. Throughput: 0: 10135.3. Samples: 7208076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:25:43,386][41256] Avg episode reward: [(0, '33.342')] +[2023-03-11 15:25:45,132][41544] Updated weights for policy 0, policy_version 14160 (0.0005) +[2023-03-11 15:25:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10066.4). Total num frames: 7282688. Throughput: 0: 10112.1. Samples: 7268264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:25:48,396][41256] Avg episode reward: [(0, '37.619')] +[2023-03-11 15:25:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014224_7282688.pth... +[2023-03-11 15:25:48,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013640_6983680.pth +[2023-03-11 15:25:48,927][41544] Updated weights for policy 0, policy_version 14240 (0.0004) +[2023-03-11 15:25:53,113][41544] Updated weights for policy 0, policy_version 14320 (0.0005) +[2023-03-11 15:25:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10052.6). Total num frames: 7331840. Throughput: 0: 10113.1. Samples: 7329016. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:25:53,396][41256] Avg episode reward: [(0, '33.948')] +[2023-03-11 15:25:57,094][41544] Updated weights for policy 0, policy_version 14400 (0.0005) +[2023-03-11 15:25:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 7385088. Throughput: 0: 10086.0. Samples: 7359796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:25:58,396][41256] Avg episode reward: [(0, '38.156')] +[2023-03-11 15:26:01,051][41544] Updated weights for policy 0, policy_version 14480 (0.0004) +[2023-03-11 15:26:03,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 7434240. Throughput: 0: 10064.9. Samples: 7421760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:03,396][41256] Avg episode reward: [(0, '35.568')] +[2023-03-11 15:26:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014520_7434240.pth... +[2023-03-11 15:26:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000013936_7135232.pth +[2023-03-11 15:26:05,283][41544] Updated weights for policy 0, policy_version 14560 (0.0005) +[2023-03-11 15:26:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 7483392. Throughput: 0: 9986.8. Samples: 7479296. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:26:08,396][41256] Avg episode reward: [(0, '38.896')] +[2023-03-11 15:26:09,622][41544] Updated weights for policy 0, policy_version 14640 (0.0005) +[2023-03-11 15:26:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9966.9, 300 sec: 10010.9). Total num frames: 7528448. Throughput: 0: 9918.1. Samples: 7507812. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:26:13,386][41256] Avg episode reward: [(0, '39.747')] +[2023-03-11 15:26:13,876][41544] Updated weights for policy 0, policy_version 14720 (0.0005) +[2023-03-11 15:26:18,084][41544] Updated weights for policy 0, policy_version 14800 (0.0005) +[2023-03-11 15:26:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 10010.9). Total num frames: 7577600. Throughput: 0: 9855.3. Samples: 7565396. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:26:18,386][41256] Avg episode reward: [(0, '40.815')] +[2023-03-11 15:26:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014800_7577600.pth... +[2023-03-11 15:26:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014224_7282688.pth +[2023-03-11 15:26:22,175][41544] Updated weights for policy 0, policy_version 14880 (0.0005) +[2023-03-11 15:26:23,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 7630848. Throughput: 0: 9912.5. Samples: 7625604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:23,386][41256] Avg episode reward: [(0, '41.731')] +[2023-03-11 15:26:26,163][41544] Updated weights for policy 0, policy_version 14960 (0.0005) +[2023-03-11 15:26:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 10024.8). Total num frames: 7680000. Throughput: 0: 9953.6. Samples: 7655988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:28,386][41256] Avg episode reward: [(0, '42.158')] +[2023-03-11 15:26:30,176][41544] Updated weights for policy 0, policy_version 15040 (0.0005) +[2023-03-11 15:26:33,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 10038.7). Total num frames: 7733248. Throughput: 0: 9984.8. Samples: 7717580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:33,386][41256] Avg episode reward: [(0, '40.853')] +[2023-03-11 15:26:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015104_7733248.pth... +[2023-03-11 15:26:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014520_7434240.pth +[2023-03-11 15:26:34,139][41544] Updated weights for policy 0, policy_version 15120 (0.0005) +[2023-03-11 15:26:38,087][41544] Updated weights for policy 0, policy_version 15200 (0.0004) +[2023-03-11 15:26:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 7782400. Throughput: 0: 10008.5. Samples: 7779400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:38,386][41256] Avg episode reward: [(0, '39.621')] +[2023-03-11 15:26:42,028][41544] Updated weights for policy 0, policy_version 15280 (0.0004) +[2023-03-11 15:26:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 7835648. Throughput: 0: 10016.7. Samples: 7810548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:43,386][41256] Avg episode reward: [(0, '38.995')] +[2023-03-11 15:26:46,141][41544] Updated weights for policy 0, policy_version 15360 (0.0005) +[2023-03-11 15:26:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 7884800. Throughput: 0: 9994.4. Samples: 7871508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:48,386][41256] Avg episode reward: [(0, '39.533')] +[2023-03-11 15:26:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015400_7884800.pth... +[2023-03-11 15:26:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000014800_7577600.pth +[2023-03-11 15:26:50,314][41544] Updated weights for policy 0, policy_version 15440 (0.0005) +[2023-03-11 15:26:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 7933952. Throughput: 0: 10013.9. Samples: 7929920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:53,386][41256] Avg episode reward: [(0, '37.480')] +[2023-03-11 15:26:54,537][41544] Updated weights for policy 0, policy_version 15520 (0.0005) +[2023-03-11 15:26:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10024.8). Total num frames: 7983104. Throughput: 0: 10029.9. Samples: 7959160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:26:58,386][41256] Avg episode reward: [(0, '39.247')] +[2023-03-11 15:26:58,474][41544] Updated weights for policy 0, policy_version 15600 (0.0004) +[2023-03-11 15:27:02,540][41544] Updated weights for policy 0, policy_version 15680 (0.0004) +[2023-03-11 15:27:03,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 8036352. Throughput: 0: 10117.0. Samples: 8020660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:27:03,386][41256] Avg episode reward: [(0, '39.517')] +[2023-03-11 15:27:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015696_8036352.pth... +[2023-03-11 15:27:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015104_7733248.pth +[2023-03-11 15:27:06,373][41544] Updated weights for policy 0, policy_version 15760 (0.0004) +[2023-03-11 15:27:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 8089600. Throughput: 0: 10197.0. Samples: 8084468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:27:08,386][41256] Avg episode reward: [(0, '38.069')] +[2023-03-11 15:27:10,271][41544] Updated weights for policy 0, policy_version 15840 (0.0004) +[2023-03-11 15:27:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10010.9). Total num frames: 8138752. Throughput: 0: 10213.3. Samples: 8115588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:27:13,386][41256] Avg episode reward: [(0, '39.156')] +[2023-03-11 15:27:14,440][41544] Updated weights for policy 0, policy_version 15920 (0.0005) +[2023-03-11 15:27:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.8, 300 sec: 10010.9). Total num frames: 8187904. Throughput: 0: 10178.5. Samples: 8175612. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:27:18,386][41256] Avg episode reward: [(0, '40.603')] +[2023-03-11 15:27:18,415][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016000_8192000.pth... +[2023-03-11 15:27:18,416][41544] Updated weights for policy 0, policy_version 16000 (0.0005) +[2023-03-11 15:27:18,417][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015400_7884800.pth +[2023-03-11 15:27:22,436][41544] Updated weights for policy 0, policy_version 16080 (0.0005) +[2023-03-11 15:27:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 8241152. Throughput: 0: 10170.5. Samples: 8237072. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:27:23,386][41256] Avg episode reward: [(0, '41.877')] +[2023-03-11 15:27:26,432][41544] Updated weights for policy 0, policy_version 16160 (0.0004) +[2023-03-11 15:27:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10024.8). Total num frames: 8290304. Throughput: 0: 10156.1. Samples: 8267572. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:27:28,386][41256] Avg episode reward: [(0, '40.863')] +[2023-03-11 15:27:30,656][41544] Updated weights for policy 0, policy_version 16240 (0.0005) +[2023-03-11 15:27:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 8339456. Throughput: 0: 10109.8. Samples: 8326448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:27:33,386][41256] Avg episode reward: [(0, '40.459')] +[2023-03-11 15:27:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016288_8339456.pth... +[2023-03-11 15:27:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000015696_8036352.pth +[2023-03-11 15:27:34,896][41544] Updated weights for policy 0, policy_version 16320 (0.0005) +[2023-03-11 15:27:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 8388608. Throughput: 0: 10153.6. Samples: 8386832. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:27:38,386][41256] Avg episode reward: [(0, '40.970')] +[2023-03-11 15:27:38,828][41544] Updated weights for policy 0, policy_version 16400 (0.0004) +[2023-03-11 15:27:43,087][41544] Updated weights for policy 0, policy_version 16480 (0.0005) +[2023-03-11 15:27:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10038.7). Total num frames: 8437760. Throughput: 0: 10155.4. Samples: 8416152. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:27:43,386][41256] Avg episode reward: [(0, '40.019')] +[2023-03-11 15:27:47,345][41544] Updated weights for policy 0, policy_version 16560 (0.0005) +[2023-03-11 15:27:48,385][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 8486912. Throughput: 0: 10077.0. Samples: 8474124. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:27:48,386][41256] Avg episode reward: [(0, '40.246')] +[2023-03-11 15:27:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016576_8486912.pth... +[2023-03-11 15:27:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016000_8192000.pth +[2023-03-11 15:27:51,567][41544] Updated weights for policy 0, policy_version 16640 (0.0005) +[2023-03-11 15:27:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10024.8). Total num frames: 8536064. Throughput: 0: 9946.0. Samples: 8532040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:27:53,386][41256] Avg episode reward: [(0, '40.521')] +[2023-03-11 15:27:55,575][41544] Updated weights for policy 0, policy_version 16720 (0.0005) +[2023-03-11 15:27:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 8589312. Throughput: 0: 9957.3. Samples: 8563668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:27:58,386][41256] Avg episode reward: [(0, '39.335')] +[2023-03-11 15:27:59,400][41544] Updated weights for policy 0, policy_version 16800 (0.0004) +[2023-03-11 15:28:03,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 10052.6). Total num frames: 8638464. Throughput: 0: 10012.7. Samples: 8626184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:28:03,386][41256] Avg episode reward: [(0, '38.095')] +[2023-03-11 15:28:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016872_8638464.pth... +[2023-03-11 15:28:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016288_8339456.pth +[2023-03-11 15:28:03,618][41544] Updated weights for policy 0, policy_version 16880 (0.0005) +[2023-03-11 15:28:08,041][41544] Updated weights for policy 0, policy_version 16960 (0.0005) +[2023-03-11 15:28:08,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.6, 300 sec: 10038.7). Total num frames: 8683520. Throughput: 0: 9887.3. Samples: 8682000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:28:08,386][41256] Avg episode reward: [(0, '38.968')] +[2023-03-11 15:28:12,304][41544] Updated weights for policy 0, policy_version 17040 (0.0005) +[2023-03-11 15:28:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 8732672. Throughput: 0: 9859.0. Samples: 8711228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:28:13,386][41256] Avg episode reward: [(0, '38.760')] +[2023-03-11 15:28:16,535][41544] Updated weights for policy 0, policy_version 17120 (0.0005) +[2023-03-11 15:28:18,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10038.7). Total num frames: 8781824. Throughput: 0: 9825.8. Samples: 8768608. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:28:18,386][41256] Avg episode reward: [(0, '39.061')] +[2023-03-11 15:28:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017152_8781824.pth... +[2023-03-11 15:28:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016576_8486912.pth +[2023-03-11 15:28:20,868][41544] Updated weights for policy 0, policy_version 17200 (0.0005) +[2023-03-11 15:28:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10010.9). Total num frames: 8826880. Throughput: 0: 9736.3. Samples: 8824964. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:28:23,386][41256] Avg episode reward: [(0, '39.160')] +[2023-03-11 15:28:25,305][41544] Updated weights for policy 0, policy_version 17280 (0.0005) +[2023-03-11 15:28:28,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9693.9, 300 sec: 9983.1). Total num frames: 8871936. Throughput: 0: 9679.7. Samples: 8851740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:28:28,386][41256] Avg episode reward: [(0, '38.840')] +[2023-03-11 15:28:29,737][41544] Updated weights for policy 0, policy_version 17360 (0.0005) +[2023-03-11 15:28:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9983.1). Total num frames: 8921088. Throughput: 0: 9659.7. Samples: 8908812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:28:33,386][41256] Avg episode reward: [(0, '38.696')] +[2023-03-11 15:28:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017424_8921088.pth... +[2023-03-11 15:28:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000016872_8638464.pth +[2023-03-11 15:28:33,985][41544] Updated weights for policy 0, policy_version 17440 (0.0005) +[2023-03-11 15:28:38,274][41544] Updated weights for policy 0, policy_version 17520 (0.0005) +[2023-03-11 15:28:38,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9983.1). Total num frames: 8970240. Throughput: 0: 9648.2. Samples: 8966208. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:28:38,386][41256] Avg episode reward: [(0, '40.320')] +[2023-03-11 15:28:42,535][41544] Updated weights for policy 0, policy_version 17600 (0.0005) +[2023-03-11 15:28:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9955.5). Total num frames: 9015296. Throughput: 0: 9580.4. Samples: 8994784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:28:43,386][41256] Avg episode reward: [(0, '39.487')] +[2023-03-11 15:28:46,795][41544] Updated weights for policy 0, policy_version 17680 (0.0005) +[2023-03-11 15:28:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9955.4). Total num frames: 9064448. Throughput: 0: 9468.4. Samples: 9052260. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:28:48,386][41256] Avg episode reward: [(0, '41.302')] +[2023-03-11 15:28:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017704_9064448.pth... +[2023-03-11 15:28:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017152_8781824.pth +[2023-03-11 15:28:51,125][41544] Updated weights for policy 0, policy_version 17760 (0.0005) +[2023-03-11 15:28:53,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9955.4). Total num frames: 9113600. Throughput: 0: 9501.7. Samples: 9109576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:28:53,386][41256] Avg episode reward: [(0, '41.027')] +[2023-03-11 15:28:55,337][41544] Updated weights for policy 0, policy_version 17840 (0.0005) +[2023-03-11 15:28:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9955.4). Total num frames: 9162752. Throughput: 0: 9491.4. Samples: 9138340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:28:58,386][41256] Avg episode reward: [(0, '41.889')] +[2023-03-11 15:28:59,633][41544] Updated weights for policy 0, policy_version 17920 (0.0005) +[2023-03-11 15:29:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9927.6). Total num frames: 9207808. Throughput: 0: 9508.5. Samples: 9196492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:29:03,386][41256] Avg episode reward: [(0, '41.792')] +[2023-03-11 15:29:03,415][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017992_9211904.pth... +[2023-03-11 15:29:03,417][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017424_8921088.pth +[2023-03-11 15:29:03,832][41544] Updated weights for policy 0, policy_version 18000 (0.0005) +[2023-03-11 15:29:07,921][41544] Updated weights for policy 0, policy_version 18080 (0.0005) +[2023-03-11 15:29:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 9261056. Throughput: 0: 9579.7. Samples: 9256052. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:29:08,386][41256] Avg episode reward: [(0, '40.927')] +[2023-03-11 15:29:12,052][41544] Updated weights for policy 0, policy_version 18160 (0.0005) +[2023-03-11 15:29:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9625.6, 300 sec: 9955.4). Total num frames: 9310208. Throughput: 0: 9648.6. Samples: 9285928. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:29:13,386][41256] Avg episode reward: [(0, '40.765')] +[2023-03-11 15:29:16,167][41544] Updated weights for policy 0, policy_version 18240 (0.0005) +[2023-03-11 15:29:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9955.4). Total num frames: 9359360. Throughput: 0: 9712.8. Samples: 9345888. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:29:18,386][41256] Avg episode reward: [(0, '41.436')] +[2023-03-11 15:29:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018280_9359360.pth... +[2023-03-11 15:29:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017704_9064448.pth +[2023-03-11 15:29:20,198][41544] Updated weights for policy 0, policy_version 18320 (0.0005) +[2023-03-11 15:29:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9941.5). Total num frames: 9408512. Throughput: 0: 9792.5. Samples: 9406868. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:29:23,386][41256] Avg episode reward: [(0, '41.468')] +[2023-03-11 15:29:24,188][41544] Updated weights for policy 0, policy_version 18400 (0.0005) +[2023-03-11 15:29:28,228][41544] Updated weights for policy 0, policy_version 18480 (0.0005) +[2023-03-11 15:29:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 9461760. Throughput: 0: 9851.0. Samples: 9438080. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:29:28,386][41256] Avg episode reward: [(0, '38.627')] +[2023-03-11 15:29:30,298][41500] KL-divergence is very high: 101.4700 +[2023-03-11 15:29:30,306][41500] KL-divergence is very high: 129.5406 +[2023-03-11 15:29:32,376][41544] Updated weights for policy 0, policy_version 18560 (0.0005) +[2023-03-11 15:29:33,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 9510912. Throughput: 0: 9889.8. Samples: 9497300. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:29:33,396][41256] Avg episode reward: [(0, '32.797')] +[2023-03-11 15:29:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018576_9510912.pth... +[2023-03-11 15:29:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000017992_9211904.pth +[2023-03-11 15:29:36,359][41544] Updated weights for policy 0, policy_version 18640 (0.0005) +[2023-03-11 15:29:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 9564160. Throughput: 0: 10001.3. Samples: 9559632. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:29:38,396][41256] Avg episode reward: [(0, '38.550')] +[2023-03-11 15:29:40,299][41544] Updated weights for policy 0, policy_version 18720 (0.0005) +[2023-03-11 15:29:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 9613312. Throughput: 0: 10031.4. Samples: 9589752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:29:43,396][41256] Avg episode reward: [(0, '37.943')] +[2023-03-11 15:29:44,285][41544] Updated weights for policy 0, policy_version 18800 (0.0005) +[2023-03-11 15:29:48,098][41544] Updated weights for policy 0, policy_version 18880 (0.0004) +[2023-03-11 15:29:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 9666560. Throughput: 0: 10137.4. Samples: 9652676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:29:48,396][41256] Avg episode reward: [(0, '37.572')] +[2023-03-11 15:29:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018880_9666560.pth... +[2023-03-11 15:29:48,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018280_9359360.pth +[2023-03-11 15:29:52,187][41544] Updated weights for policy 0, policy_version 18960 (0.0005) +[2023-03-11 15:29:53,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 9719808. Throughput: 0: 10186.4. Samples: 9714440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:29:53,396][41256] Avg episode reward: [(0, '38.733')] +[2023-03-11 15:29:56,188][41544] Updated weights for policy 0, policy_version 19040 (0.0005) +[2023-03-11 15:29:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 9768960. Throughput: 0: 10194.4. Samples: 9744676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:29:58,386][41256] Avg episode reward: [(0, '34.631')] +[2023-03-11 15:30:00,360][41544] Updated weights for policy 0, policy_version 19120 (0.0005) +[2023-03-11 15:30:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9969.2). Total num frames: 9818112. Throughput: 0: 10191.0. Samples: 9804484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:30:03,386][41256] Avg episode reward: [(0, '33.134')] +[2023-03-11 15:30:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019176_9818112.pth... +[2023-03-11 15:30:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018576_9510912.pth +[2023-03-11 15:30:04,500][41544] Updated weights for policy 0, policy_version 19200 (0.0005) +[2023-03-11 15:30:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 9867264. Throughput: 0: 10172.2. Samples: 9864616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:30:08,397][41256] Avg episode reward: [(0, '35.180')] +[2023-03-11 15:30:08,480][41544] Updated weights for policy 0, policy_version 19280 (0.0004) +[2023-03-11 15:30:12,744][41544] Updated weights for policy 0, policy_version 19360 (0.0004) +[2023-03-11 15:30:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 9916416. Throughput: 0: 10154.7. Samples: 9895040. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:30:13,396][41256] Avg episode reward: [(0, '39.254')] +[2023-03-11 15:30:16,952][41544] Updated weights for policy 0, policy_version 19440 (0.0005) +[2023-03-11 15:30:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 9965568. Throughput: 0: 10105.2. Samples: 9952036. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:30:18,386][41256] Avg episode reward: [(0, '35.775')] +[2023-03-11 15:30:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019464_9965568.pth... +[2023-03-11 15:30:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000018880_9666560.pth +[2023-03-11 15:30:21,132][41544] Updated weights for policy 0, policy_version 19520 (0.0005) +[2023-03-11 15:30:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 10014720. Throughput: 0: 10023.5. Samples: 10010688. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:30:23,386][41256] Avg episode reward: [(0, '36.995')] +[2023-03-11 15:30:25,455][41544] Updated weights for policy 0, policy_version 19600 (0.0005) +[2023-03-11 15:30:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9913.7). Total num frames: 10059776. Throughput: 0: 9987.4. Samples: 10039184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:30:28,386][41256] Avg episode reward: [(0, '37.124')] +[2023-03-11 15:30:29,851][41544] Updated weights for policy 0, policy_version 19680 (0.0005) +[2023-03-11 15:30:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 10108928. Throughput: 0: 9828.7. Samples: 10094968. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:30:33,386][41256] Avg episode reward: [(0, '37.548')] +[2023-03-11 15:30:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019744_10108928.pth... +[2023-03-11 15:30:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019176_9818112.pth +[2023-03-11 15:30:34,123][41544] Updated weights for policy 0, policy_version 19760 (0.0005) +[2023-03-11 15:30:38,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 10153984. Throughput: 0: 9738.4. Samples: 10152668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:30:38,386][41256] Avg episode reward: [(0, '39.008')] +[2023-03-11 15:30:38,421][41544] Updated weights for policy 0, policy_version 19840 (0.0005) +[2023-03-11 15:30:42,553][41544] Updated weights for policy 0, policy_version 19920 (0.0005) +[2023-03-11 15:30:43,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 10203136. Throughput: 0: 9716.6. Samples: 10181924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:30:43,386][41256] Avg episode reward: [(0, '39.096')] +[2023-03-11 15:30:46,684][41544] Updated weights for policy 0, policy_version 20000 (0.0005) +[2023-03-11 15:30:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 10256384. Throughput: 0: 9711.5. Samples: 10241500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:30:48,386][41256] Avg episode reward: [(0, '39.615')] +[2023-03-11 15:30:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020032_10256384.pth... +[2023-03-11 15:30:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019464_9965568.pth +[2023-03-11 15:30:50,609][41544] Updated weights for policy 0, policy_version 20080 (0.0005) +[2023-03-11 15:30:53,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 10305536. Throughput: 0: 9743.3. Samples: 10303064. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:30:53,396][41256] Avg episode reward: [(0, '37.924')] +[2023-03-11 15:30:54,703][41544] Updated weights for policy 0, policy_version 20160 (0.0005) +[2023-03-11 15:30:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 10354688. Throughput: 0: 9741.1. Samples: 10333388. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:30:58,396][41256] Avg episode reward: [(0, '38.825')] +[2023-03-11 15:30:58,817][41544] Updated weights for policy 0, policy_version 20240 (0.0005) +[2023-03-11 15:31:02,828][41544] Updated weights for policy 0, policy_version 20320 (0.0005) +[2023-03-11 15:31:03,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 10407936. Throughput: 0: 9818.1. Samples: 10393852. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:31:03,386][41256] Avg episode reward: [(0, '38.290')] +[2023-03-11 15:31:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020328_10407936.pth... +[2023-03-11 15:31:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000019744_10108928.pth +[2023-03-11 15:31:07,058][41544] Updated weights for policy 0, policy_version 20400 (0.0005) +[2023-03-11 15:31:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 10457088. Throughput: 0: 9825.8. Samples: 10452848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:08,386][41256] Avg episode reward: [(0, '37.191')] +[2023-03-11 15:31:11,247][41544] Updated weights for policy 0, policy_version 20480 (0.0005) +[2023-03-11 15:31:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 10506240. Throughput: 0: 9834.5. Samples: 10481736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:13,386][41256] Avg episode reward: [(0, '39.115')] +[2023-03-11 15:31:15,428][41544] Updated weights for policy 0, policy_version 20560 (0.0004) +[2023-03-11 15:31:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 10555392. Throughput: 0: 9907.4. Samples: 10540800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:18,386][41256] Avg episode reward: [(0, '40.227')] +[2023-03-11 15:31:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020616_10555392.pth... +[2023-03-11 15:31:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020032_10256384.pth +[2023-03-11 15:31:19,699][41544] Updated weights for policy 0, policy_version 20640 (0.0004) +[2023-03-11 15:31:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9913.7). Total num frames: 10604544. Throughput: 0: 9943.8. Samples: 10600140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:23,386][41256] Avg episode reward: [(0, '40.016')] +[2023-03-11 15:31:23,714][41544] Updated weights for policy 0, policy_version 20720 (0.0003) +[2023-03-11 15:31:27,920][41544] Updated weights for policy 0, policy_version 20800 (0.0003) +[2023-03-11 15:31:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 10653696. Throughput: 0: 9937.7. Samples: 10629120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:28,386][41256] Avg episode reward: [(0, '39.469')] +[2023-03-11 15:31:31,974][41544] Updated weights for policy 0, policy_version 20880 (0.0003) +[2023-03-11 15:31:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 10702848. Throughput: 0: 9944.6. Samples: 10689008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:33,386][41256] Avg episode reward: [(0, '39.977')] +[2023-03-11 15:31:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020904_10702848.pth... +[2023-03-11 15:31:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020328_10407936.pth +[2023-03-11 15:31:36,158][41544] Updated weights for policy 0, policy_version 20960 (0.0004) +[2023-03-11 15:31:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 10752000. Throughput: 0: 9897.7. Samples: 10748460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:38,386][41256] Avg episode reward: [(0, '40.242')] +[2023-03-11 15:31:40,217][41544] Updated weights for policy 0, policy_version 21040 (0.0003) +[2023-03-11 15:31:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 10801152. Throughput: 0: 9890.0. Samples: 10778440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:31:43,386][41256] Avg episode reward: [(0, '39.922')] +[2023-03-11 15:31:44,243][41544] Updated weights for policy 0, policy_version 21120 (0.0004) +[2023-03-11 15:31:48,366][41544] Updated weights for policy 0, policy_version 21200 (0.0004) +[2023-03-11 15:31:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 10854400. Throughput: 0: 9890.0. Samples: 10838904. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:31:48,386][41256] Avg episode reward: [(0, '37.732')] +[2023-03-11 15:31:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000021200_10854400.pth... +[2023-03-11 15:31:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020616_10555392.pth +[2023-03-11 15:31:52,452][41544] Updated weights for policy 0, policy_version 21280 (0.0004) +[2023-03-11 15:31:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 10903552. Throughput: 0: 9924.6. Samples: 10899456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:31:53,386][41256] Avg episode reward: [(0, '38.475')] +[2023-03-11 15:31:56,426][41544] Updated weights for policy 0, policy_version 21360 (0.0004) +[2023-03-11 15:31:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 10952704. Throughput: 0: 9963.1. Samples: 10930076. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:31:58,386][41256] Avg episode reward: [(0, '39.414')] +[2023-03-11 15:32:00,356][41544] Updated weights for policy 0, policy_version 21440 (0.0005) +[2023-03-11 15:32:03,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 11005952. Throughput: 0: 10036.4. Samples: 10992436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:03,386][41256] Avg episode reward: [(0, '39.804')] +[2023-03-11 15:32:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000021496_11005952.pth... +[2023-03-11 15:32:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000020904_10702848.pth +[2023-03-11 15:32:04,413][41544] Updated weights for policy 0, policy_version 21520 (0.0005) +[2023-03-11 15:32:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 11055104. Throughput: 0: 10020.7. Samples: 11051072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:08,386][41256] Avg episode reward: [(0, '39.608')] +[2023-03-11 15:32:08,683][41544] Updated weights for policy 0, policy_version 21600 (0.0005) +[2023-03-11 15:32:13,103][41544] Updated weights for policy 0, policy_version 21680 (0.0005) +[2023-03-11 15:32:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 11100160. Throughput: 0: 10005.9. Samples: 11079384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:13,386][41256] Avg episode reward: [(0, '40.828')] +[2023-03-11 15:32:17,404][41544] Updated weights for policy 0, policy_version 21760 (0.0005) +[2023-03-11 15:32:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 11149312. Throughput: 0: 9938.2. Samples: 11136228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:18,386][41256] Avg episode reward: [(0, '40.566')] +[2023-03-11 15:32:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000021776_11149312.pth... +[2023-03-11 15:32:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000021200_10854400.pth +[2023-03-11 15:32:21,434][41544] Updated weights for policy 0, policy_version 21840 (0.0005) +[2023-03-11 15:32:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 11198464. Throughput: 0: 9947.6. Samples: 11196100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:23,386][41256] Avg episode reward: [(0, '40.578')] +[2023-03-11 15:32:25,548][41544] Updated weights for policy 0, policy_version 21920 (0.0005) +[2023-03-11 15:32:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 11247616. Throughput: 0: 9947.3. Samples: 11226068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:28,386][41256] Avg episode reward: [(0, '40.762')] +[2023-03-11 15:32:29,672][41544] Updated weights for policy 0, policy_version 22000 (0.0004) +[2023-03-11 15:32:33,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 11300864. Throughput: 0: 9946.0. Samples: 11286472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:33,386][41256] Avg episode reward: [(0, '41.249')] +[2023-03-11 15:32:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022072_11300864.pth... +[2023-03-11 15:32:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000021496_11005952.pth +[2023-03-11 15:32:33,792][41544] Updated weights for policy 0, policy_version 22080 (0.0004) +[2023-03-11 15:32:37,764][41544] Updated weights for policy 0, policy_version 22160 (0.0005) +[2023-03-11 15:32:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 11350016. Throughput: 0: 9932.8. Samples: 11346432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:38,386][41256] Avg episode reward: [(0, '41.081')] +[2023-03-11 15:32:41,733][41544] Updated weights for policy 0, policy_version 22240 (0.0005) +[2023-03-11 15:32:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 11403264. Throughput: 0: 9947.6. Samples: 11377716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:43,386][41256] Avg episode reward: [(0, '41.267')] +[2023-03-11 15:32:45,770][41544] Updated weights for policy 0, policy_version 22320 (0.0005) +[2023-03-11 15:32:48,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 11452416. Throughput: 0: 9920.8. Samples: 11438872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:48,386][41256] Avg episode reward: [(0, '41.299')] +[2023-03-11 15:32:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022368_11452416.pth... +[2023-03-11 15:32:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000021776_11149312.pth +[2023-03-11 15:32:49,884][41544] Updated weights for policy 0, policy_version 22400 (0.0006) +[2023-03-11 15:32:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 11501568. Throughput: 0: 9946.4. Samples: 11498660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:53,386][41256] Avg episode reward: [(0, '41.705')] +[2023-03-11 15:32:53,999][41544] Updated weights for policy 0, policy_version 22480 (0.0005) +[2023-03-11 15:32:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 11546624. Throughput: 0: 9933.2. Samples: 11526376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:32:58,386][41256] Avg episode reward: [(0, '40.122')] +[2023-03-11 15:32:58,416][41544] Updated weights for policy 0, policy_version 22560 (0.0005) +[2023-03-11 15:33:02,581][41544] Updated weights for policy 0, policy_version 22640 (0.0005) +[2023-03-11 15:33:03,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 11599872. Throughput: 0: 9949.9. Samples: 11583972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:03,386][41256] Avg episode reward: [(0, '40.871')] +[2023-03-11 15:33:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022656_11599872.pth... +[2023-03-11 15:33:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022072_11300864.pth +[2023-03-11 15:33:06,606][41544] Updated weights for policy 0, policy_version 22720 (0.0003) +[2023-03-11 15:33:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 11649024. Throughput: 0: 9972.0. Samples: 11644840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:08,386][41256] Avg episode reward: [(0, '40.338')] +[2023-03-11 15:33:10,804][41544] Updated weights for policy 0, policy_version 22800 (0.0004) +[2023-03-11 15:33:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9885.9). Total num frames: 11698176. Throughput: 0: 9946.8. Samples: 11673672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:13,386][41256] Avg episode reward: [(0, '40.209')] +[2023-03-11 15:33:15,076][41544] Updated weights for policy 0, policy_version 22880 (0.0005) +[2023-03-11 15:33:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 11747328. Throughput: 0: 9902.8. Samples: 11732096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:18,386][41256] Avg episode reward: [(0, '37.971')] +[2023-03-11 15:33:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022944_11747328.pth... +[2023-03-11 15:33:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022368_11452416.pth +[2023-03-11 15:33:19,124][41544] Updated weights for policy 0, policy_version 22960 (0.0005) +[2023-03-11 15:33:23,013][41544] Updated weights for policy 0, policy_version 23040 (0.0005) +[2023-03-11 15:33:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 11800576. Throughput: 0: 9965.0. Samples: 11794856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:33:23,396][41256] Avg episode reward: [(0, '38.247')] +[2023-03-11 15:33:26,803][41544] Updated weights for policy 0, policy_version 23120 (0.0005) +[2023-03-11 15:33:28,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 11853824. Throughput: 0: 9990.4. Samples: 11827284. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:33:28,396][41256] Avg episode reward: [(0, '30.938')] +[2023-03-11 15:33:30,564][41544] Updated weights for policy 0, policy_version 23200 (0.0004) +[2023-03-11 15:33:33,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 9955.4). Total num frames: 11907072. Throughput: 0: 10063.6. Samples: 11891732. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:33:33,396][41256] Avg episode reward: [(0, '36.037')] +[2023-03-11 15:33:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000023256_11907072.pth... +[2023-03-11 15:33:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022656_11599872.pth +[2023-03-11 15:33:34,460][41544] Updated weights for policy 0, policy_version 23280 (0.0005) +[2023-03-11 15:33:38,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9969.2). Total num frames: 11956224. Throughput: 0: 10106.6. Samples: 11953456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:38,396][41256] Avg episode reward: [(0, '36.232')] +[2023-03-11 15:33:38,476][41544] Updated weights for policy 0, policy_version 23360 (0.0005) +[2023-03-11 15:33:42,446][41544] Updated weights for policy 0, policy_version 23440 (0.0005) +[2023-03-11 15:33:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 12009472. Throughput: 0: 10187.5. Samples: 11984812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:43,396][41256] Avg episode reward: [(0, '39.752')] +[2023-03-11 15:33:46,713][41544] Updated weights for policy 0, policy_version 23520 (0.0005) +[2023-03-11 15:33:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 12054528. Throughput: 0: 10209.8. Samples: 12043412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:48,396][41256] Avg episode reward: [(0, '40.583')] +[2023-03-11 15:33:48,452][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000023552_12058624.pth... +[2023-03-11 15:33:48,454][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000022944_11747328.pth +[2023-03-11 15:33:50,950][41544] Updated weights for policy 0, policy_version 23600 (0.0005) +[2023-03-11 15:33:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 12107776. Throughput: 0: 10166.8. Samples: 12102344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:53,396][41256] Avg episode reward: [(0, '40.929')] +[2023-03-11 15:33:55,014][41544] Updated weights for policy 0, policy_version 23680 (0.0005) +[2023-03-11 15:33:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9997.0). Total num frames: 12156928. Throughput: 0: 10193.2. Samples: 12132364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:33:58,386][41256] Avg episode reward: [(0, '40.235')] +[2023-03-11 15:33:59,224][41544] Updated weights for policy 0, policy_version 23760 (0.0005) +[2023-03-11 15:34:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 12201984. Throughput: 0: 10168.9. Samples: 12189696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:03,386][41256] Avg episode reward: [(0, '40.892')] +[2023-03-11 15:34:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000023832_12201984.pth... +[2023-03-11 15:34:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000023256_11907072.pth +[2023-03-11 15:34:03,601][41544] Updated weights for policy 0, policy_version 23840 (0.0005) +[2023-03-11 15:34:07,848][41544] Updated weights for policy 0, policy_version 23920 (0.0005) +[2023-03-11 15:34:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 12251136. Throughput: 0: 10048.8. Samples: 12247052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:08,386][41256] Avg episode reward: [(0, '40.657')] +[2023-03-11 15:34:11,893][41544] Updated weights for policy 0, policy_version 24000 (0.0005) +[2023-03-11 15:34:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9969.3). Total num frames: 12300288. Throughput: 0: 9987.7. Samples: 12276732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:13,386][41256] Avg episode reward: [(0, '40.644')] +[2023-03-11 15:34:16,108][41544] Updated weights for policy 0, policy_version 24080 (0.0005) +[2023-03-11 15:34:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 12349440. Throughput: 0: 9883.1. Samples: 12336472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:18,386][41256] Avg episode reward: [(0, '41.243')] +[2023-03-11 15:34:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000024120_12349440.pth... +[2023-03-11 15:34:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000023552_12058624.pth +[2023-03-11 15:34:20,271][41544] Updated weights for policy 0, policy_version 24160 (0.0005) +[2023-03-11 15:34:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 12402688. Throughput: 0: 9852.5. Samples: 12396820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:23,396][41256] Avg episode reward: [(0, '40.516')] +[2023-03-11 15:34:24,122][41544] Updated weights for policy 0, policy_version 24240 (0.0004) +[2023-03-11 15:34:28,126][41544] Updated weights for policy 0, policy_version 24320 (0.0005) +[2023-03-11 15:34:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 12451840. Throughput: 0: 9852.6. Samples: 12428180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:28,386][41256] Avg episode reward: [(0, '40.553')] +[2023-03-11 15:34:32,137][41544] Updated weights for policy 0, policy_version 24400 (0.0005) +[2023-03-11 15:34:33,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 12505088. Throughput: 0: 9919.8. Samples: 12489804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:33,396][41256] Avg episode reward: [(0, '41.705')] +[2023-03-11 15:34:33,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000024424_12505088.pth... +[2023-03-11 15:34:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000023832_12201984.pth +[2023-03-11 15:34:36,136][41544] Updated weights for policy 0, policy_version 24480 (0.0005) +[2023-03-11 15:34:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 12554240. Throughput: 0: 9965.2. Samples: 12550776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:38,397][41256] Avg episode reward: [(0, '41.406')] +[2023-03-11 15:34:40,243][41544] Updated weights for policy 0, policy_version 24560 (0.0005) +[2023-03-11 15:34:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9955.4). Total num frames: 12603392. Throughput: 0: 9957.2. Samples: 12580436. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:43,396][41256] Avg episode reward: [(0, '39.958')] +[2023-03-11 15:34:44,266][41544] Updated weights for policy 0, policy_version 24640 (0.0005) +[2023-03-11 15:34:48,251][41544] Updated weights for policy 0, policy_version 24720 (0.0005) +[2023-03-11 15:34:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 12656640. Throughput: 0: 10055.6. Samples: 12642200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:48,396][41256] Avg episode reward: [(0, '40.583')] +[2023-03-11 15:34:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000024720_12656640.pth... +[2023-03-11 15:34:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000024120_12349440.pth +[2023-03-11 15:34:52,184][41544] Updated weights for policy 0, policy_version 24800 (0.0005) +[2023-03-11 15:34:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 12705792. Throughput: 0: 10161.3. Samples: 12704312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:34:53,396][41256] Avg episode reward: [(0, '40.442')] +[2023-03-11 15:34:56,200][41544] Updated weights for policy 0, policy_version 24880 (0.0004) +[2023-03-11 15:34:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9969.2). Total num frames: 12759040. Throughput: 0: 10173.4. Samples: 12734536. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:34:58,386][41256] Avg episode reward: [(0, '38.068')] +[2023-03-11 15:35:00,229][41544] Updated weights for policy 0, policy_version 24960 (0.0004) +[2023-03-11 15:35:03,386][41256] Fps is (10 sec: 10649.4, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 12812288. Throughput: 0: 10216.7. Samples: 12796224. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:35:03,386][41256] Avg episode reward: [(0, '40.879')] +[2023-03-11 15:35:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025024_12812288.pth... +[2023-03-11 15:35:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000024424_12505088.pth +[2023-03-11 15:35:04,198][41544] Updated weights for policy 0, policy_version 25040 (0.0005) +[2023-03-11 15:35:08,331][41544] Updated weights for policy 0, policy_version 25120 (0.0005) +[2023-03-11 15:35:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 12861440. Throughput: 0: 10217.3. Samples: 12856600. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:35:08,396][41256] Avg episode reward: [(0, '41.960')] +[2023-03-11 15:35:12,376][41544] Updated weights for policy 0, policy_version 25200 (0.0005) +[2023-03-11 15:35:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 12910592. Throughput: 0: 10194.6. Samples: 12886936. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:35:13,386][41256] Avg episode reward: [(0, '42.303')] +[2023-03-11 15:35:16,423][41544] Updated weights for policy 0, policy_version 25280 (0.0004) +[2023-03-11 15:35:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 12959744. Throughput: 0: 10170.0. Samples: 12947456. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:35:18,386][41256] Avg episode reward: [(0, '43.590')] +[2023-03-11 15:35:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025312_12959744.pth... +[2023-03-11 15:35:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000024720_12656640.pth +[2023-03-11 15:35:20,468][41544] Updated weights for policy 0, policy_version 25360 (0.0005) +[2023-03-11 15:35:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 13008896. Throughput: 0: 10151.7. Samples: 13007600. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:35:23,386][41256] Avg episode reward: [(0, '42.239')] +[2023-03-11 15:35:24,669][41544] Updated weights for policy 0, policy_version 25440 (0.0005) +[2023-03-11 15:35:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9997.0). Total num frames: 13058048. Throughput: 0: 10146.1. Samples: 13037012. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:35:28,386][41256] Avg episode reward: [(0, '42.186')] +[2023-03-11 15:35:28,812][41544] Updated weights for policy 0, policy_version 25520 (0.0003) +[2023-03-11 15:35:33,163][41544] Updated weights for policy 0, policy_version 25600 (0.0005) +[2023-03-11 15:35:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10010.9). Total num frames: 13107200. Throughput: 0: 10060.4. Samples: 13094920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:35:33,386][41256] Avg episode reward: [(0, '35.889')] +[2023-03-11 15:35:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025600_13107200.pth... +[2023-03-11 15:35:33,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025024_12812288.pth +[2023-03-11 15:35:37,152][41544] Updated weights for policy 0, policy_version 25680 (0.0005) +[2023-03-11 15:35:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 13160448. Throughput: 0: 10023.7. Samples: 13155380. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:35:38,386][41256] Avg episode reward: [(0, '35.099')] +[2023-03-11 15:35:41,069][41544] Updated weights for policy 0, policy_version 25760 (0.0005) +[2023-03-11 15:35:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10010.9). Total num frames: 13209600. Throughput: 0: 10051.4. Samples: 13186848. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:35:43,386][41256] Avg episode reward: [(0, '35.328')] +[2023-03-11 15:35:44,961][41544] Updated weights for policy 0, policy_version 25840 (0.0005) +[2023-03-11 15:35:48,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 13262848. Throughput: 0: 10082.0. Samples: 13249912. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:35:48,386][41256] Avg episode reward: [(0, '33.690')] +[2023-03-11 15:35:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025904_13262848.pth... +[2023-03-11 15:35:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025312_12959744.pth +[2023-03-11 15:35:48,853][41544] Updated weights for policy 0, policy_version 25920 (0.0005) +[2023-03-11 15:35:52,860][41544] Updated weights for policy 0, policy_version 26000 (0.0005) +[2023-03-11 15:35:53,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 10038.7). Total num frames: 13316096. Throughput: 0: 10120.0. Samples: 13312000. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:35:53,386][41256] Avg episode reward: [(0, '35.745')] +[2023-03-11 15:35:56,761][41544] Updated weights for policy 0, policy_version 26080 (0.0005) +[2023-03-11 15:35:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10024.8). Total num frames: 13365248. Throughput: 0: 10142.5. Samples: 13343348. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:35:58,386][41256] Avg episode reward: [(0, '35.161')] +[2023-03-11 15:36:00,864][41544] Updated weights for policy 0, policy_version 26160 (0.0005) +[2023-03-11 15:36:03,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10038.7). Total num frames: 13418496. Throughput: 0: 10143.9. Samples: 13403932. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:36:03,386][41256] Avg episode reward: [(0, '32.651')] +[2023-03-11 15:36:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000026208_13418496.pth... +[2023-03-11 15:36:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025600_13107200.pth +[2023-03-11 15:36:04,829][41544] Updated weights for policy 0, policy_version 26240 (0.0005) +[2023-03-11 15:36:08,385][41256] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 13471744. Throughput: 0: 10197.8. Samples: 13466500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:08,386][41256] Avg episode reward: [(0, '34.879')] +[2023-03-11 15:36:08,764][41544] Updated weights for policy 0, policy_version 26320 (0.0005) +[2023-03-11 15:36:12,750][41544] Updated weights for policy 0, policy_version 26400 (0.0005) +[2023-03-11 15:36:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 13520896. Throughput: 0: 10229.3. Samples: 13497332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:13,386][41256] Avg episode reward: [(0, '35.008')] +[2023-03-11 15:36:16,858][41544] Updated weights for policy 0, policy_version 26480 (0.0004) +[2023-03-11 15:36:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10052.6). Total num frames: 13570048. Throughput: 0: 10285.3. Samples: 13557760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:18,386][41256] Avg episode reward: [(0, '37.566')] +[2023-03-11 15:36:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000026504_13570048.pth... +[2023-03-11 15:36:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000025904_13262848.pth +[2023-03-11 15:36:20,895][41544] Updated weights for policy 0, policy_version 26560 (0.0005) +[2023-03-11 15:36:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 13623296. Throughput: 0: 10300.6. Samples: 13618908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:23,386][41256] Avg episode reward: [(0, '35.110')] +[2023-03-11 15:36:24,886][41544] Updated weights for policy 0, policy_version 26640 (0.0005) +[2023-03-11 15:36:28,088][41500] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000009 +[2023-03-11 15:36:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10066.4). Total num frames: 13672448. Throughput: 0: 10278.9. Samples: 13649400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:28,386][41256] Avg episode reward: [(0, '38.828')] +[2023-03-11 15:36:28,918][41544] Updated weights for policy 0, policy_version 26720 (0.0005) +[2023-03-11 15:36:32,847][41544] Updated weights for policy 0, policy_version 26800 (0.0005) +[2023-03-11 15:36:33,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10080.3). Total num frames: 13725696. Throughput: 0: 10255.1. Samples: 13711392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:33,386][41256] Avg episode reward: [(0, '38.468')] +[2023-03-11 15:36:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000026808_13725696.pth... +[2023-03-11 15:36:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000026208_13418496.pth +[2023-03-11 15:36:36,823][41544] Updated weights for policy 0, policy_version 26880 (0.0005) +[2023-03-11 15:36:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 13774848. Throughput: 0: 10230.2. Samples: 13772360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:38,386][41256] Avg episode reward: [(0, '39.573')] +[2023-03-11 15:36:40,863][41544] Updated weights for policy 0, policy_version 26960 (0.0005) +[2023-03-11 15:36:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10080.3). Total num frames: 13828096. Throughput: 0: 10226.0. Samples: 13803520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:43,386][41256] Avg episode reward: [(0, '38.336')] +[2023-03-11 15:36:44,821][41544] Updated weights for policy 0, policy_version 27040 (0.0005) +[2023-03-11 15:36:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 13877248. Throughput: 0: 10249.6. Samples: 13865164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:48,386][41256] Avg episode reward: [(0, '37.925')] +[2023-03-11 15:36:48,407][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000027112_13881344.pth... +[2023-03-11 15:36:48,409][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000026504_13570048.pth +[2023-03-11 15:36:48,821][41544] Updated weights for policy 0, policy_version 27120 (0.0005) +[2023-03-11 15:36:52,822][41544] Updated weights for policy 0, policy_version 27200 (0.0005) +[2023-03-11 15:36:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 13930496. Throughput: 0: 10221.4. Samples: 13926464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:53,386][41256] Avg episode reward: [(0, '37.215')] +[2023-03-11 15:36:56,788][41544] Updated weights for policy 0, policy_version 27280 (0.0005) +[2023-03-11 15:36:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10080.3). Total num frames: 13979648. Throughput: 0: 10223.6. Samples: 13957392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:36:58,386][41256] Avg episode reward: [(0, '35.965')] +[2023-03-11 15:37:00,798][41544] Updated weights for policy 0, policy_version 27360 (0.0005) +[2023-03-11 15:37:03,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10094.2). Total num frames: 14032896. Throughput: 0: 10249.2. Samples: 14018972. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:37:03,386][41256] Avg episode reward: [(0, '36.081')] +[2023-03-11 15:37:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000027408_14032896.pth... +[2023-03-11 15:37:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000026808_13725696.pth +[2023-03-11 15:37:04,784][41544] Updated weights for policy 0, policy_version 27440 (0.0005) +[2023-03-11 15:37:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 14086144. Throughput: 0: 10253.1. Samples: 14080300. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:37:08,386][41256] Avg episode reward: [(0, '36.783')] +[2023-03-11 15:37:08,765][41544] Updated weights for policy 0, policy_version 27520 (0.0005) +[2023-03-11 15:37:12,842][41544] Updated weights for policy 0, policy_version 27600 (0.0005) +[2023-03-11 15:37:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 14135296. Throughput: 0: 10253.0. Samples: 14110784. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:37:13,386][41256] Avg episode reward: [(0, '34.774')] +[2023-03-11 15:37:16,865][41544] Updated weights for policy 0, policy_version 27680 (0.0005) +[2023-03-11 15:37:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 14184448. Throughput: 0: 10236.4. Samples: 14172032. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:37:18,386][41256] Avg episode reward: [(0, '37.741')] +[2023-03-11 15:37:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000027704_14184448.pth... +[2023-03-11 15:37:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000027112_13881344.pth +[2023-03-11 15:37:20,886][41544] Updated weights for policy 0, policy_version 27760 (0.0005) +[2023-03-11 15:37:23,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10135.9). Total num frames: 14237696. Throughput: 0: 10229.1. Samples: 14232672. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:37:23,386][41256] Avg episode reward: [(0, '38.402')] +[2023-03-11 15:37:24,945][41544] Updated weights for policy 0, policy_version 27840 (0.0005) +[2023-03-11 15:37:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 14286848. Throughput: 0: 10206.1. Samples: 14262792. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:37:28,386][41256] Avg episode reward: [(0, '36.671')] +[2023-03-11 15:37:28,995][41544] Updated weights for policy 0, policy_version 27920 (0.0005) +[2023-03-11 15:37:33,053][41544] Updated weights for policy 0, policy_version 28000 (0.0005) +[2023-03-11 15:37:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 14336000. Throughput: 0: 10190.3. Samples: 14323728. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:37:33,386][41256] Avg episode reward: [(0, '36.666')] +[2023-03-11 15:37:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028000_14336000.pth... +[2023-03-11 15:37:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000027408_14032896.pth +[2023-03-11 15:37:37,062][41544] Updated weights for policy 0, policy_version 28080 (0.0005) +[2023-03-11 15:37:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10122.0). Total num frames: 14389248. Throughput: 0: 10194.5. Samples: 14385216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:37:38,386][41256] Avg episode reward: [(0, '37.309')] +[2023-03-11 15:37:41,102][41544] Updated weights for policy 0, policy_version 28160 (0.0005) +[2023-03-11 15:37:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 14438400. Throughput: 0: 10173.3. Samples: 14415192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:37:43,386][41256] Avg episode reward: [(0, '39.000')] +[2023-03-11 15:37:45,160][41544] Updated weights for policy 0, policy_version 28240 (0.0005) +[2023-03-11 15:37:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10122.0). Total num frames: 14487552. Throughput: 0: 10146.9. Samples: 14475584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:37:48,386][41256] Avg episode reward: [(0, '34.219')] +[2023-03-11 15:37:48,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028296_14487552.pth... +[2023-03-11 15:37:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000027704_14184448.pth +[2023-03-11 15:37:49,289][41544] Updated weights for policy 0, policy_version 28320 (0.0005) +[2023-03-11 15:37:53,230][41544] Updated weights for policy 0, policy_version 28400 (0.0005) +[2023-03-11 15:37:53,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 14540800. Throughput: 0: 10142.5. Samples: 14536712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:37:53,386][41256] Avg episode reward: [(0, '35.149')] +[2023-03-11 15:37:57,227][41544] Updated weights for policy 0, policy_version 28480 (0.0005) +[2023-03-11 15:37:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 14589952. Throughput: 0: 10143.4. Samples: 14567236. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:37:58,386][41256] Avg episode reward: [(0, '37.894')] +[2023-03-11 15:38:01,048][41544] Updated weights for policy 0, policy_version 28560 (0.0004) +[2023-03-11 15:38:03,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 14643200. Throughput: 0: 10199.0. Samples: 14630988. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:38:03,396][41256] Avg episode reward: [(0, '36.738')] +[2023-03-11 15:38:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028608_14647296.pth... +[2023-03-11 15:38:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028000_14336000.pth +[2023-03-11 15:38:05,078][41544] Updated weights for policy 0, policy_version 28640 (0.0003) +[2023-03-11 15:38:08,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 14696448. Throughput: 0: 10203.6. Samples: 14691836. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:38:08,397][41256] Avg episode reward: [(0, '35.870')] +[2023-03-11 15:38:09,106][41544] Updated weights for policy 0, policy_version 28720 (0.0005) +[2023-03-11 15:38:13,163][41544] Updated weights for policy 0, policy_version 28800 (0.0005) +[2023-03-11 15:38:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 14745600. Throughput: 0: 10200.3. Samples: 14721808. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:38:13,386][41256] Avg episode reward: [(0, '35.687')] +[2023-03-11 15:38:17,188][41544] Updated weights for policy 0, policy_version 28880 (0.0005) +[2023-03-11 15:38:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 14798848. Throughput: 0: 10199.4. Samples: 14782700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:38:18,386][41256] Avg episode reward: [(0, '38.007')] +[2023-03-11 15:38:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028904_14798848.pth... +[2023-03-11 15:38:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028296_14487552.pth +[2023-03-11 15:38:21,181][41544] Updated weights for policy 0, policy_version 28960 (0.0005) +[2023-03-11 15:38:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.8, 300 sec: 10149.8). Total num frames: 14848000. Throughput: 0: 10212.9. Samples: 14844796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:38:23,386][41256] Avg episode reward: [(0, '35.788')] +[2023-03-11 15:38:25,013][41544] Updated weights for policy 0, policy_version 29040 (0.0005) +[2023-03-11 15:38:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 14901248. Throughput: 0: 10256.7. Samples: 14876744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:38:28,386][41256] Avg episode reward: [(0, '35.913')] +[2023-03-11 15:38:28,955][41544] Updated weights for policy 0, policy_version 29120 (0.0005) +[2023-03-11 15:38:32,999][41544] Updated weights for policy 0, policy_version 29200 (0.0006) +[2023-03-11 15:38:33,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.7). Total num frames: 14950400. Throughput: 0: 10281.8. Samples: 14938264. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:38:33,386][41256] Avg episode reward: [(0, '36.096')] +[2023-03-11 15:38:33,416][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000029208_14954496.pth... +[2023-03-11 15:38:33,417][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028608_14647296.pth +[2023-03-11 15:38:37,087][41544] Updated weights for policy 0, policy_version 29280 (0.0005) +[2023-03-11 15:38:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10149.8). Total num frames: 15003648. Throughput: 0: 10273.7. Samples: 14999028. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:38:38,386][41256] Avg episode reward: [(0, '35.733')] +[2023-03-11 15:38:41,315][41544] Updated weights for policy 0, policy_version 29360 (0.0005) +[2023-03-11 15:38:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 10149.7). Total num frames: 15048704. Throughput: 0: 10244.2. Samples: 15028224. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:38:43,386][41256] Avg episode reward: [(0, '34.310')] +[2023-03-11 15:38:45,513][41544] Updated weights for policy 0, policy_version 29440 (0.0005) +[2023-03-11 15:38:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 10171.7, 300 sec: 10135.9). Total num frames: 15097856. Throughput: 0: 10103.6. Samples: 15085648. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:38:48,386][41256] Avg episode reward: [(0, '33.179')] +[2023-03-11 15:38:48,443][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000029496_15101952.pth... +[2023-03-11 15:38:48,445][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000028904_14798848.pth +[2023-03-11 15:38:49,667][41544] Updated weights for policy 0, policy_version 29520 (0.0005) +[2023-03-11 15:38:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10149.8). Total num frames: 15151104. Throughput: 0: 10101.5. Samples: 15146404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:38:53,386][41256] Avg episode reward: [(0, '33.831')] +[2023-03-11 15:38:53,704][41544] Updated weights for policy 0, policy_version 29600 (0.0006) +[2023-03-11 15:38:57,804][41544] Updated weights for policy 0, policy_version 29680 (0.0005) +[2023-03-11 15:38:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 15200256. Throughput: 0: 10098.0. Samples: 15176220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:38:58,386][41256] Avg episode reward: [(0, '35.250')] +[2023-03-11 15:39:01,875][41544] Updated weights for policy 0, policy_version 29760 (0.0005) +[2023-03-11 15:39:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 10163.6). Total num frames: 15249408. Throughput: 0: 10098.2. Samples: 15237120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:03,386][41256] Avg episode reward: [(0, '34.099')] +[2023-03-11 15:39:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000029784_15249408.pth... +[2023-03-11 15:39:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000029208_14954496.pth +[2023-03-11 15:39:05,782][41544] Updated weights for policy 0, policy_version 29840 (0.0005) +[2023-03-11 15:39:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 15302656. Throughput: 0: 10104.3. Samples: 15299492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:08,386][41256] Avg episode reward: [(0, '33.705')] +[2023-03-11 15:39:09,641][41544] Updated weights for policy 0, policy_version 29920 (0.0005) +[2023-03-11 15:39:13,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 15355904. Throughput: 0: 10101.9. Samples: 15331328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:13,386][41256] Avg episode reward: [(0, '33.468')] +[2023-03-11 15:39:13,728][41544] Updated weights for policy 0, policy_version 30000 (0.0005) +[2023-03-11 15:39:17,759][41544] Updated weights for policy 0, policy_version 30080 (0.0005) +[2023-03-11 15:39:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 10177.5). Total num frames: 15405056. Throughput: 0: 10077.3. Samples: 15391744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:18,386][41256] Avg episode reward: [(0, '34.733')] +[2023-03-11 15:39:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030088_15405056.pth... +[2023-03-11 15:39:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000029496_15101952.pth +[2023-03-11 15:39:21,764][41544] Updated weights for policy 0, policy_version 30160 (0.0005) +[2023-03-11 15:39:23,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 10191.4). Total num frames: 15458304. Throughput: 0: 10083.8. Samples: 15452800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:23,397][41256] Avg episode reward: [(0, '34.311')] +[2023-03-11 15:39:25,940][41544] Updated weights for policy 0, policy_version 30240 (0.0005) +[2023-03-11 15:39:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 15503360. Throughput: 0: 10090.1. Samples: 15482280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:28,396][41256] Avg episode reward: [(0, '34.352')] +[2023-03-11 15:39:30,195][41544] Updated weights for policy 0, policy_version 30320 (0.0005) +[2023-03-11 15:39:33,385][41256] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 15552512. Throughput: 0: 10101.7. Samples: 15540224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:33,397][41256] Avg episode reward: [(0, '35.317')] +[2023-03-11 15:39:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030376_15552512.pth... +[2023-03-11 15:39:33,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000029784_15249408.pth +[2023-03-11 15:39:34,410][41544] Updated weights for policy 0, policy_version 30400 (0.0005) +[2023-03-11 15:39:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 15601664. Throughput: 0: 10036.1. Samples: 15598028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:38,396][41256] Avg episode reward: [(0, '36.316')] +[2023-03-11 15:39:38,615][41544] Updated weights for policy 0, policy_version 30480 (0.0005) +[2023-03-11 15:39:42,963][41544] Updated weights for policy 0, policy_version 30560 (0.0005) +[2023-03-11 15:39:43,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 15650816. Throughput: 0: 10001.8. Samples: 15626304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:43,397][41256] Avg episode reward: [(0, '35.327')] +[2023-03-11 15:39:47,175][41544] Updated weights for policy 0, policy_version 30640 (0.0005) +[2023-03-11 15:39:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9966.9, 300 sec: 10135.9). Total num frames: 15695872. Throughput: 0: 9938.0. Samples: 15684332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:39:48,396][41256] Avg episode reward: [(0, '36.641')] +[2023-03-11 15:39:48,433][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030664_15699968.pth... +[2023-03-11 15:39:48,435][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030088_15405056.pth +[2023-03-11 15:39:51,390][41544] Updated weights for policy 0, policy_version 30720 (0.0005) +[2023-03-11 15:39:53,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10135.9). Total num frames: 15749120. Throughput: 0: 9877.2. Samples: 15743968. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:39:53,396][41256] Avg episode reward: [(0, '36.403')] +[2023-03-11 15:39:55,299][41544] Updated weights for policy 0, policy_version 30800 (0.0004) +[2023-03-11 15:39:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10122.0). Total num frames: 15798272. Throughput: 0: 9866.0. Samples: 15775296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:39:58,386][41256] Avg episode reward: [(0, '37.547')] +[2023-03-11 15:39:59,261][41544] Updated weights for policy 0, policy_version 30880 (0.0004) +[2023-03-11 15:40:03,179][41544] Updated weights for policy 0, policy_version 30960 (0.0004) +[2023-03-11 15:40:03,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10135.9). Total num frames: 15851520. Throughput: 0: 9923.4. Samples: 15838296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:40:03,386][41256] Avg episode reward: [(0, '35.289')] +[2023-03-11 15:40:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030960_15851520.pth... +[2023-03-11 15:40:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030376_15552512.pth +[2023-03-11 15:40:07,104][41544] Updated weights for policy 0, policy_version 31040 (0.0005) +[2023-03-11 15:40:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 10149.7). Total num frames: 15904768. Throughput: 0: 9942.3. Samples: 15900204. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:40:08,386][41256] Avg episode reward: [(0, '39.162')] +[2023-03-11 15:40:11,162][41544] Updated weights for policy 0, policy_version 31120 (0.0005) +[2023-03-11 15:40:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 10149.8). Total num frames: 15953920. Throughput: 0: 9947.5. Samples: 15929916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:40:13,386][41256] Avg episode reward: [(0, '38.276')] +[2023-03-11 15:40:15,036][41544] Updated weights for policy 0, policy_version 31200 (0.0004) +[2023-03-11 15:40:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10163.6). Total num frames: 16007168. Throughput: 0: 10057.6. Samples: 15992816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:40:18,386][41256] Avg episode reward: [(0, '38.986')] +[2023-03-11 15:40:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000031264_16007168.pth... +[2023-03-11 15:40:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030664_15699968.pth +[2023-03-11 15:40:19,091][41544] Updated weights for policy 0, policy_version 31280 (0.0004) +[2023-03-11 15:40:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 16052224. Throughput: 0: 10047.5. Samples: 16050168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:40:23,386][41256] Avg episode reward: [(0, '40.150')] +[2023-03-11 15:40:23,540][41544] Updated weights for policy 0, policy_version 31360 (0.0004) +[2023-03-11 15:40:27,827][41544] Updated weights for policy 0, policy_version 31440 (0.0005) +[2023-03-11 15:40:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10149.8). Total num frames: 16101376. Throughput: 0: 10045.3. Samples: 16078340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:40:28,386][41256] Avg episode reward: [(0, '39.426')] +[2023-03-11 15:40:32,215][41544] Updated weights for policy 0, policy_version 31520 (0.0005) +[2023-03-11 15:40:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 16146432. Throughput: 0: 10007.1. Samples: 16134652. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:40:33,386][41256] Avg episode reward: [(0, '39.792')] +[2023-03-11 15:40:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000031536_16146432.pth... +[2023-03-11 15:40:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000030960_15851520.pth +[2023-03-11 15:40:36,385][41544] Updated weights for policy 0, policy_version 31600 (0.0005) +[2023-03-11 15:40:38,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 10122.0). Total num frames: 16195584. Throughput: 0: 9984.3. Samples: 16193260. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:40:38,386][41256] Avg episode reward: [(0, '37.624')] +[2023-03-11 15:40:40,701][41544] Updated weights for policy 0, policy_version 31680 (0.0005) +[2023-03-11 15:40:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10108.1). Total num frames: 16244736. Throughput: 0: 9915.3. Samples: 16221484. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:40:43,386][41256] Avg episode reward: [(0, '36.196')] +[2023-03-11 15:40:44,868][41544] Updated weights for policy 0, policy_version 31760 (0.0005) +[2023-03-11 15:40:48,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 16297984. Throughput: 0: 9854.9. Samples: 16281768. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:40:48,386][41256] Avg episode reward: [(0, '36.648')] +[2023-03-11 15:40:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000031832_16297984.pth... +[2023-03-11 15:40:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000031264_16007168.pth +[2023-03-11 15:40:48,731][41544] Updated weights for policy 0, policy_version 31840 (0.0004) +[2023-03-11 15:40:52,697][41544] Updated weights for policy 0, policy_version 31920 (0.0004) +[2023-03-11 15:40:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10108.1). Total num frames: 16347136. Throughput: 0: 9865.2. Samples: 16344140. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:40:53,386][41256] Avg episode reward: [(0, '36.949')] +[2023-03-11 15:40:56,653][41544] Updated weights for policy 0, policy_version 32000 (0.0005) +[2023-03-11 15:40:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 10108.1). Total num frames: 16400384. Throughput: 0: 9899.5. Samples: 16375396. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:40:58,386][41256] Avg episode reward: [(0, '37.533')] +[2023-03-11 15:41:00,631][41544] Updated weights for policy 0, policy_version 32080 (0.0005) +[2023-03-11 15:41:03,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 10094.2). Total num frames: 16449536. Throughput: 0: 9876.2. Samples: 16437244. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:41:03,386][41256] Avg episode reward: [(0, '39.042')] +[2023-03-11 15:41:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032128_16449536.pth... +[2023-03-11 15:41:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000031536_16146432.pth +[2023-03-11 15:41:04,916][41544] Updated weights for policy 0, policy_version 32160 (0.0005) +[2023-03-11 15:41:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 16498688. Throughput: 0: 9874.2. Samples: 16494508. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:41:08,386][41256] Avg episode reward: [(0, '39.498')] +[2023-03-11 15:41:09,132][41544] Updated weights for policy 0, policy_version 32240 (0.0005) +[2023-03-11 15:41:13,343][41544] Updated weights for policy 0, policy_version 32320 (0.0005) +[2023-03-11 15:41:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10094.2). Total num frames: 16547840. Throughput: 0: 9888.6. Samples: 16523328. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:41:13,395][41256] Avg episode reward: [(0, '38.837')] +[2023-03-11 15:41:17,668][41544] Updated weights for policy 0, policy_version 32400 (0.0005) +[2023-03-11 15:41:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10066.4). Total num frames: 16592896. Throughput: 0: 9918.9. Samples: 16581004. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:41:18,396][41256] Avg episode reward: [(0, '37.866')] +[2023-03-11 15:41:18,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032408_16592896.pth... +[2023-03-11 15:41:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000031832_16297984.pth +[2023-03-11 15:41:22,037][41544] Updated weights for policy 0, policy_version 32480 (0.0005) +[2023-03-11 15:41:23,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 10066.4). Total num frames: 16642048. Throughput: 0: 9882.0. Samples: 16637952. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:41:23,386][41256] Avg episode reward: [(0, '36.331')] +[2023-03-11 15:41:26,332][41544] Updated weights for policy 0, policy_version 32560 (0.0005) +[2023-03-11 15:41:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10038.7). Total num frames: 16687104. Throughput: 0: 9892.0. Samples: 16666624. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:41:28,386][41256] Avg episode reward: [(0, '37.789')] +[2023-03-11 15:41:30,801][41544] Updated weights for policy 0, policy_version 32640 (0.0005) +[2023-03-11 15:41:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 16736256. Throughput: 0: 9786.4. Samples: 16722156. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:41:33,386][41256] Avg episode reward: [(0, '36.866')] +[2023-03-11 15:41:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032688_16736256.pth... +[2023-03-11 15:41:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032128_16449536.pth +[2023-03-11 15:41:34,837][41544] Updated weights for policy 0, policy_version 32720 (0.0005) +[2023-03-11 15:41:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 16785408. Throughput: 0: 9716.5. Samples: 16781384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:41:38,386][41256] Avg episode reward: [(0, '38.532')] +[2023-03-11 15:41:39,136][41544] Updated weights for policy 0, policy_version 32800 (0.0006) +[2023-03-11 15:41:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10010.9). Total num frames: 16830464. Throughput: 0: 9659.3. Samples: 16810064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:41:43,386][41256] Avg episode reward: [(0, '38.381')] +[2023-03-11 15:41:43,398][41544] Updated weights for policy 0, policy_version 32880 (0.0005) +[2023-03-11 15:41:47,732][41544] Updated weights for policy 0, policy_version 32960 (0.0005) +[2023-03-11 15:41:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9997.0). Total num frames: 16879616. Throughput: 0: 9557.4. Samples: 16867328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:41:48,386][41256] Avg episode reward: [(0, '39.747')] +[2023-03-11 15:41:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032968_16879616.pth... +[2023-03-11 15:41:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032408_16592896.pth +[2023-03-11 15:41:51,964][41544] Updated weights for policy 0, policy_version 33040 (0.0005) +[2023-03-11 15:41:53,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9997.0). Total num frames: 16928768. Throughput: 0: 9560.8. Samples: 16924744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:41:53,386][41256] Avg episode reward: [(0, '40.412')] +[2023-03-11 15:41:56,301][41544] Updated weights for policy 0, policy_version 33120 (0.0005) +[2023-03-11 15:41:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9969.3). Total num frames: 16973824. Throughput: 0: 9555.9. Samples: 16953344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:41:58,386][41256] Avg episode reward: [(0, '40.693')] +[2023-03-11 15:42:00,678][41544] Updated weights for policy 0, policy_version 33200 (0.0005) +[2023-03-11 15:42:03,386][41256] Fps is (10 sec: 9420.6, 60 sec: 9557.3, 300 sec: 9955.4). Total num frames: 17022976. Throughput: 0: 9517.2. Samples: 17009280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:42:03,386][41256] Avg episode reward: [(0, '40.626')] +[2023-03-11 15:42:03,391][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000033248_17022976.pth... +[2023-03-11 15:42:03,394][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032688_16736256.pth +[2023-03-11 15:42:05,043][41544] Updated weights for policy 0, policy_version 33280 (0.0005) +[2023-03-11 15:42:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9941.5). Total num frames: 17068032. Throughput: 0: 9517.7. Samples: 17066248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:42:08,386][41256] Avg episode reward: [(0, '39.231')] +[2023-03-11 15:42:09,293][41544] Updated weights for policy 0, policy_version 33360 (0.0005) +[2023-03-11 15:42:13,385][41256] Fps is (10 sec: 9421.0, 60 sec: 9489.1, 300 sec: 9941.5). Total num frames: 17117184. Throughput: 0: 9527.8. Samples: 17095376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:42:13,386][41256] Avg episode reward: [(0, '39.325')] +[2023-03-11 15:42:13,392][41544] Updated weights for policy 0, policy_version 33440 (0.0005) +[2023-03-11 15:42:17,536][41544] Updated weights for policy 0, policy_version 33520 (0.0005) +[2023-03-11 15:42:18,391][41256] Fps is (10 sec: 10234.0, 60 sec: 9624.7, 300 sec: 9941.3). Total num frames: 17170432. Throughput: 0: 9624.7. Samples: 17155324. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:42:18,392][41256] Avg episode reward: [(0, '38.597')] +[2023-03-11 15:42:18,396][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000033536_17170432.pth... +[2023-03-11 15:42:18,397][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000032968_16879616.pth +[2023-03-11 15:42:21,793][41544] Updated weights for policy 0, policy_version 33600 (0.0005) +[2023-03-11 15:42:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9927.6). Total num frames: 17215488. Throughput: 0: 9596.1. Samples: 17213208. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:42:23,386][41256] Avg episode reward: [(0, '37.045')] +[2023-03-11 15:42:26,020][41544] Updated weights for policy 0, policy_version 33680 (0.0005) +[2023-03-11 15:42:28,385][41256] Fps is (10 sec: 9426.2, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 17264640. Throughput: 0: 9610.1. Samples: 17242520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:42:28,386][41256] Avg episode reward: [(0, '39.652')] +[2023-03-11 15:42:30,036][41544] Updated weights for policy 0, policy_version 33760 (0.0004) +[2023-03-11 15:42:33,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 17317888. Throughput: 0: 9688.5. Samples: 17303312. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:42:33,386][41256] Avg episode reward: [(0, '39.653')] +[2023-03-11 15:42:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000033824_17317888.pth... +[2023-03-11 15:42:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000033248_17022976.pth +[2023-03-11 15:42:34,195][41544] Updated weights for policy 0, policy_version 33840 (0.0005) +[2023-03-11 15:42:38,275][41544] Updated weights for policy 0, policy_version 33920 (0.0005) +[2023-03-11 15:42:38,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9693.9, 300 sec: 9927.6). Total num frames: 17367040. Throughput: 0: 9737.9. Samples: 17362952. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:42:38,386][41256] Avg episode reward: [(0, '39.166')] +[2023-03-11 15:42:42,279][41544] Updated weights for policy 0, policy_version 34000 (0.0005) +[2023-03-11 15:42:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 17416192. Throughput: 0: 9763.6. Samples: 17392708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:42:43,386][41256] Avg episode reward: [(0, '37.382')] +[2023-03-11 15:42:46,207][41544] Updated weights for policy 0, policy_version 34080 (0.0004) +[2023-03-11 15:42:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 17469440. Throughput: 0: 9919.2. Samples: 17455644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:42:48,386][41256] Avg episode reward: [(0, '34.266')] +[2023-03-11 15:42:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034120_17469440.pth... +[2023-03-11 15:42:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000033536_17170432.pth +[2023-03-11 15:42:50,182][41544] Updated weights for policy 0, policy_version 34160 (0.0004) +[2023-03-11 15:42:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 17518592. Throughput: 0: 10003.1. Samples: 17516388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:42:53,386][41256] Avg episode reward: [(0, '33.233')] +[2023-03-11 15:42:54,228][41544] Updated weights for policy 0, policy_version 34240 (0.0005) +[2023-03-11 15:42:58,185][41544] Updated weights for policy 0, policy_version 34320 (0.0004) +[2023-03-11 15:42:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9927.6). Total num frames: 17571840. Throughput: 0: 10043.5. Samples: 17547336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:42:58,386][41256] Avg episode reward: [(0, '43.563')] +[2023-03-11 15:43:02,062][41544] Updated weights for policy 0, policy_version 34400 (0.0004) +[2023-03-11 15:43:03,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10035.2, 300 sec: 9927.6). Total num frames: 17625088. Throughput: 0: 10112.1. Samples: 17610312. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:43:03,386][41256] Avg episode reward: [(0, '32.435')] +[2023-03-11 15:43:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034424_17625088.pth... +[2023-03-11 15:43:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000033824_17317888.pth +[2023-03-11 15:43:06,108][41544] Updated weights for policy 0, policy_version 34480 (0.0005) +[2023-03-11 15:43:08,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 17674240. Throughput: 0: 10159.5. Samples: 17670384. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:43:08,386][41256] Avg episode reward: [(0, '35.691')] +[2023-03-11 15:43:10,337][41544] Updated weights for policy 0, policy_version 34560 (0.0005) +[2023-03-11 15:43:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 17719296. Throughput: 0: 10144.9. Samples: 17699040. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:43:13,386][41256] Avg episode reward: [(0, '42.588')] +[2023-03-11 15:43:14,764][41544] Updated weights for policy 0, policy_version 34640 (0.0005) +[2023-03-11 15:43:18,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9967.9, 300 sec: 9899.8). Total num frames: 17768448. Throughput: 0: 10052.4. Samples: 17755672. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:43:18,386][41256] Avg episode reward: [(0, '59.152')] +[2023-03-11 15:43:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034704_17768448.pth... +[2023-03-11 15:43:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034120_17469440.pth +[2023-03-11 15:43:18,393][41500] Saving new best policy, reward=59.152! +[2023-03-11 15:43:19,148][41544] Updated weights for policy 0, policy_version 34720 (0.0005) +[2023-03-11 15:43:23,335][41544] Updated weights for policy 0, policy_version 34800 (0.0005) +[2023-03-11 15:43:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9885.9). Total num frames: 17817600. Throughput: 0: 9993.5. Samples: 17812660. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:43:23,386][41256] Avg episode reward: [(0, '44.458')] +[2023-03-11 15:43:27,686][41544] Updated weights for policy 0, policy_version 34880 (0.0005) +[2023-03-11 15:43:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9872.1). Total num frames: 17862656. Throughput: 0: 9970.3. Samples: 17841372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:43:28,386][41256] Avg episode reward: [(0, '34.147')] +[2023-03-11 15:43:31,922][41544] Updated weights for policy 0, policy_version 34960 (0.0005) +[2023-03-11 15:43:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9858.2). Total num frames: 17911808. Throughput: 0: 9863.5. Samples: 17899500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:43:33,386][41256] Avg episode reward: [(0, '26.917')] +[2023-03-11 15:43:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034984_17911808.pth... +[2023-03-11 15:43:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034424_17625088.pth +[2023-03-11 15:43:36,172][41544] Updated weights for policy 0, policy_version 35040 (0.0005) +[2023-03-11 15:43:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9872.1). Total num frames: 17960960. Throughput: 0: 9788.3. Samples: 17956864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:43:38,386][41256] Avg episode reward: [(0, '30.248')] +[2023-03-11 15:43:40,405][41544] Updated weights for policy 0, policy_version 35120 (0.0005) +[2023-03-11 15:43:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 18006016. Throughput: 0: 9738.0. Samples: 17985544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:43:43,386][41256] Avg episode reward: [(0, '46.006')] +[2023-03-11 15:43:44,783][41544] Updated weights for policy 0, policy_version 35200 (0.0005) +[2023-03-11 15:43:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 18055168. Throughput: 0: 9606.6. Samples: 18042608. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:43:48,386][41256] Avg episode reward: [(0, '54.423')] +[2023-03-11 15:43:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000035264_18055168.pth... +[2023-03-11 15:43:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034704_17768448.pth +[2023-03-11 15:43:49,055][41544] Updated weights for policy 0, policy_version 35280 (0.0005) +[2023-03-11 15:43:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 18100224. Throughput: 0: 9517.0. Samples: 18098648. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:43:53,386][41256] Avg episode reward: [(0, '67.537')] +[2023-03-11 15:43:53,387][41500] Saving new best policy, reward=67.537! +[2023-03-11 15:43:53,490][41544] Updated weights for policy 0, policy_version 35360 (0.0005) +[2023-03-11 15:43:57,875][41544] Updated weights for policy 0, policy_version 35440 (0.0005) +[2023-03-11 15:43:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 18149376. Throughput: 0: 9488.8. Samples: 18126036. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:43:58,386][41256] Avg episode reward: [(0, '82.083')] +[2023-03-11 15:43:58,387][41500] Saving new best policy, reward=82.083! +[2023-03-11 15:44:02,316][41544] Updated weights for policy 0, policy_version 35520 (0.0005) +[2023-03-11 15:44:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9802.6). Total num frames: 18194432. Throughput: 0: 9477.2. Samples: 18182144. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:44:03,386][41256] Avg episode reward: [(0, '84.264')] +[2023-03-11 15:44:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000035536_18194432.pth... +[2023-03-11 15:44:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000034984_17911808.pth +[2023-03-11 15:44:03,393][41500] Saving new best policy, reward=84.264! +[2023-03-11 15:44:06,693][41544] Updated weights for policy 0, policy_version 35600 (0.0005) +[2023-03-11 15:44:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9774.9). Total num frames: 18239488. Throughput: 0: 9453.9. Samples: 18238084. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:44:08,386][41256] Avg episode reward: [(0, '93.843')] +[2023-03-11 15:44:08,386][41500] Saving new best policy, reward=93.843! +[2023-03-11 15:44:11,060][41544] Updated weights for policy 0, policy_version 35680 (0.0005) +[2023-03-11 15:44:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9774.9). Total num frames: 18288640. Throughput: 0: 9440.8. Samples: 18266208. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:44:13,386][41256] Avg episode reward: [(0, '73.702')] +[2023-03-11 15:44:15,505][41544] Updated weights for policy 0, policy_version 35760 (0.0005) +[2023-03-11 15:44:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9747.1). Total num frames: 18333696. Throughput: 0: 9375.7. Samples: 18321408. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:44:18,386][41256] Avg episode reward: [(0, '83.080')] +[2023-03-11 15:44:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000035808_18333696.pth... +[2023-03-11 15:44:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000035264_18055168.pth +[2023-03-11 15:44:19,907][41544] Updated weights for policy 0, policy_version 35840 (0.0005) +[2023-03-11 15:44:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9761.0). Total num frames: 18382848. Throughput: 0: 9373.1. Samples: 18378652. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:44:23,386][41256] Avg episode reward: [(0, '86.787')] +[2023-03-11 15:44:24,187][41544] Updated weights for policy 0, policy_version 35920 (0.0005) +[2023-03-11 15:44:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9747.1). Total num frames: 18427904. Throughput: 0: 9368.2. Samples: 18407112. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:44:28,386][41256] Avg episode reward: [(0, '96.670')] +[2023-03-11 15:44:28,387][41500] Saving new best policy, reward=96.670! +[2023-03-11 15:44:28,512][41544] Updated weights for policy 0, policy_version 36000 (0.0005) +[2023-03-11 15:44:32,818][41544] Updated weights for policy 0, policy_version 36080 (0.0005) +[2023-03-11 15:44:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9747.1). Total num frames: 18477056. Throughput: 0: 9365.1. Samples: 18464036. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:44:33,386][41256] Avg episode reward: [(0, '49.443')] +[2023-03-11 15:44:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036088_18477056.pth... +[2023-03-11 15:44:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000035536_18194432.pth +[2023-03-11 15:44:37,117][41544] Updated weights for policy 0, policy_version 36160 (0.0005) +[2023-03-11 15:44:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9747.1). Total num frames: 18526208. Throughput: 0: 9390.8. Samples: 18521236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:44:38,386][41256] Avg episode reward: [(0, '33.176')] +[2023-03-11 15:44:41,351][41544] Updated weights for policy 0, policy_version 36240 (0.0005) +[2023-03-11 15:44:43,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9747.1). Total num frames: 18571264. Throughput: 0: 9426.5. Samples: 18550228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:44:43,386][41256] Avg episode reward: [(0, '58.306')] +[2023-03-11 15:44:45,540][41544] Updated weights for policy 0, policy_version 36320 (0.0005) +[2023-03-11 15:44:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9733.2). Total num frames: 18620416. Throughput: 0: 9467.7. Samples: 18608192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:44:48,386][41256] Avg episode reward: [(0, '77.315')] +[2023-03-11 15:44:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036368_18620416.pth... +[2023-03-11 15:44:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000035808_18333696.pth +[2023-03-11 15:44:49,821][41544] Updated weights for policy 0, policy_version 36400 (0.0004) +[2023-03-11 15:44:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 18669568. Throughput: 0: 9497.8. Samples: 18665484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:44:53,386][41256] Avg episode reward: [(0, '53.005')] +[2023-03-11 15:44:54,109][41544] Updated weights for policy 0, policy_version 36480 (0.0005) +[2023-03-11 15:44:58,287][41544] Updated weights for policy 0, policy_version 36560 (0.0005) +[2023-03-11 15:44:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9719.3). Total num frames: 18718720. Throughput: 0: 9519.4. Samples: 18694580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:44:58,386][41256] Avg episode reward: [(0, '47.850')] +[2023-03-11 15:45:02,557][41544] Updated weights for policy 0, policy_version 36640 (0.0005) +[2023-03-11 15:45:03,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9691.6). Total num frames: 18763776. Throughput: 0: 9585.3. Samples: 18752744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:03,386][41256] Avg episode reward: [(0, '73.868')] +[2023-03-11 15:45:03,419][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036656_18767872.pth... +[2023-03-11 15:45:03,421][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036088_18477056.pth +[2023-03-11 15:45:06,868][41544] Updated weights for policy 0, policy_version 36720 (0.0005) +[2023-03-11 15:45:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9691.6). Total num frames: 18812928. Throughput: 0: 9580.5. Samples: 18809776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:08,386][41256] Avg episode reward: [(0, '83.699')] +[2023-03-11 15:45:11,115][41544] Updated weights for policy 0, policy_version 36800 (0.0005) +[2023-03-11 15:45:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.4, 300 sec: 9677.7). Total num frames: 18862080. Throughput: 0: 9587.2. Samples: 18838536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:13,386][41256] Avg episode reward: [(0, '75.031')] +[2023-03-11 15:45:15,160][41544] Updated weights for policy 0, policy_version 36880 (0.0004) +[2023-03-11 15:45:18,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 18915328. Throughput: 0: 9691.5. Samples: 18900152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:18,386][41256] Avg episode reward: [(0, '55.908')] +[2023-03-11 15:45:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036944_18915328.pth... +[2023-03-11 15:45:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036368_18620416.pth +[2023-03-11 15:45:19,088][41544] Updated weights for policy 0, policy_version 36960 (0.0004) +[2023-03-11 15:45:23,272][41544] Updated weights for policy 0, policy_version 37040 (0.0005) +[2023-03-11 15:45:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 18964480. Throughput: 0: 9759.0. Samples: 18960392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:23,386][41256] Avg episode reward: [(0, '46.392')] +[2023-03-11 15:45:27,573][41544] Updated weights for policy 0, policy_version 37120 (0.0005) +[2023-03-11 15:45:28,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 19009536. Throughput: 0: 9753.1. Samples: 18989120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:28,386][41256] Avg episode reward: [(0, '23.241')] +[2023-03-11 15:45:31,763][41544] Updated weights for policy 0, policy_version 37200 (0.0005) +[2023-03-11 15:45:33,386][41256] Fps is (10 sec: 9420.6, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 19058688. Throughput: 0: 9749.5. Samples: 19046920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:33,386][41256] Avg episode reward: [(0, '50.803')] +[2023-03-11 15:45:33,430][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000037232_19062784.pth... +[2023-03-11 15:45:33,431][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036656_18767872.pth +[2023-03-11 15:45:35,875][41544] Updated weights for policy 0, policy_version 37280 (0.0005) +[2023-03-11 15:45:38,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9762.1, 300 sec: 9719.3). Total num frames: 19111936. Throughput: 0: 9827.6. Samples: 19107724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:38,386][41256] Avg episode reward: [(0, '20.354')] +[2023-03-11 15:45:39,843][41544] Updated weights for policy 0, policy_version 37360 (0.0005) +[2023-03-11 15:45:43,385][41256] Fps is (10 sec: 10649.7, 60 sec: 9898.7, 300 sec: 9719.3). Total num frames: 19165184. Throughput: 0: 9876.4. Samples: 19139020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:45:43,386][41256] Avg episode reward: [(0, '13.040')] +[2023-03-11 15:45:43,693][41544] Updated weights for policy 0, policy_version 37440 (0.0004) +[2023-03-11 15:45:47,711][41544] Updated weights for policy 0, policy_version 37520 (0.0004) +[2023-03-11 15:45:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9719.3). Total num frames: 19214336. Throughput: 0: 9984.7. Samples: 19202056. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:45:48,386][41256] Avg episode reward: [(0, '19.867')] +[2023-03-11 15:45:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000037528_19214336.pth... +[2023-03-11 15:45:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000036944_18915328.pth +[2023-03-11 15:45:51,974][41544] Updated weights for policy 0, policy_version 37600 (0.0005) +[2023-03-11 15:45:53,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9705.4). Total num frames: 19263488. Throughput: 0: 9993.1. Samples: 19259464. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:45:53,386][41256] Avg episode reward: [(0, '27.672')] +[2023-03-11 15:45:56,214][41544] Updated weights for policy 0, policy_version 37680 (0.0004) +[2023-03-11 15:45:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9705.4). Total num frames: 19312640. Throughput: 0: 9991.1. Samples: 19288136. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:45:58,386][41256] Avg episode reward: [(0, '35.315')] +[2023-03-11 15:46:00,431][41544] Updated weights for policy 0, policy_version 37760 (0.0005) +[2023-03-11 15:46:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 9691.6). Total num frames: 19357696. Throughput: 0: 9921.2. Samples: 19346608. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:46:03,397][41256] Avg episode reward: [(0, '42.092')] +[2023-03-11 15:46:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000037816_19361792.pth... +[2023-03-11 15:46:03,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000037232_19062784.pth +[2023-03-11 15:46:04,685][41544] Updated weights for policy 0, policy_version 37840 (0.0005) +[2023-03-11 15:46:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9691.6). Total num frames: 19406848. Throughput: 0: 9852.5. Samples: 19403756. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 15:46:08,397][41256] Avg episode reward: [(0, '27.218')] +[2023-03-11 15:46:08,908][41544] Updated weights for policy 0, policy_version 37920 (0.0005) +[2023-03-11 15:46:12,948][41544] Updated weights for policy 0, policy_version 38000 (0.0005) +[2023-03-11 15:46:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9719.3). Total num frames: 19460096. Throughput: 0: 9910.3. Samples: 19435084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:46:13,396][41256] Avg episode reward: [(0, '31.399')] +[2023-03-11 15:46:17,050][41544] Updated weights for policy 0, policy_version 38080 (0.0005) +[2023-03-11 15:46:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9719.3). Total num frames: 19509248. Throughput: 0: 9951.7. Samples: 19494744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:46:18,397][41256] Avg episode reward: [(0, '33.517')] +[2023-03-11 15:46:18,401][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038104_19509248.pth... +[2023-03-11 15:46:18,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000037528_19214336.pth +[2023-03-11 15:46:21,170][41544] Updated weights for policy 0, policy_version 38160 (0.0005) +[2023-03-11 15:46:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9733.2). Total num frames: 19558400. Throughput: 0: 9923.7. Samples: 19554292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:46:23,397][41256] Avg episode reward: [(0, '34.640')] +[2023-03-11 15:46:25,292][41544] Updated weights for policy 0, policy_version 38240 (0.0005) +[2023-03-11 15:46:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9967.0, 300 sec: 9733.2). Total num frames: 19607552. Throughput: 0: 9889.0. Samples: 19584024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:46:28,396][41256] Avg episode reward: [(0, '35.421')] +[2023-03-11 15:46:29,348][41544] Updated weights for policy 0, policy_version 38320 (0.0005) +[2023-03-11 15:46:33,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9747.1). Total num frames: 19660800. Throughput: 0: 9840.2. Samples: 19644864. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:46:33,386][41544] Updated weights for policy 0, policy_version 38400 (0.0004) +[2023-03-11 15:46:33,396][41256] Avg episode reward: [(0, '38.444')] +[2023-03-11 15:46:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038400_19660800.pth... +[2023-03-11 15:46:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000037816_19361792.pth +[2023-03-11 15:46:37,683][41544] Updated weights for policy 0, policy_version 38480 (0.0005) +[2023-03-11 15:46:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9747.1). Total num frames: 19705856. Throughput: 0: 9862.5. Samples: 19703276. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:46:38,396][41256] Avg episode reward: [(0, '37.305')] +[2023-03-11 15:46:42,016][41544] Updated weights for policy 0, policy_version 38560 (0.0005) +[2023-03-11 15:46:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 19755008. Throughput: 0: 9857.8. Samples: 19731736. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:46:43,396][41256] Avg episode reward: [(0, '36.400')] +[2023-03-11 15:46:46,271][41544] Updated weights for policy 0, policy_version 38640 (0.0005) +[2023-03-11 15:46:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 19804160. Throughput: 0: 9834.3. Samples: 19789152. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:46:48,396][41256] Avg episode reward: [(0, '37.185')] +[2023-03-11 15:46:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038680_19804160.pth... +[2023-03-11 15:46:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038104_19509248.pth +[2023-03-11 15:46:50,473][41544] Updated weights for policy 0, policy_version 38720 (0.0005) +[2023-03-11 15:46:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 19849216. Throughput: 0: 9844.8. Samples: 19846772. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 15:46:53,386][41256] Avg episode reward: [(0, '38.654')] +[2023-03-11 15:46:54,890][41544] Updated weights for policy 0, policy_version 38800 (0.0005) +[2023-03-11 15:46:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 19898368. Throughput: 0: 9771.7. Samples: 19874812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:46:58,386][41256] Avg episode reward: [(0, '37.820')] +[2023-03-11 15:46:59,228][41544] Updated weights for policy 0, policy_version 38880 (0.0005) +[2023-03-11 15:47:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 19943424. Throughput: 0: 9697.8. Samples: 19931144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:47:03,386][41256] Avg episode reward: [(0, '37.300')] +[2023-03-11 15:47:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038952_19943424.pth... +[2023-03-11 15:47:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038400_19660800.pth +[2023-03-11 15:47:03,552][41544] Updated weights for policy 0, policy_version 38960 (0.0005) +[2023-03-11 15:47:07,688][41544] Updated weights for policy 0, policy_version 39040 (0.0005) +[2023-03-11 15:47:08,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 19992576. Throughput: 0: 9681.9. Samples: 19989976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:47:08,386][41256] Avg episode reward: [(0, '38.151')] +[2023-03-11 15:47:11,980][41544] Updated weights for policy 0, policy_version 39120 (0.0005) +[2023-03-11 15:47:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9733.4). Total num frames: 20041728. Throughput: 0: 9660.3. Samples: 20018736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:47:13,386][41256] Avg episode reward: [(0, '38.753')] +[2023-03-11 15:47:16,291][41544] Updated weights for policy 0, policy_version 39200 (0.0005) +[2023-03-11 15:47:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9733.2). Total num frames: 20086784. Throughput: 0: 9566.3. Samples: 20075348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:47:18,386][41256] Avg episode reward: [(0, '37.642')] +[2023-03-11 15:47:18,449][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000039240_20090880.pth... +[2023-03-11 15:47:18,450][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038680_19804160.pth +[2023-03-11 15:47:20,535][41544] Updated weights for policy 0, policy_version 39280 (0.0005) +[2023-03-11 15:47:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9733.2). Total num frames: 20135936. Throughput: 0: 9574.0. Samples: 20134108. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:47:23,386][41256] Avg episode reward: [(0, '37.468')] +[2023-03-11 15:47:24,765][41544] Updated weights for policy 0, policy_version 39360 (0.0005) +[2023-03-11 15:47:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 20185088. Throughput: 0: 9587.5. Samples: 20163172. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:47:28,386][41256] Avg episode reward: [(0, '38.715')] +[2023-03-11 15:47:29,013][41544] Updated weights for policy 0, policy_version 39440 (0.0005) +[2023-03-11 15:47:33,296][41544] Updated weights for policy 0, policy_version 39520 (0.0005) +[2023-03-11 15:47:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9719.3). Total num frames: 20234240. Throughput: 0: 9590.2. Samples: 20220712. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:47:33,386][41256] Avg episode reward: [(0, '38.101')] +[2023-03-11 15:47:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000039520_20234240.pth... +[2023-03-11 15:47:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000038952_19943424.pth +[2023-03-11 15:47:37,444][41544] Updated weights for policy 0, policy_version 39600 (0.0004) +[2023-03-11 15:47:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 20283392. Throughput: 0: 9611.8. Samples: 20279304. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:47:38,386][41256] Avg episode reward: [(0, '38.611')] +[2023-03-11 15:47:41,711][41544] Updated weights for policy 0, policy_version 39680 (0.0005) +[2023-03-11 15:47:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9705.4). Total num frames: 20332544. Throughput: 0: 9626.0. Samples: 20307984. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:47:43,386][41256] Avg episode reward: [(0, '39.668')] +[2023-03-11 15:47:45,932][41544] Updated weights for policy 0, policy_version 39760 (0.0005) +[2023-03-11 15:47:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9691.6). Total num frames: 20377600. Throughput: 0: 9652.6. Samples: 20365512. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:47:48,386][41256] Avg episode reward: [(0, '36.882')] +[2023-03-11 15:47:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000039800_20377600.pth... +[2023-03-11 15:47:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000039240_20090880.pth +[2023-03-11 15:47:50,184][41544] Updated weights for policy 0, policy_version 39840 (0.0005) +[2023-03-11 15:47:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9677.7). Total num frames: 20426752. Throughput: 0: 9627.9. Samples: 20423232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:47:53,386][41256] Avg episode reward: [(0, '36.964')] +[2023-03-11 15:47:54,464][41544] Updated weights for policy 0, policy_version 39920 (0.0005) +[2023-03-11 15:47:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 20475904. Throughput: 0: 9617.6. Samples: 20451528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:47:58,386][41256] Avg episode reward: [(0, '28.287')] +[2023-03-11 15:47:58,708][41544] Updated weights for policy 0, policy_version 40000 (0.0005) +[2023-03-11 15:48:02,979][41544] Updated weights for policy 0, policy_version 40080 (0.0005) +[2023-03-11 15:48:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9649.9). Total num frames: 20520960. Throughput: 0: 9662.6. Samples: 20510164. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:03,386][41256] Avg episode reward: [(0, '30.339')] +[2023-03-11 15:48:03,397][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040088_20525056.pth... +[2023-03-11 15:48:03,398][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000039520_20234240.pth +[2023-03-11 15:48:07,350][41544] Updated weights for policy 0, policy_version 40160 (0.0005) +[2023-03-11 15:48:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 20570112. Throughput: 0: 9599.6. Samples: 20566088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:08,386][41256] Avg episode reward: [(0, '34.766')] +[2023-03-11 15:48:11,604][41544] Updated weights for policy 0, policy_version 40240 (0.0005) +[2023-03-11 15:48:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 20619264. Throughput: 0: 9598.2. Samples: 20595092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:13,386][41256] Avg episode reward: [(0, '37.816')] +[2023-03-11 15:48:15,752][41544] Updated weights for policy 0, policy_version 40320 (0.0005) +[2023-03-11 15:48:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9663.8). Total num frames: 20668416. Throughput: 0: 9650.4. Samples: 20654980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:18,386][41256] Avg episode reward: [(0, '40.225')] +[2023-03-11 15:48:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040368_20668416.pth... +[2023-03-11 15:48:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000039800_20377600.pth +[2023-03-11 15:48:20,060][41544] Updated weights for policy 0, policy_version 40400 (0.0005) +[2023-03-11 15:48:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9677.7). Total num frames: 20717568. Throughput: 0: 9624.2. Samples: 20712392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:23,386][41256] Avg episode reward: [(0, '41.386')] +[2023-03-11 15:48:24,259][41544] Updated weights for policy 0, policy_version 40480 (0.0005) +[2023-03-11 15:48:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 20762624. Throughput: 0: 9621.0. Samples: 20740928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:28,386][41256] Avg episode reward: [(0, '41.370')] +[2023-03-11 15:48:28,601][41544] Updated weights for policy 0, policy_version 40560 (0.0005) +[2023-03-11 15:48:32,864][41544] Updated weights for policy 0, policy_version 40640 (0.0005) +[2023-03-11 15:48:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 20811776. Throughput: 0: 9618.5. Samples: 20798344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:33,386][41256] Avg episode reward: [(0, '40.678')] +[2023-03-11 15:48:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040648_20811776.pth... +[2023-03-11 15:48:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040088_20525056.pth +[2023-03-11 15:48:37,035][41544] Updated weights for policy 0, policy_version 40720 (0.0005) +[2023-03-11 15:48:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9677.7). Total num frames: 20860928. Throughput: 0: 9635.7. Samples: 20856840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:48:38,396][41256] Avg episode reward: [(0, '41.532')] +[2023-03-11 15:48:41,174][41544] Updated weights for policy 0, policy_version 40800 (0.0005) +[2023-03-11 15:48:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9677.7). Total num frames: 20910080. Throughput: 0: 9655.7. Samples: 20886036. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:48:43,402][41256] Avg episode reward: [(0, '40.676')] +[2023-03-11 15:48:45,420][41544] Updated weights for policy 0, policy_version 40880 (0.0005) +[2023-03-11 15:48:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9677.7). Total num frames: 20955136. Throughput: 0: 9635.2. Samples: 20943748. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:48:48,396][41256] Avg episode reward: [(0, '40.439')] +[2023-03-11 15:48:48,428][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040936_20959232.pth... +[2023-03-11 15:48:48,430][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040368_20668416.pth +[2023-03-11 15:48:49,718][41544] Updated weights for policy 0, policy_version 40960 (0.0005) +[2023-03-11 15:48:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9677.7). Total num frames: 21004288. Throughput: 0: 9702.8. Samples: 21002716. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:48:53,396][41256] Avg episode reward: [(0, '41.075')] +[2023-03-11 15:48:53,845][41544] Updated weights for policy 0, policy_version 41040 (0.0005) +[2023-03-11 15:48:58,039][41544] Updated weights for policy 0, policy_version 41120 (0.0005) +[2023-03-11 15:48:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9691.6). Total num frames: 21053440. Throughput: 0: 9713.2. Samples: 21032184. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:48:58,386][41256] Avg episode reward: [(0, '39.494')] +[2023-03-11 15:49:02,174][41544] Updated weights for policy 0, policy_version 41200 (0.0005) +[2023-03-11 15:49:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 21102592. Throughput: 0: 9685.4. Samples: 21090820. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:49:03,396][41256] Avg episode reward: [(0, '39.777')] +[2023-03-11 15:49:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000041224_21106688.pth... +[2023-03-11 15:49:03,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040648_20811776.pth +[2023-03-11 15:49:06,217][41544] Updated weights for policy 0, policy_version 41280 (0.0005) +[2023-03-11 15:49:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9719.3). Total num frames: 21155840. Throughput: 0: 9763.7. Samples: 21151760. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:49:08,386][41256] Avg episode reward: [(0, '39.249')] +[2023-03-11 15:49:10,237][41544] Updated weights for policy 0, policy_version 41360 (0.0005) +[2023-03-11 15:49:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9733.2). Total num frames: 21204992. Throughput: 0: 9812.6. Samples: 21182496. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:49:13,386][41256] Avg episode reward: [(0, '39.708')] +[2023-03-11 15:49:14,187][41544] Updated weights for policy 0, policy_version 41440 (0.0004) +[2023-03-11 15:49:18,094][41544] Updated weights for policy 0, policy_version 41520 (0.0004) +[2023-03-11 15:49:18,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 21258240. Throughput: 0: 9933.3. Samples: 21245344. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:49:18,386][41256] Avg episode reward: [(0, '39.944')] +[2023-03-11 15:49:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000041520_21258240.pth... +[2023-03-11 15:49:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000040936_20959232.pth +[2023-03-11 15:49:21,848][41544] Updated weights for policy 0, policy_version 41600 (0.0003) +[2023-03-11 15:49:23,385][41256] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9774.9). Total num frames: 21311488. Throughput: 0: 10062.1. Samples: 21309632. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:49:23,386][41256] Avg episode reward: [(0, '38.110')] +[2023-03-11 15:49:25,767][41544] Updated weights for policy 0, policy_version 41680 (0.0005) +[2023-03-11 15:49:28,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10035.2, 300 sec: 9788.7). Total num frames: 21364736. Throughput: 0: 10101.4. Samples: 21340600. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:49:28,386][41256] Avg episode reward: [(0, '40.482')] +[2023-03-11 15:49:29,715][41544] Updated weights for policy 0, policy_version 41760 (0.0005) +[2023-03-11 15:49:33,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10103.5, 300 sec: 9802.6). Total num frames: 21417984. Throughput: 0: 10189.4. Samples: 21402272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:49:33,386][41256] Avg episode reward: [(0, '43.092')] +[2023-03-11 15:49:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000041832_21417984.pth... +[2023-03-11 15:49:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000041224_21106688.pth +[2023-03-11 15:49:33,766][41544] Updated weights for policy 0, policy_version 41840 (0.0005) +[2023-03-11 15:49:37,867][41544] Updated weights for policy 0, policy_version 41920 (0.0005) +[2023-03-11 15:49:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 21467136. Throughput: 0: 10229.4. Samples: 21463040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:49:38,386][41256] Avg episode reward: [(0, '44.570')] +[2023-03-11 15:49:41,858][41544] Updated weights for policy 0, policy_version 42000 (0.0005) +[2023-03-11 15:49:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 21516288. Throughput: 0: 10246.0. Samples: 21493252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:49:43,386][41256] Avg episode reward: [(0, '44.006')] +[2023-03-11 15:49:45,890][41544] Updated weights for policy 0, policy_version 42080 (0.0004) +[2023-03-11 15:49:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9830.4). Total num frames: 21569536. Throughput: 0: 10300.6. Samples: 21554348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:49:48,386][41256] Avg episode reward: [(0, '43.409')] +[2023-03-11 15:49:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042128_21569536.pth... +[2023-03-11 15:49:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000041520_21258240.pth +[2023-03-11 15:49:49,803][41544] Updated weights for policy 0, policy_version 42160 (0.0004) +[2023-03-11 15:49:53,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9830.4). Total num frames: 21618688. Throughput: 0: 10286.6. Samples: 21614656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:49:53,386][41256] Avg episode reward: [(0, '46.596')] +[2023-03-11 15:49:54,167][41544] Updated weights for policy 0, policy_version 42240 (0.0005) +[2023-03-11 15:49:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 10171.7, 300 sec: 9830.4). Total num frames: 21663744. Throughput: 0: 10239.3. Samples: 21643264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:49:58,386][41256] Avg episode reward: [(0, '48.642')] +[2023-03-11 15:49:58,465][41544] Updated weights for policy 0, policy_version 42320 (0.0005) +[2023-03-11 15:50:02,724][41544] Updated weights for policy 0, policy_version 42400 (0.0005) +[2023-03-11 15:50:03,385][41256] Fps is (10 sec: 9420.7, 60 sec: 10171.7, 300 sec: 9830.4). Total num frames: 21712896. Throughput: 0: 10118.4. Samples: 21700672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:50:03,386][41256] Avg episode reward: [(0, '48.518')] +[2023-03-11 15:50:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042408_21712896.pth... +[2023-03-11 15:50:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000041832_21417984.pth +[2023-03-11 15:50:07,073][41544] Updated weights for policy 0, policy_version 42480 (0.0005) +[2023-03-11 15:50:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9830.4). Total num frames: 21762048. Throughput: 0: 9948.6. Samples: 21757320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:50:08,386][41256] Avg episode reward: [(0, '48.755')] +[2023-03-11 15:50:11,360][41544] Updated weights for policy 0, policy_version 42560 (0.0005) +[2023-03-11 15:50:13,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 21811200. Throughput: 0: 9883.8. Samples: 21785372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:50:13,386][41256] Avg episode reward: [(0, '47.692')] +[2023-03-11 15:50:15,405][41544] Updated weights for policy 0, policy_version 42640 (0.0004) +[2023-03-11 15:50:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9816.5). Total num frames: 21860352. Throughput: 0: 9858.5. Samples: 21845904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:50:18,386][41256] Avg episode reward: [(0, '38.555')] +[2023-03-11 15:50:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042696_21860352.pth... +[2023-03-11 15:50:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042128_21569536.pth +[2023-03-11 15:50:19,516][41544] Updated weights for policy 0, policy_version 42720 (0.0005) +[2023-03-11 15:50:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 21909504. Throughput: 0: 9863.4. Samples: 21906892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:50:23,386][41256] Avg episode reward: [(0, '44.036')] +[2023-03-11 15:50:23,464][41544] Updated weights for policy 0, policy_version 42800 (0.0004) +[2023-03-11 15:50:27,456][41544] Updated weights for policy 0, policy_version 42880 (0.0004) +[2023-03-11 15:50:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9844.3). Total num frames: 21962752. Throughput: 0: 9888.6. Samples: 21938240. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:50:28,386][41256] Avg episode reward: [(0, '42.878')] +[2023-03-11 15:50:31,501][41544] Updated weights for policy 0, policy_version 42960 (0.0005) +[2023-03-11 15:50:33,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 22011904. Throughput: 0: 9886.3. Samples: 21999232. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:50:33,386][41256] Avg episode reward: [(0, '44.082')] +[2023-03-11 15:50:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042992_22011904.pth... +[2023-03-11 15:50:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042408_21712896.pth +[2023-03-11 15:50:35,628][41544] Updated weights for policy 0, policy_version 43040 (0.0005) +[2023-03-11 15:50:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 22061056. Throughput: 0: 9830.6. Samples: 22057032. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:50:38,386][41256] Avg episode reward: [(0, '47.681')] +[2023-03-11 15:50:39,948][41544] Updated weights for policy 0, policy_version 43120 (0.0005) +[2023-03-11 15:50:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9802.6). Total num frames: 22106112. Throughput: 0: 9831.8. Samples: 22085696. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:50:43,386][41256] Avg episode reward: [(0, '46.471')] +[2023-03-11 15:50:44,252][41544] Updated weights for policy 0, policy_version 43200 (0.0005) +[2023-03-11 15:50:48,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 22155264. Throughput: 0: 9834.3. Samples: 22143216. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:50:48,396][41256] Avg episode reward: [(0, '41.610')] +[2023-03-11 15:50:48,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000043272_22155264.pth... +[2023-03-11 15:50:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042696_21860352.pth +[2023-03-11 15:50:48,580][41544] Updated weights for policy 0, policy_version 43280 (0.0005) +[2023-03-11 15:50:52,988][41544] Updated weights for policy 0, policy_version 43360 (0.0004) +[2023-03-11 15:50:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 22200320. Throughput: 0: 9826.4. Samples: 22199508. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 15:50:53,396][41256] Avg episode reward: [(0, '43.507')] +[2023-03-11 15:50:57,371][41544] Updated weights for policy 0, policy_version 43440 (0.0005) +[2023-03-11 15:50:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 22249472. Throughput: 0: 9826.9. Samples: 22227580. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:50:58,396][41256] Avg episode reward: [(0, '42.300')] +[2023-03-11 15:51:01,741][41544] Updated weights for policy 0, policy_version 43520 (0.0005) +[2023-03-11 15:51:03,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 22294528. Throughput: 0: 9713.9. Samples: 22283028. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:51:03,396][41256] Avg episode reward: [(0, '46.292')] +[2023-03-11 15:51:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000043544_22294528.pth... +[2023-03-11 15:51:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000042992_22011904.pth +[2023-03-11 15:51:05,930][41544] Updated weights for policy 0, policy_version 43600 (0.0005) +[2023-03-11 15:51:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 22343680. Throughput: 0: 9670.1. Samples: 22342044. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:51:08,396][41256] Avg episode reward: [(0, '51.354')] +[2023-03-11 15:51:10,240][41544] Updated weights for policy 0, policy_version 43680 (0.0005) +[2023-03-11 15:51:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 22392832. Throughput: 0: 9583.6. Samples: 22369500. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:51:13,396][41256] Avg episode reward: [(0, '58.199')] +[2023-03-11 15:51:14,732][41544] Updated weights for policy 0, policy_version 43760 (0.0005) +[2023-03-11 15:51:18,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 22437888. Throughput: 0: 9450.2. Samples: 22424492. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:51:18,396][41256] Avg episode reward: [(0, '61.229')] +[2023-03-11 15:51:18,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000043824_22437888.pth... +[2023-03-11 15:51:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000043272_22155264.pth +[2023-03-11 15:51:19,164][41544] Updated weights for policy 0, policy_version 43840 (0.0005) +[2023-03-11 15:51:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9747.1). Total num frames: 22482944. Throughput: 0: 9398.2. Samples: 22479952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:51:23,386][41256] Avg episode reward: [(0, '59.802')] +[2023-03-11 15:51:23,560][41544] Updated weights for policy 0, policy_version 43920 (0.0005) +[2023-03-11 15:51:27,747][41544] Updated weights for policy 0, policy_version 44000 (0.0005) +[2023-03-11 15:51:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 22532096. Throughput: 0: 9392.4. Samples: 22508352. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:51:28,386][41256] Avg episode reward: [(0, '57.466')] +[2023-03-11 15:51:32,023][41544] Updated weights for policy 0, policy_version 44080 (0.0005) +[2023-03-11 15:51:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9747.1). Total num frames: 22581248. Throughput: 0: 9428.2. Samples: 22567484. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:51:33,386][41256] Avg episode reward: [(0, '48.134')] +[2023-03-11 15:51:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044104_22581248.pth... +[2023-03-11 15:51:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000043544_22294528.pth +[2023-03-11 15:51:36,281][41544] Updated weights for policy 0, policy_version 44160 (0.0004) +[2023-03-11 15:51:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9733.2). Total num frames: 22626304. Throughput: 0: 9459.4. Samples: 22625180. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:51:38,386][41256] Avg episode reward: [(0, '45.099')] +[2023-03-11 15:51:40,584][41544] Updated weights for policy 0, policy_version 44240 (0.0005) +[2023-03-11 15:51:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 22675456. Throughput: 0: 9461.8. Samples: 22653360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:51:43,386][41256] Avg episode reward: [(0, '46.458')] +[2023-03-11 15:51:44,931][41544] Updated weights for policy 0, policy_version 44320 (0.0004) +[2023-03-11 15:51:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9747.1). Total num frames: 22724608. Throughput: 0: 9498.3. Samples: 22710452. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:51:48,386][41256] Avg episode reward: [(0, '49.186')] +[2023-03-11 15:51:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044384_22724608.pth... +[2023-03-11 15:51:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000043824_22437888.pth +[2023-03-11 15:51:49,213][41544] Updated weights for policy 0, policy_version 44400 (0.0005) +[2023-03-11 15:51:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 22769664. Throughput: 0: 9446.9. Samples: 22767156. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:51:53,386][41256] Avg episode reward: [(0, '44.255')] +[2023-03-11 15:51:53,582][41544] Updated weights for policy 0, policy_version 44480 (0.0005) +[2023-03-11 15:51:57,983][41544] Updated weights for policy 0, policy_version 44560 (0.0005) +[2023-03-11 15:51:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9733.2). Total num frames: 22814720. Throughput: 0: 9450.1. Samples: 22794752. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:51:58,386][41256] Avg episode reward: [(0, '41.894')] +[2023-03-11 15:52:02,461][41544] Updated weights for policy 0, policy_version 44640 (0.0005) +[2023-03-11 15:52:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9733.2). Total num frames: 22863872. Throughput: 0: 9461.3. Samples: 22850252. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:52:03,386][41256] Avg episode reward: [(0, '44.177')] +[2023-03-11 15:52:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044656_22863872.pth... +[2023-03-11 15:52:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044104_22581248.pth +[2023-03-11 15:52:06,947][41544] Updated weights for policy 0, policy_version 44720 (0.0005) +[2023-03-11 15:52:08,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9719.3). Total num frames: 22908928. Throughput: 0: 9443.4. Samples: 22904904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:52:08,386][41256] Avg episode reward: [(0, '47.092')] +[2023-03-11 15:52:11,388][41544] Updated weights for policy 0, policy_version 44800 (0.0005) +[2023-03-11 15:52:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9719.3). Total num frames: 22953984. Throughput: 0: 9427.8. Samples: 22932604. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:52:13,386][41256] Avg episode reward: [(0, '55.593')] +[2023-03-11 15:52:15,793][41544] Updated weights for policy 0, policy_version 44880 (0.0005) +[2023-03-11 15:52:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9705.4). Total num frames: 22999040. Throughput: 0: 9336.6. Samples: 22987632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:18,386][41256] Avg episode reward: [(0, '52.634')] +[2023-03-11 15:52:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044920_22999040.pth... +[2023-03-11 15:52:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044384_22724608.pth +[2023-03-11 15:52:20,313][41544] Updated weights for policy 0, policy_version 44960 (0.0005) +[2023-03-11 15:52:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9705.4). Total num frames: 23048192. Throughput: 0: 9297.6. Samples: 23043572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:23,386][41256] Avg episode reward: [(0, '54.857')] +[2023-03-11 15:52:24,658][41544] Updated weights for policy 0, policy_version 45040 (0.0005) +[2023-03-11 15:52:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9705.4). Total num frames: 23097344. Throughput: 0: 9304.1. Samples: 23072044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:28,386][41256] Avg episode reward: [(0, '57.540')] +[2023-03-11 15:52:28,808][41544] Updated weights for policy 0, policy_version 45120 (0.0004) +[2023-03-11 15:52:32,913][41544] Updated weights for policy 0, policy_version 45200 (0.0004) +[2023-03-11 15:52:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9705.4). Total num frames: 23146496. Throughput: 0: 9352.4. Samples: 23131308. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:33,386][41256] Avg episode reward: [(0, '54.976')] +[2023-03-11 15:52:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000045208_23146496.pth... +[2023-03-11 15:52:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044656_22863872.pth +[2023-03-11 15:52:37,196][41544] Updated weights for policy 0, policy_version 45280 (0.0004) +[2023-03-11 15:52:38,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9691.6). Total num frames: 23191552. Throughput: 0: 9379.2. Samples: 23189220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:38,386][41256] Avg episode reward: [(0, '50.076')] +[2023-03-11 15:52:41,783][41544] Updated weights for policy 0, policy_version 45360 (0.0005) +[2023-03-11 15:52:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9691.6). Total num frames: 23236608. Throughput: 0: 9364.1. Samples: 23216136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:43,386][41256] Avg episode reward: [(0, '54.158')] +[2023-03-11 15:52:46,279][41544] Updated weights for policy 0, policy_version 45440 (0.0005) +[2023-03-11 15:52:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9677.7). Total num frames: 23281664. Throughput: 0: 9336.1. Samples: 23270376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:48,386][41256] Avg episode reward: [(0, '55.420')] +[2023-03-11 15:52:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000045472_23281664.pth... +[2023-03-11 15:52:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000044920_22999040.pth +[2023-03-11 15:52:50,795][41544] Updated weights for policy 0, policy_version 45520 (0.0005) +[2023-03-11 15:52:53,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9663.8). Total num frames: 23326720. Throughput: 0: 9334.1. Samples: 23324940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:53,386][41256] Avg episode reward: [(0, '53.821')] +[2023-03-11 15:52:55,235][41544] Updated weights for policy 0, policy_version 45600 (0.0005) +[2023-03-11 15:52:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9677.7). Total num frames: 23375872. Throughput: 0: 9330.5. Samples: 23352476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:52:58,386][41256] Avg episode reward: [(0, '51.166')] +[2023-03-11 15:52:59,641][41544] Updated weights for policy 0, policy_version 45680 (0.0005) +[2023-03-11 15:53:03,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9663.8). Total num frames: 23420928. Throughput: 0: 9358.8. Samples: 23408776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:03,386][41256] Avg episode reward: [(0, '51.587')] +[2023-03-11 15:53:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000045744_23420928.pth... +[2023-03-11 15:53:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000045208_23146496.pth +[2023-03-11 15:53:03,963][41544] Updated weights for policy 0, policy_version 45760 (0.0005) +[2023-03-11 15:53:08,378][41544] Updated weights for policy 0, policy_version 45840 (0.0005) +[2023-03-11 15:53:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9663.8). Total num frames: 23470080. Throughput: 0: 9375.9. Samples: 23465488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:08,386][41256] Avg episode reward: [(0, '51.935')] +[2023-03-11 15:53:12,688][41544] Updated weights for policy 0, policy_version 45920 (0.0005) +[2023-03-11 15:53:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9649.9). Total num frames: 23515136. Throughput: 0: 9348.5. Samples: 23492728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:13,386][41256] Avg episode reward: [(0, '51.916')] +[2023-03-11 15:53:17,015][41544] Updated weights for policy 0, policy_version 46000 (0.0005) +[2023-03-11 15:53:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9649.9). Total num frames: 23564288. Throughput: 0: 9317.1. Samples: 23550576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:18,386][41256] Avg episode reward: [(0, '49.608')] +[2023-03-11 15:53:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046024_23564288.pth... +[2023-03-11 15:53:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000045472_23281664.pth +[2023-03-11 15:53:21,250][41544] Updated weights for policy 0, policy_version 46080 (0.0005) +[2023-03-11 15:53:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9663.8). Total num frames: 23613440. Throughput: 0: 9330.5. Samples: 23609092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:23,386][41256] Avg episode reward: [(0, '58.579')] +[2023-03-11 15:53:25,342][41544] Updated weights for policy 0, policy_version 46160 (0.0004) +[2023-03-11 15:53:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9663.8). Total num frames: 23662592. Throughput: 0: 9386.1. Samples: 23638512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:28,386][41256] Avg episode reward: [(0, '56.213')] +[2023-03-11 15:53:29,564][41544] Updated weights for policy 0, policy_version 46240 (0.0005) +[2023-03-11 15:53:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9649.9). Total num frames: 23707648. Throughput: 0: 9469.0. Samples: 23696480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:33,386][41256] Avg episode reward: [(0, '64.295')] +[2023-03-11 15:53:33,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046304_23707648.pth... +[2023-03-11 15:53:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000045744_23420928.pth +[2023-03-11 15:53:33,912][41544] Updated weights for policy 0, policy_version 46320 (0.0005) +[2023-03-11 15:53:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9636.0). Total num frames: 23752704. Throughput: 0: 9479.8. Samples: 23751532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:38,386][41256] Avg episode reward: [(0, '68.241')] +[2023-03-11 15:53:38,415][41544] Updated weights for policy 0, policy_version 46400 (0.0005) +[2023-03-11 15:53:42,562][41544] Updated weights for policy 0, policy_version 46480 (0.0005) +[2023-03-11 15:53:43,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9663.8). Total num frames: 23805952. Throughput: 0: 9513.5. Samples: 23780584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:53:43,386][41256] Avg episode reward: [(0, '66.699')] +[2023-03-11 15:53:46,669][41544] Updated weights for policy 0, policy_version 46560 (0.0005) +[2023-03-11 15:53:48,386][41256] Fps is (10 sec: 10239.8, 60 sec: 9557.3, 300 sec: 9663.8). Total num frames: 23855104. Throughput: 0: 9584.7. Samples: 23840088. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:53:48,386][41256] Avg episode reward: [(0, '61.532')] +[2023-03-11 15:53:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046592_23855104.pth... +[2023-03-11 15:53:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046024_23564288.pth +[2023-03-11 15:53:51,124][41544] Updated weights for policy 0, policy_version 46640 (0.0005) +[2023-03-11 15:53:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9649.9). Total num frames: 23900160. Throughput: 0: 9560.2. Samples: 23895696. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:53:53,386][41256] Avg episode reward: [(0, '65.533')] +[2023-03-11 15:53:55,654][41544] Updated weights for policy 0, policy_version 46720 (0.0005) +[2023-03-11 15:53:58,385][41256] Fps is (10 sec: 9011.4, 60 sec: 9489.1, 300 sec: 9636.0). Total num frames: 23945216. Throughput: 0: 9548.1. Samples: 23922392. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:53:58,386][41256] Avg episode reward: [(0, '63.001')] +[2023-03-11 15:54:00,187][41544] Updated weights for policy 0, policy_version 46800 (0.0005) +[2023-03-11 15:54:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9608.2). Total num frames: 23990272. Throughput: 0: 9469.2. Samples: 23976692. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:54:03,386][41256] Avg episode reward: [(0, '67.110')] +[2023-03-11 15:54:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046856_23990272.pth... +[2023-03-11 15:54:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046304_23707648.pth +[2023-03-11 15:54:04,643][41544] Updated weights for policy 0, policy_version 46880 (0.0005) +[2023-03-11 15:54:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9594.4). Total num frames: 24035328. Throughput: 0: 9383.0. Samples: 24031328. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:54:08,386][41256] Avg episode reward: [(0, '69.984')] +[2023-03-11 15:54:09,144][41544] Updated weights for policy 0, policy_version 46960 (0.0006) +[2023-03-11 15:54:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9566.6). Total num frames: 24080384. Throughput: 0: 9362.3. Samples: 24059816. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:54:13,386][41256] Avg episode reward: [(0, '73.827')] +[2023-03-11 15:54:13,578][41544] Updated weights for policy 0, policy_version 47040 (0.0004) +[2023-03-11 15:54:17,869][41544] Updated weights for policy 0, policy_version 47120 (0.0003) +[2023-03-11 15:54:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9552.7). Total num frames: 24129536. Throughput: 0: 9320.8. Samples: 24115916. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:18,386][41256] Avg episode reward: [(0, '73.972')] +[2023-03-11 15:54:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047128_24129536.pth... +[2023-03-11 15:54:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046592_23855104.pth +[2023-03-11 15:54:22,335][41544] Updated weights for policy 0, policy_version 47200 (0.0005) +[2023-03-11 15:54:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9524.9). Total num frames: 24174592. Throughput: 0: 9312.9. Samples: 24170612. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:23,386][41256] Avg episode reward: [(0, '74.532')] +[2023-03-11 15:54:26,900][41544] Updated weights for policy 0, policy_version 47280 (0.0005) +[2023-03-11 15:54:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9497.2). Total num frames: 24219648. Throughput: 0: 9274.1. Samples: 24197916. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:28,386][41256] Avg episode reward: [(0, '70.843')] +[2023-03-11 15:54:31,420][41544] Updated weights for policy 0, policy_version 47360 (0.0005) +[2023-03-11 15:54:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 24264704. Throughput: 0: 9163.0. Samples: 24252424. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:33,396][41256] Avg episode reward: [(0, '70.057')] +[2023-03-11 15:54:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047392_24264704.pth... +[2023-03-11 15:54:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000046856_23990272.pth +[2023-03-11 15:54:35,840][41544] Updated weights for policy 0, policy_version 47440 (0.0006) +[2023-03-11 15:54:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9469.4). Total num frames: 24309760. Throughput: 0: 9160.2. Samples: 24307904. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:38,397][41256] Avg episode reward: [(0, '73.585')] +[2023-03-11 15:54:40,200][41544] Updated weights for policy 0, policy_version 47520 (0.0006) +[2023-03-11 15:54:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9455.5). Total num frames: 24358912. Throughput: 0: 9198.5. Samples: 24336324. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:43,396][41256] Avg episode reward: [(0, '71.578')] +[2023-03-11 15:54:44,585][41544] Updated weights for policy 0, policy_version 47600 (0.0005) +[2023-03-11 15:54:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.8, 300 sec: 9441.6). Total num frames: 24403968. Throughput: 0: 9224.4. Samples: 24391788. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:48,386][41256] Avg episode reward: [(0, '75.665')] +[2023-03-11 15:54:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047664_24403968.pth... +[2023-03-11 15:54:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047128_24129536.pth +[2023-03-11 15:54:48,955][41544] Updated weights for policy 0, policy_version 47680 (0.0005) +[2023-03-11 15:54:53,367][41544] Updated weights for policy 0, policy_version 47760 (0.0006) +[2023-03-11 15:54:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9455.5). Total num frames: 24453120. Throughput: 0: 9274.0. Samples: 24448656. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:53,396][41256] Avg episode reward: [(0, '67.950')] +[2023-03-11 15:54:57,764][41544] Updated weights for policy 0, policy_version 47840 (0.0006) +[2023-03-11 15:54:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 24498176. Throughput: 0: 9250.2. Samples: 24476076. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:54:58,386][41256] Avg episode reward: [(0, '69.023')] +[2023-03-11 15:55:02,085][41544] Updated weights for policy 0, policy_version 47920 (0.0005) +[2023-03-11 15:55:03,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9427.7). Total num frames: 24543232. Throughput: 0: 9258.7. Samples: 24532560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:55:03,386][41256] Avg episode reward: [(0, '61.419')] +[2023-03-11 15:55:03,415][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047944_24547328.pth... +[2023-03-11 15:55:03,417][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047392_24264704.pth +[2023-03-11 15:55:06,456][41544] Updated weights for policy 0, policy_version 48000 (0.0006) +[2023-03-11 15:55:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 24592384. Throughput: 0: 9291.5. Samples: 24588732. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:55:08,386][41256] Avg episode reward: [(0, '66.239')] +[2023-03-11 15:55:10,945][41544] Updated weights for policy 0, policy_version 48080 (0.0005) +[2023-03-11 15:55:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 24637440. Throughput: 0: 9298.4. Samples: 24616344. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 15:55:13,386][41256] Avg episode reward: [(0, '77.730')] +[2023-03-11 15:55:15,360][41544] Updated weights for policy 0, policy_version 48160 (0.0005) +[2023-03-11 15:55:18,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 24682496. Throughput: 0: 9318.8. Samples: 24671772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:18,386][41256] Avg episode reward: [(0, '73.747')] +[2023-03-11 15:55:18,424][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000048216_24686592.pth... +[2023-03-11 15:55:18,426][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047664_24403968.pth +[2023-03-11 15:55:19,696][41544] Updated weights for policy 0, policy_version 48240 (0.0005) +[2023-03-11 15:55:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 24731648. Throughput: 0: 9342.0. Samples: 24728292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:23,386][41256] Avg episode reward: [(0, '72.603')] +[2023-03-11 15:55:24,050][41544] Updated weights for policy 0, policy_version 48320 (0.0005) +[2023-03-11 15:55:28,336][41544] Updated weights for policy 0, policy_version 48400 (0.0005) +[2023-03-11 15:55:28,385][41256] Fps is (10 sec: 9830.6, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 24780800. Throughput: 0: 9346.6. Samples: 24756920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:28,386][41256] Avg episode reward: [(0, '71.390')] +[2023-03-11 15:55:32,747][41544] Updated weights for policy 0, policy_version 48480 (0.0006) +[2023-03-11 15:55:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 24825856. Throughput: 0: 9373.1. Samples: 24813576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:33,386][41256] Avg episode reward: [(0, '75.261')] +[2023-03-11 15:55:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000048488_24825856.pth... +[2023-03-11 15:55:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000047944_24547328.pth +[2023-03-11 15:55:37,187][41544] Updated weights for policy 0, policy_version 48560 (0.0006) +[2023-03-11 15:55:38,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 24870912. Throughput: 0: 9337.6. Samples: 24868848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:38,386][41256] Avg episode reward: [(0, '67.048')] +[2023-03-11 15:55:41,624][41544] Updated weights for policy 0, policy_version 48640 (0.0006) +[2023-03-11 15:55:43,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.2, 300 sec: 9358.3). Total num frames: 24915968. Throughput: 0: 9337.3. Samples: 24896256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:43,386][41256] Avg episode reward: [(0, '74.625')] +[2023-03-11 15:55:46,050][41544] Updated weights for policy 0, policy_version 48720 (0.0005) +[2023-03-11 15:55:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 24965120. Throughput: 0: 9327.4. Samples: 24952292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:48,386][41256] Avg episode reward: [(0, '75.174')] +[2023-03-11 15:55:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000048760_24965120.pth... +[2023-03-11 15:55:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000048216_24686592.pth +[2023-03-11 15:55:50,524][41544] Updated weights for policy 0, policy_version 48800 (0.0005) +[2023-03-11 15:55:53,385][41256] Fps is (10 sec: 9421.0, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 25010176. Throughput: 0: 9309.5. Samples: 25007660. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:53,386][41256] Avg episode reward: [(0, '70.563')] +[2023-03-11 15:55:54,813][41544] Updated weights for policy 0, policy_version 48880 (0.0005) +[2023-03-11 15:55:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 25059328. Throughput: 0: 9334.2. Samples: 25036380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:55:58,386][41256] Avg episode reward: [(0, '63.728')] +[2023-03-11 15:55:59,231][41544] Updated weights for policy 0, policy_version 48960 (0.0005) +[2023-03-11 15:56:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.6, 300 sec: 9358.3). Total num frames: 25104384. Throughput: 0: 9323.6. Samples: 25091332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:56:03,386][41256] Avg episode reward: [(0, '69.832')] +[2023-03-11 15:56:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049032_25104384.pth... +[2023-03-11 15:56:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000048488_24825856.pth +[2023-03-11 15:56:03,814][41544] Updated weights for policy 0, policy_version 49040 (0.0005) +[2023-03-11 15:56:08,088][41544] Updated weights for policy 0, policy_version 49120 (0.0005) +[2023-03-11 15:56:08,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9344.4). Total num frames: 25149440. Throughput: 0: 9309.5. Samples: 25147220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:56:08,386][41256] Avg episode reward: [(0, '69.459')] +[2023-03-11 15:56:12,425][41544] Updated weights for policy 0, policy_version 49200 (0.0005) +[2023-03-11 15:56:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 25198592. Throughput: 0: 9308.4. Samples: 25175800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:56:13,386][41256] Avg episode reward: [(0, '71.163')] +[2023-03-11 15:56:16,969][41544] Updated weights for policy 0, policy_version 49280 (0.0005) +[2023-03-11 15:56:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.6, 300 sec: 9358.3). Total num frames: 25243648. Throughput: 0: 9266.5. Samples: 25230568. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:56:18,386][41256] Avg episode reward: [(0, '81.975')] +[2023-03-11 15:56:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049304_25243648.pth... +[2023-03-11 15:56:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000048760_24965120.pth +[2023-03-11 15:56:21,389][41544] Updated weights for policy 0, policy_version 49360 (0.0005) +[2023-03-11 15:56:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9344.4). Total num frames: 25288704. Throughput: 0: 9256.9. Samples: 25285408. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:56:23,386][41256] Avg episode reward: [(0, '79.129')] +[2023-03-11 15:56:25,929][41544] Updated weights for policy 0, policy_version 49440 (0.0006) +[2023-03-11 15:56:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9330.5). Total num frames: 25333760. Throughput: 0: 9257.3. Samples: 25312832. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:56:28,386][41256] Avg episode reward: [(0, '73.481')] +[2023-03-11 15:56:30,509][41544] Updated weights for policy 0, policy_version 49520 (0.0005) +[2023-03-11 15:56:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9330.5). Total num frames: 25378816. Throughput: 0: 9206.7. Samples: 25366592. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:56:33,386][41256] Avg episode reward: [(0, '77.968')] +[2023-03-11 15:56:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049568_25378816.pth... +[2023-03-11 15:56:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049032_25104384.pth +[2023-03-11 15:56:34,977][41544] Updated weights for policy 0, policy_version 49600 (0.0005) +[2023-03-11 15:56:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9316.7). Total num frames: 25423872. Throughput: 0: 9190.7. Samples: 25421240. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:56:38,386][41256] Avg episode reward: [(0, '69.886')] +[2023-03-11 15:56:39,486][41544] Updated weights for policy 0, policy_version 49680 (0.0005) +[2023-03-11 15:56:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9302.8). Total num frames: 25468928. Throughput: 0: 9158.5. Samples: 25448512. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:56:43,386][41256] Avg episode reward: [(0, '79.009')] +[2023-03-11 15:56:43,971][41544] Updated weights for policy 0, policy_version 49760 (0.0005) +[2023-03-11 15:56:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 25513984. Throughput: 0: 9145.3. Samples: 25502872. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:56:48,386][41256] Avg episode reward: [(0, '76.824')] +[2023-03-11 15:56:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049832_25513984.pth... +[2023-03-11 15:56:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049304_25243648.pth +[2023-03-11 15:56:48,553][41544] Updated weights for policy 0, policy_version 49840 (0.0005) +[2023-03-11 15:56:52,967][41544] Updated weights for policy 0, policy_version 49920 (0.0005) +[2023-03-11 15:56:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9316.7). Total num frames: 25563136. Throughput: 0: 9126.9. Samples: 25557932. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:56:53,386][41256] Avg episode reward: [(0, '80.333')] +[2023-03-11 15:56:57,447][41544] Updated weights for policy 0, policy_version 50000 (0.0005) +[2023-03-11 15:56:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 25608192. Throughput: 0: 9097.4. Samples: 25585184. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:56:58,386][41256] Avg episode reward: [(0, '84.402')] +[2023-03-11 15:57:01,992][41544] Updated weights for policy 0, policy_version 50080 (0.0005) +[2023-03-11 15:57:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 25653248. Throughput: 0: 9096.4. Samples: 25639904. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:57:03,386][41256] Avg episode reward: [(0, '82.662')] +[2023-03-11 15:57:03,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050104_25653248.pth... +[2023-03-11 15:57:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049568_25378816.pth +[2023-03-11 15:57:06,510][41544] Updated weights for policy 0, policy_version 50160 (0.0005) +[2023-03-11 15:57:08,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 25698304. Throughput: 0: 9086.0. Samples: 25694280. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:57:08,396][41256] Avg episode reward: [(0, '81.267')] +[2023-03-11 15:57:10,997][41544] Updated weights for policy 0, policy_version 50240 (0.0005) +[2023-03-11 15:57:13,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9302.8). Total num frames: 25743360. Throughput: 0: 9087.0. Samples: 25721748. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 15:57:13,404][41256] Avg episode reward: [(0, '81.817')] +[2023-03-11 15:57:15,479][41544] Updated weights for policy 0, policy_version 50320 (0.0006) +[2023-03-11 15:57:18,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9288.9). Total num frames: 25788416. Throughput: 0: 9102.2. Samples: 25776192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:57:18,386][41256] Avg episode reward: [(0, '78.129')] +[2023-03-11 15:57:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050368_25788416.pth... +[2023-03-11 15:57:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000049832_25513984.pth +[2023-03-11 15:57:19,857][41544] Updated weights for policy 0, policy_version 50400 (0.0005) +[2023-03-11 15:57:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 25837568. Throughput: 0: 9180.4. Samples: 25834356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:57:23,386][41256] Avg episode reward: [(0, '80.115')] +[2023-03-11 15:57:23,981][41544] Updated weights for policy 0, policy_version 50480 (0.0005) +[2023-03-11 15:57:28,144][41544] Updated weights for policy 0, policy_version 50560 (0.0005) +[2023-03-11 15:57:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9216.0, 300 sec: 9288.9). Total num frames: 25886720. Throughput: 0: 9234.3. Samples: 25864056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:57:28,386][41256] Avg episode reward: [(0, '73.774')] +[2023-03-11 15:57:32,435][41544] Updated weights for policy 0, policy_version 50640 (0.0005) +[2023-03-11 15:57:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9284.3, 300 sec: 9302.8). Total num frames: 25935872. Throughput: 0: 9324.7. Samples: 25922484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:57:33,386][41256] Avg episode reward: [(0, '57.262')] +[2023-03-11 15:57:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050656_25935872.pth... +[2023-03-11 15:57:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050104_25653248.pth +[2023-03-11 15:57:36,888][41544] Updated weights for policy 0, policy_version 50720 (0.0005) +[2023-03-11 15:57:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9302.8). Total num frames: 25980928. Throughput: 0: 9310.6. Samples: 25976908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:57:38,390][41256] Avg episode reward: [(0, '63.113')] +[2023-03-11 15:57:41,460][41544] Updated weights for policy 0, policy_version 50800 (0.0005) +[2023-03-11 15:57:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9302.8). Total num frames: 26025984. Throughput: 0: 9314.1. Samples: 26004320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:57:43,396][41256] Avg episode reward: [(0, '77.220')] +[2023-03-11 15:57:45,824][41544] Updated weights for policy 0, policy_version 50880 (0.0005) +[2023-03-11 15:57:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9302.8). Total num frames: 26071040. Throughput: 0: 9326.0. Samples: 26059576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:57:48,396][41256] Avg episode reward: [(0, '68.074')] +[2023-03-11 15:57:48,398][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050920_26071040.pth... +[2023-03-11 15:57:48,400][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050368_25788416.pth +[2023-03-11 15:57:50,070][41544] Updated weights for policy 0, policy_version 50960 (0.0005) +[2023-03-11 15:57:53,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9352.5, 300 sec: 9316.7). Total num frames: 26124288. Throughput: 0: 9449.1. Samples: 26119492. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:57:53,396][41256] Avg episode reward: [(0, '72.480')] +[2023-03-11 15:57:54,162][41544] Updated weights for policy 0, policy_version 51040 (0.0005) +[2023-03-11 15:57:58,325][41544] Updated weights for policy 0, policy_version 51120 (0.0005) +[2023-03-11 15:57:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9420.8, 300 sec: 9330.5). Total num frames: 26173440. Throughput: 0: 9492.9. Samples: 26148928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:57:58,396][41256] Avg episode reward: [(0, '75.403')] +[2023-03-11 15:58:02,451][41544] Updated weights for policy 0, policy_version 51200 (0.0004) +[2023-03-11 15:58:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9330.5). Total num frames: 26222592. Throughput: 0: 9596.2. Samples: 26208020. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:58:03,396][41256] Avg episode reward: [(0, '70.946')] +[2023-03-11 15:58:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000051216_26222592.pth... +[2023-03-11 15:58:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050656_25935872.pth +[2023-03-11 15:58:06,642][41544] Updated weights for policy 0, policy_version 51280 (0.0004) +[2023-03-11 15:58:08,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9344.4). Total num frames: 26271744. Throughput: 0: 9618.8. Samples: 26267204. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:58:08,386][41256] Avg episode reward: [(0, '77.746')] +[2023-03-11 15:58:10,865][41544] Updated weights for policy 0, policy_version 51360 (0.0004) +[2023-03-11 15:58:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9344.4). Total num frames: 26320896. Throughput: 0: 9606.0. Samples: 26296328. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:58:13,386][41256] Avg episode reward: [(0, '85.758')] +[2023-03-11 15:58:15,010][41544] Updated weights for policy 0, policy_version 51440 (0.0004) +[2023-03-11 15:58:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9344.4). Total num frames: 26370048. Throughput: 0: 9615.5. Samples: 26355180. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 15:58:18,386][41256] Avg episode reward: [(0, '82.163')] +[2023-03-11 15:58:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000051504_26370048.pth... +[2023-03-11 15:58:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000050920_26071040.pth +[2023-03-11 15:58:19,174][41544] Updated weights for policy 0, policy_version 51520 (0.0004) +[2023-03-11 15:58:23,322][41544] Updated weights for policy 0, policy_version 51600 (0.0004) +[2023-03-11 15:58:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9344.4). Total num frames: 26419200. Throughput: 0: 9732.1. Samples: 26414852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:58:23,386][41256] Avg episode reward: [(0, '79.317')] +[2023-03-11 15:58:27,779][41544] Updated weights for policy 0, policy_version 51680 (0.0005) +[2023-03-11 15:58:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9344.4). Total num frames: 26464256. Throughput: 0: 9746.3. Samples: 26442904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:58:28,386][41256] Avg episode reward: [(0, '80.062')] +[2023-03-11 15:58:32,283][41544] Updated weights for policy 0, policy_version 51760 (0.0005) +[2023-03-11 15:58:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9344.4). Total num frames: 26509312. Throughput: 0: 9722.7. Samples: 26497096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:58:33,386][41256] Avg episode reward: [(0, '71.140')] +[2023-03-11 15:58:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000051776_26509312.pth... +[2023-03-11 15:58:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000051216_26222592.pth +[2023-03-11 15:58:36,668][41544] Updated weights for policy 0, policy_version 51840 (0.0005) +[2023-03-11 15:58:38,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9330.6). Total num frames: 26558464. Throughput: 0: 9653.4. Samples: 26553892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:58:38,386][41256] Avg episode reward: [(0, '73.285')] +[2023-03-11 15:58:41,004][41544] Updated weights for policy 0, policy_version 51920 (0.0005) +[2023-03-11 15:58:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9316.7). Total num frames: 26603520. Throughput: 0: 9613.6. Samples: 26581540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:58:43,386][41256] Avg episode reward: [(0, '67.992')] +[2023-03-11 15:58:45,100][41544] Updated weights for policy 0, policy_version 52000 (0.0005) +[2023-03-11 15:58:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9330.5). Total num frames: 26652672. Throughput: 0: 9616.3. Samples: 26640756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:58:48,386][41256] Avg episode reward: [(0, '60.716')] +[2023-03-11 15:58:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052056_26652672.pth... +[2023-03-11 15:58:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000051504_26370048.pth +[2023-03-11 15:58:49,251][41544] Updated weights for policy 0, policy_version 52080 (0.0005) +[2023-03-11 15:58:53,258][41544] Updated weights for policy 0, policy_version 52160 (0.0004) +[2023-03-11 15:58:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9358.3). Total num frames: 26705920. Throughput: 0: 9657.3. Samples: 26701784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:58:53,386][41256] Avg episode reward: [(0, '62.516')] +[2023-03-11 15:58:57,334][41544] Updated weights for policy 0, policy_version 52240 (0.0004) +[2023-03-11 15:58:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9372.2). Total num frames: 26755072. Throughput: 0: 9669.9. Samples: 26731472. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:58:58,386][41256] Avg episode reward: [(0, '59.538')] +[2023-03-11 15:59:01,372][41544] Updated weights for policy 0, policy_version 52320 (0.0004) +[2023-03-11 15:59:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9386.1). Total num frames: 26804224. Throughput: 0: 9710.9. Samples: 26792172. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:59:03,386][41256] Avg episode reward: [(0, '55.641')] +[2023-03-11 15:59:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052352_26804224.pth... +[2023-03-11 15:59:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000051776_26509312.pth +[2023-03-11 15:59:05,489][41544] Updated weights for policy 0, policy_version 52400 (0.0005) +[2023-03-11 15:59:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9413.9). Total num frames: 26857472. Throughput: 0: 9727.9. Samples: 26852608. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:59:08,386][41256] Avg episode reward: [(0, '59.848')] +[2023-03-11 15:59:09,688][41544] Updated weights for policy 0, policy_version 52480 (0.0004) +[2023-03-11 15:59:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9400.0). Total num frames: 26902528. Throughput: 0: 9733.9. Samples: 26880928. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:59:13,386][41256] Avg episode reward: [(0, '58.795')] +[2023-03-11 15:59:13,871][41544] Updated weights for policy 0, policy_version 52560 (0.0005) +[2023-03-11 15:59:17,954][41544] Updated weights for policy 0, policy_version 52640 (0.0004) +[2023-03-11 15:59:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9427.7). Total num frames: 26955776. Throughput: 0: 9847.0. Samples: 26940212. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:59:18,386][41256] Avg episode reward: [(0, '60.970')] +[2023-03-11 15:59:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052648_26955776.pth... +[2023-03-11 15:59:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052056_26652672.pth +[2023-03-11 15:59:22,021][41544] Updated weights for policy 0, policy_version 52720 (0.0004) +[2023-03-11 15:59:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9441.6). Total num frames: 27004928. Throughput: 0: 9933.4. Samples: 27000896. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 15:59:23,386][41256] Avg episode reward: [(0, '62.202')] +[2023-03-11 15:59:26,108][41544] Updated weights for policy 0, policy_version 52800 (0.0004) +[2023-03-11 15:59:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9455.5). Total num frames: 27054080. Throughput: 0: 9981.8. Samples: 27030720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:59:28,386][41256] Avg episode reward: [(0, '73.045')] +[2023-03-11 15:59:30,216][41544] Updated weights for policy 0, policy_version 52880 (0.0004) +[2023-03-11 15:59:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9469.4). Total num frames: 27103232. Throughput: 0: 9990.4. Samples: 27090324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:59:33,386][41256] Avg episode reward: [(0, '74.664')] +[2023-03-11 15:59:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052936_27103232.pth... +[2023-03-11 15:59:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052352_26804224.pth +[2023-03-11 15:59:34,439][41544] Updated weights for policy 0, policy_version 52960 (0.0005) +[2023-03-11 15:59:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.6, 300 sec: 9469.4). Total num frames: 27152384. Throughput: 0: 9922.5. Samples: 27148296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:59:38,386][41256] Avg episode reward: [(0, '79.965')] +[2023-03-11 15:59:38,783][41544] Updated weights for policy 0, policy_version 53040 (0.0005) +[2023-03-11 15:59:43,287][41544] Updated weights for policy 0, policy_version 53120 (0.0006) +[2023-03-11 15:59:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9469.4). Total num frames: 27197440. Throughput: 0: 9866.8. Samples: 27175480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:59:43,386][41256] Avg episode reward: [(0, '78.605')] +[2023-03-11 15:59:47,519][41544] Updated weights for policy 0, policy_version 53200 (0.0005) +[2023-03-11 15:59:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 9469.4). Total num frames: 27246592. Throughput: 0: 9765.1. Samples: 27231604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:59:48,386][41256] Avg episode reward: [(0, '76.750')] +[2023-03-11 15:59:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000053216_27246592.pth... +[2023-03-11 15:59:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052648_26955776.pth +[2023-03-11 15:59:51,584][41544] Updated weights for policy 0, policy_version 53280 (0.0004) +[2023-03-11 15:59:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9483.3). Total num frames: 27295744. Throughput: 0: 9762.3. Samples: 27291912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:59:53,386][41256] Avg episode reward: [(0, '79.086')] +[2023-03-11 15:59:55,839][41544] Updated weights for policy 0, policy_version 53360 (0.0005) +[2023-03-11 15:59:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9497.2). Total num frames: 27344896. Throughput: 0: 9765.7. Samples: 27320384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 15:59:58,386][41256] Avg episode reward: [(0, '81.277')] +[2023-03-11 15:59:59,991][41544] Updated weights for policy 0, policy_version 53440 (0.0004) +[2023-03-11 16:00:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9483.3). Total num frames: 27389952. Throughput: 0: 9734.8. Samples: 27378276. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:00:03,386][41256] Avg episode reward: [(0, '76.892')] +[2023-03-11 16:00:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000053496_27389952.pth... +[2023-03-11 16:00:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000052936_27103232.pth +[2023-03-11 16:00:04,364][41544] Updated weights for policy 0, policy_version 53520 (0.0005) +[2023-03-11 16:00:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9497.2). Total num frames: 27439104. Throughput: 0: 9646.9. Samples: 27435008. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:00:08,386][41256] Avg episode reward: [(0, '78.584')] +[2023-03-11 16:00:08,776][41544] Updated weights for policy 0, policy_version 53600 (0.0005) +[2023-03-11 16:00:13,208][41544] Updated weights for policy 0, policy_version 53680 (0.0005) +[2023-03-11 16:00:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9497.2). Total num frames: 27484160. Throughput: 0: 9594.7. Samples: 27462480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:00:13,386][41256] Avg episode reward: [(0, '73.755')] +[2023-03-11 16:00:17,369][41544] Updated weights for policy 0, policy_version 53760 (0.0005) +[2023-03-11 16:00:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 27533312. Throughput: 0: 9544.8. Samples: 27519840. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:00:18,386][41256] Avg episode reward: [(0, '73.565')] +[2023-03-11 16:00:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000053776_27533312.pth... +[2023-03-11 16:00:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000053216_27246592.pth +[2023-03-11 16:00:21,557][41544] Updated weights for policy 0, policy_version 53840 (0.0005) +[2023-03-11 16:00:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 27582464. Throughput: 0: 9565.5. Samples: 27578744. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:00:23,386][41256] Avg episode reward: [(0, '66.983')] +[2023-03-11 16:00:25,626][41544] Updated weights for policy 0, policy_version 53920 (0.0004) +[2023-03-11 16:00:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9511.1). Total num frames: 27631616. Throughput: 0: 9634.5. Samples: 27609032. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:00:28,386][41256] Avg episode reward: [(0, '73.009')] +[2023-03-11 16:00:29,776][41544] Updated weights for policy 0, policy_version 54000 (0.0005) +[2023-03-11 16:00:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 27680768. Throughput: 0: 9709.8. Samples: 27668544. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:00:33,386][41256] Avg episode reward: [(0, '70.675')] +[2023-03-11 16:00:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054064_27680768.pth... +[2023-03-11 16:00:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000053496_27389952.pth +[2023-03-11 16:00:33,919][41544] Updated weights for policy 0, policy_version 54080 (0.0004) +[2023-03-11 16:00:38,016][41544] Updated weights for policy 0, policy_version 54160 (0.0004) +[2023-03-11 16:00:38,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9538.8). Total num frames: 27729920. Throughput: 0: 9690.0. Samples: 27727960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:00:38,386][41256] Avg episode reward: [(0, '71.386')] +[2023-03-11 16:00:42,254][41544] Updated weights for policy 0, policy_version 54240 (0.0005) +[2023-03-11 16:00:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9538.8). Total num frames: 27779072. Throughput: 0: 9695.8. Samples: 27756696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:00:43,386][41256] Avg episode reward: [(0, '71.887')] +[2023-03-11 16:00:46,389][41544] Updated weights for policy 0, policy_version 54320 (0.0004) +[2023-03-11 16:00:48,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9552.7). Total num frames: 27828224. Throughput: 0: 9727.2. Samples: 27816000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:00:48,386][41256] Avg episode reward: [(0, '68.947')] +[2023-03-11 16:00:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054352_27828224.pth... +[2023-03-11 16:00:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000053776_27533312.pth +[2023-03-11 16:00:50,752][41544] Updated weights for policy 0, policy_version 54400 (0.0005) +[2023-03-11 16:00:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9552.7). Total num frames: 27877376. Throughput: 0: 9752.1. Samples: 27873852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:00:53,396][41256] Avg episode reward: [(0, '71.775')] +[2023-03-11 16:00:54,767][41544] Updated weights for policy 0, policy_version 54480 (0.0004) +[2023-03-11 16:00:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9566.6). Total num frames: 27926528. Throughput: 0: 9812.1. Samples: 27904024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:00:58,396][41256] Avg episode reward: [(0, '77.220')] +[2023-03-11 16:00:58,920][41544] Updated weights for policy 0, policy_version 54560 (0.0004) +[2023-03-11 16:01:03,102][41544] Updated weights for policy 0, policy_version 54640 (0.0005) +[2023-03-11 16:01:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9580.5). Total num frames: 27975680. Throughput: 0: 9856.5. Samples: 27963384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:03,397][41256] Avg episode reward: [(0, '82.820')] +[2023-03-11 16:01:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054640_27975680.pth... +[2023-03-11 16:01:03,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054064_27680768.pth +[2023-03-11 16:01:07,303][41544] Updated weights for policy 0, policy_version 54720 (0.0005) +[2023-03-11 16:01:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9580.5). Total num frames: 28024832. Throughput: 0: 9834.3. Samples: 28021288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:08,396][41256] Avg episode reward: [(0, '83.474')] +[2023-03-11 16:01:11,464][41544] Updated weights for policy 0, policy_version 54800 (0.0004) +[2023-03-11 16:01:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9594.4). Total num frames: 28073984. Throughput: 0: 9835.2. Samples: 28051616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:13,397][41256] Avg episode reward: [(0, '85.573')] +[2023-03-11 16:01:15,743][41544] Updated weights for policy 0, policy_version 54880 (0.0005) +[2023-03-11 16:01:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9608.2). Total num frames: 28123136. Throughput: 0: 9788.9. Samples: 28109044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:18,397][41256] Avg episode reward: [(0, '90.651')] +[2023-03-11 16:01:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054928_28123136.pth... +[2023-03-11 16:01:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054352_27828224.pth +[2023-03-11 16:01:20,042][41544] Updated weights for policy 0, policy_version 54960 (0.0005) +[2023-03-11 16:01:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9608.2). Total num frames: 28168192. Throughput: 0: 9764.7. Samples: 28167372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:23,396][41256] Avg episode reward: [(0, '82.706')] +[2023-03-11 16:01:24,217][41544] Updated weights for policy 0, policy_version 55040 (0.0005) +[2023-03-11 16:01:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 28217344. Throughput: 0: 9756.5. Samples: 28195740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:28,386][41256] Avg episode reward: [(0, '82.834')] +[2023-03-11 16:01:28,474][41544] Updated weights for policy 0, policy_version 55120 (0.0004) +[2023-03-11 16:01:32,634][41544] Updated weights for policy 0, policy_version 55200 (0.0004) +[2023-03-11 16:01:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9636.0). Total num frames: 28266496. Throughput: 0: 9739.4. Samples: 28254272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:33,396][41256] Avg episode reward: [(0, '83.943')] +[2023-03-11 16:01:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000055208_28266496.pth... +[2023-03-11 16:01:33,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054640_27975680.pth +[2023-03-11 16:01:36,857][41544] Updated weights for policy 0, policy_version 55280 (0.0005) +[2023-03-11 16:01:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9649.9). Total num frames: 28315648. Throughput: 0: 9756.3. Samples: 28312888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:38,386][41256] Avg episode reward: [(0, '85.246')] +[2023-03-11 16:01:40,982][41544] Updated weights for policy 0, policy_version 55360 (0.0004) +[2023-03-11 16:01:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9663.8). Total num frames: 28364800. Throughput: 0: 9756.6. Samples: 28343072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:43,386][41256] Avg episode reward: [(0, '85.741')] +[2023-03-11 16:01:45,398][41544] Updated weights for policy 0, policy_version 55440 (0.0005) +[2023-03-11 16:01:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9649.9). Total num frames: 28409856. Throughput: 0: 9654.8. Samples: 28397848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:48,386][41256] Avg episode reward: [(0, '83.495')] +[2023-03-11 16:01:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000055488_28409856.pth... +[2023-03-11 16:01:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000054928_28123136.pth +[2023-03-11 16:01:49,965][41544] Updated weights for policy 0, policy_version 55520 (0.0005) +[2023-03-11 16:01:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9649.9). Total num frames: 28454912. Throughput: 0: 9561.7. Samples: 28451564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:53,386][41256] Avg episode reward: [(0, '87.017')] +[2023-03-11 16:01:54,500][41544] Updated weights for policy 0, policy_version 55600 (0.0005) +[2023-03-11 16:01:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9649.9). Total num frames: 28499968. Throughput: 0: 9508.5. Samples: 28479500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:01:58,386][41256] Avg episode reward: [(0, '84.828')] +[2023-03-11 16:01:59,068][41544] Updated weights for policy 0, policy_version 55680 (0.0005) +[2023-03-11 16:02:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9649.9). Total num frames: 28545024. Throughput: 0: 9429.6. Samples: 28533376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:02:03,386][41256] Avg episode reward: [(0, '83.015')] +[2023-03-11 16:02:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000055752_28545024.pth... +[2023-03-11 16:02:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000055208_28266496.pth +[2023-03-11 16:02:03,522][41544] Updated weights for policy 0, policy_version 55760 (0.0005) +[2023-03-11 16:02:08,117][41544] Updated weights for policy 0, policy_version 55840 (0.0005) +[2023-03-11 16:02:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9649.9). Total num frames: 28590080. Throughput: 0: 9340.0. Samples: 28587672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:02:08,386][41256] Avg episode reward: [(0, '82.935')] +[2023-03-11 16:02:12,315][41544] Updated weights for policy 0, policy_version 55920 (0.0005) +[2023-03-11 16:02:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9663.8). Total num frames: 28639232. Throughput: 0: 9330.9. Samples: 28615632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:02:13,386][41256] Avg episode reward: [(0, '72.252')] +[2023-03-11 16:02:16,427][41544] Updated weights for policy 0, policy_version 56000 (0.0005) +[2023-03-11 16:02:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9663.8). Total num frames: 28688384. Throughput: 0: 9375.3. Samples: 28676160. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:02:18,386][41256] Avg episode reward: [(0, '71.009')] +[2023-03-11 16:02:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056032_28688384.pth... +[2023-03-11 16:02:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000055488_28409856.pth +[2023-03-11 16:02:20,723][41544] Updated weights for policy 0, policy_version 56080 (0.0005) +[2023-03-11 16:02:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9663.8). Total num frames: 28737536. Throughput: 0: 9332.1. Samples: 28732832. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:02:23,386][41256] Avg episode reward: [(0, '76.569')] +[2023-03-11 16:02:25,169][41544] Updated weights for policy 0, policy_version 56160 (0.0006) +[2023-03-11 16:02:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9649.9). Total num frames: 28782592. Throughput: 0: 9266.9. Samples: 28760084. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:02:28,386][41256] Avg episode reward: [(0, '74.124')] +[2023-03-11 16:02:29,681][41544] Updated weights for policy 0, policy_version 56240 (0.0005) +[2023-03-11 16:02:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9649.9). Total num frames: 28827648. Throughput: 0: 9260.4. Samples: 28814568. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:02:33,386][41256] Avg episode reward: [(0, '77.106')] +[2023-03-11 16:02:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056304_28827648.pth... +[2023-03-11 16:02:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000055752_28545024.pth +[2023-03-11 16:02:34,220][41544] Updated weights for policy 0, policy_version 56320 (0.0005) +[2023-03-11 16:02:38,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9649.9). Total num frames: 28872704. Throughput: 0: 9267.9. Samples: 28868620. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:02:38,386][41256] Avg episode reward: [(0, '81.578')] +[2023-03-11 16:02:38,783][41544] Updated weights for policy 0, policy_version 56400 (0.0005) +[2023-03-11 16:02:43,262][41544] Updated weights for policy 0, policy_version 56480 (0.0005) +[2023-03-11 16:02:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9649.9). Total num frames: 28917760. Throughput: 0: 9252.3. Samples: 28895852. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:02:43,386][41256] Avg episode reward: [(0, '84.525')] +[2023-03-11 16:02:47,675][41544] Updated weights for policy 0, policy_version 56560 (0.0005) +[2023-03-11 16:02:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9622.1). Total num frames: 28962816. Throughput: 0: 9281.0. Samples: 28951020. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:02:48,386][41256] Avg episode reward: [(0, '85.960')] +[2023-03-11 16:02:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056568_28962816.pth... +[2023-03-11 16:02:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056032_28688384.pth +[2023-03-11 16:02:52,079][41544] Updated weights for policy 0, policy_version 56640 (0.0005) +[2023-03-11 16:02:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9622.1). Total num frames: 29011968. Throughput: 0: 9318.2. Samples: 29006992. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:02:53,386][41256] Avg episode reward: [(0, '86.674')] +[2023-03-11 16:02:56,364][41544] Updated weights for policy 0, policy_version 56720 (0.0005) +[2023-03-11 16:02:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9608.2). Total num frames: 29057024. Throughput: 0: 9342.9. Samples: 29036060. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:02:58,386][41256] Avg episode reward: [(0, '84.034')] +[2023-03-11 16:03:00,823][41544] Updated weights for policy 0, policy_version 56800 (0.0005) +[2023-03-11 16:03:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9594.4). Total num frames: 29102080. Throughput: 0: 9211.1. Samples: 29090660. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:03,386][41256] Avg episode reward: [(0, '81.903')] +[2023-03-11 16:03:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056840_29102080.pth... +[2023-03-11 16:03:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056304_28827648.pth +[2023-03-11 16:03:05,312][41544] Updated weights for policy 0, policy_version 56880 (0.0005) +[2023-03-11 16:03:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9580.5). Total num frames: 29147136. Throughput: 0: 9170.2. Samples: 29145492. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:08,386][41256] Avg episode reward: [(0, '75.761')] +[2023-03-11 16:03:09,851][41544] Updated weights for policy 0, policy_version 56960 (0.0005) +[2023-03-11 16:03:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9580.5). Total num frames: 29196288. Throughput: 0: 9161.4. Samples: 29172348. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:13,386][41256] Avg episode reward: [(0, '71.423')] +[2023-03-11 16:03:14,216][41544] Updated weights for policy 0, policy_version 57040 (0.0005) +[2023-03-11 16:03:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9566.6). Total num frames: 29241344. Throughput: 0: 9195.1. Samples: 29228348. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:18,386][41256] Avg episode reward: [(0, '72.997')] +[2023-03-11 16:03:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057112_29241344.pth... +[2023-03-11 16:03:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056568_28962816.pth +[2023-03-11 16:03:18,719][41544] Updated weights for policy 0, policy_version 57120 (0.0005) +[2023-03-11 16:03:23,177][41544] Updated weights for policy 0, policy_version 57200 (0.0005) +[2023-03-11 16:03:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9566.6). Total num frames: 29286400. Throughput: 0: 9215.7. Samples: 29283324. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:23,386][41256] Avg episode reward: [(0, '75.207')] +[2023-03-11 16:03:27,547][41544] Updated weights for policy 0, policy_version 57280 (0.0005) +[2023-03-11 16:03:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9566.6). Total num frames: 29331456. Throughput: 0: 9229.3. Samples: 29311168. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:28,386][41256] Avg episode reward: [(0, '75.116')] +[2023-03-11 16:03:31,983][41544] Updated weights for policy 0, policy_version 57360 (0.0005) +[2023-03-11 16:03:33,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9566.6). Total num frames: 29380608. Throughput: 0: 9249.2. Samples: 29367236. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:33,386][41256] Avg episode reward: [(0, '81.192')] +[2023-03-11 16:03:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057384_29380608.pth... +[2023-03-11 16:03:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000056840_29102080.pth +[2023-03-11 16:03:36,467][41544] Updated weights for policy 0, policy_version 57440 (0.0005) +[2023-03-11 16:03:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9566.6). Total num frames: 29425664. Throughput: 0: 9218.0. Samples: 29421800. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:38,386][41256] Avg episode reward: [(0, '75.035')] +[2023-03-11 16:03:40,861][41544] Updated weights for policy 0, policy_version 57520 (0.0005) +[2023-03-11 16:03:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9552.7). Total num frames: 29470720. Throughput: 0: 9204.3. Samples: 29450252. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:43,386][41256] Avg episode reward: [(0, '78.069')] +[2023-03-11 16:03:45,349][41544] Updated weights for policy 0, policy_version 57600 (0.0005) +[2023-03-11 16:03:48,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9524.9). Total num frames: 29515776. Throughput: 0: 9200.3. Samples: 29504676. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:48,386][41256] Avg episode reward: [(0, '78.745')] +[2023-03-11 16:03:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057648_29515776.pth... +[2023-03-11 16:03:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057112_29241344.pth +[2023-03-11 16:03:49,651][41544] Updated weights for policy 0, policy_version 57680 (0.0004) +[2023-03-11 16:03:53,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9524.9). Total num frames: 29564928. Throughput: 0: 9298.5. Samples: 29563924. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:53,386][41256] Avg episode reward: [(0, '82.276')] +[2023-03-11 16:03:53,822][41544] Updated weights for policy 0, policy_version 57760 (0.0004) +[2023-03-11 16:03:57,915][41544] Updated weights for policy 0, policy_version 57840 (0.0004) +[2023-03-11 16:03:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9352.5, 300 sec: 9538.8). Total num frames: 29618176. Throughput: 0: 9362.6. Samples: 29593664. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:03:58,386][41256] Avg episode reward: [(0, '78.104')] +[2023-03-11 16:04:02,144][41544] Updated weights for policy 0, policy_version 57920 (0.0005) +[2023-03-11 16:04:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9511.0). Total num frames: 29663232. Throughput: 0: 9408.6. Samples: 29651736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:03,386][41256] Avg episode reward: [(0, '79.910')] +[2023-03-11 16:04:03,404][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057944_29667328.pth... +[2023-03-11 16:04:03,405][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057384_29380608.pth +[2023-03-11 16:04:06,294][41544] Updated weights for policy 0, policy_version 58000 (0.0005) +[2023-03-11 16:04:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9524.9). Total num frames: 29712384. Throughput: 0: 9507.8. Samples: 29711176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:08,386][41256] Avg episode reward: [(0, '78.190')] +[2023-03-11 16:04:10,752][41544] Updated weights for policy 0, policy_version 58080 (0.0005) +[2023-03-11 16:04:13,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 29757440. Throughput: 0: 9473.7. Samples: 29737484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:13,386][41256] Avg episode reward: [(0, '79.962')] +[2023-03-11 16:04:15,145][41544] Updated weights for policy 0, policy_version 58160 (0.0006) +[2023-03-11 16:04:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 29806592. Throughput: 0: 9486.8. Samples: 29794144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:18,386][41256] Avg episode reward: [(0, '78.467')] +[2023-03-11 16:04:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000058216_29806592.pth... +[2023-03-11 16:04:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057648_29515776.pth +[2023-03-11 16:04:19,548][41544] Updated weights for policy 0, policy_version 58240 (0.0005) +[2023-03-11 16:04:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 29855744. Throughput: 0: 9541.8. Samples: 29851180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:23,386][41256] Avg episode reward: [(0, '85.982')] +[2023-03-11 16:04:23,738][41544] Updated weights for policy 0, policy_version 58320 (0.0005) +[2023-03-11 16:04:28,011][41544] Updated weights for policy 0, policy_version 58400 (0.0005) +[2023-03-11 16:04:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 29900800. Throughput: 0: 9554.3. Samples: 29880196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:28,386][41256] Avg episode reward: [(0, '83.463')] +[2023-03-11 16:04:32,469][41544] Updated weights for policy 0, policy_version 58480 (0.0005) +[2023-03-11 16:04:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 29949952. Throughput: 0: 9591.9. Samples: 29936312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:33,386][41256] Avg episode reward: [(0, '75.729')] +[2023-03-11 16:04:33,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000058496_29949952.pth... +[2023-03-11 16:04:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000057944_29667328.pth +[2023-03-11 16:04:36,794][41544] Updated weights for policy 0, policy_version 58560 (0.0005) +[2023-03-11 16:04:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 29995008. Throughput: 0: 9527.1. Samples: 29992644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:38,386][41256] Avg episode reward: [(0, '75.642')] +[2023-03-11 16:04:41,231][41544] Updated weights for policy 0, policy_version 58640 (0.0005) +[2023-03-11 16:04:43,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 30040064. Throughput: 0: 9476.0. Samples: 30020084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:43,386][41256] Avg episode reward: [(0, '78.263')] +[2023-03-11 16:04:45,666][41544] Updated weights for policy 0, policy_version 58720 (0.0005) +[2023-03-11 16:04:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 30089216. Throughput: 0: 9426.4. Samples: 30075924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:48,386][41256] Avg episode reward: [(0, '74.173')] +[2023-03-11 16:04:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000058768_30089216.pth... +[2023-03-11 16:04:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000058216_29806592.pth +[2023-03-11 16:04:49,891][41544] Updated weights for policy 0, policy_version 58800 (0.0005) +[2023-03-11 16:04:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 30138368. Throughput: 0: 9393.5. Samples: 30133884. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:53,386][41256] Avg episode reward: [(0, '72.636')] +[2023-03-11 16:04:54,160][41544] Updated weights for policy 0, policy_version 58880 (0.0005) +[2023-03-11 16:04:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 30183424. Throughput: 0: 9454.9. Samples: 30162956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:04:58,386][41256] Avg episode reward: [(0, '80.111')] +[2023-03-11 16:04:58,460][41544] Updated weights for policy 0, policy_version 58960 (0.0005) +[2023-03-11 16:05:02,747][41544] Updated weights for policy 0, policy_version 59040 (0.0005) +[2023-03-11 16:05:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 30232576. Throughput: 0: 9465.2. Samples: 30220076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:05:03,386][41256] Avg episode reward: [(0, '79.656')] +[2023-03-11 16:05:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059048_30232576.pth... +[2023-03-11 16:05:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000058496_29949952.pth +[2023-03-11 16:05:07,193][41544] Updated weights for policy 0, policy_version 59120 (0.0005) +[2023-03-11 16:05:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 30277632. Throughput: 0: 9426.6. Samples: 30275376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:05:08,386][41256] Avg episode reward: [(0, '78.175')] +[2023-03-11 16:05:11,727][41544] Updated weights for policy 0, policy_version 59200 (0.0005) +[2023-03-11 16:05:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 30322688. Throughput: 0: 9377.2. Samples: 30302168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:05:13,386][41256] Avg episode reward: [(0, '79.080')] +[2023-03-11 16:05:16,194][41544] Updated weights for policy 0, policy_version 59280 (0.0005) +[2023-03-11 16:05:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 30367744. Throughput: 0: 9352.7. Samples: 30357184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:05:18,386][41256] Avg episode reward: [(0, '81.857')] +[2023-03-11 16:05:18,453][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059320_30371840.pth... +[2023-03-11 16:05:18,455][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000058768_30089216.pth +[2023-03-11 16:05:20,767][41544] Updated weights for policy 0, policy_version 59360 (0.0005) +[2023-03-11 16:05:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 30412800. Throughput: 0: 9295.6. Samples: 30410944. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:05:23,386][41256] Avg episode reward: [(0, '81.342')] +[2023-03-11 16:05:25,249][41544] Updated weights for policy 0, policy_version 59440 (0.0005) +[2023-03-11 16:05:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 30461952. Throughput: 0: 9298.4. Samples: 30438512. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:05:28,386][41256] Avg episode reward: [(0, '66.324')] +[2023-03-11 16:05:29,547][41544] Updated weights for policy 0, policy_version 59520 (0.0004) +[2023-03-11 16:05:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 30511104. Throughput: 0: 9366.2. Samples: 30497404. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:05:33,386][41256] Avg episode reward: [(0, '75.468')] +[2023-03-11 16:05:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059592_30511104.pth... +[2023-03-11 16:05:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059048_30232576.pth +[2023-03-11 16:05:33,747][41544] Updated weights for policy 0, policy_version 59600 (0.0003) +[2023-03-11 16:05:38,117][41544] Updated weights for policy 0, policy_version 59680 (0.0004) +[2023-03-11 16:05:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 30556160. Throughput: 0: 9316.6. Samples: 30553132. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:05:38,386][41256] Avg episode reward: [(0, '78.411')] +[2023-03-11 16:05:42,441][41544] Updated weights for policy 0, policy_version 59760 (0.0003) +[2023-03-11 16:05:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 30605312. Throughput: 0: 9314.9. Samples: 30582128. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:05:43,386][41256] Avg episode reward: [(0, '78.315')] +[2023-03-11 16:05:46,877][41544] Updated weights for policy 0, policy_version 59840 (0.0004) +[2023-03-11 16:05:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 30650368. Throughput: 0: 9289.0. Samples: 30638080. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:05:48,386][41256] Avg episode reward: [(0, '79.776')] +[2023-03-11 16:05:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059864_30650368.pth... +[2023-03-11 16:05:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059320_30371840.pth +[2023-03-11 16:05:51,452][41544] Updated weights for policy 0, policy_version 59920 (0.0005) +[2023-03-11 16:05:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 30695424. Throughput: 0: 9243.4. Samples: 30691328. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:05:53,386][41256] Avg episode reward: [(0, '75.907')] +[2023-03-11 16:05:56,044][41544] Updated weights for policy 0, policy_version 60000 (0.0005) +[2023-03-11 16:05:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 30740480. Throughput: 0: 9247.1. Samples: 30718288. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:05:58,386][41256] Avg episode reward: [(0, '72.400')] +[2023-03-11 16:06:00,427][41544] Updated weights for policy 0, policy_version 60080 (0.0005) +[2023-03-11 16:06:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 30785536. Throughput: 0: 9267.2. Samples: 30774208. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:06:03,386][41256] Avg episode reward: [(0, '77.654')] +[2023-03-11 16:06:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060128_30785536.pth... +[2023-03-11 16:06:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059592_30511104.pth +[2023-03-11 16:06:04,804][41544] Updated weights for policy 0, policy_version 60160 (0.0005) +[2023-03-11 16:06:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 30830592. Throughput: 0: 9306.8. Samples: 30829748. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:06:08,386][41256] Avg episode reward: [(0, '74.617')] +[2023-03-11 16:06:09,239][41544] Updated weights for policy 0, policy_version 60240 (0.0005) +[2023-03-11 16:06:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9344.4). Total num frames: 30879744. Throughput: 0: 9339.4. Samples: 30858784. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:06:13,396][41256] Avg episode reward: [(0, '77.351')] +[2023-03-11 16:06:13,487][41544] Updated weights for policy 0, policy_version 60320 (0.0005) +[2023-03-11 16:06:17,855][41544] Updated weights for policy 0, policy_version 60400 (0.0005) +[2023-03-11 16:06:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 30928896. Throughput: 0: 9288.7. Samples: 30915396. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:06:18,386][41256] Avg episode reward: [(0, '81.274')] +[2023-03-11 16:06:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060408_30928896.pth... +[2023-03-11 16:06:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000059864_30650368.pth +[2023-03-11 16:06:22,114][41544] Updated weights for policy 0, policy_version 60480 (0.0005) +[2023-03-11 16:06:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9358.3). Total num frames: 30978048. Throughput: 0: 9326.9. Samples: 30972844. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:06:23,386][41256] Avg episode reward: [(0, '78.068')] +[2023-03-11 16:06:26,282][41544] Updated weights for policy 0, policy_version 60560 (0.0005) +[2023-03-11 16:06:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9358.3). Total num frames: 31027200. Throughput: 0: 9340.5. Samples: 31002452. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:06:28,396][41256] Avg episode reward: [(0, '78.352')] +[2023-03-11 16:06:30,433][41544] Updated weights for policy 0, policy_version 60640 (0.0005) +[2023-03-11 16:06:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9358.3). Total num frames: 31076352. Throughput: 0: 9412.7. Samples: 31061652. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:06:33,396][41256] Avg episode reward: [(0, '82.244')] +[2023-03-11 16:06:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060696_31076352.pth... +[2023-03-11 16:06:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060128_30785536.pth +[2023-03-11 16:06:34,574][41544] Updated weights for policy 0, policy_version 60720 (0.0005) +[2023-03-11 16:06:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9344.4). Total num frames: 31121408. Throughput: 0: 9481.1. Samples: 31117976. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:06:38,396][41256] Avg episode reward: [(0, '84.312')] +[2023-03-11 16:06:39,139][41544] Updated weights for policy 0, policy_version 60800 (0.0005) +[2023-03-11 16:06:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 31166464. Throughput: 0: 9490.6. Samples: 31145364. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:06:43,396][41256] Avg episode reward: [(0, '80.353')] +[2023-03-11 16:06:43,616][41544] Updated weights for policy 0, policy_version 60880 (0.0005) +[2023-03-11 16:06:48,016][41544] Updated weights for policy 0, policy_version 60960 (0.0005) +[2023-03-11 16:06:48,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 31211520. Throughput: 0: 9480.5. Samples: 31200832. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:06:48,397][41256] Avg episode reward: [(0, '82.167')] +[2023-03-11 16:06:48,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060960_31211520.pth... +[2023-03-11 16:06:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060408_30928896.pth +[2023-03-11 16:06:52,528][41544] Updated weights for policy 0, policy_version 61040 (0.0006) +[2023-03-11 16:06:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 31256576. Throughput: 0: 9465.9. Samples: 31255716. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:06:53,396][41256] Avg episode reward: [(0, '83.184')] +[2023-03-11 16:06:57,083][41544] Updated weights for policy 0, policy_version 61120 (0.0006) +[2023-03-11 16:06:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 31301632. Throughput: 0: 9403.4. Samples: 31281936. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:06:58,396][41256] Avg episode reward: [(0, '77.076')] +[2023-03-11 16:07:01,627][41544] Updated weights for policy 0, policy_version 61200 (0.0006) +[2023-03-11 16:07:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9358.3). Total num frames: 31350784. Throughput: 0: 9354.1. Samples: 31336332. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:07:03,386][41256] Avg episode reward: [(0, '78.894')] +[2023-03-11 16:07:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000061232_31350784.pth... +[2023-03-11 16:07:03,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060696_31076352.pth +[2023-03-11 16:07:05,992][41544] Updated weights for policy 0, policy_version 61280 (0.0006) +[2023-03-11 16:07:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9344.4). Total num frames: 31395840. Throughput: 0: 9315.2. Samples: 31392028. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:07:08,396][41256] Avg episode reward: [(0, '79.967')] +[2023-03-11 16:07:10,536][41544] Updated weights for policy 0, policy_version 61360 (0.0006) +[2023-03-11 16:07:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9330.6). Total num frames: 31440896. Throughput: 0: 9266.0. Samples: 31419420. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:07:13,396][41256] Avg episode reward: [(0, '82.906')] +[2023-03-11 16:07:14,925][41544] Updated weights for policy 0, policy_version 61440 (0.0005) +[2023-03-11 16:07:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9330.5). Total num frames: 31490048. Throughput: 0: 9200.3. Samples: 31475664. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:07:18,396][41256] Avg episode reward: [(0, '80.658')] +[2023-03-11 16:07:18,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000061504_31490048.pth... +[2023-03-11 16:07:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000060960_31211520.pth +[2023-03-11 16:07:19,161][41544] Updated weights for policy 0, policy_version 61520 (0.0005) +[2023-03-11 16:07:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9330.5). Total num frames: 31535104. Throughput: 0: 9225.9. Samples: 31533140. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:07:23,386][41256] Avg episode reward: [(0, '82.248')] +[2023-03-11 16:07:23,463][41544] Updated weights for policy 0, policy_version 61600 (0.0005) +[2023-03-11 16:07:27,725][41544] Updated weights for policy 0, policy_version 61680 (0.0005) +[2023-03-11 16:07:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9344.4). Total num frames: 31584256. Throughput: 0: 9264.6. Samples: 31562272. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:07:28,386][41256] Avg episode reward: [(0, '81.908')] +[2023-03-11 16:07:32,009][41544] Updated weights for policy 0, policy_version 61760 (0.0005) +[2023-03-11 16:07:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 31633408. Throughput: 0: 9302.7. Samples: 31619452. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:07:33,386][41256] Avg episode reward: [(0, '75.598')] +[2023-03-11 16:07:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000061784_31633408.pth... +[2023-03-11 16:07:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000061232_31350784.pth +[2023-03-11 16:07:36,070][41544] Updated weights for policy 0, policy_version 61840 (0.0004) +[2023-03-11 16:07:38,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 31682560. Throughput: 0: 9415.9. Samples: 31679432. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:07:38,386][41256] Avg episode reward: [(0, '82.968')] +[2023-03-11 16:07:40,221][41544] Updated weights for policy 0, policy_version 61920 (0.0005) +[2023-03-11 16:07:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 31731712. Throughput: 0: 9483.6. Samples: 31708700. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:07:43,386][41256] Avg episode reward: [(0, '87.254')] +[2023-03-11 16:07:44,499][41544] Updated weights for policy 0, policy_version 62000 (0.0005) +[2023-03-11 16:07:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 31780864. Throughput: 0: 9585.6. Samples: 31767684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:07:48,397][41256] Avg episode reward: [(0, '83.689')] +[2023-03-11 16:07:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062072_31780864.pth... +[2023-03-11 16:07:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000061504_31490048.pth +[2023-03-11 16:07:48,700][41544] Updated weights for policy 0, policy_version 62080 (0.0005) +[2023-03-11 16:07:53,231][41544] Updated weights for policy 0, policy_version 62160 (0.0006) +[2023-03-11 16:07:53,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 31825920. Throughput: 0: 9556.3. Samples: 31822060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:07:53,397][41256] Avg episode reward: [(0, '88.251')] +[2023-03-11 16:07:57,514][41544] Updated weights for policy 0, policy_version 62240 (0.0004) +[2023-03-11 16:07:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 31870976. Throughput: 0: 9579.5. Samples: 31850496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:07:58,396][41256] Avg episode reward: [(0, '77.920')] +[2023-03-11 16:08:02,017][41544] Updated weights for policy 0, policy_version 62320 (0.0005) +[2023-03-11 16:08:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 31920128. Throughput: 0: 9579.1. Samples: 31906724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:03,396][41256] Avg episode reward: [(0, '72.943')] +[2023-03-11 16:08:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062344_31920128.pth... +[2023-03-11 16:08:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000061784_31633408.pth +[2023-03-11 16:08:06,553][41544] Updated weights for policy 0, policy_version 62400 (0.0005) +[2023-03-11 16:08:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 31965184. Throughput: 0: 9511.4. Samples: 31961152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:08,396][41256] Avg episode reward: [(0, '85.497')] +[2023-03-11 16:08:10,853][41544] Updated weights for policy 0, policy_version 62480 (0.0005) +[2023-03-11 16:08:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 32010240. Throughput: 0: 9500.1. Samples: 31989776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:13,386][41256] Avg episode reward: [(0, '86.655')] +[2023-03-11 16:08:15,399][41544] Updated weights for policy 0, policy_version 62560 (0.0005) +[2023-03-11 16:08:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 32055296. Throughput: 0: 9416.7. Samples: 32043204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:18,396][41256] Avg episode reward: [(0, '82.680')] +[2023-03-11 16:08:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062608_32055296.pth... +[2023-03-11 16:08:18,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062072_31780864.pth +[2023-03-11 16:08:19,884][41544] Updated weights for policy 0, policy_version 62640 (0.0005) +[2023-03-11 16:08:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 32100352. Throughput: 0: 9324.0. Samples: 32099012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:23,396][41256] Avg episode reward: [(0, '84.114')] +[2023-03-11 16:08:24,292][41544] Updated weights for policy 0, policy_version 62720 (0.0004) +[2023-03-11 16:08:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 32149504. Throughput: 0: 9296.0. Samples: 32127020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:28,396][41256] Avg episode reward: [(0, '88.722')] +[2023-03-11 16:08:28,696][41544] Updated weights for policy 0, policy_version 62800 (0.0003) +[2023-03-11 16:08:33,012][41544] Updated weights for policy 0, policy_version 62880 (0.0004) +[2023-03-11 16:08:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 32194560. Throughput: 0: 9222.2. Samples: 32182684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:33,397][41256] Avg episode reward: [(0, '85.283')] +[2023-03-11 16:08:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062880_32194560.pth... +[2023-03-11 16:08:33,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062344_31920128.pth +[2023-03-11 16:08:37,599][41544] Updated weights for policy 0, policy_version 62960 (0.0003) +[2023-03-11 16:08:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 32239616. Throughput: 0: 9240.6. Samples: 32237888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:38,386][41256] Avg episode reward: [(0, '87.556')] +[2023-03-11 16:08:42,123][41544] Updated weights for policy 0, policy_version 63040 (0.0003) +[2023-03-11 16:08:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 32284672. Throughput: 0: 9203.0. Samples: 32264628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:43,386][41256] Avg episode reward: [(0, '77.984')] +[2023-03-11 16:08:46,628][41544] Updated weights for policy 0, policy_version 63120 (0.0005) +[2023-03-11 16:08:48,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.8, 300 sec: 9372.2). Total num frames: 32329728. Throughput: 0: 9172.0. Samples: 32319464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:48,386][41256] Avg episode reward: [(0, '81.949')] +[2023-03-11 16:08:48,452][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063152_32333824.pth... +[2023-03-11 16:08:48,454][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062608_32055296.pth +[2023-03-11 16:08:51,127][41544] Updated weights for policy 0, policy_version 63200 (0.0005) +[2023-03-11 16:08:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 32378880. Throughput: 0: 9190.2. Samples: 32374712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:53,386][41256] Avg episode reward: [(0, '81.447')] +[2023-03-11 16:08:55,589][41544] Updated weights for policy 0, policy_version 63280 (0.0005) +[2023-03-11 16:08:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 32423936. Throughput: 0: 9154.7. Samples: 32401736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:08:58,386][41256] Avg episode reward: [(0, '80.640')] +[2023-03-11 16:09:00,207][41544] Updated weights for policy 0, policy_version 63360 (0.0005) +[2023-03-11 16:09:03,386][41256] Fps is (10 sec: 8601.5, 60 sec: 9079.5, 300 sec: 9330.5). Total num frames: 32464896. Throughput: 0: 9147.9. Samples: 32454860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:03,386][41256] Avg episode reward: [(0, '78.193')] +[2023-03-11 16:09:03,449][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063416_32468992.pth... +[2023-03-11 16:09:03,450][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000062880_32194560.pth +[2023-03-11 16:09:04,832][41544] Updated weights for policy 0, policy_version 63440 (0.0005) +[2023-03-11 16:09:08,385][41256] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9330.6). Total num frames: 32509952. Throughput: 0: 9088.9. Samples: 32508012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:08,386][41256] Avg episode reward: [(0, '80.050')] +[2023-03-11 16:09:09,387][41544] Updated weights for policy 0, policy_version 63520 (0.0005) +[2023-03-11 16:09:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9147.7, 300 sec: 9330.6). Total num frames: 32559104. Throughput: 0: 9079.1. Samples: 32535580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:13,386][41256] Avg episode reward: [(0, '81.378')] +[2023-03-11 16:09:13,796][41544] Updated weights for policy 0, policy_version 63600 (0.0005) +[2023-03-11 16:09:18,253][41544] Updated weights for policy 0, policy_version 63680 (0.0005) +[2023-03-11 16:09:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 32604160. Throughput: 0: 9090.8. Samples: 32591768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:18,386][41256] Avg episode reward: [(0, '78.750')] +[2023-03-11 16:09:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063680_32604160.pth... +[2023-03-11 16:09:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063152_32333824.pth +[2023-03-11 16:09:22,721][41544] Updated weights for policy 0, policy_version 63760 (0.0005) +[2023-03-11 16:09:23,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 32649216. Throughput: 0: 9071.5. Samples: 32646104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:23,386][41256] Avg episode reward: [(0, '77.573')] +[2023-03-11 16:09:27,203][41544] Updated weights for policy 0, policy_version 63840 (0.0005) +[2023-03-11 16:09:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9302.8). Total num frames: 32694272. Throughput: 0: 9086.7. Samples: 32673532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:28,386][41256] Avg episode reward: [(0, '85.647')] +[2023-03-11 16:09:31,611][41544] Updated weights for policy 0, policy_version 63920 (0.0005) +[2023-03-11 16:09:33,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9302.8). Total num frames: 32739328. Throughput: 0: 9103.3. Samples: 32729112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:33,386][41256] Avg episode reward: [(0, '84.633')] +[2023-03-11 16:09:33,435][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063952_32743424.pth... +[2023-03-11 16:09:33,436][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063416_32468992.pth +[2023-03-11 16:09:36,065][41544] Updated weights for policy 0, policy_version 64000 (0.0005) +[2023-03-11 16:09:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 32788480. Throughput: 0: 9105.4. Samples: 32784456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:38,386][41256] Avg episode reward: [(0, '81.596')] +[2023-03-11 16:09:40,533][41544] Updated weights for policy 0, policy_version 64080 (0.0005) +[2023-03-11 16:09:43,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 32833536. Throughput: 0: 9115.6. Samples: 32811940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:43,386][41256] Avg episode reward: [(0, '81.958')] +[2023-03-11 16:09:45,052][41544] Updated weights for policy 0, policy_version 64160 (0.0005) +[2023-03-11 16:09:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 32878592. Throughput: 0: 9143.2. Samples: 32866304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:48,386][41256] Avg episode reward: [(0, '80.637')] +[2023-03-11 16:09:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000064216_32878592.pth... +[2023-03-11 16:09:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063680_32604160.pth +[2023-03-11 16:09:49,556][41544] Updated weights for policy 0, policy_version 64240 (0.0005) +[2023-03-11 16:09:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9288.9). Total num frames: 32923648. Throughput: 0: 9178.8. Samples: 32921060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:53,386][41256] Avg episode reward: [(0, '80.453')] +[2023-03-11 16:09:53,983][41544] Updated weights for policy 0, policy_version 64320 (0.0005) +[2023-03-11 16:09:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9275.0). Total num frames: 32968704. Throughput: 0: 9171.3. Samples: 32948288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:09:58,386][41256] Avg episode reward: [(0, '83.816')] +[2023-03-11 16:09:58,417][41544] Updated weights for policy 0, policy_version 64400 (0.0005) +[2023-03-11 16:10:02,864][41544] Updated weights for policy 0, policy_version 64480 (0.0005) +[2023-03-11 16:10:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9288.9). Total num frames: 33017856. Throughput: 0: 9171.8. Samples: 33004500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:03,386][41256] Avg episode reward: [(0, '81.490')] +[2023-03-11 16:10:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000064488_33017856.pth... +[2023-03-11 16:10:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000063952_32743424.pth +[2023-03-11 16:10:07,111][41544] Updated weights for policy 0, policy_version 64560 (0.0005) +[2023-03-11 16:10:08,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9284.3, 300 sec: 9302.8). Total num frames: 33067008. Throughput: 0: 9240.2. Samples: 33061912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:08,386][41256] Avg episode reward: [(0, '80.653')] +[2023-03-11 16:10:11,272][41544] Updated weights for policy 0, policy_version 64640 (0.0005) +[2023-03-11 16:10:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9302.8). Total num frames: 33112064. Throughput: 0: 9286.1. Samples: 33091408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:13,386][41256] Avg episode reward: [(0, '83.638')] +[2023-03-11 16:10:15,485][41544] Updated weights for policy 0, policy_version 64720 (0.0005) +[2023-03-11 16:10:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9330.5). Total num frames: 33165312. Throughput: 0: 9350.2. Samples: 33149872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:18,386][41256] Avg episode reward: [(0, '80.990')] +[2023-03-11 16:10:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000064776_33165312.pth... +[2023-03-11 16:10:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000064216_32878592.pth +[2023-03-11 16:10:19,574][41544] Updated weights for policy 0, policy_version 64800 (0.0004) +[2023-03-11 16:10:23,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9420.8, 300 sec: 9330.5). Total num frames: 33214464. Throughput: 0: 9471.0. Samples: 33210652. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:23,386][41256] Avg episode reward: [(0, '81.462')] +[2023-03-11 16:10:23,614][41544] Updated weights for policy 0, policy_version 64880 (0.0004) +[2023-03-11 16:10:27,841][41544] Updated weights for policy 0, policy_version 64960 (0.0005) +[2023-03-11 16:10:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9330.5). Total num frames: 33263616. Throughput: 0: 9508.2. Samples: 33239808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:28,386][41256] Avg episode reward: [(0, '78.920')] +[2023-03-11 16:10:32,035][41544] Updated weights for policy 0, policy_version 65040 (0.0005) +[2023-03-11 16:10:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9344.4). Total num frames: 33312768. Throughput: 0: 9617.1. Samples: 33299072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:33,386][41256] Avg episode reward: [(0, '81.365')] +[2023-03-11 16:10:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065064_33312768.pth... +[2023-03-11 16:10:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000064488_33017856.pth +[2023-03-11 16:10:36,195][41544] Updated weights for policy 0, policy_version 65120 (0.0004) +[2023-03-11 16:10:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9344.4). Total num frames: 33361920. Throughput: 0: 9707.3. Samples: 33357888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:38,386][41256] Avg episode reward: [(0, '79.581')] +[2023-03-11 16:10:40,271][41544] Updated weights for policy 0, policy_version 65200 (0.0004) +[2023-03-11 16:10:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9358.3). Total num frames: 33411072. Throughput: 0: 9764.2. Samples: 33387676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:43,386][41256] Avg episode reward: [(0, '78.969')] +[2023-03-11 16:10:44,329][41544] Updated weights for policy 0, policy_version 65280 (0.0004) +[2023-03-11 16:10:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9372.2). Total num frames: 33460224. Throughput: 0: 9854.4. Samples: 33447948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:48,396][41256] Avg episode reward: [(0, '78.959')] +[2023-03-11 16:10:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065352_33460224.pth... +[2023-03-11 16:10:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000064776_33165312.pth +[2023-03-11 16:10:48,639][41544] Updated weights for policy 0, policy_version 65360 (0.0005) +[2023-03-11 16:10:53,072][41544] Updated weights for policy 0, policy_version 65440 (0.0005) +[2023-03-11 16:10:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9372.2). Total num frames: 33505280. Throughput: 0: 9811.9. Samples: 33503448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:53,396][41256] Avg episode reward: [(0, '76.346')] +[2023-03-11 16:10:57,495][41544] Updated weights for policy 0, policy_version 65520 (0.0005) +[2023-03-11 16:10:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9386.1). Total num frames: 33554432. Throughput: 0: 9768.4. Samples: 33530988. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:10:58,386][41256] Avg episode reward: [(0, '78.543')] +[2023-03-11 16:11:01,943][41544] Updated weights for policy 0, policy_version 65600 (0.0005) +[2023-03-11 16:11:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9386.1). Total num frames: 33599488. Throughput: 0: 9702.1. Samples: 33586468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:11:03,386][41256] Avg episode reward: [(0, '81.824')] +[2023-03-11 16:11:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065624_33599488.pth... +[2023-03-11 16:11:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065064_33312768.pth +[2023-03-11 16:11:06,373][41544] Updated weights for policy 0, policy_version 65680 (0.0005) +[2023-03-11 16:11:08,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9625.6, 300 sec: 9372.2). Total num frames: 33644544. Throughput: 0: 9578.0. Samples: 33641664. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:08,386][41256] Avg episode reward: [(0, '73.673')] +[2023-03-11 16:11:10,789][41544] Updated weights for policy 0, policy_version 65760 (0.0005) +[2023-03-11 16:11:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9625.6, 300 sec: 9358.3). Total num frames: 33689600. Throughput: 0: 9544.4. Samples: 33669304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:13,386][41256] Avg episode reward: [(0, '71.672')] +[2023-03-11 16:11:15,177][41544] Updated weights for policy 0, policy_version 65840 (0.0005) +[2023-03-11 16:11:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 33738752. Throughput: 0: 9492.9. Samples: 33726252. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:18,386][41256] Avg episode reward: [(0, '79.514')] +[2023-03-11 16:11:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065896_33738752.pth... +[2023-03-11 16:11:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065352_33460224.pth +[2023-03-11 16:11:19,366][41544] Updated weights for policy 0, policy_version 65920 (0.0004) +[2023-03-11 16:11:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 33787904. Throughput: 0: 9498.5. Samples: 33785320. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:23,386][41256] Avg episode reward: [(0, '78.630')] +[2023-03-11 16:11:23,497][41544] Updated weights for policy 0, policy_version 66000 (0.0005) +[2023-03-11 16:11:27,603][41544] Updated weights for policy 0, policy_version 66080 (0.0004) +[2023-03-11 16:11:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 33837056. Throughput: 0: 9503.7. Samples: 33815344. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:28,386][41256] Avg episode reward: [(0, '78.464')] +[2023-03-11 16:11:31,934][41544] Updated weights for policy 0, policy_version 66160 (0.0005) +[2023-03-11 16:11:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9372.2). Total num frames: 33886208. Throughput: 0: 9443.6. Samples: 33872908. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:33,386][41256] Avg episode reward: [(0, '80.340')] +[2023-03-11 16:11:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000066184_33886208.pth... +[2023-03-11 16:11:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065624_33599488.pth +[2023-03-11 16:11:36,292][41544] Updated weights for policy 0, policy_version 66240 (0.0005) +[2023-03-11 16:11:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 33931264. Throughput: 0: 9463.9. Samples: 33929324. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:38,386][41256] Avg episode reward: [(0, '79.474')] +[2023-03-11 16:11:40,671][41544] Updated weights for policy 0, policy_version 66320 (0.0005) +[2023-03-11 16:11:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 33980416. Throughput: 0: 9475.8. Samples: 33957400. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:11:43,386][41256] Avg episode reward: [(0, '78.878')] +[2023-03-11 16:11:45,057][41544] Updated weights for policy 0, policy_version 66400 (0.0005) +[2023-03-11 16:11:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 34025472. Throughput: 0: 9484.2. Samples: 34013256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:11:48,386][41256] Avg episode reward: [(0, '78.745')] +[2023-03-11 16:11:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000066456_34025472.pth... +[2023-03-11 16:11:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000065896_33738752.pth +[2023-03-11 16:11:49,423][41544] Updated weights for policy 0, policy_version 66480 (0.0004) +[2023-03-11 16:11:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 34070528. Throughput: 0: 9515.9. Samples: 34069880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:11:53,386][41256] Avg episode reward: [(0, '79.541')] +[2023-03-11 16:11:53,836][41544] Updated weights for policy 0, policy_version 66560 (0.0005) +[2023-03-11 16:11:58,026][41544] Updated weights for policy 0, policy_version 66640 (0.0004) +[2023-03-11 16:11:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 34119680. Throughput: 0: 9546.1. Samples: 34098880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:11:58,386][41256] Avg episode reward: [(0, '78.744')] +[2023-03-11 16:12:02,395][41544] Updated weights for policy 0, policy_version 66720 (0.0005) +[2023-03-11 16:12:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 34168832. Throughput: 0: 9542.5. Samples: 34155664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:03,386][41256] Avg episode reward: [(0, '77.603')] +[2023-03-11 16:12:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000066736_34168832.pth... +[2023-03-11 16:12:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000066184_33886208.pth +[2023-03-11 16:12:06,803][41544] Updated weights for policy 0, policy_version 66800 (0.0005) +[2023-03-11 16:12:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 34213888. Throughput: 0: 9464.6. Samples: 34211228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:08,386][41256] Avg episode reward: [(0, '82.840')] +[2023-03-11 16:12:11,243][41544] Updated weights for policy 0, policy_version 66880 (0.0005) +[2023-03-11 16:12:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 34258944. Throughput: 0: 9408.3. Samples: 34238720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:13,386][41256] Avg episode reward: [(0, '81.397')] +[2023-03-11 16:12:15,700][41544] Updated weights for policy 0, policy_version 66960 (0.0005) +[2023-03-11 16:12:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 34308096. Throughput: 0: 9370.3. Samples: 34294572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:18,386][41256] Avg episode reward: [(0, '79.693')] +[2023-03-11 16:12:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067008_34308096.pth... +[2023-03-11 16:12:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000066456_34025472.pth +[2023-03-11 16:12:20,129][41544] Updated weights for policy 0, policy_version 67040 (0.0005) +[2023-03-11 16:12:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 34353152. Throughput: 0: 9344.4. Samples: 34349824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:23,386][41256] Avg episode reward: [(0, '80.002')] +[2023-03-11 16:12:24,456][41544] Updated weights for policy 0, policy_version 67120 (0.0005) +[2023-03-11 16:12:28,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 34402304. Throughput: 0: 9351.4. Samples: 34378212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:28,386][41256] Avg episode reward: [(0, '77.996')] +[2023-03-11 16:12:28,776][41544] Updated weights for policy 0, policy_version 67200 (0.0005) +[2023-03-11 16:12:33,188][41544] Updated weights for policy 0, policy_version 67280 (0.0005) +[2023-03-11 16:12:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 34447360. Throughput: 0: 9373.7. Samples: 34435072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:33,386][41256] Avg episode reward: [(0, '76.342')] +[2023-03-11 16:12:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067280_34447360.pth... +[2023-03-11 16:12:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000066736_34168832.pth +[2023-03-11 16:12:37,715][41544] Updated weights for policy 0, policy_version 67360 (0.0005) +[2023-03-11 16:12:38,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 34492416. Throughput: 0: 9322.2. Samples: 34489380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:38,386][41256] Avg episode reward: [(0, '78.395')] +[2023-03-11 16:12:42,153][41544] Updated weights for policy 0, policy_version 67440 (0.0005) +[2023-03-11 16:12:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9344.4). Total num frames: 34537472. Throughput: 0: 9292.8. Samples: 34517056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:43,386][41256] Avg episode reward: [(0, '77.438')] +[2023-03-11 16:12:46,555][41544] Updated weights for policy 0, policy_version 67520 (0.0005) +[2023-03-11 16:12:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 34586624. Throughput: 0: 9276.2. Samples: 34573092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:48,386][41256] Avg episode reward: [(0, '76.765')] +[2023-03-11 16:12:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067552_34586624.pth... +[2023-03-11 16:12:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067008_34308096.pth +[2023-03-11 16:12:50,856][41544] Updated weights for policy 0, policy_version 67600 (0.0005) +[2023-03-11 16:12:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 34631680. Throughput: 0: 9297.2. Samples: 34629600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:53,386][41256] Avg episode reward: [(0, '79.703')] +[2023-03-11 16:12:55,315][41544] Updated weights for policy 0, policy_version 67680 (0.0005) +[2023-03-11 16:12:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 34680832. Throughput: 0: 9290.0. Samples: 34656768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:12:58,386][41256] Avg episode reward: [(0, '76.528')] +[2023-03-11 16:12:59,633][41544] Updated weights for policy 0, policy_version 67760 (0.0005) +[2023-03-11 16:13:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 34725888. Throughput: 0: 9330.8. Samples: 34714460. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:13:03,386][41256] Avg episode reward: [(0, '72.995')] +[2023-03-11 16:13:03,427][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067832_34729984.pth... +[2023-03-11 16:13:03,428][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067280_34447360.pth +[2023-03-11 16:13:03,852][41544] Updated weights for policy 0, policy_version 67840 (0.0004) +[2023-03-11 16:13:08,179][41544] Updated weights for policy 0, policy_version 67920 (0.0005) +[2023-03-11 16:13:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 34775040. Throughput: 0: 9370.2. Samples: 34771484. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:08,386][41256] Avg episode reward: [(0, '75.798')] +[2023-03-11 16:13:12,473][41544] Updated weights for policy 0, policy_version 68000 (0.0005) +[2023-03-11 16:13:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 34824192. Throughput: 0: 9372.0. Samples: 34799952. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:13,386][41256] Avg episode reward: [(0, '78.951')] +[2023-03-11 16:13:16,762][41544] Updated weights for policy 0, policy_version 68080 (0.0005) +[2023-03-11 16:13:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 34869248. Throughput: 0: 9386.8. Samples: 34857480. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:18,386][41256] Avg episode reward: [(0, '80.449')] +[2023-03-11 16:13:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068104_34869248.pth... +[2023-03-11 16:13:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067552_34586624.pth +[2023-03-11 16:13:21,119][41544] Updated weights for policy 0, policy_version 68160 (0.0005) +[2023-03-11 16:13:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 34918400. Throughput: 0: 9442.9. Samples: 34914312. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:23,386][41256] Avg episode reward: [(0, '77.803')] +[2023-03-11 16:13:25,464][41544] Updated weights for policy 0, policy_version 68240 (0.0005) +[2023-03-11 16:13:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 34963456. Throughput: 0: 9462.4. Samples: 34942864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:28,386][41256] Avg episode reward: [(0, '76.002')] +[2023-03-11 16:13:29,840][41544] Updated weights for policy 0, policy_version 68320 (0.0005) +[2023-03-11 16:13:33,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 35012608. Throughput: 0: 9472.6. Samples: 34999356. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:33,386][41256] Avg episode reward: [(0, '75.258')] +[2023-03-11 16:13:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068384_35012608.pth... +[2023-03-11 16:13:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000067832_34729984.pth +[2023-03-11 16:13:34,164][41544] Updated weights for policy 0, policy_version 68400 (0.0005) +[2023-03-11 16:13:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 35057664. Throughput: 0: 9453.3. Samples: 35055000. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:38,386][41256] Avg episode reward: [(0, '71.976')] +[2023-03-11 16:13:38,564][41544] Updated weights for policy 0, policy_version 68480 (0.0005) +[2023-03-11 16:13:42,869][41544] Updated weights for policy 0, policy_version 68560 (0.0005) +[2023-03-11 16:13:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 35106816. Throughput: 0: 9473.9. Samples: 35083096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:43,386][41256] Avg episode reward: [(0, '70.236')] +[2023-03-11 16:13:47,216][41544] Updated weights for policy 0, policy_version 68640 (0.0005) +[2023-03-11 16:13:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 35151872. Throughput: 0: 9457.2. Samples: 35140036. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:13:48,386][41256] Avg episode reward: [(0, '73.901')] +[2023-03-11 16:13:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068656_35151872.pth... +[2023-03-11 16:13:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068104_34869248.pth +[2023-03-11 16:13:51,519][41544] Updated weights for policy 0, policy_version 68720 (0.0005) +[2023-03-11 16:13:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 35201024. Throughput: 0: 9455.7. Samples: 35196992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:13:53,386][41256] Avg episode reward: [(0, '77.316')] +[2023-03-11 16:13:55,701][41544] Updated weights for policy 0, policy_version 68800 (0.0005) +[2023-03-11 16:13:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9441.6). Total num frames: 35250176. Throughput: 0: 9487.8. Samples: 35226904. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:13:58,386][41256] Avg episode reward: [(0, '78.696')] +[2023-03-11 16:13:59,916][41544] Updated weights for policy 0, policy_version 68880 (0.0005) +[2023-03-11 16:14:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 35299328. Throughput: 0: 9492.0. Samples: 35284620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:03,386][41256] Avg episode reward: [(0, '71.364')] +[2023-03-11 16:14:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068944_35299328.pth... +[2023-03-11 16:14:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068384_35012608.pth +[2023-03-11 16:14:04,182][41544] Updated weights for policy 0, policy_version 68960 (0.0005) +[2023-03-11 16:14:08,181][41544] Updated weights for policy 0, policy_version 69040 (0.0006) +[2023-03-11 16:14:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 35348480. Throughput: 0: 9564.3. Samples: 35344704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:08,386][41256] Avg episode reward: [(0, '68.865')] +[2023-03-11 16:14:12,253][41544] Updated weights for policy 0, policy_version 69120 (0.0005) +[2023-03-11 16:14:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 35397632. Throughput: 0: 9618.0. Samples: 35375672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:13,386][41256] Avg episode reward: [(0, '74.729')] +[2023-03-11 16:14:16,215][41544] Updated weights for policy 0, policy_version 69200 (0.0004) +[2023-03-11 16:14:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9497.2). Total num frames: 35450880. Throughput: 0: 9715.5. Samples: 35436552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:18,386][41256] Avg episode reward: [(0, '73.835')] +[2023-03-11 16:14:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000069240_35450880.pth... +[2023-03-11 16:14:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068656_35151872.pth +[2023-03-11 16:14:20,277][41544] Updated weights for policy 0, policy_version 69280 (0.0005) +[2023-03-11 16:14:23,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9693.9, 300 sec: 9511.0). Total num frames: 35500032. Throughput: 0: 9800.0. Samples: 35496000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:23,386][41256] Avg episode reward: [(0, '71.532')] +[2023-03-11 16:14:24,429][41544] Updated weights for policy 0, policy_version 69360 (0.0004) +[2023-03-11 16:14:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9524.9). Total num frames: 35549184. Throughput: 0: 9844.4. Samples: 35526092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:28,386][41256] Avg episode reward: [(0, '69.072')] +[2023-03-11 16:14:28,527][41544] Updated weights for policy 0, policy_version 69440 (0.0004) +[2023-03-11 16:14:32,625][41544] Updated weights for policy 0, policy_version 69520 (0.0003) +[2023-03-11 16:14:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9524.9). Total num frames: 35598336. Throughput: 0: 9912.8. Samples: 35586112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:33,386][41256] Avg episode reward: [(0, '73.699')] +[2023-03-11 16:14:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000069528_35598336.pth... +[2023-03-11 16:14:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000068944_35299328.pth +[2023-03-11 16:14:36,683][41544] Updated weights for policy 0, policy_version 69600 (0.0004) +[2023-03-11 16:14:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9552.7). Total num frames: 35651584. Throughput: 0: 10010.8. Samples: 35647480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:38,386][41256] Avg episode reward: [(0, '75.316')] +[2023-03-11 16:14:40,777][41544] Updated weights for policy 0, policy_version 69680 (0.0004) +[2023-03-11 16:14:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9566.6). Total num frames: 35700736. Throughput: 0: 9990.9. Samples: 35676496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:43,386][41256] Avg episode reward: [(0, '70.875')] +[2023-03-11 16:14:44,883][41544] Updated weights for policy 0, policy_version 69760 (0.0004) +[2023-03-11 16:14:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9580.5). Total num frames: 35749888. Throughput: 0: 10049.9. Samples: 35736864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:48,386][41256] Avg episode reward: [(0, '69.835')] +[2023-03-11 16:14:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000069824_35749888.pth... +[2023-03-11 16:14:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000069240_35450880.pth +[2023-03-11 16:14:48,906][41544] Updated weights for policy 0, policy_version 69840 (0.0003) +[2023-03-11 16:14:53,127][41544] Updated weights for policy 0, policy_version 69920 (0.0004) +[2023-03-11 16:14:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9594.4). Total num frames: 35799040. Throughput: 0: 10032.0. Samples: 35796144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:53,386][41256] Avg episode reward: [(0, '74.353')] +[2023-03-11 16:14:57,353][41544] Updated weights for policy 0, policy_version 70000 (0.0005) +[2023-03-11 16:14:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9594.4). Total num frames: 35848192. Throughput: 0: 9993.8. Samples: 35825392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:14:58,386][41256] Avg episode reward: [(0, '72.420')] +[2023-03-11 16:15:01,703][41544] Updated weights for policy 0, policy_version 70080 (0.0005) +[2023-03-11 16:15:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9580.5). Total num frames: 35893248. Throughput: 0: 9903.1. Samples: 35882192. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:03,386][41256] Avg episode reward: [(0, '79.077')] +[2023-03-11 16:15:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070104_35893248.pth... +[2023-03-11 16:15:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000069528_35598336.pth +[2023-03-11 16:15:05,990][41544] Updated weights for policy 0, policy_version 70160 (0.0005) +[2023-03-11 16:15:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9594.4). Total num frames: 35942400. Throughput: 0: 9846.6. Samples: 35939096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:08,386][41256] Avg episode reward: [(0, '72.092')] +[2023-03-11 16:15:10,337][41544] Updated weights for policy 0, policy_version 70240 (0.0005) +[2023-03-11 16:15:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9580.5). Total num frames: 35991552. Throughput: 0: 9810.7. Samples: 35967572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:13,386][41256] Avg episode reward: [(0, '70.768')] +[2023-03-11 16:15:14,711][41544] Updated weights for policy 0, policy_version 70320 (0.0005) +[2023-03-11 16:15:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9566.6). Total num frames: 36036608. Throughput: 0: 9738.2. Samples: 36024332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:18,386][41256] Avg episode reward: [(0, '73.340')] +[2023-03-11 16:15:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070384_36036608.pth... +[2023-03-11 16:15:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000069824_35749888.pth +[2023-03-11 16:15:19,020][41544] Updated weights for policy 0, policy_version 70400 (0.0005) +[2023-03-11 16:15:23,312][41544] Updated weights for policy 0, policy_version 70480 (0.0005) +[2023-03-11 16:15:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9566.6). Total num frames: 36085760. Throughput: 0: 9638.1. Samples: 36081196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:23,386][41256] Avg episode reward: [(0, '76.035')] +[2023-03-11 16:15:27,306][41544] Updated weights for policy 0, policy_version 70560 (0.0004) +[2023-03-11 16:15:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9566.6). Total num frames: 36134912. Throughput: 0: 9670.9. Samples: 36111684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:28,386][41256] Avg episode reward: [(0, '72.819')] +[2023-03-11 16:15:31,383][41544] Updated weights for policy 0, policy_version 70640 (0.0004) +[2023-03-11 16:15:33,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.2, 300 sec: 9566.6). Total num frames: 36184064. Throughput: 0: 9673.0. Samples: 36172148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:33,386][41256] Avg episode reward: [(0, '74.143')] +[2023-03-11 16:15:33,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070672_36184064.pth... +[2023-03-11 16:15:33,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070104_35893248.pth +[2023-03-11 16:15:35,429][41544] Updated weights for policy 0, policy_version 70720 (0.0004) +[2023-03-11 16:15:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9566.6). Total num frames: 36233216. Throughput: 0: 9685.8. Samples: 36232004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:38,386][41256] Avg episode reward: [(0, '78.879')] +[2023-03-11 16:15:39,679][41544] Updated weights for policy 0, policy_version 70800 (0.0005) +[2023-03-11 16:15:43,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9580.5). Total num frames: 36286464. Throughput: 0: 9686.6. Samples: 36261288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:43,386][41256] Avg episode reward: [(0, '78.497')] +[2023-03-11 16:15:43,738][41544] Updated weights for policy 0, policy_version 70880 (0.0004) +[2023-03-11 16:15:47,996][41544] Updated weights for policy 0, policy_version 70960 (0.0005) +[2023-03-11 16:15:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9580.5). Total num frames: 36331520. Throughput: 0: 9750.0. Samples: 36320940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:48,386][41256] Avg episode reward: [(0, '80.177')] +[2023-03-11 16:15:48,412][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070968_36335616.pth... +[2023-03-11 16:15:48,415][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070384_36036608.pth +[2023-03-11 16:15:52,303][41544] Updated weights for policy 0, policy_version 71040 (0.0005) +[2023-03-11 16:15:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9580.5). Total num frames: 36380672. Throughput: 0: 9741.9. Samples: 36377480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:53,386][41256] Avg episode reward: [(0, '78.204')] +[2023-03-11 16:15:56,498][41544] Updated weights for policy 0, policy_version 71120 (0.0005) +[2023-03-11 16:15:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9594.4). Total num frames: 36429824. Throughput: 0: 9759.5. Samples: 36406748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:15:58,386][41256] Avg episode reward: [(0, '71.114')] +[2023-03-11 16:16:00,656][41544] Updated weights for policy 0, policy_version 71200 (0.0004) +[2023-03-11 16:16:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9608.2). Total num frames: 36478976. Throughput: 0: 9817.2. Samples: 36466104. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:03,386][41256] Avg episode reward: [(0, '72.860')] +[2023-03-11 16:16:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000071248_36478976.pth... +[2023-03-11 16:16:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070672_36184064.pth +[2023-03-11 16:16:04,842][41544] Updated weights for policy 0, policy_version 71280 (0.0005) +[2023-03-11 16:16:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 36528128. Throughput: 0: 9827.2. Samples: 36523420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:08,386][41256] Avg episode reward: [(0, '78.263')] +[2023-03-11 16:16:09,236][41544] Updated weights for policy 0, policy_version 71360 (0.0005) +[2023-03-11 16:16:13,363][41544] Updated weights for policy 0, policy_version 71440 (0.0005) +[2023-03-11 16:16:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 36577280. Throughput: 0: 9777.7. Samples: 36551680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:13,386][41256] Avg episode reward: [(0, '77.749')] +[2023-03-11 16:16:17,429][41544] Updated weights for policy 0, policy_version 71520 (0.0004) +[2023-03-11 16:16:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9622.1). Total num frames: 36626432. Throughput: 0: 9775.7. Samples: 36612056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:18,386][41256] Avg episode reward: [(0, '76.946')] +[2023-03-11 16:16:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000071536_36626432.pth... +[2023-03-11 16:16:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000070968_36335616.pth +[2023-03-11 16:16:21,695][41544] Updated weights for policy 0, policy_version 71600 (0.0005) +[2023-03-11 16:16:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9622.1). Total num frames: 36675584. Throughput: 0: 9761.1. Samples: 36671252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:23,386][41256] Avg episode reward: [(0, '72.136')] +[2023-03-11 16:16:25,695][41544] Updated weights for policy 0, policy_version 71680 (0.0004) +[2023-03-11 16:16:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9622.1). Total num frames: 36724736. Throughput: 0: 9782.9. Samples: 36701520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:28,386][41256] Avg episode reward: [(0, '72.714')] +[2023-03-11 16:16:29,770][41544] Updated weights for policy 0, policy_version 71760 (0.0005) +[2023-03-11 16:16:33,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9649.9). Total num frames: 36777984. Throughput: 0: 9803.8. Samples: 36762112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:33,386][41256] Avg episode reward: [(0, '74.606')] +[2023-03-11 16:16:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000071832_36777984.pth... +[2023-03-11 16:16:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000071248_36478976.pth +[2023-03-11 16:16:33,754][41544] Updated weights for policy 0, policy_version 71840 (0.0004) +[2023-03-11 16:16:37,761][41544] Updated weights for policy 0, policy_version 71920 (0.0004) +[2023-03-11 16:16:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9649.9). Total num frames: 36827136. Throughput: 0: 9913.0. Samples: 36823564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:38,386][41256] Avg episode reward: [(0, '71.740')] +[2023-03-11 16:16:41,800][41544] Updated weights for policy 0, policy_version 72000 (0.0005) +[2023-03-11 16:16:43,385][41256] Fps is (10 sec: 10240.2, 60 sec: 9898.7, 300 sec: 9677.7). Total num frames: 36880384. Throughput: 0: 9942.8. Samples: 36854172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:43,386][41256] Avg episode reward: [(0, '76.708')] +[2023-03-11 16:16:45,804][41544] Updated weights for policy 0, policy_version 72080 (0.0004) +[2023-03-11 16:16:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9691.6). Total num frames: 36929536. Throughput: 0: 9986.7. Samples: 36915504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:48,386][41256] Avg episode reward: [(0, '76.510')] +[2023-03-11 16:16:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072128_36929536.pth... +[2023-03-11 16:16:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000071536_36626432.pth +[2023-03-11 16:16:49,941][41544] Updated weights for policy 0, policy_version 72160 (0.0005) +[2023-03-11 16:16:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9691.6). Total num frames: 36978688. Throughput: 0: 10039.1. Samples: 36975180. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:53,386][41256] Avg episode reward: [(0, '73.395')] +[2023-03-11 16:16:53,953][41544] Updated weights for policy 0, policy_version 72240 (0.0004) +[2023-03-11 16:16:57,975][41544] Updated weights for policy 0, policy_version 72320 (0.0004) +[2023-03-11 16:16:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9705.4). Total num frames: 37031936. Throughput: 0: 10096.9. Samples: 37006040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:16:58,386][41256] Avg episode reward: [(0, '72.697')] +[2023-03-11 16:17:02,028][41544] Updated weights for policy 0, policy_version 72400 (0.0005) +[2023-03-11 16:17:03,386][41256] Fps is (10 sec: 10239.8, 60 sec: 10035.2, 300 sec: 9719.3). Total num frames: 37081088. Throughput: 0: 10114.5. Samples: 37067208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:17:03,386][41256] Avg episode reward: [(0, '80.586')] +[2023-03-11 16:17:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072424_37081088.pth... +[2023-03-11 16:17:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000071832_36777984.pth +[2023-03-11 16:17:06,342][41544] Updated weights for policy 0, policy_version 72480 (0.0005) +[2023-03-11 16:17:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9719.3). Total num frames: 37126144. Throughput: 0: 10065.1. Samples: 37124184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:17:08,386][41256] Avg episode reward: [(0, '84.296')] +[2023-03-11 16:17:10,551][41544] Updated weights for policy 0, policy_version 72560 (0.0004) +[2023-03-11 16:17:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9719.3). Total num frames: 37175296. Throughput: 0: 10047.0. Samples: 37153636. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:17:13,386][41256] Avg episode reward: [(0, '77.902')] +[2023-03-11 16:17:14,804][41544] Updated weights for policy 0, policy_version 72640 (0.0004) +[2023-03-11 16:17:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9733.2). Total num frames: 37224448. Throughput: 0: 9973.8. Samples: 37210932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:17:18,386][41256] Avg episode reward: [(0, '81.031')] +[2023-03-11 16:17:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072704_37224448.pth... +[2023-03-11 16:17:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072128_36929536.pth +[2023-03-11 16:17:19,277][41544] Updated weights for policy 0, policy_version 72720 (0.0005) +[2023-03-11 16:17:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9719.3). Total num frames: 37269504. Throughput: 0: 9845.6. Samples: 37266616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:17:23,386][41256] Avg episode reward: [(0, '72.374')] +[2023-03-11 16:17:23,541][41544] Updated weights for policy 0, policy_version 72800 (0.0004) +[2023-03-11 16:17:27,756][41544] Updated weights for policy 0, policy_version 72880 (0.0005) +[2023-03-11 16:17:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9733.2). Total num frames: 37318656. Throughput: 0: 9805.3. Samples: 37295412. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:17:28,386][41256] Avg episode reward: [(0, '71.243')] +[2023-03-11 16:17:31,942][41544] Updated weights for policy 0, policy_version 72960 (0.0004) +[2023-03-11 16:17:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9747.1). Total num frames: 37367808. Throughput: 0: 9765.9. Samples: 37354972. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:17:33,386][41256] Avg episode reward: [(0, '74.802')] +[2023-03-11 16:17:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072984_37367808.pth... +[2023-03-11 16:17:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072424_37081088.pth +[2023-03-11 16:17:36,194][41544] Updated weights for policy 0, policy_version 73040 (0.0005) +[2023-03-11 16:17:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9747.1). Total num frames: 37412864. Throughput: 0: 9715.5. Samples: 37412380. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:17:38,386][41256] Avg episode reward: [(0, '75.005')] +[2023-03-11 16:17:40,527][41544] Updated weights for policy 0, policy_version 73120 (0.0005) +[2023-03-11 16:17:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.8, 300 sec: 9747.1). Total num frames: 37462016. Throughput: 0: 9658.2. Samples: 37440660. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:17:43,386][41256] Avg episode reward: [(0, '76.808')] +[2023-03-11 16:17:44,756][41544] Updated weights for policy 0, policy_version 73200 (0.0004) +[2023-03-11 16:17:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 37511168. Throughput: 0: 9594.1. Samples: 37498944. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:17:48,386][41256] Avg episode reward: [(0, '80.621')] +[2023-03-11 16:17:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000073264_37511168.pth... +[2023-03-11 16:17:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072704_37224448.pth +[2023-03-11 16:17:48,947][41544] Updated weights for policy 0, policy_version 73280 (0.0005) +[2023-03-11 16:17:53,071][41544] Updated weights for policy 0, policy_version 73360 (0.0003) +[2023-03-11 16:17:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9761.0). Total num frames: 37560320. Throughput: 0: 9647.7. Samples: 37558328. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:17:53,396][41256] Avg episode reward: [(0, '78.044')] +[2023-03-11 16:17:57,236][41544] Updated weights for policy 0, policy_version 73440 (0.0004) +[2023-03-11 16:17:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 37609472. Throughput: 0: 9650.8. Samples: 37587924. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:17:58,397][41256] Avg episode reward: [(0, '77.820')] +[2023-03-11 16:18:01,605][41544] Updated weights for policy 0, policy_version 73520 (0.0005) +[2023-03-11 16:18:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 37658624. Throughput: 0: 9634.0. Samples: 37644464. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:18:03,386][41256] Avg episode reward: [(0, '78.241')] +[2023-03-11 16:18:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000073552_37658624.pth... +[2023-03-11 16:18:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000072984_37367808.pth +[2023-03-11 16:18:05,932][41544] Updated weights for policy 0, policy_version 73600 (0.0005) +[2023-03-11 16:18:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 37703680. Throughput: 0: 9652.4. Samples: 37700972. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:18:08,386][41256] Avg episode reward: [(0, '78.043')] +[2023-03-11 16:18:10,309][41544] Updated weights for policy 0, policy_version 73680 (0.0005) +[2023-03-11 16:18:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 37752832. Throughput: 0: 9645.1. Samples: 37729440. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:13,386][41256] Avg episode reward: [(0, '76.620')] +[2023-03-11 16:18:14,553][41544] Updated weights for policy 0, policy_version 73760 (0.0005) +[2023-03-11 16:18:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 37801984. Throughput: 0: 9609.8. Samples: 37787412. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:18,386][41256] Avg episode reward: [(0, '76.553')] +[2023-03-11 16:18:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000073832_37801984.pth... +[2023-03-11 16:18:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000073264_37511168.pth +[2023-03-11 16:18:18,738][41544] Updated weights for policy 0, policy_version 73840 (0.0005) +[2023-03-11 16:18:22,883][41544] Updated weights for policy 0, policy_version 73920 (0.0005) +[2023-03-11 16:18:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 37851136. Throughput: 0: 9656.4. Samples: 37846916. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:23,386][41256] Avg episode reward: [(0, '71.277')] +[2023-03-11 16:18:27,048][41544] Updated weights for policy 0, policy_version 74000 (0.0005) +[2023-03-11 16:18:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 37900288. Throughput: 0: 9673.0. Samples: 37875944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:28,386][41256] Avg episode reward: [(0, '74.760')] +[2023-03-11 16:18:31,238][41544] Updated weights for policy 0, policy_version 74080 (0.0005) +[2023-03-11 16:18:33,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 37949440. Throughput: 0: 9689.0. Samples: 37934948. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:33,386][41256] Avg episode reward: [(0, '61.332')] +[2023-03-11 16:18:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074120_37949440.pth... +[2023-03-11 16:18:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000073552_37658624.pth +[2023-03-11 16:18:35,292][41544] Updated weights for policy 0, policy_version 74160 (0.0005) +[2023-03-11 16:18:38,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 37998592. Throughput: 0: 9716.5. Samples: 37995572. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:38,386][41256] Avg episode reward: [(0, '67.047')] +[2023-03-11 16:18:39,375][41544] Updated weights for policy 0, policy_version 74240 (0.0005) +[2023-03-11 16:18:43,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 38047744. Throughput: 0: 9707.0. Samples: 38024740. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:43,386][41256] Avg episode reward: [(0, '70.538')] +[2023-03-11 16:18:43,485][41544] Updated weights for policy 0, policy_version 74320 (0.0005) +[2023-03-11 16:18:47,620][41544] Updated weights for policy 0, policy_version 74400 (0.0005) +[2023-03-11 16:18:48,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 38096896. Throughput: 0: 9788.0. Samples: 38084924. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:48,386][41256] Avg episode reward: [(0, '71.079')] +[2023-03-11 16:18:48,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074408_38096896.pth... +[2023-03-11 16:18:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000073832_37801984.pth +[2023-03-11 16:18:51,804][41544] Updated weights for policy 0, policy_version 74480 (0.0005) +[2023-03-11 16:18:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 38146048. Throughput: 0: 9836.7. Samples: 38143624. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:18:53,386][41256] Avg episode reward: [(0, '73.398')] +[2023-03-11 16:18:56,087][41544] Updated weights for policy 0, policy_version 74560 (0.0005) +[2023-03-11 16:18:58,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 38195200. Throughput: 0: 9839.1. Samples: 38172200. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:18:58,386][41256] Avg episode reward: [(0, '73.383')] +[2023-03-11 16:19:00,503][41544] Updated weights for policy 0, policy_version 74640 (0.0005) +[2023-03-11 16:19:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9802.6). Total num frames: 38240256. Throughput: 0: 9791.7. Samples: 38228040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:03,386][41256] Avg episode reward: [(0, '75.044')] +[2023-03-11 16:19:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074688_38240256.pth... +[2023-03-11 16:19:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074120_37949440.pth +[2023-03-11 16:19:04,934][41544] Updated weights for policy 0, policy_version 74720 (0.0005) +[2023-03-11 16:19:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9693.9, 300 sec: 9788.7). Total num frames: 38285312. Throughput: 0: 9703.5. Samples: 38283572. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:08,386][41256] Avg episode reward: [(0, '83.017')] +[2023-03-11 16:19:09,387][41544] Updated weights for policy 0, policy_version 74800 (0.0005) +[2023-03-11 16:19:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 38334464. Throughput: 0: 9678.7. Samples: 38311488. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:13,386][41256] Avg episode reward: [(0, '83.350')] +[2023-03-11 16:19:13,717][41544] Updated weights for policy 0, policy_version 74880 (0.0005) +[2023-03-11 16:19:18,152][41544] Updated weights for policy 0, policy_version 74960 (0.0004) +[2023-03-11 16:19:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 38379520. Throughput: 0: 9607.7. Samples: 38367296. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:18,386][41256] Avg episode reward: [(0, '83.937')] +[2023-03-11 16:19:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074960_38379520.pth... +[2023-03-11 16:19:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074408_38096896.pth +[2023-03-11 16:19:22,487][41544] Updated weights for policy 0, policy_version 75040 (0.0005) +[2023-03-11 16:19:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9761.0). Total num frames: 38428672. Throughput: 0: 9523.5. Samples: 38424128. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:23,386][41256] Avg episode reward: [(0, '81.284')] +[2023-03-11 16:19:26,881][41544] Updated weights for policy 0, policy_version 75120 (0.0005) +[2023-03-11 16:19:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9747.1). Total num frames: 38473728. Throughput: 0: 9494.8. Samples: 38452004. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:28,386][41256] Avg episode reward: [(0, '82.368')] +[2023-03-11 16:19:31,235][41544] Updated weights for policy 0, policy_version 75200 (0.0005) +[2023-03-11 16:19:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9733.2). Total num frames: 38522880. Throughput: 0: 9402.8. Samples: 38508052. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:33,386][41256] Avg episode reward: [(0, '77.379')] +[2023-03-11 16:19:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000075240_38522880.pth... +[2023-03-11 16:19:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074688_38240256.pth +[2023-03-11 16:19:35,581][41544] Updated weights for policy 0, policy_version 75280 (0.0005) +[2023-03-11 16:19:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9719.3). Total num frames: 38567936. Throughput: 0: 9339.6. Samples: 38563904. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:38,386][41256] Avg episode reward: [(0, '82.386')] +[2023-03-11 16:19:40,039][41544] Updated weights for policy 0, policy_version 75360 (0.0005) +[2023-03-11 16:19:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9705.4). Total num frames: 38612992. Throughput: 0: 9335.7. Samples: 38592304. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:43,386][41256] Avg episode reward: [(0, '80.515')] +[2023-03-11 16:19:44,297][41544] Updated weights for policy 0, policy_version 75440 (0.0004) +[2023-03-11 16:19:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9705.4). Total num frames: 38662144. Throughput: 0: 9373.4. Samples: 38649844. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:48,386][41256] Avg episode reward: [(0, '79.800')] +[2023-03-11 16:19:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000075512_38662144.pth... +[2023-03-11 16:19:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000074960_38379520.pth +[2023-03-11 16:19:48,655][41544] Updated weights for policy 0, policy_version 75520 (0.0005) +[2023-03-11 16:19:52,880][41544] Updated weights for policy 0, policy_version 75600 (0.0005) +[2023-03-11 16:19:53,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9705.4). Total num frames: 38711296. Throughput: 0: 9413.9. Samples: 38707200. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:53,386][41256] Avg episode reward: [(0, '83.402')] +[2023-03-11 16:19:57,365][41544] Updated weights for policy 0, policy_version 75680 (0.0005) +[2023-03-11 16:19:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9705.4). Total num frames: 38756352. Throughput: 0: 9394.9. Samples: 38734260. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:19:58,386][41256] Avg episode reward: [(0, '75.027')] +[2023-03-11 16:20:01,816][41544] Updated weights for policy 0, policy_version 75760 (0.0005) +[2023-03-11 16:20:03,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9691.6). Total num frames: 38801408. Throughput: 0: 9375.5. Samples: 38789192. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:20:03,386][41256] Avg episode reward: [(0, '74.708')] +[2023-03-11 16:20:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000075784_38801408.pth... +[2023-03-11 16:20:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000075240_38522880.pth +[2023-03-11 16:20:06,040][41544] Updated weights for policy 0, policy_version 75840 (0.0005) +[2023-03-11 16:20:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9691.6). Total num frames: 38850560. Throughput: 0: 9386.8. Samples: 38846536. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:20:08,386][41256] Avg episode reward: [(0, '71.903')] +[2023-03-11 16:20:10,472][41544] Updated weights for policy 0, policy_version 75920 (0.0005) +[2023-03-11 16:20:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9691.6). Total num frames: 38895616. Throughput: 0: 9395.7. Samples: 38874808. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:20:13,386][41256] Avg episode reward: [(0, '74.983')] +[2023-03-11 16:20:14,892][41544] Updated weights for policy 0, policy_version 76000 (0.0006) +[2023-03-11 16:20:18,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9677.7). Total num frames: 38940672. Throughput: 0: 9377.5. Samples: 38930040. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:20:18,386][41256] Avg episode reward: [(0, '75.047')] +[2023-03-11 16:20:18,445][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076064_38944768.pth... +[2023-03-11 16:20:18,447][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000075512_38662144.pth +[2023-03-11 16:20:19,311][41544] Updated weights for policy 0, policy_version 76080 (0.0005) +[2023-03-11 16:20:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9677.7). Total num frames: 38989824. Throughput: 0: 9375.3. Samples: 38985792. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:20:23,386][41256] Avg episode reward: [(0, '74.298')] +[2023-03-11 16:20:23,665][41544] Updated weights for policy 0, policy_version 76160 (0.0004) +[2023-03-11 16:20:27,890][41544] Updated weights for policy 0, policy_version 76240 (0.0003) +[2023-03-11 16:20:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9677.7). Total num frames: 39038976. Throughput: 0: 9382.8. Samples: 39014528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:20:28,386][41256] Avg episode reward: [(0, '80.066')] +[2023-03-11 16:20:32,180][41544] Updated weights for policy 0, policy_version 76320 (0.0004) +[2023-03-11 16:20:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9663.8). Total num frames: 39084032. Throughput: 0: 9395.5. Samples: 39072640. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:20:33,386][41256] Avg episode reward: [(0, '75.604')] +[2023-03-11 16:20:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076336_39084032.pth... +[2023-03-11 16:20:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000075784_38801408.pth +[2023-03-11 16:20:36,433][41544] Updated weights for policy 0, policy_version 76400 (0.0004) +[2023-03-11 16:20:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9649.9). Total num frames: 39133184. Throughput: 0: 9403.7. Samples: 39130364. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:20:38,386][41256] Avg episode reward: [(0, '77.734')] +[2023-03-11 16:20:40,718][41544] Updated weights for policy 0, policy_version 76480 (0.0004) +[2023-03-11 16:20:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9663.8). Total num frames: 39182336. Throughput: 0: 9433.9. Samples: 39158784. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:20:43,386][41256] Avg episode reward: [(0, '79.723')] +[2023-03-11 16:20:44,918][41544] Updated weights for policy 0, policy_version 76560 (0.0005) +[2023-03-11 16:20:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9649.9). Total num frames: 39227392. Throughput: 0: 9486.1. Samples: 39216068. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:20:48,386][41256] Avg episode reward: [(0, '80.613')] +[2023-03-11 16:20:48,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076616_39227392.pth... +[2023-03-11 16:20:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076064_38944768.pth +[2023-03-11 16:20:49,384][41544] Updated weights for policy 0, policy_version 76640 (0.0005) +[2023-03-11 16:20:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9649.9). Total num frames: 39276544. Throughput: 0: 9464.9. Samples: 39272456. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:20:53,396][41256] Avg episode reward: [(0, '72.974')] +[2023-03-11 16:20:53,737][41544] Updated weights for policy 0, policy_version 76720 (0.0005) +[2023-03-11 16:20:58,097][41544] Updated weights for policy 0, policy_version 76800 (0.0005) +[2023-03-11 16:20:58,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9636.0). Total num frames: 39321600. Throughput: 0: 9458.9. Samples: 39300460. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:20:58,396][41256] Avg episode reward: [(0, '77.645')] +[2023-03-11 16:21:02,520][41544] Updated weights for policy 0, policy_version 76880 (0.0005) +[2023-03-11 16:21:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9636.0). Total num frames: 39370752. Throughput: 0: 9471.3. Samples: 39356248. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:21:03,396][41256] Avg episode reward: [(0, '79.396')] +[2023-03-11 16:21:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076896_39370752.pth... +[2023-03-11 16:21:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076336_39084032.pth +[2023-03-11 16:21:06,819][41544] Updated weights for policy 0, policy_version 76960 (0.0005) +[2023-03-11 16:21:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9622.1). Total num frames: 39415808. Throughput: 0: 9504.3. Samples: 39413484. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:21:08,396][41256] Avg episode reward: [(0, '78.833')] +[2023-03-11 16:21:11,073][41544] Updated weights for policy 0, policy_version 77040 (0.0006) +[2023-03-11 16:21:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 39464960. Throughput: 0: 9506.2. Samples: 39442308. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:21:13,397][41256] Avg episode reward: [(0, '74.245')] +[2023-03-11 16:21:15,522][41544] Updated weights for policy 0, policy_version 77120 (0.0006) +[2023-03-11 16:21:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9608.2). Total num frames: 39510016. Throughput: 0: 9446.6. Samples: 39497736. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:18,396][41256] Avg episode reward: [(0, '79.318')] +[2023-03-11 16:21:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077168_39510016.pth... +[2023-03-11 16:21:18,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076616_39227392.pth +[2023-03-11 16:21:19,787][41544] Updated weights for policy 0, policy_version 77200 (0.0004) +[2023-03-11 16:21:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9608.2). Total num frames: 39559168. Throughput: 0: 9434.0. Samples: 39554892. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:23,396][41256] Avg episode reward: [(0, '76.355')] +[2023-03-11 16:21:24,223][41544] Updated weights for policy 0, policy_version 77280 (0.0005) +[2023-03-11 16:21:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9580.5). Total num frames: 39604224. Throughput: 0: 9424.0. Samples: 39582864. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:28,396][41256] Avg episode reward: [(0, '75.785')] +[2023-03-11 16:21:28,611][41544] Updated weights for policy 0, policy_version 77360 (0.0005) +[2023-03-11 16:21:33,002][41544] Updated weights for policy 0, policy_version 77440 (0.0005) +[2023-03-11 16:21:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9566.6). Total num frames: 39649280. Throughput: 0: 9391.2. Samples: 39638672. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:33,396][41256] Avg episode reward: [(0, '76.587')] +[2023-03-11 16:21:33,449][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077448_39653376.pth... +[2023-03-11 16:21:33,450][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000076896_39370752.pth +[2023-03-11 16:21:37,304][41544] Updated weights for policy 0, policy_version 77520 (0.0005) +[2023-03-11 16:21:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9552.7). Total num frames: 39698432. Throughput: 0: 9403.7. Samples: 39695620. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:38,386][41256] Avg episode reward: [(0, '77.411')] +[2023-03-11 16:21:41,584][41544] Updated weights for policy 0, policy_version 77600 (0.0005) +[2023-03-11 16:21:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9552.7). Total num frames: 39747584. Throughput: 0: 9419.0. Samples: 39724316. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:43,386][41256] Avg episode reward: [(0, '77.904')] +[2023-03-11 16:21:45,937][41544] Updated weights for policy 0, policy_version 77680 (0.0005) +[2023-03-11 16:21:48,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9538.8). Total num frames: 39792640. Throughput: 0: 9426.0. Samples: 39780416. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:48,386][41256] Avg episode reward: [(0, '77.504')] +[2023-03-11 16:21:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077720_39792640.pth... +[2023-03-11 16:21:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077168_39510016.pth +[2023-03-11 16:21:50,415][41544] Updated weights for policy 0, policy_version 77760 (0.0004) +[2023-03-11 16:21:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9511.1). Total num frames: 39837696. Throughput: 0: 9382.0. Samples: 39835672. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:53,386][41256] Avg episode reward: [(0, '75.769')] +[2023-03-11 16:21:54,863][41544] Updated weights for policy 0, policy_version 77840 (0.0005) +[2023-03-11 16:21:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9511.1). Total num frames: 39886848. Throughput: 0: 9346.5. Samples: 39862900. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:21:58,386][41256] Avg episode reward: [(0, '78.528')] +[2023-03-11 16:21:59,215][41544] Updated weights for policy 0, policy_version 77920 (0.0005) +[2023-03-11 16:22:03,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9511.1). Total num frames: 39931904. Throughput: 0: 9375.1. Samples: 39919616. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:22:03,386][41256] Avg episode reward: [(0, '75.000')] +[2023-03-11 16:22:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077992_39931904.pth... +[2023-03-11 16:22:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077448_39653376.pth +[2023-03-11 16:22:03,628][41544] Updated weights for policy 0, policy_version 78000 (0.0005) +[2023-03-11 16:22:08,131][41544] Updated weights for policy 0, policy_version 78080 (0.0005) +[2023-03-11 16:22:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 39976960. Throughput: 0: 9321.8. Samples: 39974372. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:08,386][41256] Avg episode reward: [(0, '76.286')] +[2023-03-11 16:22:12,649][41544] Updated weights for policy 0, policy_version 78160 (0.0005) +[2023-03-11 16:22:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 40022016. Throughput: 0: 9305.2. Samples: 40001600. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:13,386][41256] Avg episode reward: [(0, '82.606')] +[2023-03-11 16:22:17,120][41544] Updated weights for policy 0, policy_version 78240 (0.0005) +[2023-03-11 16:22:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 40067072. Throughput: 0: 9280.4. Samples: 40056288. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:18,386][41256] Avg episode reward: [(0, '80.963')] +[2023-03-11 16:22:18,448][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000078264_40071168.pth... +[2023-03-11 16:22:18,450][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077720_39792640.pth +[2023-03-11 16:22:21,491][41544] Updated weights for policy 0, policy_version 78320 (0.0005) +[2023-03-11 16:22:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 40116224. Throughput: 0: 9255.9. Samples: 40112136. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:23,386][41256] Avg episode reward: [(0, '83.865')] +[2023-03-11 16:22:25,937][41544] Updated weights for policy 0, policy_version 78400 (0.0005) +[2023-03-11 16:22:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9469.4). Total num frames: 40161280. Throughput: 0: 9245.1. Samples: 40140348. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:28,386][41256] Avg episode reward: [(0, '81.814')] +[2023-03-11 16:22:30,248][41544] Updated weights for policy 0, policy_version 78480 (0.0005) +[2023-03-11 16:22:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 40210432. Throughput: 0: 9273.3. Samples: 40197716. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:33,386][41256] Avg episode reward: [(0, '81.029')] +[2023-03-11 16:22:33,391][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000078536_40210432.pth... +[2023-03-11 16:22:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000077992_39931904.pth +[2023-03-11 16:22:34,396][41544] Updated weights for policy 0, policy_version 78560 (0.0004) +[2023-03-11 16:22:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 40259584. Throughput: 0: 9340.6. Samples: 40256000. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:38,386][41256] Avg episode reward: [(0, '81.057')] +[2023-03-11 16:22:38,591][41544] Updated weights for policy 0, policy_version 78640 (0.0005) +[2023-03-11 16:22:42,764][41544] Updated weights for policy 0, policy_version 78720 (0.0005) +[2023-03-11 16:22:43,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 40308736. Throughput: 0: 9409.3. Samples: 40286320. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:43,386][41256] Avg episode reward: [(0, '84.551')] +[2023-03-11 16:22:47,248][41544] Updated weights for policy 0, policy_version 78800 (0.0005) +[2023-03-11 16:22:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 40353792. Throughput: 0: 9376.9. Samples: 40341576. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:22:48,386][41256] Avg episode reward: [(0, '80.834')] +[2023-03-11 16:22:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000078816_40353792.pth... +[2023-03-11 16:22:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000078264_40071168.pth +[2023-03-11 16:22:51,605][41544] Updated weights for policy 0, policy_version 78880 (0.0005) +[2023-03-11 16:22:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 40398848. Throughput: 0: 9414.0. Samples: 40398004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:22:53,386][41256] Avg episode reward: [(0, '81.955')] +[2023-03-11 16:22:56,084][41544] Updated weights for policy 0, policy_version 78960 (0.0005) +[2023-03-11 16:22:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 40448000. Throughput: 0: 9415.8. Samples: 40425312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:22:58,386][41256] Avg episode reward: [(0, '87.787')] +[2023-03-11 16:23:00,464][41544] Updated weights for policy 0, policy_version 79040 (0.0005) +[2023-03-11 16:23:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 40493056. Throughput: 0: 9434.5. Samples: 40480840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:03,386][41256] Avg episode reward: [(0, '85.050')] +[2023-03-11 16:23:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079088_40493056.pth... +[2023-03-11 16:23:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000078536_40210432.pth +[2023-03-11 16:23:04,798][41544] Updated weights for policy 0, policy_version 79120 (0.0005) +[2023-03-11 16:23:08,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 40542208. Throughput: 0: 9466.1. Samples: 40538112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:08,386][41256] Avg episode reward: [(0, '86.756')] +[2023-03-11 16:23:09,231][41544] Updated weights for policy 0, policy_version 79200 (0.0005) +[2023-03-11 16:23:13,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 40587264. Throughput: 0: 9445.3. Samples: 40565388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:13,386][41256] Avg episode reward: [(0, '90.871')] +[2023-03-11 16:23:13,748][41544] Updated weights for policy 0, policy_version 79280 (0.0005) +[2023-03-11 16:23:18,276][41544] Updated weights for policy 0, policy_version 79360 (0.0005) +[2023-03-11 16:23:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9427.7). Total num frames: 40632320. Throughput: 0: 9376.6. Samples: 40619664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:18,386][41256] Avg episode reward: [(0, '90.592')] +[2023-03-11 16:23:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079360_40632320.pth... +[2023-03-11 16:23:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000078816_40353792.pth +[2023-03-11 16:23:22,693][41544] Updated weights for policy 0, policy_version 79440 (0.0005) +[2023-03-11 16:23:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 40677376. Throughput: 0: 9301.1. Samples: 40674548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:23,386][41256] Avg episode reward: [(0, '85.429')] +[2023-03-11 16:23:26,990][41544] Updated weights for policy 0, policy_version 79520 (0.0005) +[2023-03-11 16:23:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 40726528. Throughput: 0: 9262.0. Samples: 40703108. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:28,386][41256] Avg episode reward: [(0, '89.871')] +[2023-03-11 16:23:31,559][41544] Updated weights for policy 0, policy_version 79600 (0.0005) +[2023-03-11 16:23:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 40767488. Throughput: 0: 9250.8. Samples: 40757860. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:33,386][41256] Avg episode reward: [(0, '84.045')] +[2023-03-11 16:23:33,396][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079632_40771584.pth... +[2023-03-11 16:23:33,397][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079088_40493056.pth +[2023-03-11 16:23:36,026][41544] Updated weights for policy 0, policy_version 79680 (0.0005) +[2023-03-11 16:23:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 40816640. Throughput: 0: 9230.1. Samples: 40813360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:38,386][41256] Avg episode reward: [(0, '87.505')] +[2023-03-11 16:23:40,291][41544] Updated weights for policy 0, policy_version 79760 (0.0005) +[2023-03-11 16:23:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 40865792. Throughput: 0: 9266.1. Samples: 40842284. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:43,386][41256] Avg episode reward: [(0, '82.220')] +[2023-03-11 16:23:44,609][41544] Updated weights for policy 0, policy_version 79840 (0.0005) +[2023-03-11 16:23:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 40910848. Throughput: 0: 9292.3. Samples: 40898992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:48,386][41256] Avg episode reward: [(0, '84.762')] +[2023-03-11 16:23:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079904_40910848.pth... +[2023-03-11 16:23:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079360_40632320.pth +[2023-03-11 16:23:48,932][41544] Updated weights for policy 0, policy_version 79920 (0.0005) +[2023-03-11 16:23:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 40955904. Throughput: 0: 9242.4. Samples: 40954020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:53,386][41256] Avg episode reward: [(0, '85.419')] +[2023-03-11 16:23:53,512][41544] Updated weights for policy 0, policy_version 80000 (0.0005) +[2023-03-11 16:23:58,084][41544] Updated weights for policy 0, policy_version 80080 (0.0005) +[2023-03-11 16:23:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 41000960. Throughput: 0: 9230.3. Samples: 40980752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:23:58,386][41256] Avg episode reward: [(0, '84.089')] +[2023-03-11 16:24:02,598][41544] Updated weights for policy 0, policy_version 80160 (0.0005) +[2023-03-11 16:24:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 41046016. Throughput: 0: 9234.4. Samples: 41035212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:24:03,386][41256] Avg episode reward: [(0, '82.137')] +[2023-03-11 16:24:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080168_41046016.pth... +[2023-03-11 16:24:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079632_40771584.pth +[2023-03-11 16:24:07,069][41544] Updated weights for policy 0, policy_version 80240 (0.0005) +[2023-03-11 16:24:08,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9344.4). Total num frames: 41091072. Throughput: 0: 9232.5. Samples: 41090012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:24:08,386][41256] Avg episode reward: [(0, '73.898')] +[2023-03-11 16:24:11,580][41544] Updated weights for policy 0, policy_version 80320 (0.0005) +[2023-03-11 16:24:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 41140224. Throughput: 0: 9202.5. Samples: 41117220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:24:13,386][41256] Avg episode reward: [(0, '77.564')] +[2023-03-11 16:24:15,810][41544] Updated weights for policy 0, policy_version 80400 (0.0005) +[2023-03-11 16:24:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 41189376. Throughput: 0: 9260.6. Samples: 41174588. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:24:18,386][41256] Avg episode reward: [(0, '78.566')] +[2023-03-11 16:24:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080448_41189376.pth... +[2023-03-11 16:24:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000079904_40910848.pth +[2023-03-11 16:24:20,080][41544] Updated weights for policy 0, policy_version 80480 (0.0005) +[2023-03-11 16:24:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 41234432. Throughput: 0: 9295.8. Samples: 41231672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:24:23,386][41256] Avg episode reward: [(0, '80.093')] +[2023-03-11 16:24:24,467][41544] Updated weights for policy 0, policy_version 80560 (0.0005) +[2023-03-11 16:24:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 41279488. Throughput: 0: 9261.9. Samples: 41259072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:24:28,386][41256] Avg episode reward: [(0, '83.101')] +[2023-03-11 16:24:28,960][41544] Updated weights for policy 0, policy_version 80640 (0.0005) +[2023-03-11 16:24:33,219][41544] Updated weights for policy 0, policy_version 80720 (0.0005) +[2023-03-11 16:24:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 41328640. Throughput: 0: 9261.5. Samples: 41315760. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:24:33,386][41256] Avg episode reward: [(0, '87.465')] +[2023-03-11 16:24:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080720_41328640.pth... +[2023-03-11 16:24:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080168_41046016.pth +[2023-03-11 16:24:37,783][41544] Updated weights for policy 0, policy_version 80800 (0.0003) +[2023-03-11 16:24:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 41373696. Throughput: 0: 9240.2. Samples: 41369828. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:24:38,386][41256] Avg episode reward: [(0, '90.187')] +[2023-03-11 16:24:42,396][41544] Updated weights for policy 0, policy_version 80880 (0.0004) +[2023-03-11 16:24:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 41418752. Throughput: 0: 9249.5. Samples: 41396980. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:24:43,386][41256] Avg episode reward: [(0, '86.304')] +[2023-03-11 16:24:46,972][41544] Updated weights for policy 0, policy_version 80960 (0.0005) +[2023-03-11 16:24:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9330.6). Total num frames: 41463808. Throughput: 0: 9238.1. Samples: 41450924. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:24:48,386][41256] Avg episode reward: [(0, '83.815')] +[2023-03-11 16:24:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080984_41463808.pth... +[2023-03-11 16:24:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080448_41189376.pth +[2023-03-11 16:24:51,544][41544] Updated weights for policy 0, policy_version 81040 (0.0005) +[2023-03-11 16:24:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9330.6). Total num frames: 41508864. Throughput: 0: 9208.6. Samples: 41504400. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:24:53,386][41256] Avg episode reward: [(0, '80.864')] +[2023-03-11 16:24:56,148][41544] Updated weights for policy 0, policy_version 81120 (0.0005) +[2023-03-11 16:24:58,386][41256] Fps is (10 sec: 8601.5, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 41549824. Throughput: 0: 9190.7. Samples: 41530804. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:24:58,386][41256] Avg episode reward: [(0, '86.649')] +[2023-03-11 16:25:00,748][41544] Updated weights for policy 0, policy_version 81200 (0.0005) +[2023-03-11 16:25:03,385][41256] Fps is (10 sec: 8601.5, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 41594880. Throughput: 0: 9096.3. Samples: 41583920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:25:03,386][41256] Avg episode reward: [(0, '85.467')] +[2023-03-11 16:25:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000081240_41594880.pth... +[2023-03-11 16:25:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080720_41328640.pth +[2023-03-11 16:25:05,396][41544] Updated weights for policy 0, policy_version 81280 (0.0005) +[2023-03-11 16:25:08,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 41639936. Throughput: 0: 9007.5. Samples: 41637008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:25:08,386][41256] Avg episode reward: [(0, '84.765')] +[2023-03-11 16:25:10,020][41544] Updated weights for policy 0, policy_version 81360 (0.0005) +[2023-03-11 16:25:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9302.8). Total num frames: 41684992. Throughput: 0: 8999.6. Samples: 41664056. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:25:13,386][41256] Avg episode reward: [(0, '86.649')] +[2023-03-11 16:25:14,645][41544] Updated weights for policy 0, policy_version 81440 (0.0005) +[2023-03-11 16:25:18,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9288.9). Total num frames: 41730048. Throughput: 0: 8918.2. Samples: 41717080. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:25:18,396][41256] Avg episode reward: [(0, '88.131')] +[2023-03-11 16:25:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000081504_41730048.pth... +[2023-03-11 16:25:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000080984_41463808.pth +[2023-03-11 16:25:19,099][41544] Updated weights for policy 0, policy_version 81520 (0.0005) +[2023-03-11 16:25:23,366][41544] Updated weights for policy 0, policy_version 81600 (0.0004) +[2023-03-11 16:25:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9288.9). Total num frames: 41779200. Throughput: 0: 8987.2. Samples: 41774252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:23,396][41256] Avg episode reward: [(0, '88.126')] +[2023-03-11 16:25:27,706][41544] Updated weights for policy 0, policy_version 81680 (0.0005) +[2023-03-11 16:25:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9288.9). Total num frames: 41824256. Throughput: 0: 9008.3. Samples: 41802356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:28,396][41256] Avg episode reward: [(0, '86.307')] +[2023-03-11 16:25:32,004][41544] Updated weights for policy 0, policy_version 81760 (0.0005) +[2023-03-11 16:25:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9288.9). Total num frames: 41873408. Throughput: 0: 9076.8. Samples: 41859380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:33,396][41256] Avg episode reward: [(0, '90.797')] +[2023-03-11 16:25:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000081784_41873408.pth... +[2023-03-11 16:25:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000081240_41594880.pth +[2023-03-11 16:25:36,331][41544] Updated weights for policy 0, policy_version 81840 (0.0004) +[2023-03-11 16:25:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9275.0). Total num frames: 41918464. Throughput: 0: 9138.0. Samples: 41915612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:38,396][41256] Avg episode reward: [(0, '89.676')] +[2023-03-11 16:25:40,779][41544] Updated weights for policy 0, policy_version 81920 (0.0005) +[2023-03-11 16:25:43,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9275.0). Total num frames: 41963520. Throughput: 0: 9167.5. Samples: 41943340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:43,386][41256] Avg episode reward: [(0, '85.925')] +[2023-03-11 16:25:45,256][41544] Updated weights for policy 0, policy_version 82000 (0.0005) +[2023-03-11 16:25:48,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9261.1). Total num frames: 42008576. Throughput: 0: 9209.5. Samples: 41998348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:48,386][41256] Avg episode reward: [(0, '88.215')] +[2023-03-11 16:25:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082048_42008576.pth... +[2023-03-11 16:25:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000081504_41730048.pth +[2023-03-11 16:25:49,856][41544] Updated weights for policy 0, policy_version 82080 (0.0005) +[2023-03-11 16:25:53,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9261.1). Total num frames: 42053632. Throughput: 0: 9210.5. Samples: 42051480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:53,396][41256] Avg episode reward: [(0, '87.350')] +[2023-03-11 16:25:54,420][41544] Updated weights for policy 0, policy_version 82160 (0.0005) +[2023-03-11 16:25:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.8, 300 sec: 9247.2). Total num frames: 42098688. Throughput: 0: 9226.1. Samples: 42079232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:25:58,396][41256] Avg episode reward: [(0, '86.136')] +[2023-03-11 16:25:58,963][41544] Updated weights for policy 0, policy_version 82240 (0.0005) +[2023-03-11 16:26:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9247.2). Total num frames: 42143744. Throughput: 0: 9223.8. Samples: 42132152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:26:03,396][41256] Avg episode reward: [(0, '85.199')] +[2023-03-11 16:26:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082312_42143744.pth... +[2023-03-11 16:26:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000081784_41873408.pth +[2023-03-11 16:26:03,579][41544] Updated weights for policy 0, policy_version 82320 (0.0005) +[2023-03-11 16:26:08,051][41544] Updated weights for policy 0, policy_version 82400 (0.0005) +[2023-03-11 16:26:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9233.4). Total num frames: 42188800. Throughput: 0: 9176.9. Samples: 42187212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:26:08,396][41256] Avg episode reward: [(0, '77.782')] +[2023-03-11 16:26:12,583][41544] Updated weights for policy 0, policy_version 82480 (0.0005) +[2023-03-11 16:26:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9233.4). Total num frames: 42233856. Throughput: 0: 9140.5. Samples: 42213680. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:13,386][41256] Avg episode reward: [(0, '71.412')] +[2023-03-11 16:26:17,137][41544] Updated weights for policy 0, policy_version 82560 (0.0005) +[2023-03-11 16:26:18,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9219.5). Total num frames: 42278912. Throughput: 0: 9077.7. Samples: 42267876. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:18,396][41256] Avg episode reward: [(0, '78.457')] +[2023-03-11 16:26:18,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082576_42278912.pth... +[2023-03-11 16:26:18,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082048_42008576.pth +[2023-03-11 16:26:21,451][41544] Updated weights for policy 0, policy_version 82640 (0.0004) +[2023-03-11 16:26:23,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9233.4). Total num frames: 42328064. Throughput: 0: 9108.2. Samples: 42325480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:23,386][41256] Avg episode reward: [(0, '76.067')] +[2023-03-11 16:26:25,621][41544] Updated weights for policy 0, policy_version 82720 (0.0004) +[2023-03-11 16:26:28,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9216.0, 300 sec: 9247.2). Total num frames: 42377216. Throughput: 0: 9143.2. Samples: 42354784. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:28,386][41256] Avg episode reward: [(0, '77.486')] +[2023-03-11 16:26:29,974][41544] Updated weights for policy 0, policy_version 82800 (0.0005) +[2023-03-11 16:26:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9233.4). Total num frames: 42422272. Throughput: 0: 9147.6. Samples: 42409992. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:33,386][41256] Avg episode reward: [(0, '80.795')] +[2023-03-11 16:26:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082856_42422272.pth... +[2023-03-11 16:26:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082312_42143744.pth +[2023-03-11 16:26:34,584][41544] Updated weights for policy 0, policy_version 82880 (0.0005) +[2023-03-11 16:26:38,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9219.5). Total num frames: 42467328. Throughput: 0: 9151.6. Samples: 42463304. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:38,386][41256] Avg episode reward: [(0, '82.403')] +[2023-03-11 16:26:39,216][41544] Updated weights for policy 0, policy_version 82960 (0.0005) +[2023-03-11 16:26:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9219.5). Total num frames: 42512384. Throughput: 0: 9134.8. Samples: 42490296. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:43,386][41256] Avg episode reward: [(0, '83.925')] +[2023-03-11 16:26:43,746][41544] Updated weights for policy 0, policy_version 83040 (0.0005) +[2023-03-11 16:26:48,113][41544] Updated weights for policy 0, policy_version 83120 (0.0005) +[2023-03-11 16:26:48,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9219.5). Total num frames: 42557440. Throughput: 0: 9175.0. Samples: 42545028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:48,386][41256] Avg episode reward: [(0, '81.246')] +[2023-03-11 16:26:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083120_42557440.pth... +[2023-03-11 16:26:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082576_42278912.pth +[2023-03-11 16:26:52,380][41544] Updated weights for policy 0, policy_version 83200 (0.0004) +[2023-03-11 16:26:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 42606592. Throughput: 0: 9230.1. Samples: 42602568. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:53,386][41256] Avg episode reward: [(0, '86.002')] +[2023-03-11 16:26:56,648][41544] Updated weights for policy 0, policy_version 83280 (0.0005) +[2023-03-11 16:26:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9233.4). Total num frames: 42655744. Throughput: 0: 9280.5. Samples: 42631304. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:26:58,386][41256] Avg episode reward: [(0, '77.557')] +[2023-03-11 16:27:00,755][41544] Updated weights for policy 0, policy_version 83360 (0.0004) +[2023-03-11 16:27:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 42704896. Throughput: 0: 9410.8. Samples: 42691364. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:03,386][41256] Avg episode reward: [(0, '75.620')] +[2023-03-11 16:27:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083408_42704896.pth... +[2023-03-11 16:27:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000082856_42422272.pth +[2023-03-11 16:27:05,119][41544] Updated weights for policy 0, policy_version 83440 (0.0005) +[2023-03-11 16:27:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 42749952. Throughput: 0: 9348.3. Samples: 42746152. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:08,386][41256] Avg episode reward: [(0, '75.951')] +[2023-03-11 16:27:09,565][41544] Updated weights for policy 0, policy_version 83520 (0.0005) +[2023-03-11 16:27:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 42795008. Throughput: 0: 9328.0. Samples: 42774544. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:13,386][41256] Avg episode reward: [(0, '73.807')] +[2023-03-11 16:27:14,005][41544] Updated weights for policy 0, policy_version 83600 (0.0006) +[2023-03-11 16:27:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9233.4). Total num frames: 42840064. Throughput: 0: 9321.5. Samples: 42829460. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:18,386][41256] Avg episode reward: [(0, '75.673')] +[2023-03-11 16:27:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083672_42840064.pth... +[2023-03-11 16:27:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083120_42557440.pth +[2023-03-11 16:27:18,456][41544] Updated weights for policy 0, policy_version 83680 (0.0006) +[2023-03-11 16:27:22,836][41544] Updated weights for policy 0, policy_version 83760 (0.0005) +[2023-03-11 16:27:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9247.2). Total num frames: 42889216. Throughput: 0: 9375.1. Samples: 42885184. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:23,386][41256] Avg episode reward: [(0, '79.629')] +[2023-03-11 16:27:27,247][41544] Updated weights for policy 0, policy_version 83840 (0.0005) +[2023-03-11 16:27:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9233.4). Total num frames: 42934272. Throughput: 0: 9399.8. Samples: 42913288. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:28,386][41256] Avg episode reward: [(0, '84.907')] +[2023-03-11 16:27:31,734][41544] Updated weights for policy 0, policy_version 83920 (0.0005) +[2023-03-11 16:27:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 42979328. Throughput: 0: 9395.8. Samples: 42967840. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:33,386][41256] Avg episode reward: [(0, '81.357')] +[2023-03-11 16:27:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083944_42979328.pth... +[2023-03-11 16:27:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083408_42704896.pth +[2023-03-11 16:27:36,198][41544] Updated weights for policy 0, policy_version 84000 (0.0006) +[2023-03-11 16:27:38,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9205.6). Total num frames: 43024384. Throughput: 0: 9358.2. Samples: 43023688. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:38,386][41256] Avg episode reward: [(0, '76.966')] +[2023-03-11 16:27:40,684][41544] Updated weights for policy 0, policy_version 84080 (0.0005) +[2023-03-11 16:27:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 43073536. Throughput: 0: 9313.2. Samples: 43050400. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:43,386][41256] Avg episode reward: [(0, '70.491')] +[2023-03-11 16:27:45,063][41544] Updated weights for policy 0, policy_version 84160 (0.0006) +[2023-03-11 16:27:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 43118592. Throughput: 0: 9222.3. Samples: 43106368. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:27:48,386][41256] Avg episode reward: [(0, '69.304')] +[2023-03-11 16:27:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000084216_43118592.pth... +[2023-03-11 16:27:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083672_42840064.pth +[2023-03-11 16:27:49,491][41544] Updated weights for policy 0, policy_version 84240 (0.0006) +[2023-03-11 16:27:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 43167744. Throughput: 0: 9259.6. Samples: 43162832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:27:53,386][41256] Avg episode reward: [(0, '74.289')] +[2023-03-11 16:27:53,819][41544] Updated weights for policy 0, policy_version 84320 (0.0005) +[2023-03-11 16:27:58,177][41544] Updated weights for policy 0, policy_version 84400 (0.0005) +[2023-03-11 16:27:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 43212800. Throughput: 0: 9263.0. Samples: 43191380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:27:58,386][41256] Avg episode reward: [(0, '75.666')] +[2023-03-11 16:28:02,636][41544] Updated weights for policy 0, policy_version 84480 (0.0006) +[2023-03-11 16:28:03,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9205.6). Total num frames: 43257856. Throughput: 0: 9263.4. Samples: 43246312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:03,386][41256] Avg episode reward: [(0, '77.716')] +[2023-03-11 16:28:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000084488_43257856.pth... +[2023-03-11 16:28:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000083944_42979328.pth +[2023-03-11 16:28:06,725][41544] Updated weights for policy 0, policy_version 84560 (0.0005) +[2023-03-11 16:28:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 43307008. Throughput: 0: 9348.5. Samples: 43305864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:08,386][41256] Avg episode reward: [(0, '77.116')] +[2023-03-11 16:28:10,914][41544] Updated weights for policy 0, policy_version 84640 (0.0005) +[2023-03-11 16:28:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9420.8, 300 sec: 9247.2). Total num frames: 43360256. Throughput: 0: 9379.3. Samples: 43335356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:13,386][41256] Avg episode reward: [(0, '73.983')] +[2023-03-11 16:28:15,023][41544] Updated weights for policy 0, policy_version 84720 (0.0005) +[2023-03-11 16:28:18,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9489.1, 300 sec: 9261.1). Total num frames: 43409408. Throughput: 0: 9485.4. Samples: 43394684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:18,386][41256] Avg episode reward: [(0, '76.390')] +[2023-03-11 16:28:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000084784_43409408.pth... +[2023-03-11 16:28:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000084216_43118592.pth +[2023-03-11 16:28:19,132][41544] Updated weights for policy 0, policy_version 84800 (0.0004) +[2023-03-11 16:28:23,275][41544] Updated weights for policy 0, policy_version 84880 (0.0004) +[2023-03-11 16:28:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9261.1). Total num frames: 43458560. Throughput: 0: 9572.6. Samples: 43454456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:23,386][41256] Avg episode reward: [(0, '77.039')] +[2023-03-11 16:28:27,571][41544] Updated weights for policy 0, policy_version 84960 (0.0006) +[2023-03-11 16:28:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9275.0). Total num frames: 43503616. Throughput: 0: 9617.8. Samples: 43483200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:28,386][41256] Avg episode reward: [(0, '78.592')] +[2023-03-11 16:28:31,915][41544] Updated weights for policy 0, policy_version 85040 (0.0006) +[2023-03-11 16:28:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9275.0). Total num frames: 43552768. Throughput: 0: 9643.4. Samples: 43540320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:33,386][41256] Avg episode reward: [(0, '79.352')] +[2023-03-11 16:28:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085064_43552768.pth... +[2023-03-11 16:28:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000084488_43257856.pth +[2023-03-11 16:28:36,125][41544] Updated weights for policy 0, policy_version 85120 (0.0005) +[2023-03-11 16:28:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9275.0). Total num frames: 43601920. Throughput: 0: 9661.7. Samples: 43597608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:38,386][41256] Avg episode reward: [(0, '83.102')] +[2023-03-11 16:28:40,491][41544] Updated weights for policy 0, policy_version 85200 (0.0005) +[2023-03-11 16:28:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9275.0). Total num frames: 43646976. Throughput: 0: 9658.7. Samples: 43626020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:28:43,386][41256] Avg episode reward: [(0, '79.557')] +[2023-03-11 16:28:44,839][41544] Updated weights for policy 0, policy_version 85280 (0.0005) +[2023-03-11 16:28:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9288.9). Total num frames: 43696128. Throughput: 0: 9694.5. Samples: 43682564. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:28:48,386][41256] Avg episode reward: [(0, '79.628')] +[2023-03-11 16:28:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085344_43696128.pth... +[2023-03-11 16:28:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000084784_43409408.pth +[2023-03-11 16:28:49,219][41544] Updated weights for policy 0, policy_version 85360 (0.0005) +[2023-03-11 16:28:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9288.9). Total num frames: 43741184. Throughput: 0: 9618.3. Samples: 43738688. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:28:53,386][41256] Avg episode reward: [(0, '80.068')] +[2023-03-11 16:28:53,527][41544] Updated weights for policy 0, policy_version 85440 (0.0005) +[2023-03-11 16:28:57,896][41544] Updated weights for policy 0, policy_version 85520 (0.0005) +[2023-03-11 16:28:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9302.8). Total num frames: 43790336. Throughput: 0: 9589.2. Samples: 43766868. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:28:58,386][41256] Avg episode reward: [(0, '79.657')] +[2023-03-11 16:29:02,334][41544] Updated weights for policy 0, policy_version 85600 (0.0005) +[2023-03-11 16:29:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9302.8). Total num frames: 43835392. Throughput: 0: 9520.4. Samples: 43823104. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:03,386][41256] Avg episode reward: [(0, '79.238')] +[2023-03-11 16:29:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085616_43835392.pth... +[2023-03-11 16:29:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085064_43552768.pth +[2023-03-11 16:29:06,741][41544] Updated weights for policy 0, policy_version 85680 (0.0005) +[2023-03-11 16:29:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9288.9). Total num frames: 43880448. Throughput: 0: 9422.0. Samples: 43878448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:08,386][41256] Avg episode reward: [(0, '79.016')] +[2023-03-11 16:29:11,023][41544] Updated weights for policy 0, policy_version 85760 (0.0005) +[2023-03-11 16:29:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9288.9). Total num frames: 43929600. Throughput: 0: 9422.6. Samples: 43907216. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:13,386][41256] Avg episode reward: [(0, '80.729')] +[2023-03-11 16:29:15,138][41544] Updated weights for policy 0, policy_version 85840 (0.0005) +[2023-03-11 16:29:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 43982848. Throughput: 0: 9478.7. Samples: 43966860. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:18,386][41256] Avg episode reward: [(0, '78.353')] +[2023-03-11 16:29:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085904_43982848.pth... +[2023-03-11 16:29:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085344_43696128.pth +[2023-03-11 16:29:19,166][41544] Updated weights for policy 0, policy_version 85920 (0.0004) +[2023-03-11 16:29:23,285][41544] Updated weights for policy 0, policy_version 86000 (0.0005) +[2023-03-11 16:29:23,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9557.3, 300 sec: 9330.5). Total num frames: 44032000. Throughput: 0: 9562.3. Samples: 44027912. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:23,386][41256] Avg episode reward: [(0, '83.586')] +[2023-03-11 16:29:27,656][41544] Updated weights for policy 0, policy_version 86080 (0.0004) +[2023-03-11 16:29:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 44077056. Throughput: 0: 9563.4. Samples: 44056372. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:28,386][41256] Avg episode reward: [(0, '75.401')] +[2023-03-11 16:29:32,127][41544] Updated weights for policy 0, policy_version 86160 (0.0005) +[2023-03-11 16:29:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9316.7). Total num frames: 44122112. Throughput: 0: 9529.6. Samples: 44111396. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:33,386][41256] Avg episode reward: [(0, '78.152')] +[2023-03-11 16:29:33,409][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000086184_44126208.pth... +[2023-03-11 16:29:33,411][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085616_43835392.pth +[2023-03-11 16:29:36,294][41544] Updated weights for policy 0, policy_version 86240 (0.0004) +[2023-03-11 16:29:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9344.4). Total num frames: 44175360. Throughput: 0: 9593.0. Samples: 44170372. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:38,386][41256] Avg episode reward: [(0, '80.937')] +[2023-03-11 16:29:40,453][41544] Updated weights for policy 0, policy_version 86320 (0.0004) +[2023-03-11 16:29:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9625.6, 300 sec: 9358.3). Total num frames: 44224512. Throughput: 0: 9622.1. Samples: 44199864. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:43,386][41256] Avg episode reward: [(0, '80.365')] +[2023-03-11 16:29:44,578][41544] Updated weights for policy 0, policy_version 86400 (0.0004) +[2023-03-11 16:29:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9372.2). Total num frames: 44273664. Throughput: 0: 9692.2. Samples: 44259252. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:48,386][41256] Avg episode reward: [(0, '78.748')] +[2023-03-11 16:29:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000086472_44273664.pth... +[2023-03-11 16:29:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000085904_43982848.pth +[2023-03-11 16:29:48,707][41544] Updated weights for policy 0, policy_version 86480 (0.0004) +[2023-03-11 16:29:52,829][41544] Updated weights for policy 0, policy_version 86560 (0.0004) +[2023-03-11 16:29:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9400.0). Total num frames: 44322816. Throughput: 0: 9785.3. Samples: 44318784. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:53,386][41256] Avg episode reward: [(0, '78.917')] +[2023-03-11 16:29:57,133][41544] Updated weights for policy 0, policy_version 86640 (0.0005) +[2023-03-11 16:29:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 44367872. Throughput: 0: 9785.9. Samples: 44347580. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:29:58,386][41256] Avg episode reward: [(0, '80.914')] +[2023-03-11 16:30:01,676][41544] Updated weights for policy 0, policy_version 86720 (0.0005) +[2023-03-11 16:30:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 44412928. Throughput: 0: 9672.5. Samples: 44402120. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:30:03,386][41256] Avg episode reward: [(0, '82.451')] +[2023-03-11 16:30:03,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000086744_44412928.pth... +[2023-03-11 16:30:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000086184_44126208.pth +[2023-03-11 16:30:06,207][41544] Updated weights for policy 0, policy_version 86800 (0.0004) +[2023-03-11 16:30:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 44457984. Throughput: 0: 9530.1. Samples: 44456768. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:30:08,386][41256] Avg episode reward: [(0, '85.626')] +[2023-03-11 16:30:10,489][41544] Updated weights for policy 0, policy_version 86880 (0.0005) +[2023-03-11 16:30:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9413.9). Total num frames: 44507136. Throughput: 0: 9549.2. Samples: 44486084. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:30:13,386][41256] Avg episode reward: [(0, '85.998')] +[2023-03-11 16:30:14,726][41544] Updated weights for policy 0, policy_version 86960 (0.0005) +[2023-03-11 16:30:18,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 44556288. Throughput: 0: 9608.7. Samples: 44543788. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:30:18,386][41256] Avg episode reward: [(0, '80.599')] +[2023-03-11 16:30:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087024_44556288.pth... +[2023-03-11 16:30:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000086472_44273664.pth +[2023-03-11 16:30:19,180][41544] Updated weights for policy 0, policy_version 87040 (0.0005) +[2023-03-11 16:30:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 44601344. Throughput: 0: 9496.3. Samples: 44597704. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:30:23,386][41256] Avg episode reward: [(0, '84.513')] +[2023-03-11 16:30:23,652][41544] Updated weights for policy 0, policy_version 87120 (0.0005) +[2023-03-11 16:30:28,125][41544] Updated weights for policy 0, policy_version 87200 (0.0005) +[2023-03-11 16:30:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 44646400. Throughput: 0: 9467.9. Samples: 44625920. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 16:30:28,386][41256] Avg episode reward: [(0, '86.915')] +[2023-03-11 16:30:32,462][41544] Updated weights for policy 0, policy_version 87280 (0.0005) +[2023-03-11 16:30:33,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 44695552. Throughput: 0: 9373.8. Samples: 44681072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:30:33,386][41256] Avg episode reward: [(0, '84.443')] +[2023-03-11 16:30:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087296_44695552.pth... +[2023-03-11 16:30:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000086744_44412928.pth +[2023-03-11 16:30:36,638][41544] Updated weights for policy 0, policy_version 87360 (0.0005) +[2023-03-11 16:30:38,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 44744704. Throughput: 0: 9356.4. Samples: 44739824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:30:38,386][41256] Avg episode reward: [(0, '87.116')] +[2023-03-11 16:30:41,064][41544] Updated weights for policy 0, policy_version 87440 (0.0005) +[2023-03-11 16:30:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9427.7). Total num frames: 44789760. Throughput: 0: 9332.0. Samples: 44767520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:30:43,386][41256] Avg episode reward: [(0, '84.081')] +[2023-03-11 16:30:45,607][41544] Updated weights for policy 0, policy_version 87520 (0.0005) +[2023-03-11 16:30:48,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 44834816. Throughput: 0: 9318.4. Samples: 44821448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:30:48,386][41256] Avg episode reward: [(0, '80.433')] +[2023-03-11 16:30:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087568_44834816.pth... +[2023-03-11 16:30:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087024_44556288.pth +[2023-03-11 16:30:49,980][41544] Updated weights for policy 0, policy_version 87600 (0.0003) +[2023-03-11 16:30:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 44879872. Throughput: 0: 9366.5. Samples: 44878260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:30:53,386][41256] Avg episode reward: [(0, '81.156')] +[2023-03-11 16:30:54,237][41544] Updated weights for policy 0, policy_version 87680 (0.0003) +[2023-03-11 16:30:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 44929024. Throughput: 0: 9370.7. Samples: 44907768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:30:58,391][41256] Avg episode reward: [(0, '87.345')] +[2023-03-11 16:30:58,591][41544] Updated weights for policy 0, policy_version 87760 (0.0003) +[2023-03-11 16:31:03,060][41544] Updated weights for policy 0, policy_version 87840 (0.0004) +[2023-03-11 16:31:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 44974080. Throughput: 0: 9319.1. Samples: 44963148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:03,386][41256] Avg episode reward: [(0, '84.873')] +[2023-03-11 16:31:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087840_44974080.pth... +[2023-03-11 16:31:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087296_44695552.pth +[2023-03-11 16:31:07,369][41544] Updated weights for policy 0, policy_version 87920 (0.0003) +[2023-03-11 16:31:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 45023232. Throughput: 0: 9371.6. Samples: 45019424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:08,396][41256] Avg episode reward: [(0, '73.054')] +[2023-03-11 16:31:11,745][41544] Updated weights for policy 0, policy_version 88000 (0.0004) +[2023-03-11 16:31:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 45068288. Throughput: 0: 9375.3. Samples: 45047808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:13,386][41256] Avg episode reward: [(0, '70.146')] +[2023-03-11 16:31:16,164][41544] Updated weights for policy 0, policy_version 88080 (0.0005) +[2023-03-11 16:31:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 45117440. Throughput: 0: 9386.7. Samples: 45103472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:18,386][41256] Avg episode reward: [(0, '68.574')] +[2023-03-11 16:31:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088120_45117440.pth... +[2023-03-11 16:31:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087568_44834816.pth +[2023-03-11 16:31:20,422][41544] Updated weights for policy 0, policy_version 88160 (0.0005) +[2023-03-11 16:31:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 45162496. Throughput: 0: 9338.1. Samples: 45160036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:23,386][41256] Avg episode reward: [(0, '69.185')] +[2023-03-11 16:31:24,866][41544] Updated weights for policy 0, policy_version 88240 (0.0005) +[2023-03-11 16:31:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 45211648. Throughput: 0: 9345.4. Samples: 45188064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:28,386][41256] Avg episode reward: [(0, '78.310')] +[2023-03-11 16:31:29,148][41544] Updated weights for policy 0, policy_version 88320 (0.0003) +[2023-03-11 16:31:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 45256704. Throughput: 0: 9400.9. Samples: 45244488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:33,386][41256] Avg episode reward: [(0, '76.408')] +[2023-03-11 16:31:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088392_45256704.pth... +[2023-03-11 16:31:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000087840_44974080.pth +[2023-03-11 16:31:33,506][41544] Updated weights for policy 0, policy_version 88400 (0.0004) +[2023-03-11 16:31:37,829][41544] Updated weights for policy 0, policy_version 88480 (0.0003) +[2023-03-11 16:31:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 45305856. Throughput: 0: 9412.5. Samples: 45301824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:38,386][41256] Avg episode reward: [(0, '74.533')] +[2023-03-11 16:31:42,230][41544] Updated weights for policy 0, policy_version 88560 (0.0004) +[2023-03-11 16:31:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 45350912. Throughput: 0: 9390.3. Samples: 45330332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:43,386][41256] Avg episode reward: [(0, '72.394')] +[2023-03-11 16:31:46,621][41544] Updated weights for policy 0, policy_version 88640 (0.0004) +[2023-03-11 16:31:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 45400064. Throughput: 0: 9392.2. Samples: 45385796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:48,386][41256] Avg episode reward: [(0, '74.627')] +[2023-03-11 16:31:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088672_45400064.pth... +[2023-03-11 16:31:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088120_45117440.pth +[2023-03-11 16:31:51,029][41544] Updated weights for policy 0, policy_version 88720 (0.0005) +[2023-03-11 16:31:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 45445120. Throughput: 0: 9370.5. Samples: 45441096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:53,386][41256] Avg episode reward: [(0, '72.656')] +[2023-03-11 16:31:55,464][41544] Updated weights for policy 0, policy_version 88800 (0.0004) +[2023-03-11 16:31:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 45494272. Throughput: 0: 9369.0. Samples: 45469412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:31:58,386][41256] Avg episode reward: [(0, '70.835')] +[2023-03-11 16:31:59,648][41544] Updated weights for policy 0, policy_version 88880 (0.0003) +[2023-03-11 16:32:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 45539328. Throughput: 0: 9414.2. Samples: 45527112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:32:03,386][41256] Avg episode reward: [(0, '76.406')] +[2023-03-11 16:32:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088944_45539328.pth... +[2023-03-11 16:32:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088392_45256704.pth +[2023-03-11 16:32:04,075][41544] Updated weights for policy 0, policy_version 88960 (0.0005) +[2023-03-11 16:32:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 45584384. Throughput: 0: 9387.3. Samples: 45582464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:32:08,386][41256] Avg episode reward: [(0, '76.645')] +[2023-03-11 16:32:08,490][41544] Updated weights for policy 0, policy_version 89040 (0.0006) +[2023-03-11 16:32:12,715][41544] Updated weights for policy 0, policy_version 89120 (0.0005) +[2023-03-11 16:32:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 45633536. Throughput: 0: 9397.2. Samples: 45610940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:32:13,386][41256] Avg episode reward: [(0, '76.867')] +[2023-03-11 16:32:17,146][41544] Updated weights for policy 0, policy_version 89200 (0.0005) +[2023-03-11 16:32:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 45678592. Throughput: 0: 9399.9. Samples: 45667484. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:18,386][41256] Avg episode reward: [(0, '78.866')] +[2023-03-11 16:32:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000089216_45678592.pth... +[2023-03-11 16:32:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088672_45400064.pth +[2023-03-11 16:32:21,346][41544] Updated weights for policy 0, policy_version 89280 (0.0005) +[2023-03-11 16:32:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 45727744. Throughput: 0: 9424.1. Samples: 45725908. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:23,386][41256] Avg episode reward: [(0, '79.802')] +[2023-03-11 16:32:25,645][41544] Updated weights for policy 0, policy_version 89360 (0.0005) +[2023-03-11 16:32:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 45776896. Throughput: 0: 9419.6. Samples: 45754212. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:28,386][41256] Avg episode reward: [(0, '80.759')] +[2023-03-11 16:32:29,991][41544] Updated weights for policy 0, policy_version 89440 (0.0005) +[2023-03-11 16:32:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 45826048. Throughput: 0: 9436.4. Samples: 45810432. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:33,386][41256] Avg episode reward: [(0, '79.402')] +[2023-03-11 16:32:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000089504_45826048.pth... +[2023-03-11 16:32:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000088944_45539328.pth +[2023-03-11 16:32:34,196][41544] Updated weights for policy 0, policy_version 89520 (0.0005) +[2023-03-11 16:32:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 45871104. Throughput: 0: 9479.6. Samples: 45867676. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:38,386][41256] Avg episode reward: [(0, '83.230')] +[2023-03-11 16:32:38,622][41544] Updated weights for policy 0, policy_version 89600 (0.0005) +[2023-03-11 16:32:43,145][41544] Updated weights for policy 0, policy_version 89680 (0.0005) +[2023-03-11 16:32:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 45916160. Throughput: 0: 9468.2. Samples: 45895480. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:43,386][41256] Avg episode reward: [(0, '83.516')] +[2023-03-11 16:32:47,586][41544] Updated weights for policy 0, policy_version 89760 (0.0005) +[2023-03-11 16:32:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 45961216. Throughput: 0: 9402.4. Samples: 45950220. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:48,386][41256] Avg episode reward: [(0, '84.600')] +[2023-03-11 16:32:48,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000089768_45961216.pth... +[2023-03-11 16:32:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000089216_45678592.pth +[2023-03-11 16:32:51,751][41544] Updated weights for policy 0, policy_version 89840 (0.0004) +[2023-03-11 16:32:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 46010368. Throughput: 0: 9455.0. Samples: 46007940. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:53,386][41256] Avg episode reward: [(0, '84.427')] +[2023-03-11 16:32:56,241][41544] Updated weights for policy 0, policy_version 89920 (0.0004) +[2023-03-11 16:32:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 46055424. Throughput: 0: 9429.0. Samples: 46035244. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:32:58,386][41256] Avg episode reward: [(0, '83.481')] +[2023-03-11 16:33:00,748][41544] Updated weights for policy 0, policy_version 90000 (0.0005) +[2023-03-11 16:33:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 46100480. Throughput: 0: 9397.3. Samples: 46090364. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:33:03,386][41256] Avg episode reward: [(0, '83.148')] +[2023-03-11 16:33:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090048_46104576.pth... +[2023-03-11 16:33:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000089504_45826048.pth +[2023-03-11 16:33:05,175][41544] Updated weights for policy 0, policy_version 90080 (0.0004) +[2023-03-11 16:33:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 46149632. Throughput: 0: 9326.7. Samples: 46145608. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:33:08,386][41256] Avg episode reward: [(0, '84.921')] +[2023-03-11 16:33:09,571][41544] Updated weights for policy 0, policy_version 90160 (0.0005) +[2023-03-11 16:33:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 46198784. Throughput: 0: 9326.6. Samples: 46173908. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:13,386][41256] Avg episode reward: [(0, '83.412')] +[2023-03-11 16:33:13,807][41544] Updated weights for policy 0, policy_version 90240 (0.0005) +[2023-03-11 16:33:18,004][41544] Updated weights for policy 0, policy_version 90320 (0.0004) +[2023-03-11 16:33:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 46243840. Throughput: 0: 9365.6. Samples: 46231884. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:18,386][41256] Avg episode reward: [(0, '80.384')] +[2023-03-11 16:33:18,412][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090328_46247936.pth... +[2023-03-11 16:33:18,414][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000089768_45961216.pth +[2023-03-11 16:33:22,147][41544] Updated weights for policy 0, policy_version 90400 (0.0005) +[2023-03-11 16:33:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9469.4). Total num frames: 46297088. Throughput: 0: 9423.8. Samples: 46291748. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:23,386][41256] Avg episode reward: [(0, '81.833')] +[2023-03-11 16:33:26,455][41544] Updated weights for policy 0, policy_version 90480 (0.0005) +[2023-03-11 16:33:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 46342144. Throughput: 0: 9439.6. Samples: 46320260. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:28,386][41256] Avg episode reward: [(0, '83.468')] +[2023-03-11 16:33:30,948][41544] Updated weights for policy 0, policy_version 90560 (0.0005) +[2023-03-11 16:33:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 46387200. Throughput: 0: 9439.0. Samples: 46374976. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:33,386][41256] Avg episode reward: [(0, '82.377')] +[2023-03-11 16:33:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090600_46387200.pth... +[2023-03-11 16:33:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090048_46104576.pth +[2023-03-11 16:33:35,379][41544] Updated weights for policy 0, policy_version 90640 (0.0005) +[2023-03-11 16:33:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 46432256. Throughput: 0: 9379.1. Samples: 46430000. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:38,386][41256] Avg episode reward: [(0, '86.112')] +[2023-03-11 16:33:39,869][41544] Updated weights for policy 0, policy_version 90720 (0.0005) +[2023-03-11 16:33:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 46477312. Throughput: 0: 9381.1. Samples: 46457392. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:43,386][41256] Avg episode reward: [(0, '83.813')] +[2023-03-11 16:33:44,311][41544] Updated weights for policy 0, policy_version 90800 (0.0005) +[2023-03-11 16:33:48,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 46526464. Throughput: 0: 9392.3. Samples: 46513016. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:48,386][41256] Avg episode reward: [(0, '84.170')] +[2023-03-11 16:33:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090872_46526464.pth... +[2023-03-11 16:33:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090328_46247936.pth +[2023-03-11 16:33:48,714][41544] Updated weights for policy 0, policy_version 90880 (0.0005) +[2023-03-11 16:33:53,191][41544] Updated weights for policy 0, policy_version 90960 (0.0005) +[2023-03-11 16:33:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 46571520. Throughput: 0: 9386.3. Samples: 46567992. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:53,386][41256] Avg episode reward: [(0, '79.144')] +[2023-03-11 16:33:57,517][41544] Updated weights for policy 0, policy_version 91040 (0.0005) +[2023-03-11 16:33:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 46616576. Throughput: 0: 9390.8. Samples: 46596492. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:33:58,386][41256] Avg episode reward: [(0, '75.779')] +[2023-03-11 16:34:01,946][41544] Updated weights for policy 0, policy_version 91120 (0.0005) +[2023-03-11 16:34:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 46665728. Throughput: 0: 9350.7. Samples: 46652664. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:34:03,386][41256] Avg episode reward: [(0, '78.370')] +[2023-03-11 16:34:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091144_46665728.pth... +[2023-03-11 16:34:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090600_46387200.pth +[2023-03-11 16:34:06,415][41544] Updated weights for policy 0, policy_version 91200 (0.0005) +[2023-03-11 16:34:08,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 46710784. Throughput: 0: 9242.7. Samples: 46707672. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:08,386][41256] Avg episode reward: [(0, '80.140')] +[2023-03-11 16:34:10,872][41544] Updated weights for policy 0, policy_version 91280 (0.0005) +[2023-03-11 16:34:13,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9400.0). Total num frames: 46755840. Throughput: 0: 9224.6. Samples: 46735368. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:13,386][41256] Avg episode reward: [(0, '82.903')] +[2023-03-11 16:34:15,458][41544] Updated weights for policy 0, policy_version 91360 (0.0005) +[2023-03-11 16:34:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 46800896. Throughput: 0: 9193.2. Samples: 46788672. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:18,386][41256] Avg episode reward: [(0, '76.112')] +[2023-03-11 16:34:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091408_46800896.pth... +[2023-03-11 16:34:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000090872_46526464.pth +[2023-03-11 16:34:19,909][41544] Updated weights for policy 0, policy_version 91440 (0.0005) +[2023-03-11 16:34:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 46845952. Throughput: 0: 9216.9. Samples: 46844760. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:23,386][41256] Avg episode reward: [(0, '68.880')] +[2023-03-11 16:34:24,362][41544] Updated weights for policy 0, policy_version 91520 (0.0005) +[2023-03-11 16:34:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 46891008. Throughput: 0: 9206.1. Samples: 46871668. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:28,386][41256] Avg episode reward: [(0, '67.895')] +[2023-03-11 16:34:28,843][41544] Updated weights for policy 0, policy_version 91600 (0.0005) +[2023-03-11 16:34:33,211][41544] Updated weights for policy 0, policy_version 91680 (0.0005) +[2023-03-11 16:34:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 46940160. Throughput: 0: 9212.5. Samples: 46927580. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:33,386][41256] Avg episode reward: [(0, '72.808')] +[2023-03-11 16:34:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091680_46940160.pth... +[2023-03-11 16:34:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091144_46665728.pth +[2023-03-11 16:34:37,651][41544] Updated weights for policy 0, policy_version 91760 (0.0005) +[2023-03-11 16:34:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9358.3). Total num frames: 46985216. Throughput: 0: 9221.8. Samples: 46982972. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:38,386][41256] Avg episode reward: [(0, '77.566')] +[2023-03-11 16:34:42,063][41544] Updated weights for policy 0, policy_version 91840 (0.0005) +[2023-03-11 16:34:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 47030272. Throughput: 0: 9201.4. Samples: 47010556. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:43,386][41256] Avg episode reward: [(0, '72.027')] +[2023-03-11 16:34:46,497][41544] Updated weights for policy 0, policy_version 91920 (0.0005) +[2023-03-11 16:34:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 47079424. Throughput: 0: 9198.4. Samples: 47066592. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:48,386][41256] Avg episode reward: [(0, '74.098')] +[2023-03-11 16:34:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091952_47079424.pth... +[2023-03-11 16:34:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091408_46800896.pth +[2023-03-11 16:34:50,864][41544] Updated weights for policy 0, policy_version 92000 (0.0005) +[2023-03-11 16:34:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9344.4). Total num frames: 47124480. Throughput: 0: 9213.1. Samples: 47122260. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:53,386][41256] Avg episode reward: [(0, '78.490')] +[2023-03-11 16:34:55,075][41544] Updated weights for policy 0, policy_version 92080 (0.0004) +[2023-03-11 16:34:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 47173632. Throughput: 0: 9262.6. Samples: 47152184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:34:58,386][41256] Avg episode reward: [(0, '78.553')] +[2023-03-11 16:34:59,496][41544] Updated weights for policy 0, policy_version 92160 (0.0004) +[2023-03-11 16:35:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 47222784. Throughput: 0: 9352.9. Samples: 47209552. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:03,386][41256] Avg episode reward: [(0, '80.184')] +[2023-03-11 16:35:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000092232_47222784.pth... +[2023-03-11 16:35:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091680_46940160.pth +[2023-03-11 16:35:03,596][41544] Updated weights for policy 0, policy_version 92240 (0.0005) +[2023-03-11 16:35:07,913][41544] Updated weights for policy 0, policy_version 92320 (0.0005) +[2023-03-11 16:35:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 47271936. Throughput: 0: 9399.7. Samples: 47267748. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:08,386][41256] Avg episode reward: [(0, '79.558')] +[2023-03-11 16:35:12,336][41544] Updated weights for policy 0, policy_version 92400 (0.0005) +[2023-03-11 16:35:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 47316992. Throughput: 0: 9428.0. Samples: 47295928. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:13,386][41256] Avg episode reward: [(0, '81.430')] +[2023-03-11 16:35:16,858][41544] Updated weights for policy 0, policy_version 92480 (0.0005) +[2023-03-11 16:35:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 47362048. Throughput: 0: 9382.0. Samples: 47349768. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:18,386][41256] Avg episode reward: [(0, '80.022')] +[2023-03-11 16:35:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000092504_47362048.pth... +[2023-03-11 16:35:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000091952_47079424.pth +[2023-03-11 16:35:21,325][41544] Updated weights for policy 0, policy_version 92560 (0.0005) +[2023-03-11 16:35:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 47407104. Throughput: 0: 9372.4. Samples: 47404728. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:23,386][41256] Avg episode reward: [(0, '79.684')] +[2023-03-11 16:35:25,722][41544] Updated weights for policy 0, policy_version 92640 (0.0005) +[2023-03-11 16:35:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 47452160. Throughput: 0: 9380.5. Samples: 47432680. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:28,386][41256] Avg episode reward: [(0, '77.504')] +[2023-03-11 16:35:30,198][41544] Updated weights for policy 0, policy_version 92720 (0.0005) +[2023-03-11 16:35:33,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 47501312. Throughput: 0: 9366.0. Samples: 47488064. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:33,386][41256] Avg episode reward: [(0, '74.068')] +[2023-03-11 16:35:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000092776_47501312.pth... +[2023-03-11 16:35:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000092232_47222784.pth +[2023-03-11 16:35:34,648][41544] Updated weights for policy 0, policy_version 92800 (0.0005) +[2023-03-11 16:35:38,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9344.4). Total num frames: 47546368. Throughput: 0: 9369.0. Samples: 47543864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:38,386][41256] Avg episode reward: [(0, '76.258')] +[2023-03-11 16:35:38,929][41544] Updated weights for policy 0, policy_version 92880 (0.0005) +[2023-03-11 16:35:43,160][41544] Updated weights for policy 0, policy_version 92960 (0.0005) +[2023-03-11 16:35:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9358.3). Total num frames: 47595520. Throughput: 0: 9342.0. Samples: 47572572. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:43,386][41256] Avg episode reward: [(0, '80.592')] +[2023-03-11 16:35:47,244][41544] Updated weights for policy 0, policy_version 93040 (0.0005) +[2023-03-11 16:35:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9372.2). Total num frames: 47644672. Throughput: 0: 9396.3. Samples: 47632384. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:48,386][41256] Avg episode reward: [(0, '81.977')] +[2023-03-11 16:35:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093056_47644672.pth... +[2023-03-11 16:35:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000092504_47362048.pth +[2023-03-11 16:35:51,461][41544] Updated weights for policy 0, policy_version 93120 (0.0005) +[2023-03-11 16:35:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 47693824. Throughput: 0: 9404.8. Samples: 47690964. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:35:53,396][41256] Avg episode reward: [(0, '79.873')] +[2023-03-11 16:35:55,654][41544] Updated weights for policy 0, policy_version 93200 (0.0004) +[2023-03-11 16:35:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 47742976. Throughput: 0: 9424.3. Samples: 47720024. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:35:58,396][41256] Avg episode reward: [(0, '81.817')] +[2023-03-11 16:35:59,807][41544] Updated weights for policy 0, policy_version 93280 (0.0005) +[2023-03-11 16:36:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 47792128. Throughput: 0: 9555.5. Samples: 47779764. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:03,386][41256] Avg episode reward: [(0, '84.728')] +[2023-03-11 16:36:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093344_47792128.pth... +[2023-03-11 16:36:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000092776_47501312.pth +[2023-03-11 16:36:04,007][41544] Updated weights for policy 0, policy_version 93360 (0.0004) +[2023-03-11 16:36:08,286][41544] Updated weights for policy 0, policy_version 93440 (0.0005) +[2023-03-11 16:36:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 47841280. Throughput: 0: 9609.0. Samples: 47837132. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:08,386][41256] Avg episode reward: [(0, '82.018')] +[2023-03-11 16:36:12,521][41544] Updated weights for policy 0, policy_version 93520 (0.0004) +[2023-03-11 16:36:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9400.0). Total num frames: 47890432. Throughput: 0: 9627.5. Samples: 47865920. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:13,386][41256] Avg episode reward: [(0, '82.356')] +[2023-03-11 16:36:16,714][41544] Updated weights for policy 0, policy_version 93600 (0.0005) +[2023-03-11 16:36:18,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9400.0). Total num frames: 47935488. Throughput: 0: 9689.1. Samples: 47924072. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:18,386][41256] Avg episode reward: [(0, '83.451')] +[2023-03-11 16:36:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093632_47939584.pth... +[2023-03-11 16:36:18,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093056_47644672.pth +[2023-03-11 16:36:20,892][41544] Updated weights for policy 0, policy_version 93680 (0.0005) +[2023-03-11 16:36:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 47984640. Throughput: 0: 9754.9. Samples: 47982836. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:23,386][41256] Avg episode reward: [(0, '84.390')] +[2023-03-11 16:36:25,150][41544] Updated weights for policy 0, policy_version 93760 (0.0005) +[2023-03-11 16:36:28,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 48033792. Throughput: 0: 9757.2. Samples: 48011648. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:28,386][41256] Avg episode reward: [(0, '83.135')] +[2023-03-11 16:36:29,515][41544] Updated weights for policy 0, policy_version 93840 (0.0005) +[2023-03-11 16:36:33,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 48078848. Throughput: 0: 9666.8. Samples: 48067388. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:33,386][41256] Avg episode reward: [(0, '77.742')] +[2023-03-11 16:36:33,392][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093912_48082944.pth... +[2023-03-11 16:36:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093344_47792128.pth +[2023-03-11 16:36:33,804][41544] Updated weights for policy 0, policy_version 93920 (0.0005) +[2023-03-11 16:36:38,042][41544] Updated weights for policy 0, policy_version 94000 (0.0005) +[2023-03-11 16:36:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 48128000. Throughput: 0: 9667.9. Samples: 48126020. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:38,386][41256] Avg episode reward: [(0, '77.381')] +[2023-03-11 16:36:42,543][41544] Updated weights for policy 0, policy_version 94080 (0.0005) +[2023-03-11 16:36:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 48173056. Throughput: 0: 9634.1. Samples: 48153560. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:43,386][41256] Avg episode reward: [(0, '81.263')] +[2023-03-11 16:36:47,072][41544] Updated weights for policy 0, policy_version 94160 (0.0005) +[2023-03-11 16:36:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9400.0). Total num frames: 48218112. Throughput: 0: 9514.0. Samples: 48207892. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:48,386][41256] Avg episode reward: [(0, '82.442')] +[2023-03-11 16:36:48,432][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000094184_48222208.pth... +[2023-03-11 16:36:48,434][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093632_47939584.pth +[2023-03-11 16:36:51,596][41544] Updated weights for policy 0, policy_version 94240 (0.0005) +[2023-03-11 16:36:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 48263168. Throughput: 0: 9455.4. Samples: 48262624. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 16:36:53,386][41256] Avg episode reward: [(0, '76.232')] +[2023-03-11 16:36:56,128][41544] Updated weights for policy 0, policy_version 94320 (0.0005) +[2023-03-11 16:36:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 48312320. Throughput: 0: 9408.0. Samples: 48289280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:36:58,386][41256] Avg episode reward: [(0, '76.107')] +[2023-03-11 16:37:00,401][41544] Updated weights for policy 0, policy_version 94400 (0.0005) +[2023-03-11 16:37:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 48357376. Throughput: 0: 9389.4. Samples: 48346596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:03,386][41256] Avg episode reward: [(0, '77.259')] +[2023-03-11 16:37:03,414][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000094456_48361472.pth... +[2023-03-11 16:37:03,416][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000093912_48082944.pth +[2023-03-11 16:37:04,627][41544] Updated weights for policy 0, policy_version 94480 (0.0005) +[2023-03-11 16:37:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 48410624. Throughput: 0: 9401.7. Samples: 48405912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:08,386][41256] Avg episode reward: [(0, '75.166')] +[2023-03-11 16:37:08,740][41544] Updated weights for policy 0, policy_version 94560 (0.0005) +[2023-03-11 16:37:13,197][41544] Updated weights for policy 0, policy_version 94640 (0.0004) +[2023-03-11 16:37:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 48455680. Throughput: 0: 9410.3. Samples: 48435112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:13,386][41256] Avg episode reward: [(0, '80.854')] +[2023-03-11 16:37:17,810][41544] Updated weights for policy 0, policy_version 94720 (0.0006) +[2023-03-11 16:37:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 48500736. Throughput: 0: 9357.1. Samples: 48488456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:18,386][41256] Avg episode reward: [(0, '83.313')] +[2023-03-11 16:37:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000094728_48500736.pth... +[2023-03-11 16:37:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000094184_48222208.pth +[2023-03-11 16:37:22,002][41544] Updated weights for policy 0, policy_version 94800 (0.0005) +[2023-03-11 16:37:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 48549888. Throughput: 0: 9328.5. Samples: 48545804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:23,386][41256] Avg episode reward: [(0, '83.078')] +[2023-03-11 16:37:26,233][41544] Updated weights for policy 0, policy_version 94880 (0.0004) +[2023-03-11 16:37:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 48599040. Throughput: 0: 9354.8. Samples: 48574528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:28,386][41256] Avg episode reward: [(0, '78.948')] +[2023-03-11 16:37:30,461][41544] Updated weights for policy 0, policy_version 94960 (0.0005) +[2023-03-11 16:37:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 48648192. Throughput: 0: 9445.1. Samples: 48632920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:33,386][41256] Avg episode reward: [(0, '82.887')] +[2023-03-11 16:37:33,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095016_48648192.pth... +[2023-03-11 16:37:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000094456_48361472.pth +[2023-03-11 16:37:34,648][41544] Updated weights for policy 0, policy_version 95040 (0.0005) +[2023-03-11 16:37:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 48693248. Throughput: 0: 9535.4. Samples: 48691716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:38,386][41256] Avg episode reward: [(0, '78.397')] +[2023-03-11 16:37:38,901][41544] Updated weights for policy 0, policy_version 95120 (0.0005) +[2023-03-11 16:37:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 48738304. Throughput: 0: 9547.3. Samples: 48718908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:43,386][41256] Avg episode reward: [(0, '76.584')] +[2023-03-11 16:37:43,488][41544] Updated weights for policy 0, policy_version 95200 (0.0005) +[2023-03-11 16:37:47,812][41544] Updated weights for policy 0, policy_version 95280 (0.0005) +[2023-03-11 16:37:48,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 48787456. Throughput: 0: 9504.3. Samples: 48774288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:48,386][41256] Avg episode reward: [(0, '78.030')] +[2023-03-11 16:37:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095288_48787456.pth... +[2023-03-11 16:37:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000094728_48500736.pth +[2023-03-11 16:37:52,119][41544] Updated weights for policy 0, policy_version 95360 (0.0005) +[2023-03-11 16:37:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 48832512. Throughput: 0: 9460.3. Samples: 48831628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:53,386][41256] Avg episode reward: [(0, '77.447')] +[2023-03-11 16:37:56,375][41544] Updated weights for policy 0, policy_version 95440 (0.0005) +[2023-03-11 16:37:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 48881664. Throughput: 0: 9452.1. Samples: 48860456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:37:58,386][41256] Avg episode reward: [(0, '78.383')] +[2023-03-11 16:38:00,649][41544] Updated weights for policy 0, policy_version 95520 (0.0005) +[2023-03-11 16:38:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 48930816. Throughput: 0: 9546.6. Samples: 48918052. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:03,386][41256] Avg episode reward: [(0, '84.931')] +[2023-03-11 16:38:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095568_48930816.pth... +[2023-03-11 16:38:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095016_48648192.pth +[2023-03-11 16:38:04,883][41544] Updated weights for policy 0, policy_version 95600 (0.0005) +[2023-03-11 16:38:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 48979968. Throughput: 0: 9554.2. Samples: 48975744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:08,386][41256] Avg episode reward: [(0, '76.676')] +[2023-03-11 16:38:09,123][41544] Updated weights for policy 0, policy_version 95680 (0.0005) +[2023-03-11 16:38:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 49025024. Throughput: 0: 9557.3. Samples: 49004608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:13,386][41256] Avg episode reward: [(0, '82.712')] +[2023-03-11 16:38:13,478][41544] Updated weights for policy 0, policy_version 95760 (0.0005) +[2023-03-11 16:38:17,968][41544] Updated weights for policy 0, policy_version 95840 (0.0005) +[2023-03-11 16:38:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 49074176. Throughput: 0: 9479.6. Samples: 49059504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:18,386][41256] Avg episode reward: [(0, '84.515')] +[2023-03-11 16:38:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095848_49074176.pth... +[2023-03-11 16:38:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095288_48787456.pth +[2023-03-11 16:38:22,120][41544] Updated weights for policy 0, policy_version 95920 (0.0005) +[2023-03-11 16:38:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 49119232. Throughput: 0: 9468.7. Samples: 49117808. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:23,386][41256] Avg episode reward: [(0, '81.865')] +[2023-03-11 16:38:26,372][41544] Updated weights for policy 0, policy_version 96000 (0.0005) +[2023-03-11 16:38:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 49168384. Throughput: 0: 9515.3. Samples: 49147096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:28,386][41256] Avg episode reward: [(0, '83.149')] +[2023-03-11 16:38:30,651][41544] Updated weights for policy 0, policy_version 96080 (0.0005) +[2023-03-11 16:38:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.0, 300 sec: 9441.6). Total num frames: 49217536. Throughput: 0: 9567.4. Samples: 49204820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:33,386][41256] Avg episode reward: [(0, '82.803')] +[2023-03-11 16:38:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096128_49217536.pth... +[2023-03-11 16:38:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095568_48930816.pth +[2023-03-11 16:38:34,785][41544] Updated weights for policy 0, policy_version 96160 (0.0005) +[2023-03-11 16:38:38,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 49266688. Throughput: 0: 9629.7. Samples: 49264964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:38,386][41256] Avg episode reward: [(0, '81.365')] +[2023-03-11 16:38:38,874][41544] Updated weights for policy 0, policy_version 96240 (0.0005) +[2023-03-11 16:38:43,040][41544] Updated weights for policy 0, policy_version 96320 (0.0004) +[2023-03-11 16:38:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9455.5). Total num frames: 49315840. Throughput: 0: 9646.0. Samples: 49294528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:43,386][41256] Avg episode reward: [(0, '79.888')] +[2023-03-11 16:38:47,578][41544] Updated weights for policy 0, policy_version 96400 (0.0005) +[2023-03-11 16:38:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 49360896. Throughput: 0: 9606.0. Samples: 49350320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:48,386][41256] Avg episode reward: [(0, '82.128')] +[2023-03-11 16:38:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096408_49360896.pth... +[2023-03-11 16:38:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000095848_49074176.pth +[2023-03-11 16:38:52,172][41544] Updated weights for policy 0, policy_version 96480 (0.0005) +[2023-03-11 16:38:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 49405952. Throughput: 0: 9509.3. Samples: 49403664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:53,386][41256] Avg episode reward: [(0, '85.624')] +[2023-03-11 16:38:56,769][41544] Updated weights for policy 0, policy_version 96560 (0.0005) +[2023-03-11 16:38:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9441.6). Total num frames: 49451008. Throughput: 0: 9465.1. Samples: 49430536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:38:58,386][41256] Avg episode reward: [(0, '83.844')] +[2023-03-11 16:39:01,293][41544] Updated weights for policy 0, policy_version 96640 (0.0005) +[2023-03-11 16:39:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 49496064. Throughput: 0: 9439.5. Samples: 49484280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:03,386][41256] Avg episode reward: [(0, '84.090')] +[2023-03-11 16:39:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096672_49496064.pth... +[2023-03-11 16:39:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096128_49217536.pth +[2023-03-11 16:39:05,947][41544] Updated weights for policy 0, policy_version 96720 (0.0005) +[2023-03-11 16:39:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 49541120. Throughput: 0: 9317.5. Samples: 49537096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:08,386][41256] Avg episode reward: [(0, '86.319')] +[2023-03-11 16:39:10,587][41544] Updated weights for policy 0, policy_version 96800 (0.0005) +[2023-03-11 16:39:13,385][41256] Fps is (10 sec: 8601.7, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 49582080. Throughput: 0: 9263.4. Samples: 49563948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:13,386][41256] Avg episode reward: [(0, '83.483')] +[2023-03-11 16:39:15,341][41544] Updated weights for policy 0, policy_version 96880 (0.0005) +[2023-03-11 16:39:18,386][41256] Fps is (10 sec: 8601.6, 60 sec: 9216.0, 300 sec: 9427.7). Total num frames: 49627136. Throughput: 0: 9130.1. Samples: 49615676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:18,386][41256] Avg episode reward: [(0, '90.344')] +[2023-03-11 16:39:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096928_49627136.pth... +[2023-03-11 16:39:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096408_49360896.pth +[2023-03-11 16:39:19,994][41544] Updated weights for policy 0, policy_version 96960 (0.0006) +[2023-03-11 16:39:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9427.7). Total num frames: 49672192. Throughput: 0: 8962.9. Samples: 49668292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:23,386][41256] Avg episode reward: [(0, '87.459')] +[2023-03-11 16:39:24,625][41544] Updated weights for policy 0, policy_version 97040 (0.0006) +[2023-03-11 16:39:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9413.9). Total num frames: 49717248. Throughput: 0: 8929.3. Samples: 49696348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:28,386][41256] Avg episode reward: [(0, '78.453')] +[2023-03-11 16:39:29,180][41544] Updated weights for policy 0, policy_version 97120 (0.0005) +[2023-03-11 16:39:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9413.9). Total num frames: 49762304. Throughput: 0: 8882.1. Samples: 49750016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:33,386][41256] Avg episode reward: [(0, '77.215')] +[2023-03-11 16:39:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000097192_49762304.pth... +[2023-03-11 16:39:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096672_49496064.pth +[2023-03-11 16:39:33,735][41544] Updated weights for policy 0, policy_version 97200 (0.0005) +[2023-03-11 16:39:38,321][41544] Updated weights for policy 0, policy_version 97280 (0.0006) +[2023-03-11 16:39:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9413.9). Total num frames: 49807360. Throughput: 0: 8880.0. Samples: 49803264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:38,386][41256] Avg episode reward: [(0, '76.390')] +[2023-03-11 16:39:42,756][41544] Updated weights for policy 0, policy_version 97360 (0.0003) +[2023-03-11 16:39:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9400.0). Total num frames: 49852416. Throughput: 0: 8897.2. Samples: 49830912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:43,386][41256] Avg episode reward: [(0, '77.681')] +[2023-03-11 16:39:47,292][41544] Updated weights for policy 0, policy_version 97440 (0.0005) +[2023-03-11 16:39:48,386][41256] Fps is (10 sec: 9011.1, 60 sec: 8942.9, 300 sec: 9400.0). Total num frames: 49897472. Throughput: 0: 8910.6. Samples: 49885256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:48,386][41256] Avg episode reward: [(0, '81.781')] +[2023-03-11 16:39:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000097456_49897472.pth... +[2023-03-11 16:39:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000096928_49627136.pth +[2023-03-11 16:39:51,858][41544] Updated weights for policy 0, policy_version 97520 (0.0006) +[2023-03-11 16:39:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9386.1). Total num frames: 49942528. Throughput: 0: 8932.0. Samples: 49939036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:53,386][41256] Avg episode reward: [(0, '79.644')] +[2023-03-11 16:39:56,262][41544] Updated weights for policy 0, policy_version 97600 (0.0005) +[2023-03-11 16:39:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9372.2). Total num frames: 49987584. Throughput: 0: 8960.4. Samples: 49967168. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:39:58,386][41256] Avg episode reward: [(0, '75.972')] +[2023-03-11 16:40:00,547][41544] Updated weights for policy 0, policy_version 97680 (0.0005) +[2023-03-11 16:40:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9011.2, 300 sec: 9372.2). Total num frames: 50036736. Throughput: 0: 9084.3. Samples: 50024468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:03,386][41256] Avg episode reward: [(0, '78.773')] +[2023-03-11 16:40:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000097728_50036736.pth... +[2023-03-11 16:40:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000097192_49762304.pth +[2023-03-11 16:40:04,825][41544] Updated weights for policy 0, policy_version 97760 (0.0005) +[2023-03-11 16:40:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9079.5, 300 sec: 9386.1). Total num frames: 50085888. Throughput: 0: 9188.9. Samples: 50081792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:08,386][41256] Avg episode reward: [(0, '78.771')] +[2023-03-11 16:40:09,204][41544] Updated weights for policy 0, policy_version 97840 (0.0005) +[2023-03-11 16:40:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 50130944. Throughput: 0: 9186.1. Samples: 50109724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:13,386][41256] Avg episode reward: [(0, '76.855')] +[2023-03-11 16:40:13,575][41544] Updated weights for policy 0, policy_version 97920 (0.0006) +[2023-03-11 16:40:17,916][41544] Updated weights for policy 0, policy_version 98000 (0.0005) +[2023-03-11 16:40:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 50180096. Throughput: 0: 9236.6. Samples: 50165664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:18,386][41256] Avg episode reward: [(0, '73.846')] +[2023-03-11 16:40:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098008_50180096.pth... +[2023-03-11 16:40:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000097456_49897472.pth +[2023-03-11 16:40:22,160][41544] Updated weights for policy 0, policy_version 98080 (0.0005) +[2023-03-11 16:40:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 50225152. Throughput: 0: 9344.9. Samples: 50223784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:23,386][41256] Avg episode reward: [(0, '70.195')] +[2023-03-11 16:40:26,294][41544] Updated weights for policy 0, policy_version 98160 (0.0005) +[2023-03-11 16:40:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9400.0). Total num frames: 50274304. Throughput: 0: 9398.2. Samples: 50253832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:28,386][41256] Avg episode reward: [(0, '76.138')] +[2023-03-11 16:40:30,663][41544] Updated weights for policy 0, policy_version 98240 (0.0005) +[2023-03-11 16:40:33,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 50323456. Throughput: 0: 9439.8. Samples: 50310048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:33,386][41256] Avg episode reward: [(0, '75.308')] +[2023-03-11 16:40:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098288_50323456.pth... +[2023-03-11 16:40:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000097728_50036736.pth +[2023-03-11 16:40:35,129][41544] Updated weights for policy 0, policy_version 98320 (0.0005) +[2023-03-11 16:40:38,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 50368512. Throughput: 0: 9459.5. Samples: 50364712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:38,386][41256] Avg episode reward: [(0, '74.513')] +[2023-03-11 16:40:39,509][41544] Updated weights for policy 0, policy_version 98400 (0.0006) +[2023-03-11 16:40:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 50417664. Throughput: 0: 9464.9. Samples: 50393088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:43,386][41256] Avg episode reward: [(0, '69.570')] +[2023-03-11 16:40:43,769][41544] Updated weights for policy 0, policy_version 98480 (0.0004) +[2023-03-11 16:40:48,152][41544] Updated weights for policy 0, policy_version 98560 (0.0004) +[2023-03-11 16:40:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 50462720. Throughput: 0: 9466.1. Samples: 50450440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:48,386][41256] Avg episode reward: [(0, '71.021')] +[2023-03-11 16:40:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098560_50462720.pth... +[2023-03-11 16:40:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098008_50180096.pth +[2023-03-11 16:40:52,608][41544] Updated weights for policy 0, policy_version 98640 (0.0006) +[2023-03-11 16:40:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9372.2). Total num frames: 50507776. Throughput: 0: 9426.7. Samples: 50505992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:53,386][41256] Avg episode reward: [(0, '73.864')] +[2023-03-11 16:40:57,031][41544] Updated weights for policy 0, policy_version 98720 (0.0005) +[2023-03-11 16:40:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9358.3). Total num frames: 50552832. Throughput: 0: 9413.4. Samples: 50533328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:40:58,386][41256] Avg episode reward: [(0, '80.119')] +[2023-03-11 16:41:01,324][41544] Updated weights for policy 0, policy_version 98800 (0.0004) +[2023-03-11 16:41:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 50606080. Throughput: 0: 9448.1. Samples: 50590828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:03,386][41256] Avg episode reward: [(0, '75.043')] +[2023-03-11 16:41:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098840_50606080.pth... +[2023-03-11 16:41:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098288_50323456.pth +[2023-03-11 16:41:05,330][41544] Updated weights for policy 0, policy_version 98880 (0.0005) +[2023-03-11 16:41:08,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 50655232. Throughput: 0: 9496.1. Samples: 50651112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:08,386][41256] Avg episode reward: [(0, '71.882')] +[2023-03-11 16:41:09,609][41544] Updated weights for policy 0, policy_version 98960 (0.0005) +[2023-03-11 16:41:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 50704384. Throughput: 0: 9466.5. Samples: 50679824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:13,386][41256] Avg episode reward: [(0, '70.591')] +[2023-03-11 16:41:13,643][41544] Updated weights for policy 0, policy_version 99040 (0.0005) +[2023-03-11 16:41:17,720][41544] Updated weights for policy 0, policy_version 99120 (0.0005) +[2023-03-11 16:41:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 50753536. Throughput: 0: 9574.3. Samples: 50740892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:18,386][41256] Avg episode reward: [(0, '71.117')] +[2023-03-11 16:41:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099128_50753536.pth... +[2023-03-11 16:41:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098560_50462720.pth +[2023-03-11 16:41:21,957][41544] Updated weights for policy 0, policy_version 99200 (0.0006) +[2023-03-11 16:41:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9386.1). Total num frames: 50802688. Throughput: 0: 9659.7. Samples: 50799400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:23,386][41256] Avg episode reward: [(0, '70.589')] +[2023-03-11 16:41:26,165][41544] Updated weights for policy 0, policy_version 99280 (0.0004) +[2023-03-11 16:41:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 50851840. Throughput: 0: 9662.0. Samples: 50827880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:28,386][41256] Avg episode reward: [(0, '74.221')] +[2023-03-11 16:41:30,312][41544] Updated weights for policy 0, policy_version 99360 (0.0003) +[2023-03-11 16:41:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 50900992. Throughput: 0: 9706.7. Samples: 50887240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:33,386][41256] Avg episode reward: [(0, '76.331')] +[2023-03-11 16:41:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099416_50900992.pth... +[2023-03-11 16:41:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000098840_50606080.pth +[2023-03-11 16:41:34,686][41544] Updated weights for policy 0, policy_version 99440 (0.0003) +[2023-03-11 16:41:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 50946048. Throughput: 0: 9705.4. Samples: 50942736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:38,386][41256] Avg episode reward: [(0, '75.408')] +[2023-03-11 16:41:39,048][41544] Updated weights for policy 0, policy_version 99520 (0.0003) +[2023-03-11 16:41:43,341][41544] Updated weights for policy 0, policy_version 99600 (0.0004) +[2023-03-11 16:41:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9413.9). Total num frames: 50995200. Throughput: 0: 9736.0. Samples: 50971448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:43,386][41256] Avg episode reward: [(0, '77.651')] +[2023-03-11 16:41:47,706][41544] Updated weights for policy 0, policy_version 99680 (0.0003) +[2023-03-11 16:41:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9413.9). Total num frames: 51040256. Throughput: 0: 9715.7. Samples: 51028032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:48,386][41256] Avg episode reward: [(0, '78.426')] +[2023-03-11 16:41:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099688_51040256.pth... +[2023-03-11 16:41:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099128_50753536.pth +[2023-03-11 16:41:51,954][41544] Updated weights for policy 0, policy_version 99760 (0.0003) +[2023-03-11 16:41:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 51089408. Throughput: 0: 9649.4. Samples: 51085336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:53,386][41256] Avg episode reward: [(0, '80.175')] +[2023-03-11 16:41:56,475][41544] Updated weights for policy 0, policy_version 99840 (0.0005) +[2023-03-11 16:41:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 51134464. Throughput: 0: 9610.1. Samples: 51112276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:41:58,386][41256] Avg episode reward: [(0, '84.231')] +[2023-03-11 16:42:00,727][41544] Updated weights for policy 0, policy_version 99920 (0.0005) +[2023-03-11 16:42:03,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 51183616. Throughput: 0: 9538.9. Samples: 51170144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:03,386][41256] Avg episode reward: [(0, '81.226')] +[2023-03-11 16:42:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099968_51183616.pth... +[2023-03-11 16:42:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099416_50900992.pth +[2023-03-11 16:42:04,863][41544] Updated weights for policy 0, policy_version 100000 (0.0005) +[2023-03-11 16:42:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9413.9). Total num frames: 51232768. Throughput: 0: 9559.9. Samples: 51229596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:08,386][41256] Avg episode reward: [(0, '80.110')] +[2023-03-11 16:42:08,956][41544] Updated weights for policy 0, policy_version 100080 (0.0005) +[2023-03-11 16:42:13,057][41544] Updated weights for policy 0, policy_version 100160 (0.0005) +[2023-03-11 16:42:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 51281920. Throughput: 0: 9601.1. Samples: 51259928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:13,386][41256] Avg episode reward: [(0, '78.923')] +[2023-03-11 16:42:17,278][41544] Updated weights for policy 0, policy_version 100240 (0.0005) +[2023-03-11 16:42:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 51331072. Throughput: 0: 9588.8. Samples: 51318736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:18,386][41256] Avg episode reward: [(0, '84.051')] +[2023-03-11 16:42:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000100256_51331072.pth... +[2023-03-11 16:42:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099688_51040256.pth +[2023-03-11 16:42:21,595][41544] Updated weights for policy 0, policy_version 100320 (0.0005) +[2023-03-11 16:42:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 51380224. Throughput: 0: 9623.0. Samples: 51375768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:23,386][41256] Avg episode reward: [(0, '87.723')] +[2023-03-11 16:42:25,985][41544] Updated weights for policy 0, policy_version 100400 (0.0006) +[2023-03-11 16:42:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 51425280. Throughput: 0: 9603.2. Samples: 51403592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:28,386][41256] Avg episode reward: [(0, '86.932')] +[2023-03-11 16:42:30,244][41544] Updated weights for policy 0, policy_version 100480 (0.0005) +[2023-03-11 16:42:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 51474432. Throughput: 0: 9634.1. Samples: 51461568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:33,386][41256] Avg episode reward: [(0, '82.995')] +[2023-03-11 16:42:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000100536_51474432.pth... +[2023-03-11 16:42:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000099968_51183616.pth +[2023-03-11 16:42:34,397][41544] Updated weights for policy 0, policy_version 100560 (0.0005) +[2023-03-11 16:42:38,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 9455.5). Total num frames: 51527680. Throughput: 0: 9700.8. Samples: 51521872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:38,386][41256] Avg episode reward: [(0, '84.896')] +[2023-03-11 16:42:38,386][41544] Updated weights for policy 0, policy_version 100640 (0.0005) +[2023-03-11 16:42:42,560][41544] Updated weights for policy 0, policy_version 100720 (0.0005) +[2023-03-11 16:42:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9441.6). Total num frames: 51572736. Throughput: 0: 9778.9. Samples: 51552328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:43,386][41256] Avg episode reward: [(0, '85.703')] +[2023-03-11 16:42:46,911][41544] Updated weights for policy 0, policy_version 100800 (0.0005) +[2023-03-11 16:42:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9455.5). Total num frames: 51621888. Throughput: 0: 9762.8. Samples: 51609468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:48,386][41256] Avg episode reward: [(0, '85.069')] +[2023-03-11 16:42:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000100824_51621888.pth... +[2023-03-11 16:42:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000100256_51331072.pth +[2023-03-11 16:42:51,255][41544] Updated weights for policy 0, policy_version 100880 (0.0006) +[2023-03-11 16:42:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9441.6). Total num frames: 51666944. Throughput: 0: 9687.1. Samples: 51665516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:53,386][41256] Avg episode reward: [(0, '82.765')] +[2023-03-11 16:42:55,672][41544] Updated weights for policy 0, policy_version 100960 (0.0005) +[2023-03-11 16:42:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9441.6). Total num frames: 51716096. Throughput: 0: 9625.4. Samples: 51693072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:42:58,386][41256] Avg episode reward: [(0, '82.457')] +[2023-03-11 16:42:59,961][41544] Updated weights for policy 0, policy_version 101040 (0.0005) +[2023-03-11 16:43:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 51761152. Throughput: 0: 9571.3. Samples: 51749444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:03,395][41256] Avg episode reward: [(0, '83.452')] +[2023-03-11 16:43:03,398][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101096_51761152.pth... +[2023-03-11 16:43:03,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000100536_51474432.pth +[2023-03-11 16:43:04,500][41544] Updated weights for policy 0, policy_version 101120 (0.0005) +[2023-03-11 16:43:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9441.6). Total num frames: 51810304. Throughput: 0: 9556.8. Samples: 51805824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:08,396][41256] Avg episode reward: [(0, '87.155')] +[2023-03-11 16:43:08,763][41544] Updated weights for policy 0, policy_version 101200 (0.0003) +[2023-03-11 16:43:12,978][41544] Updated weights for policy 0, policy_version 101280 (0.0004) +[2023-03-11 16:43:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 51855360. Throughput: 0: 9579.0. Samples: 51834648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:13,396][41256] Avg episode reward: [(0, '89.899')] +[2023-03-11 16:43:17,208][41544] Updated weights for policy 0, policy_version 101360 (0.0004) +[2023-03-11 16:43:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9441.6). Total num frames: 51904512. Throughput: 0: 9573.6. Samples: 51892380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:18,386][41256] Avg episode reward: [(0, '87.965')] +[2023-03-11 16:43:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101376_51904512.pth... +[2023-03-11 16:43:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000100824_51621888.pth +[2023-03-11 16:43:21,572][41544] Updated weights for policy 0, policy_version 101440 (0.0004) +[2023-03-11 16:43:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9441.6). Total num frames: 51953664. Throughput: 0: 9500.7. Samples: 51949404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:23,386][41256] Avg episode reward: [(0, '87.015')] +[2023-03-11 16:43:26,069][41544] Updated weights for policy 0, policy_version 101520 (0.0006) +[2023-03-11 16:43:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 51998720. Throughput: 0: 9423.9. Samples: 51976404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:28,386][41256] Avg episode reward: [(0, '84.308')] +[2023-03-11 16:43:30,416][41544] Updated weights for policy 0, policy_version 101600 (0.0005) +[2023-03-11 16:43:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 52043776. Throughput: 0: 9389.7. Samples: 52032004. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:33,386][41256] Avg episode reward: [(0, '83.644')] +[2023-03-11 16:43:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101648_52043776.pth... +[2023-03-11 16:43:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101096_51761152.pth +[2023-03-11 16:43:34,804][41544] Updated weights for policy 0, policy_version 101680 (0.0003) +[2023-03-11 16:43:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 52092928. Throughput: 0: 9400.4. Samples: 52088536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:38,386][41256] Avg episode reward: [(0, '86.840')] +[2023-03-11 16:43:39,189][41544] Updated weights for policy 0, policy_version 101760 (0.0005) +[2023-03-11 16:43:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 52137984. Throughput: 0: 9430.5. Samples: 52117444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:43,386][41256] Avg episode reward: [(0, '86.009')] +[2023-03-11 16:43:43,668][41544] Updated weights for policy 0, policy_version 101840 (0.0005) +[2023-03-11 16:43:44,576][41500] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000003 +[2023-03-11 16:43:48,176][41544] Updated weights for policy 0, policy_version 101920 (0.0005) +[2023-03-11 16:43:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 52183040. Throughput: 0: 9364.3. Samples: 52170836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:43:48,386][41256] Avg episode reward: [(0, '86.699')] +[2023-03-11 16:43:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101920_52183040.pth... +[2023-03-11 16:43:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101376_51904512.pth +[2023-03-11 16:43:52,641][41544] Updated weights for policy 0, policy_version 102000 (0.0006) +[2023-03-11 16:43:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 52228096. Throughput: 0: 9336.6. Samples: 52225972. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:43:53,386][41256] Avg episode reward: [(0, '87.959')] +[2023-03-11 16:43:56,971][41544] Updated weights for policy 0, policy_version 102080 (0.0005) +[2023-03-11 16:43:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 52277248. Throughput: 0: 9319.8. Samples: 52254040. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:43:58,386][41256] Avg episode reward: [(0, '87.052')] +[2023-03-11 16:44:01,085][41544] Updated weights for policy 0, policy_version 102160 (0.0005) +[2023-03-11 16:44:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 52326400. Throughput: 0: 9357.4. Samples: 52313464. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:03,397][41256] Avg episode reward: [(0, '85.207')] +[2023-03-11 16:44:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000102200_52326400.pth... +[2023-03-11 16:44:03,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101648_52043776.pth +[2023-03-11 16:44:05,138][41544] Updated weights for policy 0, policy_version 102240 (0.0005) +[2023-03-11 16:44:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 52375552. Throughput: 0: 9415.3. Samples: 52373092. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:08,397][41256] Avg episode reward: [(0, '79.186')] +[2023-03-11 16:44:09,338][41544] Updated weights for policy 0, policy_version 102320 (0.0005) +[2023-03-11 16:44:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 52424704. Throughput: 0: 9478.7. Samples: 52402944. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:13,396][41256] Avg episode reward: [(0, '86.651')] +[2023-03-11 16:44:13,526][41544] Updated weights for policy 0, policy_version 102400 (0.0005) +[2023-03-11 16:44:17,856][41544] Updated weights for policy 0, policy_version 102480 (0.0005) +[2023-03-11 16:44:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 52473856. Throughput: 0: 9522.2. Samples: 52460504. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:18,397][41256] Avg episode reward: [(0, '85.403')] +[2023-03-11 16:44:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000102488_52473856.pth... +[2023-03-11 16:44:18,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000101920_52183040.pth +[2023-03-11 16:44:22,311][41544] Updated weights for policy 0, policy_version 102560 (0.0005) +[2023-03-11 16:44:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 52518912. Throughput: 0: 9492.9. Samples: 52515716. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:23,396][41256] Avg episode reward: [(0, '85.044')] +[2023-03-11 16:44:26,566][41544] Updated weights for policy 0, policy_version 102640 (0.0005) +[2023-03-11 16:44:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 52568064. Throughput: 0: 9494.4. Samples: 52544692. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:28,386][41256] Avg episode reward: [(0, '87.292')] +[2023-03-11 16:44:30,781][41544] Updated weights for policy 0, policy_version 102720 (0.0005) +[2023-03-11 16:44:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9524.9). Total num frames: 52617216. Throughput: 0: 9610.0. Samples: 52603284. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:33,386][41256] Avg episode reward: [(0, '85.616')] +[2023-03-11 16:44:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000102768_52617216.pth... +[2023-03-11 16:44:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000102200_52326400.pth +[2023-03-11 16:44:34,900][41544] Updated weights for policy 0, policy_version 102800 (0.0005) +[2023-03-11 16:44:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9538.8). Total num frames: 52666368. Throughput: 0: 9695.7. Samples: 52662280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:38,396][41256] Avg episode reward: [(0, '85.409')] +[2023-03-11 16:44:39,079][41544] Updated weights for policy 0, policy_version 102880 (0.0005) +[2023-03-11 16:44:43,292][41544] Updated weights for policy 0, policy_version 102960 (0.0005) +[2023-03-11 16:44:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9552.7). Total num frames: 52715520. Throughput: 0: 9716.5. Samples: 52691280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:43,396][41256] Avg episode reward: [(0, '85.332')] +[2023-03-11 16:44:47,496][41544] Updated weights for policy 0, policy_version 103040 (0.0005) +[2023-03-11 16:44:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9552.7). Total num frames: 52760576. Throughput: 0: 9700.6. Samples: 52749992. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 16:44:48,396][41256] Avg episode reward: [(0, '83.126')] +[2023-03-11 16:44:48,409][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103056_52764672.pth... +[2023-03-11 16:44:48,410][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000102488_52473856.pth +[2023-03-11 16:44:51,839][41544] Updated weights for policy 0, policy_version 103120 (0.0005) +[2023-03-11 16:44:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9566.6). Total num frames: 52809728. Throughput: 0: 9642.6. Samples: 52807008. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:44:53,396][41256] Avg episode reward: [(0, '81.133')] +[2023-03-11 16:44:55,898][41544] Updated weights for policy 0, policy_version 103200 (0.0005) +[2023-03-11 16:44:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9566.6). Total num frames: 52858880. Throughput: 0: 9663.0. Samples: 52837780. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:44:58,386][41256] Avg episode reward: [(0, '83.682')] +[2023-03-11 16:45:00,003][41544] Updated weights for policy 0, policy_version 103280 (0.0004) +[2023-03-11 16:45:03,386][41256] Fps is (10 sec: 10239.8, 60 sec: 9762.1, 300 sec: 9580.5). Total num frames: 52912128. Throughput: 0: 9703.4. Samples: 52897160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:03,386][41256] Avg episode reward: [(0, '85.287')] +[2023-03-11 16:45:03,391][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103344_52912128.pth... +[2023-03-11 16:45:03,394][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000102768_52617216.pth +[2023-03-11 16:45:04,217][41544] Updated weights for policy 0, policy_version 103360 (0.0005) +[2023-03-11 16:45:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9580.5). Total num frames: 52957184. Throughput: 0: 9741.2. Samples: 52954068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:08,386][41256] Avg episode reward: [(0, '82.473')] +[2023-03-11 16:45:08,532][41544] Updated weights for policy 0, policy_version 103440 (0.0005) +[2023-03-11 16:45:12,726][41544] Updated weights for policy 0, policy_version 103520 (0.0005) +[2023-03-11 16:45:13,385][41256] Fps is (10 sec: 9421.0, 60 sec: 9693.9, 300 sec: 9580.5). Total num frames: 53006336. Throughput: 0: 9758.4. Samples: 52983820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:13,386][41256] Avg episode reward: [(0, '85.056')] +[2023-03-11 16:45:17,054][41544] Updated weights for policy 0, policy_version 103600 (0.0005) +[2023-03-11 16:45:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9594.4). Total num frames: 53055488. Throughput: 0: 9727.6. Samples: 53041024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:18,386][41256] Avg episode reward: [(0, '85.513')] +[2023-03-11 16:45:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103624_53055488.pth... +[2023-03-11 16:45:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103056_52764672.pth +[2023-03-11 16:45:21,342][41544] Updated weights for policy 0, policy_version 103680 (0.0005) +[2023-03-11 16:45:23,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9580.5). Total num frames: 53100544. Throughput: 0: 9697.7. Samples: 53098676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:23,386][41256] Avg episode reward: [(0, '83.366')] +[2023-03-11 16:45:25,551][41544] Updated weights for policy 0, policy_version 103760 (0.0005) +[2023-03-11 16:45:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9580.5). Total num frames: 53149696. Throughput: 0: 9702.7. Samples: 53127900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:28,386][41256] Avg episode reward: [(0, '81.807')] +[2023-03-11 16:45:29,808][41544] Updated weights for policy 0, policy_version 103840 (0.0005) +[2023-03-11 16:45:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9693.8, 300 sec: 9594.4). Total num frames: 53198848. Throughput: 0: 9669.0. Samples: 53185100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:33,386][41256] Avg episode reward: [(0, '81.519')] +[2023-03-11 16:45:33,391][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103904_53198848.pth... +[2023-03-11 16:45:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103344_52912128.pth +[2023-03-11 16:45:34,241][41544] Updated weights for policy 0, policy_version 103920 (0.0005) +[2023-03-11 16:45:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9580.5). Total num frames: 53243904. Throughput: 0: 9653.4. Samples: 53241412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:38,386][41256] Avg episode reward: [(0, '82.415')] +[2023-03-11 16:45:38,540][41544] Updated weights for policy 0, policy_version 104000 (0.0005) +[2023-03-11 16:45:42,960][41544] Updated weights for policy 0, policy_version 104080 (0.0006) +[2023-03-11 16:45:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9557.3, 300 sec: 9580.5). Total num frames: 53288960. Throughput: 0: 9594.7. Samples: 53269540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:43,386][41256] Avg episode reward: [(0, '87.655')] +[2023-03-11 16:45:47,393][41544] Updated weights for policy 0, policy_version 104160 (0.0006) +[2023-03-11 16:45:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 53338112. Throughput: 0: 9505.5. Samples: 53324908. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:48,397][41256] Avg episode reward: [(0, '88.495')] +[2023-03-11 16:45:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000104176_53338112.pth... +[2023-03-11 16:45:48,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103624_53055488.pth +[2023-03-11 16:45:51,509][41544] Updated weights for policy 0, policy_version 104240 (0.0005) +[2023-03-11 16:45:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9608.2). Total num frames: 53387264. Throughput: 0: 9546.9. Samples: 53383680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:53,397][41256] Avg episode reward: [(0, '88.432')] +[2023-03-11 16:45:55,541][41544] Updated weights for policy 0, policy_version 104320 (0.0005) +[2023-03-11 16:45:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 53436416. Throughput: 0: 9583.2. Samples: 53415064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:45:58,397][41256] Avg episode reward: [(0, '81.193')] +[2023-03-11 16:45:59,729][41544] Updated weights for policy 0, policy_version 104400 (0.0005) +[2023-03-11 16:46:03,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9625.6, 300 sec: 9608.2). Total num frames: 53489664. Throughput: 0: 9607.8. Samples: 53473372. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:03,396][41256] Avg episode reward: [(0, '83.677')] +[2023-03-11 16:46:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000104472_53489664.pth... +[2023-03-11 16:46:03,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000103904_53198848.pth +[2023-03-11 16:46:03,814][41544] Updated weights for policy 0, policy_version 104480 (0.0005) +[2023-03-11 16:46:08,082][41544] Updated weights for policy 0, policy_version 104560 (0.0005) +[2023-03-11 16:46:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 53534720. Throughput: 0: 9633.3. Samples: 53532172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:08,386][41256] Avg episode reward: [(0, '86.338')] +[2023-03-11 16:46:12,256][41544] Updated weights for policy 0, policy_version 104640 (0.0005) +[2023-03-11 16:46:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 53583872. Throughput: 0: 9656.4. Samples: 53562440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:13,396][41256] Avg episode reward: [(0, '84.919')] +[2023-03-11 16:46:16,574][41544] Updated weights for policy 0, policy_version 104720 (0.0005) +[2023-03-11 16:46:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 53633024. Throughput: 0: 9651.4. Samples: 53619412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:18,397][41256] Avg episode reward: [(0, '86.668')] +[2023-03-11 16:46:18,401][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000104752_53633024.pth... +[2023-03-11 16:46:18,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000104176_53338112.pth +[2023-03-11 16:46:20,907][41544] Updated weights for policy 0, policy_version 104800 (0.0004) +[2023-03-11 16:46:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9580.5). Total num frames: 53678080. Throughput: 0: 9662.9. Samples: 53676244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:23,396][41256] Avg episode reward: [(0, '85.135')] +[2023-03-11 16:46:25,233][41544] Updated weights for policy 0, policy_version 104880 (0.0005) +[2023-03-11 16:46:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9580.5). Total num frames: 53727232. Throughput: 0: 9649.3. Samples: 53703760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:28,396][41256] Avg episode reward: [(0, '87.770')] +[2023-03-11 16:46:29,673][41544] Updated weights for policy 0, policy_version 104960 (0.0006) +[2023-03-11 16:46:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.4, 300 sec: 9580.5). Total num frames: 53772288. Throughput: 0: 9673.8. Samples: 53760228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:33,397][41256] Avg episode reward: [(0, '86.237')] +[2023-03-11 16:46:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105024_53772288.pth... +[2023-03-11 16:46:33,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000104472_53489664.pth +[2023-03-11 16:46:33,850][41544] Updated weights for policy 0, policy_version 105040 (0.0005) +[2023-03-11 16:46:38,017][41544] Updated weights for policy 0, policy_version 105120 (0.0005) +[2023-03-11 16:46:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9580.5). Total num frames: 53821440. Throughput: 0: 9694.1. Samples: 53819916. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:38,397][41256] Avg episode reward: [(0, '87.263')] +[2023-03-11 16:46:42,126][41544] Updated weights for policy 0, policy_version 105200 (0.0005) +[2023-03-11 16:46:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9608.2). Total num frames: 53874688. Throughput: 0: 9658.7. Samples: 53849704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:43,397][41256] Avg episode reward: [(0, '87.330')] +[2023-03-11 16:46:46,211][41544] Updated weights for policy 0, policy_version 105280 (0.0005) +[2023-03-11 16:46:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9762.1, 300 sec: 9608.2). Total num frames: 53923840. Throughput: 0: 9691.5. Samples: 53909492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:48,397][41256] Avg episode reward: [(0, '81.798')] +[2023-03-11 16:46:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105320_53923840.pth... +[2023-03-11 16:46:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000104752_53633024.pth +[2023-03-11 16:46:50,244][41544] Updated weights for policy 0, policy_version 105360 (0.0005) +[2023-03-11 16:46:53,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9636.0). Total num frames: 53977088. Throughput: 0: 9768.2. Samples: 53971740. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:53,396][41256] Avg episode reward: [(0, '79.177')] +[2023-03-11 16:46:54,196][41544] Updated weights for policy 0, policy_version 105440 (0.0005) +[2023-03-11 16:46:58,316][41544] Updated weights for policy 0, policy_version 105520 (0.0005) +[2023-03-11 16:46:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 9636.0). Total num frames: 54026240. Throughput: 0: 9760.8. Samples: 54001676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:46:58,386][41256] Avg episode reward: [(0, '78.520')] +[2023-03-11 16:47:02,664][41544] Updated weights for policy 0, policy_version 105600 (0.0005) +[2023-03-11 16:47:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9622.1). Total num frames: 54071296. Throughput: 0: 9781.3. Samples: 54059572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:03,386][41256] Avg episode reward: [(0, '84.233')] +[2023-03-11 16:47:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105608_54071296.pth... +[2023-03-11 16:47:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105024_53772288.pth +[2023-03-11 16:47:07,109][41544] Updated weights for policy 0, policy_version 105680 (0.0005) +[2023-03-11 16:47:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9693.9, 300 sec: 9608.2). Total num frames: 54116352. Throughput: 0: 9753.3. Samples: 54115144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:08,386][41256] Avg episode reward: [(0, '84.973')] +[2023-03-11 16:47:11,613][41544] Updated weights for policy 0, policy_version 105760 (0.0005) +[2023-03-11 16:47:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 54161408. Throughput: 0: 9738.1. Samples: 54141976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:13,386][41256] Avg episode reward: [(0, '80.981')] +[2023-03-11 16:47:16,171][41544] Updated weights for policy 0, policy_version 105840 (0.0005) +[2023-03-11 16:47:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.4, 300 sec: 9580.5). Total num frames: 54206464. Throughput: 0: 9683.5. Samples: 54195984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:18,386][41256] Avg episode reward: [(0, '89.116')] +[2023-03-11 16:47:18,420][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105880_54210560.pth... +[2023-03-11 16:47:18,421][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105320_53923840.pth +[2023-03-11 16:47:20,617][41544] Updated weights for policy 0, policy_version 105920 (0.0005) +[2023-03-11 16:47:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9594.4). Total num frames: 54255616. Throughput: 0: 9592.6. Samples: 54251584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:23,386][41256] Avg episode reward: [(0, '81.640')] +[2023-03-11 16:47:24,940][41544] Updated weights for policy 0, policy_version 106000 (0.0005) +[2023-03-11 16:47:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9580.5). Total num frames: 54300672. Throughput: 0: 9568.0. Samples: 54280264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:28,386][41256] Avg episode reward: [(0, '79.837')] +[2023-03-11 16:47:29,312][41544] Updated weights for policy 0, policy_version 106080 (0.0005) +[2023-03-11 16:47:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9566.6). Total num frames: 54349824. Throughput: 0: 9512.1. Samples: 54337536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:33,386][41256] Avg episode reward: [(0, '79.313')] +[2023-03-11 16:47:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106152_54349824.pth... +[2023-03-11 16:47:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105608_54071296.pth +[2023-03-11 16:47:33,597][41544] Updated weights for policy 0, policy_version 106160 (0.0005) +[2023-03-11 16:47:37,972][41544] Updated weights for policy 0, policy_version 106240 (0.0005) +[2023-03-11 16:47:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9580.5). Total num frames: 54398976. Throughput: 0: 9379.7. Samples: 54393828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:38,386][41256] Avg episode reward: [(0, '80.166')] +[2023-03-11 16:47:42,120][41544] Updated weights for policy 0, policy_version 106320 (0.0004) +[2023-03-11 16:47:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9566.6). Total num frames: 54444032. Throughput: 0: 9379.8. Samples: 54423768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:43,386][41256] Avg episode reward: [(0, '76.711')] +[2023-03-11 16:47:46,448][41544] Updated weights for policy 0, policy_version 106400 (0.0005) +[2023-03-11 16:47:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9580.5). Total num frames: 54493184. Throughput: 0: 9362.8. Samples: 54480896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:48,386][41256] Avg episode reward: [(0, '77.112')] +[2023-03-11 16:47:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106432_54493184.pth... +[2023-03-11 16:47:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000105880_54210560.pth +[2023-03-11 16:47:50,828][41544] Updated weights for policy 0, policy_version 106480 (0.0005) +[2023-03-11 16:47:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9566.6). Total num frames: 54538240. Throughput: 0: 9356.8. Samples: 54536200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:53,386][41256] Avg episode reward: [(0, '79.420')] +[2023-03-11 16:47:55,213][41544] Updated weights for policy 0, policy_version 106560 (0.0005) +[2023-03-11 16:47:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9580.5). Total num frames: 54587392. Throughput: 0: 9385.6. Samples: 54564328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:47:58,386][41256] Avg episode reward: [(0, '76.882')] +[2023-03-11 16:47:59,466][41544] Updated weights for policy 0, policy_version 106640 (0.0005) +[2023-03-11 16:48:03,386][41256] Fps is (10 sec: 9830.2, 60 sec: 9420.8, 300 sec: 9580.5). Total num frames: 54636544. Throughput: 0: 9486.8. Samples: 54622892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:03,386][41256] Avg episode reward: [(0, '71.937')] +[2023-03-11 16:48:03,391][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106712_54636544.pth... +[2023-03-11 16:48:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106152_54349824.pth +[2023-03-11 16:48:03,777][41544] Updated weights for policy 0, policy_version 106720 (0.0005) +[2023-03-11 16:48:08,012][41544] Updated weights for policy 0, policy_version 106800 (0.0005) +[2023-03-11 16:48:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9580.5). Total num frames: 54681600. Throughput: 0: 9518.9. Samples: 54679932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:08,386][41256] Avg episode reward: [(0, '71.701')] +[2023-03-11 16:48:12,401][41544] Updated weights for policy 0, policy_version 106880 (0.0005) +[2023-03-11 16:48:13,385][41256] Fps is (10 sec: 9421.0, 60 sec: 9489.1, 300 sec: 9580.5). Total num frames: 54730752. Throughput: 0: 9502.0. Samples: 54707856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:13,386][41256] Avg episode reward: [(0, '74.822')] +[2023-03-11 16:48:16,750][41544] Updated weights for policy 0, policy_version 106960 (0.0005) +[2023-03-11 16:48:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9566.6). Total num frames: 54775808. Throughput: 0: 9480.9. Samples: 54764176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:18,386][41256] Avg episode reward: [(0, '74.870')] +[2023-03-11 16:48:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106984_54775808.pth... +[2023-03-11 16:48:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106432_54493184.pth +[2023-03-11 16:48:21,143][41544] Updated weights for policy 0, policy_version 107040 (0.0005) +[2023-03-11 16:48:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9580.5). Total num frames: 54824960. Throughput: 0: 9480.4. Samples: 54820448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:23,386][41256] Avg episode reward: [(0, '78.757')] +[2023-03-11 16:48:25,529][41544] Updated weights for policy 0, policy_version 107120 (0.0005) +[2023-03-11 16:48:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9580.5). Total num frames: 54870016. Throughput: 0: 9441.4. Samples: 54848632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:28,386][41256] Avg episode reward: [(0, '74.738')] +[2023-03-11 16:48:29,944][41544] Updated weights for policy 0, policy_version 107200 (0.0005) +[2023-03-11 16:48:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9566.6). Total num frames: 54915072. Throughput: 0: 9400.1. Samples: 54903900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:33,386][41256] Avg episode reward: [(0, '79.770')] +[2023-03-11 16:48:33,410][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000107264_54919168.pth... +[2023-03-11 16:48:33,412][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106712_54636544.pth +[2023-03-11 16:48:34,278][41544] Updated weights for policy 0, policy_version 107280 (0.0005) +[2023-03-11 16:48:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9580.5). Total num frames: 54964224. Throughput: 0: 9429.9. Samples: 54960544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:38,386][41256] Avg episode reward: [(0, '79.291')] +[2023-03-11 16:48:38,663][41544] Updated weights for policy 0, policy_version 107360 (0.0005) +[2023-03-11 16:48:42,999][41544] Updated weights for policy 0, policy_version 107440 (0.0005) +[2023-03-11 16:48:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9580.5). Total num frames: 55009280. Throughput: 0: 9434.1. Samples: 54988864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:43,386][41256] Avg episode reward: [(0, '79.214')] +[2023-03-11 16:48:47,299][41544] Updated weights for policy 0, policy_version 107520 (0.0005) +[2023-03-11 16:48:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9594.4). Total num frames: 55058432. Throughput: 0: 9406.0. Samples: 55046160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:48,386][41256] Avg episode reward: [(0, '81.206')] +[2023-03-11 16:48:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000107536_55058432.pth... +[2023-03-11 16:48:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000106984_54775808.pth +[2023-03-11 16:48:51,624][41544] Updated weights for policy 0, policy_version 107600 (0.0005) +[2023-03-11 16:48:53,386][41256] Fps is (10 sec: 9830.2, 60 sec: 9489.1, 300 sec: 9594.4). Total num frames: 55107584. Throughput: 0: 9405.9. Samples: 55103200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:53,386][41256] Avg episode reward: [(0, '77.854')] +[2023-03-11 16:48:55,981][41544] Updated weights for policy 0, policy_version 107680 (0.0005) +[2023-03-11 16:48:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9580.5). Total num frames: 55152640. Throughput: 0: 9412.8. Samples: 55131432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:48:58,386][41256] Avg episode reward: [(0, '76.752')] +[2023-03-11 16:49:00,449][41544] Updated weights for policy 0, policy_version 107760 (0.0005) +[2023-03-11 16:49:03,386][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.6, 300 sec: 9566.6). Total num frames: 55197696. Throughput: 0: 9382.9. Samples: 55186408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:03,386][41256] Avg episode reward: [(0, '77.329')] +[2023-03-11 16:49:03,422][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000107816_55201792.pth... +[2023-03-11 16:49:03,423][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000107264_54919168.pth +[2023-03-11 16:49:04,735][41544] Updated weights for policy 0, policy_version 107840 (0.0005) +[2023-03-11 16:49:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9566.6). Total num frames: 55246848. Throughput: 0: 9389.7. Samples: 55242984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:08,386][41256] Avg episode reward: [(0, '77.701')] +[2023-03-11 16:49:09,071][41544] Updated weights for policy 0, policy_version 107920 (0.0005) +[2023-03-11 16:49:13,335][41544] Updated weights for policy 0, policy_version 108000 (0.0005) +[2023-03-11 16:49:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9566.6). Total num frames: 55296000. Throughput: 0: 9397.0. Samples: 55271496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:13,386][41256] Avg episode reward: [(0, '78.034')] +[2023-03-11 16:49:17,640][41544] Updated weights for policy 0, policy_version 108080 (0.0005) +[2023-03-11 16:49:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9566.6). Total num frames: 55341056. Throughput: 0: 9446.1. Samples: 55328976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:18,386][41256] Avg episode reward: [(0, '77.220')] +[2023-03-11 16:49:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108088_55341056.pth... +[2023-03-11 16:49:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000107536_55058432.pth +[2023-03-11 16:49:21,920][41544] Updated weights for policy 0, policy_version 108160 (0.0005) +[2023-03-11 16:49:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9566.6). Total num frames: 55390208. Throughput: 0: 9462.9. Samples: 55386376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:23,386][41256] Avg episode reward: [(0, '75.805')] +[2023-03-11 16:49:26,274][41544] Updated weights for policy 0, policy_version 108240 (0.0005) +[2023-03-11 16:49:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9552.7). Total num frames: 55435264. Throughput: 0: 9466.3. Samples: 55414848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:28,386][41256] Avg episode reward: [(0, '74.657')] +[2023-03-11 16:49:30,672][41544] Updated weights for policy 0, policy_version 108320 (0.0005) +[2023-03-11 16:49:33,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9552.7). Total num frames: 55484416. Throughput: 0: 9443.6. Samples: 55471124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:33,386][41256] Avg episode reward: [(0, '75.608')] +[2023-03-11 16:49:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108368_55484416.pth... +[2023-03-11 16:49:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000107816_55201792.pth +[2023-03-11 16:49:34,979][41544] Updated weights for policy 0, policy_version 108400 (0.0005) +[2023-03-11 16:49:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9538.8). Total num frames: 55529472. Throughput: 0: 9446.0. Samples: 55528268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:38,386][41256] Avg episode reward: [(0, '70.424')] +[2023-03-11 16:49:39,309][41544] Updated weights for policy 0, policy_version 108480 (0.0005) +[2023-03-11 16:49:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9552.7). Total num frames: 55578624. Throughput: 0: 9438.7. Samples: 55556172. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:43,386][41256] Avg episode reward: [(0, '74.001')] +[2023-03-11 16:49:43,640][41544] Updated weights for policy 0, policy_version 108560 (0.0005) +[2023-03-11 16:49:48,105][41544] Updated weights for policy 0, policy_version 108640 (0.0005) +[2023-03-11 16:49:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9538.8). Total num frames: 55623680. Throughput: 0: 9455.6. Samples: 55611912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:48,386][41256] Avg episode reward: [(0, '77.856')] +[2023-03-11 16:49:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108640_55623680.pth... +[2023-03-11 16:49:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108088_55341056.pth +[2023-03-11 16:49:52,555][41544] Updated weights for policy 0, policy_version 108720 (0.0005) +[2023-03-11 16:49:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.6, 300 sec: 9524.9). Total num frames: 55668736. Throughput: 0: 9436.6. Samples: 55667632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:53,386][41256] Avg episode reward: [(0, '76.558')] +[2023-03-11 16:49:57,055][41544] Updated weights for policy 0, policy_version 108800 (0.0005) +[2023-03-11 16:49:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 55713792. Throughput: 0: 9402.8. Samples: 55694620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:49:58,386][41256] Avg episode reward: [(0, '76.481')] +[2023-03-11 16:50:01,628][41544] Updated weights for policy 0, policy_version 108880 (0.0005) +[2023-03-11 16:50:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 55758848. Throughput: 0: 9321.5. Samples: 55748444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:03,386][41256] Avg episode reward: [(0, '69.430')] +[2023-03-11 16:50:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108904_55758848.pth... +[2023-03-11 16:50:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108368_55484416.pth +[2023-03-11 16:50:06,192][41544] Updated weights for policy 0, policy_version 108960 (0.0005) +[2023-03-11 16:50:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 55803904. Throughput: 0: 9259.5. Samples: 55803056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:08,386][41256] Avg episode reward: [(0, '66.988')] +[2023-03-11 16:50:10,741][41544] Updated weights for policy 0, policy_version 109040 (0.0005) +[2023-03-11 16:50:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9469.4). Total num frames: 55848960. Throughput: 0: 9207.8. Samples: 55829200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:13,386][41256] Avg episode reward: [(0, '70.151')] +[2023-03-11 16:50:15,336][41544] Updated weights for policy 0, policy_version 109120 (0.0005) +[2023-03-11 16:50:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9469.4). Total num frames: 55894016. Throughput: 0: 9148.3. Samples: 55882796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:18,386][41256] Avg episode reward: [(0, '70.214')] +[2023-03-11 16:50:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109168_55894016.pth... +[2023-03-11 16:50:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108640_55623680.pth +[2023-03-11 16:50:19,885][41544] Updated weights for policy 0, policy_version 109200 (0.0005) +[2023-03-11 16:50:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9455.5). Total num frames: 55939072. Throughput: 0: 9075.1. Samples: 55936648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:23,386][41256] Avg episode reward: [(0, '70.181')] +[2023-03-11 16:50:24,461][41544] Updated weights for policy 0, policy_version 109280 (0.0005) +[2023-03-11 16:50:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9441.6). Total num frames: 55984128. Throughput: 0: 9056.4. Samples: 55963712. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:28,386][41256] Avg episode reward: [(0, '70.554')] +[2023-03-11 16:50:29,071][41544] Updated weights for policy 0, policy_version 109360 (0.0005) +[2023-03-11 16:50:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9441.6). Total num frames: 56029184. Throughput: 0: 9001.8. Samples: 56016992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:33,386][41256] Avg episode reward: [(0, '73.207')] +[2023-03-11 16:50:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109432_56029184.pth... +[2023-03-11 16:50:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000108904_55758848.pth +[2023-03-11 16:50:33,670][41544] Updated weights for policy 0, policy_version 109440 (0.0005) +[2023-03-11 16:50:38,221][41544] Updated weights for policy 0, policy_version 109520 (0.0005) +[2023-03-11 16:50:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9441.6). Total num frames: 56074240. Throughput: 0: 8952.7. Samples: 56070504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:38,386][41256] Avg episode reward: [(0, '77.722')] +[2023-03-11 16:50:42,684][41544] Updated weights for policy 0, policy_version 109600 (0.0005) +[2023-03-11 16:50:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9011.2, 300 sec: 9427.7). Total num frames: 56119296. Throughput: 0: 8982.1. Samples: 56098816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:43,386][41256] Avg episode reward: [(0, '77.751')] +[2023-03-11 16:50:47,264][41544] Updated weights for policy 0, policy_version 109680 (0.0005) +[2023-03-11 16:50:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9413.9). Total num frames: 56164352. Throughput: 0: 8970.9. Samples: 56152136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:48,386][41256] Avg episode reward: [(0, '73.071')] +[2023-03-11 16:50:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109696_56164352.pth... +[2023-03-11 16:50:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109168_55894016.pth +[2023-03-11 16:50:51,785][41544] Updated weights for policy 0, policy_version 109760 (0.0005) +[2023-03-11 16:50:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9400.0). Total num frames: 56209408. Throughput: 0: 8968.9. Samples: 56206656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:50:53,386][41256] Avg episode reward: [(0, '71.563')] +[2023-03-11 16:50:56,285][41544] Updated weights for policy 0, policy_version 109840 (0.0005) +[2023-03-11 16:50:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9372.2). Total num frames: 56254464. Throughput: 0: 8996.6. Samples: 56234048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:50:58,396][41256] Avg episode reward: [(0, '74.325')] +[2023-03-11 16:51:00,735][41544] Updated weights for policy 0, policy_version 109920 (0.0005) +[2023-03-11 16:51:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9372.2). Total num frames: 56299520. Throughput: 0: 9023.0. Samples: 56288832. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:03,396][41256] Avg episode reward: [(0, '72.838')] +[2023-03-11 16:51:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109960_56299520.pth... +[2023-03-11 16:51:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109432_56029184.pth +[2023-03-11 16:51:05,197][41544] Updated weights for policy 0, policy_version 110000 (0.0005) +[2023-03-11 16:51:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9358.3). Total num frames: 56344576. Throughput: 0: 9050.8. Samples: 56343932. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:08,396][41256] Avg episode reward: [(0, '73.184')] +[2023-03-11 16:51:09,789][41544] Updated weights for policy 0, policy_version 110080 (0.0005) +[2023-03-11 16:51:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9011.2, 300 sec: 9344.4). Total num frames: 56389632. Throughput: 0: 9034.3. Samples: 56370256. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:13,396][41256] Avg episode reward: [(0, '75.169')] +[2023-03-11 16:51:14,292][41544] Updated weights for policy 0, policy_version 110160 (0.0005) +[2023-03-11 16:51:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9358.3). Total num frames: 56438784. Throughput: 0: 9079.1. Samples: 56425552. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:18,396][41256] Avg episode reward: [(0, '74.998')] +[2023-03-11 16:51:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000110232_56438784.pth... +[2023-03-11 16:51:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109696_56164352.pth +[2023-03-11 16:51:18,788][41544] Updated weights for policy 0, policy_version 110240 (0.0005) +[2023-03-11 16:51:23,283][41544] Updated weights for policy 0, policy_version 110320 (0.0005) +[2023-03-11 16:51:23,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9079.5, 300 sec: 9344.4). Total num frames: 56483840. Throughput: 0: 9095.6. Samples: 56479808. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:23,396][41256] Avg episode reward: [(0, '75.640')] +[2023-03-11 16:51:27,790][41544] Updated weights for policy 0, policy_version 110400 (0.0005) +[2023-03-11 16:51:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9344.4). Total num frames: 56528896. Throughput: 0: 9072.5. Samples: 56507080. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:28,396][41256] Avg episode reward: [(0, '74.727')] +[2023-03-11 16:51:32,325][41544] Updated weights for policy 0, policy_version 110480 (0.0005) +[2023-03-11 16:51:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9330.5). Total num frames: 56573952. Throughput: 0: 9100.9. Samples: 56561676. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:33,396][41256] Avg episode reward: [(0, '73.817')] +[2023-03-11 16:51:33,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000110496_56573952.pth... +[2023-03-11 16:51:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000109960_56299520.pth +[2023-03-11 16:51:36,829][41544] Updated weights for policy 0, policy_version 110560 (0.0005) +[2023-03-11 16:51:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9302.8). Total num frames: 56619008. Throughput: 0: 9088.0. Samples: 56615616. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:38,396][41256] Avg episode reward: [(0, '79.460')] +[2023-03-11 16:51:41,293][41544] Updated weights for policy 0, policy_version 110640 (0.0005) +[2023-03-11 16:51:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9288.9). Total num frames: 56664064. Throughput: 0: 9101.1. Samples: 56643596. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:43,386][41256] Avg episode reward: [(0, '80.664')] +[2023-03-11 16:51:45,759][41544] Updated weights for policy 0, policy_version 110720 (0.0005) +[2023-03-11 16:51:48,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9079.5, 300 sec: 9261.1). Total num frames: 56709120. Throughput: 0: 9099.7. Samples: 56698320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:48,386][41256] Avg episode reward: [(0, '80.589')] +[2023-03-11 16:51:48,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000110760_56709120.pth... +[2023-03-11 16:51:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000110232_56438784.pth +[2023-03-11 16:51:50,197][41544] Updated weights for policy 0, policy_version 110800 (0.0005) +[2023-03-11 16:51:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9147.7, 300 sec: 9261.1). Total num frames: 56758272. Throughput: 0: 9118.0. Samples: 56754240. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:53,386][41256] Avg episode reward: [(0, '78.926')] +[2023-03-11 16:51:54,635][41544] Updated weights for policy 0, policy_version 110880 (0.0005) +[2023-03-11 16:51:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9147.7, 300 sec: 9261.1). Total num frames: 56803328. Throughput: 0: 9150.7. Samples: 56782040. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:51:58,386][41256] Avg episode reward: [(0, '80.354')] +[2023-03-11 16:51:59,120][41544] Updated weights for policy 0, policy_version 110960 (0.0005) +[2023-03-11 16:52:03,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9261.1). Total num frames: 56848384. Throughput: 0: 9135.1. Samples: 56836632. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:03,386][41256] Avg episode reward: [(0, '81.653')] +[2023-03-11 16:52:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111032_56848384.pth... +[2023-03-11 16:52:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000110496_56573952.pth +[2023-03-11 16:52:03,512][41544] Updated weights for policy 0, policy_version 111040 (0.0005) +[2023-03-11 16:52:07,914][41544] Updated weights for policy 0, policy_version 111120 (0.0005) +[2023-03-11 16:52:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9275.0). Total num frames: 56897536. Throughput: 0: 9189.2. Samples: 56893320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:08,386][41256] Avg episode reward: [(0, '80.823')] +[2023-03-11 16:52:12,286][41544] Updated weights for policy 0, policy_version 111200 (0.0005) +[2023-03-11 16:52:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9275.0). Total num frames: 56942592. Throughput: 0: 9208.5. Samples: 56921464. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:13,386][41256] Avg episode reward: [(0, '79.515')] +[2023-03-11 16:52:16,568][41544] Updated weights for policy 0, policy_version 111280 (0.0005) +[2023-03-11 16:52:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9275.0). Total num frames: 56991744. Throughput: 0: 9253.2. Samples: 56978068. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:18,386][41256] Avg episode reward: [(0, '78.697')] +[2023-03-11 16:52:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111312_56991744.pth... +[2023-03-11 16:52:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000110760_56709120.pth +[2023-03-11 16:52:20,856][41544] Updated weights for policy 0, policy_version 111360 (0.0005) +[2023-03-11 16:52:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9275.0). Total num frames: 57036800. Throughput: 0: 9330.9. Samples: 57035508. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:23,386][41256] Avg episode reward: [(0, '76.960')] +[2023-03-11 16:52:25,228][41544] Updated weights for policy 0, policy_version 111440 (0.0005) +[2023-03-11 16:52:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9275.0). Total num frames: 57085952. Throughput: 0: 9317.0. Samples: 57062860. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:28,386][41256] Avg episode reward: [(0, '81.429')] +[2023-03-11 16:52:29,328][41544] Updated weights for policy 0, policy_version 111520 (0.0004) +[2023-03-11 16:52:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.6, 300 sec: 9275.0). Total num frames: 57135104. Throughput: 0: 9439.3. Samples: 57123088. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:33,386][41256] Avg episode reward: [(0, '79.357')] +[2023-03-11 16:52:33,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111592_57135104.pth... +[2023-03-11 16:52:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111032_56848384.pth +[2023-03-11 16:52:33,459][41544] Updated weights for policy 0, policy_version 111600 (0.0005) +[2023-03-11 16:52:37,702][41544] Updated weights for policy 0, policy_version 111680 (0.0005) +[2023-03-11 16:52:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9288.9). Total num frames: 57184256. Throughput: 0: 9492.2. Samples: 57181388. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:38,386][41256] Avg episode reward: [(0, '82.137')] +[2023-03-11 16:52:41,933][41544] Updated weights for policy 0, policy_version 111760 (0.0003) +[2023-03-11 16:52:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9288.9). Total num frames: 57233408. Throughput: 0: 9518.0. Samples: 57210352. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:43,386][41256] Avg episode reward: [(0, '82.549')] +[2023-03-11 16:52:46,058][41544] Updated weights for policy 0, policy_version 111840 (0.0003) +[2023-03-11 16:52:48,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9302.8). Total num frames: 57282560. Throughput: 0: 9625.9. Samples: 57269796. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:48,386][41256] Avg episode reward: [(0, '81.423')] +[2023-03-11 16:52:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111880_57282560.pth... +[2023-03-11 16:52:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111312_56991744.pth +[2023-03-11 16:52:50,500][41544] Updated weights for policy 0, policy_version 111920 (0.0005) +[2023-03-11 16:52:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9288.9). Total num frames: 57327616. Throughput: 0: 9570.1. Samples: 57323976. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 16:52:53,386][41256] Avg episode reward: [(0, '81.315')] +[2023-03-11 16:52:55,000][41544] Updated weights for policy 0, policy_version 112000 (0.0006) +[2023-03-11 16:52:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9275.0). Total num frames: 57372672. Throughput: 0: 9572.1. Samples: 57352208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:52:58,386][41256] Avg episode reward: [(0, '84.617')] +[2023-03-11 16:52:59,471][41544] Updated weights for policy 0, policy_version 112080 (0.0006) +[2023-03-11 16:53:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9275.0). Total num frames: 57417728. Throughput: 0: 9514.7. Samples: 57406228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:03,386][41256] Avg episode reward: [(0, '81.765')] +[2023-03-11 16:53:03,419][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000112152_57421824.pth... +[2023-03-11 16:53:03,421][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111592_57135104.pth +[2023-03-11 16:53:03,827][41544] Updated weights for policy 0, policy_version 112160 (0.0005) +[2023-03-11 16:53:08,096][41544] Updated weights for policy 0, policy_version 112240 (0.0004) +[2023-03-11 16:53:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9275.0). Total num frames: 57466880. Throughput: 0: 9521.0. Samples: 57463956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:08,386][41256] Avg episode reward: [(0, '80.802')] +[2023-03-11 16:53:12,241][41544] Updated weights for policy 0, policy_version 112320 (0.0005) +[2023-03-11 16:53:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9288.9). Total num frames: 57516032. Throughput: 0: 9580.0. Samples: 57493960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:13,386][41256] Avg episode reward: [(0, '81.055')] +[2023-03-11 16:53:16,383][41544] Updated weights for policy 0, policy_version 112400 (0.0005) +[2023-03-11 16:53:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9288.9). Total num frames: 57565184. Throughput: 0: 9556.3. Samples: 57553124. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:18,386][41256] Avg episode reward: [(0, '82.560')] +[2023-03-11 16:53:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000112432_57565184.pth... +[2023-03-11 16:53:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000111880_57282560.pth +[2023-03-11 16:53:20,611][41544] Updated weights for policy 0, policy_version 112480 (0.0005) +[2023-03-11 16:53:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9302.8). Total num frames: 57614336. Throughput: 0: 9538.6. Samples: 57610624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:23,386][41256] Avg episode reward: [(0, '82.415')] +[2023-03-11 16:53:24,849][41544] Updated weights for policy 0, policy_version 112560 (0.0004) +[2023-03-11 16:53:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9316.7). Total num frames: 57663488. Throughput: 0: 9558.9. Samples: 57640504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:28,386][41256] Avg episode reward: [(0, '81.882')] +[2023-03-11 16:53:29,143][41544] Updated weights for policy 0, policy_version 112640 (0.0005) +[2023-03-11 16:53:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9302.8). Total num frames: 57708544. Throughput: 0: 9478.5. Samples: 57696328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:33,386][41256] Avg episode reward: [(0, '80.369')] +[2023-03-11 16:53:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000112712_57708544.pth... +[2023-03-11 16:53:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000112152_57421824.pth +[2023-03-11 16:53:33,517][41544] Updated weights for policy 0, policy_version 112720 (0.0005) +[2023-03-11 16:53:37,655][41544] Updated weights for policy 0, policy_version 112800 (0.0005) +[2023-03-11 16:53:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 57757696. Throughput: 0: 9585.6. Samples: 57755328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:38,386][41256] Avg episode reward: [(0, '82.350')] +[2023-03-11 16:53:41,857][41544] Updated weights for policy 0, policy_version 112880 (0.0005) +[2023-03-11 16:53:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 57806848. Throughput: 0: 9610.1. Samples: 57784664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:43,386][41256] Avg episode reward: [(0, '81.523')] +[2023-03-11 16:53:46,050][41544] Updated weights for policy 0, policy_version 112960 (0.0005) +[2023-03-11 16:53:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 57856000. Throughput: 0: 9715.8. Samples: 57843440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:48,386][41256] Avg episode reward: [(0, '81.272')] +[2023-03-11 16:53:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113000_57856000.pth... +[2023-03-11 16:53:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000112432_57565184.pth +[2023-03-11 16:53:50,308][41544] Updated weights for policy 0, policy_version 113040 (0.0005) +[2023-03-11 16:53:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 57901056. Throughput: 0: 9686.5. Samples: 57899848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:53:53,386][41256] Avg episode reward: [(0, '86.332')] +[2023-03-11 16:53:54,832][41544] Updated weights for policy 0, policy_version 113120 (0.0005) +[2023-03-11 16:53:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9330.6). Total num frames: 57950208. Throughput: 0: 9612.3. Samples: 57926512. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:53:58,386][41256] Avg episode reward: [(0, '85.419')] +[2023-03-11 16:53:59,125][41544] Updated weights for policy 0, policy_version 113200 (0.0004) +[2023-03-11 16:54:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9316.7). Total num frames: 57995264. Throughput: 0: 9552.5. Samples: 57982984. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:03,386][41256] Avg episode reward: [(0, '84.742')] +[2023-03-11 16:54:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113272_57995264.pth... +[2023-03-11 16:54:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000112712_57708544.pth +[2023-03-11 16:54:03,623][41544] Updated weights for policy 0, policy_version 113280 (0.0005) +[2023-03-11 16:54:07,756][41544] Updated weights for policy 0, policy_version 113360 (0.0004) +[2023-03-11 16:54:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9316.7). Total num frames: 58044416. Throughput: 0: 9561.6. Samples: 58040896. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:08,386][41256] Avg episode reward: [(0, '87.465')] +[2023-03-11 16:54:11,956][41544] Updated weights for policy 0, policy_version 113440 (0.0005) +[2023-03-11 16:54:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9330.6). Total num frames: 58093568. Throughput: 0: 9541.3. Samples: 58069864. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:13,386][41256] Avg episode reward: [(0, '85.207')] +[2023-03-11 16:54:16,010][41544] Updated weights for policy 0, policy_version 113520 (0.0004) +[2023-03-11 16:54:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9330.5). Total num frames: 58142720. Throughput: 0: 9648.2. Samples: 58130496. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:18,386][41256] Avg episode reward: [(0, '84.770')] +[2023-03-11 16:54:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113560_58142720.pth... +[2023-03-11 16:54:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113000_57856000.pth +[2023-03-11 16:54:20,155][41544] Updated weights for policy 0, policy_version 113600 (0.0004) +[2023-03-11 16:54:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9344.4). Total num frames: 58191872. Throughput: 0: 9673.1. Samples: 58190616. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:23,386][41256] Avg episode reward: [(0, '84.597')] +[2023-03-11 16:54:24,228][41544] Updated weights for policy 0, policy_version 113680 (0.0004) +[2023-03-11 16:54:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9344.4). Total num frames: 58241024. Throughput: 0: 9678.4. Samples: 58220192. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:28,386][41256] Avg episode reward: [(0, '85.345')] +[2023-03-11 16:54:28,597][41544] Updated weights for policy 0, policy_version 113760 (0.0005) +[2023-03-11 16:54:32,989][41544] Updated weights for policy 0, policy_version 113840 (0.0005) +[2023-03-11 16:54:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9344.4). Total num frames: 58286080. Throughput: 0: 9599.8. Samples: 58275432. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:33,386][41256] Avg episode reward: [(0, '81.570')] +[2023-03-11 16:54:33,440][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113848_58290176.pth... +[2023-03-11 16:54:33,441][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113272_57995264.pth +[2023-03-11 16:54:37,396][41544] Updated weights for policy 0, policy_version 113920 (0.0005) +[2023-03-11 16:54:38,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9344.4). Total num frames: 58335232. Throughput: 0: 9585.6. Samples: 58331200. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:38,386][41256] Avg episode reward: [(0, '78.949')] +[2023-03-11 16:54:41,640][41544] Updated weights for policy 0, policy_version 114000 (0.0005) +[2023-03-11 16:54:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9358.3). Total num frames: 58384384. Throughput: 0: 9630.4. Samples: 58359880. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:43,386][41256] Avg episode reward: [(0, '80.277')] +[2023-03-11 16:54:46,068][41544] Updated weights for policy 0, policy_version 114080 (0.0005) +[2023-03-11 16:54:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 58429440. Throughput: 0: 9636.5. Samples: 58416628. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:48,386][41256] Avg episode reward: [(0, '81.164')] +[2023-03-11 16:54:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114120_58429440.pth... +[2023-03-11 16:54:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113560_58142720.pth +[2023-03-11 16:54:50,357][41544] Updated weights for policy 0, policy_version 114160 (0.0005) +[2023-03-11 16:54:53,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 58474496. Throughput: 0: 9611.7. Samples: 58473420. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:53,386][41256] Avg episode reward: [(0, '83.034')] +[2023-03-11 16:54:54,775][41544] Updated weights for policy 0, policy_version 114240 (0.0005) +[2023-03-11 16:54:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9372.2). Total num frames: 58523648. Throughput: 0: 9571.1. Samples: 58500564. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:54:58,386][41256] Avg episode reward: [(0, '82.509')] +[2023-03-11 16:54:59,205][41544] Updated weights for policy 0, policy_version 114320 (0.0005) +[2023-03-11 16:55:03,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9372.2). Total num frames: 58568704. Throughput: 0: 9486.3. Samples: 58557380. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:03,386][41256] Avg episode reward: [(0, '80.758')] +[2023-03-11 16:55:03,419][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114400_58572800.pth... +[2023-03-11 16:55:03,419][41544] Updated weights for policy 0, policy_version 114400 (0.0005) +[2023-03-11 16:55:03,420][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000113848_58290176.pth +[2023-03-11 16:55:07,562][41544] Updated weights for policy 0, policy_version 114480 (0.0004) +[2023-03-11 16:55:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 58621952. Throughput: 0: 9464.7. Samples: 58616528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:08,386][41256] Avg episode reward: [(0, '78.951')] +[2023-03-11 16:55:11,694][41544] Updated weights for policy 0, policy_version 114560 (0.0005) +[2023-03-11 16:55:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9400.0). Total num frames: 58667008. Throughput: 0: 9472.1. Samples: 58646436. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:13,386][41256] Avg episode reward: [(0, '75.771')] +[2023-03-11 16:55:16,120][41544] Updated weights for policy 0, policy_version 114640 (0.0005) +[2023-03-11 16:55:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 58716160. Throughput: 0: 9499.7. Samples: 58702920. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:18,386][41256] Avg episode reward: [(0, '72.359')] +[2023-03-11 16:55:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114680_58716160.pth... +[2023-03-11 16:55:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114120_58429440.pth +[2023-03-11 16:55:20,505][41544] Updated weights for policy 0, policy_version 114720 (0.0005) +[2023-03-11 16:55:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 58761216. Throughput: 0: 9476.9. Samples: 58757660. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:23,386][41256] Avg episode reward: [(0, '76.651')] +[2023-03-11 16:55:24,996][41544] Updated weights for policy 0, policy_version 114800 (0.0005) +[2023-03-11 16:55:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 58806272. Throughput: 0: 9464.9. Samples: 58785800. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:28,386][41256] Avg episode reward: [(0, '79.808')] +[2023-03-11 16:55:29,470][41544] Updated weights for policy 0, policy_version 114880 (0.0005) +[2023-03-11 16:55:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 58851328. Throughput: 0: 9397.3. Samples: 58839508. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:33,386][41256] Avg episode reward: [(0, '82.229')] +[2023-03-11 16:55:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114944_58851328.pth... +[2023-03-11 16:55:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114400_58572800.pth +[2023-03-11 16:55:33,989][41544] Updated weights for policy 0, policy_version 114960 (0.0005) +[2023-03-11 16:55:38,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 58896384. Throughput: 0: 9383.3. Samples: 58895668. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:38,386][41256] Avg episode reward: [(0, '84.389')] +[2023-03-11 16:55:38,389][41544] Updated weights for policy 0, policy_version 115040 (0.0005) +[2023-03-11 16:55:42,829][41544] Updated weights for policy 0, policy_version 115120 (0.0005) +[2023-03-11 16:55:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 58945536. Throughput: 0: 9398.2. Samples: 58923484. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:43,386][41256] Avg episode reward: [(0, '82.707')] +[2023-03-11 16:55:47,237][41544] Updated weights for policy 0, policy_version 115200 (0.0005) +[2023-03-11 16:55:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 58990592. Throughput: 0: 9360.3. Samples: 58978596. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:48,386][41256] Avg episode reward: [(0, '81.111')] +[2023-03-11 16:55:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000115216_58990592.pth... +[2023-03-11 16:55:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114680_58716160.pth +[2023-03-11 16:55:51,777][41544] Updated weights for policy 0, policy_version 115280 (0.0005) +[2023-03-11 16:55:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 59035648. Throughput: 0: 9260.7. Samples: 59033260. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:53,386][41256] Avg episode reward: [(0, '84.113')] +[2023-03-11 16:55:56,292][41544] Updated weights for policy 0, policy_version 115360 (0.0005) +[2023-03-11 16:55:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 59080704. Throughput: 0: 9195.5. Samples: 59060236. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 16:55:58,386][41256] Avg episode reward: [(0, '83.575')] +[2023-03-11 16:56:00,708][41544] Updated weights for policy 0, policy_version 115440 (0.0005) +[2023-03-11 16:56:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 59129856. Throughput: 0: 9170.1. Samples: 59115576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:03,386][41256] Avg episode reward: [(0, '81.123')] +[2023-03-11 16:56:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000115488_59129856.pth... +[2023-03-11 16:56:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000114944_58851328.pth +[2023-03-11 16:56:05,132][41544] Updated weights for policy 0, policy_version 115520 (0.0005) +[2023-03-11 16:56:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 59174912. Throughput: 0: 9182.8. Samples: 59170888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:08,386][41256] Avg episode reward: [(0, '81.940')] +[2023-03-11 16:56:09,617][41544] Updated weights for policy 0, policy_version 115600 (0.0005) +[2023-03-11 16:56:13,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9427.7). Total num frames: 59219968. Throughput: 0: 9171.7. Samples: 59198528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:13,386][41256] Avg episode reward: [(0, '87.980')] +[2023-03-11 16:56:14,088][41544] Updated weights for policy 0, policy_version 115680 (0.0005) +[2023-03-11 16:56:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9427.7). Total num frames: 59265024. Throughput: 0: 9201.4. Samples: 59253572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:18,386][41256] Avg episode reward: [(0, '85.293')] +[2023-03-11 16:56:18,456][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000115760_59269120.pth... +[2023-03-11 16:56:18,457][41544] Updated weights for policy 0, policy_version 115760 (0.0005) +[2023-03-11 16:56:18,459][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000115216_58990592.pth +[2023-03-11 16:56:22,604][41544] Updated weights for policy 0, policy_version 115840 (0.0005) +[2023-03-11 16:56:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 59314176. Throughput: 0: 9259.0. Samples: 59312324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:23,396][41256] Avg episode reward: [(0, '84.167')] +[2023-03-11 16:56:26,676][41544] Updated weights for policy 0, policy_version 115920 (0.0004) +[2023-03-11 16:56:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 59367424. Throughput: 0: 9320.6. Samples: 59342912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:28,396][41256] Avg episode reward: [(0, '86.291')] +[2023-03-11 16:56:30,808][41544] Updated weights for policy 0, policy_version 116000 (0.0005) +[2023-03-11 16:56:33,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 59416576. Throughput: 0: 9416.9. Samples: 59402356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:33,397][41256] Avg episode reward: [(0, '83.502')] +[2023-03-11 16:56:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116048_59416576.pth... +[2023-03-11 16:56:33,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000115488_59129856.pth +[2023-03-11 16:56:34,944][41544] Updated weights for policy 0, policy_version 116080 (0.0005) +[2023-03-11 16:56:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 59465728. Throughput: 0: 9519.4. Samples: 59461632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:38,386][41256] Avg episode reward: [(0, '84.813')] +[2023-03-11 16:56:39,103][41544] Updated weights for policy 0, policy_version 116160 (0.0004) +[2023-03-11 16:56:43,223][41544] Updated weights for policy 0, policy_version 116240 (0.0004) +[2023-03-11 16:56:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9511.1). Total num frames: 59514880. Throughput: 0: 9568.4. Samples: 59490816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:43,386][41256] Avg episode reward: [(0, '86.416')] +[2023-03-11 16:56:47,480][41544] Updated weights for policy 0, policy_version 116320 (0.0005) +[2023-03-11 16:56:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9511.0). Total num frames: 59564032. Throughput: 0: 9645.3. Samples: 59549616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:48,386][41256] Avg episode reward: [(0, '87.369')] +[2023-03-11 16:56:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116336_59564032.pth... +[2023-03-11 16:56:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000115760_59269120.pth +[2023-03-11 16:56:51,711][41544] Updated weights for policy 0, policy_version 116400 (0.0005) +[2023-03-11 16:56:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 59613184. Throughput: 0: 9709.8. Samples: 59607828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:53,386][41256] Avg episode reward: [(0, '85.358')] +[2023-03-11 16:56:55,889][41544] Updated weights for policy 0, policy_version 116480 (0.0004) +[2023-03-11 16:56:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 59658240. Throughput: 0: 9757.4. Samples: 59637612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:56:58,386][41256] Avg episode reward: [(0, '83.256')] +[2023-03-11 16:57:00,127][41544] Updated weights for policy 0, policy_version 116560 (0.0005) +[2023-03-11 16:57:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 59707392. Throughput: 0: 9808.1. Samples: 59694936. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:03,386][41256] Avg episode reward: [(0, '82.934')] +[2023-03-11 16:57:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116616_59707392.pth... +[2023-03-11 16:57:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116048_59416576.pth +[2023-03-11 16:57:04,473][41544] Updated weights for policy 0, policy_version 116640 (0.0005) +[2023-03-11 16:57:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9524.9). Total num frames: 59752448. Throughput: 0: 9715.7. Samples: 59749528. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:08,386][41256] Avg episode reward: [(0, '84.768')] +[2023-03-11 16:57:09,100][41544] Updated weights for policy 0, policy_version 116720 (0.0005) +[2023-03-11 16:57:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9625.6, 300 sec: 9511.1). Total num frames: 59797504. Throughput: 0: 9643.0. Samples: 59776848. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:13,386][41256] Avg episode reward: [(0, '87.685')] +[2023-03-11 16:57:13,584][41544] Updated weights for policy 0, policy_version 116800 (0.0005) +[2023-03-11 16:57:18,121][41544] Updated weights for policy 0, policy_version 116880 (0.0005) +[2023-03-11 16:57:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9511.0). Total num frames: 59842560. Throughput: 0: 9528.1. Samples: 59831120. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:18,386][41256] Avg episode reward: [(0, '86.565')] +[2023-03-11 16:57:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116880_59842560.pth... +[2023-03-11 16:57:18,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116336_59564032.pth +[2023-03-11 16:57:22,625][41544] Updated weights for policy 0, policy_version 116960 (0.0005) +[2023-03-11 16:57:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9497.2). Total num frames: 59887616. Throughput: 0: 9422.1. Samples: 59885628. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:23,386][41256] Avg episode reward: [(0, '84.967')] +[2023-03-11 16:57:26,756][41544] Updated weights for policy 0, policy_version 117040 (0.0005) +[2023-03-11 16:57:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 59936768. Throughput: 0: 9436.0. Samples: 59915436. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:28,386][41256] Avg episode reward: [(0, '81.615')] +[2023-03-11 16:57:31,057][41544] Updated weights for policy 0, policy_version 117120 (0.0005) +[2023-03-11 16:57:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 59985920. Throughput: 0: 9402.1. Samples: 59972712. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:33,386][41256] Avg episode reward: [(0, '80.753')] +[2023-03-11 16:57:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117160_59985920.pth... +[2023-03-11 16:57:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116616_59707392.pth +[2023-03-11 16:57:35,544][41544] Updated weights for policy 0, policy_version 117200 (0.0005) +[2023-03-11 16:57:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 60030976. Throughput: 0: 9313.9. Samples: 60026952. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:38,386][41256] Avg episode reward: [(0, '81.769')] +[2023-03-11 16:57:40,056][41544] Updated weights for policy 0, policy_version 117280 (0.0005) +[2023-03-11 16:57:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 60080128. Throughput: 0: 9280.1. Samples: 60055216. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:43,386][41256] Avg episode reward: [(0, '83.965')] +[2023-03-11 16:57:44,230][41544] Updated weights for policy 0, policy_version 117360 (0.0004) +[2023-03-11 16:57:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 60125184. Throughput: 0: 9295.1. Samples: 60113216. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:48,386][41256] Avg episode reward: [(0, '88.184')] +[2023-03-11 16:57:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117432_60125184.pth... +[2023-03-11 16:57:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000116880_59842560.pth +[2023-03-11 16:57:48,467][41544] Updated weights for policy 0, policy_version 117440 (0.0005) +[2023-03-11 16:57:52,975][41544] Updated weights for policy 0, policy_version 117520 (0.0005) +[2023-03-11 16:57:53,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 60170240. Throughput: 0: 9329.5. Samples: 60169356. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:53,386][41256] Avg episode reward: [(0, '83.220')] +[2023-03-11 16:57:57,436][41544] Updated weights for policy 0, policy_version 117600 (0.0005) +[2023-03-11 16:57:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 60219392. Throughput: 0: 9325.8. Samples: 60196512. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 16:57:58,386][41256] Avg episode reward: [(0, '87.093')] +[2023-03-11 16:58:01,957][41544] Updated weights for policy 0, policy_version 117680 (0.0005) +[2023-03-11 16:58:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 60264448. Throughput: 0: 9340.4. Samples: 60251440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:03,386][41256] Avg episode reward: [(0, '86.227')] +[2023-03-11 16:58:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117704_60264448.pth... +[2023-03-11 16:58:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117160_59985920.pth +[2023-03-11 16:58:06,426][41544] Updated weights for policy 0, policy_version 117760 (0.0005) +[2023-03-11 16:58:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9469.4). Total num frames: 60309504. Throughput: 0: 9343.6. Samples: 60306088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:08,386][41256] Avg episode reward: [(0, '85.414')] +[2023-03-11 16:58:10,929][41544] Updated weights for policy 0, policy_version 117840 (0.0005) +[2023-03-11 16:58:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9455.5). Total num frames: 60354560. Throughput: 0: 9290.1. Samples: 60333492. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:13,386][41256] Avg episode reward: [(0, '85.568')] +[2023-03-11 16:58:15,339][41544] Updated weights for policy 0, policy_version 117920 (0.0005) +[2023-03-11 16:58:18,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.2, 300 sec: 9441.6). Total num frames: 60399616. Throughput: 0: 9239.8. Samples: 60388504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:18,386][41256] Avg episode reward: [(0, '86.135')] +[2023-03-11 16:58:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117968_60399616.pth... +[2023-03-11 16:58:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117432_60125184.pth +[2023-03-11 16:58:19,794][41544] Updated weights for policy 0, policy_version 118000 (0.0005) +[2023-03-11 16:58:23,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 60444672. Throughput: 0: 9275.6. Samples: 60444352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:23,386][41256] Avg episode reward: [(0, '85.604')] +[2023-03-11 16:58:24,284][41544] Updated weights for policy 0, policy_version 118080 (0.0005) +[2023-03-11 16:58:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9441.6). Total num frames: 60493824. Throughput: 0: 9253.2. Samples: 60471612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:28,386][41256] Avg episode reward: [(0, '85.489')] +[2023-03-11 16:58:28,670][41544] Updated weights for policy 0, policy_version 118160 (0.0005) +[2023-03-11 16:58:32,939][41544] Updated weights for policy 0, policy_version 118240 (0.0004) +[2023-03-11 16:58:33,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9284.3, 300 sec: 9441.6). Total num frames: 60542976. Throughput: 0: 9211.9. Samples: 60527752. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:33,386][41256] Avg episode reward: [(0, '83.634')] +[2023-03-11 16:58:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000118248_60542976.pth... +[2023-03-11 16:58:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117704_60264448.pth +[2023-03-11 16:58:37,176][41544] Updated weights for policy 0, policy_version 118320 (0.0005) +[2023-03-11 16:58:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 60588032. Throughput: 0: 9265.8. Samples: 60586316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:38,386][41256] Avg episode reward: [(0, '83.788')] +[2023-03-11 16:58:41,712][41544] Updated weights for policy 0, policy_version 118400 (0.0005) +[2023-03-11 16:58:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9413.9). Total num frames: 60633088. Throughput: 0: 9251.4. Samples: 60612824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:43,386][41256] Avg episode reward: [(0, '87.370')] +[2023-03-11 16:58:45,931][41544] Updated weights for policy 0, policy_version 118480 (0.0004) +[2023-03-11 16:58:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 60682240. Throughput: 0: 9307.6. Samples: 60670280. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:48,386][41256] Avg episode reward: [(0, '83.482')] +[2023-03-11 16:58:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000118520_60682240.pth... +[2023-03-11 16:58:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000117968_60399616.pth +[2023-03-11 16:58:50,312][41544] Updated weights for policy 0, policy_version 118560 (0.0005) +[2023-03-11 16:58:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 60727296. Throughput: 0: 9318.7. Samples: 60725428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:53,386][41256] Avg episode reward: [(0, '87.123')] +[2023-03-11 16:58:54,889][41544] Updated weights for policy 0, policy_version 118640 (0.0005) +[2023-03-11 16:58:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9413.9). Total num frames: 60772352. Throughput: 0: 9298.9. Samples: 60751944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:58:58,386][41256] Avg episode reward: [(0, '91.579')] +[2023-03-11 16:58:59,435][41544] Updated weights for policy 0, policy_version 118720 (0.0005) +[2023-03-11 16:59:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 60817408. Throughput: 0: 9285.3. Samples: 60806340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 16:59:03,386][41256] Avg episode reward: [(0, '84.592')] +[2023-03-11 16:59:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000118784_60817408.pth... +[2023-03-11 16:59:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000118248_60542976.pth +[2023-03-11 16:59:03,996][41544] Updated weights for policy 0, policy_version 118800 (0.0005) +[2023-03-11 16:59:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 60862464. Throughput: 0: 9255.0. Samples: 60860828. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:08,386][41256] Avg episode reward: [(0, '83.683')] +[2023-03-11 16:59:08,474][41544] Updated weights for policy 0, policy_version 118880 (0.0005) +[2023-03-11 16:59:12,927][41544] Updated weights for policy 0, policy_version 118960 (0.0005) +[2023-03-11 16:59:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 60907520. Throughput: 0: 9257.2. Samples: 60888184. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:13,386][41256] Avg episode reward: [(0, '88.716')] +[2023-03-11 16:59:17,416][41544] Updated weights for policy 0, policy_version 119040 (0.0005) +[2023-03-11 16:59:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 60956672. Throughput: 0: 9238.9. Samples: 60943504. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:18,386][41256] Avg episode reward: [(0, '86.189')] +[2023-03-11 16:59:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119056_60956672.pth... +[2023-03-11 16:59:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000118520_60682240.pth +[2023-03-11 16:59:21,868][41544] Updated weights for policy 0, policy_version 119120 (0.0005) +[2023-03-11 16:59:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 61001728. Throughput: 0: 9151.0. Samples: 60998112. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:23,386][41256] Avg episode reward: [(0, '86.697')] +[2023-03-11 16:59:26,146][41544] Updated weights for policy 0, policy_version 119200 (0.0005) +[2023-03-11 16:59:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 61050880. Throughput: 0: 9204.0. Samples: 61027004. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:28,386][41256] Avg episode reward: [(0, '82.480')] +[2023-03-11 16:59:30,281][41544] Updated weights for policy 0, policy_version 119280 (0.0004) +[2023-03-11 16:59:33,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 61100032. Throughput: 0: 9267.6. Samples: 61087324. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:33,386][41256] Avg episode reward: [(0, '82.624')] +[2023-03-11 16:59:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119336_61100032.pth... +[2023-03-11 16:59:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000118784_60817408.pth +[2023-03-11 16:59:34,421][41544] Updated weights for policy 0, policy_version 119360 (0.0005) +[2023-03-11 16:59:37,803][41500] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000005 +[2023-03-11 16:59:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 61149184. Throughput: 0: 9327.4. Samples: 61145160. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:38,386][41256] Avg episode reward: [(0, '80.594')] +[2023-03-11 16:59:38,706][41544] Updated weights for policy 0, policy_version 119440 (0.0005) +[2023-03-11 16:59:43,138][41544] Updated weights for policy 0, policy_version 119520 (0.0004) +[2023-03-11 16:59:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 61194240. Throughput: 0: 9354.8. Samples: 61172912. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:43,386][41256] Avg episode reward: [(0, '81.987')] +[2023-03-11 16:59:47,516][41544] Updated weights for policy 0, policy_version 119600 (0.0005) +[2023-03-11 16:59:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9372.2). Total num frames: 61239296. Throughput: 0: 9391.0. Samples: 61228936. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:48,386][41256] Avg episode reward: [(0, '83.213')] +[2023-03-11 16:59:48,419][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119616_61243392.pth... +[2023-03-11 16:59:48,420][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119056_60956672.pth +[2023-03-11 16:59:52,087][41544] Updated weights for policy 0, policy_version 119680 (0.0005) +[2023-03-11 16:59:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 61284352. Throughput: 0: 9388.1. Samples: 61283292. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:53,386][41256] Avg episode reward: [(0, '80.932')] +[2023-03-11 16:59:56,576][41544] Updated weights for policy 0, policy_version 119760 (0.0005) +[2023-03-11 16:59:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 61329408. Throughput: 0: 9384.1. Samples: 61310468. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 16:59:58,386][41256] Avg episode reward: [(0, '78.436')] +[2023-03-11 17:00:01,145][41544] Updated weights for policy 0, policy_version 119840 (0.0005) +[2023-03-11 17:00:03,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9330.5). Total num frames: 61374464. Throughput: 0: 9347.8. Samples: 61364156. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:00:03,386][41256] Avg episode reward: [(0, '82.688')] +[2023-03-11 17:00:03,413][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119880_61378560.pth... +[2023-03-11 17:00:03,415][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119336_61100032.pth +[2023-03-11 17:00:05,667][41544] Updated weights for policy 0, policy_version 119920 (0.0005) +[2023-03-11 17:00:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9330.5). Total num frames: 61419520. Throughput: 0: 9361.0. Samples: 61419356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:08,386][41256] Avg episode reward: [(0, '85.790')] +[2023-03-11 17:00:10,177][41544] Updated weights for policy 0, policy_version 120000 (0.0005) +[2023-03-11 17:00:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9330.5). Total num frames: 61468672. Throughput: 0: 9318.4. Samples: 61446332. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:13,386][41256] Avg episode reward: [(0, '84.071')] +[2023-03-11 17:00:14,670][41544] Updated weights for policy 0, policy_version 120080 (0.0005) +[2023-03-11 17:00:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9330.5). Total num frames: 61513728. Throughput: 0: 9188.3. Samples: 61500800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:18,386][41256] Avg episode reward: [(0, '84.244')] +[2023-03-11 17:00:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120144_61513728.pth... +[2023-03-11 17:00:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119616_61243392.pth +[2023-03-11 17:00:19,206][41544] Updated weights for policy 0, policy_version 120160 (0.0005) +[2023-03-11 17:00:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9330.5). Total num frames: 61558784. Throughput: 0: 9100.6. Samples: 61554688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:23,386][41256] Avg episode reward: [(0, '85.871')] +[2023-03-11 17:00:23,797][41544] Updated weights for policy 0, policy_version 120240 (0.0005) +[2023-03-11 17:00:28,361][41544] Updated weights for policy 0, policy_version 120320 (0.0005) +[2023-03-11 17:00:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9330.6). Total num frames: 61603840. Throughput: 0: 9074.2. Samples: 61581252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:28,386][41256] Avg episode reward: [(0, '85.264')] +[2023-03-11 17:00:32,831][41544] Updated weights for policy 0, policy_version 120400 (0.0005) +[2023-03-11 17:00:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9330.5). Total num frames: 61648896. Throughput: 0: 9037.1. Samples: 61635604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:33,386][41256] Avg episode reward: [(0, '80.341')] +[2023-03-11 17:00:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120408_61648896.pth... +[2023-03-11 17:00:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000119880_61378560.pth +[2023-03-11 17:00:37,244][41544] Updated weights for policy 0, policy_version 120480 (0.0005) +[2023-03-11 17:00:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 61693952. Throughput: 0: 9062.2. Samples: 61691092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:38,386][41256] Avg episode reward: [(0, '85.373')] +[2023-03-11 17:00:41,625][41544] Updated weights for policy 0, policy_version 120560 (0.0005) +[2023-03-11 17:00:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9330.6). Total num frames: 61743104. Throughput: 0: 9074.0. Samples: 61718796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:43,386][41256] Avg episode reward: [(0, '86.452')] +[2023-03-11 17:00:46,109][41544] Updated weights for policy 0, policy_version 120640 (0.0005) +[2023-03-11 17:00:48,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9330.5). Total num frames: 61788160. Throughput: 0: 9135.1. Samples: 61775236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:48,386][41256] Avg episode reward: [(0, '82.506')] +[2023-03-11 17:00:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120680_61788160.pth... +[2023-03-11 17:00:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120144_61513728.pth +[2023-03-11 17:00:50,468][41544] Updated weights for policy 0, policy_version 120720 (0.0005) +[2023-03-11 17:00:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9330.5). Total num frames: 61833216. Throughput: 0: 9144.8. Samples: 61830872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:53,386][41256] Avg episode reward: [(0, '82.611')] +[2023-03-11 17:00:54,911][41544] Updated weights for policy 0, policy_version 120800 (0.0005) +[2023-03-11 17:00:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 61878272. Throughput: 0: 9145.2. Samples: 61857864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:00:58,386][41256] Avg episode reward: [(0, '85.041')] +[2023-03-11 17:00:59,489][41544] Updated weights for policy 0, policy_version 120880 (0.0005) +[2023-03-11 17:01:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 61923328. Throughput: 0: 9130.6. Samples: 61911676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:03,386][41256] Avg episode reward: [(0, '86.065')] +[2023-03-11 17:01:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120944_61923328.pth... +[2023-03-11 17:01:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120408_61648896.pth +[2023-03-11 17:01:04,083][41544] Updated weights for policy 0, policy_version 120960 (0.0005) +[2023-03-11 17:01:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 61968384. Throughput: 0: 9118.4. Samples: 61965016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:08,386][41256] Avg episode reward: [(0, '88.269')] +[2023-03-11 17:01:08,677][41544] Updated weights for policy 0, policy_version 121040 (0.0005) +[2023-03-11 17:01:13,263][41544] Updated weights for policy 0, policy_version 121120 (0.0005) +[2023-03-11 17:01:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 62013440. Throughput: 0: 9123.3. Samples: 61991800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:13,386][41256] Avg episode reward: [(0, '87.859')] +[2023-03-11 17:01:17,793][41544] Updated weights for policy 0, policy_version 121200 (0.0005) +[2023-03-11 17:01:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9302.8). Total num frames: 62058496. Throughput: 0: 9124.5. Samples: 62046208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:18,386][41256] Avg episode reward: [(0, '87.401')] +[2023-03-11 17:01:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000121208_62058496.pth... +[2023-03-11 17:01:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120680_61788160.pth +[2023-03-11 17:01:22,319][41544] Updated weights for policy 0, policy_version 121280 (0.0005) +[2023-03-11 17:01:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9275.0). Total num frames: 62103552. Throughput: 0: 9081.4. Samples: 62099756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:23,386][41256] Avg episode reward: [(0, '85.924')] +[2023-03-11 17:01:26,912][41544] Updated weights for policy 0, policy_version 121360 (0.0005) +[2023-03-11 17:01:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9261.1). Total num frames: 62148608. Throughput: 0: 9072.9. Samples: 62127076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:28,386][41256] Avg episode reward: [(0, '89.172')] +[2023-03-11 17:01:31,527][41544] Updated weights for policy 0, policy_version 121440 (0.0005) +[2023-03-11 17:01:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9247.2). Total num frames: 62193664. Throughput: 0: 9002.6. Samples: 62180352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:33,386][41256] Avg episode reward: [(0, '89.223')] +[2023-03-11 17:01:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000121472_62193664.pth... +[2023-03-11 17:01:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000120944_61923328.pth +[2023-03-11 17:01:36,040][41544] Updated weights for policy 0, policy_version 121520 (0.0005) +[2023-03-11 17:01:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9233.4). Total num frames: 62238720. Throughput: 0: 8973.7. Samples: 62234688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:38,386][41256] Avg episode reward: [(0, '80.695')] +[2023-03-11 17:01:40,417][41544] Updated weights for policy 0, policy_version 121600 (0.0005) +[2023-03-11 17:01:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9219.5). Total num frames: 62283776. Throughput: 0: 9009.8. Samples: 62263304. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:43,386][41256] Avg episode reward: [(0, '79.613')] +[2023-03-11 17:01:44,930][41544] Updated weights for policy 0, policy_version 121680 (0.0005) +[2023-03-11 17:01:48,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9011.2, 300 sec: 9205.6). Total num frames: 62328832. Throughput: 0: 9048.1. Samples: 62318840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:48,386][41256] Avg episode reward: [(0, '86.055')] +[2023-03-11 17:01:48,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000121744_62332928.pth... +[2023-03-11 17:01:48,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000121208_62058496.pth +[2023-03-11 17:01:49,284][41544] Updated weights for policy 0, policy_version 121760 (0.0005) +[2023-03-11 17:01:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9219.5). Total num frames: 62377984. Throughput: 0: 9113.3. Samples: 62375116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:53,386][41256] Avg episode reward: [(0, '86.224')] +[2023-03-11 17:01:53,639][41544] Updated weights for policy 0, policy_version 121840 (0.0005) +[2023-03-11 17:01:58,197][41544] Updated weights for policy 0, policy_version 121920 (0.0005) +[2023-03-11 17:01:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 62423040. Throughput: 0: 9124.4. Samples: 62402396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:01:58,386][41256] Avg episode reward: [(0, '84.243')] +[2023-03-11 17:02:02,653][41544] Updated weights for policy 0, policy_version 122000 (0.0005) +[2023-03-11 17:02:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 62468096. Throughput: 0: 9104.5. Samples: 62455912. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:02:03,386][41256] Avg episode reward: [(0, '86.949')] +[2023-03-11 17:02:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122008_62468096.pth... +[2023-03-11 17:02:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000121472_62193664.pth +[2023-03-11 17:02:07,201][41544] Updated weights for policy 0, policy_version 122080 (0.0005) +[2023-03-11 17:02:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 62513152. Throughput: 0: 9131.6. Samples: 62510676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:02:08,386][41256] Avg episode reward: [(0, '83.985')] +[2023-03-11 17:02:11,740][41544] Updated weights for policy 0, policy_version 122160 (0.0005) +[2023-03-11 17:02:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 62558208. Throughput: 0: 9125.9. Samples: 62537740. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:13,386][41256] Avg episode reward: [(0, '83.819')] +[2023-03-11 17:02:16,176][41544] Updated weights for policy 0, policy_version 122240 (0.0005) +[2023-03-11 17:02:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 62603264. Throughput: 0: 9170.5. Samples: 62593024. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:18,386][41256] Avg episode reward: [(0, '84.245')] +[2023-03-11 17:02:18,410][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122280_62607360.pth... +[2023-03-11 17:02:18,412][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000121744_62332928.pth +[2023-03-11 17:02:20,736][41544] Updated weights for policy 0, policy_version 122320 (0.0005) +[2023-03-11 17:02:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9191.7). Total num frames: 62648320. Throughput: 0: 9170.6. Samples: 62647364. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:23,386][41256] Avg episode reward: [(0, '79.842')] +[2023-03-11 17:02:25,277][41544] Updated weights for policy 0, policy_version 122400 (0.0005) +[2023-03-11 17:02:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9177.8). Total num frames: 62693376. Throughput: 0: 9124.0. Samples: 62673884. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:28,386][41256] Avg episode reward: [(0, '83.582')] +[2023-03-11 17:02:29,743][41544] Updated weights for policy 0, policy_version 122480 (0.0005) +[2023-03-11 17:02:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9191.7). Total num frames: 62742528. Throughput: 0: 9116.5. Samples: 62729084. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:33,386][41256] Avg episode reward: [(0, '82.616')] +[2023-03-11 17:02:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122544_62742528.pth... +[2023-03-11 17:02:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122008_62468096.pth +[2023-03-11 17:02:34,224][41544] Updated weights for policy 0, policy_version 122560 (0.0005) +[2023-03-11 17:02:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 62787584. Throughput: 0: 9076.5. Samples: 62783560. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:38,386][41256] Avg episode reward: [(0, '82.562')] +[2023-03-11 17:02:38,704][41544] Updated weights for policy 0, policy_version 122640 (0.0005) +[2023-03-11 17:02:43,148][41544] Updated weights for policy 0, policy_version 122720 (0.0005) +[2023-03-11 17:02:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 62832640. Throughput: 0: 9100.6. Samples: 62811924. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:43,386][41256] Avg episode reward: [(0, '81.390')] +[2023-03-11 17:02:47,584][41544] Updated weights for policy 0, policy_version 122800 (0.0005) +[2023-03-11 17:02:48,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 62877696. Throughput: 0: 9134.7. Samples: 62866972. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:48,386][41256] Avg episode reward: [(0, '82.788')] +[2023-03-11 17:02:48,443][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122816_62881792.pth... +[2023-03-11 17:02:48,446][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122280_62607360.pth +[2023-03-11 17:02:51,948][41544] Updated weights for policy 0, policy_version 122880 (0.0005) +[2023-03-11 17:02:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 62926848. Throughput: 0: 9158.7. Samples: 62922816. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:53,396][41256] Avg episode reward: [(0, '82.392')] +[2023-03-11 17:02:56,444][41544] Updated weights for policy 0, policy_version 122960 (0.0005) +[2023-03-11 17:02:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 62971904. Throughput: 0: 9169.5. Samples: 62950368. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:02:58,396][41256] Avg episode reward: [(0, '82.018')] +[2023-03-11 17:03:00,910][41544] Updated weights for policy 0, policy_version 123040 (0.0005) +[2023-03-11 17:03:03,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 63016960. Throughput: 0: 9156.7. Samples: 63005076. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:03:03,397][41256] Avg episode reward: [(0, '81.937')] +[2023-03-11 17:03:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123080_63016960.pth... +[2023-03-11 17:03:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122544_62742528.pth +[2023-03-11 17:03:05,324][41544] Updated weights for policy 0, policy_version 123120 (0.0005) +[2023-03-11 17:03:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9177.8). Total num frames: 63062016. Throughput: 0: 9174.0. Samples: 63060196. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:03:08,396][41256] Avg episode reward: [(0, '84.849')] +[2023-03-11 17:03:09,773][41544] Updated weights for policy 0, policy_version 123200 (0.0005) +[2023-03-11 17:03:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9191.7). Total num frames: 63111168. Throughput: 0: 9209.1. Samples: 63088292. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:13,396][41256] Avg episode reward: [(0, '81.582')] +[2023-03-11 17:03:13,903][41544] Updated weights for policy 0, policy_version 123280 (0.0005) +[2023-03-11 17:03:17,967][41544] Updated weights for policy 0, policy_version 123360 (0.0005) +[2023-03-11 17:03:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9205.6). Total num frames: 63160320. Throughput: 0: 9330.2. Samples: 63148944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:18,396][41256] Avg episode reward: [(0, '84.951')] +[2023-03-11 17:03:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123368_63164416.pth... +[2023-03-11 17:03:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000122816_62881792.pth +[2023-03-11 17:03:22,192][41544] Updated weights for policy 0, policy_version 123440 (0.0005) +[2023-03-11 17:03:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9205.6). Total num frames: 63209472. Throughput: 0: 9423.3. Samples: 63207608. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:23,396][41256] Avg episode reward: [(0, '86.631')] +[2023-03-11 17:03:26,403][41544] Updated weights for policy 0, policy_version 123520 (0.0005) +[2023-03-11 17:03:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9489.1, 300 sec: 9219.5). Total num frames: 63262720. Throughput: 0: 9430.8. Samples: 63236308. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:28,396][41256] Avg episode reward: [(0, '84.857')] +[2023-03-11 17:03:30,416][41544] Updated weights for policy 0, policy_version 123600 (0.0004) +[2023-03-11 17:03:33,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9489.1, 300 sec: 9233.4). Total num frames: 63311872. Throughput: 0: 9567.8. Samples: 63297524. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:33,396][41256] Avg episode reward: [(0, '81.988')] +[2023-03-11 17:03:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123656_63311872.pth... +[2023-03-11 17:03:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123080_63016960.pth +[2023-03-11 17:03:34,333][41544] Updated weights for policy 0, policy_version 123680 (0.0004) +[2023-03-11 17:03:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9247.2). Total num frames: 63361024. Throughput: 0: 9694.3. Samples: 63359060. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:38,396][41256] Avg episode reward: [(0, '81.118')] +[2023-03-11 17:03:38,507][41544] Updated weights for policy 0, policy_version 123760 (0.0005) +[2023-03-11 17:03:42,933][41544] Updated weights for policy 0, policy_version 123840 (0.0005) +[2023-03-11 17:03:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9233.4). Total num frames: 63406080. Throughput: 0: 9687.7. Samples: 63386316. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:43,386][41256] Avg episode reward: [(0, '86.809')] +[2023-03-11 17:03:47,492][41544] Updated weights for policy 0, policy_version 123920 (0.0005) +[2023-03-11 17:03:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9233.4). Total num frames: 63451136. Throughput: 0: 9697.8. Samples: 63441476. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:48,386][41256] Avg episode reward: [(0, '86.563')] +[2023-03-11 17:03:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123936_63455232.pth... +[2023-03-11 17:03:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123368_63164416.pth +[2023-03-11 17:03:52,051][41544] Updated weights for policy 0, policy_version 124000 (0.0005) +[2023-03-11 17:03:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9233.4). Total num frames: 63496192. Throughput: 0: 9672.4. Samples: 63495452. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:53,386][41256] Avg episode reward: [(0, '84.845')] +[2023-03-11 17:03:56,636][41544] Updated weights for policy 0, policy_version 124080 (0.0005) +[2023-03-11 17:03:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9233.4). Total num frames: 63541248. Throughput: 0: 9629.7. Samples: 63521628. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:03:58,386][41256] Avg episode reward: [(0, '87.223')] +[2023-03-11 17:04:01,152][41544] Updated weights for policy 0, policy_version 124160 (0.0005) +[2023-03-11 17:04:03,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9233.4). Total num frames: 63586304. Throughput: 0: 9491.4. Samples: 63576056. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:04:03,386][41256] Avg episode reward: [(0, '84.802')] +[2023-03-11 17:04:03,444][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000124200_63590400.pth... +[2023-03-11 17:04:03,446][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123656_63311872.pth +[2023-03-11 17:04:05,719][41544] Updated weights for policy 0, policy_version 124240 (0.0005) +[2023-03-11 17:04:08,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9247.2). Total num frames: 63635456. Throughput: 0: 9416.9. Samples: 63631368. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:04:08,386][41256] Avg episode reward: [(0, '82.405')] +[2023-03-11 17:04:10,099][41544] Updated weights for policy 0, policy_version 124320 (0.0005) +[2023-03-11 17:04:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9233.4). Total num frames: 63680512. Throughput: 0: 9396.3. Samples: 63659140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:13,386][41256] Avg episode reward: [(0, '84.341')] +[2023-03-11 17:04:14,574][41544] Updated weights for policy 0, policy_version 124400 (0.0005) +[2023-03-11 17:04:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9233.4). Total num frames: 63725568. Throughput: 0: 9242.1. Samples: 63713416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:18,386][41256] Avg episode reward: [(0, '80.821')] +[2023-03-11 17:04:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000124464_63725568.pth... +[2023-03-11 17:04:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000123936_63455232.pth +[2023-03-11 17:04:18,942][41544] Updated weights for policy 0, policy_version 124480 (0.0005) +[2023-03-11 17:04:22,995][41544] Updated weights for policy 0, policy_version 124560 (0.0004) +[2023-03-11 17:04:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9233.4). Total num frames: 63774720. Throughput: 0: 9196.4. Samples: 63772896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:23,386][41256] Avg episode reward: [(0, '82.737')] +[2023-03-11 17:04:27,475][41544] Updated weights for policy 0, policy_version 124640 (0.0005) +[2023-03-11 17:04:28,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9352.5, 300 sec: 9233.4). Total num frames: 63823872. Throughput: 0: 9208.1. Samples: 63800680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:28,386][41256] Avg episode reward: [(0, '78.768')] +[2023-03-11 17:04:31,939][41544] Updated weights for policy 0, policy_version 124720 (0.0005) +[2023-03-11 17:04:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 63868928. Throughput: 0: 9216.5. Samples: 63856220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:33,386][41256] Avg episode reward: [(0, '79.509')] +[2023-03-11 17:04:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000124744_63868928.pth... +[2023-03-11 17:04:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000124200_63590400.pth +[2023-03-11 17:04:36,513][41544] Updated weights for policy 0, policy_version 124800 (0.0005) +[2023-03-11 17:04:38,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 63913984. Throughput: 0: 9209.7. Samples: 63909888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:38,386][41256] Avg episode reward: [(0, '79.320')] +[2023-03-11 17:04:40,992][41544] Updated weights for policy 0, policy_version 124880 (0.0005) +[2023-03-11 17:04:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 63959040. Throughput: 0: 9236.0. Samples: 63937248. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:43,386][41256] Avg episode reward: [(0, '82.894')] +[2023-03-11 17:04:45,520][41544] Updated weights for policy 0, policy_version 124960 (0.0005) +[2023-03-11 17:04:48,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 64004096. Throughput: 0: 9239.2. Samples: 63991820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:48,386][41256] Avg episode reward: [(0, '82.779')] +[2023-03-11 17:04:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125008_64004096.pth... +[2023-03-11 17:04:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000124464_63725568.pth +[2023-03-11 17:04:49,978][41544] Updated weights for policy 0, policy_version 125040 (0.0005) +[2023-03-11 17:04:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 64049152. Throughput: 0: 9214.3. Samples: 64046012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:53,386][41256] Avg episode reward: [(0, '82.585')] +[2023-03-11 17:04:54,502][41544] Updated weights for policy 0, policy_version 125120 (0.0005) +[2023-03-11 17:04:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 64094208. Throughput: 0: 9213.2. Samples: 64073736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:04:58,386][41256] Avg episode reward: [(0, '85.084')] +[2023-03-11 17:04:59,031][41544] Updated weights for policy 0, policy_version 125200 (0.0005) +[2023-03-11 17:05:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9219.5). Total num frames: 64139264. Throughput: 0: 9191.8. Samples: 64127048. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:05:03,386][41256] Avg episode reward: [(0, '82.081')] +[2023-03-11 17:05:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125272_64139264.pth... +[2023-03-11 17:05:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000124744_63868928.pth +[2023-03-11 17:05:03,631][41544] Updated weights for policy 0, policy_version 125280 (0.0005) +[2023-03-11 17:05:08,111][41544] Updated weights for policy 0, policy_version 125360 (0.0005) +[2023-03-11 17:05:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9205.6). Total num frames: 64184320. Throughput: 0: 9086.5. Samples: 64181788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:05:08,386][41256] Avg episode reward: [(0, '79.945')] +[2023-03-11 17:05:12,612][41544] Updated weights for policy 0, policy_version 125440 (0.0005) +[2023-03-11 17:05:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9205.6). Total num frames: 64229376. Throughput: 0: 9072.9. Samples: 64208960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:05:13,386][41256] Avg episode reward: [(0, '77.381')] +[2023-03-11 17:05:17,116][41544] Updated weights for policy 0, policy_version 125520 (0.0005) +[2023-03-11 17:05:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9205.6). Total num frames: 64274432. Throughput: 0: 9050.2. Samples: 64263480. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:18,386][41256] Avg episode reward: [(0, '75.313')] +[2023-03-11 17:05:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125536_64274432.pth... +[2023-03-11 17:05:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125008_64004096.pth +[2023-03-11 17:05:21,582][41544] Updated weights for policy 0, policy_version 125600 (0.0005) +[2023-03-11 17:05:23,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9219.5). Total num frames: 64323584. Throughput: 0: 9094.9. Samples: 64319160. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:23,386][41256] Avg episode reward: [(0, '80.359')] +[2023-03-11 17:05:26,018][41544] Updated weights for policy 0, policy_version 125680 (0.0005) +[2023-03-11 17:05:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9219.5). Total num frames: 64368640. Throughput: 0: 9098.8. Samples: 64346696. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:28,386][41256] Avg episode reward: [(0, '81.050')] +[2023-03-11 17:05:30,513][41544] Updated weights for policy 0, policy_version 125760 (0.0005) +[2023-03-11 17:05:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9219.5). Total num frames: 64413696. Throughput: 0: 9102.0. Samples: 64401408. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:33,386][41256] Avg episode reward: [(0, '83.801')] +[2023-03-11 17:05:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125808_64413696.pth... +[2023-03-11 17:05:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125272_64139264.pth +[2023-03-11 17:05:35,049][41544] Updated weights for policy 0, policy_version 125840 (0.0005) +[2023-03-11 17:05:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 64458752. Throughput: 0: 9086.4. Samples: 64454900. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:38,386][41256] Avg episode reward: [(0, '82.738')] +[2023-03-11 17:05:39,520][41544] Updated weights for policy 0, policy_version 125920 (0.0005) +[2023-03-11 17:05:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9219.5). Total num frames: 64507904. Throughput: 0: 9107.5. Samples: 64483572. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:43,386][41256] Avg episode reward: [(0, '79.058')] +[2023-03-11 17:05:43,701][41544] Updated weights for policy 0, policy_version 126000 (0.0005) +[2023-03-11 17:05:47,878][41544] Updated weights for policy 0, policy_version 126080 (0.0005) +[2023-03-11 17:05:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9216.0, 300 sec: 9233.4). Total num frames: 64557056. Throughput: 0: 9234.0. Samples: 64542580. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:48,386][41256] Avg episode reward: [(0, '79.770')] +[2023-03-11 17:05:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126088_64557056.pth... +[2023-03-11 17:05:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125536_64274432.pth +[2023-03-11 17:05:52,156][41544] Updated weights for policy 0, policy_version 126160 (0.0005) +[2023-03-11 17:05:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9233.4). Total num frames: 64602112. Throughput: 0: 9304.2. Samples: 64600476. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:53,386][41256] Avg episode reward: [(0, '80.318')] +[2023-03-11 17:05:56,387][41544] Updated weights for policy 0, policy_version 126240 (0.0005) +[2023-03-11 17:05:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9247.2). Total num frames: 64651264. Throughput: 0: 9351.5. Samples: 64629780. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:05:58,386][41256] Avg episode reward: [(0, '80.463')] +[2023-03-11 17:06:00,639][41544] Updated weights for policy 0, policy_version 126320 (0.0004) +[2023-03-11 17:06:03,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9352.5, 300 sec: 9261.1). Total num frames: 64700416. Throughput: 0: 9431.0. Samples: 64687876. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:03,386][41256] Avg episode reward: [(0, '78.033')] +[2023-03-11 17:06:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126368_64700416.pth... +[2023-03-11 17:06:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000125808_64413696.pth +[2023-03-11 17:06:04,864][41544] Updated weights for policy 0, policy_version 126400 (0.0005) +[2023-03-11 17:06:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9275.0). Total num frames: 64749568. Throughput: 0: 9475.0. Samples: 64745536. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:08,386][41256] Avg episode reward: [(0, '82.936')] +[2023-03-11 17:06:09,116][41544] Updated weights for policy 0, policy_version 126480 (0.0005) +[2023-03-11 17:06:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9275.0). Total num frames: 64794624. Throughput: 0: 9499.1. Samples: 64774156. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:13,386][41256] Avg episode reward: [(0, '81.707')] +[2023-03-11 17:06:13,453][41544] Updated weights for policy 0, policy_version 126560 (0.0005) +[2023-03-11 17:06:17,912][41544] Updated weights for policy 0, policy_version 126640 (0.0005) +[2023-03-11 17:06:18,386][41256] Fps is (10 sec: 9420.6, 60 sec: 9489.0, 300 sec: 9288.9). Total num frames: 64843776. Throughput: 0: 9529.8. Samples: 64830248. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:18,386][41256] Avg episode reward: [(0, '75.601')] +[2023-03-11 17:06:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126648_64843776.pth... +[2023-03-11 17:06:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126088_64557056.pth +[2023-03-11 17:06:22,421][41544] Updated weights for policy 0, policy_version 126720 (0.0005) +[2023-03-11 17:06:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9288.9). Total num frames: 64888832. Throughput: 0: 9552.1. Samples: 64884744. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:23,386][41256] Avg episode reward: [(0, '73.092')] +[2023-03-11 17:06:26,893][41544] Updated weights for policy 0, policy_version 126800 (0.0005) +[2023-03-11 17:06:28,385][41256] Fps is (10 sec: 9011.4, 60 sec: 9420.8, 300 sec: 9288.9). Total num frames: 64933888. Throughput: 0: 9519.4. Samples: 64911944. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:28,386][41256] Avg episode reward: [(0, '76.187')] +[2023-03-11 17:06:31,351][41544] Updated weights for policy 0, policy_version 126880 (0.0005) +[2023-03-11 17:06:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9288.9). Total num frames: 64978944. Throughput: 0: 9426.8. Samples: 64966788. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:33,386][41256] Avg episode reward: [(0, '75.230')] +[2023-03-11 17:06:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126912_64978944.pth... +[2023-03-11 17:06:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126368_64700416.pth +[2023-03-11 17:06:35,820][41544] Updated weights for policy 0, policy_version 126960 (0.0005) +[2023-03-11 17:06:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9288.9). Total num frames: 65024000. Throughput: 0: 9374.3. Samples: 65022320. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:38,386][41256] Avg episode reward: [(0, '77.508')] +[2023-03-11 17:06:40,287][41544] Updated weights for policy 0, policy_version 127040 (0.0005) +[2023-03-11 17:06:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9288.9). Total num frames: 65069056. Throughput: 0: 9321.6. Samples: 65049252. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:43,386][41256] Avg episode reward: [(0, '76.001')] +[2023-03-11 17:06:44,792][41544] Updated weights for policy 0, policy_version 127120 (0.0005) +[2023-03-11 17:06:48,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9275.0). Total num frames: 65114112. Throughput: 0: 9242.8. Samples: 65103804. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:48,386][41256] Avg episode reward: [(0, '75.430')] +[2023-03-11 17:06:48,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127184_65118208.pth... +[2023-03-11 17:06:48,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126648_64843776.pth +[2023-03-11 17:06:49,263][41544] Updated weights for policy 0, policy_version 127200 (0.0005) +[2023-03-11 17:06:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9288.9). Total num frames: 65163264. Throughput: 0: 9192.0. Samples: 65159176. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:53,386][41256] Avg episode reward: [(0, '73.768')] +[2023-03-11 17:06:53,766][41544] Updated weights for policy 0, policy_version 127280 (0.0005) +[2023-03-11 17:06:58,207][41544] Updated weights for policy 0, policy_version 127360 (0.0005) +[2023-03-11 17:06:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9288.9). Total num frames: 65208320. Throughput: 0: 9157.6. Samples: 65186248. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:06:58,386][41256] Avg episode reward: [(0, '80.192')] +[2023-03-11 17:07:02,676][41544] Updated weights for policy 0, policy_version 127440 (0.0005) +[2023-03-11 17:07:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9288.9). Total num frames: 65253376. Throughput: 0: 9131.4. Samples: 65241160. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:07:03,386][41256] Avg episode reward: [(0, '79.104')] +[2023-03-11 17:07:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127448_65253376.pth... +[2023-03-11 17:07:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000126912_64978944.pth +[2023-03-11 17:07:07,128][41544] Updated weights for policy 0, policy_version 127520 (0.0005) +[2023-03-11 17:07:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 65298432. Throughput: 0: 9163.4. Samples: 65297096. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:07:08,386][41256] Avg episode reward: [(0, '76.265')] +[2023-03-11 17:07:11,599][41544] Updated weights for policy 0, policy_version 127600 (0.0005) +[2023-03-11 17:07:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 65343488. Throughput: 0: 9162.2. Samples: 65324244. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:07:13,386][41256] Avg episode reward: [(0, '79.253')] +[2023-03-11 17:07:16,072][41544] Updated weights for policy 0, policy_version 127680 (0.0005) +[2023-03-11 17:07:18,386][41256] Fps is (10 sec: 9420.6, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 65392640. Throughput: 0: 9174.0. Samples: 65379620. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:07:18,386][41256] Avg episode reward: [(0, '78.423')] +[2023-03-11 17:07:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127720_65392640.pth... +[2023-03-11 17:07:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127184_65118208.pth +[2023-03-11 17:07:20,612][41544] Updated weights for policy 0, policy_version 127760 (0.0004) +[2023-03-11 17:07:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 65437696. Throughput: 0: 9139.7. Samples: 65433608. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:23,386][41256] Avg episode reward: [(0, '75.161')] +[2023-03-11 17:07:25,088][41544] Updated weights for policy 0, policy_version 127840 (0.0005) +[2023-03-11 17:07:28,385][41256] Fps is (10 sec: 9011.4, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 65482752. Throughput: 0: 9150.6. Samples: 65461028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:28,386][41256] Avg episode reward: [(0, '74.515')] +[2023-03-11 17:07:29,543][41544] Updated weights for policy 0, policy_version 127920 (0.0005) +[2023-03-11 17:07:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 65527808. Throughput: 0: 9167.1. Samples: 65516324. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:33,386][41256] Avg episode reward: [(0, '74.967')] +[2023-03-11 17:07:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127984_65527808.pth... +[2023-03-11 17:07:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127448_65253376.pth +[2023-03-11 17:07:33,945][41544] Updated weights for policy 0, policy_version 128000 (0.0005) +[2023-03-11 17:07:38,333][41544] Updated weights for policy 0, policy_version 128080 (0.0005) +[2023-03-11 17:07:38,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9302.8). Total num frames: 65576960. Throughput: 0: 9178.5. Samples: 65572208. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:38,386][41256] Avg episode reward: [(0, '74.105')] +[2023-03-11 17:07:42,631][41544] Updated weights for policy 0, policy_version 128160 (0.0005) +[2023-03-11 17:07:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9302.8). Total num frames: 65622016. Throughput: 0: 9212.7. Samples: 65600820. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:43,386][41256] Avg episode reward: [(0, '75.001')] +[2023-03-11 17:07:46,720][41544] Updated weights for policy 0, policy_version 128240 (0.0004) +[2023-03-11 17:07:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9302.8). Total num frames: 65671168. Throughput: 0: 9304.9. Samples: 65659880. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:48,386][41256] Avg episode reward: [(0, '74.522')] +[2023-03-11 17:07:48,395][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000128272_65675264.pth... +[2023-03-11 17:07:48,397][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127720_65392640.pth +[2023-03-11 17:07:50,900][41544] Updated weights for policy 0, policy_version 128320 (0.0005) +[2023-03-11 17:07:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9352.5, 300 sec: 9330.6). Total num frames: 65724416. Throughput: 0: 9385.9. Samples: 65719460. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:53,386][41256] Avg episode reward: [(0, '73.843')] +[2023-03-11 17:07:55,036][41544] Updated weights for policy 0, policy_version 128400 (0.0005) +[2023-03-11 17:07:58,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9420.8, 300 sec: 9344.4). Total num frames: 65773568. Throughput: 0: 9436.6. Samples: 65748892. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:07:58,386][41256] Avg episode reward: [(0, '72.828')] +[2023-03-11 17:07:59,172][41544] Updated weights for policy 0, policy_version 128480 (0.0005) +[2023-03-11 17:08:03,270][41544] Updated weights for policy 0, policy_version 128560 (0.0004) +[2023-03-11 17:08:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9358.3). Total num frames: 65822720. Throughput: 0: 9525.7. Samples: 65808276. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:08:03,386][41256] Avg episode reward: [(0, '74.540')] +[2023-03-11 17:08:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000128560_65822720.pth... +[2023-03-11 17:08:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000127984_65527808.pth +[2023-03-11 17:08:07,431][41544] Updated weights for policy 0, policy_version 128640 (0.0004) +[2023-03-11 17:08:08,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9358.3). Total num frames: 65871872. Throughput: 0: 9649.6. Samples: 65867840. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:08:08,386][41256] Avg episode reward: [(0, '73.381')] +[2023-03-11 17:08:11,524][41544] Updated weights for policy 0, policy_version 128720 (0.0005) +[2023-03-11 17:08:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9358.3). Total num frames: 65921024. Throughput: 0: 9702.4. Samples: 65897636. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:08:13,386][41256] Avg episode reward: [(0, '67.598')] +[2023-03-11 17:08:15,700][41544] Updated weights for policy 0, policy_version 128800 (0.0005) +[2023-03-11 17:08:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9358.3). Total num frames: 65970176. Throughput: 0: 9794.5. Samples: 65957076. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:08:18,386][41256] Avg episode reward: [(0, '68.993')] +[2023-03-11 17:08:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000128848_65970176.pth... +[2023-03-11 17:08:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000128272_65675264.pth +[2023-03-11 17:08:19,865][41544] Updated weights for policy 0, policy_version 128880 (0.0005) +[2023-03-11 17:08:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9344.4). Total num frames: 66019328. Throughput: 0: 9846.4. Samples: 66015296. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:23,386][41256] Avg episode reward: [(0, '74.404')] +[2023-03-11 17:08:24,113][41544] Updated weights for policy 0, policy_version 128960 (0.0005) +[2023-03-11 17:08:28,272][41544] Updated weights for policy 0, policy_version 129040 (0.0005) +[2023-03-11 17:08:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9344.4). Total num frames: 66068480. Throughput: 0: 9855.0. Samples: 66044296. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:28,386][41256] Avg episode reward: [(0, '76.072')] +[2023-03-11 17:08:32,386][41544] Updated weights for policy 0, policy_version 129120 (0.0004) +[2023-03-11 17:08:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9344.4). Total num frames: 66117632. Throughput: 0: 9868.3. Samples: 66103952. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:33,386][41256] Avg episode reward: [(0, '78.791')] +[2023-03-11 17:08:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129136_66117632.pth... +[2023-03-11 17:08:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000128560_65822720.pth +[2023-03-11 17:08:36,614][41544] Updated weights for policy 0, policy_version 129200 (0.0005) +[2023-03-11 17:08:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9358.3). Total num frames: 66166784. Throughput: 0: 9851.1. Samples: 66162760. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:38,386][41256] Avg episode reward: [(0, '77.872')] +[2023-03-11 17:08:40,723][41544] Updated weights for policy 0, policy_version 129280 (0.0004) +[2023-03-11 17:08:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9372.2). Total num frames: 66215936. Throughput: 0: 9850.0. Samples: 66192140. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:43,386][41256] Avg episode reward: [(0, '77.573')] +[2023-03-11 17:08:45,024][41544] Updated weights for policy 0, policy_version 129360 (0.0005) +[2023-03-11 17:08:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 9372.2). Total num frames: 66260992. Throughput: 0: 9793.0. Samples: 66248960. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:48,386][41256] Avg episode reward: [(0, '73.510')] +[2023-03-11 17:08:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129416_66260992.pth... +[2023-03-11 17:08:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000128848_65970176.pth +[2023-03-11 17:08:49,363][41544] Updated weights for policy 0, policy_version 129440 (0.0005) +[2023-03-11 17:08:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9386.1). Total num frames: 66310144. Throughput: 0: 9736.4. Samples: 66305976. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:53,386][41256] Avg episode reward: [(0, '68.864')] +[2023-03-11 17:08:53,816][41544] Updated weights for policy 0, policy_version 129520 (0.0005) +[2023-03-11 17:08:58,125][41544] Updated weights for policy 0, policy_version 129600 (0.0005) +[2023-03-11 17:08:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9386.1). Total num frames: 66355200. Throughput: 0: 9684.5. Samples: 66333440. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:08:58,386][41256] Avg episode reward: [(0, '73.947')] +[2023-03-11 17:09:02,500][41544] Updated weights for policy 0, policy_version 129680 (0.0005) +[2023-03-11 17:09:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9693.9, 300 sec: 9386.1). Total num frames: 66404352. Throughput: 0: 9623.2. Samples: 66390120. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:09:03,386][41256] Avg episode reward: [(0, '78.057')] +[2023-03-11 17:09:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129696_66404352.pth... +[2023-03-11 17:09:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129136_66117632.pth +[2023-03-11 17:09:06,823][41544] Updated weights for policy 0, policy_version 129760 (0.0005) +[2023-03-11 17:09:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9386.1). Total num frames: 66449408. Throughput: 0: 9576.4. Samples: 66446236. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:09:08,386][41256] Avg episode reward: [(0, '77.125')] +[2023-03-11 17:09:11,292][41544] Updated weights for policy 0, policy_version 129840 (0.0005) +[2023-03-11 17:09:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 66494464. Throughput: 0: 9550.0. Samples: 66474048. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:09:13,386][41256] Avg episode reward: [(0, '83.695')] +[2023-03-11 17:09:15,683][41544] Updated weights for policy 0, policy_version 129920 (0.0005) +[2023-03-11 17:09:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 66539520. Throughput: 0: 9463.7. Samples: 66529820. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:09:18,386][41256] Avg episode reward: [(0, '83.526')] +[2023-03-11 17:09:18,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129968_66543616.pth... +[2023-03-11 17:09:18,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129416_66260992.pth +[2023-03-11 17:09:20,137][41544] Updated weights for policy 0, policy_version 130000 (0.0005) +[2023-03-11 17:09:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 66588672. Throughput: 0: 9388.4. Samples: 66585240. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:09:23,396][41256] Avg episode reward: [(0, '83.843')] +[2023-03-11 17:09:24,520][41544] Updated weights for policy 0, policy_version 130080 (0.0005) +[2023-03-11 17:09:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9372.2). Total num frames: 66633728. Throughput: 0: 9359.4. Samples: 66613312. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:09:28,396][41256] Avg episode reward: [(0, '77.473')] +[2023-03-11 17:09:28,949][41544] Updated weights for policy 0, policy_version 130160 (0.0005) +[2023-03-11 17:09:33,385][41544] Updated weights for policy 0, policy_version 130240 (0.0005) +[2023-03-11 17:09:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 66682880. Throughput: 0: 9323.2. Samples: 66668504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:09:33,397][41256] Avg episode reward: [(0, '78.998')] +[2023-03-11 17:09:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000130240_66682880.pth... +[2023-03-11 17:09:33,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129696_66404352.pth +[2023-03-11 17:09:37,713][41544] Updated weights for policy 0, policy_version 130320 (0.0005) +[2023-03-11 17:09:38,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 66727936. Throughput: 0: 9311.2. Samples: 66724980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:09:38,397][41256] Avg episode reward: [(0, '81.372')] +[2023-03-11 17:09:41,895][41544] Updated weights for policy 0, policy_version 130400 (0.0005) +[2023-03-11 17:09:43,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 66777088. Throughput: 0: 9357.8. Samples: 66754540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:09:43,397][41256] Avg episode reward: [(0, '74.891')] +[2023-03-11 17:09:46,115][41544] Updated weights for policy 0, policy_version 130480 (0.0005) +[2023-03-11 17:09:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 66826240. Throughput: 0: 9389.9. Samples: 66812664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:09:48,396][41256] Avg episode reward: [(0, '76.917')] +[2023-03-11 17:09:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000130520_66826240.pth... +[2023-03-11 17:09:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000129968_66543616.pth +[2023-03-11 17:09:50,483][41544] Updated weights for policy 0, policy_version 130560 (0.0005) +[2023-03-11 17:09:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 66871296. Throughput: 0: 9381.9. Samples: 66868420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:09:53,397][41256] Avg episode reward: [(0, '82.444')] +[2023-03-11 17:09:54,977][41544] Updated weights for policy 0, policy_version 130640 (0.0005) +[2023-03-11 17:09:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 66916352. Throughput: 0: 9375.6. Samples: 66895948. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:09:58,386][41256] Avg episode reward: [(0, '83.491')] +[2023-03-11 17:09:59,399][41544] Updated weights for policy 0, policy_version 130720 (0.0005) +[2023-03-11 17:10:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 66961408. Throughput: 0: 9351.1. Samples: 66950620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:10:03,386][41256] Avg episode reward: [(0, '81.661')] +[2023-03-11 17:10:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000130784_66961408.pth... +[2023-03-11 17:10:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000130240_66682880.pth +[2023-03-11 17:10:03,925][41544] Updated weights for policy 0, policy_version 130800 (0.0005) +[2023-03-11 17:10:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 67006464. Throughput: 0: 9335.8. Samples: 67005352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:10:08,386][41256] Avg episode reward: [(0, '82.405')] +[2023-03-11 17:10:08,455][41544] Updated weights for policy 0, policy_version 130880 (0.0005) +[2023-03-11 17:10:12,993][41544] Updated weights for policy 0, policy_version 130960 (0.0005) +[2023-03-11 17:10:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 67051520. Throughput: 0: 9303.6. Samples: 67031972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:10:13,386][41256] Avg episode reward: [(0, '82.608')] +[2023-03-11 17:10:17,399][41544] Updated weights for policy 0, policy_version 131040 (0.0005) +[2023-03-11 17:10:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 67100672. Throughput: 0: 9320.4. Samples: 67087924. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:10:18,386][41256] Avg episode reward: [(0, '79.523')] +[2023-03-11 17:10:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131056_67100672.pth... +[2023-03-11 17:10:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000130520_66826240.pth +[2023-03-11 17:10:21,780][41544] Updated weights for policy 0, policy_version 131120 (0.0005) +[2023-03-11 17:10:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 67145728. Throughput: 0: 9293.7. Samples: 67143196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:10:23,386][41256] Avg episode reward: [(0, '78.398')] +[2023-03-11 17:10:26,232][41544] Updated weights for policy 0, policy_version 131200 (0.0006) +[2023-03-11 17:10:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 67190784. Throughput: 0: 9247.6. Samples: 67170680. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:10:28,386][41256] Avg episode reward: [(0, '82.068')] +[2023-03-11 17:10:30,639][41544] Updated weights for policy 0, policy_version 131280 (0.0006) +[2023-03-11 17:10:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 67239936. Throughput: 0: 9192.2. Samples: 67226312. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:10:33,386][41256] Avg episode reward: [(0, '84.399')] +[2023-03-11 17:10:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131328_67239936.pth... +[2023-03-11 17:10:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000130784_66961408.pth +[2023-03-11 17:10:35,189][41544] Updated weights for policy 0, policy_version 131360 (0.0006) +[2023-03-11 17:10:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9413.9). Total num frames: 67284992. Throughput: 0: 9166.3. Samples: 67280904. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:10:38,386][41256] Avg episode reward: [(0, '81.742')] +[2023-03-11 17:10:39,659][41544] Updated weights for policy 0, policy_version 131440 (0.0006) +[2023-03-11 17:10:43,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 67330048. Throughput: 0: 9167.5. Samples: 67308484. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:10:43,386][41256] Avg episode reward: [(0, '78.777')] +[2023-03-11 17:10:44,070][41544] Updated weights for policy 0, policy_version 131520 (0.0006) +[2023-03-11 17:10:48,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9400.0). Total num frames: 67375104. Throughput: 0: 9169.9. Samples: 67363264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:10:48,386][41256] Avg episode reward: [(0, '79.275')] +[2023-03-11 17:10:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131592_67375104.pth... +[2023-03-11 17:10:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131056_67100672.pth +[2023-03-11 17:10:48,547][41544] Updated weights for policy 0, policy_version 131600 (0.0005) +[2023-03-11 17:10:53,007][41544] Updated weights for policy 0, policy_version 131680 (0.0006) +[2023-03-11 17:10:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 67420160. Throughput: 0: 9194.4. Samples: 67419100. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:10:53,386][41256] Avg episode reward: [(0, '82.527')] +[2023-03-11 17:10:57,412][41544] Updated weights for policy 0, policy_version 131760 (0.0006) +[2023-03-11 17:10:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 67469312. Throughput: 0: 9217.0. Samples: 67446736. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:10:58,386][41256] Avg episode reward: [(0, '80.700')] +[2023-03-11 17:11:01,543][41544] Updated weights for policy 0, policy_version 131840 (0.0005) +[2023-03-11 17:11:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9284.3, 300 sec: 9386.1). Total num frames: 67518464. Throughput: 0: 9269.4. Samples: 67505048. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:11:03,397][41256] Avg episode reward: [(0, '84.843')] +[2023-03-11 17:11:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131872_67518464.pth... +[2023-03-11 17:11:03,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131328_67239936.pth +[2023-03-11 17:11:05,829][41544] Updated weights for policy 0, policy_version 131920 (0.0003) +[2023-03-11 17:11:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 67567616. Throughput: 0: 9338.0. Samples: 67563408. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:11:08,396][41256] Avg episode reward: [(0, '83.606')] +[2023-03-11 17:11:10,013][41544] Updated weights for policy 0, policy_version 132000 (0.0004) +[2023-03-11 17:11:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 67616768. Throughput: 0: 9367.4. Samples: 67592212. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:11:13,386][41256] Avg episode reward: [(0, '83.168')] +[2023-03-11 17:11:14,175][41544] Updated weights for policy 0, policy_version 132080 (0.0004) +[2023-03-11 17:11:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 67661824. Throughput: 0: 9412.9. Samples: 67649892. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:11:18,386][41256] Avg episode reward: [(0, '81.857')] +[2023-03-11 17:11:18,445][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132160_67665920.pth... +[2023-03-11 17:11:18,445][41544] Updated weights for policy 0, policy_version 132160 (0.0005) +[2023-03-11 17:11:18,447][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131592_67375104.pth +[2023-03-11 17:11:22,791][41544] Updated weights for policy 0, policy_version 132240 (0.0005) +[2023-03-11 17:11:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 67710976. Throughput: 0: 9469.5. Samples: 67707032. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:11:23,386][41256] Avg episode reward: [(0, '83.048')] +[2023-03-11 17:11:27,168][41544] Updated weights for policy 0, policy_version 132320 (0.0005) +[2023-03-11 17:11:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 67756032. Throughput: 0: 9491.8. Samples: 67735616. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:11:28,386][41256] Avg episode reward: [(0, '81.315')] +[2023-03-11 17:11:31,516][41544] Updated weights for policy 0, policy_version 132400 (0.0004) +[2023-03-11 17:11:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9427.7). Total num frames: 67805184. Throughput: 0: 9530.5. Samples: 67792136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:11:33,386][41256] Avg episode reward: [(0, '77.694')] +[2023-03-11 17:11:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132432_67805184.pth... +[2023-03-11 17:11:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000131872_67518464.pth +[2023-03-11 17:11:35,865][41544] Updated weights for policy 0, policy_version 132480 (0.0005) +[2023-03-11 17:11:38,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9427.7). Total num frames: 67850240. Throughput: 0: 9542.4. Samples: 67848508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:11:38,386][41256] Avg episode reward: [(0, '76.749')] +[2023-03-11 17:11:40,208][41544] Updated weights for policy 0, policy_version 132560 (0.0005) +[2023-03-11 17:11:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9441.6). Total num frames: 67899392. Throughput: 0: 9554.8. Samples: 67876704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:11:43,386][41256] Avg episode reward: [(0, '78.274')] +[2023-03-11 17:11:44,600][41544] Updated weights for policy 0, policy_version 132640 (0.0005) +[2023-03-11 17:11:48,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 67944448. Throughput: 0: 9493.0. Samples: 67932232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:11:48,386][41256] Avg episode reward: [(0, '78.191')] +[2023-03-11 17:11:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132704_67944448.pth... +[2023-03-11 17:11:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132160_67665920.pth +[2023-03-11 17:11:49,061][41544] Updated weights for policy 0, policy_version 132720 (0.0005) +[2023-03-11 17:11:53,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 67989504. Throughput: 0: 9443.4. Samples: 67988360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:11:53,386][41256] Avg episode reward: [(0, '80.419')] +[2023-03-11 17:11:53,425][41544] Updated weights for policy 0, policy_version 132800 (0.0005) +[2023-03-11 17:11:57,870][41544] Updated weights for policy 0, policy_version 132880 (0.0005) +[2023-03-11 17:11:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9441.6). Total num frames: 68038656. Throughput: 0: 9417.8. Samples: 68016012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:11:58,386][41256] Avg episode reward: [(0, '82.068')] +[2023-03-11 17:12:02,341][41544] Updated weights for policy 0, policy_version 132960 (0.0005) +[2023-03-11 17:12:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 68083712. Throughput: 0: 9367.4. Samples: 68071424. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:12:03,386][41256] Avg episode reward: [(0, '78.742')] +[2023-03-11 17:12:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132976_68083712.pth... +[2023-03-11 17:12:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132432_67805184.pth +[2023-03-11 17:12:06,816][41544] Updated weights for policy 0, policy_version 133040 (0.0005) +[2023-03-11 17:12:08,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 68128768. Throughput: 0: 9306.8. Samples: 68125836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:12:08,386][41256] Avg episode reward: [(0, '83.570')] +[2023-03-11 17:12:11,249][41544] Updated weights for policy 0, policy_version 133120 (0.0005) +[2023-03-11 17:12:13,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 68173824. Throughput: 0: 9284.4. Samples: 68153416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:12:13,386][41256] Avg episode reward: [(0, '81.852')] +[2023-03-11 17:12:15,531][41544] Updated weights for policy 0, policy_version 133200 (0.0005) +[2023-03-11 17:12:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 68222976. Throughput: 0: 9311.6. Samples: 68211160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:12:18,386][41256] Avg episode reward: [(0, '81.972')] +[2023-03-11 17:12:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000133248_68222976.pth... +[2023-03-11 17:12:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132704_67944448.pth +[2023-03-11 17:12:19,739][41544] Updated weights for policy 0, policy_version 133280 (0.0005) +[2023-03-11 17:12:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 68272128. Throughput: 0: 9353.8. Samples: 68269432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:12:23,386][41256] Avg episode reward: [(0, '81.025')] +[2023-03-11 17:12:23,952][41544] Updated weights for policy 0, policy_version 133360 (0.0005) +[2023-03-11 17:12:28,073][41544] Updated weights for policy 0, policy_version 133440 (0.0004) +[2023-03-11 17:12:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 68321280. Throughput: 0: 9390.9. Samples: 68299292. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:12:28,386][41256] Avg episode reward: [(0, '81.929')] +[2023-03-11 17:12:32,286][41544] Updated weights for policy 0, policy_version 133520 (0.0004) +[2023-03-11 17:12:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9469.4). Total num frames: 68370432. Throughput: 0: 9460.3. Samples: 68357944. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:12:33,386][41256] Avg episode reward: [(0, '79.990')] +[2023-03-11 17:12:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000133536_68370432.pth... +[2023-03-11 17:12:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000132976_68083712.pth +[2023-03-11 17:12:36,478][41544] Updated weights for policy 0, policy_version 133600 (0.0005) +[2023-03-11 17:12:38,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 68419584. Throughput: 0: 9508.1. Samples: 68416224. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:12:38,397][41256] Avg episode reward: [(0, '81.573')] +[2023-03-11 17:12:40,714][41544] Updated weights for policy 0, policy_version 133680 (0.0005) +[2023-03-11 17:12:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9483.3). Total num frames: 68468736. Throughput: 0: 9538.6. Samples: 68445248. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:12:43,396][41256] Avg episode reward: [(0, '83.425')] +[2023-03-11 17:12:44,814][41544] Updated weights for policy 0, policy_version 133760 (0.0004) +[2023-03-11 17:12:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 68517888. Throughput: 0: 9641.6. Samples: 68505296. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:12:48,396][41256] Avg episode reward: [(0, '82.828')] +[2023-03-11 17:12:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000133824_68517888.pth... +[2023-03-11 17:12:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000133248_68222976.pth +[2023-03-11 17:12:49,046][41544] Updated weights for policy 0, policy_version 133840 (0.0004) +[2023-03-11 17:12:53,326][41544] Updated weights for policy 0, policy_version 133920 (0.0005) +[2023-03-11 17:12:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9469.4). Total num frames: 68567040. Throughput: 0: 9707.6. Samples: 68562680. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:12:53,397][41256] Avg episode reward: [(0, '81.755')] +[2023-03-11 17:12:57,561][41544] Updated weights for policy 0, policy_version 134000 (0.0005) +[2023-03-11 17:12:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 68612096. Throughput: 0: 9733.2. Samples: 68591412. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:12:58,402][41256] Avg episode reward: [(0, '81.040')] +[2023-03-11 17:13:01,904][41544] Updated weights for policy 0, policy_version 134080 (0.0005) +[2023-03-11 17:13:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9455.5). Total num frames: 68661248. Throughput: 0: 9720.8. Samples: 68648596. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:03,397][41256] Avg episode reward: [(0, '87.271')] +[2023-03-11 17:13:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134104_68661248.pth... +[2023-03-11 17:13:03,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000133536_68370432.pth +[2023-03-11 17:13:06,434][41544] Updated weights for policy 0, policy_version 134160 (0.0005) +[2023-03-11 17:13:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9441.6). Total num frames: 68706304. Throughput: 0: 9622.3. Samples: 68702436. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:08,397][41256] Avg episode reward: [(0, '87.820')] +[2023-03-11 17:13:10,870][41544] Updated weights for policy 0, policy_version 134240 (0.0005) +[2023-03-11 17:13:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 68751360. Throughput: 0: 9591.0. Samples: 68730888. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:13,386][41256] Avg episode reward: [(0, '89.805')] +[2023-03-11 17:13:15,419][41544] Updated weights for policy 0, policy_version 134320 (0.0005) +[2023-03-11 17:13:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 68800512. Throughput: 0: 9500.6. Samples: 68785472. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:18,386][41256] Avg episode reward: [(0, '85.891')] +[2023-03-11 17:13:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134376_68800512.pth... +[2023-03-11 17:13:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000133824_68517888.pth +[2023-03-11 17:13:19,607][41544] Updated weights for policy 0, policy_version 134400 (0.0004) +[2023-03-11 17:13:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 68845568. Throughput: 0: 9451.6. Samples: 68841544. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:23,397][41256] Avg episode reward: [(0, '90.751')] +[2023-03-11 17:13:24,152][41544] Updated weights for policy 0, policy_version 134480 (0.0005) +[2023-03-11 17:13:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 68890624. Throughput: 0: 9422.0. Samples: 68869236. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:28,390][41256] Avg episode reward: [(0, '85.068')] +[2023-03-11 17:13:28,674][41544] Updated weights for policy 0, policy_version 134560 (0.0006) +[2023-03-11 17:13:33,192][41544] Updated weights for policy 0, policy_version 134640 (0.0005) +[2023-03-11 17:13:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 68935680. Throughput: 0: 9292.6. Samples: 68923464. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:33,396][41256] Avg episode reward: [(0, '82.780')] +[2023-03-11 17:13:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134640_68935680.pth... +[2023-03-11 17:13:33,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134104_68661248.pth +[2023-03-11 17:13:37,602][41544] Updated weights for policy 0, policy_version 134720 (0.0004) +[2023-03-11 17:13:38,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 68980736. Throughput: 0: 9252.7. Samples: 68979052. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:38,397][41256] Avg episode reward: [(0, '85.862')] +[2023-03-11 17:13:41,729][41544] Updated weights for policy 0, policy_version 134800 (0.0004) +[2023-03-11 17:13:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 69029888. Throughput: 0: 9271.3. Samples: 69008620. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:43,386][41256] Avg episode reward: [(0, '85.892')] +[2023-03-11 17:13:45,948][41544] Updated weights for policy 0, policy_version 134880 (0.0005) +[2023-03-11 17:13:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 69079040. Throughput: 0: 9302.0. Samples: 69067184. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:48,386][41256] Avg episode reward: [(0, '83.511')] +[2023-03-11 17:13:48,416][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134928_69083136.pth... +[2023-03-11 17:13:48,418][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134376_68800512.pth +[2023-03-11 17:13:50,087][41544] Updated weights for policy 0, policy_version 134960 (0.0005) +[2023-03-11 17:13:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 69132288. Throughput: 0: 9443.8. Samples: 69127408. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:53,386][41256] Avg episode reward: [(0, '83.374')] +[2023-03-11 17:13:54,162][41544] Updated weights for policy 0, policy_version 135040 (0.0005) +[2023-03-11 17:13:58,316][41544] Updated weights for policy 0, policy_version 135120 (0.0005) +[2023-03-11 17:13:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 69181440. Throughput: 0: 9467.6. Samples: 69156928. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:13:58,386][41256] Avg episode reward: [(0, '82.637')] +[2023-03-11 17:14:02,631][41544] Updated weights for policy 0, policy_version 135200 (0.0005) +[2023-03-11 17:14:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 69226496. Throughput: 0: 9548.6. Samples: 69215160. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:14:03,386][41256] Avg episode reward: [(0, '85.632')] +[2023-03-11 17:14:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000135208_69226496.pth... +[2023-03-11 17:14:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134640_68935680.pth +[2023-03-11 17:14:06,877][41544] Updated weights for policy 0, policy_version 135280 (0.0005) +[2023-03-11 17:14:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 69275648. Throughput: 0: 9572.4. Samples: 69272300. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:14:08,386][41256] Avg episode reward: [(0, '83.038')] +[2023-03-11 17:14:10,940][41544] Updated weights for policy 0, policy_version 135360 (0.0005) +[2023-03-11 17:14:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9441.6). Total num frames: 69324800. Throughput: 0: 9654.5. Samples: 69303688. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:14:13,386][41256] Avg episode reward: [(0, '85.331')] +[2023-03-11 17:14:15,123][41544] Updated weights for policy 0, policy_version 135440 (0.0005) +[2023-03-11 17:14:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9441.6). Total num frames: 69373952. Throughput: 0: 9739.2. Samples: 69361728. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:14:18,386][41256] Avg episode reward: [(0, '88.336')] +[2023-03-11 17:14:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000135496_69373952.pth... +[2023-03-11 17:14:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000134928_69083136.pth +[2023-03-11 17:14:19,338][41544] Updated weights for policy 0, policy_version 135520 (0.0005) +[2023-03-11 17:14:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9455.5). Total num frames: 69423104. Throughput: 0: 9790.8. Samples: 69419636. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:14:23,386][41256] Avg episode reward: [(0, '88.587')] +[2023-03-11 17:14:23,628][41544] Updated weights for policy 0, policy_version 135600 (0.0005) +[2023-03-11 17:14:28,197][41544] Updated weights for policy 0, policy_version 135680 (0.0005) +[2023-03-11 17:14:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9441.6). Total num frames: 69468160. Throughput: 0: 9753.4. Samples: 69447524. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:14:28,386][41256] Avg episode reward: [(0, '87.050')] +[2023-03-11 17:14:32,628][41544] Updated weights for policy 0, policy_version 135760 (0.0006) +[2023-03-11 17:14:33,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9441.6). Total num frames: 69513216. Throughput: 0: 9659.1. Samples: 69501844. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:14:33,386][41256] Avg episode reward: [(0, '89.267')] +[2023-03-11 17:14:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000135768_69513216.pth... +[2023-03-11 17:14:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000135208_69226496.pth +[2023-03-11 17:14:37,076][41544] Updated weights for policy 0, policy_version 135840 (0.0005) +[2023-03-11 17:14:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9441.6). Total num frames: 69562368. Throughput: 0: 9556.4. Samples: 69557448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:14:38,386][41256] Avg episode reward: [(0, '89.574')] +[2023-03-11 17:14:41,412][41544] Updated weights for policy 0, policy_version 135920 (0.0005) +[2023-03-11 17:14:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 69607424. Throughput: 0: 9538.2. Samples: 69586148. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:14:43,386][41256] Avg episode reward: [(0, '86.641')] +[2023-03-11 17:14:45,868][41544] Updated weights for policy 0, policy_version 136000 (0.0006) +[2023-03-11 17:14:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 69652480. Throughput: 0: 9458.8. Samples: 69640804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:14:48,386][41256] Avg episode reward: [(0, '90.027')] +[2023-03-11 17:14:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136040_69652480.pth... +[2023-03-11 17:14:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000135496_69373952.pth +[2023-03-11 17:14:50,321][41544] Updated weights for policy 0, policy_version 136080 (0.0006) +[2023-03-11 17:14:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9441.6). Total num frames: 69701632. Throughput: 0: 9428.7. Samples: 69696592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:14:53,386][41256] Avg episode reward: [(0, '91.069')] +[2023-03-11 17:14:54,552][41544] Updated weights for policy 0, policy_version 136160 (0.0005) +[2023-03-11 17:14:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 69746688. Throughput: 0: 9389.5. Samples: 69726216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:14:58,386][41256] Avg episode reward: [(0, '89.716')] +[2023-03-11 17:14:58,969][41544] Updated weights for policy 0, policy_version 136240 (0.0006) +[2023-03-11 17:15:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9441.6). Total num frames: 69791744. Throughput: 0: 9322.5. Samples: 69781240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:03,386][41256] Avg episode reward: [(0, '88.625')] +[2023-03-11 17:15:03,441][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136320_69795840.pth... +[2023-03-11 17:15:03,442][41544] Updated weights for policy 0, policy_version 136320 (0.0006) +[2023-03-11 17:15:03,444][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000135768_69513216.pth +[2023-03-11 17:15:07,898][41544] Updated weights for policy 0, policy_version 136400 (0.0006) +[2023-03-11 17:15:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9455.5). Total num frames: 69840896. Throughput: 0: 9266.7. Samples: 69836640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:08,386][41256] Avg episode reward: [(0, '86.373')] +[2023-03-11 17:15:12,360][41544] Updated weights for policy 0, policy_version 136480 (0.0006) +[2023-03-11 17:15:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 69885952. Throughput: 0: 9251.6. Samples: 69863848. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:13,386][41256] Avg episode reward: [(0, '87.762')] +[2023-03-11 17:15:16,736][41544] Updated weights for policy 0, policy_version 136560 (0.0004) +[2023-03-11 17:15:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9441.6). Total num frames: 69931008. Throughput: 0: 9282.0. Samples: 69919536. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:18,386][41256] Avg episode reward: [(0, '88.573')] +[2023-03-11 17:15:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136584_69931008.pth... +[2023-03-11 17:15:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136040_69652480.pth +[2023-03-11 17:15:21,208][41544] Updated weights for policy 0, policy_version 136640 (0.0004) +[2023-03-11 17:15:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9455.5). Total num frames: 69980160. Throughput: 0: 9300.1. Samples: 69975952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:23,386][41256] Avg episode reward: [(0, '87.993')] +[2023-03-11 17:15:25,491][41544] Updated weights for policy 0, policy_version 136720 (0.0003) +[2023-03-11 17:15:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9441.6). Total num frames: 70025216. Throughput: 0: 9288.5. Samples: 70004132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:28,386][41256] Avg episode reward: [(0, '93.318')] +[2023-03-11 17:15:29,634][41544] Updated weights for policy 0, policy_version 136800 (0.0004) +[2023-03-11 17:15:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9455.5). Total num frames: 70074368. Throughput: 0: 9376.5. Samples: 70062748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:33,386][41256] Avg episode reward: [(0, '90.472')] +[2023-03-11 17:15:33,394][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136872_70078464.pth... +[2023-03-11 17:15:33,396][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136320_69795840.pth +[2023-03-11 17:15:33,826][41544] Updated weights for policy 0, policy_version 136880 (0.0005) +[2023-03-11 17:15:38,074][41544] Updated weights for policy 0, policy_version 136960 (0.0005) +[2023-03-11 17:15:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9469.4). Total num frames: 70123520. Throughput: 0: 9438.6. Samples: 70121328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:38,386][41256] Avg episode reward: [(0, '88.809')] +[2023-03-11 17:15:42,194][41544] Updated weights for policy 0, policy_version 137040 (0.0005) +[2023-03-11 17:15:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 70172672. Throughput: 0: 9450.0. Samples: 70151468. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:43,386][41256] Avg episode reward: [(0, '88.982')] +[2023-03-11 17:15:46,338][41544] Updated weights for policy 0, policy_version 137120 (0.0005) +[2023-03-11 17:15:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 70221824. Throughput: 0: 9526.6. Samples: 70209936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:48,386][41256] Avg episode reward: [(0, '92.036')] +[2023-03-11 17:15:48,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137160_70225920.pth... +[2023-03-11 17:15:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136584_69931008.pth +[2023-03-11 17:15:50,452][41544] Updated weights for policy 0, policy_version 137200 (0.0005) +[2023-03-11 17:15:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 9511.1). Total num frames: 70275072. Throughput: 0: 9647.7. Samples: 70270788. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:53,386][41256] Avg episode reward: [(0, '90.294')] +[2023-03-11 17:15:54,480][41544] Updated weights for policy 0, policy_version 137280 (0.0005) +[2023-03-11 17:15:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9511.1). Total num frames: 70324224. Throughput: 0: 9707.7. Samples: 70300696. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:15:58,386][41256] Avg episode reward: [(0, '86.055')] +[2023-03-11 17:15:58,560][41544] Updated weights for policy 0, policy_version 137360 (0.0005) +[2023-03-11 17:16:02,609][41544] Updated weights for policy 0, policy_version 137440 (0.0005) +[2023-03-11 17:16:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9511.1). Total num frames: 70373376. Throughput: 0: 9813.9. Samples: 70361160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:03,386][41256] Avg episode reward: [(0, '86.793')] +[2023-03-11 17:16:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137448_70373376.pth... +[2023-03-11 17:16:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000136872_70078464.pth +[2023-03-11 17:16:07,115][41544] Updated weights for policy 0, policy_version 137520 (0.0005) +[2023-03-11 17:16:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 70418432. Throughput: 0: 9803.7. Samples: 70417120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:08,386][41256] Avg episode reward: [(0, '86.641')] +[2023-03-11 17:16:11,575][41544] Updated weights for policy 0, policy_version 137600 (0.0005) +[2023-03-11 17:16:13,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 70463488. Throughput: 0: 9788.3. Samples: 70444604. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:13,386][41256] Avg episode reward: [(0, '85.776')] +[2023-03-11 17:16:16,142][41544] Updated weights for policy 0, policy_version 137680 (0.0005) +[2023-03-11 17:16:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9483.3). Total num frames: 70508544. Throughput: 0: 9678.0. Samples: 70498256. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:18,386][41256] Avg episode reward: [(0, '88.660')] +[2023-03-11 17:16:18,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137712_70508544.pth... +[2023-03-11 17:16:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137160_70225920.pth +[2023-03-11 17:16:20,813][41544] Updated weights for policy 0, policy_version 137760 (0.0005) +[2023-03-11 17:16:23,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 70553600. Throughput: 0: 9544.4. Samples: 70550828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:23,386][41256] Avg episode reward: [(0, '89.001')] +[2023-03-11 17:16:25,438][41544] Updated weights for policy 0, policy_version 137840 (0.0005) +[2023-03-11 17:16:28,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 70598656. Throughput: 0: 9480.2. Samples: 70578076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:28,386][41256] Avg episode reward: [(0, '90.640')] +[2023-03-11 17:16:30,658][41544] Updated weights for policy 0, policy_version 137920 (0.0006) +[2023-03-11 17:16:33,386][41256] Fps is (10 sec: 7782.3, 60 sec: 9284.3, 300 sec: 9427.7). Total num frames: 70631424. Throughput: 0: 9180.1. Samples: 70623040. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:33,386][41256] Avg episode reward: [(0, '89.614')] +[2023-03-11 17:16:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137952_70631424.pth... +[2023-03-11 17:16:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137448_70373376.pth +[2023-03-11 17:16:37,003][41544] Updated weights for policy 0, policy_version 138000 (0.0005) +[2023-03-11 17:16:38,385][41256] Fps is (10 sec: 6963.2, 60 sec: 9079.5, 300 sec: 9386.1). Total num frames: 70668288. Throughput: 0: 8733.7. Samples: 70663804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:38,386][41256] Avg episode reward: [(0, '90.985')] +[2023-03-11 17:16:41,477][41544] Updated weights for policy 0, policy_version 138080 (0.0005) +[2023-03-11 17:16:43,385][41256] Fps is (10 sec: 8192.1, 60 sec: 9011.2, 300 sec: 9386.1). Total num frames: 70713344. Throughput: 0: 8683.7. Samples: 70691464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:43,386][41256] Avg episode reward: [(0, '89.981')] +[2023-03-11 17:16:45,841][41544] Updated weights for policy 0, policy_version 138160 (0.0005) +[2023-03-11 17:16:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 8942.9, 300 sec: 9386.1). Total num frames: 70758400. Throughput: 0: 8575.6. Samples: 70747060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:48,386][41256] Avg episode reward: [(0, '88.888')] +[2023-03-11 17:16:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000138200_70758400.pth... +[2023-03-11 17:16:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137712_70508544.pth +[2023-03-11 17:16:50,250][41544] Updated weights for policy 0, policy_version 138240 (0.0005) +[2023-03-11 17:16:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 8806.4, 300 sec: 9372.2). Total num frames: 70803456. Throughput: 0: 8545.1. Samples: 70801648. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:53,386][41256] Avg episode reward: [(0, '89.420')] +[2023-03-11 17:16:54,822][41544] Updated weights for policy 0, policy_version 138320 (0.0005) +[2023-03-11 17:16:58,385][41256] Fps is (10 sec: 8601.6, 60 sec: 8669.9, 300 sec: 9358.3). Total num frames: 70844416. Throughput: 0: 8550.1. Samples: 70829360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:16:58,386][41256] Avg episode reward: [(0, '86.187')] +[2023-03-11 17:17:00,375][41544] Updated weights for policy 0, policy_version 138400 (0.0005) +[2023-03-11 17:17:03,386][41256] Fps is (10 sec: 7372.7, 60 sec: 8396.8, 300 sec: 9316.7). Total num frames: 70877184. Throughput: 0: 8256.9. Samples: 70869816. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:03,386][41256] Avg episode reward: [(0, '86.847')] +[2023-03-11 17:17:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000138432_70877184.pth... +[2023-03-11 17:17:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000137952_70631424.pth +[2023-03-11 17:17:06,010][41544] Updated weights for policy 0, policy_version 138480 (0.0005) +[2023-03-11 17:17:08,385][41256] Fps is (10 sec: 7782.4, 60 sec: 8396.8, 300 sec: 9316.7). Total num frames: 70922240. Throughput: 0: 8191.7. Samples: 70919456. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:08,386][41256] Avg episode reward: [(0, '88.735')] +[2023-03-11 17:17:10,128][41544] Updated weights for policy 0, policy_version 138560 (0.0005) +[2023-03-11 17:17:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 8533.3, 300 sec: 9330.6). Total num frames: 70975488. Throughput: 0: 8268.5. Samples: 70950160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:13,386][41256] Avg episode reward: [(0, '88.868')] +[2023-03-11 17:17:14,123][41544] Updated weights for policy 0, policy_version 138640 (0.0004) +[2023-03-11 17:17:18,211][41544] Updated weights for policy 0, policy_version 138720 (0.0005) +[2023-03-11 17:17:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 8601.6, 300 sec: 9330.5). Total num frames: 71024640. Throughput: 0: 8620.4. Samples: 71010960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:18,386][41256] Avg episode reward: [(0, '87.950')] +[2023-03-11 17:17:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000138720_71024640.pth... +[2023-03-11 17:17:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000138200_70758400.pth +[2023-03-11 17:17:22,167][41544] Updated weights for policy 0, policy_version 138800 (0.0005) +[2023-03-11 17:17:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 8669.9, 300 sec: 9330.6). Total num frames: 71073792. Throughput: 0: 9083.0. Samples: 71072540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:23,386][41256] Avg episode reward: [(0, '87.633')] +[2023-03-11 17:17:26,488][41544] Updated weights for policy 0, policy_version 138880 (0.0005) +[2023-03-11 17:17:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 8738.1, 300 sec: 9330.6). Total num frames: 71122944. Throughput: 0: 9093.5. Samples: 71100672. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:28,386][41256] Avg episode reward: [(0, '89.122')] +[2023-03-11 17:17:30,758][41544] Updated weights for policy 0, policy_version 138960 (0.0005) +[2023-03-11 17:17:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9011.2, 300 sec: 9330.5). Total num frames: 71172096. Throughput: 0: 9137.0. Samples: 71158224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:33,386][41256] Avg episode reward: [(0, '88.529')] +[2023-03-11 17:17:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139008_71172096.pth... +[2023-03-11 17:17:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000138432_70877184.pth +[2023-03-11 17:17:35,036][41544] Updated weights for policy 0, policy_version 139040 (0.0005) +[2023-03-11 17:17:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 71217152. Throughput: 0: 9192.8. Samples: 71215324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:38,386][41256] Avg episode reward: [(0, '85.555')] +[2023-03-11 17:17:39,349][41544] Updated weights for policy 0, policy_version 139120 (0.0005) +[2023-03-11 17:17:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9302.8). Total num frames: 71262208. Throughput: 0: 9182.8. Samples: 71242584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:43,386][41256] Avg episode reward: [(0, '89.861')] +[2023-03-11 17:17:43,824][41544] Updated weights for policy 0, policy_version 139200 (0.0005) +[2023-03-11 17:17:48,285][41544] Updated weights for policy 0, policy_version 139280 (0.0005) +[2023-03-11 17:17:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9302.8). Total num frames: 71311360. Throughput: 0: 9528.4. Samples: 71298592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:48,386][41256] Avg episode reward: [(0, '89.425')] +[2023-03-11 17:17:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139280_71311360.pth... +[2023-03-11 17:17:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000138720_71024640.pth +[2023-03-11 17:17:52,903][41544] Updated weights for policy 0, policy_version 139360 (0.0005) +[2023-03-11 17:17:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9302.8). Total num frames: 71356416. Throughput: 0: 9614.2. Samples: 71352096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:53,386][41256] Avg episode reward: [(0, '90.273')] +[2023-03-11 17:17:57,496][41544] Updated weights for policy 0, policy_version 139440 (0.0005) +[2023-03-11 17:17:58,386][41256] Fps is (10 sec: 8601.6, 60 sec: 9216.0, 300 sec: 9275.0). Total num frames: 71397376. Throughput: 0: 9515.0. Samples: 71378336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:17:58,386][41256] Avg episode reward: [(0, '91.634')] +[2023-03-11 17:18:02,185][41544] Updated weights for policy 0, policy_version 139520 (0.0005) +[2023-03-11 17:18:03,386][41256] Fps is (10 sec: 8601.5, 60 sec: 9420.8, 300 sec: 9275.0). Total num frames: 71442432. Throughput: 0: 9337.2. Samples: 71431136. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:03,386][41256] Avg episode reward: [(0, '91.429')] +[2023-03-11 17:18:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139536_71442432.pth... +[2023-03-11 17:18:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139008_71172096.pth +[2023-03-11 17:18:06,748][41544] Updated weights for policy 0, policy_version 139600 (0.0005) +[2023-03-11 17:18:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9275.0). Total num frames: 71487488. Throughput: 0: 9170.3. Samples: 71485204. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:08,386][41256] Avg episode reward: [(0, '92.284')] +[2023-03-11 17:18:11,249][41544] Updated weights for policy 0, policy_version 139680 (0.0005) +[2023-03-11 17:18:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9261.1). Total num frames: 71532544. Throughput: 0: 9144.2. Samples: 71512160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:13,386][41256] Avg episode reward: [(0, '90.462')] +[2023-03-11 17:18:15,776][41544] Updated weights for policy 0, policy_version 139760 (0.0005) +[2023-03-11 17:18:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9261.1). Total num frames: 71577600. Throughput: 0: 9075.6. Samples: 71566624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:18,386][41256] Avg episode reward: [(0, '92.599')] +[2023-03-11 17:18:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139800_71577600.pth... +[2023-03-11 17:18:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139280_71311360.pth +[2023-03-11 17:18:20,340][41544] Updated weights for policy 0, policy_version 139840 (0.0005) +[2023-03-11 17:18:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9275.0). Total num frames: 71626752. Throughput: 0: 9036.8. Samples: 71621980. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:23,386][41256] Avg episode reward: [(0, '91.188')] +[2023-03-11 17:18:24,656][41544] Updated weights for policy 0, policy_version 139920 (0.0005) +[2023-03-11 17:18:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9275.0). Total num frames: 71671808. Throughput: 0: 9065.1. Samples: 71650516. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:28,386][41256] Avg episode reward: [(0, '91.048')] +[2023-03-11 17:18:28,942][41544] Updated weights for policy 0, policy_version 140000 (0.0005) +[2023-03-11 17:18:33,252][41544] Updated weights for policy 0, policy_version 140080 (0.0005) +[2023-03-11 17:18:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9288.9). Total num frames: 71720960. Throughput: 0: 9087.3. Samples: 71707520. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:33,386][41256] Avg episode reward: [(0, '94.658')] +[2023-03-11 17:18:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140080_71720960.pth... +[2023-03-11 17:18:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139536_71442432.pth +[2023-03-11 17:18:37,545][41544] Updated weights for policy 0, policy_version 140160 (0.0005) +[2023-03-11 17:18:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9275.0). Total num frames: 71766016. Throughput: 0: 9170.6. Samples: 71764772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:38,386][41256] Avg episode reward: [(0, '92.254')] +[2023-03-11 17:18:41,864][41544] Updated weights for policy 0, policy_version 140240 (0.0005) +[2023-03-11 17:18:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9275.0). Total num frames: 71815168. Throughput: 0: 9208.1. Samples: 71792700. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:18:43,386][41256] Avg episode reward: [(0, '91.324')] +[2023-03-11 17:18:46,114][41544] Updated weights for policy 0, policy_version 140320 (0.0005) +[2023-03-11 17:18:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9216.0, 300 sec: 9261.1). Total num frames: 71864320. Throughput: 0: 9335.8. Samples: 71851248. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:18:48,386][41256] Avg episode reward: [(0, '88.964')] +[2023-03-11 17:18:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140360_71864320.pth... +[2023-03-11 17:18:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000139800_71577600.pth +[2023-03-11 17:18:50,604][41544] Updated weights for policy 0, policy_version 140400 (0.0005) +[2023-03-11 17:18:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9247.2). Total num frames: 71909376. Throughput: 0: 9328.6. Samples: 71904992. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:18:53,386][41256] Avg episode reward: [(0, '91.029')] +[2023-03-11 17:18:55,180][41544] Updated weights for policy 0, policy_version 140480 (0.0005) +[2023-03-11 17:18:58,386][41256] Fps is (10 sec: 8601.6, 60 sec: 9216.0, 300 sec: 9233.4). Total num frames: 71950336. Throughput: 0: 9326.9. Samples: 71931872. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:18:58,386][41256] Avg episode reward: [(0, '91.755')] +[2023-03-11 17:18:59,776][41544] Updated weights for policy 0, policy_version 140560 (0.0005) +[2023-03-11 17:19:03,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9233.4). Total num frames: 71999488. Throughput: 0: 9312.1. Samples: 71985668. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:03,386][41256] Avg episode reward: [(0, '94.837')] +[2023-03-11 17:19:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140624_71999488.pth... +[2023-03-11 17:19:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140080_71720960.pth +[2023-03-11 17:19:04,263][41544] Updated weights for policy 0, policy_version 140640 (0.0005) +[2023-03-11 17:19:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 72044544. Throughput: 0: 9309.3. Samples: 72040900. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:08,386][41256] Avg episode reward: [(0, '91.415')] +[2023-03-11 17:19:08,630][41544] Updated weights for policy 0, policy_version 140720 (0.0004) +[2023-03-11 17:19:12,878][41544] Updated weights for policy 0, policy_version 140800 (0.0005) +[2023-03-11 17:19:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 72093696. Throughput: 0: 9321.8. Samples: 72069996. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:13,386][41256] Avg episode reward: [(0, '90.375')] +[2023-03-11 17:19:17,251][41544] Updated weights for policy 0, policy_version 140880 (0.0005) +[2023-03-11 17:19:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9205.6). Total num frames: 72138752. Throughput: 0: 9311.3. Samples: 72126528. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:18,386][41256] Avg episode reward: [(0, '92.013')] +[2023-03-11 17:19:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140896_72138752.pth... +[2023-03-11 17:19:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140360_71864320.pth +[2023-03-11 17:19:21,603][41544] Updated weights for policy 0, policy_version 140960 (0.0005) +[2023-03-11 17:19:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 72187904. Throughput: 0: 9312.1. Samples: 72183816. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:23,386][41256] Avg episode reward: [(0, '90.160')] +[2023-03-11 17:19:25,828][41544] Updated weights for policy 0, policy_version 141040 (0.0005) +[2023-03-11 17:19:28,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 72232960. Throughput: 0: 9329.9. Samples: 72212544. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:28,386][41256] Avg episode reward: [(0, '88.830')] +[2023-03-11 17:19:30,211][41544] Updated weights for policy 0, policy_version 141120 (0.0005) +[2023-03-11 17:19:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 72282112. Throughput: 0: 9269.8. Samples: 72268388. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:33,386][41256] Avg episode reward: [(0, '90.011')] +[2023-03-11 17:19:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000141176_72282112.pth... +[2023-03-11 17:19:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140624_71999488.pth +[2023-03-11 17:19:34,533][41544] Updated weights for policy 0, policy_version 141200 (0.0005) +[2023-03-11 17:19:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 72327168. Throughput: 0: 9353.3. Samples: 72325892. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:38,386][41256] Avg episode reward: [(0, '89.211')] +[2023-03-11 17:19:38,888][41544] Updated weights for policy 0, policy_version 141280 (0.0003) +[2023-03-11 17:19:43,336][41544] Updated weights for policy 0, policy_version 141360 (0.0004) +[2023-03-11 17:19:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9233.4). Total num frames: 72376320. Throughput: 0: 9362.2. Samples: 72353168. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:43,386][41256] Avg episode reward: [(0, '90.279')] +[2023-03-11 17:19:47,749][41544] Updated weights for policy 0, policy_version 141440 (0.0005) +[2023-03-11 17:19:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9219.5). Total num frames: 72421376. Throughput: 0: 9409.5. Samples: 72409096. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:19:48,386][41256] Avg episode reward: [(0, '87.296')] +[2023-03-11 17:19:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000141448_72421376.pth... +[2023-03-11 17:19:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000140896_72138752.pth +[2023-03-11 17:19:52,031][41544] Updated weights for policy 0, policy_version 141520 (0.0004) +[2023-03-11 17:19:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9233.4). Total num frames: 72470528. Throughput: 0: 9451.0. Samples: 72466196. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:19:53,386][41256] Avg episode reward: [(0, '87.018')] +[2023-03-11 17:19:56,344][41544] Updated weights for policy 0, policy_version 141600 (0.0005) +[2023-03-11 17:19:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9233.4). Total num frames: 72515584. Throughput: 0: 9446.9. Samples: 72495104. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:19:58,386][41256] Avg episode reward: [(0, '85.780')] +[2023-03-11 17:20:00,812][41544] Updated weights for policy 0, policy_version 141680 (0.0005) +[2023-03-11 17:20:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 72560640. Throughput: 0: 9400.5. Samples: 72549552. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:03,386][41256] Avg episode reward: [(0, '89.282')] +[2023-03-11 17:20:03,401][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000141728_72564736.pth... +[2023-03-11 17:20:03,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000141176_72282112.pth +[2023-03-11 17:20:05,156][41544] Updated weights for policy 0, policy_version 141760 (0.0005) +[2023-03-11 17:20:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9233.4). Total num frames: 72609792. Throughput: 0: 9382.5. Samples: 72606028. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:08,386][41256] Avg episode reward: [(0, '87.910')] +[2023-03-11 17:20:09,545][41544] Updated weights for policy 0, policy_version 141840 (0.0005) +[2023-03-11 17:20:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9233.4). Total num frames: 72654848. Throughput: 0: 9373.4. Samples: 72634348. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:13,386][41256] Avg episode reward: [(0, '84.585')] +[2023-03-11 17:20:14,016][41544] Updated weights for policy 0, policy_version 141920 (0.0004) +[2023-03-11 17:20:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.6, 300 sec: 9219.5). Total num frames: 72699904. Throughput: 0: 9337.8. Samples: 72688588. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:18,386][41256] Avg episode reward: [(0, '86.248')] +[2023-03-11 17:20:18,454][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142000_72704000.pth... +[2023-03-11 17:20:18,455][41544] Updated weights for policy 0, policy_version 142000 (0.0005) +[2023-03-11 17:20:18,456][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000141448_72421376.pth +[2023-03-11 17:20:22,840][41544] Updated weights for policy 0, policy_version 142080 (0.0005) +[2023-03-11 17:20:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9233.4). Total num frames: 72749056. Throughput: 0: 9314.1. Samples: 72745024. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:23,386][41256] Avg episode reward: [(0, '84.672')] +[2023-03-11 17:20:27,271][41544] Updated weights for policy 0, policy_version 142160 (0.0005) +[2023-03-11 17:20:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9219.5). Total num frames: 72794112. Throughput: 0: 9335.4. Samples: 72773260. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:28,386][41256] Avg episode reward: [(0, '83.599')] +[2023-03-11 17:20:31,765][41544] Updated weights for policy 0, policy_version 142240 (0.0005) +[2023-03-11 17:20:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9205.6). Total num frames: 72839168. Throughput: 0: 9294.1. Samples: 72827332. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:33,386][41256] Avg episode reward: [(0, '84.505')] +[2023-03-11 17:20:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142264_72839168.pth... +[2023-03-11 17:20:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000141728_72564736.pth +[2023-03-11 17:20:36,259][41544] Updated weights for policy 0, policy_version 142320 (0.0005) +[2023-03-11 17:20:38,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9284.3, 300 sec: 9191.7). Total num frames: 72884224. Throughput: 0: 9260.7. Samples: 72882928. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:38,386][41256] Avg episode reward: [(0, '85.010')] +[2023-03-11 17:20:40,678][41544] Updated weights for policy 0, policy_version 142400 (0.0005) +[2023-03-11 17:20:43,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9191.7). Total num frames: 72933376. Throughput: 0: 9227.0. Samples: 72910320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:43,386][41256] Avg episode reward: [(0, '84.211')] +[2023-03-11 17:20:45,016][41544] Updated weights for policy 0, policy_version 142480 (0.0005) +[2023-03-11 17:20:48,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9163.9). Total num frames: 72978432. Throughput: 0: 9278.9. Samples: 72967104. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:48,386][41256] Avg episode reward: [(0, '83.576')] +[2023-03-11 17:20:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142536_72978432.pth... +[2023-03-11 17:20:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142000_72704000.pth +[2023-03-11 17:20:49,259][41544] Updated weights for policy 0, policy_version 142560 (0.0005) +[2023-03-11 17:20:53,359][41544] Updated weights for policy 0, policy_version 142640 (0.0004) +[2023-03-11 17:20:53,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9177.8). Total num frames: 73031680. Throughput: 0: 9343.0. Samples: 73026464. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:20:53,397][41256] Avg episode reward: [(0, '81.238')] +[2023-03-11 17:20:57,482][41544] Updated weights for policy 0, policy_version 142720 (0.0005) +[2023-03-11 17:20:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9420.8, 300 sec: 9177.8). Total num frames: 73080832. Throughput: 0: 9377.1. Samples: 73056320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:20:58,386][41256] Avg episode reward: [(0, '82.423')] +[2023-03-11 17:21:01,753][41544] Updated weights for policy 0, policy_version 142800 (0.0005) +[2023-03-11 17:21:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9177.8). Total num frames: 73125888. Throughput: 0: 9458.5. Samples: 73114224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:03,386][41256] Avg episode reward: [(0, '80.709')] +[2023-03-11 17:21:03,391][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142824_73125888.pth... +[2023-03-11 17:21:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142264_72839168.pth +[2023-03-11 17:21:06,076][41544] Updated weights for policy 0, policy_version 142880 (0.0005) +[2023-03-11 17:21:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9191.7). Total num frames: 73175040. Throughput: 0: 9466.5. Samples: 73171016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:08,386][41256] Avg episode reward: [(0, '82.003')] +[2023-03-11 17:21:10,500][41544] Updated weights for policy 0, policy_version 142960 (0.0005) +[2023-03-11 17:21:13,385][41256] Fps is (10 sec: 9421.0, 60 sec: 9420.8, 300 sec: 9191.7). Total num frames: 73220096. Throughput: 0: 9466.2. Samples: 73199240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:13,386][41256] Avg episode reward: [(0, '80.031')] +[2023-03-11 17:21:14,833][41544] Updated weights for policy 0, policy_version 143040 (0.0005) +[2023-03-11 17:21:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.0, 300 sec: 9205.6). Total num frames: 73269248. Throughput: 0: 9522.3. Samples: 73255836. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:18,386][41256] Avg episode reward: [(0, '77.676')] +[2023-03-11 17:21:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143104_73269248.pth... +[2023-03-11 17:21:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142536_72978432.pth +[2023-03-11 17:21:19,009][41544] Updated weights for policy 0, policy_version 143120 (0.0004) +[2023-03-11 17:21:23,215][41544] Updated weights for policy 0, policy_version 143200 (0.0005) +[2023-03-11 17:21:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9219.5). Total num frames: 73318400. Throughput: 0: 9589.1. Samples: 73314440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:23,386][41256] Avg episode reward: [(0, '82.356')] +[2023-03-11 17:21:27,448][41544] Updated weights for policy 0, policy_version 143280 (0.0005) +[2023-03-11 17:21:28,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9275.0). Total num frames: 73367552. Throughput: 0: 9638.1. Samples: 73344036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:28,386][41256] Avg episode reward: [(0, '80.217')] +[2023-03-11 17:21:31,781][41544] Updated weights for policy 0, policy_version 143360 (0.0004) +[2023-03-11 17:21:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9302.8). Total num frames: 73412608. Throughput: 0: 9633.4. Samples: 73400608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:33,386][41256] Avg episode reward: [(0, '76.978')] +[2023-03-11 17:21:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143384_73412608.pth... +[2023-03-11 17:21:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000142824_73125888.pth +[2023-03-11 17:21:36,274][41544] Updated weights for policy 0, policy_version 143440 (0.0005) +[2023-03-11 17:21:38,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9557.3, 300 sec: 9302.8). Total num frames: 73457664. Throughput: 0: 9547.7. Samples: 73456112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:38,386][41256] Avg episode reward: [(0, '79.990')] +[2023-03-11 17:21:40,759][41544] Updated weights for policy 0, policy_version 143520 (0.0005) +[2023-03-11 17:21:43,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9302.8). Total num frames: 73502720. Throughput: 0: 9477.0. Samples: 73482784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:43,386][41256] Avg episode reward: [(0, '74.975')] +[2023-03-11 17:21:45,193][41544] Updated weights for policy 0, policy_version 143600 (0.0005) +[2023-03-11 17:21:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9316.7). Total num frames: 73551872. Throughput: 0: 9441.1. Samples: 73539072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:48,386][41256] Avg episode reward: [(0, '75.997')] +[2023-03-11 17:21:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143656_73551872.pth... +[2023-03-11 17:21:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143104_73269248.pth +[2023-03-11 17:21:49,583][41544] Updated weights for policy 0, policy_version 143680 (0.0005) +[2023-03-11 17:21:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9330.5). Total num frames: 73596928. Throughput: 0: 9413.9. Samples: 73594640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:53,386][41256] Avg episode reward: [(0, '76.275')] +[2023-03-11 17:21:53,984][41544] Updated weights for policy 0, policy_version 143760 (0.0005) +[2023-03-11 17:21:58,247][41544] Updated weights for policy 0, policy_version 143840 (0.0005) +[2023-03-11 17:21:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 73646080. Throughput: 0: 9396.2. Samples: 73622068. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:21:58,386][41256] Avg episode reward: [(0, '78.807')] +[2023-03-11 17:22:02,327][41544] Updated weights for policy 0, policy_version 143920 (0.0005) +[2023-03-11 17:22:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 73695232. Throughput: 0: 9482.5. Samples: 73682548. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:03,386][41256] Avg episode reward: [(0, '78.235')] +[2023-03-11 17:22:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143936_73695232.pth... +[2023-03-11 17:22:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143384_73412608.pth +[2023-03-11 17:22:06,322][41544] Updated weights for policy 0, policy_version 144000 (0.0005) +[2023-03-11 17:22:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 73744384. Throughput: 0: 9522.8. Samples: 73742968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:08,386][41256] Avg episode reward: [(0, '78.763')] +[2023-03-11 17:22:10,575][41544] Updated weights for policy 0, policy_version 144080 (0.0005) +[2023-03-11 17:22:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 73793536. Throughput: 0: 9501.3. Samples: 73771596. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:13,386][41256] Avg episode reward: [(0, '81.125')] +[2023-03-11 17:22:14,859][41544] Updated weights for policy 0, policy_version 144160 (0.0006) +[2023-03-11 17:22:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 73842688. Throughput: 0: 9519.8. Samples: 73829000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:18,386][41256] Avg episode reward: [(0, '83.848')] +[2023-03-11 17:22:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000144224_73842688.pth... +[2023-03-11 17:22:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143656_73551872.pth +[2023-03-11 17:22:19,000][41544] Updated weights for policy 0, policy_version 144240 (0.0005) +[2023-03-11 17:22:23,077][41544] Updated weights for policy 0, policy_version 144320 (0.0005) +[2023-03-11 17:22:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 73891840. Throughput: 0: 9626.9. Samples: 73889324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:23,386][41256] Avg episode reward: [(0, '79.914')] +[2023-03-11 17:22:27,256][41544] Updated weights for policy 0, policy_version 144400 (0.0005) +[2023-03-11 17:22:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 73940992. Throughput: 0: 9709.8. Samples: 73919724. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:28,386][41256] Avg episode reward: [(0, '83.660')] +[2023-03-11 17:22:31,643][41544] Updated weights for policy 0, policy_version 144480 (0.0006) +[2023-03-11 17:22:33,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 73990144. Throughput: 0: 9703.2. Samples: 73975716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:33,386][41256] Avg episode reward: [(0, '87.271')] +[2023-03-11 17:22:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000144512_73990144.pth... +[2023-03-11 17:22:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000143936_73695232.pth +[2023-03-11 17:22:35,983][41544] Updated weights for policy 0, policy_version 144560 (0.0005) +[2023-03-11 17:22:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 74035200. Throughput: 0: 9726.9. Samples: 74032352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:38,386][41256] Avg episode reward: [(0, '84.099')] +[2023-03-11 17:22:40,227][41544] Updated weights for policy 0, policy_version 144640 (0.0004) +[2023-03-11 17:22:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9386.1). Total num frames: 74080256. Throughput: 0: 9759.2. Samples: 74061232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:43,386][41256] Avg episode reward: [(0, '86.219')] +[2023-03-11 17:22:44,819][41544] Updated weights for policy 0, policy_version 144720 (0.0005) +[2023-03-11 17:22:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9386.1). Total num frames: 74125312. Throughput: 0: 9595.7. Samples: 74114352. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:48,386][41256] Avg episode reward: [(0, '85.286')] +[2023-03-11 17:22:48,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000144776_74125312.pth... +[2023-03-11 17:22:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000144224_73842688.pth +[2023-03-11 17:22:49,372][41544] Updated weights for policy 0, policy_version 144800 (0.0006) +[2023-03-11 17:22:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9413.9). Total num frames: 74174464. Throughput: 0: 9496.2. Samples: 74170296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:53,386][41256] Avg episode reward: [(0, '85.465')] +[2023-03-11 17:22:53,795][41544] Updated weights for policy 0, policy_version 144880 (0.0005) +[2023-03-11 17:22:58,154][41544] Updated weights for policy 0, policy_version 144960 (0.0005) +[2023-03-11 17:22:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 74219520. Throughput: 0: 9470.7. Samples: 74197776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:22:58,386][41256] Avg episode reward: [(0, '87.040')] +[2023-03-11 17:23:02,529][41544] Updated weights for policy 0, policy_version 145040 (0.0005) +[2023-03-11 17:23:03,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 74264576. Throughput: 0: 9445.9. Samples: 74254068. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:03,386][41256] Avg episode reward: [(0, '84.791')] +[2023-03-11 17:23:03,405][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145056_74268672.pth... +[2023-03-11 17:23:03,406][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000144512_73990144.pth +[2023-03-11 17:23:06,861][41544] Updated weights for policy 0, policy_version 145120 (0.0005) +[2023-03-11 17:23:08,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9427.7). Total num frames: 74313728. Throughput: 0: 9363.5. Samples: 74310680. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:08,386][41256] Avg episode reward: [(0, '87.602')] +[2023-03-11 17:23:10,960][41544] Updated weights for policy 0, policy_version 145200 (0.0005) +[2023-03-11 17:23:13,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9441.6). Total num frames: 74362880. Throughput: 0: 9363.1. Samples: 74341064. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:13,386][41256] Avg episode reward: [(0, '82.797')] +[2023-03-11 17:23:15,071][41544] Updated weights for policy 0, policy_version 145280 (0.0005) +[2023-03-11 17:23:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 74416128. Throughput: 0: 9463.2. Samples: 74401560. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:18,386][41256] Avg episode reward: [(0, '85.941')] +[2023-03-11 17:23:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145344_74416128.pth... +[2023-03-11 17:23:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000144776_74125312.pth +[2023-03-11 17:23:19,055][41544] Updated weights for policy 0, policy_version 145360 (0.0004) +[2023-03-11 17:23:23,109][41544] Updated weights for policy 0, policy_version 145440 (0.0004) +[2023-03-11 17:23:23,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 74465280. Throughput: 0: 9555.3. Samples: 74462340. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:23,386][41256] Avg episode reward: [(0, '84.079')] +[2023-03-11 17:23:27,231][41544] Updated weights for policy 0, policy_version 145520 (0.0004) +[2023-03-11 17:23:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 74514432. Throughput: 0: 9588.9. Samples: 74492732. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:28,386][41256] Avg episode reward: [(0, '83.686')] +[2023-03-11 17:23:31,409][41544] Updated weights for policy 0, policy_version 145600 (0.0005) +[2023-03-11 17:23:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 74563584. Throughput: 0: 9711.3. Samples: 74551360. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:33,386][41256] Avg episode reward: [(0, '86.442')] +[2023-03-11 17:23:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145632_74563584.pth... +[2023-03-11 17:23:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145056_74268672.pth +[2023-03-11 17:23:35,796][41544] Updated weights for policy 0, policy_version 145680 (0.0005) +[2023-03-11 17:23:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9469.4). Total num frames: 74608640. Throughput: 0: 9699.7. Samples: 74606784. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:38,386][41256] Avg episode reward: [(0, '85.092')] +[2023-03-11 17:23:40,348][41544] Updated weights for policy 0, policy_version 145760 (0.0005) +[2023-03-11 17:23:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9455.5). Total num frames: 74653696. Throughput: 0: 9678.0. Samples: 74633288. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:43,386][41256] Avg episode reward: [(0, '85.308')] +[2023-03-11 17:23:44,859][41544] Updated weights for policy 0, policy_version 145840 (0.0005) +[2023-03-11 17:23:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9469.4). Total num frames: 74702848. Throughput: 0: 9654.3. Samples: 74688512. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:48,386][41256] Avg episode reward: [(0, '85.802')] +[2023-03-11 17:23:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145904_74702848.pth... +[2023-03-11 17:23:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145344_74416128.pth +[2023-03-11 17:23:49,207][41544] Updated weights for policy 0, policy_version 145920 (0.0005) +[2023-03-11 17:23:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9483.3). Total num frames: 74747904. Throughput: 0: 9665.2. Samples: 74745616. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:53,386][41256] Avg episode reward: [(0, '82.665')] +[2023-03-11 17:23:53,519][41544] Updated weights for policy 0, policy_version 146000 (0.0006) +[2023-03-11 17:23:57,921][41544] Updated weights for policy 0, policy_version 146080 (0.0006) +[2023-03-11 17:23:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9483.3). Total num frames: 74797056. Throughput: 0: 9612.6. Samples: 74773632. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:23:58,386][41256] Avg episode reward: [(0, '85.507')] +[2023-03-11 17:24:02,238][41544] Updated weights for policy 0, policy_version 146160 (0.0004) +[2023-03-11 17:24:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9483.3). Total num frames: 74842112. Throughput: 0: 9518.6. Samples: 74829896. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:24:03,386][41256] Avg episode reward: [(0, '84.246')] +[2023-03-11 17:24:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000146176_74842112.pth... +[2023-03-11 17:24:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145632_74563584.pth +[2023-03-11 17:24:06,331][41544] Updated weights for policy 0, policy_version 146240 (0.0003) +[2023-03-11 17:24:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9483.3). Total num frames: 74891264. Throughput: 0: 9490.0. Samples: 74889388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:08,386][41256] Avg episode reward: [(0, '85.192')] +[2023-03-11 17:24:10,581][41544] Updated weights for policy 0, policy_version 146320 (0.0003) +[2023-03-11 17:24:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9497.2). Total num frames: 74940416. Throughput: 0: 9461.8. Samples: 74918512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:13,386][41256] Avg episode reward: [(0, '87.216')] +[2023-03-11 17:24:14,832][41544] Updated weights for policy 0, policy_version 146400 (0.0003) +[2023-03-11 17:24:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9497.2). Total num frames: 74989568. Throughput: 0: 9437.2. Samples: 74976032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:18,386][41256] Avg episode reward: [(0, '85.176')] +[2023-03-11 17:24:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000146464_74989568.pth... +[2023-03-11 17:24:18,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000145904_74702848.pth +[2023-03-11 17:24:19,258][41544] Updated weights for policy 0, policy_version 146480 (0.0004) +[2023-03-11 17:24:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 75034624. Throughput: 0: 9419.6. Samples: 75030664. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:23,386][41256] Avg episode reward: [(0, '86.808')] +[2023-03-11 17:24:23,660][41544] Updated weights for policy 0, policy_version 146560 (0.0005) +[2023-03-11 17:24:28,119][41544] Updated weights for policy 0, policy_version 146640 (0.0005) +[2023-03-11 17:24:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 75079680. Throughput: 0: 9464.7. Samples: 75059200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:28,386][41256] Avg episode reward: [(0, '85.354')] +[2023-03-11 17:24:32,390][41544] Updated weights for policy 0, policy_version 146720 (0.0005) +[2023-03-11 17:24:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 75128832. Throughput: 0: 9490.2. Samples: 75115572. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:33,386][41256] Avg episode reward: [(0, '84.255')] +[2023-03-11 17:24:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000146736_75128832.pth... +[2023-03-11 17:24:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000146176_74842112.pth +[2023-03-11 17:24:36,783][41544] Updated weights for policy 0, policy_version 146800 (0.0004) +[2023-03-11 17:24:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 75173888. Throughput: 0: 9460.9. Samples: 75171356. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:38,386][41256] Avg episode reward: [(0, '86.251')] +[2023-03-11 17:24:41,247][41544] Updated weights for policy 0, policy_version 146880 (0.0006) +[2023-03-11 17:24:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 75218944. Throughput: 0: 9442.4. Samples: 75198540. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:43,386][41256] Avg episode reward: [(0, '83.896')] +[2023-03-11 17:24:45,725][41544] Updated weights for policy 0, policy_version 146960 (0.0005) +[2023-03-11 17:24:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 75268096. Throughput: 0: 9433.1. Samples: 75254384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:48,386][41256] Avg episode reward: [(0, '83.441')] +[2023-03-11 17:24:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147008_75268096.pth... +[2023-03-11 17:24:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000146464_74989568.pth +[2023-03-11 17:24:50,133][41544] Updated weights for policy 0, policy_version 147040 (0.0005) +[2023-03-11 17:24:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 75313152. Throughput: 0: 9327.4. Samples: 75309120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:53,386][41256] Avg episode reward: [(0, '82.605')] +[2023-03-11 17:24:54,572][41544] Updated weights for policy 0, policy_version 147120 (0.0005) +[2023-03-11 17:24:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9497.2). Total num frames: 75362304. Throughput: 0: 9317.5. Samples: 75337800. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:24:58,386][41256] Avg episode reward: [(0, '82.087')] +[2023-03-11 17:24:58,705][41544] Updated weights for policy 0, policy_version 147200 (0.0005) +[2023-03-11 17:25:02,776][41544] Updated weights for policy 0, policy_version 147280 (0.0003) +[2023-03-11 17:25:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9497.2). Total num frames: 75411456. Throughput: 0: 9379.3. Samples: 75398100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:03,386][41256] Avg episode reward: [(0, '82.384')] +[2023-03-11 17:25:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147288_75411456.pth... +[2023-03-11 17:25:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000146736_75128832.pth +[2023-03-11 17:25:07,047][41544] Updated weights for policy 0, policy_version 147360 (0.0005) +[2023-03-11 17:25:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9511.0). Total num frames: 75460608. Throughput: 0: 9463.3. Samples: 75456512. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:08,386][41256] Avg episode reward: [(0, '84.711')] +[2023-03-11 17:25:11,413][41544] Updated weights for policy 0, policy_version 147440 (0.0005) +[2023-03-11 17:25:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9511.0). Total num frames: 75505664. Throughput: 0: 9442.1. Samples: 75484096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:13,386][41256] Avg episode reward: [(0, '83.668')] +[2023-03-11 17:25:15,827][41544] Updated weights for policy 0, policy_version 147520 (0.0005) +[2023-03-11 17:25:18,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9352.5, 300 sec: 9497.2). Total num frames: 75550720. Throughput: 0: 9424.8. Samples: 75539688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:18,386][41256] Avg episode reward: [(0, '86.892')] +[2023-03-11 17:25:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147560_75550720.pth... +[2023-03-11 17:25:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147008_75268096.pth +[2023-03-11 17:25:20,236][41544] Updated weights for policy 0, policy_version 147600 (0.0005) +[2023-03-11 17:25:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9511.0). Total num frames: 75599872. Throughput: 0: 9427.1. Samples: 75595576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:23,386][41256] Avg episode reward: [(0, '88.153')] +[2023-03-11 17:25:24,700][41544] Updated weights for policy 0, policy_version 147680 (0.0003) +[2023-03-11 17:25:28,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9511.1). Total num frames: 75644928. Throughput: 0: 9435.4. Samples: 75623132. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:28,386][41256] Avg episode reward: [(0, '86.377')] +[2023-03-11 17:25:29,069][41544] Updated weights for policy 0, policy_version 147760 (0.0005) +[2023-03-11 17:25:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9511.0). Total num frames: 75689984. Throughput: 0: 9430.2. Samples: 75678744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:33,386][41256] Avg episode reward: [(0, '87.786')] +[2023-03-11 17:25:33,447][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147840_75694080.pth... +[2023-03-11 17:25:33,448][41544] Updated weights for policy 0, policy_version 147840 (0.0005) +[2023-03-11 17:25:33,449][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147288_75411456.pth +[2023-03-11 17:25:37,905][41544] Updated weights for policy 0, policy_version 147920 (0.0005) +[2023-03-11 17:25:38,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9511.1). Total num frames: 75739136. Throughput: 0: 9464.4. Samples: 75735020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:38,386][41256] Avg episode reward: [(0, '88.396')] +[2023-03-11 17:25:42,296][41544] Updated weights for policy 0, policy_version 148000 (0.0005) +[2023-03-11 17:25:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9511.1). Total num frames: 75784192. Throughput: 0: 9432.7. Samples: 75762272. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:43,386][41256] Avg episode reward: [(0, '86.974')] +[2023-03-11 17:25:46,648][41544] Updated weights for policy 0, policy_version 148080 (0.0004) +[2023-03-11 17:25:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 75829248. Throughput: 0: 9349.6. Samples: 75818832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:48,386][41256] Avg episode reward: [(0, '86.800')] +[2023-03-11 17:25:48,444][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148112_75833344.pth... +[2023-03-11 17:25:48,445][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147560_75550720.pth +[2023-03-11 17:25:51,107][41544] Updated weights for policy 0, policy_version 148160 (0.0004) +[2023-03-11 17:25:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9483.3). Total num frames: 75878400. Throughput: 0: 9280.1. Samples: 75874116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:53,386][41256] Avg episode reward: [(0, '89.536')] +[2023-03-11 17:25:55,588][41544] Updated weights for policy 0, policy_version 148240 (0.0005) +[2023-03-11 17:25:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9483.3). Total num frames: 75923456. Throughput: 0: 9271.8. Samples: 75901328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:25:58,386][41256] Avg episode reward: [(0, '85.999')] +[2023-03-11 17:25:59,988][41544] Updated weights for policy 0, policy_version 148320 (0.0005) +[2023-03-11 17:26:03,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9469.4). Total num frames: 75968512. Throughput: 0: 9270.1. Samples: 75956844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:03,386][41256] Avg episode reward: [(0, '87.941')] +[2023-03-11 17:26:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148376_75968512.pth... +[2023-03-11 17:26:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000147840_75694080.pth +[2023-03-11 17:26:04,380][41544] Updated weights for policy 0, policy_version 148400 (0.0005) +[2023-03-11 17:26:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9483.3). Total num frames: 76017664. Throughput: 0: 9284.4. Samples: 76013376. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:08,386][41256] Avg episode reward: [(0, '86.743')] +[2023-03-11 17:26:08,765][41544] Updated weights for policy 0, policy_version 148480 (0.0005) +[2023-03-11 17:26:13,245][41544] Updated weights for policy 0, policy_version 148560 (0.0005) +[2023-03-11 17:26:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9284.3, 300 sec: 9469.4). Total num frames: 76062720. Throughput: 0: 9286.6. Samples: 76041028. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:13,386][41256] Avg episode reward: [(0, '86.384')] +[2023-03-11 17:26:17,792][41544] Updated weights for policy 0, policy_version 148640 (0.0005) +[2023-03-11 17:26:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9455.5). Total num frames: 76107776. Throughput: 0: 9261.2. Samples: 76095496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:18,386][41256] Avg episode reward: [(0, '85.091')] +[2023-03-11 17:26:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148648_76107776.pth... +[2023-03-11 17:26:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148112_75833344.pth +[2023-03-11 17:26:22,315][41544] Updated weights for policy 0, policy_version 148720 (0.0005) +[2023-03-11 17:26:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 76152832. Throughput: 0: 9202.1. Samples: 76149112. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:23,386][41256] Avg episode reward: [(0, '85.578')] +[2023-03-11 17:26:26,809][41544] Updated weights for policy 0, policy_version 148800 (0.0005) +[2023-03-11 17:26:28,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 76197888. Throughput: 0: 9214.8. Samples: 76176940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:28,386][41256] Avg episode reward: [(0, '85.398')] +[2023-03-11 17:26:31,345][41544] Updated weights for policy 0, policy_version 148880 (0.0005) +[2023-03-11 17:26:33,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 76242944. Throughput: 0: 9153.2. Samples: 76230728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:33,386][41256] Avg episode reward: [(0, '84.717')] +[2023-03-11 17:26:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148912_76242944.pth... +[2023-03-11 17:26:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148376_75968512.pth +[2023-03-11 17:26:35,795][41544] Updated weights for policy 0, policy_version 148960 (0.0005) +[2023-03-11 17:26:38,386][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9441.6). Total num frames: 76288000. Throughput: 0: 9169.2. Samples: 76286732. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:38,386][41256] Avg episode reward: [(0, '84.998')] +[2023-03-11 17:26:40,217][41544] Updated weights for policy 0, policy_version 149040 (0.0005) +[2023-03-11 17:26:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 76337152. Throughput: 0: 9180.0. Samples: 76314428. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:43,386][41256] Avg episode reward: [(0, '84.330')] +[2023-03-11 17:26:44,688][41544] Updated weights for policy 0, policy_version 149120 (0.0005) +[2023-03-11 17:26:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 76382208. Throughput: 0: 9181.2. Samples: 76370000. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:48,386][41256] Avg episode reward: [(0, '86.671')] +[2023-03-11 17:26:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000149184_76382208.pth... +[2023-03-11 17:26:48,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148648_76107776.pth +[2023-03-11 17:26:48,860][41544] Updated weights for policy 0, policy_version 149200 (0.0004) +[2023-03-11 17:26:52,994][41544] Updated weights for policy 0, policy_version 149280 (0.0004) +[2023-03-11 17:26:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9441.6). Total num frames: 76431360. Throughput: 0: 9256.8. Samples: 76429932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:53,386][41256] Avg episode reward: [(0, '84.569')] +[2023-03-11 17:26:57,197][41544] Updated weights for policy 0, policy_version 149360 (0.0005) +[2023-03-11 17:26:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9284.3, 300 sec: 9441.6). Total num frames: 76480512. Throughput: 0: 9290.5. Samples: 76459100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:26:58,386][41256] Avg episode reward: [(0, '86.891')] +[2023-03-11 17:27:01,345][41544] Updated weights for policy 0, policy_version 149440 (0.0005) +[2023-03-11 17:27:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 76529664. Throughput: 0: 9390.7. Samples: 76518076. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:03,386][41256] Avg episode reward: [(0, '88.469')] +[2023-03-11 17:27:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000149472_76529664.pth... +[2023-03-11 17:27:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000148912_76242944.pth +[2023-03-11 17:27:05,645][41544] Updated weights for policy 0, policy_version 149520 (0.0005) +[2023-03-11 17:27:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9352.5, 300 sec: 9441.6). Total num frames: 76578816. Throughput: 0: 9459.4. Samples: 76574784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:08,386][41256] Avg episode reward: [(0, '85.372')] +[2023-03-11 17:27:10,047][41544] Updated weights for policy 0, policy_version 149600 (0.0005) +[2023-03-11 17:27:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9427.7). Total num frames: 76623872. Throughput: 0: 9473.4. Samples: 76603244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:13,386][41256] Avg episode reward: [(0, '88.077')] +[2023-03-11 17:27:14,255][41544] Updated weights for policy 0, policy_version 149680 (0.0005) +[2023-03-11 17:27:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9427.7). Total num frames: 76673024. Throughput: 0: 9557.9. Samples: 76660832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:18,386][41256] Avg episode reward: [(0, '86.834')] +[2023-03-11 17:27:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000149752_76673024.pth... +[2023-03-11 17:27:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000149184_76382208.pth +[2023-03-11 17:27:18,594][41544] Updated weights for policy 0, policy_version 149760 (0.0005) +[2023-03-11 17:27:23,080][41544] Updated weights for policy 0, policy_version 149840 (0.0006) +[2023-03-11 17:27:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9413.9). Total num frames: 76718080. Throughput: 0: 9539.2. Samples: 76715996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:23,386][41256] Avg episode reward: [(0, '85.942')] +[2023-03-11 17:27:27,513][41544] Updated weights for policy 0, policy_version 149920 (0.0005) +[2023-03-11 17:27:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 76767232. Throughput: 0: 9517.8. Samples: 76742728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:28,386][41256] Avg episode reward: [(0, '88.698')] +[2023-03-11 17:27:31,739][41544] Updated weights for policy 0, policy_version 150000 (0.0005) +[2023-03-11 17:27:33,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 76812288. Throughput: 0: 9572.8. Samples: 76800776. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:33,386][41256] Avg episode reward: [(0, '84.601')] +[2023-03-11 17:27:33,423][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150032_76816384.pth... +[2023-03-11 17:27:33,424][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000149472_76529664.pth +[2023-03-11 17:27:36,131][41544] Updated weights for policy 0, policy_version 150080 (0.0005) +[2023-03-11 17:27:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 76861440. Throughput: 0: 9498.1. Samples: 76857344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:38,386][41256] Avg episode reward: [(0, '87.319')] +[2023-03-11 17:27:40,361][41544] Updated weights for policy 0, policy_version 150160 (0.0005) +[2023-03-11 17:27:43,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9441.6). Total num frames: 76910592. Throughput: 0: 9496.3. Samples: 76886432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:43,386][41256] Avg episode reward: [(0, '85.764')] +[2023-03-11 17:27:44,461][41544] Updated weights for policy 0, policy_version 150240 (0.0004) +[2023-03-11 17:27:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 76955648. Throughput: 0: 9492.5. Samples: 76945240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:48,386][41256] Avg episode reward: [(0, '88.287')] +[2023-03-11 17:27:48,458][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150312_76959744.pth... +[2023-03-11 17:27:48,461][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000149752_76673024.pth +[2023-03-11 17:27:48,904][41544] Updated weights for policy 0, policy_version 150320 (0.0005) +[2023-03-11 17:27:53,360][41544] Updated weights for policy 0, policy_version 150400 (0.0005) +[2023-03-11 17:27:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9441.6). Total num frames: 77004800. Throughput: 0: 9459.2. Samples: 77000448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:53,386][41256] Avg episode reward: [(0, '84.943')] +[2023-03-11 17:27:57,841][41544] Updated weights for policy 0, policy_version 150480 (0.0005) +[2023-03-11 17:27:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9441.6). Total num frames: 77049856. Throughput: 0: 9431.8. Samples: 77027676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:27:58,386][41256] Avg episode reward: [(0, '88.307')] +[2023-03-11 17:28:02,384][41544] Updated weights for policy 0, policy_version 150560 (0.0005) +[2023-03-11 17:28:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9427.7). Total num frames: 77094912. Throughput: 0: 9365.2. Samples: 77082264. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:28:03,386][41256] Avg episode reward: [(0, '88.112')] +[2023-03-11 17:28:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150576_77094912.pth... +[2023-03-11 17:28:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150032_76816384.pth +[2023-03-11 17:28:06,867][41544] Updated weights for policy 0, policy_version 150640 (0.0005) +[2023-03-11 17:28:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9413.9). Total num frames: 77139968. Throughput: 0: 9342.7. Samples: 77136416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:28:08,386][41256] Avg episode reward: [(0, '85.872')] +[2023-03-11 17:28:11,368][41544] Updated weights for policy 0, policy_version 150720 (0.0005) +[2023-03-11 17:28:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 77185024. Throughput: 0: 9370.2. Samples: 77164388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:28:13,386][41256] Avg episode reward: [(0, '85.136')] +[2023-03-11 17:28:15,715][41544] Updated weights for policy 0, policy_version 150800 (0.0005) +[2023-03-11 17:28:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 77234176. Throughput: 0: 9319.5. Samples: 77220152. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:18,386][41256] Avg episode reward: [(0, '85.682')] +[2023-03-11 17:28:18,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150848_77234176.pth... +[2023-03-11 17:28:18,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150312_76959744.pth +[2023-03-11 17:28:19,959][41544] Updated weights for policy 0, policy_version 150880 (0.0005) +[2023-03-11 17:28:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 77283328. Throughput: 0: 9356.0. Samples: 77278364. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:23,386][41256] Avg episode reward: [(0, '86.067')] +[2023-03-11 17:28:24,226][41544] Updated weights for policy 0, policy_version 150960 (0.0005) +[2023-03-11 17:28:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 77328384. Throughput: 0: 9359.5. Samples: 77307608. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:28,386][41256] Avg episode reward: [(0, '86.346')] +[2023-03-11 17:28:28,396][41544] Updated weights for policy 0, policy_version 151040 (0.0005) +[2023-03-11 17:28:32,575][41544] Updated weights for policy 0, policy_version 151120 (0.0005) +[2023-03-11 17:28:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.0, 300 sec: 9400.0). Total num frames: 77381632. Throughput: 0: 9343.5. Samples: 77365696. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:33,386][41256] Avg episode reward: [(0, '89.261')] +[2023-03-11 17:28:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151136_77381632.pth... +[2023-03-11 17:28:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150576_77094912.pth +[2023-03-11 17:28:36,836][41544] Updated weights for policy 0, policy_version 151200 (0.0004) +[2023-03-11 17:28:38,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 77426688. Throughput: 0: 9404.9. Samples: 77423668. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:38,386][41256] Avg episode reward: [(0, '86.208')] +[2023-03-11 17:28:41,199][41544] Updated weights for policy 0, policy_version 151280 (0.0006) +[2023-03-11 17:28:43,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 77471744. Throughput: 0: 9420.2. Samples: 77451588. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:43,386][41256] Avg episode reward: [(0, '86.032')] +[2023-03-11 17:28:45,675][41544] Updated weights for policy 0, policy_version 151360 (0.0005) +[2023-03-11 17:28:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 77520896. Throughput: 0: 9440.7. Samples: 77507096. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:48,386][41256] Avg episode reward: [(0, '84.454')] +[2023-03-11 17:28:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151408_77520896.pth... +[2023-03-11 17:28:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000150848_77234176.pth +[2023-03-11 17:28:49,949][41544] Updated weights for policy 0, policy_version 151440 (0.0005) +[2023-03-11 17:28:53,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9386.1). Total num frames: 77565952. Throughput: 0: 9523.5. Samples: 77564976. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:53,386][41256] Avg episode reward: [(0, '88.998')] +[2023-03-11 17:28:54,307][41544] Updated weights for policy 0, policy_version 151520 (0.0005) +[2023-03-11 17:28:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 77615104. Throughput: 0: 9500.3. Samples: 77591904. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:28:58,386][41256] Avg episode reward: [(0, '86.861')] +[2023-03-11 17:28:58,768][41544] Updated weights for policy 0, policy_version 151600 (0.0005) +[2023-03-11 17:29:03,274][41544] Updated weights for policy 0, policy_version 151680 (0.0006) +[2023-03-11 17:29:03,386][41256] Fps is (10 sec: 9420.9, 60 sec: 9420.8, 300 sec: 9386.1). Total num frames: 77660160. Throughput: 0: 9499.1. Samples: 77647612. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:29:03,386][41256] Avg episode reward: [(0, '90.369')] +[2023-03-11 17:29:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151680_77660160.pth... +[2023-03-11 17:29:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151136_77381632.pth +[2023-03-11 17:29:07,582][41544] Updated weights for policy 0, policy_version 151760 (0.0005) +[2023-03-11 17:29:08,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9420.8, 300 sec: 9372.2). Total num frames: 77705216. Throughput: 0: 9449.3. Samples: 77703580. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:29:08,386][41256] Avg episode reward: [(0, '88.785')] +[2023-03-11 17:29:11,760][41544] Updated weights for policy 0, policy_version 151840 (0.0005) +[2023-03-11 17:29:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9372.2). Total num frames: 77754368. Throughput: 0: 9449.1. Samples: 77732820. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:29:13,386][41256] Avg episode reward: [(0, '88.075')] +[2023-03-11 17:29:16,104][41544] Updated weights for policy 0, policy_version 151920 (0.0005) +[2023-03-11 17:29:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9489.1, 300 sec: 9386.1). Total num frames: 77803520. Throughput: 0: 9429.8. Samples: 77790036. Policy #0 lag: (min: 3.0, avg: 3.0, max: 3.0) +[2023-03-11 17:29:18,386][41256] Avg episode reward: [(0, '87.583')] +[2023-03-11 17:29:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151960_77803520.pth... +[2023-03-11 17:29:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151408_77520896.pth +[2023-03-11 17:29:20,242][41544] Updated weights for policy 0, policy_version 152000 (0.0005) +[2023-03-11 17:29:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 77852672. Throughput: 0: 9447.6. Samples: 77848808. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:23,386][41256] Avg episode reward: [(0, '89.166')] +[2023-03-11 17:29:24,417][41544] Updated weights for policy 0, policy_version 152080 (0.0005) +[2023-03-11 17:29:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9400.0). Total num frames: 77901824. Throughput: 0: 9487.1. Samples: 77878508. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:28,386][41256] Avg episode reward: [(0, '87.650')] +[2023-03-11 17:29:28,610][41544] Updated weights for policy 0, policy_version 152160 (0.0005) +[2023-03-11 17:29:32,829][41544] Updated weights for policy 0, policy_version 152240 (0.0005) +[2023-03-11 17:29:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 77950976. Throughput: 0: 9575.3. Samples: 77937984. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:33,386][41256] Avg episode reward: [(0, '90.817')] +[2023-03-11 17:29:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000152248_77950976.pth... +[2023-03-11 17:29:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151680_77660160.pth +[2023-03-11 17:29:37,272][41544] Updated weights for policy 0, policy_version 152320 (0.0005) +[2023-03-11 17:29:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9413.9). Total num frames: 77996032. Throughput: 0: 9509.0. Samples: 77992880. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:38,386][41256] Avg episode reward: [(0, '87.226')] +[2023-03-11 17:29:41,393][41544] Updated weights for policy 0, policy_version 152400 (0.0005) +[2023-03-11 17:29:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.4, 300 sec: 9413.9). Total num frames: 78045184. Throughput: 0: 9589.0. Samples: 78023408. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:43,386][41256] Avg episode reward: [(0, '87.898')] +[2023-03-11 17:29:45,451][41544] Updated weights for policy 0, policy_version 152480 (0.0005) +[2023-03-11 17:29:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9427.7). Total num frames: 78094336. Throughput: 0: 9667.9. Samples: 78082668. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:48,386][41256] Avg episode reward: [(0, '87.272')] +[2023-03-11 17:29:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000152528_78094336.pth... +[2023-03-11 17:29:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000151960_77803520.pth +[2023-03-11 17:29:49,849][41544] Updated weights for policy 0, policy_version 152560 (0.0005) +[2023-03-11 17:29:53,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9427.7). Total num frames: 78143488. Throughput: 0: 9683.2. Samples: 78139324. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:53,386][41256] Avg episode reward: [(0, '87.877')] +[2023-03-11 17:29:54,233][41544] Updated weights for policy 0, policy_version 152640 (0.0005) +[2023-03-11 17:29:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.4, 300 sec: 9413.9). Total num frames: 78188544. Throughput: 0: 9653.4. Samples: 78167224. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:29:58,386][41256] Avg episode reward: [(0, '89.834')] +[2023-03-11 17:29:58,623][41544] Updated weights for policy 0, policy_version 152720 (0.0005) +[2023-03-11 17:30:03,032][41544] Updated weights for policy 0, policy_version 152800 (0.0005) +[2023-03-11 17:30:03,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9400.0). Total num frames: 78233600. Throughput: 0: 9605.6. Samples: 78222288. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:30:03,386][41256] Avg episode reward: [(0, '88.904')] +[2023-03-11 17:30:03,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000152800_78233600.pth... +[2023-03-11 17:30:03,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000152248_77950976.pth +[2023-03-11 17:30:07,412][41544] Updated weights for policy 0, policy_version 152880 (0.0005) +[2023-03-11 17:30:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9413.9). Total num frames: 78282752. Throughput: 0: 9553.6. Samples: 78278720. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:30:08,386][41256] Avg episode reward: [(0, '86.038')] +[2023-03-11 17:30:11,874][41544] Updated weights for policy 0, policy_version 152960 (0.0005) +[2023-03-11 17:30:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9413.9). Total num frames: 78327808. Throughput: 0: 9518.0. Samples: 78306820. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:30:13,386][41256] Avg episode reward: [(0, '87.748')] +[2023-03-11 17:30:16,311][41544] Updated weights for policy 0, policy_version 153040 (0.0005) +[2023-03-11 17:30:18,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9400.0). Total num frames: 78372864. Throughput: 0: 9399.7. Samples: 78360972. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:30:18,386][41256] Avg episode reward: [(0, '90.836')] +[2023-03-11 17:30:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153072_78372864.pth... +[2023-03-11 17:30:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000152528_78094336.pth +[2023-03-11 17:30:20,813][41544] Updated weights for policy 0, policy_version 153120 (0.0005) +[2023-03-11 17:30:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9420.8, 300 sec: 9400.0). Total num frames: 78417920. Throughput: 0: 9406.1. Samples: 78416156. Policy #0 lag: (min: 1.0, avg: 1.0, max: 1.0) +[2023-03-11 17:30:23,386][41256] Avg episode reward: [(0, '90.219')] +[2023-03-11 17:30:25,288][41544] Updated weights for policy 0, policy_version 153200 (0.0005) +[2023-03-11 17:30:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 78462976. Throughput: 0: 9333.9. Samples: 78443432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:30:28,386][41256] Avg episode reward: [(0, '88.865')] +[2023-03-11 17:30:29,753][41544] Updated weights for policy 0, policy_version 153280 (0.0005) +[2023-03-11 17:30:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 78512128. Throughput: 0: 9239.2. Samples: 78498432. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:30:33,386][41256] Avg episode reward: [(0, '89.730')] +[2023-03-11 17:30:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153344_78512128.pth... +[2023-03-11 17:30:33,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000152800_78233600.pth +[2023-03-11 17:30:34,220][41544] Updated weights for policy 0, policy_version 153360 (0.0005) +[2023-03-11 17:30:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9400.0). Total num frames: 78557184. Throughput: 0: 9204.7. Samples: 78553536. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:30:38,386][41256] Avg episode reward: [(0, '88.636')] +[2023-03-11 17:30:38,688][41544] Updated weights for policy 0, policy_version 153440 (0.0005) +[2023-03-11 17:30:43,124][41544] Updated weights for policy 0, policy_version 153520 (0.0005) +[2023-03-11 17:30:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9400.0). Total num frames: 78602240. Throughput: 0: 9199.1. Samples: 78581184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:30:43,386][41256] Avg episode reward: [(0, '89.025')] +[2023-03-11 17:30:47,514][41544] Updated weights for policy 0, policy_version 153600 (0.0005) +[2023-03-11 17:30:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 78647296. Throughput: 0: 9206.8. Samples: 78636592. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:30:48,396][41256] Avg episode reward: [(0, '85.372')] +[2023-03-11 17:30:48,419][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153616_78651392.pth... +[2023-03-11 17:30:48,420][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153072_78372864.pth +[2023-03-11 17:30:52,021][41544] Updated weights for policy 0, policy_version 153680 (0.0005) +[2023-03-11 17:30:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 78696448. Throughput: 0: 9188.8. Samples: 78692216. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:30:53,396][41256] Avg episode reward: [(0, '88.023')] +[2023-03-11 17:30:56,507][41544] Updated weights for policy 0, policy_version 153760 (0.0005) +[2023-03-11 17:30:58,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 78741504. Throughput: 0: 9161.0. Samples: 78719064. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:30:58,396][41256] Avg episode reward: [(0, '90.447')] +[2023-03-11 17:31:01,137][41544] Updated weights for policy 0, policy_version 153840 (0.0005) +[2023-03-11 17:31:03,385][41256] Fps is (10 sec: 8601.6, 60 sec: 9147.7, 300 sec: 9372.2). Total num frames: 78782464. Throughput: 0: 9141.4. Samples: 78772336. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:31:03,396][41256] Avg episode reward: [(0, '91.811')] +[2023-03-11 17:31:03,433][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153880_78786560.pth... +[2023-03-11 17:31:03,434][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153344_78512128.pth +[2023-03-11 17:31:05,681][41544] Updated weights for policy 0, policy_version 153920 (0.0005) +[2023-03-11 17:31:08,385][41256] Fps is (10 sec: 8601.6, 60 sec: 9079.5, 300 sec: 9372.2). Total num frames: 78827520. Throughput: 0: 9130.0. Samples: 78827008. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:31:08,396][41256] Avg episode reward: [(0, '89.474')] +[2023-03-11 17:31:10,153][41544] Updated weights for policy 0, policy_version 154000 (0.0005) +[2023-03-11 17:31:13,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 78876672. Throughput: 0: 9130.7. Samples: 78854316. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:31:13,396][41256] Avg episode reward: [(0, '87.909')] +[2023-03-11 17:31:14,668][41544] Updated weights for policy 0, policy_version 154080 (0.0005) +[2023-03-11 17:31:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 78921728. Throughput: 0: 9133.5. Samples: 78909440. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:31:18,396][41256] Avg episode reward: [(0, '85.684')] +[2023-03-11 17:31:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154144_78921728.pth... +[2023-03-11 17:31:18,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153616_78651392.pth +[2023-03-11 17:31:19,119][41544] Updated weights for policy 0, policy_version 154160 (0.0005) +[2023-03-11 17:31:23,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 78966784. Throughput: 0: 9137.6. Samples: 78964728. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:31:23,396][41256] Avg episode reward: [(0, '88.310')] +[2023-03-11 17:31:23,504][41544] Updated weights for policy 0, policy_version 154240 (0.0005) +[2023-03-11 17:31:27,871][41544] Updated weights for policy 0, policy_version 154320 (0.0005) +[2023-03-11 17:31:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9400.0). Total num frames: 79015936. Throughput: 0: 9149.6. Samples: 78992916. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:31:28,396][41256] Avg episode reward: [(0, '90.024')] +[2023-03-11 17:31:32,261][41544] Updated weights for policy 0, policy_version 154400 (0.0005) +[2023-03-11 17:31:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9400.0). Total num frames: 79060992. Throughput: 0: 9159.5. Samples: 79048768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:31:33,396][41256] Avg episode reward: [(0, '90.866')] +[2023-03-11 17:31:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154416_79060992.pth... +[2023-03-11 17:31:33,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000153880_78786560.pth +[2023-03-11 17:31:36,787][41544] Updated weights for policy 0, policy_version 154480 (0.0005) +[2023-03-11 17:31:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 79106048. Throughput: 0: 9137.9. Samples: 79103420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:31:38,396][41256] Avg episode reward: [(0, '88.953')] +[2023-03-11 17:31:41,211][41544] Updated weights for policy 0, policy_version 154560 (0.0005) +[2023-03-11 17:31:43,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9386.1). Total num frames: 79151104. Throughput: 0: 9159.4. Samples: 79131236. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:31:43,396][41256] Avg episode reward: [(0, '89.501')] +[2023-03-11 17:31:45,607][41544] Updated weights for policy 0, policy_version 154640 (0.0005) +[2023-03-11 17:31:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9386.1). Total num frames: 79200256. Throughput: 0: 9229.6. Samples: 79187668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:31:48,396][41256] Avg episode reward: [(0, '89.591')] +[2023-03-11 17:31:48,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154688_79200256.pth... +[2023-03-11 17:31:48,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154144_78921728.pth +[2023-03-11 17:31:50,018][41544] Updated weights for policy 0, policy_version 154720 (0.0005) +[2023-03-11 17:31:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9147.7, 300 sec: 9372.2). Total num frames: 79245312. Throughput: 0: 9242.1. Samples: 79242900. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:31:53,396][41256] Avg episode reward: [(0, '86.376')] +[2023-03-11 17:31:54,440][41544] Updated weights for policy 0, policy_version 154800 (0.0005) +[2023-03-11 17:31:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9372.2). Total num frames: 79294464. Throughput: 0: 9242.3. Samples: 79270220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:31:58,386][41256] Avg episode reward: [(0, '90.897')] +[2023-03-11 17:31:58,808][41544] Updated weights for policy 0, policy_version 154880 (0.0005) +[2023-03-11 17:32:03,128][41544] Updated weights for policy 0, policy_version 154960 (0.0005) +[2023-03-11 17:32:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 79339520. Throughput: 0: 9285.7. Samples: 79327296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:03,386][41256] Avg episode reward: [(0, '87.809')] +[2023-03-11 17:32:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154960_79339520.pth... +[2023-03-11 17:32:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154416_79060992.pth +[2023-03-11 17:32:07,494][41544] Updated weights for policy 0, policy_version 155040 (0.0005) +[2023-03-11 17:32:08,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 79388672. Throughput: 0: 9320.2. Samples: 79384140. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:08,386][41256] Avg episode reward: [(0, '90.406')] +[2023-03-11 17:32:11,797][41544] Updated weights for policy 0, policy_version 155120 (0.0005) +[2023-03-11 17:32:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 79433728. Throughput: 0: 9330.7. Samples: 79412796. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:13,386][41256] Avg episode reward: [(0, '90.334')] +[2023-03-11 17:32:16,153][41544] Updated weights for policy 0, policy_version 155200 (0.0005) +[2023-03-11 17:32:18,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9372.2). Total num frames: 79482880. Throughput: 0: 9341.1. Samples: 79469120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:18,386][41256] Avg episode reward: [(0, '89.147')] +[2023-03-11 17:32:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000155240_79482880.pth... +[2023-03-11 17:32:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154688_79200256.pth +[2023-03-11 17:32:20,544][41544] Updated weights for policy 0, policy_version 155280 (0.0005) +[2023-03-11 17:32:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9358.3). Total num frames: 79527936. Throughput: 0: 9352.6. Samples: 79524288. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:23,386][41256] Avg episode reward: [(0, '91.312')] +[2023-03-11 17:32:25,057][41544] Updated weights for policy 0, policy_version 155360 (0.0005) +[2023-03-11 17:32:28,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9358.3). Total num frames: 79572992. Throughput: 0: 9346.8. Samples: 79551844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:28,386][41256] Avg episode reward: [(0, '91.013')] +[2023-03-11 17:32:29,629][41544] Updated weights for policy 0, policy_version 155440 (0.0005) +[2023-03-11 17:32:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9344.4). Total num frames: 79618048. Throughput: 0: 9289.1. Samples: 79605680. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:33,386][41256] Avg episode reward: [(0, '90.488')] +[2023-03-11 17:32:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000155504_79618048.pth... +[2023-03-11 17:32:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000154960_79339520.pth +[2023-03-11 17:32:34,141][41544] Updated weights for policy 0, policy_version 155520 (0.0005) +[2023-03-11 17:32:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9330.5). Total num frames: 79663104. Throughput: 0: 9257.2. Samples: 79659476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:38,386][41256] Avg episode reward: [(0, '92.659')] +[2023-03-11 17:32:38,680][41544] Updated weights for policy 0, policy_version 155600 (0.0005) +[2023-03-11 17:32:43,232][41544] Updated weights for policy 0, policy_version 155680 (0.0005) +[2023-03-11 17:32:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9330.6). Total num frames: 79708160. Throughput: 0: 9268.3. Samples: 79687296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:43,386][41256] Avg episode reward: [(0, '92.883')] +[2023-03-11 17:32:47,673][41544] Updated weights for policy 0, policy_version 155760 (0.0004) +[2023-03-11 17:32:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9316.7). Total num frames: 79753216. Throughput: 0: 9193.2. Samples: 79740992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:48,386][41256] Avg episode reward: [(0, '94.390')] +[2023-03-11 17:32:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000155768_79753216.pth... +[2023-03-11 17:32:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000155240_79482880.pth +[2023-03-11 17:32:52,229][41544] Updated weights for policy 0, policy_version 155840 (0.0004) +[2023-03-11 17:32:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9316.7). Total num frames: 79798272. Throughput: 0: 9138.4. Samples: 79795368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:53,386][41256] Avg episode reward: [(0, '92.062')] +[2023-03-11 17:32:56,780][41544] Updated weights for policy 0, policy_version 155920 (0.0005) +[2023-03-11 17:32:58,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 79843328. Throughput: 0: 9111.7. Samples: 79822824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:32:58,386][41256] Avg episode reward: [(0, '94.238')] +[2023-03-11 17:33:01,330][41544] Updated weights for policy 0, policy_version 156000 (0.0005) +[2023-03-11 17:33:03,386][41256] Fps is (10 sec: 9011.1, 60 sec: 9147.7, 300 sec: 9316.7). Total num frames: 79888384. Throughput: 0: 9051.0. Samples: 79876416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:33:03,386][41256] Avg episode reward: [(0, '91.610')] +[2023-03-11 17:33:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156032_79888384.pth... +[2023-03-11 17:33:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000155504_79618048.pth +[2023-03-11 17:33:05,899][41544] Updated weights for policy 0, policy_version 156080 (0.0005) +[2023-03-11 17:33:08,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9316.7). Total num frames: 79933440. Throughput: 0: 9025.4. Samples: 79930432. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:33:08,386][41256] Avg episode reward: [(0, '94.970')] +[2023-03-11 17:33:10,458][41544] Updated weights for policy 0, policy_version 156160 (0.0005) +[2023-03-11 17:33:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9302.8). Total num frames: 79978496. Throughput: 0: 9020.6. Samples: 79957772. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:33:13,386][41256] Avg episode reward: [(0, '93.656')] +[2023-03-11 17:33:14,968][41544] Updated weights for policy 0, policy_version 156240 (0.0005) +[2023-03-11 17:33:18,386][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9288.9). Total num frames: 80023552. Throughput: 0: 9020.1. Samples: 80011584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:33:18,386][41256] Avg episode reward: [(0, '96.218')] +[2023-03-11 17:33:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156296_80023552.pth... +[2023-03-11 17:33:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000155768_79753216.pth +[2023-03-11 17:33:19,515][41544] Updated weights for policy 0, policy_version 156320 (0.0005) +[2023-03-11 17:33:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9288.9). Total num frames: 80068608. Throughput: 0: 9010.4. Samples: 80064944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:33:23,386][41256] Avg episode reward: [(0, '93.630')] +[2023-03-11 17:33:24,119][41544] Updated weights for policy 0, policy_version 156400 (0.0005) +[2023-03-11 17:33:28,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9011.2, 300 sec: 9261.1). Total num frames: 80113664. Throughput: 0: 9009.8. Samples: 80092736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:33:28,386][41256] Avg episode reward: [(0, '90.287')] +[2023-03-11 17:33:28,701][41544] Updated weights for policy 0, policy_version 156480 (0.0005) +[2023-03-11 17:33:33,248][41544] Updated weights for policy 0, policy_version 156560 (0.0005) +[2023-03-11 17:33:33,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9261.1). Total num frames: 80158720. Throughput: 0: 9009.3. Samples: 80146412. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:33:33,386][41256] Avg episode reward: [(0, '93.252')] +[2023-03-11 17:33:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156560_80158720.pth... +[2023-03-11 17:33:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156032_79888384.pth +[2023-03-11 17:33:37,780][41544] Updated weights for policy 0, policy_version 156640 (0.0005) +[2023-03-11 17:33:38,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9261.1). Total num frames: 80203776. Throughput: 0: 8990.8. Samples: 80199956. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:33:38,386][41256] Avg episode reward: [(0, '94.782')] +[2023-03-11 17:33:42,309][41544] Updated weights for policy 0, policy_version 156720 (0.0005) +[2023-03-11 17:33:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9247.2). Total num frames: 80248832. Throughput: 0: 9005.2. Samples: 80228060. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:33:43,386][41256] Avg episode reward: [(0, '93.164')] +[2023-03-11 17:33:46,732][41544] Updated weights for policy 0, policy_version 156800 (0.0005) +[2023-03-11 17:33:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9247.2). Total num frames: 80293888. Throughput: 0: 9022.9. Samples: 80282448. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:33:48,386][41256] Avg episode reward: [(0, '94.826')] +[2023-03-11 17:33:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156824_80293888.pth... +[2023-03-11 17:33:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156296_80023552.pth +[2023-03-11 17:33:51,230][41544] Updated weights for policy 0, policy_version 156880 (0.0005) +[2023-03-11 17:33:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9011.2, 300 sec: 9233.4). Total num frames: 80338944. Throughput: 0: 9046.9. Samples: 80337544. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:33:53,386][41256] Avg episode reward: [(0, '93.685')] +[2023-03-11 17:33:55,727][41544] Updated weights for policy 0, policy_version 156960 (0.0005) +[2023-03-11 17:33:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9011.2, 300 sec: 9233.4). Total num frames: 80384000. Throughput: 0: 9036.7. Samples: 80364424. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:33:58,386][41256] Avg episode reward: [(0, '92.443')] +[2023-03-11 17:34:00,183][41544] Updated weights for policy 0, policy_version 157040 (0.0005) +[2023-03-11 17:34:03,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9079.5, 300 sec: 9247.2). Total num frames: 80433152. Throughput: 0: 9077.2. Samples: 80420056. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:03,386][41256] Avg episode reward: [(0, '94.600')] +[2023-03-11 17:34:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157096_80433152.pth... +[2023-03-11 17:34:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156560_80158720.pth +[2023-03-11 17:34:04,630][41544] Updated weights for policy 0, policy_version 157120 (0.0005) +[2023-03-11 17:34:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9079.5, 300 sec: 9233.4). Total num frames: 80478208. Throughput: 0: 9094.2. Samples: 80474184. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:08,386][41256] Avg episode reward: [(0, '94.030')] +[2023-03-11 17:34:09,115][41544] Updated weights for policy 0, policy_version 157200 (0.0005) +[2023-03-11 17:34:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9219.5). Total num frames: 80523264. Throughput: 0: 9098.2. Samples: 80502156. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:13,386][41256] Avg episode reward: [(0, '93.900')] +[2023-03-11 17:34:13,626][41544] Updated weights for policy 0, policy_version 157280 (0.0005) +[2023-03-11 17:34:18,041][41544] Updated weights for policy 0, policy_version 157360 (0.0005) +[2023-03-11 17:34:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9079.5, 300 sec: 9205.6). Total num frames: 80568320. Throughput: 0: 9129.0. Samples: 80557216. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:18,386][41256] Avg episode reward: [(0, '91.872')] +[2023-03-11 17:34:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157360_80568320.pth... +[2023-03-11 17:34:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000156824_80293888.pth +[2023-03-11 17:34:22,512][41544] Updated weights for policy 0, policy_version 157440 (0.0005) +[2023-03-11 17:34:23,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9079.5, 300 sec: 9191.7). Total num frames: 80613376. Throughput: 0: 9169.3. Samples: 80612572. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:23,386][41256] Avg episode reward: [(0, '90.581')] +[2023-03-11 17:34:26,930][41544] Updated weights for policy 0, policy_version 157520 (0.0005) +[2023-03-11 17:34:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9147.7, 300 sec: 9191.7). Total num frames: 80662528. Throughput: 0: 9151.0. Samples: 80639856. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:28,386][41256] Avg episode reward: [(0, '91.614')] +[2023-03-11 17:34:31,328][41544] Updated weights for policy 0, policy_version 157600 (0.0005) +[2023-03-11 17:34:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9147.7, 300 sec: 9191.7). Total num frames: 80707584. Throughput: 0: 9186.3. Samples: 80695832. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:33,386][41256] Avg episode reward: [(0, '90.186')] +[2023-03-11 17:34:33,421][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157640_80711680.pth... +[2023-03-11 17:34:33,423][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157096_80433152.pth +[2023-03-11 17:34:35,609][41544] Updated weights for policy 0, policy_version 157680 (0.0005) +[2023-03-11 17:34:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9191.7). Total num frames: 80756736. Throughput: 0: 9224.3. Samples: 80752640. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 17:34:38,386][41256] Avg episode reward: [(0, '92.664')] +[2023-03-11 17:34:40,144][41544] Updated weights for policy 0, policy_version 157760 (0.0006) +[2023-03-11 17:34:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9216.0, 300 sec: 9177.8). Total num frames: 80801792. Throughput: 0: 9223.8. Samples: 80779496. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:34:43,386][41256] Avg episode reward: [(0, '92.972')] +[2023-03-11 17:34:44,681][41544] Updated weights for policy 0, policy_version 157840 (0.0005) +[2023-03-11 17:34:48,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 80846848. Throughput: 0: 9200.2. Samples: 80834064. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:34:48,386][41256] Avg episode reward: [(0, '89.829')] +[2023-03-11 17:34:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157904_80846848.pth... +[2023-03-11 17:34:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157360_80568320.pth +[2023-03-11 17:34:49,226][41544] Updated weights for policy 0, policy_version 157920 (0.0006) +[2023-03-11 17:34:53,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 80891904. Throughput: 0: 9193.2. Samples: 80887880. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:34:53,386][41256] Avg episode reward: [(0, '90.123')] +[2023-03-11 17:34:53,687][41544] Updated weights for policy 0, policy_version 158000 (0.0005) +[2023-03-11 17:34:57,978][41544] Updated weights for policy 0, policy_version 158080 (0.0004) +[2023-03-11 17:34:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9177.8). Total num frames: 80941056. Throughput: 0: 9208.6. Samples: 80916544. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:34:58,386][41256] Avg episode reward: [(0, '89.023')] +[2023-03-11 17:35:02,372][41544] Updated weights for policy 0, policy_version 158160 (0.0004) +[2023-03-11 17:35:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 80986112. Throughput: 0: 9248.8. Samples: 80973412. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:03,386][41256] Avg episode reward: [(0, '90.341')] +[2023-03-11 17:35:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000158176_80986112.pth... +[2023-03-11 17:35:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157640_80711680.pth +[2023-03-11 17:35:06,720][41544] Updated weights for policy 0, policy_version 158240 (0.0005) +[2023-03-11 17:35:08,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9216.0, 300 sec: 9163.9). Total num frames: 81031168. Throughput: 0: 9260.4. Samples: 81029292. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:08,386][41256] Avg episode reward: [(0, '88.492')] +[2023-03-11 17:35:11,103][41544] Updated weights for policy 0, policy_version 158320 (0.0005) +[2023-03-11 17:35:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9177.8). Total num frames: 81080320. Throughput: 0: 9284.8. Samples: 81057672. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:13,386][41256] Avg episode reward: [(0, '89.555')] +[2023-03-11 17:35:15,433][41544] Updated weights for policy 0, policy_version 158400 (0.0005) +[2023-03-11 17:35:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9284.3, 300 sec: 9177.8). Total num frames: 81125376. Throughput: 0: 9299.0. Samples: 81114288. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:18,386][41256] Avg episode reward: [(0, '89.002')] +[2023-03-11 17:35:18,415][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000158456_81129472.pth... +[2023-03-11 17:35:18,417][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000157904_80846848.pth +[2023-03-11 17:35:19,705][41544] Updated weights for policy 0, policy_version 158480 (0.0005) +[2023-03-11 17:35:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9191.7). Total num frames: 81174528. Throughput: 0: 9300.9. Samples: 81171180. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:23,386][41256] Avg episode reward: [(0, '87.619')] +[2023-03-11 17:35:23,967][41544] Updated weights for policy 0, policy_version 158560 (0.0004) +[2023-03-11 17:35:28,304][41544] Updated weights for policy 0, policy_version 158640 (0.0005) +[2023-03-11 17:35:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9352.5, 300 sec: 9191.7). Total num frames: 81223680. Throughput: 0: 9345.9. Samples: 81200064. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:28,386][41256] Avg episode reward: [(0, '85.800')] +[2023-03-11 17:35:32,705][41544] Updated weights for policy 0, policy_version 158720 (0.0005) +[2023-03-11 17:35:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9191.7). Total num frames: 81268736. Throughput: 0: 9386.5. Samples: 81256456. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:33,386][41256] Avg episode reward: [(0, '86.725')] +[2023-03-11 17:35:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000158728_81268736.pth... +[2023-03-11 17:35:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000158176_80986112.pth +[2023-03-11 17:35:37,048][41544] Updated weights for policy 0, policy_version 158800 (0.0005) +[2023-03-11 17:35:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9205.6). Total num frames: 81317888. Throughput: 0: 9456.8. Samples: 81313436. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:38,386][41256] Avg episode reward: [(0, '89.349')] +[2023-03-11 17:35:41,430][41544] Updated weights for policy 0, policy_version 158880 (0.0005) +[2023-03-11 17:35:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9205.6). Total num frames: 81362944. Throughput: 0: 9438.8. Samples: 81341288. Policy #0 lag: (min: 5.0, avg: 5.0, max: 5.0) +[2023-03-11 17:35:43,386][41256] Avg episode reward: [(0, '88.355')] +[2023-03-11 17:35:45,674][41544] Updated weights for policy 0, policy_version 158960 (0.0004) +[2023-03-11 17:35:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9205.6). Total num frames: 81412096. Throughput: 0: 9463.5. Samples: 81399268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:35:48,396][41256] Avg episode reward: [(0, '89.113')] +[2023-03-11 17:35:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159008_81412096.pth... +[2023-03-11 17:35:48,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000158456_81129472.pth +[2023-03-11 17:35:49,723][41544] Updated weights for policy 0, policy_version 159040 (0.0004) +[2023-03-11 17:35:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9219.5). Total num frames: 81461248. Throughput: 0: 9563.1. Samples: 81459632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:35:53,396][41256] Avg episode reward: [(0, '86.640')] +[2023-03-11 17:35:53,869][41544] Updated weights for policy 0, policy_version 159120 (0.0004) +[2023-03-11 17:35:58,071][41544] Updated weights for policy 0, policy_version 159200 (0.0004) +[2023-03-11 17:35:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9247.2). Total num frames: 81510400. Throughput: 0: 9587.7. Samples: 81489116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:35:58,396][41256] Avg episode reward: [(0, '84.976')] +[2023-03-11 17:36:02,259][41544] Updated weights for policy 0, policy_version 159280 (0.0004) +[2023-03-11 17:36:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9261.1). Total num frames: 81559552. Throughput: 0: 9623.1. Samples: 81547328. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:03,397][41256] Avg episode reward: [(0, '82.198')] +[2023-03-11 17:36:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159296_81559552.pth... +[2023-03-11 17:36:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000158728_81268736.pth +[2023-03-11 17:36:06,330][41544] Updated weights for policy 0, policy_version 159360 (0.0005) +[2023-03-11 17:36:08,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9693.9, 300 sec: 9275.0). Total num frames: 81612800. Throughput: 0: 9697.8. Samples: 81607580. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:08,397][41256] Avg episode reward: [(0, '83.344')] +[2023-03-11 17:36:10,365][41544] Updated weights for policy 0, policy_version 159440 (0.0004) +[2023-03-11 17:36:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9288.9). Total num frames: 81661952. Throughput: 0: 9721.0. Samples: 81637508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:13,397][41256] Avg episode reward: [(0, '84.867')] +[2023-03-11 17:36:14,449][41544] Updated weights for policy 0, policy_version 159520 (0.0005) +[2023-03-11 17:36:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9302.8). Total num frames: 81711104. Throughput: 0: 9816.2. Samples: 81698184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:18,396][41256] Avg episode reward: [(0, '85.237')] +[2023-03-11 17:36:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159592_81711104.pth... +[2023-03-11 17:36:18,404][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159008_81412096.pth +[2023-03-11 17:36:18,664][41544] Updated weights for policy 0, policy_version 159600 (0.0005) +[2023-03-11 17:36:22,968][41544] Updated weights for policy 0, policy_version 159680 (0.0005) +[2023-03-11 17:36:23,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9288.9). Total num frames: 81756160. Throughput: 0: 9819.7. Samples: 81755324. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:23,396][41256] Avg episode reward: [(0, '84.359')] +[2023-03-11 17:36:27,388][41544] Updated weights for policy 0, policy_version 159760 (0.0005) +[2023-03-11 17:36:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9302.8). Total num frames: 81805312. Throughput: 0: 9814.5. Samples: 81782940. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:28,396][41256] Avg episode reward: [(0, '84.862')] +[2023-03-11 17:36:31,771][41544] Updated weights for policy 0, policy_version 159840 (0.0005) +[2023-03-11 17:36:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9302.8). Total num frames: 81850368. Throughput: 0: 9760.4. Samples: 81838488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:33,396][41256] Avg episode reward: [(0, '86.719')] +[2023-03-11 17:36:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159864_81850368.pth... +[2023-03-11 17:36:33,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159296_81559552.pth +[2023-03-11 17:36:36,173][41544] Updated weights for policy 0, policy_version 159920 (0.0005) +[2023-03-11 17:36:38,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9316.7). Total num frames: 81899520. Throughput: 0: 9674.8. Samples: 81894996. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:38,396][41256] Avg episode reward: [(0, '86.725')] +[2023-03-11 17:36:40,511][41544] Updated weights for policy 0, policy_version 160000 (0.0005) +[2023-03-11 17:36:43,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9302.8). Total num frames: 81944576. Throughput: 0: 9651.6. Samples: 81923440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:43,396][41256] Avg episode reward: [(0, '81.029')] +[2023-03-11 17:36:44,837][41544] Updated weights for policy 0, policy_version 160080 (0.0005) +[2023-03-11 17:36:48,386][41256] Fps is (10 sec: 9420.6, 60 sec: 9693.8, 300 sec: 9316.7). Total num frames: 81993728. Throughput: 0: 9609.0. Samples: 81979736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:36:48,397][41256] Avg episode reward: [(0, '86.410')] +[2023-03-11 17:36:48,402][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160144_81993728.pth... +[2023-03-11 17:36:48,404][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159592_81711104.pth +[2023-03-11 17:36:49,264][41544] Updated weights for policy 0, policy_version 160160 (0.0005) +[2023-03-11 17:36:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9302.8). Total num frames: 82038784. Throughput: 0: 9511.6. Samples: 82035600. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:36:53,386][41256] Avg episode reward: [(0, '85.491')] +[2023-03-11 17:36:53,638][41544] Updated weights for policy 0, policy_version 160240 (0.0005) +[2023-03-11 17:36:58,028][41544] Updated weights for policy 0, policy_version 160320 (0.0005) +[2023-03-11 17:36:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9557.3, 300 sec: 9302.8). Total num frames: 82083840. Throughput: 0: 9465.0. Samples: 82063432. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:36:58,386][41256] Avg episode reward: [(0, '83.492')] +[2023-03-11 17:37:02,484][41544] Updated weights for policy 0, policy_version 160400 (0.0005) +[2023-03-11 17:37:03,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9489.1, 300 sec: 9288.9). Total num frames: 82128896. Throughput: 0: 9352.1. Samples: 82119028. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:03,386][41256] Avg episode reward: [(0, '79.648')] +[2023-03-11 17:37:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160416_82132992.pth... +[2023-03-11 17:37:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000159864_81850368.pth +[2023-03-11 17:37:06,875][41544] Updated weights for policy 0, policy_version 160480 (0.0005) +[2023-03-11 17:37:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9302.8). Total num frames: 82178048. Throughput: 0: 9309.4. Samples: 82174248. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:08,386][41256] Avg episode reward: [(0, '85.040')] +[2023-03-11 17:37:11,380][41544] Updated weights for policy 0, policy_version 160560 (0.0005) +[2023-03-11 17:37:13,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9288.9). Total num frames: 82223104. Throughput: 0: 9319.3. Samples: 82202308. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:13,386][41256] Avg episode reward: [(0, '85.717')] +[2023-03-11 17:37:15,775][41544] Updated weights for policy 0, policy_version 160640 (0.0005) +[2023-03-11 17:37:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9302.8). Total num frames: 82272256. Throughput: 0: 9324.3. Samples: 82258080. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:18,386][41256] Avg episode reward: [(0, '87.816')] +[2023-03-11 17:37:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160688_82272256.pth... +[2023-03-11 17:37:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160144_81993728.pth +[2023-03-11 17:37:19,920][41544] Updated weights for policy 0, policy_version 160720 (0.0005) +[2023-03-11 17:37:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9316.7). Total num frames: 82321408. Throughput: 0: 9390.8. Samples: 82317584. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:23,386][41256] Avg episode reward: [(0, '85.445')] +[2023-03-11 17:37:24,012][41544] Updated weights for policy 0, policy_version 160800 (0.0005) +[2023-03-11 17:37:28,115][41544] Updated weights for policy 0, policy_version 160880 (0.0005) +[2023-03-11 17:37:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9330.6). Total num frames: 82370560. Throughput: 0: 9425.2. Samples: 82347572. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:28,386][41256] Avg episode reward: [(0, '83.656')] +[2023-03-11 17:37:32,189][41544] Updated weights for policy 0, policy_version 160960 (0.0005) +[2023-03-11 17:37:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9344.4). Total num frames: 82419712. Throughput: 0: 9508.7. Samples: 82407628. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:33,386][41256] Avg episode reward: [(0, '84.170')] +[2023-03-11 17:37:33,401][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160984_82423808.pth... +[2023-03-11 17:37:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160416_82132992.pth +[2023-03-11 17:37:36,255][41544] Updated weights for policy 0, policy_version 161040 (0.0005) +[2023-03-11 17:37:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9557.3, 300 sec: 9372.2). Total num frames: 82472960. Throughput: 0: 9619.0. Samples: 82468456. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:38,386][41256] Avg episode reward: [(0, '85.309')] +[2023-03-11 17:37:40,387][41544] Updated weights for policy 0, policy_version 161120 (0.0005) +[2023-03-11 17:37:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9625.6, 300 sec: 9386.1). Total num frames: 82522112. Throughput: 0: 9653.3. Samples: 82497832. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:43,386][41256] Avg episode reward: [(0, '86.284')] +[2023-03-11 17:37:44,440][41544] Updated weights for policy 0, policy_version 161200 (0.0005) +[2023-03-11 17:37:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9400.0). Total num frames: 82571264. Throughput: 0: 9778.0. Samples: 82559040. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:48,386][41256] Avg episode reward: [(0, '88.646')] +[2023-03-11 17:37:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000161272_82571264.pth... +[2023-03-11 17:37:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160688_82272256.pth +[2023-03-11 17:37:48,469][41544] Updated weights for policy 0, policy_version 161280 (0.0005) +[2023-03-11 17:37:52,530][41544] Updated weights for policy 0, policy_version 161360 (0.0005) +[2023-03-11 17:37:53,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9693.9, 300 sec: 9413.9). Total num frames: 82620416. Throughput: 0: 9899.6. Samples: 82619732. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:37:53,386][41256] Avg episode reward: [(0, '86.547')] +[2023-03-11 17:37:56,743][41544] Updated weights for policy 0, policy_version 161440 (0.0005) +[2023-03-11 17:37:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9427.7). Total num frames: 82669568. Throughput: 0: 9909.7. Samples: 82648244. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:37:58,386][41256] Avg episode reward: [(0, '84.992')] +[2023-03-11 17:38:00,855][41544] Updated weights for policy 0, policy_version 161520 (0.0005) +[2023-03-11 17:38:03,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9455.5). Total num frames: 82722816. Throughput: 0: 9991.6. Samples: 82707704. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:03,386][41256] Avg episode reward: [(0, '88.531')] +[2023-03-11 17:38:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000161568_82722816.pth... +[2023-03-11 17:38:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000160984_82423808.pth +[2023-03-11 17:38:05,039][41544] Updated weights for policy 0, policy_version 161600 (0.0005) +[2023-03-11 17:38:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9469.4). Total num frames: 82771968. Throughput: 0: 9988.0. Samples: 82767044. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:08,386][41256] Avg episode reward: [(0, '87.671')] +[2023-03-11 17:38:09,212][41544] Updated weights for policy 0, policy_version 161680 (0.0005) +[2023-03-11 17:38:13,334][41544] Updated weights for policy 0, policy_version 161760 (0.0005) +[2023-03-11 17:38:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9483.3). Total num frames: 82821120. Throughput: 0: 9969.2. Samples: 82796188. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:13,386][41256] Avg episode reward: [(0, '83.421')] +[2023-03-11 17:38:17,475][41544] Updated weights for policy 0, policy_version 161840 (0.0005) +[2023-03-11 17:38:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9497.2). Total num frames: 82870272. Throughput: 0: 9960.1. Samples: 82855832. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:18,386][41256] Avg episode reward: [(0, '86.862')] +[2023-03-11 17:38:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000161856_82870272.pth... +[2023-03-11 17:38:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000161272_82571264.pth +[2023-03-11 17:38:21,576][41544] Updated weights for policy 0, policy_version 161920 (0.0005) +[2023-03-11 17:38:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9511.0). Total num frames: 82919424. Throughput: 0: 9930.7. Samples: 82915340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:23,386][41256] Avg episode reward: [(0, '86.063')] +[2023-03-11 17:38:25,720][41544] Updated weights for policy 0, policy_version 162000 (0.0005) +[2023-03-11 17:38:28,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 9524.9). Total num frames: 82968576. Throughput: 0: 9937.4. Samples: 82945016. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:28,386][41256] Avg episode reward: [(0, '88.955')] +[2023-03-11 17:38:30,100][41544] Updated weights for policy 0, policy_version 162080 (0.0005) +[2023-03-11 17:38:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9524.9). Total num frames: 83013632. Throughput: 0: 9830.4. Samples: 83001408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:33,386][41256] Avg episode reward: [(0, '86.260')] +[2023-03-11 17:38:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000162136_83013632.pth... +[2023-03-11 17:38:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000161568_82722816.pth +[2023-03-11 17:38:34,345][41544] Updated weights for policy 0, policy_version 162160 (0.0005) +[2023-03-11 17:38:38,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9538.8). Total num frames: 83062784. Throughput: 0: 9777.9. Samples: 83059736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:38,386][41256] Avg episode reward: [(0, '84.625')] +[2023-03-11 17:38:38,573][41544] Updated weights for policy 0, policy_version 162240 (0.0005) +[2023-03-11 17:38:42,729][41544] Updated weights for policy 0, policy_version 162320 (0.0005) +[2023-03-11 17:38:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9552.7). Total num frames: 83111936. Throughput: 0: 9799.9. Samples: 83089240. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:43,386][41256] Avg episode reward: [(0, '85.798')] +[2023-03-11 17:38:46,834][41544] Updated weights for policy 0, policy_version 162400 (0.0005) +[2023-03-11 17:38:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9566.6). Total num frames: 83161088. Throughput: 0: 9803.6. Samples: 83148864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:48,386][41256] Avg episode reward: [(0, '85.382')] +[2023-03-11 17:38:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000162424_83161088.pth... +[2023-03-11 17:38:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000161856_82870272.pth +[2023-03-11 17:38:50,948][41544] Updated weights for policy 0, policy_version 162480 (0.0005) +[2023-03-11 17:38:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9594.4). Total num frames: 83214336. Throughput: 0: 9823.0. Samples: 83209080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:53,386][41256] Avg episode reward: [(0, '88.466')] +[2023-03-11 17:38:55,088][41544] Updated weights for policy 0, policy_version 162560 (0.0005) +[2023-03-11 17:38:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9580.5). Total num frames: 83259392. Throughput: 0: 9828.0. Samples: 83238448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:38:58,386][41256] Avg episode reward: [(0, '82.671')] +[2023-03-11 17:38:59,364][41544] Updated weights for policy 0, policy_version 162640 (0.0005) +[2023-03-11 17:39:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9594.4). Total num frames: 83308544. Throughput: 0: 9801.2. Samples: 83296884. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:03,386][41256] Avg episode reward: [(0, '83.936')] +[2023-03-11 17:39:03,411][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000162720_83312640.pth... +[2023-03-11 17:39:03,411][41544] Updated weights for policy 0, policy_version 162720 (0.0005) +[2023-03-11 17:39:03,413][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000162136_83013632.pth +[2023-03-11 17:39:07,417][41544] Updated weights for policy 0, policy_version 162800 (0.0004) +[2023-03-11 17:39:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9622.1). Total num frames: 83361792. Throughput: 0: 9831.7. Samples: 83357768. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:08,386][41256] Avg episode reward: [(0, '84.519')] +[2023-03-11 17:39:11,358][41544] Updated weights for policy 0, policy_version 162880 (0.0004) +[2023-03-11 17:39:13,385][41256] Fps is (10 sec: 10649.6, 60 sec: 9898.7, 300 sec: 9649.9). Total num frames: 83415040. Throughput: 0: 9864.4. Samples: 83388912. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:13,386][41256] Avg episode reward: [(0, '82.879')] +[2023-03-11 17:39:15,372][41544] Updated weights for policy 0, policy_version 162960 (0.0005) +[2023-03-11 17:39:18,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9663.8). Total num frames: 83464192. Throughput: 0: 9982.1. Samples: 83450604. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:18,386][41256] Avg episode reward: [(0, '84.129')] +[2023-03-11 17:39:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163016_83464192.pth... +[2023-03-11 17:39:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000162424_83161088.pth +[2023-03-11 17:39:19,410][41544] Updated weights for policy 0, policy_version 163040 (0.0005) +[2023-03-11 17:39:23,370][41544] Updated weights for policy 0, policy_version 163120 (0.0004) +[2023-03-11 17:39:23,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9966.9, 300 sec: 9677.7). Total num frames: 83517440. Throughput: 0: 10056.5. Samples: 83512276. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:23,386][41256] Avg episode reward: [(0, '85.729')] +[2023-03-11 17:39:27,334][41544] Updated weights for policy 0, policy_version 163200 (0.0004) +[2023-03-11 17:39:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9967.0, 300 sec: 9691.6). Total num frames: 83566592. Throughput: 0: 10084.0. Samples: 83543020. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:28,386][41256] Avg episode reward: [(0, '85.239')] +[2023-03-11 17:39:31,383][41544] Updated weights for policy 0, policy_version 163280 (0.0004) +[2023-03-11 17:39:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9691.6). Total num frames: 83615744. Throughput: 0: 10103.7. Samples: 83603528. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:33,386][41256] Avg episode reward: [(0, '82.967')] +[2023-03-11 17:39:33,411][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163320_83619840.pth... +[2023-03-11 17:39:33,412][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000162720_83312640.pth +[2023-03-11 17:39:35,381][41544] Updated weights for policy 0, policy_version 163360 (0.0004) +[2023-03-11 17:39:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9719.3). Total num frames: 83668992. Throughput: 0: 10130.8. Samples: 83664968. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:38,386][41256] Avg episode reward: [(0, '84.573')] +[2023-03-11 17:39:39,451][41544] Updated weights for policy 0, policy_version 163440 (0.0005) +[2023-03-11 17:39:43,385][41256] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9733.2). Total num frames: 83718144. Throughput: 0: 10152.8. Samples: 83695324. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:43,386][41256] Avg episode reward: [(0, '85.645')] +[2023-03-11 17:39:43,506][41544] Updated weights for policy 0, policy_version 163520 (0.0005) +[2023-03-11 17:39:47,559][41544] Updated weights for policy 0, policy_version 163600 (0.0004) +[2023-03-11 17:39:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9747.1). Total num frames: 83767296. Throughput: 0: 10193.2. Samples: 83755576. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:48,386][41256] Avg episode reward: [(0, '85.281')] +[2023-03-11 17:39:48,406][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163616_83771392.pth... +[2023-03-11 17:39:48,408][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163016_83464192.pth +[2023-03-11 17:39:51,693][41544] Updated weights for policy 0, policy_version 163680 (0.0005) +[2023-03-11 17:39:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9761.0). Total num frames: 83820544. Throughput: 0: 10177.9. Samples: 83815772. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:53,386][41256] Avg episode reward: [(0, '87.020')] +[2023-03-11 17:39:55,870][41544] Updated weights for policy 0, policy_version 163760 (0.0005) +[2023-03-11 17:39:58,385][41256] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9761.0). Total num frames: 83865600. Throughput: 0: 10138.3. Samples: 83845136. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:39:58,386][41256] Avg episode reward: [(0, '87.833')] +[2023-03-11 17:40:00,049][41544] Updated weights for policy 0, policy_version 163840 (0.0005) +[2023-03-11 17:40:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10171.7, 300 sec: 9788.7). Total num frames: 83918848. Throughput: 0: 10079.8. Samples: 83904196. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:03,386][41256] Avg episode reward: [(0, '85.243')] +[2023-03-11 17:40:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163904_83918848.pth... +[2023-03-11 17:40:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163320_83619840.pth +[2023-03-11 17:40:04,114][41544] Updated weights for policy 0, policy_version 163920 (0.0004) +[2023-03-11 17:40:08,064][41544] Updated weights for policy 0, policy_version 164000 (0.0005) +[2023-03-11 17:40:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9788.7). Total num frames: 83968000. Throughput: 0: 10073.1. Samples: 83965568. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:08,386][41256] Avg episode reward: [(0, '86.489')] +[2023-03-11 17:40:11,980][41544] Updated weights for policy 0, policy_version 164080 (0.0004) +[2023-03-11 17:40:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 84021248. Throughput: 0: 10083.6. Samples: 83996784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:13,386][41256] Avg episode reward: [(0, '85.707')] +[2023-03-11 17:40:16,025][41544] Updated weights for policy 0, policy_version 164160 (0.0004) +[2023-03-11 17:40:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9816.5). Total num frames: 84070400. Throughput: 0: 10102.0. Samples: 84058120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:18,386][41256] Avg episode reward: [(0, '88.881')] +[2023-03-11 17:40:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000164200_84070400.pth... +[2023-03-11 17:40:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163616_83771392.pth +[2023-03-11 17:40:20,098][41544] Updated weights for policy 0, policy_version 164240 (0.0005) +[2023-03-11 17:40:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9830.4). Total num frames: 84123648. Throughput: 0: 10085.0. Samples: 84118792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:23,386][41256] Avg episode reward: [(0, '87.264')] +[2023-03-11 17:40:24,144][41544] Updated weights for policy 0, policy_version 164320 (0.0004) +[2023-03-11 17:40:28,183][41544] Updated weights for policy 0, policy_version 164400 (0.0004) +[2023-03-11 17:40:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9844.3). Total num frames: 84172800. Throughput: 0: 10071.2. Samples: 84148528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:28,386][41256] Avg episode reward: [(0, '86.111')] +[2023-03-11 17:40:32,260][41544] Updated weights for policy 0, policy_version 164480 (0.0004) +[2023-03-11 17:40:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9844.3). Total num frames: 84221952. Throughput: 0: 10092.3. Samples: 84209728. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:33,386][41256] Avg episode reward: [(0, '89.418')] +[2023-03-11 17:40:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000164496_84221952.pth... +[2023-03-11 17:40:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000163904_83918848.pth +[2023-03-11 17:40:36,397][41544] Updated weights for policy 0, policy_version 164560 (0.0005) +[2023-03-11 17:40:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 84271104. Throughput: 0: 10064.5. Samples: 84268676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:38,386][41256] Avg episode reward: [(0, '86.999')] +[2023-03-11 17:40:40,577][41544] Updated weights for policy 0, policy_version 164640 (0.0005) +[2023-03-11 17:40:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 84320256. Throughput: 0: 10073.4. Samples: 84298440. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:43,386][41256] Avg episode reward: [(0, '87.987')] +[2023-03-11 17:40:44,773][41544] Updated weights for policy 0, policy_version 164720 (0.0005) +[2023-03-11 17:40:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9858.2). Total num frames: 84369408. Throughput: 0: 10039.7. Samples: 84355984. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:48,397][41256] Avg episode reward: [(0, '87.483')] +[2023-03-11 17:40:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000164784_84369408.pth... +[2023-03-11 17:40:48,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000164200_84070400.pth +[2023-03-11 17:40:49,147][41544] Updated weights for policy 0, policy_version 164800 (0.0005) +[2023-03-11 17:40:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9844.3). Total num frames: 84414464. Throughput: 0: 9919.2. Samples: 84411932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:53,397][41256] Avg episode reward: [(0, '85.902')] +[2023-03-11 17:40:53,506][41544] Updated weights for policy 0, policy_version 164880 (0.0005) +[2023-03-11 17:40:58,023][41544] Updated weights for policy 0, policy_version 164960 (0.0005) +[2023-03-11 17:40:58,385][41256] Fps is (10 sec: 9011.4, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 84459520. Throughput: 0: 9843.9. Samples: 84439760. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:40:58,396][41256] Avg episode reward: [(0, '86.025')] +[2023-03-11 17:41:02,425][41544] Updated weights for policy 0, policy_version 165040 (0.0005) +[2023-03-11 17:41:03,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9816.5). Total num frames: 84508672. Throughput: 0: 9720.3. Samples: 84495532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:41:03,396][41256] Avg episode reward: [(0, '86.030')] +[2023-03-11 17:41:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165056_84508672.pth... +[2023-03-11 17:41:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000164496_84221952.pth +[2023-03-11 17:41:06,918][41544] Updated weights for policy 0, policy_version 165120 (0.0005) +[2023-03-11 17:41:08,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9802.6). Total num frames: 84553728. Throughput: 0: 9575.8. Samples: 84549704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:08,396][41256] Avg episode reward: [(0, '87.987')] +[2023-03-11 17:41:11,341][41544] Updated weights for policy 0, policy_version 165200 (0.0005) +[2023-03-11 17:41:13,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9625.6, 300 sec: 9788.7). Total num frames: 84598784. Throughput: 0: 9550.6. Samples: 84578304. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:13,396][41256] Avg episode reward: [(0, '86.112')] +[2023-03-11 17:41:15,774][41544] Updated weights for policy 0, policy_version 165280 (0.0005) +[2023-03-11 17:41:18,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9557.3, 300 sec: 9788.7). Total num frames: 84643840. Throughput: 0: 9406.0. Samples: 84633000. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:18,396][41256] Avg episode reward: [(0, '85.258')] +[2023-03-11 17:41:18,458][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165328_84647936.pth... +[2023-03-11 17:41:18,460][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000164784_84369408.pth +[2023-03-11 17:41:20,271][41544] Updated weights for policy 0, policy_version 165360 (0.0005) +[2023-03-11 17:41:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9788.7). Total num frames: 84692992. Throughput: 0: 9338.2. Samples: 84688896. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:23,396][41256] Avg episode reward: [(0, '86.038')] +[2023-03-11 17:41:24,615][41544] Updated weights for policy 0, policy_version 165440 (0.0005) +[2023-03-11 17:41:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9788.7). Total num frames: 84738048. Throughput: 0: 9308.4. Samples: 84717316. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:28,396][41256] Avg episode reward: [(0, '86.966')] +[2023-03-11 17:41:29,048][41544] Updated weights for policy 0, policy_version 165520 (0.0005) +[2023-03-11 17:41:33,361][41544] Updated weights for policy 0, policy_version 165600 (0.0005) +[2023-03-11 17:41:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9420.8, 300 sec: 9788.7). Total num frames: 84787200. Throughput: 0: 9259.2. Samples: 84772648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:33,397][41256] Avg episode reward: [(0, '83.562')] +[2023-03-11 17:41:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165600_84787200.pth... +[2023-03-11 17:41:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165056_84508672.pth +[2023-03-11 17:41:37,708][41544] Updated weights for policy 0, policy_version 165680 (0.0005) +[2023-03-11 17:41:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9788.7). Total num frames: 84832256. Throughput: 0: 9275.6. Samples: 84829336. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:38,396][41256] Avg episode reward: [(0, '84.211')] +[2023-03-11 17:41:42,017][41544] Updated weights for policy 0, policy_version 165760 (0.0005) +[2023-03-11 17:41:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9788.8). Total num frames: 84881408. Throughput: 0: 9285.3. Samples: 84857600. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:43,386][41256] Avg episode reward: [(0, '82.826')] +[2023-03-11 17:41:46,457][41544] Updated weights for policy 0, policy_version 165840 (0.0005) +[2023-03-11 17:41:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9788.7). Total num frames: 84926464. Throughput: 0: 9302.0. Samples: 84914120. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:48,386][41256] Avg episode reward: [(0, '83.702')] +[2023-03-11 17:41:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165872_84926464.pth... +[2023-03-11 17:41:48,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165328_84647936.pth +[2023-03-11 17:41:50,868][41544] Updated weights for policy 0, policy_version 165920 (0.0005) +[2023-03-11 17:41:53,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9284.3, 300 sec: 9788.7). Total num frames: 84971520. Throughput: 0: 9335.8. Samples: 84969816. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:53,396][41256] Avg episode reward: [(0, '80.659')] +[2023-03-11 17:41:55,228][41544] Updated weights for policy 0, policy_version 166000 (0.0005) +[2023-03-11 17:41:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9802.6). Total num frames: 85020672. Throughput: 0: 9327.0. Samples: 84998020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:41:58,396][41256] Avg episode reward: [(0, '84.643')] +[2023-03-11 17:41:59,497][41544] Updated weights for policy 0, policy_version 166080 (0.0005) +[2023-03-11 17:42:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9352.5, 300 sec: 9802.6). Total num frames: 85069824. Throughput: 0: 9371.9. Samples: 85054736. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:42:03,396][41256] Avg episode reward: [(0, '83.430')] +[2023-03-11 17:42:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000166152_85069824.pth... +[2023-03-11 17:42:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165600_84787200.pth +[2023-03-11 17:42:03,783][41544] Updated weights for policy 0, policy_version 166160 (0.0005) +[2023-03-11 17:42:07,845][41544] Updated weights for policy 0, policy_version 166240 (0.0004) +[2023-03-11 17:42:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9816.5). Total num frames: 85118976. Throughput: 0: 9466.5. Samples: 85114888. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:42:08,396][41256] Avg episode reward: [(0, '83.825')] +[2023-03-11 17:42:11,943][41544] Updated weights for policy 0, policy_version 166320 (0.0004) +[2023-03-11 17:42:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9816.5). Total num frames: 85168128. Throughput: 0: 9488.5. Samples: 85144300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:13,396][41256] Avg episode reward: [(0, '81.624')] +[2023-03-11 17:42:15,956][41544] Updated weights for policy 0, policy_version 166400 (0.0004) +[2023-03-11 17:42:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9557.3, 300 sec: 9816.5). Total num frames: 85217280. Throughput: 0: 9624.4. Samples: 85205748. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:18,397][41256] Avg episode reward: [(0, '81.346')] +[2023-03-11 17:42:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000166448_85221376.pth... +[2023-03-11 17:42:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000165872_84926464.pth +[2023-03-11 17:42:19,982][41544] Updated weights for policy 0, policy_version 166480 (0.0004) +[2023-03-11 17:42:23,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9625.6, 300 sec: 9830.4). Total num frames: 85270528. Throughput: 0: 9712.3. Samples: 85266392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:23,397][41256] Avg episode reward: [(0, '82.313')] +[2023-03-11 17:42:24,097][41544] Updated weights for policy 0, policy_version 166560 (0.0005) +[2023-03-11 17:42:28,261][41544] Updated weights for policy 0, policy_version 166640 (0.0005) +[2023-03-11 17:42:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9693.9, 300 sec: 9830.4). Total num frames: 85319680. Throughput: 0: 9733.7. Samples: 85295616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:28,396][41256] Avg episode reward: [(0, '84.404')] +[2023-03-11 17:42:32,390][41544] Updated weights for policy 0, policy_version 166720 (0.0005) +[2023-03-11 17:42:33,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9816.5). Total num frames: 85368832. Throughput: 0: 9809.9. Samples: 85355564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:33,396][41256] Avg episode reward: [(0, '83.714')] +[2023-03-11 17:42:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000166736_85368832.pth... +[2023-03-11 17:42:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000166152_85069824.pth +[2023-03-11 17:42:36,500][41544] Updated weights for policy 0, policy_version 166800 (0.0005) +[2023-03-11 17:42:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 85417984. Throughput: 0: 9884.8. Samples: 85414632. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:38,396][41256] Avg episode reward: [(0, '81.024')] +[2023-03-11 17:42:40,578][41544] Updated weights for policy 0, policy_version 166880 (0.0005) +[2023-03-11 17:42:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9816.5). Total num frames: 85467136. Throughput: 0: 9936.1. Samples: 85445144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:43,396][41256] Avg episode reward: [(0, '81.198')] +[2023-03-11 17:42:44,692][41544] Updated weights for policy 0, policy_version 166960 (0.0005) +[2023-03-11 17:42:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 85520384. Throughput: 0: 10014.6. Samples: 85505392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:48,396][41256] Avg episode reward: [(0, '79.921')] +[2023-03-11 17:42:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167032_85520384.pth... +[2023-03-11 17:42:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000166448_85221376.pth +[2023-03-11 17:42:48,680][41544] Updated weights for policy 0, policy_version 167040 (0.0005) +[2023-03-11 17:42:52,844][41544] Updated weights for policy 0, policy_version 167120 (0.0005) +[2023-03-11 17:42:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 85569536. Throughput: 0: 10012.5. Samples: 85565448. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:53,396][41256] Avg episode reward: [(0, '79.313')] +[2023-03-11 17:42:57,167][41544] Updated weights for policy 0, policy_version 167200 (0.0005) +[2023-03-11 17:42:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 85614592. Throughput: 0: 9996.0. Samples: 85594120. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:42:58,386][41256] Avg episode reward: [(0, '80.523')] +[2023-03-11 17:43:01,525][41544] Updated weights for policy 0, policy_version 167280 (0.0005) +[2023-03-11 17:43:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 85663744. Throughput: 0: 9885.6. Samples: 85650600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:03,386][41256] Avg episode reward: [(0, '79.288')] +[2023-03-11 17:43:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167312_85663744.pth... +[2023-03-11 17:43:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000166736_85368832.pth +[2023-03-11 17:43:05,876][41544] Updated weights for policy 0, policy_version 167360 (0.0005) +[2023-03-11 17:43:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 85708800. Throughput: 0: 9796.9. Samples: 85707252. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:08,386][41256] Avg episode reward: [(0, '78.169')] +[2023-03-11 17:43:10,160][41544] Updated weights for policy 0, policy_version 167440 (0.0005) +[2023-03-11 17:43:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9788.7). Total num frames: 85757952. Throughput: 0: 9777.4. Samples: 85735600. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:13,386][41256] Avg episode reward: [(0, '78.604')] +[2023-03-11 17:43:14,504][41544] Updated weights for policy 0, policy_version 167520 (0.0005) +[2023-03-11 17:43:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9762.1, 300 sec: 9774.9). Total num frames: 85803008. Throughput: 0: 9697.3. Samples: 85791944. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:18,386][41256] Avg episode reward: [(0, '80.334')] +[2023-03-11 17:43:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167584_85803008.pth... +[2023-03-11 17:43:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167032_85520384.pth +[2023-03-11 17:43:18,913][41544] Updated weights for policy 0, policy_version 167600 (0.0006) +[2023-03-11 17:43:23,271][41544] Updated weights for policy 0, policy_version 167680 (0.0005) +[2023-03-11 17:43:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9774.9). Total num frames: 85852160. Throughput: 0: 9633.2. Samples: 85848128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:23,386][41256] Avg episode reward: [(0, '79.869')] +[2023-03-11 17:43:27,721][41544] Updated weights for policy 0, policy_version 167760 (0.0005) +[2023-03-11 17:43:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 85897216. Throughput: 0: 9589.9. Samples: 85876688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:28,386][41256] Avg episode reward: [(0, '80.855')] +[2023-03-11 17:43:32,057][41544] Updated weights for policy 0, policy_version 167840 (0.0005) +[2023-03-11 17:43:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 85946368. Throughput: 0: 9476.7. Samples: 85931844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:33,386][41256] Avg episode reward: [(0, '80.138')] +[2023-03-11 17:43:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167864_85946368.pth... +[2023-03-11 17:43:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167312_85663744.pth +[2023-03-11 17:43:36,433][41544] Updated weights for policy 0, policy_version 167920 (0.0005) +[2023-03-11 17:43:38,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9761.0). Total num frames: 85991424. Throughput: 0: 9381.8. Samples: 85987628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:38,386][41256] Avg episode reward: [(0, '77.188')] +[2023-03-11 17:43:40,797][41544] Updated weights for policy 0, policy_version 168000 (0.0005) +[2023-03-11 17:43:43,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9489.1, 300 sec: 9747.1). Total num frames: 86036480. Throughput: 0: 9377.2. Samples: 86016096. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:43,386][41256] Avg episode reward: [(0, '76.442')] +[2023-03-11 17:43:45,197][41544] Updated weights for policy 0, policy_version 168080 (0.0004) +[2023-03-11 17:43:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9420.8, 300 sec: 9733.2). Total num frames: 86085632. Throughput: 0: 9382.3. Samples: 86072804. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:48,386][41256] Avg episode reward: [(0, '79.268')] +[2023-03-11 17:43:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168136_86085632.pth... +[2023-03-11 17:43:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167584_85803008.pth +[2023-03-11 17:43:49,616][41544] Updated weights for policy 0, policy_version 168160 (0.0005) +[2023-03-11 17:43:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9733.2). Total num frames: 86130688. Throughput: 0: 9341.9. Samples: 86127640. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:53,386][41256] Avg episode reward: [(0, '77.829')] +[2023-03-11 17:43:54,044][41544] Updated weights for policy 0, policy_version 168240 (0.0005) +[2023-03-11 17:43:58,385][41256] Fps is (10 sec: 9011.3, 60 sec: 9352.5, 300 sec: 9719.3). Total num frames: 86175744. Throughput: 0: 9327.5. Samples: 86155336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:43:58,386][41256] Avg episode reward: [(0, '80.158')] +[2023-03-11 17:43:58,460][41544] Updated weights for policy 0, policy_version 168320 (0.0005) +[2023-03-11 17:44:02,822][41544] Updated weights for policy 0, policy_version 168400 (0.0005) +[2023-03-11 17:44:03,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9705.4). Total num frames: 86224896. Throughput: 0: 9340.5. Samples: 86212268. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:44:03,386][41256] Avg episode reward: [(0, '79.373')] +[2023-03-11 17:44:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168408_86224896.pth... +[2023-03-11 17:44:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000167864_85946368.pth +[2023-03-11 17:44:07,228][41544] Updated weights for policy 0, policy_version 168480 (0.0005) +[2023-03-11 17:44:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9677.7). Total num frames: 86269952. Throughput: 0: 9317.3. Samples: 86267408. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:44:08,386][41256] Avg episode reward: [(0, '79.485')] +[2023-03-11 17:44:11,654][41544] Updated weights for policy 0, policy_version 168560 (0.0005) +[2023-03-11 17:44:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9352.5, 300 sec: 9677.7). Total num frames: 86319104. Throughput: 0: 9295.6. Samples: 86294992. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:44:13,386][41256] Avg episode reward: [(0, '79.083')] +[2023-03-11 17:44:15,944][41544] Updated weights for policy 0, policy_version 168640 (0.0005) +[2023-03-11 17:44:18,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9352.5, 300 sec: 9649.9). Total num frames: 86364160. Throughput: 0: 9335.4. Samples: 86351936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:44:18,386][41256] Avg episode reward: [(0, '79.634')] +[2023-03-11 17:44:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168680_86364160.pth... +[2023-03-11 17:44:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168136_86085632.pth +[2023-03-11 17:44:20,367][41544] Updated weights for policy 0, policy_version 168720 (0.0005) +[2023-03-11 17:44:23,385][41256] Fps is (10 sec: 9011.2, 60 sec: 9284.3, 300 sec: 9636.0). Total num frames: 86409216. Throughput: 0: 9339.2. Samples: 86407892. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:23,386][41256] Avg episode reward: [(0, '83.641')] +[2023-03-11 17:44:24,798][41544] Updated weights for policy 0, policy_version 168800 (0.0005) +[2023-03-11 17:44:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9352.5, 300 sec: 9636.0). Total num frames: 86458368. Throughput: 0: 9307.4. Samples: 86434928. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:28,386][41256] Avg episode reward: [(0, '83.739')] +[2023-03-11 17:44:29,193][41544] Updated weights for policy 0, policy_version 168880 (0.0005) +[2023-03-11 17:44:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9284.3, 300 sec: 9608.2). Total num frames: 86503424. Throughput: 0: 9306.9. Samples: 86491612. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:33,386][41256] Avg episode reward: [(0, '82.918')] +[2023-03-11 17:44:33,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168952_86503424.pth... +[2023-03-11 17:44:33,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168408_86224896.pth +[2023-03-11 17:44:33,446][41544] Updated weights for policy 0, policy_version 168960 (0.0005) +[2023-03-11 17:44:37,475][41544] Updated weights for policy 0, policy_version 169040 (0.0004) +[2023-03-11 17:44:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9420.8, 300 sec: 9622.1). Total num frames: 86556672. Throughput: 0: 9436.1. Samples: 86552264. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:38,386][41256] Avg episode reward: [(0, '81.404')] +[2023-03-11 17:44:41,626][41544] Updated weights for policy 0, policy_version 169120 (0.0005) +[2023-03-11 17:44:43,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9489.1, 300 sec: 9622.1). Total num frames: 86605824. Throughput: 0: 9468.4. Samples: 86581416. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:43,386][41256] Avg episode reward: [(0, '79.465')] +[2023-03-11 17:44:45,713][41544] Updated weights for policy 0, policy_version 169200 (0.0004) +[2023-03-11 17:44:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9489.1, 300 sec: 9608.2). Total num frames: 86654976. Throughput: 0: 9544.0. Samples: 86641748. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:48,386][41256] Avg episode reward: [(0, '76.843')] +[2023-03-11 17:44:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000169248_86654976.pth... +[2023-03-11 17:44:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168680_86364160.pth +[2023-03-11 17:44:49,932][41544] Updated weights for policy 0, policy_version 169280 (0.0005) +[2023-03-11 17:44:53,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9489.1, 300 sec: 9608.2). Total num frames: 86700032. Throughput: 0: 9587.0. Samples: 86698824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:53,386][41256] Avg episode reward: [(0, '80.121')] +[2023-03-11 17:44:54,325][41544] Updated weights for policy 0, policy_version 169360 (0.0005) +[2023-03-11 17:44:58,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9594.4). Total num frames: 86749184. Throughput: 0: 9605.4. Samples: 86727236. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:44:58,386][41256] Avg episode reward: [(0, '80.357')] +[2023-03-11 17:44:58,612][41544] Updated weights for policy 0, policy_version 169440 (0.0005) +[2023-03-11 17:45:02,938][41544] Updated weights for policy 0, policy_version 169520 (0.0005) +[2023-03-11 17:45:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.4, 300 sec: 9594.4). Total num frames: 86798336. Throughput: 0: 9596.4. Samples: 86783772. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:45:03,386][41256] Avg episode reward: [(0, '79.970')] +[2023-03-11 17:45:03,388][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000169528_86798336.pth... +[2023-03-11 17:45:03,390][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000168952_86503424.pth +[2023-03-11 17:45:07,369][41544] Updated weights for policy 0, policy_version 169600 (0.0005) +[2023-03-11 17:45:08,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9557.3, 300 sec: 9566.6). Total num frames: 86843392. Throughput: 0: 9588.2. Samples: 86839360. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:45:08,386][41256] Avg episode reward: [(0, '81.436')] +[2023-03-11 17:45:11,804][41544] Updated weights for policy 0, policy_version 169680 (0.0005) +[2023-03-11 17:45:13,385][41256] Fps is (10 sec: 9011.1, 60 sec: 9489.1, 300 sec: 9552.7). Total num frames: 86888448. Throughput: 0: 9614.9. Samples: 86867600. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:45:13,386][41256] Avg episode reward: [(0, '81.119')] +[2023-03-11 17:45:15,880][41544] Updated weights for policy 0, policy_version 169760 (0.0004) +[2023-03-11 17:45:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9625.6, 300 sec: 9552.7). Total num frames: 86941696. Throughput: 0: 9666.5. Samples: 86926604. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:45:18,386][41256] Avg episode reward: [(0, '80.623')] +[2023-03-11 17:45:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000169808_86941696.pth... +[2023-03-11 17:45:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000169248_86654976.pth +[2023-03-11 17:45:19,965][41544] Updated weights for policy 0, policy_version 169840 (0.0004) +[2023-03-11 17:45:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9693.9, 300 sec: 9552.7). Total num frames: 86990848. Throughput: 0: 9656.9. Samples: 86986824. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:45:23,386][41256] Avg episode reward: [(0, '81.815')] +[2023-03-11 17:45:24,008][41544] Updated weights for policy 0, policy_version 169920 (0.0004) +[2023-03-11 17:45:28,089][41544] Updated weights for policy 0, policy_version 170000 (0.0005) +[2023-03-11 17:45:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9552.7). Total num frames: 87040000. Throughput: 0: 9684.3. Samples: 87017208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:45:28,386][41256] Avg episode reward: [(0, '81.947')] +[2023-03-11 17:45:32,133][41544] Updated weights for policy 0, policy_version 170080 (0.0004) +[2023-03-11 17:45:33,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9566.6). Total num frames: 87093248. Throughput: 0: 9694.0. Samples: 87077976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:45:33,386][41256] Avg episode reward: [(0, '82.923')] +[2023-03-11 17:45:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170104_87093248.pth... +[2023-03-11 17:45:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000169528_86798336.pth +[2023-03-11 17:45:36,203][41544] Updated weights for policy 0, policy_version 170160 (0.0005) +[2023-03-11 17:45:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9566.6). Total num frames: 87142400. Throughput: 0: 9767.6. Samples: 87138368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:45:38,386][41256] Avg episode reward: [(0, '81.928')] +[2023-03-11 17:45:40,297][41544] Updated weights for policy 0, policy_version 170240 (0.0004) +[2023-03-11 17:45:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.2, 300 sec: 9566.6). Total num frames: 87191552. Throughput: 0: 9798.7. Samples: 87168176. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:45:43,386][41256] Avg episode reward: [(0, '82.005')] +[2023-03-11 17:45:44,359][41544] Updated weights for policy 0, policy_version 170320 (0.0004) +[2023-03-11 17:45:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9580.5). Total num frames: 87240704. Throughput: 0: 9887.0. Samples: 87228688. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:45:48,386][41256] Avg episode reward: [(0, '81.122')] +[2023-03-11 17:45:48,423][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170400_87244800.pth... +[2023-03-11 17:45:48,424][41544] Updated weights for policy 0, policy_version 170400 (0.0005) +[2023-03-11 17:45:48,425][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000169808_86941696.pth +[2023-03-11 17:45:52,444][41544] Updated weights for policy 0, policy_version 170480 (0.0004) +[2023-03-11 17:45:53,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9608.2). Total num frames: 87293952. Throughput: 0: 10012.4. Samples: 87289920. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:45:53,397][41256] Avg episode reward: [(0, '81.765')] +[2023-03-11 17:45:56,533][41544] Updated weights for policy 0, policy_version 170560 (0.0005) +[2023-03-11 17:45:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9608.2). Total num frames: 87343104. Throughput: 0: 10049.5. Samples: 87319828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:45:58,396][41256] Avg episode reward: [(0, '80.336')] +[2023-03-11 17:46:00,605][41544] Updated weights for policy 0, policy_version 170640 (0.0005) +[2023-03-11 17:46:03,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9622.1). Total num frames: 87392256. Throughput: 0: 10076.2. Samples: 87380032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:03,396][41256] Avg episode reward: [(0, '80.756')] +[2023-03-11 17:46:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170688_87392256.pth... +[2023-03-11 17:46:03,400][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170104_87093248.pth +[2023-03-11 17:46:04,668][41544] Updated weights for policy 0, policy_version 170720 (0.0005) +[2023-03-11 17:46:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9649.9). Total num frames: 87445504. Throughput: 0: 10100.3. Samples: 87441336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:08,396][41256] Avg episode reward: [(0, '82.259')] +[2023-03-11 17:46:08,658][41544] Updated weights for policy 0, policy_version 170800 (0.0004) +[2023-03-11 17:46:12,758][41544] Updated weights for policy 0, policy_version 170880 (0.0005) +[2023-03-11 17:46:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9663.8). Total num frames: 87494656. Throughput: 0: 10085.6. Samples: 87471060. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:13,396][41256] Avg episode reward: [(0, '81.170')] +[2023-03-11 17:46:16,834][41544] Updated weights for policy 0, policy_version 170960 (0.0005) +[2023-03-11 17:46:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9663.8). Total num frames: 87543808. Throughput: 0: 10078.9. Samples: 87531528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:18,396][41256] Avg episode reward: [(0, '83.239')] +[2023-03-11 17:46:18,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170984_87543808.pth... +[2023-03-11 17:46:18,400][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170400_87244800.pth +[2023-03-11 17:46:20,904][41544] Updated weights for policy 0, policy_version 171040 (0.0005) +[2023-03-11 17:46:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9691.6). Total num frames: 87597056. Throughput: 0: 10091.2. Samples: 87592472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:23,396][41256] Avg episode reward: [(0, '82.567')] +[2023-03-11 17:46:24,943][41544] Updated weights for policy 0, policy_version 171120 (0.0005) +[2023-03-11 17:46:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9691.6). Total num frames: 87646208. Throughput: 0: 10093.8. Samples: 87622396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:28,396][41256] Avg episode reward: [(0, '81.977')] +[2023-03-11 17:46:28,976][41544] Updated weights for policy 0, policy_version 171200 (0.0004) +[2023-03-11 17:46:32,951][41544] Updated weights for policy 0, policy_version 171280 (0.0004) +[2023-03-11 17:46:33,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10103.5, 300 sec: 9719.3). Total num frames: 87699456. Throughput: 0: 10107.0. Samples: 87683504. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:33,397][41256] Avg episode reward: [(0, '80.584')] +[2023-03-11 17:46:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000171288_87699456.pth... +[2023-03-11 17:46:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170688_87392256.pth +[2023-03-11 17:46:36,896][41544] Updated weights for policy 0, policy_version 171360 (0.0004) +[2023-03-11 17:46:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9719.3). Total num frames: 87748608. Throughput: 0: 10136.4. Samples: 87746056. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:38,386][41256] Avg episode reward: [(0, '79.613')] +[2023-03-11 17:46:40,875][41544] Updated weights for policy 0, policy_version 171440 (0.0004) +[2023-03-11 17:46:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9747.1). Total num frames: 87801856. Throughput: 0: 10164.2. Samples: 87777216. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:43,386][41256] Avg episode reward: [(0, '77.560')] +[2023-03-11 17:46:44,880][41544] Updated weights for policy 0, policy_version 171520 (0.0004) +[2023-03-11 17:46:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9761.0). Total num frames: 87851008. Throughput: 0: 10187.1. Samples: 87838452. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:48,386][41256] Avg episode reward: [(0, '78.747')] +[2023-03-11 17:46:48,421][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000171592_87855104.pth... +[2023-03-11 17:46:48,422][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000170984_87543808.pth +[2023-03-11 17:46:48,829][41544] Updated weights for policy 0, policy_version 171600 (0.0004) +[2023-03-11 17:46:52,806][41544] Updated weights for policy 0, policy_version 171680 (0.0005) +[2023-03-11 17:46:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9774.9). Total num frames: 87904256. Throughput: 0: 10197.7. Samples: 87900232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:53,386][41256] Avg episode reward: [(0, '78.875')] +[2023-03-11 17:46:56,793][41544] Updated weights for policy 0, policy_version 171760 (0.0004) +[2023-03-11 17:46:58,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9774.9). Total num frames: 87953408. Throughput: 0: 10232.0. Samples: 87931500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:46:58,386][41256] Avg episode reward: [(0, '79.625')] +[2023-03-11 17:47:00,845][41544] Updated weights for policy 0, policy_version 171840 (0.0004) +[2023-03-11 17:47:03,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9788.7). Total num frames: 88006656. Throughput: 0: 10234.7. Samples: 87992088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:47:03,386][41256] Avg episode reward: [(0, '79.712')] +[2023-03-11 17:47:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000171888_88006656.pth... +[2023-03-11 17:47:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000171288_87699456.pth +[2023-03-11 17:47:04,846][41544] Updated weights for policy 0, policy_version 171920 (0.0004) +[2023-03-11 17:47:08,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10171.7, 300 sec: 9788.7). Total num frames: 88055808. Throughput: 0: 10242.9. Samples: 88053404. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:47:08,386][41256] Avg episode reward: [(0, '80.830')] +[2023-03-11 17:47:08,875][41544] Updated weights for policy 0, policy_version 172000 (0.0005) +[2023-03-11 17:47:12,929][41544] Updated weights for policy 0, policy_version 172080 (0.0005) +[2023-03-11 17:47:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9802.6). Total num frames: 88109056. Throughput: 0: 10254.4. Samples: 88083844. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:47:13,386][41256] Avg episode reward: [(0, '79.120')] +[2023-03-11 17:47:16,942][41544] Updated weights for policy 0, policy_version 172160 (0.0004) +[2023-03-11 17:47:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9788.7). Total num frames: 88158208. Throughput: 0: 10251.5. Samples: 88144820. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:47:18,386][41256] Avg episode reward: [(0, '78.980')] +[2023-03-11 17:47:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000172184_88158208.pth... +[2023-03-11 17:47:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000171592_87855104.pth +[2023-03-11 17:47:21,021][41544] Updated weights for policy 0, policy_version 172240 (0.0005) +[2023-03-11 17:47:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 9788.7). Total num frames: 88207360. Throughput: 0: 10203.4. Samples: 88205208. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:47:23,386][41256] Avg episode reward: [(0, '78.914')] +[2023-03-11 17:47:25,096][41544] Updated weights for policy 0, policy_version 172320 (0.0005) +[2023-03-11 17:47:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9802.6). Total num frames: 88260608. Throughput: 0: 10185.8. Samples: 88235576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:47:28,386][41256] Avg episode reward: [(0, '79.689')] +[2023-03-11 17:47:29,040][41544] Updated weights for policy 0, policy_version 172400 (0.0005) +[2023-03-11 17:47:32,965][41544] Updated weights for policy 0, policy_version 172480 (0.0004) +[2023-03-11 17:47:33,386][41256] Fps is (10 sec: 10649.6, 60 sec: 10240.0, 300 sec: 9816.5). Total num frames: 88313856. Throughput: 0: 10202.0. Samples: 88297544. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:47:33,386][41256] Avg episode reward: [(0, '76.694')] +[2023-03-11 17:47:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000172488_88313856.pth... +[2023-03-11 17:47:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000171888_88006656.pth +[2023-03-11 17:47:36,942][41544] Updated weights for policy 0, policy_version 172560 (0.0004) +[2023-03-11 17:47:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9816.5). Total num frames: 88363008. Throughput: 0: 10206.5. Samples: 88359524. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:47:38,386][41256] Avg episode reward: [(0, '76.843')] +[2023-03-11 17:47:40,934][41544] Updated weights for policy 0, policy_version 172640 (0.0004) +[2023-03-11 17:47:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9816.5). Total num frames: 88416256. Throughput: 0: 10207.6. Samples: 88390840. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:47:43,386][41256] Avg episode reward: [(0, '76.043')] +[2023-03-11 17:47:44,953][41544] Updated weights for policy 0, policy_version 172720 (0.0005) +[2023-03-11 17:47:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9816.5). Total num frames: 88465408. Throughput: 0: 10221.9. Samples: 88452076. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:47:48,386][41256] Avg episode reward: [(0, '79.122')] +[2023-03-11 17:47:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000172784_88465408.pth... +[2023-03-11 17:47:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000172184_88158208.pth +[2023-03-11 17:47:48,908][41544] Updated weights for policy 0, policy_version 172800 (0.0004) +[2023-03-11 17:47:52,851][41544] Updated weights for policy 0, policy_version 172880 (0.0004) +[2023-03-11 17:47:53,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9844.3). Total num frames: 88518656. Throughput: 0: 10248.2. Samples: 88514572. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:47:53,386][41256] Avg episode reward: [(0, '76.645')] +[2023-03-11 17:47:56,829][41544] Updated weights for policy 0, policy_version 172960 (0.0004) +[2023-03-11 17:47:58,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 9858.2). Total num frames: 88571904. Throughput: 0: 10257.9. Samples: 88545448. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:47:58,386][41256] Avg episode reward: [(0, '80.537')] +[2023-03-11 17:47:59,555][41500] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000003 +[2023-03-11 17:48:00,775][41544] Updated weights for policy 0, policy_version 173040 (0.0004) +[2023-03-11 17:48:03,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9872.0). Total num frames: 88621056. Throughput: 0: 10283.9. Samples: 88607596. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:03,386][41256] Avg episode reward: [(0, '78.785')] +[2023-03-11 17:48:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173088_88621056.pth... +[2023-03-11 17:48:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000172488_88313856.pth +[2023-03-11 17:48:04,810][41544] Updated weights for policy 0, policy_version 173120 (0.0004) +[2023-03-11 17:48:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9872.1). Total num frames: 88670208. Throughput: 0: 10298.2. Samples: 88668628. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:08,386][41256] Avg episode reward: [(0, '79.509')] +[2023-03-11 17:48:08,798][41544] Updated weights for policy 0, policy_version 173200 (0.0004) +[2023-03-11 17:48:12,812][41544] Updated weights for policy 0, policy_version 173280 (0.0004) +[2023-03-11 17:48:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 88723456. Throughput: 0: 10297.2. Samples: 88698952. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:13,386][41256] Avg episode reward: [(0, '81.038')] +[2023-03-11 17:48:16,885][41544] Updated weights for policy 0, policy_version 173360 (0.0005) +[2023-03-11 17:48:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9899.8). Total num frames: 88772608. Throughput: 0: 10280.5. Samples: 88760168. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:18,386][41256] Avg episode reward: [(0, '79.218')] +[2023-03-11 17:48:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173384_88772608.pth... +[2023-03-11 17:48:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000172784_88465408.pth +[2023-03-11 17:48:20,874][41544] Updated weights for policy 0, policy_version 173440 (0.0005) +[2023-03-11 17:48:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9927.6). Total num frames: 88825856. Throughput: 0: 10274.3. Samples: 88821868. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:23,386][41256] Avg episode reward: [(0, '82.222')] +[2023-03-11 17:48:24,793][41544] Updated weights for policy 0, policy_version 173520 (0.0004) +[2023-03-11 17:48:28,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 9941.5). Total num frames: 88879104. Throughput: 0: 10262.8. Samples: 88852668. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:28,386][41256] Avg episode reward: [(0, '80.031')] +[2023-03-11 17:48:28,773][41544] Updated weights for policy 0, policy_version 173600 (0.0004) +[2023-03-11 17:48:32,783][41544] Updated weights for policy 0, policy_version 173680 (0.0005) +[2023-03-11 17:48:33,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9955.4). Total num frames: 88928256. Throughput: 0: 10273.1. Samples: 88914368. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:33,386][41256] Avg episode reward: [(0, '82.071')] +[2023-03-11 17:48:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173688_88928256.pth... +[2023-03-11 17:48:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173088_88621056.pth +[2023-03-11 17:48:36,775][41544] Updated weights for policy 0, policy_version 173760 (0.0004) +[2023-03-11 17:48:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10240.0, 300 sec: 9969.2). Total num frames: 88977408. Throughput: 0: 10251.8. Samples: 88975904. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:48:38,386][41256] Avg episode reward: [(0, '78.920')] +[2023-03-11 17:48:40,776][41544] Updated weights for policy 0, policy_version 173840 (0.0005) +[2023-03-11 17:48:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9983.1). Total num frames: 89030656. Throughput: 0: 10243.8. Samples: 89006420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:48:43,386][41256] Avg episode reward: [(0, '79.875')] +[2023-03-11 17:48:44,730][41544] Updated weights for policy 0, policy_version 173920 (0.0004) +[2023-03-11 17:48:48,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 10010.9). Total num frames: 89083904. Throughput: 0: 10234.8. Samples: 89068160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:48:48,386][41256] Avg episode reward: [(0, '79.480')] +[2023-03-11 17:48:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173992_89083904.pth... +[2023-03-11 17:48:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173384_88772608.pth +[2023-03-11 17:48:48,725][41544] Updated weights for policy 0, policy_version 174000 (0.0005) +[2023-03-11 17:48:52,734][41544] Updated weights for policy 0, policy_version 174080 (0.0005) +[2023-03-11 17:48:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10024.8). Total num frames: 89133056. Throughput: 0: 10249.2. Samples: 89129840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:48:53,386][41256] Avg episode reward: [(0, '80.065')] +[2023-03-11 17:48:56,702][41544] Updated weights for policy 0, policy_version 174160 (0.0005) +[2023-03-11 17:48:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 10038.7). Total num frames: 89186304. Throughput: 0: 10276.4. Samples: 89161388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:48:58,386][41256] Avg episode reward: [(0, '77.772')] +[2023-03-11 17:49:00,642][41544] Updated weights for policy 0, policy_version 174240 (0.0004) +[2023-03-11 17:49:03,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 10052.6). Total num frames: 89235456. Throughput: 0: 10290.3. Samples: 89223232. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:03,386][41256] Avg episode reward: [(0, '78.297')] +[2023-03-11 17:49:03,398][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000174296_89239552.pth... +[2023-03-11 17:49:03,400][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173688_88928256.pth +[2023-03-11 17:49:04,584][41544] Updated weights for policy 0, policy_version 174320 (0.0004) +[2023-03-11 17:49:08,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10066.4). Total num frames: 89288704. Throughput: 0: 10297.9. Samples: 89285276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:08,386][41256] Avg episode reward: [(0, '80.199')] +[2023-03-11 17:49:08,563][41544] Updated weights for policy 0, policy_version 174400 (0.0005) +[2023-03-11 17:49:12,521][41544] Updated weights for policy 0, policy_version 174480 (0.0005) +[2023-03-11 17:49:13,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10308.3, 300 sec: 10094.2). Total num frames: 89341952. Throughput: 0: 10305.6. Samples: 89316420. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:13,386][41256] Avg episode reward: [(0, '80.008')] +[2023-03-11 17:49:16,393][41544] Updated weights for policy 0, policy_version 174560 (0.0004) +[2023-03-11 17:49:18,386][41256] Fps is (10 sec: 10649.6, 60 sec: 10376.5, 300 sec: 10122.0). Total num frames: 89395200. Throughput: 0: 10329.9. Samples: 89379212. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:18,386][41256] Avg episode reward: [(0, '78.270')] +[2023-03-11 17:49:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000174600_89395200.pth... +[2023-03-11 17:49:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000173992_89083904.pth +[2023-03-11 17:49:20,335][41544] Updated weights for policy 0, policy_version 174640 (0.0004) +[2023-03-11 17:49:23,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10122.0). Total num frames: 89444352. Throughput: 0: 10341.2. Samples: 89441260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:23,386][41256] Avg episode reward: [(0, '78.616')] +[2023-03-11 17:49:24,338][41544] Updated weights for policy 0, policy_version 174720 (0.0005) +[2023-03-11 17:49:28,262][41544] Updated weights for policy 0, policy_version 174800 (0.0005) +[2023-03-11 17:49:28,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 10149.7). Total num frames: 89497600. Throughput: 0: 10350.3. Samples: 89472184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:28,386][41256] Avg episode reward: [(0, '78.751')] +[2023-03-11 17:49:32,199][41544] Updated weights for policy 0, policy_version 174880 (0.0005) +[2023-03-11 17:49:33,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 10135.9). Total num frames: 89546752. Throughput: 0: 10365.6. Samples: 89534612. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:33,386][41256] Avg episode reward: [(0, '79.339')] +[2023-03-11 17:49:33,406][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000174904_89550848.pth... +[2023-03-11 17:49:33,408][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000174296_89239552.pth +[2023-03-11 17:49:36,158][41544] Updated weights for policy 0, policy_version 174960 (0.0005) +[2023-03-11 17:49:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 10149.7). Total num frames: 89600000. Throughput: 0: 10381.8. Samples: 89597020. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:38,386][41256] Avg episode reward: [(0, '77.749')] +[2023-03-11 17:49:40,164][41544] Updated weights for policy 0, policy_version 175040 (0.0004) +[2023-03-11 17:49:43,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10376.5, 300 sec: 10163.6). Total num frames: 89653248. Throughput: 0: 10352.0. Samples: 89627228. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:49:43,386][41256] Avg episode reward: [(0, '79.817')] +[2023-03-11 17:49:44,138][41544] Updated weights for policy 0, policy_version 175120 (0.0004) +[2023-03-11 17:49:48,331][41544] Updated weights for policy 0, policy_version 175200 (0.0005) +[2023-03-11 17:49:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10308.2, 300 sec: 10177.5). Total num frames: 89702400. Throughput: 0: 10331.5. Samples: 89688152. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:49:48,386][41256] Avg episode reward: [(0, '78.345')] +[2023-03-11 17:49:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000175200_89702400.pth... +[2023-03-11 17:49:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000174600_89395200.pth +[2023-03-11 17:49:52,644][41544] Updated weights for policy 0, policy_version 175280 (0.0006) +[2023-03-11 17:49:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 10240.0, 300 sec: 10163.6). Total num frames: 89747456. Throughput: 0: 10219.8. Samples: 89745164. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:49:53,386][41256] Avg episode reward: [(0, '77.939')] +[2023-03-11 17:49:56,898][41544] Updated weights for policy 0, policy_version 175360 (0.0005) +[2023-03-11 17:49:58,385][41256] Fps is (10 sec: 9420.9, 60 sec: 10171.7, 300 sec: 10163.6). Total num frames: 89796608. Throughput: 0: 10165.7. Samples: 89773876. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:49:58,386][41256] Avg episode reward: [(0, '79.566')] +[2023-03-11 17:50:00,995][41544] Updated weights for policy 0, policy_version 175440 (0.0005) +[2023-03-11 17:50:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10171.7, 300 sec: 10177.5). Total num frames: 89845760. Throughput: 0: 10096.1. Samples: 89833536. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:03,386][41256] Avg episode reward: [(0, '77.261')] +[2023-03-11 17:50:03,421][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000175488_89849856.pth... +[2023-03-11 17:50:03,423][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000174904_89550848.pth +[2023-03-11 17:50:05,116][41544] Updated weights for policy 0, policy_version 175520 (0.0005) +[2023-03-11 17:50:08,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 89894912. Throughput: 0: 10044.1. Samples: 89893244. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:08,386][41256] Avg episode reward: [(0, '78.271')] +[2023-03-11 17:50:09,189][41544] Updated weights for policy 0, policy_version 175600 (0.0005) +[2023-03-11 17:50:13,245][41544] Updated weights for policy 0, policy_version 175680 (0.0005) +[2023-03-11 17:50:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 10191.4). Total num frames: 89948160. Throughput: 0: 10032.5. Samples: 89923648. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:13,386][41256] Avg episode reward: [(0, '75.962')] +[2023-03-11 17:50:17,348][41544] Updated weights for policy 0, policy_version 175760 (0.0005) +[2023-03-11 17:50:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 89997312. Throughput: 0: 10000.4. Samples: 89984628. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:18,386][41256] Avg episode reward: [(0, '76.223')] +[2023-03-11 17:50:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000175776_89997312.pth... +[2023-03-11 17:50:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000175200_89702400.pth +[2023-03-11 17:50:21,465][41544] Updated weights for policy 0, policy_version 175840 (0.0005) +[2023-03-11 17:50:23,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 90046464. Throughput: 0: 9939.2. Samples: 90044284. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:23,386][41256] Avg episode reward: [(0, '76.397')] +[2023-03-11 17:50:25,412][41544] Updated weights for policy 0, policy_version 175920 (0.0004) +[2023-03-11 17:50:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10191.4). Total num frames: 90099712. Throughput: 0: 9955.3. Samples: 90075216. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:28,386][41256] Avg episode reward: [(0, '74.931')] +[2023-03-11 17:50:29,374][41544] Updated weights for policy 0, policy_version 176000 (0.0005) +[2023-03-11 17:50:33,362][41544] Updated weights for policy 0, policy_version 176080 (0.0005) +[2023-03-11 17:50:33,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10103.5, 300 sec: 10205.3). Total num frames: 90152960. Throughput: 0: 9972.7. Samples: 90136924. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:33,386][41256] Avg episode reward: [(0, '75.441')] +[2023-03-11 17:50:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176080_90152960.pth... +[2023-03-11 17:50:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000175488_89849856.pth +[2023-03-11 17:50:37,404][41544] Updated weights for policy 0, policy_version 176160 (0.0005) +[2023-03-11 17:50:38,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 10205.3). Total num frames: 90202112. Throughput: 0: 10064.8. Samples: 90198080. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:38,386][41256] Avg episode reward: [(0, '77.639')] +[2023-03-11 17:50:41,611][41544] Updated weights for policy 0, policy_version 176240 (0.0005) +[2023-03-11 17:50:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10205.3). Total num frames: 90251264. Throughput: 0: 10067.5. Samples: 90226912. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:43,386][41256] Avg episode reward: [(0, '78.094')] +[2023-03-11 17:50:45,853][41544] Updated weights for policy 0, policy_version 176320 (0.0005) +[2023-03-11 17:50:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9898.7, 300 sec: 10177.5). Total num frames: 90296320. Throughput: 0: 10038.4. Samples: 90285264. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:50:48,386][41256] Avg episode reward: [(0, '77.521')] +[2023-03-11 17:50:48,440][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176368_90300416.pth... +[2023-03-11 17:50:48,442][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000175776_89997312.pth +[2023-03-11 17:50:50,100][41544] Updated weights for policy 0, policy_version 176400 (0.0005) +[2023-03-11 17:50:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 90345472. Throughput: 0: 10005.2. Samples: 90343476. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:50:53,386][41256] Avg episode reward: [(0, '76.564')] +[2023-03-11 17:50:54,270][41544] Updated weights for policy 0, policy_version 176480 (0.0005) +[2023-03-11 17:50:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9966.9, 300 sec: 10177.5). Total num frames: 90394624. Throughput: 0: 9983.3. Samples: 90372896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:50:58,386][41256] Avg episode reward: [(0, '75.146')] +[2023-03-11 17:50:58,487][41544] Updated weights for policy 0, policy_version 176560 (0.0005) +[2023-03-11 17:51:02,656][41544] Updated weights for policy 0, policy_version 176640 (0.0005) +[2023-03-11 17:51:03,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 90443776. Throughput: 0: 9930.5. Samples: 90431500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:03,386][41256] Avg episode reward: [(0, '74.454')] +[2023-03-11 17:51:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176648_90443776.pth... +[2023-03-11 17:51:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176080_90152960.pth +[2023-03-11 17:51:06,878][41544] Updated weights for policy 0, policy_version 176720 (0.0005) +[2023-03-11 17:51:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 10163.6). Total num frames: 90492928. Throughput: 0: 9900.9. Samples: 90489824. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:08,386][41256] Avg episode reward: [(0, '76.491')] +[2023-03-11 17:51:11,036][41544] Updated weights for policy 0, policy_version 176800 (0.0005) +[2023-03-11 17:51:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 10163.6). Total num frames: 90542080. Throughput: 0: 9877.6. Samples: 90519708. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:13,386][41256] Avg episode reward: [(0, '78.508')] +[2023-03-11 17:51:15,173][41544] Updated weights for policy 0, policy_version 176880 (0.0004) +[2023-03-11 17:51:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 90591232. Throughput: 0: 9823.0. Samples: 90578960. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:18,386][41256] Avg episode reward: [(0, '75.715')] +[2023-03-11 17:51:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176936_90591232.pth... +[2023-03-11 17:51:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176368_90300416.pth +[2023-03-11 17:51:19,337][41544] Updated weights for policy 0, policy_version 176960 (0.0005) +[2023-03-11 17:51:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 10149.7). Total num frames: 90640384. Throughput: 0: 9761.8. Samples: 90637360. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:23,386][41256] Avg episode reward: [(0, '75.788')] +[2023-03-11 17:51:23,570][41544] Updated weights for policy 0, policy_version 177040 (0.0005) +[2023-03-11 17:51:27,629][41544] Updated weights for policy 0, policy_version 177120 (0.0004) +[2023-03-11 17:51:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10135.9). Total num frames: 90689536. Throughput: 0: 9773.9. Samples: 90666736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:28,386][41256] Avg episode reward: [(0, '76.384')] +[2023-03-11 17:51:31,811][41544] Updated weights for policy 0, policy_version 177200 (0.0005) +[2023-03-11 17:51:33,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10135.9). Total num frames: 90738688. Throughput: 0: 9804.6. Samples: 90726472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:33,386][41256] Avg episode reward: [(0, '76.251')] +[2023-03-11 17:51:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000177224_90738688.pth... +[2023-03-11 17:51:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176648_90443776.pth +[2023-03-11 17:51:36,040][41544] Updated weights for policy 0, policy_version 177280 (0.0005) +[2023-03-11 17:51:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10122.0). Total num frames: 90787840. Throughput: 0: 9809.0. Samples: 90784880. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:38,386][41256] Avg episode reward: [(0, '79.898')] +[2023-03-11 17:51:40,232][41544] Updated weights for policy 0, policy_version 177360 (0.0005) +[2023-03-11 17:51:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10122.0). Total num frames: 90836992. Throughput: 0: 9813.9. Samples: 90814524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:43,386][41256] Avg episode reward: [(0, '77.528')] +[2023-03-11 17:51:44,324][41544] Updated weights for policy 0, policy_version 177440 (0.0005) +[2023-03-11 17:51:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 90886144. Throughput: 0: 9831.7. Samples: 90873928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:48,386][41256] Avg episode reward: [(0, '74.908')] +[2023-03-11 17:51:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000177512_90886144.pth... +[2023-03-11 17:51:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000176936_90591232.pth +[2023-03-11 17:51:48,496][41544] Updated weights for policy 0, policy_version 177520 (0.0005) +[2023-03-11 17:51:52,734][41544] Updated weights for policy 0, policy_version 177600 (0.0005) +[2023-03-11 17:51:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10108.1). Total num frames: 90935296. Throughput: 0: 9822.9. Samples: 90931856. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:53,386][41256] Avg episode reward: [(0, '74.175')] +[2023-03-11 17:51:56,943][41544] Updated weights for policy 0, policy_version 177680 (0.0004) +[2023-03-11 17:51:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10094.2). Total num frames: 90984448. Throughput: 0: 9813.6. Samples: 90961320. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:51:58,386][41256] Avg episode reward: [(0, '76.058')] +[2023-03-11 17:52:01,130][41544] Updated weights for policy 0, policy_version 177760 (0.0005) +[2023-03-11 17:52:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10094.2). Total num frames: 91033600. Throughput: 0: 9803.1. Samples: 91020100. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:03,386][41256] Avg episode reward: [(0, '78.259')] +[2023-03-11 17:52:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000177800_91033600.pth... +[2023-03-11 17:52:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000177224_90738688.pth +[2023-03-11 17:52:05,341][41544] Updated weights for policy 0, policy_version 177840 (0.0005) +[2023-03-11 17:52:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10080.3). Total num frames: 91082752. Throughput: 0: 9806.8. Samples: 91078668. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:08,386][41256] Avg episode reward: [(0, '77.107')] +[2023-03-11 17:52:09,547][41544] Updated weights for policy 0, policy_version 177920 (0.0005) +[2023-03-11 17:52:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10080.3). Total num frames: 91131904. Throughput: 0: 9791.2. Samples: 91107340. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:13,386][41256] Avg episode reward: [(0, '73.405')] +[2023-03-11 17:52:13,727][41544] Updated weights for policy 0, policy_version 178000 (0.0005) +[2023-03-11 17:52:17,942][41544] Updated weights for policy 0, policy_version 178080 (0.0005) +[2023-03-11 17:52:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10080.3). Total num frames: 91181056. Throughput: 0: 9769.3. Samples: 91166088. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:18,396][41256] Avg episode reward: [(0, '74.463')] +[2023-03-11 17:52:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178088_91181056.pth... +[2023-03-11 17:52:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000177512_90886144.pth +[2023-03-11 17:52:22,179][41544] Updated weights for policy 0, policy_version 178160 (0.0005) +[2023-03-11 17:52:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10052.6). Total num frames: 91226112. Throughput: 0: 9779.5. Samples: 91224956. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:23,396][41256] Avg episode reward: [(0, '76.310')] +[2023-03-11 17:52:26,336][41544] Updated weights for policy 0, policy_version 178240 (0.0005) +[2023-03-11 17:52:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 10038.7). Total num frames: 91275264. Throughput: 0: 9766.8. Samples: 91254032. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:28,396][41256] Avg episode reward: [(0, '76.601')] +[2023-03-11 17:52:30,525][41544] Updated weights for policy 0, policy_version 178320 (0.0005) +[2023-03-11 17:52:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 10038.7). Total num frames: 91324416. Throughput: 0: 9746.6. Samples: 91312524. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:33,396][41256] Avg episode reward: [(0, '74.256')] +[2023-03-11 17:52:33,443][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178376_91328512.pth... +[2023-03-11 17:52:33,445][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000177800_91033600.pth +[2023-03-11 17:52:34,644][41544] Updated weights for policy 0, policy_version 178400 (0.0004) +[2023-03-11 17:52:38,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 91377664. Throughput: 0: 9816.2. Samples: 91373584. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:38,396][41256] Avg episode reward: [(0, '74.338')] +[2023-03-11 17:52:38,660][41544] Updated weights for policy 0, policy_version 178480 (0.0003) +[2023-03-11 17:52:42,636][41544] Updated weights for policy 0, policy_version 178560 (0.0003) +[2023-03-11 17:52:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9830.4, 300 sec: 10038.7). Total num frames: 91426816. Throughput: 0: 9843.9. Samples: 91404296. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:43,386][41256] Avg episode reward: [(0, '75.175')] +[2023-03-11 17:52:46,767][41544] Updated weights for policy 0, policy_version 178640 (0.0004) +[2023-03-11 17:52:48,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10024.8). Total num frames: 91475968. Throughput: 0: 9867.8. Samples: 91464152. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:48,386][41256] Avg episode reward: [(0, '74.263')] +[2023-03-11 17:52:48,397][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178672_91480064.pth... +[2023-03-11 17:52:48,399][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178088_91181056.pth +[2023-03-11 17:52:50,925][41544] Updated weights for policy 0, policy_version 178720 (0.0005) +[2023-03-11 17:52:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 91525120. Throughput: 0: 9882.7. Samples: 91523388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:53,386][41256] Avg episode reward: [(0, '75.380')] +[2023-03-11 17:52:55,172][41544] Updated weights for policy 0, policy_version 178800 (0.0006) +[2023-03-11 17:52:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 91574272. Throughput: 0: 9889.5. Samples: 91552368. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:52:58,386][41256] Avg episode reward: [(0, '76.050')] +[2023-03-11 17:52:59,421][41544] Updated weights for policy 0, policy_version 178880 (0.0005) +[2023-03-11 17:53:03,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 10010.9). Total num frames: 91623424. Throughput: 0: 9878.8. Samples: 91610632. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:03,386][41256] Avg episode reward: [(0, '75.180')] +[2023-03-11 17:53:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178952_91623424.pth... +[2023-03-11 17:53:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178376_91328512.pth +[2023-03-11 17:53:03,668][41544] Updated weights for policy 0, policy_version 178960 (0.0005) +[2023-03-11 17:53:07,867][41544] Updated weights for policy 0, policy_version 179040 (0.0006) +[2023-03-11 17:53:08,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9997.0). Total num frames: 91672576. Throughput: 0: 9856.4. Samples: 91668496. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:08,386][41256] Avg episode reward: [(0, '76.758')] +[2023-03-11 17:53:12,083][41544] Updated weights for policy 0, policy_version 179120 (0.0005) +[2023-03-11 17:53:13,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9997.0). Total num frames: 91721728. Throughput: 0: 9847.4. Samples: 91697164. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:13,386][41256] Avg episode reward: [(0, '74.304')] +[2023-03-11 17:53:16,264][41544] Updated weights for policy 0, policy_version 179200 (0.0005) +[2023-03-11 17:53:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9983.1). Total num frames: 91770880. Throughput: 0: 9856.1. Samples: 91756048. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:18,386][41256] Avg episode reward: [(0, '73.614')] +[2023-03-11 17:53:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000179240_91770880.pth... +[2023-03-11 17:53:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178672_91480064.pth +[2023-03-11 17:53:20,382][41544] Updated weights for policy 0, policy_version 179280 (0.0005) +[2023-03-11 17:53:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9969.2). Total num frames: 91820032. Throughput: 0: 9813.5. Samples: 91815192. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:23,386][41256] Avg episode reward: [(0, '75.147')] +[2023-03-11 17:53:24,667][41544] Updated weights for policy 0, policy_version 179360 (0.0005) +[2023-03-11 17:53:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 91865088. Throughput: 0: 9773.3. Samples: 91844096. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:28,386][41256] Avg episode reward: [(0, '73.722')] +[2023-03-11 17:53:28,859][41544] Updated weights for policy 0, policy_version 179440 (0.0006) +[2023-03-11 17:53:33,083][41544] Updated weights for policy 0, policy_version 179520 (0.0006) +[2023-03-11 17:53:33,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 91914240. Throughput: 0: 9732.0. Samples: 91902092. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:33,386][41256] Avg episode reward: [(0, '75.877')] +[2023-03-11 17:53:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000179520_91914240.pth... +[2023-03-11 17:53:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000178952_91623424.pth +[2023-03-11 17:53:37,312][41544] Updated weights for policy 0, policy_version 179600 (0.0005) +[2023-03-11 17:53:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9941.5). Total num frames: 91963392. Throughput: 0: 9705.6. Samples: 91960140. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:38,386][41256] Avg episode reward: [(0, '76.719')] +[2023-03-11 17:53:41,515][41544] Updated weights for policy 0, policy_version 179680 (0.0005) +[2023-03-11 17:53:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 92012544. Throughput: 0: 9720.5. Samples: 91989792. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:43,386][41256] Avg episode reward: [(0, '77.523')] +[2023-03-11 17:53:45,729][41544] Updated weights for policy 0, policy_version 179760 (0.0005) +[2023-03-11 17:53:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9927.6). Total num frames: 92061696. Throughput: 0: 9725.9. Samples: 92048296. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:48,386][41256] Avg episode reward: [(0, '75.373')] +[2023-03-11 17:53:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000179808_92061696.pth... +[2023-03-11 17:53:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000179240_91770880.pth +[2023-03-11 17:53:49,944][41544] Updated weights for policy 0, policy_version 179840 (0.0005) +[2023-03-11 17:53:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 92110848. Throughput: 0: 9739.0. Samples: 92106752. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:53,386][41256] Avg episode reward: [(0, '76.193')] +[2023-03-11 17:53:54,065][41544] Updated weights for policy 0, policy_version 179920 (0.0005) +[2023-03-11 17:53:58,167][41544] Updated weights for policy 0, policy_version 180000 (0.0004) +[2023-03-11 17:53:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 92160000. Throughput: 0: 9765.3. Samples: 92136604. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:53:58,386][41256] Avg episode reward: [(0, '75.525')] +[2023-03-11 17:54:02,206][41544] Updated weights for policy 0, policy_version 180080 (0.0005) +[2023-03-11 17:54:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 92209152. Throughput: 0: 9797.5. Samples: 92196936. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:54:03,386][41256] Avg episode reward: [(0, '74.706')] +[2023-03-11 17:54:03,393][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000180104_92213248.pth... +[2023-03-11 17:54:03,395][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000179520_91914240.pth +[2023-03-11 17:54:06,247][41544] Updated weights for policy 0, policy_version 180160 (0.0005) +[2023-03-11 17:54:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9899.8). Total num frames: 92262400. Throughput: 0: 9847.3. Samples: 92258320. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:54:08,386][41256] Avg episode reward: [(0, '75.289')] +[2023-03-11 17:54:10,169][41544] Updated weights for policy 0, policy_version 180240 (0.0005) +[2023-03-11 17:54:13,385][41256] Fps is (10 sec: 10649.5, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 92315648. Throughput: 0: 9901.8. Samples: 92289676. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:13,386][41256] Avg episode reward: [(0, '75.888')] +[2023-03-11 17:54:14,120][41544] Updated weights for policy 0, policy_version 180320 (0.0005) +[2023-03-11 17:54:18,138][41544] Updated weights for policy 0, policy_version 180400 (0.0005) +[2023-03-11 17:54:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9899.8). Total num frames: 92364800. Throughput: 0: 9998.5. Samples: 92352024. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:18,386][41256] Avg episode reward: [(0, '75.172')] +[2023-03-11 17:54:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000180400_92364800.pth... +[2023-03-11 17:54:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000179808_92061696.pth +[2023-03-11 17:54:22,080][41544] Updated weights for policy 0, policy_version 180480 (0.0005) +[2023-03-11 17:54:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9899.8). Total num frames: 92418048. Throughput: 0: 10084.7. Samples: 92413952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:23,396][41256] Avg episode reward: [(0, '75.230')] +[2023-03-11 17:54:26,013][41544] Updated weights for policy 0, policy_version 180560 (0.0005) +[2023-03-11 17:54:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 92467200. Throughput: 0: 10107.3. Samples: 92444620. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:28,396][41256] Avg episode reward: [(0, '75.239')] +[2023-03-11 17:54:30,004][41544] Updated weights for policy 0, policy_version 180640 (0.0005) +[2023-03-11 17:54:33,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9899.8). Total num frames: 92520448. Throughput: 0: 10182.0. Samples: 92506484. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:33,396][41256] Avg episode reward: [(0, '75.709')] +[2023-03-11 17:54:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000180704_92520448.pth... +[2023-03-11 17:54:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000180104_92213248.pth +[2023-03-11 17:54:34,002][41544] Updated weights for policy 0, policy_version 180720 (0.0005) +[2023-03-11 17:54:37,960][41544] Updated weights for policy 0, policy_version 180800 (0.0005) +[2023-03-11 17:54:38,385][41256] Fps is (10 sec: 10649.6, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 92573696. Throughput: 0: 10261.2. Samples: 92568508. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:38,397][41256] Avg episode reward: [(0, '77.990')] +[2023-03-11 17:54:41,914][41544] Updated weights for policy 0, policy_version 180880 (0.0005) +[2023-03-11 17:54:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10171.7, 300 sec: 9899.8). Total num frames: 92622848. Throughput: 0: 10290.0. Samples: 92599656. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:43,396][41256] Avg episode reward: [(0, '78.238')] +[2023-03-11 17:54:45,960][41544] Updated weights for policy 0, policy_version 180960 (0.0005) +[2023-03-11 17:54:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10240.0, 300 sec: 9927.6). Total num frames: 92676096. Throughput: 0: 10300.8. Samples: 92660472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:48,396][41256] Avg episode reward: [(0, '77.934')] +[2023-03-11 17:54:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181008_92676096.pth... +[2023-03-11 17:54:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000180400_92364800.pth +[2023-03-11 17:54:49,840][41544] Updated weights for policy 0, policy_version 181040 (0.0005) +[2023-03-11 17:54:53,386][41256] Fps is (10 sec: 10649.5, 60 sec: 10308.3, 300 sec: 9941.5). Total num frames: 92729344. Throughput: 0: 10353.4. Samples: 92724224. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:53,397][41256] Avg episode reward: [(0, '78.230')] +[2023-03-11 17:54:53,774][41544] Updated weights for policy 0, policy_version 181120 (0.0005) +[2023-03-11 17:54:57,756][41544] Updated weights for policy 0, policy_version 181200 (0.0005) +[2023-03-11 17:54:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9941.5). Total num frames: 92778496. Throughput: 0: 10319.1. Samples: 92754036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:54:58,386][41256] Avg episode reward: [(0, '79.068')] +[2023-03-11 17:55:01,763][41544] Updated weights for policy 0, policy_version 181280 (0.0005) +[2023-03-11 17:55:03,386][41256] Fps is (10 sec: 10240.0, 60 sec: 10376.5, 300 sec: 9955.4). Total num frames: 92831744. Throughput: 0: 10303.7. Samples: 92815692. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:03,386][41256] Avg episode reward: [(0, '77.404')] +[2023-03-11 17:55:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181312_92831744.pth... +[2023-03-11 17:55:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000180704_92520448.pth +[2023-03-11 17:55:05,742][41544] Updated weights for policy 0, policy_version 181360 (0.0005) +[2023-03-11 17:55:08,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9941.5). Total num frames: 92880896. Throughput: 0: 10301.1. Samples: 92877500. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:08,386][41256] Avg episode reward: [(0, '78.010')] +[2023-03-11 17:55:09,712][41544] Updated weights for policy 0, policy_version 181440 (0.0005) +[2023-03-11 17:55:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 92934144. Throughput: 0: 10311.0. Samples: 92908616. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:13,386][41256] Avg episode reward: [(0, '77.422')] +[2023-03-11 17:55:13,732][41544] Updated weights for policy 0, policy_version 181520 (0.0005) +[2023-03-11 17:55:17,726][41544] Updated weights for policy 0, policy_version 181600 (0.0005) +[2023-03-11 17:55:18,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 92983296. Throughput: 0: 10305.2. Samples: 92970220. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:18,386][41256] Avg episode reward: [(0, '76.970')] +[2023-03-11 17:55:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181608_92983296.pth... +[2023-03-11 17:55:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181008_92676096.pth +[2023-03-11 17:55:21,727][41544] Updated weights for policy 0, policy_version 181680 (0.0005) +[2023-03-11 17:55:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 93036544. Throughput: 0: 10295.6. Samples: 93031812. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:23,386][41256] Avg episode reward: [(0, '75.727')] +[2023-03-11 17:55:25,674][41544] Updated weights for policy 0, policy_version 181760 (0.0005) +[2023-03-11 17:55:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9941.5). Total num frames: 93085696. Throughput: 0: 10290.2. Samples: 93062716. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:28,386][41256] Avg episode reward: [(0, '76.056')] +[2023-03-11 17:55:29,691][41544] Updated weights for policy 0, policy_version 181840 (0.0005) +[2023-03-11 17:55:33,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10308.3, 300 sec: 9955.4). Total num frames: 93138944. Throughput: 0: 10304.2. Samples: 93124160. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:33,386][41256] Avg episode reward: [(0, '77.694')] +[2023-03-11 17:55:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181912_93138944.pth... +[2023-03-11 17:55:33,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181312_92831744.pth +[2023-03-11 17:55:33,676][41544] Updated weights for policy 0, policy_version 181920 (0.0005) +[2023-03-11 17:55:37,655][41544] Updated weights for policy 0, policy_version 182000 (0.0005) +[2023-03-11 17:55:38,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10240.0, 300 sec: 9955.4). Total num frames: 93188096. Throughput: 0: 10259.0. Samples: 93185876. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:38,386][41256] Avg episode reward: [(0, '76.244')] +[2023-03-11 17:55:41,730][41544] Updated weights for policy 0, policy_version 182080 (0.0004) +[2023-03-11 17:55:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10240.0, 300 sec: 9969.3). Total num frames: 93237248. Throughput: 0: 10277.7. Samples: 93216532. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:43,386][41256] Avg episode reward: [(0, '77.251')] +[2023-03-11 17:55:45,808][41544] Updated weights for policy 0, policy_version 182160 (0.0003) +[2023-03-11 17:55:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 10240.0, 300 sec: 9983.1). Total num frames: 93290496. Throughput: 0: 10226.0. Samples: 93275864. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:48,386][41256] Avg episode reward: [(0, '75.448')] +[2023-03-11 17:55:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000182208_93290496.pth... +[2023-03-11 17:55:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181608_92983296.pth +[2023-03-11 17:55:49,980][41544] Updated weights for policy 0, policy_version 182240 (0.0003) +[2023-03-11 17:55:53,385][41256] Fps is (10 sec: 10239.9, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 93339648. Throughput: 0: 10180.5. Samples: 93335624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:53,396][41256] Avg episode reward: [(0, '76.452')] +[2023-03-11 17:55:54,108][41544] Updated weights for policy 0, policy_version 182320 (0.0005) +[2023-03-11 17:55:58,316][41544] Updated weights for policy 0, policy_version 182400 (0.0005) +[2023-03-11 17:55:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10171.7, 300 sec: 9983.1). Total num frames: 93388800. Throughput: 0: 10133.5. Samples: 93364624. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:55:58,386][41256] Avg episode reward: [(0, '76.777')] +[2023-03-11 17:56:02,286][41544] Updated weights for policy 0, policy_version 182480 (0.0005) +[2023-03-11 17:56:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 93437952. Throughput: 0: 10114.0. Samples: 93425348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:56:03,396][41256] Avg episode reward: [(0, '77.118')] +[2023-03-11 17:56:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000182496_93437952.pth... +[2023-03-11 17:56:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000181912_93138944.pth +[2023-03-11 17:56:06,627][41544] Updated weights for policy 0, policy_version 182560 (0.0006) +[2023-03-11 17:56:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10103.5, 300 sec: 9983.1). Total num frames: 93487104. Throughput: 0: 10021.2. Samples: 93482768. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:56:08,396][41256] Avg episode reward: [(0, '76.180')] +[2023-03-11 17:56:10,916][41544] Updated weights for policy 0, policy_version 182640 (0.0005) +[2023-03-11 17:56:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 93532160. Throughput: 0: 9972.7. Samples: 93511488. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:56:13,386][41256] Avg episode reward: [(0, '76.209')] +[2023-03-11 17:56:15,273][41544] Updated weights for policy 0, policy_version 182720 (0.0005) +[2023-03-11 17:56:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9966.9, 300 sec: 9969.2). Total num frames: 93581312. Throughput: 0: 9860.6. Samples: 93567888. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:56:18,396][41256] Avg episode reward: [(0, '77.762')] +[2023-03-11 17:56:18,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000182776_93581312.pth... +[2023-03-11 17:56:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000182208_93290496.pth +[2023-03-11 17:56:19,560][41544] Updated weights for policy 0, policy_version 182800 (0.0005) +[2023-03-11 17:56:23,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 93626368. Throughput: 0: 9758.6. Samples: 93625012. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:23,386][41256] Avg episode reward: [(0, '78.824')] +[2023-03-11 17:56:23,867][41544] Updated weights for policy 0, policy_version 182880 (0.0005) +[2023-03-11 17:56:28,092][41544] Updated weights for policy 0, policy_version 182960 (0.0005) +[2023-03-11 17:56:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9955.4). Total num frames: 93675520. Throughput: 0: 9725.5. Samples: 93654180. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:28,386][41256] Avg episode reward: [(0, '76.823')] +[2023-03-11 17:56:32,314][41544] Updated weights for policy 0, policy_version 183040 (0.0005) +[2023-03-11 17:56:33,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 93724672. Throughput: 0: 9700.5. Samples: 93712384. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:33,386][41256] Avg episode reward: [(0, '75.865')] +[2023-03-11 17:56:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183056_93724672.pth... +[2023-03-11 17:56:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000182496_93437952.pth +[2023-03-11 17:56:36,559][41544] Updated weights for policy 0, policy_version 183120 (0.0005) +[2023-03-11 17:56:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 93773824. Throughput: 0: 9648.4. Samples: 93769800. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:38,386][41256] Avg episode reward: [(0, '76.443')] +[2023-03-11 17:56:40,828][41544] Updated weights for policy 0, policy_version 183200 (0.0005) +[2023-03-11 17:56:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9955.4). Total num frames: 93822976. Throughput: 0: 9640.9. Samples: 93798464. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:43,386][41256] Avg episode reward: [(0, '76.353')] +[2023-03-11 17:56:45,071][41544] Updated weights for policy 0, policy_version 183280 (0.0005) +[2023-03-11 17:56:48,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 93868032. Throughput: 0: 9566.0. Samples: 93855816. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:48,386][41256] Avg episode reward: [(0, '73.946')] +[2023-03-11 17:56:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183336_93868032.pth... +[2023-03-11 17:56:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000182776_93581312.pth +[2023-03-11 17:56:49,396][41544] Updated weights for policy 0, policy_version 183360 (0.0005) +[2023-03-11 17:56:53,385][41256] Fps is (10 sec: 9420.7, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 93917184. Throughput: 0: 9564.1. Samples: 93913152. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:53,386][41256] Avg episode reward: [(0, '74.691')] +[2023-03-11 17:56:53,660][41544] Updated weights for policy 0, policy_version 183440 (0.0005) +[2023-03-11 17:56:57,967][41544] Updated weights for policy 0, policy_version 183520 (0.0005) +[2023-03-11 17:56:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9941.5). Total num frames: 93966336. Throughput: 0: 9570.8. Samples: 93942172. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:56:58,386][41256] Avg episode reward: [(0, '74.576')] +[2023-03-11 17:57:02,192][41544] Updated weights for policy 0, policy_version 183600 (0.0005) +[2023-03-11 17:57:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9927.6). Total num frames: 94011392. Throughput: 0: 9595.1. Samples: 93999668. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:57:03,386][41256] Avg episode reward: [(0, '75.088')] +[2023-03-11 17:57:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183616_94011392.pth... +[2023-03-11 17:57:03,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183056_93724672.pth +[2023-03-11 17:57:06,523][41544] Updated weights for policy 0, policy_version 183680 (0.0005) +[2023-03-11 17:57:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9927.6). Total num frames: 94060544. Throughput: 0: 9593.8. Samples: 94056732. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:57:08,386][41256] Avg episode reward: [(0, '74.132')] +[2023-03-11 17:57:10,800][41544] Updated weights for policy 0, policy_version 183760 (0.0005) +[2023-03-11 17:57:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9913.7). Total num frames: 94105600. Throughput: 0: 9578.0. Samples: 94085192. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:57:13,386][41256] Avg episode reward: [(0, '75.371')] +[2023-03-11 17:57:15,178][41544] Updated weights for policy 0, policy_version 183840 (0.0005) +[2023-03-11 17:57:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9927.6). Total num frames: 94154752. Throughput: 0: 9551.2. Samples: 94142188. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:57:18,386][41256] Avg episode reward: [(0, '76.205')] +[2023-03-11 17:57:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183896_94154752.pth... +[2023-03-11 17:57:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183336_93868032.pth +[2023-03-11 17:57:19,483][41544] Updated weights for policy 0, policy_version 183920 (0.0005) +[2023-03-11 17:57:22,083][41500] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000004 +[2023-03-11 17:57:23,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9625.6, 300 sec: 9927.6). Total num frames: 94203904. Throughput: 0: 9547.0. Samples: 94199416. Policy #0 lag: (min: 6.0, avg: 6.0, max: 6.0) +[2023-03-11 17:57:23,386][41256] Avg episode reward: [(0, '74.670')] +[2023-03-11 17:57:23,775][41544] Updated weights for policy 0, policy_version 184000 (0.0005) +[2023-03-11 17:57:28,047][41544] Updated weights for policy 0, policy_version 184080 (0.0005) +[2023-03-11 17:57:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9913.7). Total num frames: 94248960. Throughput: 0: 9555.9. Samples: 94228480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:57:28,386][41256] Avg episode reward: [(0, '77.250')] +[2023-03-11 17:57:32,432][41544] Updated weights for policy 0, policy_version 184160 (0.0005) +[2023-03-11 17:57:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9557.3, 300 sec: 9899.8). Total num frames: 94298112. Throughput: 0: 9524.0. Samples: 94284396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:57:33,386][41256] Avg episode reward: [(0, '75.110')] +[2023-03-11 17:57:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000184176_94298112.pth... +[2023-03-11 17:57:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183616_94011392.pth +[2023-03-11 17:57:36,714][41544] Updated weights for policy 0, policy_version 184240 (0.0005) +[2023-03-11 17:57:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9885.9). Total num frames: 94343168. Throughput: 0: 9524.5. Samples: 94341756. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:57:38,386][41256] Avg episode reward: [(0, '74.394')] +[2023-03-11 17:57:41,063][41544] Updated weights for policy 0, policy_version 184320 (0.0005) +[2023-03-11 17:57:43,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9885.9). Total num frames: 94392320. Throughput: 0: 9503.7. Samples: 94369840. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:57:43,386][41256] Avg episode reward: [(0, '74.389')] +[2023-03-11 17:57:45,393][41544] Updated weights for policy 0, policy_version 184400 (0.0005) +[2023-03-11 17:57:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9885.9). Total num frames: 94441472. Throughput: 0: 9493.4. Samples: 94426872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:57:48,386][41256] Avg episode reward: [(0, '75.097')] +[2023-03-11 17:57:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000184456_94441472.pth... +[2023-03-11 17:57:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000183896_94154752.pth +[2023-03-11 17:57:49,629][41544] Updated weights for policy 0, policy_version 184480 (0.0005) +[2023-03-11 17:57:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9872.1). Total num frames: 94486528. Throughput: 0: 9503.6. Samples: 94484396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:57:53,386][41256] Avg episode reward: [(0, '75.333')] +[2023-03-11 17:57:53,929][41544] Updated weights for policy 0, policy_version 184560 (0.0005) +[2023-03-11 17:57:58,239][41544] Updated weights for policy 0, policy_version 184640 (0.0005) +[2023-03-11 17:57:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9872.1). Total num frames: 94535680. Throughput: 0: 9506.0. Samples: 94512964. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:57:58,386][41256] Avg episode reward: [(0, '75.601')] +[2023-03-11 17:58:02,529][41544] Updated weights for policy 0, policy_version 184720 (0.0005) +[2023-03-11 17:58:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9872.1). Total num frames: 94584832. Throughput: 0: 9502.1. Samples: 94569784. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:58:03,386][41256] Avg episode reward: [(0, '75.634')] +[2023-03-11 17:58:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000184736_94584832.pth... +[2023-03-11 17:58:03,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000184176_94298112.pth +[2023-03-11 17:58:06,806][41544] Updated weights for policy 0, policy_version 184800 (0.0005) +[2023-03-11 17:58:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9858.2). Total num frames: 94629888. Throughput: 0: 9510.4. Samples: 94627384. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:58:08,386][41256] Avg episode reward: [(0, '75.472')] +[2023-03-11 17:58:11,103][41544] Updated weights for policy 0, policy_version 184880 (0.0005) +[2023-03-11 17:58:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9858.2). Total num frames: 94679040. Throughput: 0: 9502.0. Samples: 94656072. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:58:13,386][41256] Avg episode reward: [(0, '73.470')] +[2023-03-11 17:58:15,348][41544] Updated weights for policy 0, policy_version 184960 (0.0005) +[2023-03-11 17:58:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9858.2). Total num frames: 94728192. Throughput: 0: 9556.6. Samples: 94714444. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:58:18,386][41256] Avg episode reward: [(0, '74.579')] +[2023-03-11 17:58:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185016_94728192.pth... +[2023-03-11 17:58:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000184456_94441472.pth +[2023-03-11 17:58:19,604][41544] Updated weights for policy 0, policy_version 185040 (0.0005) +[2023-03-11 17:58:23,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9858.2). Total num frames: 94773248. Throughput: 0: 9541.3. Samples: 94771116. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:58:23,386][41256] Avg episode reward: [(0, '74.509')] +[2023-03-11 17:58:23,942][41544] Updated weights for policy 0, policy_version 185120 (0.0005) +[2023-03-11 17:58:28,227][41544] Updated weights for policy 0, policy_version 185200 (0.0005) +[2023-03-11 17:58:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9858.2). Total num frames: 94822400. Throughput: 0: 9546.1. Samples: 94799416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 17:58:28,386][41256] Avg episode reward: [(0, '73.919')] +[2023-03-11 17:58:32,517][41544] Updated weights for policy 0, policy_version 185280 (0.0005) +[2023-03-11 17:58:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9489.1, 300 sec: 9844.3). Total num frames: 94867456. Throughput: 0: 9553.8. Samples: 94856792. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:58:33,386][41256] Avg episode reward: [(0, '74.909')] +[2023-03-11 17:58:33,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185296_94871552.pth... +[2023-03-11 17:58:33,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000184736_94584832.pth +[2023-03-11 17:58:36,820][41544] Updated weights for policy 0, policy_version 185360 (0.0005) +[2023-03-11 17:58:38,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9844.3). Total num frames: 94916608. Throughput: 0: 9540.1. Samples: 94913700. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:58:38,386][41256] Avg episode reward: [(0, '74.579')] +[2023-03-11 17:58:41,137][41544] Updated weights for policy 0, policy_version 185440 (0.0005) +[2023-03-11 17:58:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9844.3). Total num frames: 94965760. Throughput: 0: 9543.9. Samples: 94942440. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:58:43,386][41256] Avg episode reward: [(0, '74.544')] +[2023-03-11 17:58:45,409][41544] Updated weights for policy 0, policy_version 185520 (0.0005) +[2023-03-11 17:58:48,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9489.1, 300 sec: 9830.4). Total num frames: 95010816. Throughput: 0: 9556.5. Samples: 94999828. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:58:48,386][41256] Avg episode reward: [(0, '71.956')] +[2023-03-11 17:58:48,404][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185576_95014912.pth... +[2023-03-11 17:58:48,406][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185016_94728192.pth +[2023-03-11 17:58:49,681][41544] Updated weights for policy 0, policy_version 185600 (0.0005) +[2023-03-11 17:58:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9830.4). Total num frames: 95059968. Throughput: 0: 9553.8. Samples: 95057304. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:58:53,386][41256] Avg episode reward: [(0, '72.437')] +[2023-03-11 17:58:53,973][41544] Updated weights for policy 0, policy_version 185680 (0.0005) +[2023-03-11 17:58:58,052][41544] Updated weights for policy 0, policy_version 185760 (0.0005) +[2023-03-11 17:58:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9557.3, 300 sec: 9830.4). Total num frames: 95109120. Throughput: 0: 9574.5. Samples: 95086924. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:58:58,386][41256] Avg episode reward: [(0, '73.156')] +[2023-03-11 17:59:02,368][41544] Updated weights for policy 0, policy_version 185840 (0.0005) +[2023-03-11 17:59:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9557.3, 300 sec: 9816.5). Total num frames: 95158272. Throughput: 0: 9582.0. Samples: 95145632. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:03,386][41256] Avg episode reward: [(0, '73.134')] +[2023-03-11 17:59:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185856_95158272.pth... +[2023-03-11 17:59:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185296_94871552.pth +[2023-03-11 17:59:06,617][41544] Updated weights for policy 0, policy_version 185920 (0.0005) +[2023-03-11 17:59:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9802.6). Total num frames: 95207424. Throughput: 0: 9603.6. Samples: 95203280. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:08,386][41256] Avg episode reward: [(0, '74.474')] +[2023-03-11 17:59:10,885][41544] Updated weights for policy 0, policy_version 186000 (0.0006) +[2023-03-11 17:59:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9788.7). Total num frames: 95252480. Throughput: 0: 9613.0. Samples: 95232000. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:13,386][41256] Avg episode reward: [(0, '75.166')] +[2023-03-11 17:59:15,171][41544] Updated weights for policy 0, policy_version 186080 (0.0005) +[2023-03-11 17:59:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9774.9). Total num frames: 95301632. Throughput: 0: 9612.4. Samples: 95289352. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:18,386][41256] Avg episode reward: [(0, '75.401')] +[2023-03-11 17:59:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186136_95301632.pth... +[2023-03-11 17:59:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185576_95014912.pth +[2023-03-11 17:59:19,452][41544] Updated weights for policy 0, policy_version 186160 (0.0005) +[2023-03-11 17:59:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9774.9). Total num frames: 95350784. Throughput: 0: 9622.0. Samples: 95346688. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:23,386][41256] Avg episode reward: [(0, '74.956')] +[2023-03-11 17:59:23,766][41544] Updated weights for policy 0, policy_version 186240 (0.0005) +[2023-03-11 17:59:28,015][41544] Updated weights for policy 0, policy_version 186320 (0.0005) +[2023-03-11 17:59:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9557.3, 300 sec: 9747.1). Total num frames: 95395840. Throughput: 0: 9618.8. Samples: 95375288. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:28,386][41256] Avg episode reward: [(0, '74.331')] +[2023-03-11 17:59:32,263][41544] Updated weights for policy 0, policy_version 186400 (0.0005) +[2023-03-11 17:59:33,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9733.2). Total num frames: 95444992. Throughput: 0: 9620.9. Samples: 95432768. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:33,386][41256] Avg episode reward: [(0, '75.615')] +[2023-03-11 17:59:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186416_95444992.pth... +[2023-03-11 17:59:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000185856_95158272.pth +[2023-03-11 17:59:36,500][41544] Updated weights for policy 0, policy_version 186480 (0.0005) +[2023-03-11 17:59:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9733.2). Total num frames: 95494144. Throughput: 0: 9625.0. Samples: 95490428. Policy #0 lag: (min: 4.0, avg: 4.0, max: 4.0) +[2023-03-11 17:59:38,386][41256] Avg episode reward: [(0, '74.515')] +[2023-03-11 17:59:40,632][41544] Updated weights for policy 0, policy_version 186560 (0.0005) +[2023-03-11 17:59:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9625.6, 300 sec: 9719.3). Total num frames: 95543296. Throughput: 0: 9639.2. Samples: 95520688. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:59:43,386][41256] Avg episode reward: [(0, '73.772')] +[2023-03-11 17:59:44,581][41544] Updated weights for policy 0, policy_version 186640 (0.0004) +[2023-03-11 17:59:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 95592448. Throughput: 0: 9685.3. Samples: 95581472. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:59:48,386][41256] Avg episode reward: [(0, '75.085')] +[2023-03-11 17:59:48,395][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186712_95596544.pth... +[2023-03-11 17:59:48,397][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186136_95301632.pth +[2023-03-11 17:59:48,809][41544] Updated weights for policy 0, policy_version 186720 (0.0005) +[2023-03-11 17:59:53,092][41544] Updated weights for policy 0, policy_version 186800 (0.0005) +[2023-03-11 17:59:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9705.4). Total num frames: 95641600. Throughput: 0: 9690.1. Samples: 95639336. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:59:53,386][41256] Avg episode reward: [(0, '73.974')] +[2023-03-11 17:59:57,436][41544] Updated weights for policy 0, policy_version 186880 (0.0005) +[2023-03-11 17:59:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9693.9, 300 sec: 9691.6). Total num frames: 95690752. Throughput: 0: 9682.8. Samples: 95667724. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 17:59:58,386][41256] Avg episode reward: [(0, '75.680')] +[2023-03-11 18:00:01,632][41544] Updated weights for policy 0, policy_version 186960 (0.0004) +[2023-03-11 18:00:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9691.6). Total num frames: 95739904. Throughput: 0: 9696.7. Samples: 95725704. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:03,396][41256] Avg episode reward: [(0, '75.584')] +[2023-03-11 18:00:03,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186992_95739904.pth... +[2023-03-11 18:00:03,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186416_95444992.pth +[2023-03-11 18:00:05,816][41544] Updated weights for policy 0, policy_version 187040 (0.0005) +[2023-03-11 18:00:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9625.6, 300 sec: 9663.8). Total num frames: 95784960. Throughput: 0: 9708.9. Samples: 95783588. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:08,396][41256] Avg episode reward: [(0, '76.251')] +[2023-03-11 18:00:10,193][41544] Updated weights for policy 0, policy_version 187120 (0.0005) +[2023-03-11 18:00:13,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9663.8). Total num frames: 95834112. Throughput: 0: 9691.0. Samples: 95811384. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:13,397][41256] Avg episode reward: [(0, '76.308')] +[2023-03-11 18:00:14,444][41544] Updated weights for policy 0, policy_version 187200 (0.0005) +[2023-03-11 18:00:18,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9649.9). Total num frames: 95883264. Throughput: 0: 9719.4. Samples: 95870140. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:18,396][41256] Avg episode reward: [(0, '72.741')] +[2023-03-11 18:00:18,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000187272_95883264.pth... +[2023-03-11 18:00:18,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186712_95596544.pth +[2023-03-11 18:00:18,651][41544] Updated weights for policy 0, policy_version 187280 (0.0005) +[2023-03-11 18:00:22,865][41544] Updated weights for policy 0, policy_version 187360 (0.0005) +[2023-03-11 18:00:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9649.9). Total num frames: 95932416. Throughput: 0: 9731.2. Samples: 95928332. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:23,386][41256] Avg episode reward: [(0, '72.465')] +[2023-03-11 18:00:27,086][41544] Updated weights for policy 0, policy_version 187440 (0.0005) +[2023-03-11 18:00:28,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9622.1). Total num frames: 95977472. Throughput: 0: 9697.1. Samples: 95957056. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:28,386][41256] Avg episode reward: [(0, '73.081')] +[2023-03-11 18:00:31,287][41544] Updated weights for policy 0, policy_version 187520 (0.0005) +[2023-03-11 18:00:33,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9693.9, 300 sec: 9622.1). Total num frames: 96026624. Throughput: 0: 9647.6. Samples: 96015612. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:33,386][41256] Avg episode reward: [(0, '72.795')] +[2023-03-11 18:00:33,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000187560_96030720.pth... +[2023-03-11 18:00:33,401][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000186992_95739904.pth +[2023-03-11 18:00:35,495][41544] Updated weights for policy 0, policy_version 187600 (0.0004) +[2023-03-11 18:00:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9693.9, 300 sec: 9622.1). Total num frames: 96075776. Throughput: 0: 9639.7. Samples: 96073124. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:38,386][41256] Avg episode reward: [(0, '70.233')] +[2023-03-11 18:00:39,685][41544] Updated weights for policy 0, policy_version 187680 (0.0004) +[2023-03-11 18:00:43,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 96129024. Throughput: 0: 9695.6. Samples: 96104028. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) +[2023-03-11 18:00:43,386][41256] Avg episode reward: [(0, '71.406')] +[2023-03-11 18:00:43,776][41544] Updated weights for policy 0, policy_version 187760 (0.0003) +[2023-03-11 18:00:47,963][41544] Updated weights for policy 0, policy_version 187840 (0.0005) +[2023-03-11 18:00:48,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 96178176. Throughput: 0: 9720.9. Samples: 96163144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:00:48,386][41256] Avg episode reward: [(0, '69.848')] +[2023-03-11 18:00:48,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000187848_96178176.pth... +[2023-03-11 18:00:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000187272_95883264.pth +[2023-03-11 18:00:52,210][41544] Updated weights for policy 0, policy_version 187920 (0.0005) +[2023-03-11 18:00:53,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9608.2). Total num frames: 96223232. Throughput: 0: 9718.0. Samples: 96220896. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:00:53,397][41256] Avg episode reward: [(0, '70.601')] +[2023-03-11 18:00:56,413][41544] Updated weights for policy 0, policy_version 188000 (0.0005) +[2023-03-11 18:00:58,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9608.2). Total num frames: 96272384. Throughput: 0: 9759.3. Samples: 96250552. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:00:58,396][41256] Avg episode reward: [(0, '69.639')] +[2023-03-11 18:01:00,544][41544] Updated weights for policy 0, policy_version 188080 (0.0005) +[2023-03-11 18:01:03,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9622.1). Total num frames: 96325632. Throughput: 0: 9774.0. Samples: 96309972. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:03,396][41256] Avg episode reward: [(0, '68.466')] +[2023-03-11 18:01:03,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000188136_96325632.pth... +[2023-03-11 18:01:03,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000187560_96030720.pth +[2023-03-11 18:01:04,529][41544] Updated weights for policy 0, policy_version 188160 (0.0004) +[2023-03-11 18:01:08,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9636.0). Total num frames: 96374784. Throughput: 0: 9853.6. Samples: 96371744. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:08,396][41256] Avg episode reward: [(0, '69.741')] +[2023-03-11 18:01:08,552][41544] Updated weights for policy 0, policy_version 188240 (0.0005) +[2023-03-11 18:01:12,587][41544] Updated weights for policy 0, policy_version 188320 (0.0004) +[2023-03-11 18:01:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9649.9). Total num frames: 96428032. Throughput: 0: 9876.5. Samples: 96401496. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:13,386][41256] Avg episode reward: [(0, '71.276')] +[2023-03-11 18:01:16,609][41544] Updated weights for policy 0, policy_version 188400 (0.0005) +[2023-03-11 18:01:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9663.8). Total num frames: 96477184. Throughput: 0: 9948.6. Samples: 96463300. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:18,386][41256] Avg episode reward: [(0, '71.997')] +[2023-03-11 18:01:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000188432_96477184.pth... +[2023-03-11 18:01:18,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000187848_96178176.pth +[2023-03-11 18:01:20,541][41544] Updated weights for policy 0, policy_version 188480 (0.0004) +[2023-03-11 18:01:23,385][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9677.7). Total num frames: 96530432. Throughput: 0: 10072.8. Samples: 96526400. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:23,397][41256] Avg episode reward: [(0, '71.865')] +[2023-03-11 18:01:24,491][41544] Updated weights for policy 0, policy_version 188560 (0.0004) +[2023-03-11 18:01:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9677.7). Total num frames: 96579584. Throughput: 0: 10030.8. Samples: 96555416. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:28,396][41256] Avg episode reward: [(0, '72.539')] +[2023-03-11 18:01:28,723][41544] Updated weights for policy 0, policy_version 188640 (0.0004) +[2023-03-11 18:01:33,004][41544] Updated weights for policy 0, policy_version 188720 (0.0005) +[2023-03-11 18:01:33,386][41256] Fps is (10 sec: 9420.7, 60 sec: 9966.9, 300 sec: 9663.8). Total num frames: 96624640. Throughput: 0: 10004.5. Samples: 96613348. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:33,397][41256] Avg episode reward: [(0, '71.636')] +[2023-03-11 18:01:33,435][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000188728_96628736.pth... +[2023-03-11 18:01:33,437][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000188136_96325632.pth +[2023-03-11 18:01:37,241][41544] Updated weights for policy 0, policy_version 188800 (0.0005) +[2023-03-11 18:01:38,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9966.9, 300 sec: 9663.8). Total num frames: 96673792. Throughput: 0: 10015.1. Samples: 96671576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:38,396][41256] Avg episode reward: [(0, '72.930')] +[2023-03-11 18:01:41,499][41544] Updated weights for policy 0, policy_version 188880 (0.0005) +[2023-03-11 18:01:43,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9677.7). Total num frames: 96722944. Throughput: 0: 10000.3. Samples: 96700564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:43,396][41256] Avg episode reward: [(0, '71.820')] +[2023-03-11 18:01:45,462][41544] Updated weights for policy 0, policy_version 188960 (0.0005) +[2023-03-11 18:01:48,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9966.9, 300 sec: 9691.6). Total num frames: 96776192. Throughput: 0: 10049.5. Samples: 96762200. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:48,396][41256] Avg episode reward: [(0, '71.004')] +[2023-03-11 18:01:48,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189016_96776192.pth... +[2023-03-11 18:01:48,403][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000188432_96477184.pth +[2023-03-11 18:01:49,371][41544] Updated weights for policy 0, policy_version 189040 (0.0005) +[2023-03-11 18:01:53,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10035.2, 300 sec: 9691.6). Total num frames: 96825344. Throughput: 0: 10033.7. Samples: 96823260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:53,397][41256] Avg episode reward: [(0, '72.509')] +[2023-03-11 18:01:53,492][41544] Updated weights for policy 0, policy_version 189120 (0.0005) +[2023-03-11 18:01:57,699][41544] Updated weights for policy 0, policy_version 189200 (0.0005) +[2023-03-11 18:01:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9705.4). Total num frames: 96874496. Throughput: 0: 10023.7. Samples: 96852564. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:01:58,396][41256] Avg episode reward: [(0, '73.988')] +[2023-03-11 18:02:01,964][41544] Updated weights for policy 0, policy_version 189280 (0.0005) +[2023-03-11 18:02:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9705.4). Total num frames: 96923648. Throughput: 0: 9932.8. Samples: 96910276. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:03,386][41256] Avg episode reward: [(0, '72.266')] +[2023-03-11 18:02:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189304_96923648.pth... +[2023-03-11 18:02:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000188728_96628736.pth +[2023-03-11 18:02:06,161][41544] Updated weights for policy 0, policy_version 189360 (0.0005) +[2023-03-11 18:02:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9719.3). Total num frames: 96972800. Throughput: 0: 9856.4. Samples: 96969936. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:08,386][41256] Avg episode reward: [(0, '73.874')] +[2023-03-11 18:02:10,071][41544] Updated weights for policy 0, policy_version 189440 (0.0004) +[2023-03-11 18:02:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9733.2). Total num frames: 97026048. Throughput: 0: 9910.3. Samples: 97001380. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:13,386][41256] Avg episode reward: [(0, '71.114')] +[2023-03-11 18:02:14,110][41544] Updated weights for policy 0, policy_version 189520 (0.0004) +[2023-03-11 18:02:18,321][41544] Updated weights for policy 0, policy_version 189600 (0.0005) +[2023-03-11 18:02:18,386][41256] Fps is (10 sec: 10240.0, 60 sec: 9966.9, 300 sec: 9733.2). Total num frames: 97075200. Throughput: 0: 9938.8. Samples: 97060592. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:18,386][41256] Avg episode reward: [(0, '72.915')] +[2023-03-11 18:02:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189600_97075200.pth... +[2023-03-11 18:02:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189016_96776192.pth +[2023-03-11 18:02:22,523][41544] Updated weights for policy 0, policy_version 189680 (0.0005) +[2023-03-11 18:02:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9747.1). Total num frames: 97124352. Throughput: 0: 9960.3. Samples: 97119792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:23,386][41256] Avg episode reward: [(0, '72.467')] +[2023-03-11 18:02:26,719][41544] Updated weights for policy 0, policy_version 189760 (0.0005) +[2023-03-11 18:02:28,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9830.4, 300 sec: 9733.2). Total num frames: 97169408. Throughput: 0: 9963.6. Samples: 97148928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:28,386][41256] Avg episode reward: [(0, '72.430')] +[2023-03-11 18:02:30,978][41544] Updated weights for policy 0, policy_version 189840 (0.0005) +[2023-03-11 18:02:33,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9747.1). Total num frames: 97218560. Throughput: 0: 9872.9. Samples: 97206480. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:33,386][41256] Avg episode reward: [(0, '71.620')] +[2023-03-11 18:02:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189880_97218560.pth... +[2023-03-11 18:02:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189304_96923648.pth +[2023-03-11 18:02:35,107][41544] Updated weights for policy 0, policy_version 189920 (0.0005) +[2023-03-11 18:02:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9747.1). Total num frames: 97267712. Throughput: 0: 9840.1. Samples: 97266064. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:38,386][41256] Avg episode reward: [(0, '72.500')] +[2023-03-11 18:02:39,244][41544] Updated weights for policy 0, policy_version 190000 (0.0005) +[2023-03-11 18:02:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9747.1). Total num frames: 97316864. Throughput: 0: 9850.3. Samples: 97295828. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:43,386][41256] Avg episode reward: [(0, '71.641')] +[2023-03-11 18:02:43,417][41544] Updated weights for policy 0, policy_version 190080 (0.0005) +[2023-03-11 18:02:47,706][41544] Updated weights for policy 0, policy_version 190160 (0.0005) +[2023-03-11 18:02:48,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 97366016. Throughput: 0: 9854.7. Samples: 97353736. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:48,386][41256] Avg episode reward: [(0, '73.236')] +[2023-03-11 18:02:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000190168_97366016.pth... +[2023-03-11 18:02:48,391][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189600_97075200.pth +[2023-03-11 18:02:51,860][41544] Updated weights for policy 0, policy_version 190240 (0.0005) +[2023-03-11 18:02:53,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 97415168. Throughput: 0: 9822.1. Samples: 97411932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:53,386][41256] Avg episode reward: [(0, '72.842')] +[2023-03-11 18:02:55,988][41544] Updated weights for policy 0, policy_version 190320 (0.0005) +[2023-03-11 18:02:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9830.4, 300 sec: 9761.0). Total num frames: 97464320. Throughput: 0: 9804.4. Samples: 97442576. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:02:58,386][41256] Avg episode reward: [(0, '71.394')] +[2023-03-11 18:02:59,934][41544] Updated weights for policy 0, policy_version 190400 (0.0004) +[2023-03-11 18:03:03,386][41256] Fps is (10 sec: 10239.9, 60 sec: 9898.7, 300 sec: 9788.7). Total num frames: 97517568. Throughput: 0: 9855.6. Samples: 97504092. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:03,386][41256] Avg episode reward: [(0, '71.611')] +[2023-03-11 18:03:03,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000190464_97517568.pth... +[2023-03-11 18:03:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000189880_97218560.pth +[2023-03-11 18:03:03,890][41544] Updated weights for policy 0, policy_version 190480 (0.0004) +[2023-03-11 18:03:07,803][41544] Updated weights for policy 0, policy_version 190560 (0.0004) +[2023-03-11 18:03:08,385][41256] Fps is (10 sec: 10649.6, 60 sec: 9966.9, 300 sec: 9802.6). Total num frames: 97570816. Throughput: 0: 9933.3. Samples: 97566792. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:08,386][41256] Avg episode reward: [(0, '73.189')] +[2023-03-11 18:03:11,831][41544] Updated weights for policy 0, policy_version 190640 (0.0004) +[2023-03-11 18:03:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 9898.7, 300 sec: 9802.6). Total num frames: 97619968. Throughput: 0: 9988.0. Samples: 97598388. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:13,386][41256] Avg episode reward: [(0, '73.876')] +[2023-03-11 18:03:16,047][41544] Updated weights for policy 0, policy_version 190720 (0.0005) +[2023-03-11 18:03:18,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 97669120. Throughput: 0: 10004.5. Samples: 97656684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:18,386][41256] Avg episode reward: [(0, '73.508')] +[2023-03-11 18:03:18,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000190760_97669120.pth... +[2023-03-11 18:03:18,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000190168_97366016.pth +[2023-03-11 18:03:20,216][41544] Updated weights for policy 0, policy_version 190800 (0.0005) +[2023-03-11 18:03:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9816.5). Total num frames: 97718272. Throughput: 0: 9982.1. Samples: 97715260. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:23,386][41256] Avg episode reward: [(0, '72.794')] +[2023-03-11 18:03:24,430][41544] Updated weights for policy 0, policy_version 190880 (0.0005) +[2023-03-11 18:03:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 97767424. Throughput: 0: 9956.1. Samples: 97743852. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:28,386][41256] Avg episode reward: [(0, '72.814')] +[2023-03-11 18:03:28,639][41544] Updated weights for policy 0, policy_version 190960 (0.0005) +[2023-03-11 18:03:32,871][41544] Updated weights for policy 0, policy_version 191040 (0.0005) +[2023-03-11 18:03:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 97816576. Throughput: 0: 9978.4. Samples: 97802764. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:33,386][41256] Avg episode reward: [(0, '73.719')] +[2023-03-11 18:03:33,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191048_97816576.pth... +[2023-03-11 18:03:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000190464_97517568.pth +[2023-03-11 18:03:37,051][41544] Updated weights for policy 0, policy_version 191120 (0.0005) +[2023-03-11 18:03:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9830.4). Total num frames: 97865728. Throughput: 0: 9989.8. Samples: 97861472. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:38,386][41256] Avg episode reward: [(0, '73.366')] +[2023-03-11 18:03:41,300][41544] Updated weights for policy 0, policy_version 191200 (0.0005) +[2023-03-11 18:03:43,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 97910784. Throughput: 0: 9949.8. Samples: 97890316. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:43,386][41256] Avg episode reward: [(0, '73.930')] +[2023-03-11 18:03:45,497][41544] Updated weights for policy 0, policy_version 191280 (0.0004) +[2023-03-11 18:03:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 97959936. Throughput: 0: 9863.6. Samples: 97947952. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:48,386][41256] Avg episode reward: [(0, '73.715')] +[2023-03-11 18:03:48,453][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191336_97964032.pth... +[2023-03-11 18:03:48,455][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000190760_97669120.pth +[2023-03-11 18:03:49,681][41544] Updated weights for policy 0, policy_version 191360 (0.0005) +[2023-03-11 18:03:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 98009088. Throughput: 0: 9775.4. Samples: 98006684. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:53,386][41256] Avg episode reward: [(0, '73.758')] +[2023-03-11 18:03:53,928][41544] Updated weights for policy 0, policy_version 191440 (0.0005) +[2023-03-11 18:03:58,065][41544] Updated weights for policy 0, policy_version 191520 (0.0005) +[2023-03-11 18:03:58,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9898.7, 300 sec: 9830.4). Total num frames: 98058240. Throughput: 0: 9738.7. Samples: 98036628. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:03:58,386][41256] Avg episode reward: [(0, '73.524')] +[2023-03-11 18:04:02,279][41544] Updated weights for policy 0, policy_version 191600 (0.0005) +[2023-03-11 18:04:03,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 98107392. Throughput: 0: 9742.7. Samples: 98095104. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:03,386][41256] Avg episode reward: [(0, '72.338')] +[2023-03-11 18:04:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191616_98107392.pth... +[2023-03-11 18:04:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191048_97816576.pth +[2023-03-11 18:04:06,550][41544] Updated weights for policy 0, policy_version 191680 (0.0004) +[2023-03-11 18:04:08,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9844.3). Total num frames: 98156544. Throughput: 0: 9740.5. Samples: 98153584. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:08,386][41256] Avg episode reward: [(0, '73.582')] +[2023-03-11 18:04:10,426][41544] Updated weights for policy 0, policy_version 191760 (0.0003) +[2023-03-11 18:04:13,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 98209792. Throughput: 0: 9809.7. Samples: 98185288. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:13,386][41256] Avg episode reward: [(0, '75.206')] +[2023-03-11 18:04:14,402][41544] Updated weights for policy 0, policy_version 191840 (0.0004) +[2023-03-11 18:04:18,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9830.4, 300 sec: 9858.2). Total num frames: 98258944. Throughput: 0: 9865.7. Samples: 98246720. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:18,396][41256] Avg episode reward: [(0, '74.468')] +[2023-03-11 18:04:18,401][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191920_98263040.pth... +[2023-03-11 18:04:18,401][41544] Updated weights for policy 0, policy_version 191920 (0.0005) +[2023-03-11 18:04:18,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191336_97964032.pth +[2023-03-11 18:04:22,455][41544] Updated weights for policy 0, policy_version 192000 (0.0005) +[2023-03-11 18:04:23,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 98312192. Throughput: 0: 9925.0. Samples: 98308096. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:23,386][41256] Avg episode reward: [(0, '73.329')] +[2023-03-11 18:04:26,646][41544] Updated weights for policy 0, policy_version 192080 (0.0005) +[2023-03-11 18:04:28,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 98361344. Throughput: 0: 9922.6. Samples: 98336832. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:28,396][41256] Avg episode reward: [(0, '73.963')] +[2023-03-11 18:04:30,824][41544] Updated weights for policy 0, policy_version 192160 (0.0005) +[2023-03-11 18:04:33,385][41256] Fps is (10 sec: 9830.3, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 98410496. Throughput: 0: 9951.4. Samples: 98395764. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:33,396][41256] Avg episode reward: [(0, '73.539')] +[2023-03-11 18:04:33,400][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000192208_98410496.pth... +[2023-03-11 18:04:33,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191616_98107392.pth +[2023-03-11 18:04:34,990][41544] Updated weights for policy 0, policy_version 192240 (0.0005) +[2023-03-11 18:04:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9885.9). Total num frames: 98459648. Throughput: 0: 9986.1. Samples: 98456060. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:38,396][41256] Avg episode reward: [(0, '74.018')] +[2023-03-11 18:04:38,947][41544] Updated weights for policy 0, policy_version 192320 (0.0004) +[2023-03-11 18:04:42,952][41544] Updated weights for policy 0, policy_version 192400 (0.0005) +[2023-03-11 18:04:43,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 98512896. Throughput: 0: 10015.0. Samples: 98487304. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:43,395][41256] Avg episode reward: [(0, '72.085')] +[2023-03-11 18:04:46,950][41544] Updated weights for policy 0, policy_version 192480 (0.0005) +[2023-03-11 18:04:48,385][41256] Fps is (10 sec: 10239.9, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 98562048. Throughput: 0: 10080.6. Samples: 98548732. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:48,396][41256] Avg episode reward: [(0, '73.844')] +[2023-03-11 18:04:48,399][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000192504_98562048.pth... +[2023-03-11 18:04:48,402][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000191920_98263040.pth +[2023-03-11 18:04:51,059][41544] Updated weights for policy 0, policy_version 192560 (0.0005) +[2023-03-11 18:04:53,385][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 98611200. Throughput: 0: 10098.6. Samples: 98608020. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:53,386][41256] Avg episode reward: [(0, '72.951')] +[2023-03-11 18:04:55,240][41544] Updated weights for policy 0, policy_version 192640 (0.0004) +[2023-03-11 18:04:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9899.8). Total num frames: 98660352. Throughput: 0: 10048.8. Samples: 98637484. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:04:58,386][41256] Avg episode reward: [(0, '73.286')] +[2023-03-11 18:04:59,324][41544] Updated weights for policy 0, policy_version 192720 (0.0004) +[2023-03-11 18:05:03,216][41544] Updated weights for policy 0, policy_version 192800 (0.0004) +[2023-03-11 18:05:03,385][41256] Fps is (10 sec: 10240.0, 60 sec: 10103.5, 300 sec: 9927.6). Total num frames: 98713600. Throughput: 0: 10051.4. Samples: 98699032. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:05:03,386][41256] Avg episode reward: [(0, '72.680')] +[2023-03-11 18:05:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000192800_98713600.pth... +[2023-03-11 18:05:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000192208_98410496.pth +[2023-03-11 18:05:07,161][41544] Updated weights for policy 0, policy_version 192880 (0.0004) +[2023-03-11 18:05:08,385][41256] Fps is (10 sec: 10649.7, 60 sec: 10171.7, 300 sec: 9941.5). Total num frames: 98766848. Throughput: 0: 10081.2. Samples: 98761748. Policy #0 lag: (min: 2.0, avg: 2.0, max: 2.0) +[2023-03-11 18:05:08,386][41256] Avg episode reward: [(0, '71.018')] +[2023-03-11 18:05:11,048][41544] Updated weights for policy 0, policy_version 192960 (0.0004) +[2023-03-11 18:05:13,385][41256] Fps is (10 sec: 10240.1, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 98816000. Throughput: 0: 10141.2. Samples: 98793184. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:13,386][41256] Avg episode reward: [(0, '75.066')] +[2023-03-11 18:05:15,102][41544] Updated weights for policy 0, policy_version 193040 (0.0005) +[2023-03-11 18:05:18,386][41256] Fps is (10 sec: 9830.3, 60 sec: 10103.5, 300 sec: 9941.5). Total num frames: 98865152. Throughput: 0: 10161.6. Samples: 98853036. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:18,386][41256] Avg episode reward: [(0, '74.227')] +[2023-03-11 18:05:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193096_98865152.pth... +[2023-03-11 18:05:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000192504_98562048.pth +[2023-03-11 18:05:19,308][41544] Updated weights for policy 0, policy_version 193120 (0.0005) +[2023-03-11 18:05:23,385][41256] Fps is (10 sec: 9830.3, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 98914304. Throughput: 0: 10140.8. Samples: 98912396. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:23,386][41256] Avg episode reward: [(0, '74.804')] +[2023-03-11 18:05:23,432][41544] Updated weights for policy 0, policy_version 193200 (0.0005) +[2023-03-11 18:05:27,631][41544] Updated weights for policy 0, policy_version 193280 (0.0005) +[2023-03-11 18:05:28,385][41256] Fps is (10 sec: 9830.5, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 98963456. Throughput: 0: 10103.8. Samples: 98941976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:28,386][41256] Avg episode reward: [(0, '73.126')] +[2023-03-11 18:05:31,807][41544] Updated weights for policy 0, policy_version 193360 (0.0005) +[2023-03-11 18:05:33,386][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9955.4). Total num frames: 99012608. Throughput: 0: 10036.9. Samples: 99000392. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:33,386][41256] Avg episode reward: [(0, '73.481')] +[2023-03-11 18:05:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193384_99012608.pth... +[2023-03-11 18:05:33,393][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000192800_98713600.pth +[2023-03-11 18:05:36,056][41544] Updated weights for policy 0, policy_version 193440 (0.0005) +[2023-03-11 18:05:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 10035.2, 300 sec: 9941.5). Total num frames: 99061760. Throughput: 0: 10015.6. Samples: 99058720. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:38,386][41256] Avg episode reward: [(0, '74.314')] +[2023-03-11 18:05:40,243][41544] Updated weights for policy 0, policy_version 193520 (0.0005) +[2023-03-11 18:05:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 99110912. Throughput: 0: 10013.3. Samples: 99088084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:43,386][41256] Avg episode reward: [(0, '75.126')] +[2023-03-11 18:05:44,405][41544] Updated weights for policy 0, policy_version 193600 (0.0005) +[2023-03-11 18:05:48,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 99160064. Throughput: 0: 9953.3. Samples: 99146932. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:48,386][41256] Avg episode reward: [(0, '74.220')] +[2023-03-11 18:05:48,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193672_99160064.pth... +[2023-03-11 18:05:48,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193096_98865152.pth +[2023-03-11 18:05:48,664][41544] Updated weights for policy 0, policy_version 193680 (0.0005) +[2023-03-11 18:05:52,861][41544] Updated weights for policy 0, policy_version 193760 (0.0005) +[2023-03-11 18:05:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9955.4). Total num frames: 99209216. Throughput: 0: 9852.9. Samples: 99205128. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:53,386][41256] Avg episode reward: [(0, '74.999')] +[2023-03-11 18:05:57,055][41544] Updated weights for policy 0, policy_version 193840 (0.0005) +[2023-03-11 18:05:58,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9966.9, 300 sec: 9941.5). Total num frames: 99258368. Throughput: 0: 9795.2. Samples: 99233968. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:05:58,386][41256] Avg episode reward: [(0, '73.789')] +[2023-03-11 18:06:01,232][41544] Updated weights for policy 0, policy_version 193920 (0.0005) +[2023-03-11 18:06:03,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9898.7, 300 sec: 9941.5). Total num frames: 99307520. Throughput: 0: 9777.2. Samples: 99293012. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:03,386][41256] Avg episode reward: [(0, '74.377')] +[2023-03-11 18:06:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193960_99307520.pth... +[2023-03-11 18:06:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193384_99012608.pth +[2023-03-11 18:06:05,420][41544] Updated weights for policy 0, policy_version 194000 (0.0005) +[2023-03-11 18:06:08,386][41256] Fps is (10 sec: 9830.4, 60 sec: 9830.4, 300 sec: 9927.6). Total num frames: 99356672. Throughput: 0: 9767.4. Samples: 99351928. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:08,386][41256] Avg episode reward: [(0, '75.091')] +[2023-03-11 18:06:09,665][41544] Updated weights for policy 0, policy_version 194080 (0.0005) +[2023-03-11 18:06:13,385][41256] Fps is (10 sec: 9420.9, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 99401728. Throughput: 0: 9755.6. Samples: 99380976. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:13,386][41256] Avg episode reward: [(0, '75.126')] +[2023-03-11 18:06:13,869][41544] Updated weights for policy 0, policy_version 194160 (0.0005) +[2023-03-11 18:06:18,139][41544] Updated weights for policy 0, policy_version 194240 (0.0005) +[2023-03-11 18:06:18,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 99450880. Throughput: 0: 9738.1. Samples: 99438608. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:18,386][41256] Avg episode reward: [(0, '74.663')] +[2023-03-11 18:06:18,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000194240_99450880.pth... +[2023-03-11 18:06:18,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193672_99160064.pth +[2023-03-11 18:06:22,359][41544] Updated weights for policy 0, policy_version 194320 (0.0005) +[2023-03-11 18:06:23,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 99500032. Throughput: 0: 9731.7. Samples: 99496644. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:23,386][41256] Avg episode reward: [(0, '74.131')] +[2023-03-11 18:06:26,544][41544] Updated weights for policy 0, policy_version 194400 (0.0005) +[2023-03-11 18:06:28,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 99549184. Throughput: 0: 9729.1. Samples: 99525892. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:28,386][41256] Avg episode reward: [(0, '73.951')] +[2023-03-11 18:06:30,697][41544] Updated weights for policy 0, policy_version 194480 (0.0005) +[2023-03-11 18:06:33,386][41256] Fps is (10 sec: 9830.3, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 99598336. Throughput: 0: 9736.7. Samples: 99585084. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:33,386][41256] Avg episode reward: [(0, '74.352')] +[2023-03-11 18:06:33,390][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000194528_99598336.pth... +[2023-03-11 18:06:33,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000193960_99307520.pth +[2023-03-11 18:06:34,923][41544] Updated weights for policy 0, policy_version 194560 (0.0005) +[2023-03-11 18:06:38,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9913.7). Total num frames: 99647488. Throughput: 0: 9740.8. Samples: 99643464. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:38,386][41256] Avg episode reward: [(0, '74.694')] +[2023-03-11 18:06:39,109][41544] Updated weights for policy 0, policy_version 194640 (0.0005) +[2023-03-11 18:06:43,299][41544] Updated weights for policy 0, policy_version 194720 (0.0005) +[2023-03-11 18:06:43,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 99696640. Throughput: 0: 9737.2. Samples: 99672144. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:43,386][41256] Avg episode reward: [(0, '74.169')] +[2023-03-11 18:06:47,570][41544] Updated weights for policy 0, policy_version 194800 (0.0005) +[2023-03-11 18:06:48,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9885.9). Total num frames: 99741696. Throughput: 0: 9718.9. Samples: 99730364. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:48,386][41256] Avg episode reward: [(0, '75.993')] +[2023-03-11 18:06:48,432][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000194816_99745792.pth... +[2023-03-11 18:06:48,434][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000194240_99450880.pth +[2023-03-11 18:06:51,707][41544] Updated weights for policy 0, policy_version 194880 (0.0005) +[2023-03-11 18:06:53,385][41256] Fps is (10 sec: 9830.4, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 99794944. Throughput: 0: 9732.1. Samples: 99789872. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:53,386][41256] Avg episode reward: [(0, '73.995')] +[2023-03-11 18:06:55,909][41544] Updated weights for policy 0, policy_version 194960 (0.0005) +[2023-03-11 18:06:58,385][41256] Fps is (10 sec: 10240.0, 60 sec: 9762.1, 300 sec: 9899.8). Total num frames: 99844096. Throughput: 0: 9741.5. Samples: 99819344. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:06:58,386][41256] Avg episode reward: [(0, '76.272')] +[2023-03-11 18:07:00,081][41544] Updated weights for policy 0, policy_version 195040 (0.0005) +[2023-03-11 18:07:03,386][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9885.9). Total num frames: 99889152. Throughput: 0: 9749.5. Samples: 99877336. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:07:03,386][41256] Avg episode reward: [(0, '72.545')] +[2023-03-11 18:07:03,389][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000195096_99889152.pth... +[2023-03-11 18:07:03,392][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000194528_99598336.pth +[2023-03-11 18:07:04,313][41544] Updated weights for policy 0, policy_version 195120 (0.0005) +[2023-03-11 18:07:08,385][41256] Fps is (10 sec: 9420.8, 60 sec: 9693.9, 300 sec: 9872.1). Total num frames: 99938304. Throughput: 0: 9753.0. Samples: 99935528. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:07:08,386][41256] Avg episode reward: [(0, '75.860')] +[2023-03-11 18:07:08,516][41544] Updated weights for policy 0, policy_version 195200 (0.0005) +[2023-03-11 18:07:12,724][41544] Updated weights for policy 0, policy_version 195280 (0.0005) +[2023-03-11 18:07:13,385][41256] Fps is (10 sec: 9830.5, 60 sec: 9762.1, 300 sec: 9872.1). Total num frames: 99987456. Throughput: 0: 9759.7. Samples: 99965080. Policy #0 lag: (min: 7.0, avg: 7.0, max: 7.0) +[2023-03-11 18:07:13,386][41256] Avg episode reward: [(0, '74.681')] +[2023-03-11 18:07:14,823][41500] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 +[2023-03-11 18:07:15,241][41500] Early stopping after 2 epochs (8 sgd steps), loss delta 0.0000000 +[2023-03-11 18:07:15,242][41546] Stopping RolloutWorker_w2... +[2023-03-11 18:07:15,242][41583] Stopping RolloutWorker_w5... +[2023-03-11 18:07:15,242][41549] Stopping RolloutWorker_w3... +[2023-03-11 18:07:15,242][41546] Loop rollout_proc2_evt_loop terminating... +[2023-03-11 18:07:15,242][41583] Loop rollout_proc5_evt_loop terminating... +[2023-03-11 18:07:15,242][41548] Stopping RolloutWorker_w4... +[2023-03-11 18:07:15,242][41549] Loop rollout_proc3_evt_loop terminating... +[2023-03-11 18:07:15,242][41500] Stopping Batcher_0... +[2023-03-11 18:07:15,242][41548] Loop rollout_proc4_evt_loop terminating... +[2023-03-11 18:07:15,242][41550] Stopping RolloutWorker_w6... +[2023-03-11 18:07:15,242][41547] Stopping RolloutWorker_w0... +[2023-03-11 18:07:15,242][41256] Component RolloutWorker_w5 stopped! +[2023-03-11 18:07:15,242][41572] Stopping RolloutWorker_w7... +[2023-03-11 18:07:15,242][41550] Loop rollout_proc6_evt_loop terminating... +[2023-03-11 18:07:15,242][41500] Loop batcher_evt_loop terminating... +[2023-03-11 18:07:15,242][41547] Loop rollout_proc0_evt_loop terminating... +[2023-03-11 18:07:15,242][41545] Stopping RolloutWorker_w1... +[2023-03-11 18:07:15,242][41572] Loop rollout_proc7_evt_loop terminating... +[2023-03-11 18:07:15,242][41256] Component RolloutWorker_w2 stopped! +[2023-03-11 18:07:15,242][41545] Loop rollout_proc1_evt_loop terminating... +[2023-03-11 18:07:15,242][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... +[2023-03-11 18:07:15,243][41256] Component RolloutWorker_w3 stopped! +[2023-03-11 18:07:15,243][41256] Component Batcher_0 stopped! +[2023-03-11 18:07:15,243][41256] Component RolloutWorker_w4 stopped! +[2023-03-11 18:07:15,243][41256] Component RolloutWorker_w6 stopped! +[2023-03-11 18:07:15,243][41256] Component RolloutWorker_w0 stopped! +[2023-03-11 18:07:15,243][41256] Component RolloutWorker_w7 stopped! +[2023-03-11 18:07:15,244][41256] Component RolloutWorker_w1 stopped! +[2023-03-11 18:07:15,244][41500] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000194816_99745792.pth +[2023-03-11 18:07:15,244][41500] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/stick-pull-v2/checkpoint_p0/checkpoint_000195328_100007936.pth... +[2023-03-11 18:07:15,246][41500] Stopping LearnerWorker_p0... +[2023-03-11 18:07:15,246][41500] Loop learner_proc0_evt_loop terminating... +[2023-03-11 18:07:15,246][41256] Component LearnerWorker_p0 stopped! +[2023-03-11 18:07:15,263][41544] Weights refcount: 2 0 +[2023-03-11 18:07:15,263][41544] Stopping InferenceWorker_p0-w0... +[2023-03-11 18:07:15,264][41544] Loop inference_proc0-0_evt_loop terminating... +[2023-03-11 18:07:15,264][41256] Component InferenceWorker_p0-w0 stopped! +[2023-03-11 18:07:15,265][41256] Waiting for process learner_proc0 to stop... +[2023-03-11 18:07:15,682][41256] Waiting for process inference_proc0-0 to join... +[2023-03-11 18:07:15,682][41256] Waiting for process rollout_proc0 to join... +[2023-03-11 18:07:15,683][41256] Waiting for process rollout_proc1 to join... +[2023-03-11 18:07:15,683][41256] Waiting for process rollout_proc2 to join... +[2023-03-11 18:07:15,684][41256] Waiting for process rollout_proc3 to join... +[2023-03-11 18:07:15,684][41256] Waiting for process rollout_proc4 to join... +[2023-03-11 18:07:15,684][41256] Waiting for process rollout_proc5 to join... +[2023-03-11 18:07:15,685][41256] Waiting for process rollout_proc6 to join... +[2023-03-11 18:07:15,685][41256] Waiting for process rollout_proc7 to join... +[2023-03-11 18:07:15,685][41256] Batcher 0 profile tree view: +batching: 17.7316, releasing_batches: 15.0058 +[2023-03-11 18:07:15,685][41256] InferenceWorker_p0-w0 profile tree view: +wait_policy: 0.0000 + wait_policy_total: 4086.2930 +update_model: 110.5235 + weight_update: 0.0005 +one_step: 0.0007 + handle_policy_step: 5515.9914 + deserialize: 230.5263, stack: 59.0635, obs_to_device_normalize: 997.2117, forward: 2748.5211, send_messages: 383.6733 + prepare_outputs: 616.0213 + to_cpu: 99.0715 +[2023-03-11 18:07:15,685][41256] Learner 0 profile tree view: +misc: 0.0954, prepare_batch: 91.1412 +train: 1179.9865 + epoch_init: 0.3734, minibatch_init: 12.2620, losses_postprocess: 12.1599, kl_divergence: 4.3480, after_optimizer: 4.9471 + calculate_losses: 485.9633 + losses_init: 0.3247, forward_head: 240.2692, bptt_initial: 1.2741, bptt: 1.2388, tail: 114.2932, advantages_returns: 8.8129, losses: 105.6194 + update: 644.2314 + clip: 56.8388 +[2023-03-11 18:07:15,686][41256] RolloutWorker_w0 profile tree view: +wait_for_trajectories: 2.7724, enqueue_policy_requests: 127.3107, env_step: 7536.8992, overhead: 306.1504, complete_rollouts: 3.2780 +save_policy_outputs: 328.3066 + split_output_tensors: 162.0075 +[2023-03-11 18:07:15,686][41256] RolloutWorker_w7 profile tree view: +wait_for_trajectories: 2.8058, enqueue_policy_requests: 128.8893, env_step: 7506.4859, overhead: 307.0077, complete_rollouts: 3.3760 +save_policy_outputs: 321.2704 + split_output_tensors: 158.9147 +[2023-03-11 18:07:15,686][41256] Loop Runner_EvtLoop terminating... +[2023-03-11 18:07:15,686][41256] Runner profile tree view: +main_loop: 10404.9287 +[2023-03-11 18:07:15,686][41256] Collected {0: 100007936}, FPS: 9611.6