[2023-03-07 07:35:31,381][155126] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/config.json... [2023-03-07 07:35:31,396][155126] Rollout worker 0 uses device cpu [2023-03-07 07:35:31,396][155126] Rollout worker 1 uses device cpu [2023-03-07 07:35:31,396][155126] Rollout worker 2 uses device cpu [2023-03-07 07:35:31,396][155126] Rollout worker 3 uses device cpu [2023-03-07 07:35:31,396][155126] Rollout worker 4 uses device cpu [2023-03-07 07:35:31,396][155126] Rollout worker 5 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 6 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 7 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 8 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 9 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 10 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 11 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 12 uses device cpu [2023-03-07 07:35:31,397][155126] Rollout worker 13 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 14 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 15 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 16 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 17 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 18 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 19 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 20 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 21 uses device cpu [2023-03-07 07:35:31,398][155126] Rollout worker 22 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 23 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 24 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 25 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 26 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 27 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 28 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 29 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 30 uses device cpu [2023-03-07 07:35:31,399][155126] Rollout worker 31 uses device cpu [2023-03-07 07:35:31,412][155126] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 07:35:31,413][155126] InferenceWorker_p0-w0: min num requests: 10 [2023-03-07 07:35:31,495][155126] Starting all processes... [2023-03-07 07:35:31,496][155126] Starting process learner_proc0 [2023-03-07 07:35:31,546][155126] Starting all processes... [2023-03-07 07:35:31,593][155126] Starting process inference_proc0-0 [2023-03-07 07:35:31,601][155126] Starting process rollout_proc0 [2023-03-07 07:35:31,602][155126] Starting process rollout_proc1 [2023-03-07 07:35:31,602][155126] Starting process rollout_proc2 [2023-03-07 07:35:31,602][155126] Starting process rollout_proc3 [2023-03-07 07:35:31,603][155126] Starting process rollout_proc4 [2023-03-07 07:35:31,603][155126] Starting process rollout_proc5 [2023-03-07 07:35:31,603][155126] Starting process rollout_proc6 [2023-03-07 07:35:31,603][155126] Starting process rollout_proc7 [2023-03-07 07:35:31,604][155126] Starting process rollout_proc8 [2023-03-07 07:35:31,605][155126] Starting process rollout_proc9 [2023-03-07 07:35:31,611][155126] Starting process rollout_proc10 [2023-03-07 07:35:31,612][155126] Starting process rollout_proc11 [2023-03-07 07:35:31,612][155126] Starting process rollout_proc12 [2023-03-07 07:35:31,613][155126] Starting process rollout_proc13 [2023-03-07 07:35:31,614][155126] Starting process rollout_proc14 [2023-03-07 07:35:31,614][155126] Starting process rollout_proc15 [2023-03-07 07:35:31,620][155126] Starting process rollout_proc16 [2023-03-07 07:35:31,620][155126] Starting process rollout_proc17 [2023-03-07 07:35:31,626][155126] Starting process rollout_proc18 [2023-03-07 07:35:31,631][155126] Starting process rollout_proc19 [2023-03-07 07:35:31,640][155126] Starting process rollout_proc20 [2023-03-07 07:35:31,646][155126] Starting process rollout_proc21 [2023-03-07 07:35:31,785][155126] Starting process rollout_proc22 [2023-03-07 07:35:31,786][155126] Starting process rollout_proc23 [2023-03-07 07:35:31,794][155126] Starting process rollout_proc24 [2023-03-07 07:35:31,802][155126] Starting process rollout_proc25 [2023-03-07 07:35:31,835][155126] Starting process rollout_proc26 [2023-03-07 07:35:31,844][155126] Starting process rollout_proc27 [2023-03-07 07:35:31,844][155126] Starting process rollout_proc28 [2023-03-07 07:35:31,845][155126] Starting process rollout_proc29 [2023-03-07 07:35:31,845][155126] Starting process rollout_proc30 [2023-03-07 07:35:31,845][155126] Starting process rollout_proc31 [2023-03-07 07:35:33,455][155401] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 07:35:33,455][155401] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-07 07:35:33,465][155401] Num visible devices: 1 [2023-03-07 07:35:33,490][155454] Worker 1 uses CPU cores [1] [2023-03-07 07:35:33,490][155401] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-07 07:35:33,490][155401] Starting seed is not provided [2023-03-07 07:35:33,490][155401] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 07:35:33,491][155401] Initializing actor-critic model on device cuda:0 [2023-03-07 07:35:33,491][155401] RunningMeanStd input shape: (39,) [2023-03-07 07:35:33,491][155401] RunningMeanStd input shape: (1,) [2023-03-07 07:35:33,592][155401] Created Actor Critic model with architecture: [2023-03-07 07:35:33,593][155401] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-07 07:35:33,667][155457] Worker 4 uses CPU cores [4] [2023-03-07 07:35:33,705][155455] Worker 2 uses CPU cores [2] [2023-03-07 07:35:33,927][155682] Worker 18 uses CPU cores [18] [2023-03-07 07:35:34,023][156073] Worker 30 uses CPU cores [30] [2023-03-07 07:35:34,036][155680] Worker 15 uses CPU cores [15] [2023-03-07 07:35:34,243][155683] Worker 16 uses CPU cores [16] [2023-03-07 07:35:34,271][156005] Worker 23 uses CPU cores [23] [2023-03-07 07:35:34,327][155675] Worker 12 uses CPU cores [12] [2023-03-07 07:35:34,547][155684] Worker 7 uses CPU cores [7] [2023-03-07 07:35:34,598][156041] Worker 29 uses CPU cores [29] [2023-03-07 07:35:34,767][155458] Worker 5 uses CPU cores [5] [2023-03-07 07:35:34,939][155960] Worker 22 uses CPU cores [22] [2023-03-07 07:35:35,076][156006] Worker 24 uses CPU cores [24] [2023-03-07 07:35:35,175][156039] Worker 27 uses CPU cores [27] [2023-03-07 07:35:35,181][155401] Using optimizer [2023-03-07 07:35:35,182][155401] No checkpoints found [2023-03-07 07:35:35,182][155401] Did not load from checkpoint, starting from scratch! [2023-03-07 07:35:35,182][155401] Initialized policy 0 weights for model version 0 [2023-03-07 07:35:35,183][155401] LearnerWorker_p0 finished initialization! [2023-03-07 07:35:35,184][155401] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 07:35:35,280][155689] Worker 17 uses CPU cores [17] [2023-03-07 07:35:35,443][155706] Worker 20 uses CPU cores [20] [2023-03-07 07:35:35,483][156076] Worker 31 uses CPU cores [31] [2023-03-07 07:35:35,530][155687] Worker 14 uses CPU cores [14] [2023-03-07 07:35:35,791][155679] Worker 9 uses CPU cores [9] [2023-03-07 07:35:35,792][155686] Worker 11 uses CPU cores [11] [2023-03-07 07:35:35,913][155456] Worker 3 uses CPU cores [3] [2023-03-07 07:35:35,926][155452] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 07:35:35,926][155452] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-07 07:35:35,936][155452] Num visible devices: 1 [2023-03-07 07:35:36,032][155452] RunningMeanStd input shape: (39,) [2023-03-07 07:35:36,032][155452] RunningMeanStd input shape: (1,) [2023-03-07 07:35:36,035][156038] Worker 26 uses CPU cores [26] [2023-03-07 07:35:36,183][156074] Worker 28 uses CPU cores [28] [2023-03-07 07:35:36,289][155674] Worker 10 uses CPU cores [10] [2023-03-07 07:35:36,290][155688] Worker 19 uses CPU cores [19] [2023-03-07 07:35:36,426][156075] Worker 25 uses CPU cores [25] [2023-03-07 07:35:36,435][155724] Worker 13 uses CPU cores [13] [2023-03-07 07:35:36,464][155673] Worker 6 uses CPU cores [6] [2023-03-07 07:35:36,587][155681] Worker 21 uses CPU cores [21] [2023-03-07 07:35:36,692][155126] Inference worker 0-0 is ready! [2023-03-07 07:35:36,693][155126] All inference workers are ready! Signal rollout workers to start! [2023-03-07 07:35:36,778][155453] Worker 0 uses CPU cores [0] [2023-03-07 07:35:36,921][155685] Worker 8 uses CPU cores [8] [2023-03-07 07:35:38,247][156006] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,265][155683] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,367][155126] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-07 07:35:38,390][155458] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,390][155679] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,436][155687] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,461][155960] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,481][155456] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,513][156041] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,549][155674] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,550][156038] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,568][155680] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,568][156076] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,584][155675] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,588][155724] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,591][156039] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,603][155454] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,603][155689] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,606][155688] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,610][155457] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,610][155706] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,614][156005] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,615][155684] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,625][155682] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,625][155455] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,638][156074] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,652][155673] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,653][156073] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,657][156075] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,657][155686] Decorrelating experience for 0 frames... [2023-03-07 07:35:38,856][155681] Decorrelating experience for 0 frames... [2023-03-07 07:35:39,036][155453] Decorrelating experience for 0 frames... [2023-03-07 07:35:39,185][155685] Decorrelating experience for 0 frames... [2023-03-07 07:35:40,023][156006] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,036][155683] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,192][155679] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,213][155458] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,234][155687] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,292][155960] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,296][156041] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,312][155456] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,345][155724] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,355][156075] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,373][156073] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,376][155680] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,378][156076] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,395][156038] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,399][155674] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,404][155688] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,409][156039] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,417][155673] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,419][155675] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,428][155454] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,428][155689] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,438][155706] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,438][155684] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,438][155457] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,443][155686] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,471][155682] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,475][155455] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,487][155681] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,494][156005] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,509][156074] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,555][155453] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,799][155685] Decorrelating experience for 32 frames... [2023-03-07 07:35:40,864][155401] Signal inference workers to stop experience collection... [2023-03-07 07:35:40,868][155452] InferenceWorker_p0-w0: stopping experience collection [2023-03-07 07:35:41,152][155401] Signal inference workers to resume experience collection... [2023-03-07 07:35:41,153][155452] InferenceWorker_p0-w0: resuming experience collection [2023-03-07 07:35:42,326][155452] Updated weights for policy 0, policy_version 10 (0.0217) [2023-03-07 07:35:43,080][155452] Updated weights for policy 0, policy_version 20 (0.0006) [2023-03-07 07:35:43,367][155126] Fps is (10 sec: 4710.5, 60 sec: 4710.5, 300 sec: 4710.5). Total num frames: 23552. Throughput: 0: 3377.8. Samples: 16889. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 07:35:43,893][155452] Updated weights for policy 0, policy_version 30 (0.0006) [2023-03-07 07:35:44,630][155452] Updated weights for policy 0, policy_version 40 (0.0005) [2023-03-07 07:35:45,389][155452] Updated weights for policy 0, policy_version 50 (0.0006) [2023-03-07 07:35:46,153][155452] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-07 07:35:46,953][155452] Updated weights for policy 0, policy_version 70 (0.0006) [2023-03-07 07:35:47,725][155452] Updated weights for policy 0, policy_version 80 (0.0006) [2023-03-07 07:35:48,367][155126] Fps is (10 sec: 9011.3, 60 sec: 9011.3, 300 sec: 9011.3). Total num frames: 90112. Throughput: 0: 5656.4. Samples: 56564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:35:48,367][155126] Avg episode reward: [(0, '195.768')] [2023-03-07 07:35:48,491][155452] Updated weights for policy 0, policy_version 90 (0.0007) [2023-03-07 07:35:49,264][155452] Updated weights for policy 0, policy_version 100 (0.0006) [2023-03-07 07:35:50,006][155452] Updated weights for policy 0, policy_version 110 (0.0006) [2023-03-07 07:35:50,765][155452] Updated weights for policy 0, policy_version 120 (0.0006) [2023-03-07 07:35:51,408][155126] Heartbeat connected on Batcher_0 [2023-03-07 07:35:51,410][155126] Heartbeat connected on LearnerWorker_p0 [2023-03-07 07:35:51,415][155126] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-07 07:35:51,416][155126] Heartbeat connected on RolloutWorker_w0 [2023-03-07 07:35:51,417][155126] Heartbeat connected on RolloutWorker_w1 [2023-03-07 07:35:51,419][155126] Heartbeat connected on RolloutWorker_w2 [2023-03-07 07:35:51,421][155126] Heartbeat connected on RolloutWorker_w3 [2023-03-07 07:35:51,422][155126] Heartbeat connected on RolloutWorker_w4 [2023-03-07 07:35:51,424][155126] Heartbeat connected on RolloutWorker_w5 [2023-03-07 07:35:51,426][155126] Heartbeat connected on RolloutWorker_w6 [2023-03-07 07:35:51,428][155126] Heartbeat connected on RolloutWorker_w7 [2023-03-07 07:35:51,430][155126] Heartbeat connected on RolloutWorker_w8 [2023-03-07 07:35:51,432][155126] Heartbeat connected on RolloutWorker_w9 [2023-03-07 07:35:51,434][155126] Heartbeat connected on RolloutWorker_w10 [2023-03-07 07:35:51,458][155126] Heartbeat connected on RolloutWorker_w11 [2023-03-07 07:35:51,459][155126] Heartbeat connected on RolloutWorker_w12 [2023-03-07 07:35:51,462][155126] Heartbeat connected on RolloutWorker_w13 [2023-03-07 07:35:51,463][155126] Heartbeat connected on RolloutWorker_w14 [2023-03-07 07:35:51,465][155126] Heartbeat connected on RolloutWorker_w15 [2023-03-07 07:35:51,467][155126] Heartbeat connected on RolloutWorker_w16 [2023-03-07 07:35:51,469][155126] Heartbeat connected on RolloutWorker_w17 [2023-03-07 07:35:51,471][155126] Heartbeat connected on RolloutWorker_w18 [2023-03-07 07:35:51,473][155126] Heartbeat connected on RolloutWorker_w19 [2023-03-07 07:35:51,475][155126] Heartbeat connected on RolloutWorker_w20 [2023-03-07 07:35:51,476][155126] Heartbeat connected on RolloutWorker_w21 [2023-03-07 07:35:51,477][155126] Heartbeat connected on RolloutWorker_w22 [2023-03-07 07:35:51,480][155126] Heartbeat connected on RolloutWorker_w23 [2023-03-07 07:35:51,481][155126] Heartbeat connected on RolloutWorker_w24 [2023-03-07 07:35:51,484][155126] Heartbeat connected on RolloutWorker_w25 [2023-03-07 07:35:51,486][155126] Heartbeat connected on RolloutWorker_w26 [2023-03-07 07:35:51,487][155126] Heartbeat connected on RolloutWorker_w27 [2023-03-07 07:35:51,488][155126] Heartbeat connected on RolloutWorker_w28 [2023-03-07 07:35:51,492][155126] Heartbeat connected on RolloutWorker_w30 [2023-03-07 07:35:51,494][155126] Heartbeat connected on RolloutWorker_w31 [2023-03-07 07:35:51,494][155126] Heartbeat connected on RolloutWorker_w29 [2023-03-07 07:35:51,551][155452] Updated weights for policy 0, policy_version 130 (0.0006) [2023-03-07 07:35:52,326][155452] Updated weights for policy 0, policy_version 140 (0.0006) [2023-03-07 07:35:53,076][155452] Updated weights for policy 0, policy_version 150 (0.0007) [2023-03-07 07:35:53,367][155126] Fps is (10 sec: 13312.0, 60 sec: 10444.8, 300 sec: 10444.8). Total num frames: 156672. Throughput: 0: 9072.6. Samples: 136088. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 07:35:53,367][155126] Avg episode reward: [(0, '257.859')] [2023-03-07 07:35:53,368][155401] Saving new best policy, reward=257.859! [2023-03-07 07:35:53,892][155452] Updated weights for policy 0, policy_version 160 (0.0007) [2023-03-07 07:35:54,656][155452] Updated weights for policy 0, policy_version 170 (0.0006) [2023-03-07 07:35:55,433][155452] Updated weights for policy 0, policy_version 180 (0.0007) [2023-03-07 07:35:56,204][155452] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-07 07:35:56,977][155452] Updated weights for policy 0, policy_version 200 (0.0006) [2023-03-07 07:35:57,746][155452] Updated weights for policy 0, policy_version 210 (0.0006) [2023-03-07 07:35:58,367][155126] Fps is (10 sec: 13209.6, 60 sec: 11110.4, 300 sec: 11110.4). Total num frames: 222208. Throughput: 0: 10775.7. Samples: 215513. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:35:58,368][155126] Avg episode reward: [(0, '250.108')] [2023-03-07 07:35:58,550][155452] Updated weights for policy 0, policy_version 220 (0.0007) [2023-03-07 07:35:59,316][155452] Updated weights for policy 0, policy_version 230 (0.0007) [2023-03-07 07:36:00,077][155452] Updated weights for policy 0, policy_version 240 (0.0006) [2023-03-07 07:36:00,868][155452] Updated weights for policy 0, policy_version 250 (0.0005) [2023-03-07 07:36:01,625][155452] Updated weights for policy 0, policy_version 260 (0.0006) [2023-03-07 07:36:02,391][155452] Updated weights for policy 0, policy_version 270 (0.0006) [2023-03-07 07:36:03,180][155452] Updated weights for policy 0, policy_version 280 (0.0006) [2023-03-07 07:36:03,367][155126] Fps is (10 sec: 13209.4, 60 sec: 11550.7, 300 sec: 11550.7). Total num frames: 288768. Throughput: 0: 10199.6. Samples: 254992. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:36:03,368][155126] Avg episode reward: [(0, '454.405')] [2023-03-07 07:36:03,368][155401] Saving new best policy, reward=454.405! [2023-03-07 07:36:03,985][155452] Updated weights for policy 0, policy_version 290 (0.0006) [2023-03-07 07:36:04,767][155452] Updated weights for policy 0, policy_version 300 (0.0006) [2023-03-07 07:36:05,540][155452] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-07 07:36:06,345][155452] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-07 07:36:07,113][155452] Updated weights for policy 0, policy_version 330 (0.0005) [2023-03-07 07:36:07,884][155452] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-07 07:36:08,367][155126] Fps is (10 sec: 13209.7, 60 sec: 11810.2, 300 sec: 11810.2). Total num frames: 354304. Throughput: 0: 11120.8. Samples: 333622. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:36:08,367][155126] Avg episode reward: [(0, '384.947')] [2023-03-07 07:36:08,674][155452] Updated weights for policy 0, policy_version 350 (0.0005) [2023-03-07 07:36:09,439][155452] Updated weights for policy 0, policy_version 360 (0.0006) [2023-03-07 07:36:10,207][155452] Updated weights for policy 0, policy_version 370 (0.0006) [2023-03-07 07:36:11,022][155452] Updated weights for policy 0, policy_version 380 (0.0007) [2023-03-07 07:36:11,780][155452] Updated weights for policy 0, policy_version 390 (0.0005) [2023-03-07 07:36:12,557][155452] Updated weights for policy 0, policy_version 400 (0.0007) [2023-03-07 07:36:13,359][155452] Updated weights for policy 0, policy_version 410 (0.0006) [2023-03-07 07:36:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 11995.4, 300 sec: 11995.4). Total num frames: 419840. Throughput: 0: 11791.4. Samples: 412700. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:36:13,368][155126] Avg episode reward: [(0, '660.406')] [2023-03-07 07:36:13,368][155401] Saving new best policy, reward=660.406! [2023-03-07 07:36:14,134][155452] Updated weights for policy 0, policy_version 420 (0.0005) [2023-03-07 07:36:14,915][155452] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-07 07:36:15,693][155452] Updated weights for policy 0, policy_version 440 (0.0006) [2023-03-07 07:36:16,493][155452] Updated weights for policy 0, policy_version 450 (0.0006) [2023-03-07 07:36:17,254][155452] Updated weights for policy 0, policy_version 460 (0.0006) [2023-03-07 07:36:18,032][155452] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-07 07:36:18,367][155126] Fps is (10 sec: 13107.0, 60 sec: 12134.4, 300 sec: 12134.4). Total num frames: 485376. Throughput: 0: 11296.2. Samples: 451847. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:36:18,368][155126] Avg episode reward: [(0, '604.278')] [2023-03-07 07:36:18,817][155452] Updated weights for policy 0, policy_version 480 (0.0007) [2023-03-07 07:36:19,582][155452] Updated weights for policy 0, policy_version 490 (0.0006) [2023-03-07 07:36:20,362][155452] Updated weights for policy 0, policy_version 500 (0.0007) [2023-03-07 07:36:21,128][155452] Updated weights for policy 0, policy_version 510 (0.0006) [2023-03-07 07:36:21,885][155452] Updated weights for policy 0, policy_version 520 (0.0007) [2023-03-07 07:36:22,672][155452] Updated weights for policy 0, policy_version 530 (0.0006) [2023-03-07 07:36:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 12242.5, 300 sec: 12242.5). Total num frames: 550912. Throughput: 0: 11802.9. Samples: 531131. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:36:23,367][155126] Avg episode reward: [(0, '312.129')] [2023-03-07 07:36:23,438][155452] Updated weights for policy 0, policy_version 540 (0.0006) [2023-03-07 07:36:24,211][155452] Updated weights for policy 0, policy_version 550 (0.0007) [2023-03-07 07:36:24,973][155452] Updated weights for policy 0, policy_version 560 (0.0007) [2023-03-07 07:36:25,758][155452] Updated weights for policy 0, policy_version 570 (0.0006) [2023-03-07 07:36:26,533][155452] Updated weights for policy 0, policy_version 580 (0.0005) [2023-03-07 07:36:27,312][155452] Updated weights for policy 0, policy_version 590 (0.0007) [2023-03-07 07:36:28,098][155452] Updated weights for policy 0, policy_version 600 (0.0007) [2023-03-07 07:36:28,367][155126] Fps is (10 sec: 13209.7, 60 sec: 12349.5, 300 sec: 12349.5). Total num frames: 617472. Throughput: 0: 13187.3. Samples: 610317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:36:28,367][155126] Avg episode reward: [(0, '412.642')] [2023-03-07 07:36:28,878][155452] Updated weights for policy 0, policy_version 610 (0.0006) [2023-03-07 07:36:29,642][155452] Updated weights for policy 0, policy_version 620 (0.0006) [2023-03-07 07:36:30,437][155452] Updated weights for policy 0, policy_version 630 (0.0006) [2023-03-07 07:36:31,214][155452] Updated weights for policy 0, policy_version 640 (0.0006) [2023-03-07 07:36:31,987][155452] Updated weights for policy 0, policy_version 650 (0.0006) [2023-03-07 07:36:32,776][155452] Updated weights for policy 0, policy_version 660 (0.0007) [2023-03-07 07:36:33,367][155126] Fps is (10 sec: 13209.5, 60 sec: 12418.3, 300 sec: 12418.3). Total num frames: 683008. Throughput: 0: 13186.6. Samples: 649963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:36:33,368][155126] Avg episode reward: [(0, '312.501')] [2023-03-07 07:36:33,544][155452] Updated weights for policy 0, policy_version 670 (0.0006) [2023-03-07 07:36:34,322][155452] Updated weights for policy 0, policy_version 680 (0.0006) [2023-03-07 07:36:35,085][155452] Updated weights for policy 0, policy_version 690 (0.0007) [2023-03-07 07:36:35,865][155452] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-07 07:36:36,619][155452] Updated weights for policy 0, policy_version 710 (0.0006) [2023-03-07 07:36:37,422][155452] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-07 07:36:38,188][155452] Updated weights for policy 0, policy_version 730 (0.0006) [2023-03-07 07:36:38,367][155126] Fps is (10 sec: 13209.5, 60 sec: 12492.8, 300 sec: 12492.8). Total num frames: 749568. Throughput: 0: 13181.6. Samples: 729263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:36:38,368][155126] Avg episode reward: [(0, '495.061')] [2023-03-07 07:36:38,962][155452] Updated weights for policy 0, policy_version 740 (0.0006) [2023-03-07 07:36:39,750][155452] Updated weights for policy 0, policy_version 750 (0.0007) [2023-03-07 07:36:40,523][155452] Updated weights for policy 0, policy_version 760 (0.0006) [2023-03-07 07:36:41,288][155452] Updated weights for policy 0, policy_version 770 (0.0006) [2023-03-07 07:36:42,073][155452] Updated weights for policy 0, policy_version 780 (0.0005) [2023-03-07 07:36:42,849][155452] Updated weights for policy 0, policy_version 790 (0.0006) [2023-03-07 07:36:43,367][155126] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 12540.1). Total num frames: 815104. Throughput: 0: 13171.4. Samples: 808225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:36:43,367][155126] Avg episode reward: [(0, '514.759')] [2023-03-07 07:36:43,607][155452] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-07 07:36:44,388][155452] Updated weights for policy 0, policy_version 810 (0.0007) [2023-03-07 07:36:45,160][155452] Updated weights for policy 0, policy_version 820 (0.0007) [2023-03-07 07:36:45,935][155452] Updated weights for policy 0, policy_version 830 (0.0005) [2023-03-07 07:36:46,730][155452] Updated weights for policy 0, policy_version 840 (0.0007) [2023-03-07 07:36:47,504][155452] Updated weights for policy 0, policy_version 850 (0.0006) [2023-03-07 07:36:48,277][155452] Updated weights for policy 0, policy_version 860 (0.0006) [2023-03-07 07:36:48,367][155126] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 12595.2). Total num frames: 881664. Throughput: 0: 13177.8. Samples: 847992. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:36:48,367][155126] Avg episode reward: [(0, '273.085')] [2023-03-07 07:36:49,049][155452] Updated weights for policy 0, policy_version 870 (0.0005) [2023-03-07 07:36:49,829][155452] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-07 07:36:50,606][155452] Updated weights for policy 0, policy_version 890 (0.0006) [2023-03-07 07:36:51,369][155452] Updated weights for policy 0, policy_version 900 (0.0006) [2023-03-07 07:36:52,163][155452] Updated weights for policy 0, policy_version 910 (0.0006) [2023-03-07 07:36:52,923][155452] Updated weights for policy 0, policy_version 920 (0.0006) [2023-03-07 07:36:53,367][155126] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 12629.3). Total num frames: 947200. Throughput: 0: 13189.0. Samples: 927130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:36:53,368][155126] Avg episode reward: [(0, '201.452')] [2023-03-07 07:36:53,694][155452] Updated weights for policy 0, policy_version 930 (0.0005) [2023-03-07 07:36:54,485][155452] Updated weights for policy 0, policy_version 940 (0.0006) [2023-03-07 07:36:55,241][155452] Updated weights for policy 0, policy_version 950 (0.0006) [2023-03-07 07:36:56,007][155452] Updated weights for policy 0, policy_version 960 (0.0007) [2023-03-07 07:36:56,797][155452] Updated weights for policy 0, policy_version 970 (0.0006) [2023-03-07 07:36:57,568][155452] Updated weights for policy 0, policy_version 980 (0.0006) [2023-03-07 07:36:58,338][155452] Updated weights for policy 0, policy_version 990 (0.0006) [2023-03-07 07:36:58,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 12672.0). Total num frames: 1013760. Throughput: 0: 13196.3. Samples: 1006532. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:36:58,367][155126] Avg episode reward: [(0, '256.337')] [2023-03-07 07:36:59,141][155452] Updated weights for policy 0, policy_version 1000 (0.0006) [2023-03-07 07:36:59,906][155452] Updated weights for policy 0, policy_version 1010 (0.0006) [2023-03-07 07:37:00,675][155452] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-07 07:37:01,454][155452] Updated weights for policy 0, policy_version 1030 (0.0006) [2023-03-07 07:37:02,232][155452] Updated weights for policy 0, policy_version 1040 (0.0005) [2023-03-07 07:37:03,007][155452] Updated weights for policy 0, policy_version 1050 (0.0007) [2023-03-07 07:37:03,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12697.6). Total num frames: 1079296. Throughput: 0: 13204.6. Samples: 1046054. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:37:03,368][155126] Avg episode reward: [(0, '310.052')] [2023-03-07 07:37:03,768][155452] Updated weights for policy 0, policy_version 1060 (0.0006) [2023-03-07 07:37:04,551][155452] Updated weights for policy 0, policy_version 1070 (0.0006) [2023-03-07 07:37:05,338][155452] Updated weights for policy 0, policy_version 1080 (0.0008) [2023-03-07 07:37:06,134][155452] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-07 07:37:06,912][155452] Updated weights for policy 0, policy_version 1100 (0.0007) [2023-03-07 07:37:07,695][155452] Updated weights for policy 0, policy_version 1110 (0.0006) [2023-03-07 07:37:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 12720.4). Total num frames: 1144832. Throughput: 0: 13192.8. Samples: 1124806. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:37:08,367][155126] Avg episode reward: [(0, '600.198')] [2023-03-07 07:37:08,479][155452] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-07 07:37:09,265][155452] Updated weights for policy 0, policy_version 1130 (0.0006) [2023-03-07 07:37:10,037][155452] Updated weights for policy 0, policy_version 1140 (0.0005) [2023-03-07 07:37:10,812][155452] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-07 07:37:11,580][155452] Updated weights for policy 0, policy_version 1160 (0.0007) [2023-03-07 07:37:12,353][155452] Updated weights for policy 0, policy_version 1170 (0.0006) [2023-03-07 07:37:13,149][155452] Updated weights for policy 0, policy_version 1180 (0.0006) [2023-03-07 07:37:13,367][155126] Fps is (10 sec: 13209.8, 60 sec: 13192.6, 300 sec: 12751.5). Total num frames: 1211392. Throughput: 0: 13190.8. Samples: 1203903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:37:13,367][155126] Avg episode reward: [(0, '656.590')] [2023-03-07 07:37:13,902][155452] Updated weights for policy 0, policy_version 1190 (0.0007) [2023-03-07 07:37:14,674][155452] Updated weights for policy 0, policy_version 1200 (0.0006) [2023-03-07 07:37:15,441][155452] Updated weights for policy 0, policy_version 1210 (0.0007) [2023-03-07 07:37:16,215][155452] Updated weights for policy 0, policy_version 1220 (0.0006) [2023-03-07 07:37:17,001][155452] Updated weights for policy 0, policy_version 1230 (0.0006) [2023-03-07 07:37:17,780][155452] Updated weights for policy 0, policy_version 1240 (0.0006) [2023-03-07 07:37:18,367][155126] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 12769.3). Total num frames: 1276928. Throughput: 0: 13193.3. Samples: 1243661. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:37:18,367][155126] Avg episode reward: [(0, '375.602')] [2023-03-07 07:37:18,556][155452] Updated weights for policy 0, policy_version 1250 (0.0007) [2023-03-07 07:37:19,322][155452] Updated weights for policy 0, policy_version 1260 (0.0006) [2023-03-07 07:37:20,105][155452] Updated weights for policy 0, policy_version 1270 (0.0006) [2023-03-07 07:37:20,892][155452] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-07 07:37:21,649][155452] Updated weights for policy 0, policy_version 1290 (0.0005) [2023-03-07 07:37:22,420][155452] Updated weights for policy 0, policy_version 1300 (0.0005) [2023-03-07 07:37:23,183][155452] Updated weights for policy 0, policy_version 1310 (0.0006) [2023-03-07 07:37:23,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 12795.1). Total num frames: 1343488. Throughput: 0: 13191.6. Samples: 1322885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:37:23,367][155126] Avg episode reward: [(0, '621.659')] [2023-03-07 07:37:23,946][155452] Updated weights for policy 0, policy_version 1320 (0.0006) [2023-03-07 07:37:24,730][155452] Updated weights for policy 0, policy_version 1330 (0.0006) [2023-03-07 07:37:25,511][155452] Updated weights for policy 0, policy_version 1340 (0.0006) [2023-03-07 07:37:26,269][155452] Updated weights for policy 0, policy_version 1350 (0.0007) [2023-03-07 07:37:27,040][155452] Updated weights for policy 0, policy_version 1360 (0.0006) [2023-03-07 07:37:27,810][155452] Updated weights for policy 0, policy_version 1370 (0.0006) [2023-03-07 07:37:28,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 12809.3). Total num frames: 1409024. Throughput: 0: 13207.0. Samples: 1402541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:37:28,367][155126] Avg episode reward: [(0, '421.945')] [2023-03-07 07:37:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000001377_1410048.pth... [2023-03-07 07:37:28,578][155452] Updated weights for policy 0, policy_version 1380 (0.0006) [2023-03-07 07:37:29,362][155452] Updated weights for policy 0, policy_version 1390 (0.0006) [2023-03-07 07:37:30,143][155452] Updated weights for policy 0, policy_version 1400 (0.0007) [2023-03-07 07:37:30,930][155452] Updated weights for policy 0, policy_version 1410 (0.0006) [2023-03-07 07:37:31,712][155452] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-07 07:37:32,497][155452] Updated weights for policy 0, policy_version 1430 (0.0005) [2023-03-07 07:37:33,277][155452] Updated weights for policy 0, policy_version 1440 (0.0006) [2023-03-07 07:37:33,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 12831.2). Total num frames: 1475584. Throughput: 0: 13197.4. Samples: 1441873. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:37:33,367][155126] Avg episode reward: [(0, '402.794')] [2023-03-07 07:37:34,038][155452] Updated weights for policy 0, policy_version 1450 (0.0006) [2023-03-07 07:37:34,834][155452] Updated weights for policy 0, policy_version 1460 (0.0007) [2023-03-07 07:37:35,618][155452] Updated weights for policy 0, policy_version 1470 (0.0006) [2023-03-07 07:37:36,390][155452] Updated weights for policy 0, policy_version 1480 (0.0006) [2023-03-07 07:37:37,173][155452] Updated weights for policy 0, policy_version 1490 (0.0006) [2023-03-07 07:37:37,955][155452] Updated weights for policy 0, policy_version 1500 (0.0005) [2023-03-07 07:37:38,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 12842.7). Total num frames: 1541120. Throughput: 0: 13194.3. Samples: 1520871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:37:38,367][155126] Avg episode reward: [(0, '279.215')] [2023-03-07 07:37:38,721][155452] Updated weights for policy 0, policy_version 1510 (0.0006) [2023-03-07 07:37:39,493][155452] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-07 07:37:40,273][155452] Updated weights for policy 0, policy_version 1530 (0.0006) [2023-03-07 07:37:41,049][155452] Updated weights for policy 0, policy_version 1540 (0.0006) [2023-03-07 07:37:41,818][155452] Updated weights for policy 0, policy_version 1550 (0.0006) [2023-03-07 07:37:42,595][155452] Updated weights for policy 0, policy_version 1560 (0.0006) [2023-03-07 07:37:43,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 12853.2). Total num frames: 1606656. Throughput: 0: 13186.2. Samples: 1599911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:37:43,368][155126] Avg episode reward: [(0, '648.801')] [2023-03-07 07:37:43,401][155452] Updated weights for policy 0, policy_version 1570 (0.0006) [2023-03-07 07:37:44,167][155452] Updated weights for policy 0, policy_version 1580 (0.0006) [2023-03-07 07:37:44,942][155452] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-07 07:37:45,733][155452] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-07 07:37:46,505][155452] Updated weights for policy 0, policy_version 1610 (0.0006) [2023-03-07 07:37:47,292][155452] Updated weights for policy 0, policy_version 1620 (0.0006) [2023-03-07 07:37:48,070][155452] Updated weights for policy 0, policy_version 1630 (0.0006) [2023-03-07 07:37:48,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 12863.0). Total num frames: 1672192. Throughput: 0: 13181.6. Samples: 1639228. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:37:48,368][155126] Avg episode reward: [(0, '928.711')] [2023-03-07 07:37:48,378][155401] Saving new best policy, reward=928.711! [2023-03-07 07:37:48,837][155452] Updated weights for policy 0, policy_version 1640 (0.0005) [2023-03-07 07:37:49,613][155452] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-07 07:37:50,388][155452] Updated weights for policy 0, policy_version 1660 (0.0006) [2023-03-07 07:37:51,170][155452] Updated weights for policy 0, policy_version 1670 (0.0006) [2023-03-07 07:37:51,956][155452] Updated weights for policy 0, policy_version 1680 (0.0006) [2023-03-07 07:37:52,746][155452] Updated weights for policy 0, policy_version 1690 (0.0006) [2023-03-07 07:37:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 12872.1). Total num frames: 1737728. Throughput: 0: 13185.6. Samples: 1718159. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:37:53,367][155126] Avg episode reward: [(0, '870.398')] [2023-03-07 07:37:53,525][155452] Updated weights for policy 0, policy_version 1700 (0.0006) [2023-03-07 07:37:54,288][155452] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-07 07:37:55,096][155452] Updated weights for policy 0, policy_version 1720 (0.0007) [2023-03-07 07:37:55,861][155452] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-07 07:37:56,632][155452] Updated weights for policy 0, policy_version 1740 (0.0006) [2023-03-07 07:37:57,424][155452] Updated weights for policy 0, policy_version 1750 (0.0006) [2023-03-07 07:37:58,201][155452] Updated weights for policy 0, policy_version 1760 (0.0006) [2023-03-07 07:37:58,367][155126] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 12887.8). Total num frames: 1804288. Throughput: 0: 13175.0. Samples: 1796780. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:37:58,367][155126] Avg episode reward: [(0, '871.910')] [2023-03-07 07:37:58,972][155452] Updated weights for policy 0, policy_version 1770 (0.0007) [2023-03-07 07:37:59,757][155452] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-07 07:38:00,524][155452] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-07 07:38:01,302][155452] Updated weights for policy 0, policy_version 1800 (0.0006) [2023-03-07 07:38:02,087][155452] Updated weights for policy 0, policy_version 1810 (0.0006) [2023-03-07 07:38:02,871][155452] Updated weights for policy 0, policy_version 1820 (0.0006) [2023-03-07 07:38:03,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 12895.3). Total num frames: 1869824. Throughput: 0: 13175.2. Samples: 1836546. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:38:03,368][155126] Avg episode reward: [(0, '759.321')] [2023-03-07 07:38:03,651][155452] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-07 07:38:04,429][155452] Updated weights for policy 0, policy_version 1840 (0.0005) [2023-03-07 07:38:05,205][155452] Updated weights for policy 0, policy_version 1850 (0.0006) [2023-03-07 07:38:05,985][155452] Updated weights for policy 0, policy_version 1860 (0.0007) [2023-03-07 07:38:06,753][155452] Updated weights for policy 0, policy_version 1870 (0.0006) [2023-03-07 07:38:07,541][155452] Updated weights for policy 0, policy_version 1880 (0.0006) [2023-03-07 07:38:08,333][155452] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-07 07:38:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 12902.4). Total num frames: 1935360. Throughput: 0: 13168.3. Samples: 1915456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:38:08,367][155126] Avg episode reward: [(0, '755.557')] [2023-03-07 07:38:09,080][155452] Updated weights for policy 0, policy_version 1900 (0.0007) [2023-03-07 07:38:09,890][155452] Updated weights for policy 0, policy_version 1910 (0.0007) [2023-03-07 07:38:10,664][155452] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-07 07:38:11,453][155452] Updated weights for policy 0, policy_version 1930 (0.0006) [2023-03-07 07:38:12,249][155452] Updated weights for policy 0, policy_version 1940 (0.0007) [2023-03-07 07:38:13,025][155452] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-07 07:38:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 12909.0). Total num frames: 2000896. Throughput: 0: 13143.0. Samples: 1993977. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:38:13,368][155126] Avg episode reward: [(0, '855.398')] [2023-03-07 07:38:13,819][155452] Updated weights for policy 0, policy_version 1960 (0.0007) [2023-03-07 07:38:14,611][155452] Updated weights for policy 0, policy_version 1970 (0.0006) [2023-03-07 07:38:15,390][155452] Updated weights for policy 0, policy_version 1980 (0.0006) [2023-03-07 07:38:16,174][155452] Updated weights for policy 0, policy_version 1990 (0.0006) [2023-03-07 07:38:16,971][155452] Updated weights for policy 0, policy_version 2000 (0.0007) [2023-03-07 07:38:17,766][155452] Updated weights for policy 0, policy_version 2010 (0.0006) [2023-03-07 07:38:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13141.3, 300 sec: 12908.8). Total num frames: 2065408. Throughput: 0: 13132.0. Samples: 2032811. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:38:18,367][155126] Avg episode reward: [(0, '870.075')] [2023-03-07 07:38:18,526][155452] Updated weights for policy 0, policy_version 2020 (0.0006) [2023-03-07 07:38:19,310][155452] Updated weights for policy 0, policy_version 2030 (0.0006) [2023-03-07 07:38:20,093][155452] Updated weights for policy 0, policy_version 2040 (0.0006) [2023-03-07 07:38:20,861][155452] Updated weights for policy 0, policy_version 2050 (0.0007) [2023-03-07 07:38:21,645][155452] Updated weights for policy 0, policy_version 2060 (0.0006) [2023-03-07 07:38:22,418][155452] Updated weights for policy 0, policy_version 2070 (0.0006) [2023-03-07 07:38:23,197][155452] Updated weights for policy 0, policy_version 2080 (0.0007) [2023-03-07 07:38:23,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 12921.0). Total num frames: 2131968. Throughput: 0: 13123.8. Samples: 2111442. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:38:23,368][155126] Avg episode reward: [(0, '950.141')] [2023-03-07 07:38:23,368][155401] Saving new best policy, reward=950.141! [2023-03-07 07:38:23,981][155452] Updated weights for policy 0, policy_version 2090 (0.0006) [2023-03-07 07:38:24,758][155452] Updated weights for policy 0, policy_version 2100 (0.0006) [2023-03-07 07:38:25,541][155452] Updated weights for policy 0, policy_version 2110 (0.0006) [2023-03-07 07:38:26,322][155452] Updated weights for policy 0, policy_version 2120 (0.0007) [2023-03-07 07:38:27,110][155452] Updated weights for policy 0, policy_version 2130 (0.0006) [2023-03-07 07:38:27,888][155452] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-07 07:38:28,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 12926.5). Total num frames: 2197504. Throughput: 0: 13120.1. Samples: 2190317. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:38:28,368][155126] Avg episode reward: [(0, '1097.012')] [2023-03-07 07:38:28,383][155401] Saving new best policy, reward=1097.012! [2023-03-07 07:38:28,661][155452] Updated weights for policy 0, policy_version 2150 (0.0006) [2023-03-07 07:38:29,450][155452] Updated weights for policy 0, policy_version 2160 (0.0007) [2023-03-07 07:38:30,224][155452] Updated weights for policy 0, policy_version 2170 (0.0006) [2023-03-07 07:38:31,013][155452] Updated weights for policy 0, policy_version 2180 (0.0007) [2023-03-07 07:38:31,815][155452] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-07 07:38:32,582][155452] Updated weights for policy 0, policy_version 2200 (0.0007) [2023-03-07 07:38:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 12925.8). Total num frames: 2262016. Throughput: 0: 13117.0. Samples: 2229491. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:38:33,367][155126] Avg episode reward: [(0, '494.974')] [2023-03-07 07:38:33,371][155452] Updated weights for policy 0, policy_version 2210 (0.0006) [2023-03-07 07:38:34,149][155452] Updated weights for policy 0, policy_version 2220 (0.0007) [2023-03-07 07:38:34,901][155452] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-07 07:38:35,692][155452] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-07 07:38:36,470][155452] Updated weights for policy 0, policy_version 2250 (0.0006) [2023-03-07 07:38:37,242][155452] Updated weights for policy 0, policy_version 2260 (0.0006) [2023-03-07 07:38:38,023][155452] Updated weights for policy 0, policy_version 2270 (0.0005) [2023-03-07 07:38:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 12936.5). Total num frames: 2328576. Throughput: 0: 13114.0. Samples: 2308290. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:38:38,368][155126] Avg episode reward: [(0, '567.778')] [2023-03-07 07:38:38,798][155452] Updated weights for policy 0, policy_version 2280 (0.0006) [2023-03-07 07:38:39,572][155452] Updated weights for policy 0, policy_version 2290 (0.0006) [2023-03-07 07:38:40,348][155452] Updated weights for policy 0, policy_version 2300 (0.0005) [2023-03-07 07:38:41,128][155452] Updated weights for policy 0, policy_version 2310 (0.0005) [2023-03-07 07:38:41,912][155452] Updated weights for policy 0, policy_version 2320 (0.0007) [2023-03-07 07:38:42,692][155452] Updated weights for policy 0, policy_version 2330 (0.0007) [2023-03-07 07:38:43,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 12941.1). Total num frames: 2394112. Throughput: 0: 13124.8. Samples: 2387398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:38:43,368][155126] Avg episode reward: [(0, '586.911')] [2023-03-07 07:38:43,479][155452] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-07 07:38:44,250][155452] Updated weights for policy 0, policy_version 2350 (0.0006) [2023-03-07 07:38:45,000][155452] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-07 07:38:45,793][155452] Updated weights for policy 0, policy_version 2370 (0.0006) [2023-03-07 07:38:46,570][155452] Updated weights for policy 0, policy_version 2380 (0.0006) [2023-03-07 07:38:47,349][155452] Updated weights for policy 0, policy_version 2390 (0.0006) [2023-03-07 07:38:48,128][155452] Updated weights for policy 0, policy_version 2400 (0.0006) [2023-03-07 07:38:48,367][155126] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 12950.9). Total num frames: 2460672. Throughput: 0: 13119.7. Samples: 2426932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:38:48,367][155126] Avg episode reward: [(0, '505.721')] [2023-03-07 07:38:48,891][155452] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-07 07:38:49,671][155452] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-07 07:38:50,456][155452] Updated weights for policy 0, policy_version 2430 (0.0007) [2023-03-07 07:38:51,220][155452] Updated weights for policy 0, policy_version 2440 (0.0007) [2023-03-07 07:38:51,998][155452] Updated weights for policy 0, policy_version 2450 (0.0006) [2023-03-07 07:38:52,762][155452] Updated weights for policy 0, policy_version 2460 (0.0007) [2023-03-07 07:38:53,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 12954.9). Total num frames: 2526208. Throughput: 0: 13125.5. Samples: 2506104. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:38:53,368][155126] Avg episode reward: [(0, '474.138')] [2023-03-07 07:38:53,545][155452] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-07 07:38:54,332][155452] Updated weights for policy 0, policy_version 2480 (0.0005) [2023-03-07 07:38:55,080][155452] Updated weights for policy 0, policy_version 2490 (0.0006) [2023-03-07 07:38:55,862][155452] Updated weights for policy 0, policy_version 2500 (0.0006) [2023-03-07 07:38:56,649][155452] Updated weights for policy 0, policy_version 2510 (0.0007) [2023-03-07 07:38:57,418][155452] Updated weights for policy 0, policy_version 2520 (0.0006) [2023-03-07 07:38:58,209][155452] Updated weights for policy 0, policy_version 2530 (0.0006) [2023-03-07 07:38:58,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 12963.8). Total num frames: 2592768. Throughput: 0: 13140.7. Samples: 2585305. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:38:58,367][155126] Avg episode reward: [(0, '510.790')] [2023-03-07 07:38:58,983][155452] Updated weights for policy 0, policy_version 2540 (0.0005) [2023-03-07 07:38:59,777][155452] Updated weights for policy 0, policy_version 2550 (0.0005) [2023-03-07 07:39:00,561][155452] Updated weights for policy 0, policy_version 2560 (0.0005) [2023-03-07 07:39:01,320][155452] Updated weights for policy 0, policy_version 2570 (0.0005) [2023-03-07 07:39:02,089][155452] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-07 07:39:02,871][155452] Updated weights for policy 0, policy_version 2590 (0.0006) [2023-03-07 07:39:03,367][155126] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 12967.3). Total num frames: 2658304. Throughput: 0: 13158.0. Samples: 2624919. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:39:03,367][155126] Avg episode reward: [(0, '537.410')] [2023-03-07 07:39:03,658][155452] Updated weights for policy 0, policy_version 2600 (0.0005) [2023-03-07 07:39:04,411][155452] Updated weights for policy 0, policy_version 2610 (0.0006) [2023-03-07 07:39:05,201][155452] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-07 07:39:05,984][155452] Updated weights for policy 0, policy_version 2630 (0.0007) [2023-03-07 07:39:06,747][155452] Updated weights for policy 0, policy_version 2640 (0.0006) [2023-03-07 07:39:07,526][155452] Updated weights for policy 0, policy_version 2650 (0.0006) [2023-03-07 07:39:08,279][155452] Updated weights for policy 0, policy_version 2660 (0.0005) [2023-03-07 07:39:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 12970.7). Total num frames: 2723840. Throughput: 0: 13165.6. Samples: 2703894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:39:08,367][155126] Avg episode reward: [(0, '590.288')] [2023-03-07 07:39:09,061][155452] Updated weights for policy 0, policy_version 2670 (0.0005) [2023-03-07 07:39:09,823][155452] Updated weights for policy 0, policy_version 2680 (0.0008) [2023-03-07 07:39:10,600][155452] Updated weights for policy 0, policy_version 2690 (0.0006) [2023-03-07 07:39:11,378][155452] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-07 07:39:12,163][155452] Updated weights for policy 0, policy_version 2710 (0.0006) [2023-03-07 07:39:12,933][155452] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-07 07:39:13,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 12978.6). Total num frames: 2790400. Throughput: 0: 13177.0. Samples: 2783282. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:39:13,367][155126] Avg episode reward: [(0, '653.651')] [2023-03-07 07:39:13,717][155452] Updated weights for policy 0, policy_version 2730 (0.0006) [2023-03-07 07:39:14,468][155452] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-07 07:39:15,261][155452] Updated weights for policy 0, policy_version 2750 (0.0006) [2023-03-07 07:39:16,045][155452] Updated weights for policy 0, policy_version 2760 (0.0006) [2023-03-07 07:39:16,808][155452] Updated weights for policy 0, policy_version 2770 (0.0006) [2023-03-07 07:39:17,585][155452] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-07 07:39:18,364][155452] Updated weights for policy 0, policy_version 2790 (0.0006) [2023-03-07 07:39:18,367][155126] Fps is (10 sec: 13312.1, 60 sec: 13192.5, 300 sec: 12986.2). Total num frames: 2856960. Throughput: 0: 13189.6. Samples: 2823020. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:39:18,375][155126] Avg episode reward: [(0, '508.891')] [2023-03-07 07:39:19,158][155452] Updated weights for policy 0, policy_version 2800 (0.0006) [2023-03-07 07:39:19,931][155452] Updated weights for policy 0, policy_version 2810 (0.0006) [2023-03-07 07:39:20,716][155452] Updated weights for policy 0, policy_version 2820 (0.0006) [2023-03-07 07:39:21,482][155452] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-07 07:39:22,261][155452] Updated weights for policy 0, policy_version 2840 (0.0006) [2023-03-07 07:39:23,041][155452] Updated weights for policy 0, policy_version 2850 (0.0006) [2023-03-07 07:39:23,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 12988.9). Total num frames: 2922496. Throughput: 0: 13197.4. Samples: 2902170. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:39:23,378][155126] Avg episode reward: [(0, '580.985')] [2023-03-07 07:39:23,821][155452] Updated weights for policy 0, policy_version 2860 (0.0005) [2023-03-07 07:39:24,584][155452] Updated weights for policy 0, policy_version 2870 (0.0007) [2023-03-07 07:39:25,374][155452] Updated weights for policy 0, policy_version 2880 (0.0006) [2023-03-07 07:39:26,169][155452] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-07 07:39:26,922][155452] Updated weights for policy 0, policy_version 2900 (0.0006) [2023-03-07 07:39:27,692][155452] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-07 07:39:28,367][155126] Fps is (10 sec: 13106.9, 60 sec: 13175.4, 300 sec: 12991.4). Total num frames: 2988032. Throughput: 0: 13190.4. Samples: 2980969. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:39:28,378][155126] Avg episode reward: [(0, '546.692')] [2023-03-07 07:39:28,383][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000002918_2988032.pth... [2023-03-07 07:39:28,478][155452] Updated weights for policy 0, policy_version 2920 (0.0006) [2023-03-07 07:39:29,270][155452] Updated weights for policy 0, policy_version 2930 (0.0006) [2023-03-07 07:39:30,039][155452] Updated weights for policy 0, policy_version 2940 (0.0006) [2023-03-07 07:39:30,824][155452] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-07 07:39:31,601][155452] Updated weights for policy 0, policy_version 2960 (0.0005) [2023-03-07 07:39:32,384][155452] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-07 07:39:33,166][155452] Updated weights for policy 0, policy_version 2980 (0.0006) [2023-03-07 07:39:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 12993.9). Total num frames: 3053568. Throughput: 0: 13188.7. Samples: 3020423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:39:33,367][155126] Avg episode reward: [(0, '610.576')] [2023-03-07 07:39:33,937][155452] Updated weights for policy 0, policy_version 2990 (0.0006) [2023-03-07 07:39:34,709][155452] Updated weights for policy 0, policy_version 3000 (0.0006) [2023-03-07 07:39:35,473][155452] Updated weights for policy 0, policy_version 3010 (0.0006) [2023-03-07 07:39:36,255][155452] Updated weights for policy 0, policy_version 3020 (0.0006) [2023-03-07 07:39:37,033][155452] Updated weights for policy 0, policy_version 3030 (0.0006) [2023-03-07 07:39:37,805][155452] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-07 07:39:38,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 12996.3). Total num frames: 3119104. Throughput: 0: 13188.1. Samples: 3099568. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:39:38,378][155126] Avg episode reward: [(0, '521.444')] [2023-03-07 07:39:38,570][155452] Updated weights for policy 0, policy_version 3050 (0.0006) [2023-03-07 07:39:39,362][155452] Updated weights for policy 0, policy_version 3060 (0.0006) [2023-03-07 07:39:40,142][155452] Updated weights for policy 0, policy_version 3070 (0.0006) [2023-03-07 07:39:40,921][155452] Updated weights for policy 0, policy_version 3080 (0.0006) [2023-03-07 07:39:41,697][155452] Updated weights for policy 0, policy_version 3090 (0.0007) [2023-03-07 07:39:42,478][155452] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-07 07:39:43,265][155452] Updated weights for policy 0, policy_version 3110 (0.0006) [2023-03-07 07:39:43,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13002.7). Total num frames: 3185664. Throughput: 0: 13183.4. Samples: 3178556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:39:43,367][155126] Avg episode reward: [(0, '605.224')] [2023-03-07 07:39:44,028][155452] Updated weights for policy 0, policy_version 3120 (0.0006) [2023-03-07 07:39:44,804][155452] Updated weights for policy 0, policy_version 3130 (0.0006) [2023-03-07 07:39:45,589][155452] Updated weights for policy 0, policy_version 3140 (0.0006) [2023-03-07 07:39:46,359][155452] Updated weights for policy 0, policy_version 3150 (0.0007) [2023-03-07 07:39:47,143][155452] Updated weights for policy 0, policy_version 3160 (0.0006) [2023-03-07 07:39:47,915][155452] Updated weights for policy 0, policy_version 3170 (0.0005) [2023-03-07 07:39:48,367][155126] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13004.8). Total num frames: 3251200. Throughput: 0: 13180.4. Samples: 3218038. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:39:48,367][155126] Avg episode reward: [(0, '475.885')] [2023-03-07 07:39:48,672][155452] Updated weights for policy 0, policy_version 3180 (0.0007) [2023-03-07 07:39:49,450][155452] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-07 07:39:50,228][155452] Updated weights for policy 0, policy_version 3200 (0.0007) [2023-03-07 07:39:50,993][155452] Updated weights for policy 0, policy_version 3210 (0.0008) [2023-03-07 07:39:51,769][155452] Updated weights for policy 0, policy_version 3220 (0.0006) [2023-03-07 07:39:52,557][155452] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-07 07:39:53,327][155452] Updated weights for policy 0, policy_version 3240 (0.0006) [2023-03-07 07:39:53,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13010.8). Total num frames: 3317760. Throughput: 0: 13189.6. Samples: 3297426. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:39:53,368][155126] Avg episode reward: [(0, '466.966')] [2023-03-07 07:39:54,089][155452] Updated weights for policy 0, policy_version 3250 (0.0006) [2023-03-07 07:39:54,862][155452] Updated weights for policy 0, policy_version 3260 (0.0007) [2023-03-07 07:39:55,633][155452] Updated weights for policy 0, policy_version 3270 (0.0006) [2023-03-07 07:39:56,406][155452] Updated weights for policy 0, policy_version 3280 (0.0006) [2023-03-07 07:39:57,190][155452] Updated weights for policy 0, policy_version 3290 (0.0006) [2023-03-07 07:39:57,990][155452] Updated weights for policy 0, policy_version 3300 (0.0007) [2023-03-07 07:39:58,367][155126] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13012.7). Total num frames: 3383296. Throughput: 0: 13186.5. Samples: 3376674. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:39:58,367][155126] Avg episode reward: [(0, '544.584')] [2023-03-07 07:39:58,744][155452] Updated weights for policy 0, policy_version 3310 (0.0005) [2023-03-07 07:39:59,529][155452] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-07 07:40:00,291][155452] Updated weights for policy 0, policy_version 3330 (0.0005) [2023-03-07 07:40:01,047][155452] Updated weights for policy 0, policy_version 3340 (0.0006) [2023-03-07 07:40:01,825][155452] Updated weights for policy 0, policy_version 3350 (0.0006) [2023-03-07 07:40:02,615][155452] Updated weights for policy 0, policy_version 3360 (0.0006) [2023-03-07 07:40:03,367][155126] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13018.3). Total num frames: 3449856. Throughput: 0: 13190.2. Samples: 3416580. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:40:03,368][155126] Avg episode reward: [(0, '482.941')] [2023-03-07 07:40:03,388][155452] Updated weights for policy 0, policy_version 3370 (0.0007) [2023-03-07 07:40:04,154][155452] Updated weights for policy 0, policy_version 3380 (0.0006) [2023-03-07 07:40:04,942][155452] Updated weights for policy 0, policy_version 3390 (0.0007) [2023-03-07 07:40:05,722][155452] Updated weights for policy 0, policy_version 3400 (0.0006) [2023-03-07 07:40:06,503][155452] Updated weights for policy 0, policy_version 3410 (0.0006) [2023-03-07 07:40:07,284][155452] Updated weights for policy 0, policy_version 3420 (0.0006) [2023-03-07 07:40:08,045][155452] Updated weights for policy 0, policy_version 3430 (0.0006) [2023-03-07 07:40:08,367][155126] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13023.8). Total num frames: 3516416. Throughput: 0: 13188.4. Samples: 3495646. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:40:08,367][155126] Avg episode reward: [(0, '564.021')] [2023-03-07 07:40:08,838][155452] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-07 07:40:09,626][155452] Updated weights for policy 0, policy_version 3450 (0.0006) [2023-03-07 07:40:10,395][155452] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-07 07:40:11,175][155452] Updated weights for policy 0, policy_version 3470 (0.0006) [2023-03-07 07:40:11,941][155452] Updated weights for policy 0, policy_version 3480 (0.0006) [2023-03-07 07:40:12,733][155452] Updated weights for policy 0, policy_version 3490 (0.0006) [2023-03-07 07:40:13,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13025.3). Total num frames: 3581952. Throughput: 0: 13194.4. Samples: 3574718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:40:13,367][155126] Avg episode reward: [(0, '631.982')] [2023-03-07 07:40:13,506][155452] Updated weights for policy 0, policy_version 3500 (0.0006) [2023-03-07 07:40:14,273][155452] Updated weights for policy 0, policy_version 3510 (0.0005) [2023-03-07 07:40:15,070][155452] Updated weights for policy 0, policy_version 3520 (0.0007) [2023-03-07 07:40:15,820][155452] Updated weights for policy 0, policy_version 3530 (0.0007) [2023-03-07 07:40:16,618][155452] Updated weights for policy 0, policy_version 3540 (0.0006) [2023-03-07 07:40:17,389][155452] Updated weights for policy 0, policy_version 3550 (0.0006) [2023-03-07 07:40:18,159][155452] Updated weights for policy 0, policy_version 3560 (0.0006) [2023-03-07 07:40:18,367][155126] Fps is (10 sec: 13106.9, 60 sec: 13175.4, 300 sec: 13026.7). Total num frames: 3647488. Throughput: 0: 13195.9. Samples: 3614240. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:40:18,368][155126] Avg episode reward: [(0, '657.715')] [2023-03-07 07:40:18,937][155452] Updated weights for policy 0, policy_version 3570 (0.0006) [2023-03-07 07:40:19,721][155452] Updated weights for policy 0, policy_version 3580 (0.0006) [2023-03-07 07:40:20,511][155452] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-07 07:40:21,298][155452] Updated weights for policy 0, policy_version 3600 (0.0006) [2023-03-07 07:40:22,080][155452] Updated weights for policy 0, policy_version 3610 (0.0007) [2023-03-07 07:40:22,859][155452] Updated weights for policy 0, policy_version 3620 (0.0006) [2023-03-07 07:40:23,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13175.5, 300 sec: 13028.2). Total num frames: 3713024. Throughput: 0: 13186.8. Samples: 3692974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:40:23,367][155126] Avg episode reward: [(0, '671.509')] [2023-03-07 07:40:23,639][155452] Updated weights for policy 0, policy_version 3630 (0.0007) [2023-03-07 07:40:24,405][155452] Updated weights for policy 0, policy_version 3640 (0.0005) [2023-03-07 07:40:25,191][155452] Updated weights for policy 0, policy_version 3650 (0.0006) [2023-03-07 07:40:25,974][155452] Updated weights for policy 0, policy_version 3660 (0.0006) [2023-03-07 07:40:26,755][155452] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-07 07:40:27,513][155452] Updated weights for policy 0, policy_version 3680 (0.0006) [2023-03-07 07:40:28,334][155452] Updated weights for policy 0, policy_version 3690 (0.0005) [2023-03-07 07:40:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13029.5). Total num frames: 3778560. Throughput: 0: 13186.3. Samples: 3771942. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:40:28,368][155126] Avg episode reward: [(0, '668.602')] [2023-03-07 07:40:29,115][155452] Updated weights for policy 0, policy_version 3700 (0.0006) [2023-03-07 07:40:29,895][155452] Updated weights for policy 0, policy_version 3710 (0.0007) [2023-03-07 07:40:30,677][155452] Updated weights for policy 0, policy_version 3720 (0.0006) [2023-03-07 07:40:31,463][155452] Updated weights for policy 0, policy_version 3730 (0.0005) [2023-03-07 07:40:32,236][155452] Updated weights for policy 0, policy_version 3740 (0.0006) [2023-03-07 07:40:33,021][155452] Updated weights for policy 0, policy_version 3750 (0.0006) [2023-03-07 07:40:33,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13030.8). Total num frames: 3844096. Throughput: 0: 13170.7. Samples: 3810719. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:40:33,368][155126] Avg episode reward: [(0, '951.571')] [2023-03-07 07:40:33,813][155452] Updated weights for policy 0, policy_version 3760 (0.0006) [2023-03-07 07:40:34,610][155452] Updated weights for policy 0, policy_version 3770 (0.0006) [2023-03-07 07:40:35,384][155452] Updated weights for policy 0, policy_version 3780 (0.0006) [2023-03-07 07:40:36,170][155452] Updated weights for policy 0, policy_version 3790 (0.0005) [2023-03-07 07:40:36,951][155452] Updated weights for policy 0, policy_version 3800 (0.0006) [2023-03-07 07:40:37,723][155452] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-07 07:40:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 3909632. Throughput: 0: 13152.6. Samples: 3889291. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:40:38,367][155126] Avg episode reward: [(0, '1314.405')] [2023-03-07 07:40:38,370][155401] Saving new best policy, reward=1314.405! [2023-03-07 07:40:38,517][155452] Updated weights for policy 0, policy_version 3820 (0.0006) [2023-03-07 07:40:39,296][155452] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-07 07:40:40,080][155452] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-07 07:40:40,850][155452] Updated weights for policy 0, policy_version 3850 (0.0006) [2023-03-07 07:40:41,641][155452] Updated weights for policy 0, policy_version 3860 (0.0006) [2023-03-07 07:40:42,450][155452] Updated weights for policy 0, policy_version 3870 (0.0006) [2023-03-07 07:40:43,228][155452] Updated weights for policy 0, policy_version 3880 (0.0005) [2023-03-07 07:40:43,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 3974144. Throughput: 0: 13128.7. Samples: 3967466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:40:43,367][155126] Avg episode reward: [(0, '985.663')] [2023-03-07 07:40:44,022][155452] Updated weights for policy 0, policy_version 3890 (0.0006) [2023-03-07 07:40:44,827][155452] Updated weights for policy 0, policy_version 3900 (0.0007) [2023-03-07 07:40:45,594][155452] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-07 07:40:46,404][155452] Updated weights for policy 0, policy_version 3920 (0.0007) [2023-03-07 07:40:47,186][155452] Updated weights for policy 0, policy_version 3930 (0.0006) [2023-03-07 07:40:47,993][155452] Updated weights for policy 0, policy_version 3940 (0.0007) [2023-03-07 07:40:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 4039680. Throughput: 0: 13107.3. Samples: 4006408. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:40:48,368][155126] Avg episode reward: [(0, '1337.732')] [2023-03-07 07:40:48,372][155401] Saving new best policy, reward=1337.732! [2023-03-07 07:40:48,771][155452] Updated weights for policy 0, policy_version 3950 (0.0006) [2023-03-07 07:40:49,561][155452] Updated weights for policy 0, policy_version 3960 (0.0006) [2023-03-07 07:40:50,343][155452] Updated weights for policy 0, policy_version 3970 (0.0005) [2023-03-07 07:40:51,151][155452] Updated weights for policy 0, policy_version 3980 (0.0006) [2023-03-07 07:40:51,936][155452] Updated weights for policy 0, policy_version 3990 (0.0005) [2023-03-07 07:40:52,704][155452] Updated weights for policy 0, policy_version 4000 (0.0006) [2023-03-07 07:40:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 13159.3). Total num frames: 4104192. Throughput: 0: 13074.5. Samples: 4083999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:40:53,378][155126] Avg episode reward: [(0, '1226.979')] [2023-03-07 07:40:53,497][155452] Updated weights for policy 0, policy_version 4010 (0.0005) [2023-03-07 07:40:54,285][155452] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-07 07:40:55,076][155452] Updated weights for policy 0, policy_version 4030 (0.0006) [2023-03-07 07:40:55,844][155452] Updated weights for policy 0, policy_version 4040 (0.0007) [2023-03-07 07:40:56,647][155452] Updated weights for policy 0, policy_version 4050 (0.0006) [2023-03-07 07:40:57,442][155452] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-07 07:40:58,223][155452] Updated weights for policy 0, policy_version 4070 (0.0007) [2023-03-07 07:40:58,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13090.1, 300 sec: 13152.3). Total num frames: 4168704. Throughput: 0: 13054.2. Samples: 4162157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:40:58,378][155126] Avg episode reward: [(0, '1194.062')] [2023-03-07 07:40:59,008][155452] Updated weights for policy 0, policy_version 4080 (0.0007) [2023-03-07 07:40:59,797][155452] Updated weights for policy 0, policy_version 4090 (0.0007) [2023-03-07 07:41:00,565][155452] Updated weights for policy 0, policy_version 4100 (0.0006) [2023-03-07 07:41:01,353][155452] Updated weights for policy 0, policy_version 4110 (0.0006) [2023-03-07 07:41:02,120][155452] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-07 07:41:02,925][155452] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-07 07:41:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13152.3). Total num frames: 4234240. Throughput: 0: 13044.9. Samples: 4201258. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:41:03,367][155126] Avg episode reward: [(0, '1448.456')] [2023-03-07 07:41:03,390][155401] Saving new best policy, reward=1448.456! [2023-03-07 07:41:03,713][155452] Updated weights for policy 0, policy_version 4140 (0.0006) [2023-03-07 07:41:04,498][155452] Updated weights for policy 0, policy_version 4150 (0.0005) [2023-03-07 07:41:05,273][155452] Updated weights for policy 0, policy_version 4160 (0.0006) [2023-03-07 07:41:06,086][155452] Updated weights for policy 0, policy_version 4170 (0.0007) [2023-03-07 07:41:06,860][155452] Updated weights for policy 0, policy_version 4180 (0.0007) [2023-03-07 07:41:07,633][155452] Updated weights for policy 0, policy_version 4190 (0.0007) [2023-03-07 07:41:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13152.3). Total num frames: 4299776. Throughput: 0: 13035.5. Samples: 4279573. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:41:08,367][155126] Avg episode reward: [(0, '1389.370')] [2023-03-07 07:41:08,426][155452] Updated weights for policy 0, policy_version 4200 (0.0006) [2023-03-07 07:41:09,188][155452] Updated weights for policy 0, policy_version 4210 (0.0006) [2023-03-07 07:41:09,977][155452] Updated weights for policy 0, policy_version 4220 (0.0006) [2023-03-07 07:41:10,770][155452] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-07 07:41:11,558][155452] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-07 07:41:12,324][155452] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-07 07:41:13,097][155452] Updated weights for policy 0, policy_version 4260 (0.0006) [2023-03-07 07:41:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13152.3). Total num frames: 4365312. Throughput: 0: 13028.0. Samples: 4358203. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:41:13,368][155126] Avg episode reward: [(0, '1142.643')] [2023-03-07 07:41:13,875][155452] Updated weights for policy 0, policy_version 4270 (0.0006) [2023-03-07 07:41:14,669][155452] Updated weights for policy 0, policy_version 4280 (0.0006) [2023-03-07 07:41:15,440][155452] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-07 07:41:16,225][155452] Updated weights for policy 0, policy_version 4300 (0.0006) [2023-03-07 07:41:17,009][155452] Updated weights for policy 0, policy_version 4310 (0.0006) [2023-03-07 07:41:17,782][155452] Updated weights for policy 0, policy_version 4320 (0.0006) [2023-03-07 07:41:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13152.3). Total num frames: 4430848. Throughput: 0: 13040.3. Samples: 4397533. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:41:18,378][155126] Avg episode reward: [(0, '1191.124')] [2023-03-07 07:41:18,587][155452] Updated weights for policy 0, policy_version 4330 (0.0006) [2023-03-07 07:41:19,361][155452] Updated weights for policy 0, policy_version 4340 (0.0006) [2023-03-07 07:41:20,156][155452] Updated weights for policy 0, policy_version 4350 (0.0008) [2023-03-07 07:41:20,937][155452] Updated weights for policy 0, policy_version 4360 (0.0006) [2023-03-07 07:41:21,718][155452] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-07 07:41:22,513][155452] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-07 07:41:23,290][155452] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-07 07:41:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13145.4). Total num frames: 4495360. Throughput: 0: 13031.8. Samples: 4475722. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:41:23,367][155126] Avg episode reward: [(0, '1351.382')] [2023-03-07 07:41:24,076][155452] Updated weights for policy 0, policy_version 4400 (0.0006) [2023-03-07 07:41:24,864][155452] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-07 07:41:25,645][155452] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-07 07:41:26,440][155452] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-07 07:41:27,223][155452] Updated weights for policy 0, policy_version 4440 (0.0006) [2023-03-07 07:41:28,013][155452] Updated weights for policy 0, policy_version 4450 (0.0006) [2023-03-07 07:41:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13145.4). Total num frames: 4560896. Throughput: 0: 13032.3. Samples: 4553922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:41:28,367][155126] Avg episode reward: [(0, '1545.476')] [2023-03-07 07:41:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000004454_4560896.pth... [2023-03-07 07:41:28,400][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000001377_1410048.pth [2023-03-07 07:41:28,403][155401] Saving new best policy, reward=1545.476! [2023-03-07 07:41:28,808][155452] Updated weights for policy 0, policy_version 4460 (0.0006) [2023-03-07 07:41:29,577][155452] Updated weights for policy 0, policy_version 4470 (0.0005) [2023-03-07 07:41:30,374][155452] Updated weights for policy 0, policy_version 4480 (0.0006) [2023-03-07 07:41:31,165][155452] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-07 07:41:31,947][155452] Updated weights for policy 0, policy_version 4500 (0.0006) [2023-03-07 07:41:32,718][155452] Updated weights for policy 0, policy_version 4510 (0.0005) [2023-03-07 07:41:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13141.9). Total num frames: 4626432. Throughput: 0: 13036.2. Samples: 4593036. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:41:33,367][155126] Avg episode reward: [(0, '1592.521')] [2023-03-07 07:41:33,368][155401] Saving new best policy, reward=1592.521! [2023-03-07 07:41:33,513][155452] Updated weights for policy 0, policy_version 4520 (0.0006) [2023-03-07 07:41:34,299][155452] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-07 07:41:35,096][155452] Updated weights for policy 0, policy_version 4540 (0.0006) [2023-03-07 07:41:35,885][155452] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-07 07:41:36,675][155452] Updated weights for policy 0, policy_version 4560 (0.0006) [2023-03-07 07:41:37,472][155452] Updated weights for policy 0, policy_version 4570 (0.0006) [2023-03-07 07:41:38,255][155452] Updated weights for policy 0, policy_version 4580 (0.0006) [2023-03-07 07:41:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13138.4). Total num frames: 4690944. Throughput: 0: 13042.3. Samples: 4670902. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:41:38,368][155126] Avg episode reward: [(0, '1486.640')] [2023-03-07 07:41:39,045][155452] Updated weights for policy 0, policy_version 4590 (0.0006) [2023-03-07 07:41:39,851][155452] Updated weights for policy 0, policy_version 4600 (0.0006) [2023-03-07 07:41:40,616][155452] Updated weights for policy 0, policy_version 4610 (0.0006) [2023-03-07 07:41:41,406][155452] Updated weights for policy 0, policy_version 4620 (0.0005) [2023-03-07 07:41:42,176][155452] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-07 07:41:42,962][155452] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-07 07:41:43,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13021.9, 300 sec: 13131.5). Total num frames: 4755456. Throughput: 0: 13040.6. Samples: 4748983. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:41:43,367][155126] Avg episode reward: [(0, '1593.061')] [2023-03-07 07:41:43,371][155401] Saving new best policy, reward=1593.061! [2023-03-07 07:41:43,738][155452] Updated weights for policy 0, policy_version 4650 (0.0006) [2023-03-07 07:41:44,531][155452] Updated weights for policy 0, policy_version 4660 (0.0007) [2023-03-07 07:41:45,304][155452] Updated weights for policy 0, policy_version 4670 (0.0005) [2023-03-07 07:41:46,116][155452] Updated weights for policy 0, policy_version 4680 (0.0006) [2023-03-07 07:41:46,876][155452] Updated weights for policy 0, policy_version 4690 (0.0006) [2023-03-07 07:41:47,653][155452] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-07 07:41:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13131.5). Total num frames: 4820992. Throughput: 0: 13044.4. Samples: 4788254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 07:41:48,367][155126] Avg episode reward: [(0, '1684.546')] [2023-03-07 07:41:48,372][155401] Saving new best policy, reward=1684.546! [2023-03-07 07:41:48,441][155452] Updated weights for policy 0, policy_version 4710 (0.0006) [2023-03-07 07:41:49,234][155452] Updated weights for policy 0, policy_version 4720 (0.0006) [2023-03-07 07:41:50,041][155452] Updated weights for policy 0, policy_version 4730 (0.0006) [2023-03-07 07:41:50,821][155452] Updated weights for policy 0, policy_version 4740 (0.0007) [2023-03-07 07:41:51,592][155452] Updated weights for policy 0, policy_version 4750 (0.0006) [2023-03-07 07:41:52,395][155452] Updated weights for policy 0, policy_version 4760 (0.0007) [2023-03-07 07:41:53,179][155452] Updated weights for policy 0, policy_version 4770 (0.0007) [2023-03-07 07:41:53,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13128.0). Total num frames: 4886528. Throughput: 0: 13040.5. Samples: 4866399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:41:53,368][155126] Avg episode reward: [(0, '2028.666')] [2023-03-07 07:41:53,368][155401] Saving new best policy, reward=2028.666! [2023-03-07 07:41:53,958][155452] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-07 07:41:54,726][155452] Updated weights for policy 0, policy_version 4790 (0.0006) [2023-03-07 07:41:55,521][155452] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-07 07:41:56,312][155452] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-07 07:41:57,098][155452] Updated weights for policy 0, policy_version 4820 (0.0006) [2023-03-07 07:41:57,871][155452] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-07 07:41:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13128.0). Total num frames: 4952064. Throughput: 0: 13036.9. Samples: 4944865. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:41:58,367][155126] Avg episode reward: [(0, '2061.394')] [2023-03-07 07:41:58,372][155401] Saving new best policy, reward=2061.394! [2023-03-07 07:41:58,646][155452] Updated weights for policy 0, policy_version 4840 (0.0006) [2023-03-07 07:41:59,425][155452] Updated weights for policy 0, policy_version 4850 (0.0006) [2023-03-07 07:42:00,212][155452] Updated weights for policy 0, policy_version 4860 (0.0006) [2023-03-07 07:42:01,000][155452] Updated weights for policy 0, policy_version 4870 (0.0007) [2023-03-07 07:42:01,785][155452] Updated weights for policy 0, policy_version 4880 (0.0007) [2023-03-07 07:42:02,553][155452] Updated weights for policy 0, policy_version 4890 (0.0006) [2023-03-07 07:42:03,350][155452] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-07 07:42:03,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13128.0). Total num frames: 5017600. Throughput: 0: 13035.2. Samples: 4984114. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:42:03,367][155126] Avg episode reward: [(0, '1898.328')] [2023-03-07 07:42:04,142][155452] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-07 07:42:04,921][155452] Updated weights for policy 0, policy_version 4920 (0.0006) [2023-03-07 07:42:05,723][155452] Updated weights for policy 0, policy_version 4930 (0.0007) [2023-03-07 07:42:06,499][155452] Updated weights for policy 0, policy_version 4940 (0.0005) [2023-03-07 07:42:07,292][155452] Updated weights for policy 0, policy_version 4950 (0.0006) [2023-03-07 07:42:08,086][155452] Updated weights for policy 0, policy_version 4960 (0.0006) [2023-03-07 07:42:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13121.1). Total num frames: 5082112. Throughput: 0: 13037.1. Samples: 5062390. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:42:08,367][155126] Avg episode reward: [(0, '2108.037')] [2023-03-07 07:42:08,370][155401] Saving new best policy, reward=2108.037! [2023-03-07 07:42:08,873][155452] Updated weights for policy 0, policy_version 4970 (0.0006) [2023-03-07 07:42:09,645][155452] Updated weights for policy 0, policy_version 4980 (0.0006) [2023-03-07 07:42:10,425][155452] Updated weights for policy 0, policy_version 4990 (0.0006) [2023-03-07 07:42:11,209][155452] Updated weights for policy 0, policy_version 5000 (0.0005) [2023-03-07 07:42:12,006][155452] Updated weights for policy 0, policy_version 5010 (0.0006) [2023-03-07 07:42:12,784][155452] Updated weights for policy 0, policy_version 5020 (0.0006) [2023-03-07 07:42:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13121.1). Total num frames: 5147648. Throughput: 0: 13037.3. Samples: 5140600. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:42:13,368][155126] Avg episode reward: [(0, '1980.004')] [2023-03-07 07:42:13,569][155452] Updated weights for policy 0, policy_version 5030 (0.0005) [2023-03-07 07:42:14,368][155452] Updated weights for policy 0, policy_version 5040 (0.0006) [2023-03-07 07:42:15,144][155452] Updated weights for policy 0, policy_version 5050 (0.0006) [2023-03-07 07:42:15,921][155452] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-07 07:42:16,725][155452] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-07 07:42:17,492][155452] Updated weights for policy 0, policy_version 5080 (0.0006) [2023-03-07 07:42:18,297][155452] Updated weights for policy 0, policy_version 5090 (0.0006) [2023-03-07 07:42:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13114.1). Total num frames: 5212160. Throughput: 0: 13033.1. Samples: 5179527. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:42:18,367][155126] Avg episode reward: [(0, '1957.472')] [2023-03-07 07:42:19,085][155452] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-07 07:42:19,882][155452] Updated weights for policy 0, policy_version 5110 (0.0006) [2023-03-07 07:42:20,694][155452] Updated weights for policy 0, policy_version 5120 (0.0007) [2023-03-07 07:42:21,458][155452] Updated weights for policy 0, policy_version 5130 (0.0006) [2023-03-07 07:42:22,250][155452] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-07 07:42:23,037][155452] Updated weights for policy 0, policy_version 5150 (0.0005) [2023-03-07 07:42:23,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13114.1). Total num frames: 5277696. Throughput: 0: 13036.5. Samples: 5257544. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 07:42:23,367][155126] Avg episode reward: [(0, '2103.607')] [2023-03-07 07:42:23,835][155452] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-07 07:42:24,605][155452] Updated weights for policy 0, policy_version 5170 (0.0005) [2023-03-07 07:42:25,386][155452] Updated weights for policy 0, policy_version 5180 (0.0006) [2023-03-07 07:42:26,163][155452] Updated weights for policy 0, policy_version 5190 (0.0006) [2023-03-07 07:42:26,966][155452] Updated weights for policy 0, policy_version 5200 (0.0007) [2023-03-07 07:42:27,755][155452] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-07 07:42:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13107.2). Total num frames: 5342208. Throughput: 0: 13039.3. Samples: 5335753. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:42:28,368][155126] Avg episode reward: [(0, '2031.497')] [2023-03-07 07:42:28,549][155452] Updated weights for policy 0, policy_version 5220 (0.0006) [2023-03-07 07:42:29,322][155452] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-07 07:42:30,098][155452] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-07 07:42:30,893][155452] Updated weights for policy 0, policy_version 5250 (0.0006) [2023-03-07 07:42:31,674][155452] Updated weights for policy 0, policy_version 5260 (0.0006) [2023-03-07 07:42:32,479][155452] Updated weights for policy 0, policy_version 5270 (0.0007) [2023-03-07 07:42:33,247][155452] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-07 07:42:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13107.2). Total num frames: 5407744. Throughput: 0: 13033.3. Samples: 5374754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:42:33,367][155126] Avg episode reward: [(0, '1729.079')] [2023-03-07 07:42:34,046][155452] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-07 07:42:34,826][155452] Updated weights for policy 0, policy_version 5300 (0.0007) [2023-03-07 07:42:35,598][155452] Updated weights for policy 0, policy_version 5310 (0.0007) [2023-03-07 07:42:36,389][155452] Updated weights for policy 0, policy_version 5320 (0.0006) [2023-03-07 07:42:37,164][155452] Updated weights for policy 0, policy_version 5330 (0.0007) [2023-03-07 07:42:37,971][155452] Updated weights for policy 0, policy_version 5340 (0.0007) [2023-03-07 07:42:38,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13107.2). Total num frames: 5473280. Throughput: 0: 13034.9. Samples: 5452967. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:42:38,367][155126] Avg episode reward: [(0, '1799.564')] [2023-03-07 07:42:38,750][155452] Updated weights for policy 0, policy_version 5350 (0.0006) [2023-03-07 07:42:39,530][155452] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-07 07:42:40,328][155452] Updated weights for policy 0, policy_version 5370 (0.0006) [2023-03-07 07:42:41,092][155452] Updated weights for policy 0, policy_version 5380 (0.0006) [2023-03-07 07:42:41,885][155452] Updated weights for policy 0, policy_version 5390 (0.0006) [2023-03-07 07:42:42,654][155452] Updated weights for policy 0, policy_version 5400 (0.0007) [2023-03-07 07:42:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13107.2). Total num frames: 5538816. Throughput: 0: 13034.3. Samples: 5531409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:42:43,367][155126] Avg episode reward: [(0, '1899.788')] [2023-03-07 07:42:43,441][155452] Updated weights for policy 0, policy_version 5410 (0.0006) [2023-03-07 07:42:44,222][155452] Updated weights for policy 0, policy_version 5420 (0.0006) [2023-03-07 07:42:45,018][155452] Updated weights for policy 0, policy_version 5430 (0.0006) [2023-03-07 07:42:45,799][155452] Updated weights for policy 0, policy_version 5440 (0.0006) [2023-03-07 07:42:46,577][155452] Updated weights for policy 0, policy_version 5450 (0.0006) [2023-03-07 07:42:47,389][155452] Updated weights for policy 0, policy_version 5460 (0.0006) [2023-03-07 07:42:48,157][155452] Updated weights for policy 0, policy_version 5470 (0.0006) [2023-03-07 07:42:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13103.7). Total num frames: 5603328. Throughput: 0: 13033.8. Samples: 5570637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:42:48,367][155126] Avg episode reward: [(0, '2127.975')] [2023-03-07 07:42:48,385][155401] Saving new best policy, reward=2127.975! [2023-03-07 07:42:48,953][155452] Updated weights for policy 0, policy_version 5480 (0.0006) [2023-03-07 07:42:49,727][155452] Updated weights for policy 0, policy_version 5490 (0.0006) [2023-03-07 07:42:50,533][155452] Updated weights for policy 0, policy_version 5500 (0.0006) [2023-03-07 07:42:51,300][155452] Updated weights for policy 0, policy_version 5510 (0.0006) [2023-03-07 07:42:52,086][155452] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-07 07:42:52,857][155452] Updated weights for policy 0, policy_version 5530 (0.0006) [2023-03-07 07:42:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13100.3). Total num frames: 5668864. Throughput: 0: 13027.9. Samples: 5648644. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:42:53,367][155126] Avg episode reward: [(0, '1796.469')] [2023-03-07 07:42:53,645][155452] Updated weights for policy 0, policy_version 5540 (0.0006) [2023-03-07 07:42:54,426][155452] Updated weights for policy 0, policy_version 5550 (0.0006) [2023-03-07 07:42:55,215][155452] Updated weights for policy 0, policy_version 5560 (0.0007) [2023-03-07 07:42:55,989][155452] Updated weights for policy 0, policy_version 5570 (0.0007) [2023-03-07 07:42:56,765][155452] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-07 07:42:57,548][155452] Updated weights for policy 0, policy_version 5590 (0.0006) [2023-03-07 07:42:58,340][155452] Updated weights for policy 0, policy_version 5600 (0.0006) [2023-03-07 07:42:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13100.3). Total num frames: 5734400. Throughput: 0: 13037.3. Samples: 5727276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:42:58,367][155126] Avg episode reward: [(0, '2127.193')] [2023-03-07 07:42:59,112][155452] Updated weights for policy 0, policy_version 5610 (0.0006) [2023-03-07 07:42:59,907][155452] Updated weights for policy 0, policy_version 5620 (0.0006) [2023-03-07 07:43:00,673][155452] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-07 07:43:01,482][155452] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-07 07:43:02,261][155452] Updated weights for policy 0, policy_version 5650 (0.0007) [2023-03-07 07:43:03,037][155452] Updated weights for policy 0, policy_version 5660 (0.0007) [2023-03-07 07:43:03,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13100.3). Total num frames: 5799936. Throughput: 0: 13046.2. Samples: 5766606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:43:03,378][155126] Avg episode reward: [(0, '1919.942')] [2023-03-07 07:43:03,827][155452] Updated weights for policy 0, policy_version 5670 (0.0006) [2023-03-07 07:43:04,627][155452] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-07 07:43:05,407][155452] Updated weights for policy 0, policy_version 5690 (0.0006) [2023-03-07 07:43:06,189][155452] Updated weights for policy 0, policy_version 5700 (0.0006) [2023-03-07 07:43:06,976][155452] Updated weights for policy 0, policy_version 5710 (0.0006) [2023-03-07 07:43:07,755][155452] Updated weights for policy 0, policy_version 5720 (0.0006) [2023-03-07 07:43:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13096.8). Total num frames: 5864448. Throughput: 0: 13049.0. Samples: 5844749. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:43:08,367][155126] Avg episode reward: [(0, '1969.088')] [2023-03-07 07:43:08,536][155452] Updated weights for policy 0, policy_version 5730 (0.0006) [2023-03-07 07:43:09,319][155452] Updated weights for policy 0, policy_version 5740 (0.0006) [2023-03-07 07:43:10,104][155452] Updated weights for policy 0, policy_version 5750 (0.0006) [2023-03-07 07:43:10,882][155452] Updated weights for policy 0, policy_version 5760 (0.0006) [2023-03-07 07:43:11,686][155452] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-07 07:43:12,451][155452] Updated weights for policy 0, policy_version 5780 (0.0007) [2023-03-07 07:43:13,263][155452] Updated weights for policy 0, policy_version 5790 (0.0007) [2023-03-07 07:43:13,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13100.3). Total num frames: 5929984. Throughput: 0: 13051.2. Samples: 5923057. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:43:13,378][155126] Avg episode reward: [(0, '1791.737')] [2023-03-07 07:43:14,061][155452] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-07 07:43:14,838][155452] Updated weights for policy 0, policy_version 5810 (0.0006) [2023-03-07 07:43:15,638][155452] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-07 07:43:16,421][155452] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-07 07:43:17,194][155452] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-07 07:43:17,997][155452] Updated weights for policy 0, policy_version 5850 (0.0005) [2023-03-07 07:43:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13093.3). Total num frames: 5994496. Throughput: 0: 13044.6. Samples: 5961760. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 07:43:18,367][155126] Avg episode reward: [(0, '1955.499')] [2023-03-07 07:43:18,789][155452] Updated weights for policy 0, policy_version 5860 (0.0006) [2023-03-07 07:43:19,560][155452] Updated weights for policy 0, policy_version 5870 (0.0006) [2023-03-07 07:43:20,337][155452] Updated weights for policy 0, policy_version 5880 (0.0005) [2023-03-07 07:43:21,133][155452] Updated weights for policy 0, policy_version 5890 (0.0006) [2023-03-07 07:43:21,917][155452] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-07 07:43:22,704][155452] Updated weights for policy 0, policy_version 5910 (0.0006) [2023-03-07 07:43:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13093.3). Total num frames: 6060032. Throughput: 0: 13045.7. Samples: 6040026. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 07:43:23,367][155126] Avg episode reward: [(0, '2020.701')] [2023-03-07 07:43:23,504][155452] Updated weights for policy 0, policy_version 5920 (0.0006) [2023-03-07 07:43:24,262][155452] Updated weights for policy 0, policy_version 5930 (0.0007) [2023-03-07 07:43:25,056][155452] Updated weights for policy 0, policy_version 5940 (0.0006) [2023-03-07 07:43:25,845][155452] Updated weights for policy 0, policy_version 5950 (0.0007) [2023-03-07 07:43:26,620][155452] Updated weights for policy 0, policy_version 5960 (0.0007) [2023-03-07 07:43:27,399][155452] Updated weights for policy 0, policy_version 5970 (0.0006) [2023-03-07 07:43:28,173][155452] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-07 07:43:28,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13096.8). Total num frames: 6125568. Throughput: 0: 13049.4. Samples: 6118634. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:43:28,367][155126] Avg episode reward: [(0, '1860.216')] [2023-03-07 07:43:28,382][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000005982_6125568.pth... [2023-03-07 07:43:28,412][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000002918_2988032.pth [2023-03-07 07:43:28,937][155452] Updated weights for policy 0, policy_version 5990 (0.0006) [2023-03-07 07:43:29,727][155452] Updated weights for policy 0, policy_version 6000 (0.0006) [2023-03-07 07:43:30,510][155452] Updated weights for policy 0, policy_version 6010 (0.0006) [2023-03-07 07:43:31,289][155452] Updated weights for policy 0, policy_version 6020 (0.0006) [2023-03-07 07:43:32,069][155452] Updated weights for policy 0, policy_version 6030 (0.0006) [2023-03-07 07:43:32,865][155452] Updated weights for policy 0, policy_version 6040 (0.0006) [2023-03-07 07:43:33,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 6191104. Throughput: 0: 13050.8. Samples: 6157922. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:43:33,378][155126] Avg episode reward: [(0, '1513.022')] [2023-03-07 07:43:33,672][155452] Updated weights for policy 0, policy_version 6050 (0.0007) [2023-03-07 07:43:34,445][155452] Updated weights for policy 0, policy_version 6060 (0.0005) [2023-03-07 07:43:35,226][155452] Updated weights for policy 0, policy_version 6070 (0.0005) [2023-03-07 07:43:36,007][155452] Updated weights for policy 0, policy_version 6080 (0.0007) [2023-03-07 07:43:36,778][155452] Updated weights for policy 0, policy_version 6090 (0.0006) [2023-03-07 07:43:37,585][155452] Updated weights for policy 0, policy_version 6100 (0.0006) [2023-03-07 07:43:38,352][155452] Updated weights for policy 0, policy_version 6110 (0.0006) [2023-03-07 07:43:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13093.3). Total num frames: 6256640. Throughput: 0: 13058.9. Samples: 6236297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:43:38,367][155126] Avg episode reward: [(0, '1717.173')] [2023-03-07 07:43:39,140][155452] Updated weights for policy 0, policy_version 6120 (0.0006) [2023-03-07 07:43:39,930][155452] Updated weights for policy 0, policy_version 6130 (0.0006) [2023-03-07 07:43:40,718][155452] Updated weights for policy 0, policy_version 6140 (0.0006) [2023-03-07 07:43:41,510][155452] Updated weights for policy 0, policy_version 6150 (0.0006) [2023-03-07 07:43:42,306][155452] Updated weights for policy 0, policy_version 6160 (0.0007) [2023-03-07 07:43:43,091][155452] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-07 07:43:43,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13086.4). Total num frames: 6321152. Throughput: 0: 13043.4. Samples: 6314229. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 07:43:43,367][155126] Avg episode reward: [(0, '1789.420')] [2023-03-07 07:43:43,859][155452] Updated weights for policy 0, policy_version 6180 (0.0007) [2023-03-07 07:43:44,661][155452] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-07 07:43:45,450][155452] Updated weights for policy 0, policy_version 6200 (0.0006) [2023-03-07 07:43:46,219][155452] Updated weights for policy 0, policy_version 6210 (0.0006) [2023-03-07 07:43:47,009][155452] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-07 07:43:47,780][155452] Updated weights for policy 0, policy_version 6230 (0.0006) [2023-03-07 07:43:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13086.4). Total num frames: 6386688. Throughput: 0: 13039.2. Samples: 6353367. Policy #0 lag: (min: 0.0, avg: 1.5, max: 3.0) [2023-03-07 07:43:48,367][155126] Avg episode reward: [(0, '1886.715')] [2023-03-07 07:43:48,557][155452] Updated weights for policy 0, policy_version 6240 (0.0006) [2023-03-07 07:43:49,350][155452] Updated weights for policy 0, policy_version 6250 (0.0006) [2023-03-07 07:43:50,141][155452] Updated weights for policy 0, policy_version 6260 (0.0007) [2023-03-07 07:43:50,933][155452] Updated weights for policy 0, policy_version 6270 (0.0007) [2023-03-07 07:43:51,733][155452] Updated weights for policy 0, policy_version 6280 (0.0006) [2023-03-07 07:43:52,515][155452] Updated weights for policy 0, policy_version 6290 (0.0006) [2023-03-07 07:43:53,305][155452] Updated weights for policy 0, policy_version 6300 (0.0006) [2023-03-07 07:43:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13079.4). Total num frames: 6451200. Throughput: 0: 13040.0. Samples: 6431552. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:43:53,367][155126] Avg episode reward: [(0, '1860.322')] [2023-03-07 07:43:54,092][155452] Updated weights for policy 0, policy_version 6310 (0.0007) [2023-03-07 07:43:54,874][155452] Updated weights for policy 0, policy_version 6320 (0.0006) [2023-03-07 07:43:55,638][155452] Updated weights for policy 0, policy_version 6330 (0.0006) [2023-03-07 07:43:56,435][155452] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-07 07:43:57,229][155452] Updated weights for policy 0, policy_version 6350 (0.0006) [2023-03-07 07:43:57,995][155452] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-07 07:43:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13079.4). Total num frames: 6516736. Throughput: 0: 13042.0. Samples: 6509946. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:43:58,367][155126] Avg episode reward: [(0, '2018.795')] [2023-03-07 07:43:58,778][155452] Updated weights for policy 0, policy_version 6370 (0.0006) [2023-03-07 07:43:59,584][155452] Updated weights for policy 0, policy_version 6380 (0.0006) [2023-03-07 07:44:00,354][155452] Updated weights for policy 0, policy_version 6390 (0.0006) [2023-03-07 07:44:01,130][155452] Updated weights for policy 0, policy_version 6400 (0.0005) [2023-03-07 07:44:01,910][155452] Updated weights for policy 0, policy_version 6410 (0.0006) [2023-03-07 07:44:02,732][155452] Updated weights for policy 0, policy_version 6420 (0.0006) [2023-03-07 07:44:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13079.4). Total num frames: 6582272. Throughput: 0: 13050.3. Samples: 6549026. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:44:03,367][155126] Avg episode reward: [(0, '2192.451')] [2023-03-07 07:44:03,379][155401] Saving new best policy, reward=2192.451! [2023-03-07 07:44:03,508][155452] Updated weights for policy 0, policy_version 6430 (0.0007) [2023-03-07 07:44:04,297][155452] Updated weights for policy 0, policy_version 6440 (0.0006) [2023-03-07 07:44:05,081][155452] Updated weights for policy 0, policy_version 6450 (0.0006) [2023-03-07 07:44:05,874][155452] Updated weights for policy 0, policy_version 6460 (0.0006) [2023-03-07 07:44:06,644][155452] Updated weights for policy 0, policy_version 6470 (0.0007) [2023-03-07 07:44:07,448][155452] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-07 07:44:08,244][155452] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-07 07:44:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13072.5). Total num frames: 6646784. Throughput: 0: 13048.1. Samples: 6627193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:44:08,367][155126] Avg episode reward: [(0, '1744.259')] [2023-03-07 07:44:09,030][155452] Updated weights for policy 0, policy_version 6500 (0.0006) [2023-03-07 07:44:09,815][155452] Updated weights for policy 0, policy_version 6510 (0.0006) [2023-03-07 07:44:10,595][155452] Updated weights for policy 0, policy_version 6520 (0.0007) [2023-03-07 07:44:11,378][155452] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-07 07:44:12,169][155452] Updated weights for policy 0, policy_version 6540 (0.0006) [2023-03-07 07:44:12,925][155452] Updated weights for policy 0, policy_version 6550 (0.0006) [2023-03-07 07:44:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13069.0). Total num frames: 6712320. Throughput: 0: 13041.3. Samples: 6705492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:13,367][155126] Avg episode reward: [(0, '1783.538')] [2023-03-07 07:44:13,721][155452] Updated weights for policy 0, policy_version 6560 (0.0006) [2023-03-07 07:44:14,492][155452] Updated weights for policy 0, policy_version 6570 (0.0006) [2023-03-07 07:44:15,283][155452] Updated weights for policy 0, policy_version 6580 (0.0006) [2023-03-07 07:44:16,055][155452] Updated weights for policy 0, policy_version 6590 (0.0006) [2023-03-07 07:44:16,841][155452] Updated weights for policy 0, policy_version 6600 (0.0006) [2023-03-07 07:44:17,630][155452] Updated weights for policy 0, policy_version 6610 (0.0005) [2023-03-07 07:44:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 6777856. Throughput: 0: 13037.3. Samples: 6744599. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:18,367][155126] Avg episode reward: [(0, '1997.063')] [2023-03-07 07:44:18,411][155452] Updated weights for policy 0, policy_version 6620 (0.0006) [2023-03-07 07:44:19,198][155452] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-07 07:44:19,970][155452] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-07 07:44:20,764][155452] Updated weights for policy 0, policy_version 6650 (0.0007) [2023-03-07 07:44:21,556][155452] Updated weights for policy 0, policy_version 6660 (0.0006) [2023-03-07 07:44:22,320][155452] Updated weights for policy 0, policy_version 6670 (0.0006) [2023-03-07 07:44:23,097][155452] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-07 07:44:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 6843392. Throughput: 0: 13042.4. Samples: 6823206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:23,367][155126] Avg episode reward: [(0, '2099.333')] [2023-03-07 07:44:23,881][155452] Updated weights for policy 0, policy_version 6690 (0.0006) [2023-03-07 07:44:24,698][155452] Updated weights for policy 0, policy_version 6700 (0.0007) [2023-03-07 07:44:25,472][155452] Updated weights for policy 0, policy_version 6710 (0.0005) [2023-03-07 07:44:26,273][155452] Updated weights for policy 0, policy_version 6720 (0.0006) [2023-03-07 07:44:27,090][155452] Updated weights for policy 0, policy_version 6730 (0.0006) [2023-03-07 07:44:27,859][155452] Updated weights for policy 0, policy_version 6740 (0.0005) [2023-03-07 07:44:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 6907904. Throughput: 0: 13038.7. Samples: 6900972. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:44:28,368][155126] Avg episode reward: [(0, '2155.105')] [2023-03-07 07:44:28,636][155452] Updated weights for policy 0, policy_version 6750 (0.0006) [2023-03-07 07:44:29,439][155452] Updated weights for policy 0, policy_version 6760 (0.0007) [2023-03-07 07:44:30,222][155452] Updated weights for policy 0, policy_version 6770 (0.0006) [2023-03-07 07:44:30,994][155452] Updated weights for policy 0, policy_version 6780 (0.0006) [2023-03-07 07:44:31,797][155452] Updated weights for policy 0, policy_version 6790 (0.0006) [2023-03-07 07:44:32,575][155452] Updated weights for policy 0, policy_version 6800 (0.0007) [2023-03-07 07:44:33,350][155452] Updated weights for policy 0, policy_version 6810 (0.0006) [2023-03-07 07:44:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13065.6). Total num frames: 6973440. Throughput: 0: 13037.3. Samples: 6940044. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:33,367][155126] Avg episode reward: [(0, '2232.207')] [2023-03-07 07:44:33,378][155401] Saving new best policy, reward=2232.207! [2023-03-07 07:44:34,125][155452] Updated weights for policy 0, policy_version 6820 (0.0006) [2023-03-07 07:44:34,903][155452] Updated weights for policy 0, policy_version 6830 (0.0006) [2023-03-07 07:44:35,689][155452] Updated weights for policy 0, policy_version 6840 (0.0006) [2023-03-07 07:44:36,465][155452] Updated weights for policy 0, policy_version 6850 (0.0006) [2023-03-07 07:44:37,225][155452] Updated weights for policy 0, policy_version 6860 (0.0007) [2023-03-07 07:44:37,999][155452] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-07 07:44:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13062.1). Total num frames: 7038976. Throughput: 0: 13048.4. Samples: 7018729. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:38,367][155126] Avg episode reward: [(0, '1868.925')] [2023-03-07 07:44:38,780][155452] Updated weights for policy 0, policy_version 6880 (0.0006) [2023-03-07 07:44:39,559][155452] Updated weights for policy 0, policy_version 6890 (0.0007) [2023-03-07 07:44:40,349][155452] Updated weights for policy 0, policy_version 6900 (0.0005) [2023-03-07 07:44:41,135][155452] Updated weights for policy 0, policy_version 6910 (0.0006) [2023-03-07 07:44:41,922][155452] Updated weights for policy 0, policy_version 6920 (0.0006) [2023-03-07 07:44:42,705][155452] Updated weights for policy 0, policy_version 6930 (0.0006) [2023-03-07 07:44:43,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 7104512. Throughput: 0: 13055.7. Samples: 7097453. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:43,367][155126] Avg episode reward: [(0, '1842.905')] [2023-03-07 07:44:43,486][155452] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-07 07:44:44,264][155452] Updated weights for policy 0, policy_version 6950 (0.0006) [2023-03-07 07:44:45,074][155452] Updated weights for policy 0, policy_version 6960 (0.0006) [2023-03-07 07:44:45,855][155452] Updated weights for policy 0, policy_version 6970 (0.0006) [2023-03-07 07:44:46,644][155452] Updated weights for policy 0, policy_version 6980 (0.0006) [2023-03-07 07:44:47,442][155452] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-07 07:44:48,211][155452] Updated weights for policy 0, policy_version 7000 (0.0006) [2023-03-07 07:44:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 7170048. Throughput: 0: 13053.4. Samples: 7136425. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:44:48,367][155126] Avg episode reward: [(0, '1763.078')] [2023-03-07 07:44:48,998][155452] Updated weights for policy 0, policy_version 7010 (0.0006) [2023-03-07 07:44:49,791][155452] Updated weights for policy 0, policy_version 7020 (0.0005) [2023-03-07 07:44:50,575][155452] Updated weights for policy 0, policy_version 7030 (0.0006) [2023-03-07 07:44:51,367][155452] Updated weights for policy 0, policy_version 7040 (0.0007) [2023-03-07 07:44:52,162][155452] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-07 07:44:52,947][155452] Updated weights for policy 0, policy_version 7060 (0.0006) [2023-03-07 07:44:53,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 7234560. Throughput: 0: 13053.1. Samples: 7214581. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:53,367][155126] Avg episode reward: [(0, '1714.314')] [2023-03-07 07:44:53,721][155452] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-07 07:44:54,505][155452] Updated weights for policy 0, policy_version 7080 (0.0005) [2023-03-07 07:44:55,283][155452] Updated weights for policy 0, policy_version 7090 (0.0006) [2023-03-07 07:44:56,068][155452] Updated weights for policy 0, policy_version 7100 (0.0006) [2023-03-07 07:44:56,854][155452] Updated weights for policy 0, policy_version 7110 (0.0006) [2023-03-07 07:44:57,657][155452] Updated weights for policy 0, policy_version 7120 (0.0006) [2023-03-07 07:44:58,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 7299072. Throughput: 0: 13047.1. Samples: 7292613. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:44:58,367][155126] Avg episode reward: [(0, '1828.177')] [2023-03-07 07:44:58,443][155452] Updated weights for policy 0, policy_version 7130 (0.0006) [2023-03-07 07:44:59,224][155452] Updated weights for policy 0, policy_version 7140 (0.0006) [2023-03-07 07:45:00,016][155452] Updated weights for policy 0, policy_version 7150 (0.0006) [2023-03-07 07:45:00,801][155452] Updated weights for policy 0, policy_version 7160 (0.0006) [2023-03-07 07:45:01,598][155452] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-07 07:45:02,386][155452] Updated weights for policy 0, policy_version 7180 (0.0006) [2023-03-07 07:45:03,194][155452] Updated weights for policy 0, policy_version 7190 (0.0006) [2023-03-07 07:45:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 7364608. Throughput: 0: 13045.7. Samples: 7331654. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:45:03,367][155126] Avg episode reward: [(0, '1965.732')] [2023-03-07 07:45:03,978][155452] Updated weights for policy 0, policy_version 7200 (0.0006) [2023-03-07 07:45:04,762][155452] Updated weights for policy 0, policy_version 7210 (0.0007) [2023-03-07 07:45:05,551][155452] Updated weights for policy 0, policy_version 7220 (0.0006) [2023-03-07 07:45:06,340][155452] Updated weights for policy 0, policy_version 7230 (0.0006) [2023-03-07 07:45:07,124][155452] Updated weights for policy 0, policy_version 7240 (0.0006) [2023-03-07 07:45:07,910][155452] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-07 07:45:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.3). Total num frames: 7429120. Throughput: 0: 13031.0. Samples: 7409601. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:45:08,367][155126] Avg episode reward: [(0, '2300.551')] [2023-03-07 07:45:08,371][155401] Saving new best policy, reward=2300.551! [2023-03-07 07:45:08,695][155452] Updated weights for policy 0, policy_version 7260 (0.0006) [2023-03-07 07:45:09,495][155452] Updated weights for policy 0, policy_version 7270 (0.0006) [2023-03-07 07:45:10,280][155452] Updated weights for policy 0, policy_version 7280 (0.0006) [2023-03-07 07:45:11,066][155452] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-07 07:45:11,845][155452] Updated weights for policy 0, policy_version 7300 (0.0007) [2023-03-07 07:45:12,622][155452] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-07 07:45:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13041.3). Total num frames: 7494656. Throughput: 0: 13037.3. Samples: 7487650. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:45:13,367][155126] Avg episode reward: [(0, '2110.838')] [2023-03-07 07:45:13,413][155452] Updated weights for policy 0, policy_version 7320 (0.0007) [2023-03-07 07:45:14,193][155452] Updated weights for policy 0, policy_version 7330 (0.0006) [2023-03-07 07:45:14,984][155452] Updated weights for policy 0, policy_version 7340 (0.0006) [2023-03-07 07:45:15,767][155452] Updated weights for policy 0, policy_version 7350 (0.0005) [2023-03-07 07:45:16,558][155452] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-07 07:45:17,350][155452] Updated weights for policy 0, policy_version 7370 (0.0006) [2023-03-07 07:45:18,135][155452] Updated weights for policy 0, policy_version 7380 (0.0006) [2023-03-07 07:45:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 7559168. Throughput: 0: 13037.6. Samples: 7526736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:45:18,367][155126] Avg episode reward: [(0, '2117.477')] [2023-03-07 07:45:18,915][155452] Updated weights for policy 0, policy_version 7390 (0.0006) [2023-03-07 07:45:19,710][155452] Updated weights for policy 0, policy_version 7400 (0.0006) [2023-03-07 07:45:20,487][155452] Updated weights for policy 0, policy_version 7410 (0.0006) [2023-03-07 07:45:21,282][155452] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-07 07:45:22,082][155452] Updated weights for policy 0, policy_version 7430 (0.0007) [2023-03-07 07:45:22,850][155452] Updated weights for policy 0, policy_version 7440 (0.0006) [2023-03-07 07:45:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 7624704. Throughput: 0: 13021.5. Samples: 7604699. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:45:23,368][155126] Avg episode reward: [(0, '2076.328')] [2023-03-07 07:45:23,636][155452] Updated weights for policy 0, policy_version 7450 (0.0006) [2023-03-07 07:45:24,419][155452] Updated weights for policy 0, policy_version 7460 (0.0005) [2023-03-07 07:45:25,202][155452] Updated weights for policy 0, policy_version 7470 (0.0006) [2023-03-07 07:45:25,976][155452] Updated weights for policy 0, policy_version 7480 (0.0007) [2023-03-07 07:45:26,755][155452] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-07 07:45:27,537][155452] Updated weights for policy 0, policy_version 7500 (0.0006) [2023-03-07 07:45:28,319][155452] Updated weights for policy 0, policy_version 7510 (0.0006) [2023-03-07 07:45:28,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 7690240. Throughput: 0: 13019.5. Samples: 7683332. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:45:28,368][155126] Avg episode reward: [(0, '2046.089')] [2023-03-07 07:45:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000007510_7690240.pth... [2023-03-07 07:45:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000004454_4560896.pth [2023-03-07 07:45:29,094][155452] Updated weights for policy 0, policy_version 7520 (0.0006) [2023-03-07 07:45:29,873][155452] Updated weights for policy 0, policy_version 7530 (0.0007) [2023-03-07 07:45:30,665][155452] Updated weights for policy 0, policy_version 7540 (0.0005) [2023-03-07 07:45:31,434][155452] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-07 07:45:32,240][155452] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-07 07:45:33,017][155452] Updated weights for policy 0, policy_version 7570 (0.0006) [2023-03-07 07:45:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 7755776. Throughput: 0: 13026.8. Samples: 7722632. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:45:33,378][155126] Avg episode reward: [(0, '1957.970')] [2023-03-07 07:45:33,802][155452] Updated weights for policy 0, policy_version 7580 (0.0006) [2023-03-07 07:45:34,592][155452] Updated weights for policy 0, policy_version 7590 (0.0007) [2023-03-07 07:45:35,380][155452] Updated weights for policy 0, policy_version 7600 (0.0006) [2023-03-07 07:45:36,159][155452] Updated weights for policy 0, policy_version 7610 (0.0006) [2023-03-07 07:45:36,954][155452] Updated weights for policy 0, policy_version 7620 (0.0006) [2023-03-07 07:45:37,741][155452] Updated weights for policy 0, policy_version 7630 (0.0006) [2023-03-07 07:45:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 7821312. Throughput: 0: 13029.4. Samples: 7800902. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:45:38,367][155126] Avg episode reward: [(0, '2194.325')] [2023-03-07 07:45:38,525][155452] Updated weights for policy 0, policy_version 7640 (0.0006) [2023-03-07 07:45:39,308][155452] Updated weights for policy 0, policy_version 7650 (0.0006) [2023-03-07 07:45:40,093][155452] Updated weights for policy 0, policy_version 7660 (0.0005) [2023-03-07 07:45:40,884][155452] Updated weights for policy 0, policy_version 7670 (0.0006) [2023-03-07 07:45:41,663][155452] Updated weights for policy 0, policy_version 7680 (0.0006) [2023-03-07 07:45:42,454][155452] Updated weights for policy 0, policy_version 7690 (0.0007) [2023-03-07 07:45:43,237][155452] Updated weights for policy 0, policy_version 7700 (0.0007) [2023-03-07 07:45:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 7885824. Throughput: 0: 13030.1. Samples: 7878968. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:45:43,367][155126] Avg episode reward: [(0, '2424.485')] [2023-03-07 07:45:43,391][155401] Saving new best policy, reward=2424.485! [2023-03-07 07:45:44,014][155452] Updated weights for policy 0, policy_version 7710 (0.0006) [2023-03-07 07:45:44,805][155452] Updated weights for policy 0, policy_version 7720 (0.0006) [2023-03-07 07:45:45,568][155452] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-07 07:45:46,353][155452] Updated weights for policy 0, policy_version 7740 (0.0006) [2023-03-07 07:45:47,160][155452] Updated weights for policy 0, policy_version 7750 (0.0006) [2023-03-07 07:45:47,930][155452] Updated weights for policy 0, policy_version 7760 (0.0006) [2023-03-07 07:45:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13041.2). Total num frames: 7951360. Throughput: 0: 13038.4. Samples: 7918381. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:45:48,367][155126] Avg episode reward: [(0, '2180.012')] [2023-03-07 07:45:48,728][155452] Updated weights for policy 0, policy_version 7770 (0.0006) [2023-03-07 07:45:49,512][155452] Updated weights for policy 0, policy_version 7780 (0.0006) [2023-03-07 07:45:50,307][155452] Updated weights for policy 0, policy_version 7790 (0.0006) [2023-03-07 07:45:51,068][155452] Updated weights for policy 0, policy_version 7800 (0.0006) [2023-03-07 07:45:51,526][155401] KL-divergence is very high: 25903.8906 [2023-03-07 07:45:51,831][155452] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-07 07:45:52,630][155452] Updated weights for policy 0, policy_version 7820 (0.0006) [2023-03-07 07:45:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 8016896. Throughput: 0: 13048.3. Samples: 7996772. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:45:53,367][155126] Avg episode reward: [(0, '2255.765')] [2023-03-07 07:45:53,420][155452] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-07 07:45:54,200][155452] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-07 07:45:54,991][155452] Updated weights for policy 0, policy_version 7850 (0.0007) [2023-03-07 07:45:55,780][155452] Updated weights for policy 0, policy_version 7860 (0.0007) [2023-03-07 07:45:56,562][155452] Updated weights for policy 0, policy_version 7870 (0.0006) [2023-03-07 07:45:57,345][155452] Updated weights for policy 0, policy_version 7880 (0.0006) [2023-03-07 07:45:58,147][155452] Updated weights for policy 0, policy_version 7890 (0.0006) [2023-03-07 07:45:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 8082432. Throughput: 0: 13052.7. Samples: 8075020. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:45:58,367][155126] Avg episode reward: [(0, '2005.519')] [2023-03-07 07:45:58,925][155452] Updated weights for policy 0, policy_version 7900 (0.0006) [2023-03-07 07:45:59,708][155452] Updated weights for policy 0, policy_version 7910 (0.0007) [2023-03-07 07:46:00,471][155452] Updated weights for policy 0, policy_version 7920 (0.0005) [2023-03-07 07:46:01,281][155452] Updated weights for policy 0, policy_version 7930 (0.0005) [2023-03-07 07:46:01,905][155401] KL-divergence is very high: 12505140.0000 [2023-03-07 07:46:02,068][155452] Updated weights for policy 0, policy_version 7940 (0.0006) [2023-03-07 07:46:02,849][155452] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-07 07:46:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 8146944. Throughput: 0: 13053.4. Samples: 8114138. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:46:03,368][155126] Avg episode reward: [(0, '2110.188')] [2023-03-07 07:46:03,631][155452] Updated weights for policy 0, policy_version 7960 (0.0006) [2023-03-07 07:46:04,399][155452] Updated weights for policy 0, policy_version 7970 (0.0006) [2023-03-07 07:46:05,184][155452] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-07 07:46:05,961][155452] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-07 07:46:06,740][155452] Updated weights for policy 0, policy_version 8000 (0.0007) [2023-03-07 07:46:07,513][155452] Updated weights for policy 0, policy_version 8010 (0.0006) [2023-03-07 07:46:08,307][155452] Updated weights for policy 0, policy_version 8020 (0.0006) [2023-03-07 07:46:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 8212480. Throughput: 0: 13066.8. Samples: 8192705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:08,378][155126] Avg episode reward: [(0, '2199.515')] [2023-03-07 07:46:09,093][155452] Updated weights for policy 0, policy_version 8030 (0.0007) [2023-03-07 07:46:09,873][155452] Updated weights for policy 0, policy_version 8040 (0.0006) [2023-03-07 07:46:10,647][155452] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-07 07:46:11,429][155452] Updated weights for policy 0, policy_version 8060 (0.0006) [2023-03-07 07:46:12,226][155452] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-07 07:46:13,009][155452] Updated weights for policy 0, policy_version 8080 (0.0006) [2023-03-07 07:46:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 8278016. Throughput: 0: 13063.3. Samples: 8271177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:13,367][155126] Avg episode reward: [(0, '2337.649')] [2023-03-07 07:46:13,804][155452] Updated weights for policy 0, policy_version 8090 (0.0006) [2023-03-07 07:46:14,596][155452] Updated weights for policy 0, policy_version 8100 (0.0007) [2023-03-07 07:46:15,377][155452] Updated weights for policy 0, policy_version 8110 (0.0006) [2023-03-07 07:46:16,156][155452] Updated weights for policy 0, policy_version 8120 (0.0006) [2023-03-07 07:46:16,920][155452] Updated weights for policy 0, policy_version 8130 (0.0006) [2023-03-07 07:46:17,701][155452] Updated weights for policy 0, policy_version 8140 (0.0005) [2023-03-07 07:46:18,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 8343552. Throughput: 0: 13057.8. Samples: 8310235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:46:18,368][155126] Avg episode reward: [(0, '2312.198')] [2023-03-07 07:46:18,490][155452] Updated weights for policy 0, policy_version 8150 (0.0007) [2023-03-07 07:46:19,278][155452] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-07 07:46:20,053][155452] Updated weights for policy 0, policy_version 8170 (0.0006) [2023-03-07 07:46:20,847][155452] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-07 07:46:21,633][155452] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-07 07:46:22,431][155452] Updated weights for policy 0, policy_version 8200 (0.0006) [2023-03-07 07:46:23,225][155452] Updated weights for policy 0, policy_version 8210 (0.0007) [2023-03-07 07:46:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 8408064. Throughput: 0: 13059.0. Samples: 8388559. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:23,367][155126] Avg episode reward: [(0, '2119.972')] [2023-03-07 07:46:23,998][155452] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-07 07:46:24,790][155452] Updated weights for policy 0, policy_version 8230 (0.0006) [2023-03-07 07:46:25,561][155452] Updated weights for policy 0, policy_version 8240 (0.0006) [2023-03-07 07:46:26,360][155452] Updated weights for policy 0, policy_version 8250 (0.0006) [2023-03-07 07:46:27,136][155452] Updated weights for policy 0, policy_version 8260 (0.0006) [2023-03-07 07:46:27,916][155452] Updated weights for policy 0, policy_version 8270 (0.0006) [2023-03-07 07:46:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 8473600. Throughput: 0: 13060.9. Samples: 8466706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:28,367][155126] Avg episode reward: [(0, '2122.341')] [2023-03-07 07:46:28,715][155452] Updated weights for policy 0, policy_version 8280 (0.0006) [2023-03-07 07:46:29,503][155452] Updated weights for policy 0, policy_version 8290 (0.0007) [2023-03-07 07:46:30,281][155452] Updated weights for policy 0, policy_version 8300 (0.0006) [2023-03-07 07:46:31,058][155452] Updated weights for policy 0, policy_version 8310 (0.0006) [2023-03-07 07:46:31,843][155452] Updated weights for policy 0, policy_version 8320 (0.0006) [2023-03-07 07:46:32,618][155452] Updated weights for policy 0, policy_version 8330 (0.0006) [2023-03-07 07:46:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 8539136. Throughput: 0: 13056.8. Samples: 8505937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:33,367][155126] Avg episode reward: [(0, '2256.324')] [2023-03-07 07:46:33,402][155452] Updated weights for policy 0, policy_version 8340 (0.0006) [2023-03-07 07:46:34,178][155452] Updated weights for policy 0, policy_version 8350 (0.0005) [2023-03-07 07:46:34,963][155452] Updated weights for policy 0, policy_version 8360 (0.0006) [2023-03-07 07:46:35,729][155452] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-07 07:46:36,529][155452] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-07 07:46:37,303][155452] Updated weights for policy 0, policy_version 8390 (0.0006) [2023-03-07 07:46:38,084][155452] Updated weights for policy 0, policy_version 8400 (0.0006) [2023-03-07 07:46:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 8604672. Throughput: 0: 13061.7. Samples: 8584549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:38,367][155126] Avg episode reward: [(0, '1864.763')] [2023-03-07 07:46:38,873][155452] Updated weights for policy 0, policy_version 8410 (0.0006) [2023-03-07 07:46:39,653][155452] Updated weights for policy 0, policy_version 8420 (0.0006) [2023-03-07 07:46:40,455][155452] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-07 07:46:41,230][155452] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-07 07:46:42,029][155452] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-07 07:46:42,821][155452] Updated weights for policy 0, policy_version 8460 (0.0006) [2023-03-07 07:46:43,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 8670208. Throughput: 0: 13062.7. Samples: 8662840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:43,367][155126] Avg episode reward: [(0, '2125.462')] [2023-03-07 07:46:43,587][155452] Updated weights for policy 0, policy_version 8470 (0.0006) [2023-03-07 07:46:44,364][155452] Updated weights for policy 0, policy_version 8480 (0.0006) [2023-03-07 07:46:45,142][155452] Updated weights for policy 0, policy_version 8490 (0.0006) [2023-03-07 07:46:45,929][155452] Updated weights for policy 0, policy_version 8500 (0.0006) [2023-03-07 07:46:46,707][155452] Updated weights for policy 0, policy_version 8510 (0.0006) [2023-03-07 07:46:47,481][155452] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-07 07:46:48,261][155452] Updated weights for policy 0, policy_version 8530 (0.0006) [2023-03-07 07:46:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 8735744. Throughput: 0: 13067.5. Samples: 8702176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:48,367][155126] Avg episode reward: [(0, '2385.004')] [2023-03-07 07:46:49,047][155452] Updated weights for policy 0, policy_version 8540 (0.0006) [2023-03-07 07:46:49,829][155452] Updated weights for policy 0, policy_version 8550 (0.0006) [2023-03-07 07:46:50,618][155452] Updated weights for policy 0, policy_version 8560 (0.0006) [2023-03-07 07:46:51,386][155452] Updated weights for policy 0, policy_version 8570 (0.0006) [2023-03-07 07:46:52,175][155452] Updated weights for policy 0, policy_version 8580 (0.0006) [2023-03-07 07:46:52,948][155452] Updated weights for policy 0, policy_version 8590 (0.0005) [2023-03-07 07:46:53,367][155126] Fps is (10 sec: 13106.9, 60 sec: 13073.0, 300 sec: 13048.2). Total num frames: 8801280. Throughput: 0: 13072.6. Samples: 8780973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:53,367][155126] Avg episode reward: [(0, '1897.357')] [2023-03-07 07:46:53,738][155452] Updated weights for policy 0, policy_version 8600 (0.0005) [2023-03-07 07:46:54,535][155452] Updated weights for policy 0, policy_version 8610 (0.0006) [2023-03-07 07:46:55,310][155452] Updated weights for policy 0, policy_version 8620 (0.0006) [2023-03-07 07:46:55,614][155401] KL-divergence is very high: 1107.9108 [2023-03-07 07:46:56,087][155452] Updated weights for policy 0, policy_version 8630 (0.0006) [2023-03-07 07:46:56,886][155452] Updated weights for policy 0, policy_version 8640 (0.0006) [2023-03-07 07:46:57,655][155452] Updated weights for policy 0, policy_version 8650 (0.0006) [2023-03-07 07:46:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 8866816. Throughput: 0: 13071.4. Samples: 8859392. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:46:58,368][155126] Avg episode reward: [(0, '2088.213')] [2023-03-07 07:46:58,422][155452] Updated weights for policy 0, policy_version 8660 (0.0006) [2023-03-07 07:46:59,209][155452] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-07 07:46:59,989][155452] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-07 07:47:00,770][155452] Updated weights for policy 0, policy_version 8690 (0.0006) [2023-03-07 07:47:01,577][155452] Updated weights for policy 0, policy_version 8700 (0.0006) [2023-03-07 07:47:02,351][155452] Updated weights for policy 0, policy_version 8710 (0.0007) [2023-03-07 07:47:03,154][155452] Updated weights for policy 0, policy_version 8720 (0.0006) [2023-03-07 07:47:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 8931328. Throughput: 0: 13080.7. Samples: 8898864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:47:03,367][155126] Avg episode reward: [(0, '2036.725')] [2023-03-07 07:47:03,932][155452] Updated weights for policy 0, policy_version 8730 (0.0006) [2023-03-07 07:47:04,729][155452] Updated weights for policy 0, policy_version 8740 (0.0006) [2023-03-07 07:47:05,516][155452] Updated weights for policy 0, policy_version 8750 (0.0006) [2023-03-07 07:47:06,294][155452] Updated weights for policy 0, policy_version 8760 (0.0005) [2023-03-07 07:47:07,082][155452] Updated weights for policy 0, policy_version 8770 (0.0007) [2023-03-07 07:47:07,869][155452] Updated weights for policy 0, policy_version 8780 (0.0007) [2023-03-07 07:47:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 8996864. Throughput: 0: 13072.6. Samples: 8976826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:47:08,368][155126] Avg episode reward: [(0, '2161.858')] [2023-03-07 07:47:08,664][155452] Updated weights for policy 0, policy_version 8790 (0.0006) [2023-03-07 07:47:09,436][155452] Updated weights for policy 0, policy_version 8800 (0.0006) [2023-03-07 07:47:10,221][155452] Updated weights for policy 0, policy_version 8810 (0.0006) [2023-03-07 07:47:11,004][155452] Updated weights for policy 0, policy_version 8820 (0.0007) [2023-03-07 07:47:11,780][155452] Updated weights for policy 0, policy_version 8830 (0.0006) [2023-03-07 07:47:12,563][155452] Updated weights for policy 0, policy_version 8840 (0.0006) [2023-03-07 07:47:13,351][155452] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-07 07:47:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 9062400. Throughput: 0: 13075.3. Samples: 9055097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:47:13,367][155126] Avg episode reward: [(0, '2158.714')] [2023-03-07 07:47:14,151][155452] Updated weights for policy 0, policy_version 8860 (0.0007) [2023-03-07 07:47:14,933][155452] Updated weights for policy 0, policy_version 8870 (0.0006) [2023-03-07 07:47:15,222][155401] KL-divergence is very high: 237281.9219 [2023-03-07 07:47:15,712][155452] Updated weights for policy 0, policy_version 8880 (0.0006) [2023-03-07 07:47:16,484][155452] Updated weights for policy 0, policy_version 8890 (0.0006) [2023-03-07 07:47:17,279][155452] Updated weights for policy 0, policy_version 8900 (0.0006) [2023-03-07 07:47:18,067][155452] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-07 07:47:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 9126912. Throughput: 0: 13076.6. Samples: 9094385. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:47:18,368][155126] Avg episode reward: [(0, '2301.035')] [2023-03-07 07:47:18,854][155452] Updated weights for policy 0, policy_version 8920 (0.0006) [2023-03-07 07:47:19,657][155452] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-07 07:47:20,410][155452] Updated weights for policy 0, policy_version 8940 (0.0007) [2023-03-07 07:47:21,210][155401] KL-divergence is very high: 1504.5315 [2023-03-07 07:47:21,217][155452] Updated weights for policy 0, policy_version 8950 (0.0006) [2023-03-07 07:47:21,361][155401] KL-divergence is very high: 209.5002 [2023-03-07 07:47:22,006][155452] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-07 07:47:22,782][155452] Updated weights for policy 0, policy_version 8970 (0.0007) [2023-03-07 07:47:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 9192448. Throughput: 0: 13062.3. Samples: 9172353. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:47:23,367][155126] Avg episode reward: [(0, '2110.896')] [2023-03-07 07:47:23,561][155452] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-07 07:47:24,350][155452] Updated weights for policy 0, policy_version 8990 (0.0006) [2023-03-07 07:47:25,127][155452] Updated weights for policy 0, policy_version 9000 (0.0006) [2023-03-07 07:47:25,918][155452] Updated weights for policy 0, policy_version 9010 (0.0007) [2023-03-07 07:47:26,709][155452] Updated weights for policy 0, policy_version 9020 (0.0007) [2023-03-07 07:47:27,493][155452] Updated weights for policy 0, policy_version 9030 (0.0005) [2023-03-07 07:47:28,263][155452] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-07 07:47:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 9257984. Throughput: 0: 13062.7. Samples: 9250663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:47:28,367][155126] Avg episode reward: [(0, '2049.884')] [2023-03-07 07:47:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000009041_9257984.pth... [2023-03-07 07:47:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000005982_6125568.pth [2023-03-07 07:47:29,057][155452] Updated weights for policy 0, policy_version 9050 (0.0006) [2023-03-07 07:47:29,825][155452] Updated weights for policy 0, policy_version 9060 (0.0006) [2023-03-07 07:47:30,611][155452] Updated weights for policy 0, policy_version 9070 (0.0007) [2023-03-07 07:47:31,395][155452] Updated weights for policy 0, policy_version 9080 (0.0006) [2023-03-07 07:47:32,185][155452] Updated weights for policy 0, policy_version 9090 (0.0006) [2023-03-07 07:47:32,964][155452] Updated weights for policy 0, policy_version 9100 (0.0006) [2023-03-07 07:47:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 9323520. Throughput: 0: 13062.8. Samples: 9289999. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:47:33,367][155126] Avg episode reward: [(0, '2057.996')] [2023-03-07 07:47:33,754][155452] Updated weights for policy 0, policy_version 9110 (0.0006) [2023-03-07 07:47:34,540][155452] Updated weights for policy 0, policy_version 9120 (0.0006) [2023-03-07 07:47:35,323][155452] Updated weights for policy 0, policy_version 9130 (0.0006) [2023-03-07 07:47:36,095][155452] Updated weights for policy 0, policy_version 9140 (0.0007) [2023-03-07 07:47:36,882][155452] Updated weights for policy 0, policy_version 9150 (0.0007) [2023-03-07 07:47:37,676][155452] Updated weights for policy 0, policy_version 9160 (0.0007) [2023-03-07 07:47:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 9388032. Throughput: 0: 13053.2. Samples: 9368368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:47:38,367][155126] Avg episode reward: [(0, '2073.735')] [2023-03-07 07:47:38,473][155452] Updated weights for policy 0, policy_version 9170 (0.0005) [2023-03-07 07:47:39,261][155452] Updated weights for policy 0, policy_version 9180 (0.0007) [2023-03-07 07:47:40,046][155452] Updated weights for policy 0, policy_version 9190 (0.0006) [2023-03-07 07:47:40,834][155452] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-07 07:47:41,616][155452] Updated weights for policy 0, policy_version 9210 (0.0007) [2023-03-07 07:47:42,398][155452] Updated weights for policy 0, policy_version 9220 (0.0007) [2023-03-07 07:47:43,029][155401] KL-divergence is very high: 1363.5051 [2023-03-07 07:47:43,192][155452] Updated weights for policy 0, policy_version 9230 (0.0006) [2023-03-07 07:47:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 9453568. Throughput: 0: 13044.6. Samples: 9446396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:47:43,367][155126] Avg episode reward: [(0, '1838.598')] [2023-03-07 07:47:43,973][155452] Updated weights for policy 0, policy_version 9240 (0.0007) [2023-03-07 07:47:44,754][155452] Updated weights for policy 0, policy_version 9250 (0.0006) [2023-03-07 07:47:45,541][155452] Updated weights for policy 0, policy_version 9260 (0.0006) [2023-03-07 07:47:46,329][155452] Updated weights for policy 0, policy_version 9270 (0.0007) [2023-03-07 07:47:47,113][155452] Updated weights for policy 0, policy_version 9280 (0.0005) [2023-03-07 07:47:47,901][155452] Updated weights for policy 0, policy_version 9290 (0.0006) [2023-03-07 07:47:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 9518080. Throughput: 0: 13034.4. Samples: 9485413. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:47:48,368][155126] Avg episode reward: [(0, '1848.240')] [2023-03-07 07:47:48,681][155452] Updated weights for policy 0, policy_version 9300 (0.0008) [2023-03-07 07:47:49,473][155452] Updated weights for policy 0, policy_version 9310 (0.0007) [2023-03-07 07:47:50,251][155452] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-07 07:47:51,048][155452] Updated weights for policy 0, policy_version 9330 (0.0007) [2023-03-07 07:47:51,837][155452] Updated weights for policy 0, policy_version 9340 (0.0007) [2023-03-07 07:47:52,608][155452] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-07 07:47:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 9583616. Throughput: 0: 13042.5. Samples: 9563739. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:47:53,367][155126] Avg episode reward: [(0, '1838.348')] [2023-03-07 07:47:53,389][155452] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-07 07:47:54,182][155452] Updated weights for policy 0, policy_version 9370 (0.0006) [2023-03-07 07:47:54,970][155452] Updated weights for policy 0, policy_version 9380 (0.0006) [2023-03-07 07:47:55,751][155452] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-07 07:47:56,540][155452] Updated weights for policy 0, policy_version 9400 (0.0006) [2023-03-07 07:47:57,314][155452] Updated weights for policy 0, policy_version 9410 (0.0007) [2023-03-07 07:47:58,094][155452] Updated weights for policy 0, policy_version 9420 (0.0006) [2023-03-07 07:47:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 9649152. Throughput: 0: 13046.4. Samples: 9642186. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:47:58,367][155126] Avg episode reward: [(0, '1628.842')] [2023-03-07 07:47:58,881][155452] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-07 07:47:59,662][155452] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-07 07:48:00,442][155452] Updated weights for policy 0, policy_version 9450 (0.0006) [2023-03-07 07:48:01,233][155452] Updated weights for policy 0, policy_version 9460 (0.0006) [2023-03-07 07:48:02,011][155452] Updated weights for policy 0, policy_version 9470 (0.0006) [2023-03-07 07:48:02,782][155452] Updated weights for policy 0, policy_version 9480 (0.0006) [2023-03-07 07:48:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 9714688. Throughput: 0: 13044.1. Samples: 9681372. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:48:03,378][155126] Avg episode reward: [(0, '1833.316')] [2023-03-07 07:48:03,588][155452] Updated weights for policy 0, policy_version 9490 (0.0007) [2023-03-07 07:48:04,380][155452] Updated weights for policy 0, policy_version 9500 (0.0007) [2023-03-07 07:48:05,162][155452] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-07 07:48:05,951][155452] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-07 07:48:06,732][155452] Updated weights for policy 0, policy_version 9530 (0.0007) [2023-03-07 07:48:07,507][155452] Updated weights for policy 0, policy_version 9540 (0.0007) [2023-03-07 07:48:08,295][155452] Updated weights for policy 0, policy_version 9550 (0.0007) [2023-03-07 07:48:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 9779200. Throughput: 0: 13050.7. Samples: 9759636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:48:08,367][155126] Avg episode reward: [(0, '1974.828')] [2023-03-07 07:48:09,078][155452] Updated weights for policy 0, policy_version 9560 (0.0006) [2023-03-07 07:48:09,861][155452] Updated weights for policy 0, policy_version 9570 (0.0007) [2023-03-07 07:48:10,645][155452] Updated weights for policy 0, policy_version 9580 (0.0005) [2023-03-07 07:48:11,432][155452] Updated weights for policy 0, policy_version 9590 (0.0006) [2023-03-07 07:48:12,223][155452] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-07 07:48:12,985][155452] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-07 07:48:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 9844736. Throughput: 0: 13051.5. Samples: 9837983. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:48:13,368][155126] Avg episode reward: [(0, '1712.332')] [2023-03-07 07:48:13,780][155452] Updated weights for policy 0, policy_version 9620 (0.0007) [2023-03-07 07:48:14,538][155452] Updated weights for policy 0, policy_version 9630 (0.0005) [2023-03-07 07:48:15,306][155452] Updated weights for policy 0, policy_version 9640 (0.0006) [2023-03-07 07:48:16,100][155452] Updated weights for policy 0, policy_version 9650 (0.0006) [2023-03-07 07:48:16,869][155452] Updated weights for policy 0, policy_version 9660 (0.0006) [2023-03-07 07:48:17,647][155452] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-07 07:48:18,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 9911296. Throughput: 0: 13058.6. Samples: 9877635. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:48:18,368][155126] Avg episode reward: [(0, '1623.606')] [2023-03-07 07:48:18,432][155452] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-07 07:48:19,213][155452] Updated weights for policy 0, policy_version 9690 (0.0006) [2023-03-07 07:48:20,015][155452] Updated weights for policy 0, policy_version 9700 (0.0005) [2023-03-07 07:48:20,813][155452] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-07 07:48:21,593][155452] Updated weights for policy 0, policy_version 9720 (0.0006) [2023-03-07 07:48:22,358][155452] Updated weights for policy 0, policy_version 9730 (0.0006) [2023-03-07 07:48:23,164][155452] Updated weights for policy 0, policy_version 9740 (0.0006) [2023-03-07 07:48:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 9975808. Throughput: 0: 13058.9. Samples: 9956018. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:48:23,379][155126] Avg episode reward: [(0, '1639.445')] [2023-03-07 07:48:23,954][155452] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-07 07:48:24,737][155452] Updated weights for policy 0, policy_version 9760 (0.0007) [2023-03-07 07:48:25,523][155452] Updated weights for policy 0, policy_version 9770 (0.0007) [2023-03-07 07:48:26,296][155452] Updated weights for policy 0, policy_version 9780 (0.0006) [2023-03-07 07:48:27,083][155452] Updated weights for policy 0, policy_version 9790 (0.0005) [2023-03-07 07:48:27,866][155452] Updated weights for policy 0, policy_version 9800 (0.0006) [2023-03-07 07:48:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 10041344. Throughput: 0: 13061.8. Samples: 10034176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:48:28,367][155126] Avg episode reward: [(0, '1725.329')] [2023-03-07 07:48:28,648][155452] Updated weights for policy 0, policy_version 9810 (0.0006) [2023-03-07 07:48:29,446][155452] Updated weights for policy 0, policy_version 9820 (0.0006) [2023-03-07 07:48:30,238][155452] Updated weights for policy 0, policy_version 9830 (0.0006) [2023-03-07 07:48:31,031][155452] Updated weights for policy 0, policy_version 9840 (0.0005) [2023-03-07 07:48:31,805][155452] Updated weights for policy 0, policy_version 9850 (0.0006) [2023-03-07 07:48:32,593][155452] Updated weights for policy 0, policy_version 9860 (0.0005) [2023-03-07 07:48:33,365][155452] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-07 07:48:33,367][155126] Fps is (10 sec: 13107.5, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 10106880. Throughput: 0: 13060.6. Samples: 10073137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:48:33,367][155126] Avg episode reward: [(0, '1563.255')] [2023-03-07 07:48:34,148][155452] Updated weights for policy 0, policy_version 9880 (0.0006) [2023-03-07 07:48:34,937][155452] Updated weights for policy 0, policy_version 9890 (0.0006) [2023-03-07 07:48:35,633][155401] KL-divergence is very high: 21847.5449 [2023-03-07 07:48:35,726][155452] Updated weights for policy 0, policy_version 9900 (0.0005) [2023-03-07 07:48:36,496][155452] Updated weights for policy 0, policy_version 9910 (0.0006) [2023-03-07 07:48:37,292][155452] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-07 07:48:38,073][155452] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-07 07:48:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 10171392. Throughput: 0: 13067.6. Samples: 10151782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:48:38,367][155126] Avg episode reward: [(0, '1650.017')] [2023-03-07 07:48:38,850][155452] Updated weights for policy 0, policy_version 9940 (0.0006) [2023-03-07 07:48:39,643][155452] Updated weights for policy 0, policy_version 9950 (0.0006) [2023-03-07 07:48:40,414][155452] Updated weights for policy 0, policy_version 9960 (0.0006) [2023-03-07 07:48:41,177][155452] Updated weights for policy 0, policy_version 9970 (0.0005) [2023-03-07 07:48:41,972][155452] Updated weights for policy 0, policy_version 9980 (0.0006) [2023-03-07 07:48:42,750][155452] Updated weights for policy 0, policy_version 9990 (0.0006) [2023-03-07 07:48:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 10236928. Throughput: 0: 13068.2. Samples: 10230253. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:48:43,367][155126] Avg episode reward: [(0, '1810.323')] [2023-03-07 07:48:43,534][155452] Updated weights for policy 0, policy_version 10000 (0.0005) [2023-03-07 07:48:44,328][155452] Updated weights for policy 0, policy_version 10010 (0.0006) [2023-03-07 07:48:45,119][155452] Updated weights for policy 0, policy_version 10020 (0.0006) [2023-03-07 07:48:45,920][155452] Updated weights for policy 0, policy_version 10030 (0.0007) [2023-03-07 07:48:46,211][155401] KL-divergence is very high: 147240.7500 [2023-03-07 07:48:46,694][155452] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-07 07:48:47,515][155452] Updated weights for policy 0, policy_version 10050 (0.0007) [2023-03-07 07:48:48,283][155452] Updated weights for policy 0, policy_version 10060 (0.0006) [2023-03-07 07:48:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 10302464. Throughput: 0: 13062.9. Samples: 10269200. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:48:48,367][155126] Avg episode reward: [(0, '1640.293')] [2023-03-07 07:48:49,081][155452] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-07 07:48:49,851][155452] Updated weights for policy 0, policy_version 10080 (0.0006) [2023-03-07 07:48:50,638][155452] Updated weights for policy 0, policy_version 10090 (0.0006) [2023-03-07 07:48:51,404][155452] Updated weights for policy 0, policy_version 10100 (0.0006) [2023-03-07 07:48:52,190][155452] Updated weights for policy 0, policy_version 10110 (0.0006) [2023-03-07 07:48:52,993][155452] Updated weights for policy 0, policy_version 10120 (0.0006) [2023-03-07 07:48:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 10368000. Throughput: 0: 13063.8. Samples: 10347507. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 07:48:53,367][155126] Avg episode reward: [(0, '1597.179')] [2023-03-07 07:48:53,750][155452] Updated weights for policy 0, policy_version 10130 (0.0006) [2023-03-07 07:48:54,528][155452] Updated weights for policy 0, policy_version 10140 (0.0006) [2023-03-07 07:48:55,318][155452] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-07 07:48:56,085][155452] Updated weights for policy 0, policy_version 10160 (0.0006) [2023-03-07 07:48:56,875][155452] Updated weights for policy 0, policy_version 10170 (0.0006) [2023-03-07 07:48:57,641][155452] Updated weights for policy 0, policy_version 10180 (0.0006) [2023-03-07 07:48:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 10432512. Throughput: 0: 13070.5. Samples: 10426155. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:48:58,367][155126] Avg episode reward: [(0, '1524.342')] [2023-03-07 07:48:58,425][155452] Updated weights for policy 0, policy_version 10190 (0.0006) [2023-03-07 07:48:59,218][155452] Updated weights for policy 0, policy_version 10200 (0.0006) [2023-03-07 07:49:00,010][155452] Updated weights for policy 0, policy_version 10210 (0.0006) [2023-03-07 07:49:00,777][155452] Updated weights for policy 0, policy_version 10220 (0.0007) [2023-03-07 07:49:01,577][155452] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-07 07:49:02,361][155452] Updated weights for policy 0, policy_version 10240 (0.0006) [2023-03-07 07:49:03,120][155452] Updated weights for policy 0, policy_version 10250 (0.0006) [2023-03-07 07:49:03,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 10499072. Throughput: 0: 13062.3. Samples: 10465442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:03,368][155126] Avg episode reward: [(0, '1765.979')] [2023-03-07 07:49:03,900][155452] Updated weights for policy 0, policy_version 10260 (0.0007) [2023-03-07 07:49:04,707][155452] Updated weights for policy 0, policy_version 10270 (0.0006) [2023-03-07 07:49:05,485][155452] Updated weights for policy 0, policy_version 10280 (0.0006) [2023-03-07 07:49:06,253][155452] Updated weights for policy 0, policy_version 10290 (0.0005) [2023-03-07 07:49:07,037][155452] Updated weights for policy 0, policy_version 10300 (0.0007) [2023-03-07 07:49:07,834][155452] Updated weights for policy 0, policy_version 10310 (0.0006) [2023-03-07 07:49:08,367][155126] Fps is (10 sec: 13209.7, 60 sec: 13090.1, 300 sec: 13058.6). Total num frames: 10564608. Throughput: 0: 13067.4. Samples: 10544049. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:49:08,367][155126] Avg episode reward: [(0, '1820.893')] [2023-03-07 07:49:08,597][155452] Updated weights for policy 0, policy_version 10320 (0.0006) [2023-03-07 07:49:09,385][155452] Updated weights for policy 0, policy_version 10330 (0.0005) [2023-03-07 07:49:10,151][155452] Updated weights for policy 0, policy_version 10340 (0.0006) [2023-03-07 07:49:10,930][155452] Updated weights for policy 0, policy_version 10350 (0.0005) [2023-03-07 07:49:11,719][155452] Updated weights for policy 0, policy_version 10360 (0.0006) [2023-03-07 07:49:12,480][155452] Updated weights for policy 0, policy_version 10370 (0.0007) [2023-03-07 07:49:13,275][155452] Updated weights for policy 0, policy_version 10380 (0.0006) [2023-03-07 07:49:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13090.2, 300 sec: 13058.6). Total num frames: 10630144. Throughput: 0: 13080.9. Samples: 10622816. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:13,367][155126] Avg episode reward: [(0, '1731.859')] [2023-03-07 07:49:14,047][155452] Updated weights for policy 0, policy_version 10390 (0.0006) [2023-03-07 07:49:14,819][155452] Updated weights for policy 0, policy_version 10400 (0.0007) [2023-03-07 07:49:15,602][155452] Updated weights for policy 0, policy_version 10410 (0.0006) [2023-03-07 07:49:16,391][155452] Updated weights for policy 0, policy_version 10420 (0.0006) [2023-03-07 07:49:17,164][155452] Updated weights for policy 0, policy_version 10430 (0.0006) [2023-03-07 07:49:17,938][155452] Updated weights for policy 0, policy_version 10440 (0.0007) [2023-03-07 07:49:18,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 10695680. Throughput: 0: 13090.7. Samples: 10662218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:18,367][155126] Avg episode reward: [(0, '1407.732')] [2023-03-07 07:49:18,726][155452] Updated weights for policy 0, policy_version 10450 (0.0006) [2023-03-07 07:49:19,494][155452] Updated weights for policy 0, policy_version 10460 (0.0005) [2023-03-07 07:49:20,291][155452] Updated weights for policy 0, policy_version 10470 (0.0006) [2023-03-07 07:49:21,073][155452] Updated weights for policy 0, policy_version 10480 (0.0007) [2023-03-07 07:49:21,858][155452] Updated weights for policy 0, policy_version 10490 (0.0006) [2023-03-07 07:49:22,645][155452] Updated weights for policy 0, policy_version 10500 (0.0005) [2023-03-07 07:49:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13090.2, 300 sec: 13062.1). Total num frames: 10761216. Throughput: 0: 13091.3. Samples: 10740890. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:23,367][155126] Avg episode reward: [(0, '1415.491')] [2023-03-07 07:49:23,428][155452] Updated weights for policy 0, policy_version 10510 (0.0005) [2023-03-07 07:49:24,195][155452] Updated weights for policy 0, policy_version 10520 (0.0007) [2023-03-07 07:49:25,007][155452] Updated weights for policy 0, policy_version 10530 (0.0007) [2023-03-07 07:49:25,773][155452] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-07 07:49:26,556][155452] Updated weights for policy 0, policy_version 10550 (0.0007) [2023-03-07 07:49:27,327][155452] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-07 07:49:28,102][155452] Updated weights for policy 0, policy_version 10570 (0.0007) [2023-03-07 07:49:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13062.1). Total num frames: 10826752. Throughput: 0: 13096.9. Samples: 10819614. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:28,367][155126] Avg episode reward: [(0, '1480.242')] [2023-03-07 07:49:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000010573_10826752.pth... [2023-03-07 07:49:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000007510_7690240.pth [2023-03-07 07:49:28,885][155452] Updated weights for policy 0, policy_version 10580 (0.0006) [2023-03-07 07:49:29,661][155452] Updated weights for policy 0, policy_version 10590 (0.0005) [2023-03-07 07:49:30,439][155452] Updated weights for policy 0, policy_version 10600 (0.0006) [2023-03-07 07:49:31,240][155452] Updated weights for policy 0, policy_version 10610 (0.0006) [2023-03-07 07:49:32,018][155452] Updated weights for policy 0, policy_version 10620 (0.0006) [2023-03-07 07:49:32,791][155452] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-07 07:49:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13062.1). Total num frames: 10892288. Throughput: 0: 13108.7. Samples: 10859090. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:33,367][155126] Avg episode reward: [(0, '1478.179')] [2023-03-07 07:49:33,583][155452] Updated weights for policy 0, policy_version 10640 (0.0006) [2023-03-07 07:49:34,352][155452] Updated weights for policy 0, policy_version 10650 (0.0007) [2023-03-07 07:49:35,146][155452] Updated weights for policy 0, policy_version 10660 (0.0007) [2023-03-07 07:49:35,934][155452] Updated weights for policy 0, policy_version 10670 (0.0007) [2023-03-07 07:49:36,700][155452] Updated weights for policy 0, policy_version 10680 (0.0006) [2023-03-07 07:49:37,489][155452] Updated weights for policy 0, policy_version 10690 (0.0006) [2023-03-07 07:49:38,290][155452] Updated weights for policy 0, policy_version 10700 (0.0006) [2023-03-07 07:49:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13062.1). Total num frames: 10957824. Throughput: 0: 13109.2. Samples: 10937423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:38,367][155126] Avg episode reward: [(0, '1590.939')] [2023-03-07 07:49:39,051][155452] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-07 07:49:39,838][155452] Updated weights for policy 0, policy_version 10720 (0.0007) [2023-03-07 07:49:40,619][155452] Updated weights for policy 0, policy_version 10730 (0.0007) [2023-03-07 07:49:41,404][155452] Updated weights for policy 0, policy_version 10740 (0.0006) [2023-03-07 07:49:42,173][155452] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-07 07:49:42,996][155452] Updated weights for policy 0, policy_version 10760 (0.0006) [2023-03-07 07:49:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13090.2, 300 sec: 13058.6). Total num frames: 11022336. Throughput: 0: 13104.7. Samples: 11015865. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:43,367][155126] Avg episode reward: [(0, '1471.026')] [2023-03-07 07:49:43,759][155452] Updated weights for policy 0, policy_version 10770 (0.0006) [2023-03-07 07:49:44,533][155452] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-07 07:49:45,319][155452] Updated weights for policy 0, policy_version 10790 (0.0007) [2023-03-07 07:49:46,099][155452] Updated weights for policy 0, policy_version 10800 (0.0006) [2023-03-07 07:49:46,883][155452] Updated weights for policy 0, policy_version 10810 (0.0006) [2023-03-07 07:49:47,654][155452] Updated weights for policy 0, policy_version 10820 (0.0006) [2023-03-07 07:49:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13090.1, 300 sec: 13062.1). Total num frames: 11087872. Throughput: 0: 13105.3. Samples: 11055181. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:49:48,368][155126] Avg episode reward: [(0, '1599.535')] [2023-03-07 07:49:48,433][155452] Updated weights for policy 0, policy_version 10830 (0.0007) [2023-03-07 07:49:49,237][155452] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-07 07:49:50,003][155452] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-07 07:49:50,805][155452] Updated weights for policy 0, policy_version 10860 (0.0006) [2023-03-07 07:49:51,578][155452] Updated weights for policy 0, policy_version 10870 (0.0007) [2023-03-07 07:49:52,353][155452] Updated weights for policy 0, policy_version 10880 (0.0006) [2023-03-07 07:49:53,153][155452] Updated weights for policy 0, policy_version 10890 (0.0005) [2023-03-07 07:49:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13065.5). Total num frames: 11153408. Throughput: 0: 13104.3. Samples: 11133743. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:49:53,367][155126] Avg episode reward: [(0, '1610.267')] [2023-03-07 07:49:53,940][155452] Updated weights for policy 0, policy_version 10900 (0.0006) [2023-03-07 07:49:54,721][155452] Updated weights for policy 0, policy_version 10910 (0.0006) [2023-03-07 07:49:55,508][155452] Updated weights for policy 0, policy_version 10920 (0.0007) [2023-03-07 07:49:56,294][155452] Updated weights for policy 0, policy_version 10930 (0.0007) [2023-03-07 07:49:57,075][155452] Updated weights for policy 0, policy_version 10940 (0.0007) [2023-03-07 07:49:57,854][155452] Updated weights for policy 0, policy_version 10950 (0.0006) [2023-03-07 07:49:58,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13107.2, 300 sec: 13065.6). Total num frames: 11218944. Throughput: 0: 13090.2. Samples: 11211876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:49:58,367][155126] Avg episode reward: [(0, '1618.446')] [2023-03-07 07:49:58,646][155452] Updated weights for policy 0, policy_version 10960 (0.0006) [2023-03-07 07:49:59,446][155452] Updated weights for policy 0, policy_version 10970 (0.0006) [2023-03-07 07:50:00,231][155452] Updated weights for policy 0, policy_version 10980 (0.0006) [2023-03-07 07:50:01,012][155452] Updated weights for policy 0, policy_version 10990 (0.0006) [2023-03-07 07:50:01,770][155452] Updated weights for policy 0, policy_version 11000 (0.0006) [2023-03-07 07:50:02,564][155452] Updated weights for policy 0, policy_version 11010 (0.0006) [2023-03-07 07:50:03,342][155452] Updated weights for policy 0, policy_version 11020 (0.0005) [2023-03-07 07:50:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 11284480. Throughput: 0: 13080.8. Samples: 11250856. Policy #0 lag: (min: 0.0, avg: 1.3, max: 2.0) [2023-03-07 07:50:03,368][155126] Avg episode reward: [(0, '1538.828')] [2023-03-07 07:50:04,124][155452] Updated weights for policy 0, policy_version 11030 (0.0006) [2023-03-07 07:50:04,914][155452] Updated weights for policy 0, policy_version 11040 (0.0006) [2023-03-07 07:50:05,699][155452] Updated weights for policy 0, policy_version 11050 (0.0006) [2023-03-07 07:50:06,486][155452] Updated weights for policy 0, policy_version 11060 (0.0005) [2023-03-07 07:50:07,252][155452] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-07 07:50:08,041][155452] Updated weights for policy 0, policy_version 11080 (0.0007) [2023-03-07 07:50:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13069.0). Total num frames: 11350016. Throughput: 0: 13082.5. Samples: 11329603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 2.0) [2023-03-07 07:50:08,367][155126] Avg episode reward: [(0, '1676.698')] [2023-03-07 07:50:08,805][155452] Updated weights for policy 0, policy_version 11090 (0.0006) [2023-03-07 07:50:09,601][155452] Updated weights for policy 0, policy_version 11100 (0.0006) [2023-03-07 07:50:10,383][155452] Updated weights for policy 0, policy_version 11110 (0.0006) [2023-03-07 07:50:11,178][155452] Updated weights for policy 0, policy_version 11120 (0.0006) [2023-03-07 07:50:11,970][155452] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-07 07:50:12,753][155452] Updated weights for policy 0, policy_version 11140 (0.0006) [2023-03-07 07:50:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 11414528. Throughput: 0: 13072.6. Samples: 11407883. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:50:13,367][155126] Avg episode reward: [(0, '1646.622')] [2023-03-07 07:50:13,543][155452] Updated weights for policy 0, policy_version 11150 (0.0006) [2023-03-07 07:50:14,309][155452] Updated weights for policy 0, policy_version 11160 (0.0006) [2023-03-07 07:50:15,104][155452] Updated weights for policy 0, policy_version 11170 (0.0006) [2023-03-07 07:50:15,874][155452] Updated weights for policy 0, policy_version 11180 (0.0007) [2023-03-07 07:50:16,657][155452] Updated weights for policy 0, policy_version 11190 (0.0006) [2023-03-07 07:50:17,467][155452] Updated weights for policy 0, policy_version 11200 (0.0007) [2023-03-07 07:50:18,245][155452] Updated weights for policy 0, policy_version 11210 (0.0006) [2023-03-07 07:50:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 11480064. Throughput: 0: 13071.4. Samples: 11447304. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:50:18,367][155126] Avg episode reward: [(0, '1582.565')] [2023-03-07 07:50:19,026][155452] Updated weights for policy 0, policy_version 11220 (0.0005) [2023-03-07 07:50:19,816][155452] Updated weights for policy 0, policy_version 11230 (0.0007) [2023-03-07 07:50:20,578][155452] Updated weights for policy 0, policy_version 11240 (0.0008) [2023-03-07 07:50:21,361][155452] Updated weights for policy 0, policy_version 11250 (0.0006) [2023-03-07 07:50:22,143][155452] Updated weights for policy 0, policy_version 11260 (0.0006) [2023-03-07 07:50:22,914][155452] Updated weights for policy 0, policy_version 11270 (0.0006) [2023-03-07 07:50:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 11545600. Throughput: 0: 13071.7. Samples: 11525649. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:50:23,368][155126] Avg episode reward: [(0, '1183.811')] [2023-03-07 07:50:23,718][155452] Updated weights for policy 0, policy_version 11280 (0.0006) [2023-03-07 07:50:24,504][155452] Updated weights for policy 0, policy_version 11290 (0.0006) [2023-03-07 07:50:25,285][155452] Updated weights for policy 0, policy_version 11300 (0.0006) [2023-03-07 07:50:26,069][155452] Updated weights for policy 0, policy_version 11310 (0.0007) [2023-03-07 07:50:26,873][155452] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-07 07:50:27,661][155452] Updated weights for policy 0, policy_version 11330 (0.0006) [2023-03-07 07:50:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 11611136. Throughput: 0: 13062.9. Samples: 11603699. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:50:28,367][155126] Avg episode reward: [(0, '1890.005')] [2023-03-07 07:50:28,424][155452] Updated weights for policy 0, policy_version 11340 (0.0006) [2023-03-07 07:50:29,211][155452] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-07 07:50:30,015][155452] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-07 07:50:30,786][155452] Updated weights for policy 0, policy_version 11370 (0.0006) [2023-03-07 07:50:31,558][155452] Updated weights for policy 0, policy_version 11380 (0.0007) [2023-03-07 07:50:32,352][155452] Updated weights for policy 0, policy_version 11390 (0.0007) [2023-03-07 07:50:33,114][155452] Updated weights for policy 0, policy_version 11400 (0.0005) [2023-03-07 07:50:33,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 11676672. Throughput: 0: 13064.3. Samples: 11643073. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:50:33,367][155126] Avg episode reward: [(0, '1508.631')] [2023-03-07 07:50:33,894][155452] Updated weights for policy 0, policy_version 11410 (0.0006) [2023-03-07 07:50:34,702][155452] Updated weights for policy 0, policy_version 11420 (0.0006) [2023-03-07 07:50:35,497][155452] Updated weights for policy 0, policy_version 11430 (0.0006) [2023-03-07 07:50:36,291][155452] Updated weights for policy 0, policy_version 11440 (0.0006) [2023-03-07 07:50:37,078][155452] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-07 07:50:37,855][155452] Updated weights for policy 0, policy_version 11460 (0.0006) [2023-03-07 07:50:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 11741184. Throughput: 0: 13057.0. Samples: 11721306. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:50:38,367][155126] Avg episode reward: [(0, '1646.820')] [2023-03-07 07:50:38,631][155452] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-07 07:50:39,424][155452] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-07 07:50:40,190][155452] Updated weights for policy 0, policy_version 11490 (0.0007) [2023-03-07 07:50:40,955][155452] Updated weights for policy 0, policy_version 11500 (0.0005) [2023-03-07 07:50:41,742][155452] Updated weights for policy 0, policy_version 11510 (0.0007) [2023-03-07 07:50:42,543][155452] Updated weights for policy 0, policy_version 11520 (0.0006) [2023-03-07 07:50:43,321][155452] Updated weights for policy 0, policy_version 11530 (0.0006) [2023-03-07 07:50:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 11806720. Throughput: 0: 13065.4. Samples: 11799817. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:50:43,367][155126] Avg episode reward: [(0, '1296.939')] [2023-03-07 07:50:44,102][155452] Updated weights for policy 0, policy_version 11540 (0.0006) [2023-03-07 07:50:44,896][155452] Updated weights for policy 0, policy_version 11550 (0.0005) [2023-03-07 07:50:45,675][155452] Updated weights for policy 0, policy_version 11560 (0.0006) [2023-03-07 07:50:46,465][155452] Updated weights for policy 0, policy_version 11570 (0.0007) [2023-03-07 07:50:47,245][155452] Updated weights for policy 0, policy_version 11580 (0.0006) [2023-03-07 07:50:48,030][155452] Updated weights for policy 0, policy_version 11590 (0.0006) [2023-03-07 07:50:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 11872256. Throughput: 0: 13071.4. Samples: 11839069. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:50:48,367][155126] Avg episode reward: [(0, '1229.100')] [2023-03-07 07:50:48,801][155452] Updated weights for policy 0, policy_version 11600 (0.0007) [2023-03-07 07:50:49,578][155452] Updated weights for policy 0, policy_version 11610 (0.0006) [2023-03-07 07:50:50,399][155452] Updated weights for policy 0, policy_version 11620 (0.0006) [2023-03-07 07:50:51,164][155452] Updated weights for policy 0, policy_version 11630 (0.0006) [2023-03-07 07:50:51,954][155452] Updated weights for policy 0, policy_version 11640 (0.0006) [2023-03-07 07:50:52,727][155452] Updated weights for policy 0, policy_version 11650 (0.0007) [2023-03-07 07:50:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 11936768. Throughput: 0: 13058.5. Samples: 11917234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:50:53,367][155126] Avg episode reward: [(0, '1806.244')] [2023-03-07 07:50:53,534][155452] Updated weights for policy 0, policy_version 11660 (0.0006) [2023-03-07 07:50:54,299][155452] Updated weights for policy 0, policy_version 11670 (0.0006) [2023-03-07 07:50:55,070][155452] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-07 07:50:55,866][155452] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-07 07:50:56,641][155452] Updated weights for policy 0, policy_version 11700 (0.0006) [2023-03-07 07:50:57,425][155452] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-07 07:50:58,203][155452] Updated weights for policy 0, policy_version 11720 (0.0006) [2023-03-07 07:50:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 12002304. Throughput: 0: 13067.2. Samples: 11995909. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:50:58,367][155126] Avg episode reward: [(0, '1543.858')] [2023-03-07 07:50:59,006][155452] Updated weights for policy 0, policy_version 11730 (0.0006) [2023-03-07 07:50:59,779][155452] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-07 07:51:00,558][155452] Updated weights for policy 0, policy_version 11750 (0.0006) [2023-03-07 07:51:01,349][155452] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-07 07:51:02,142][155452] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-07 07:51:02,912][155452] Updated weights for policy 0, policy_version 11780 (0.0006) [2023-03-07 07:51:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 12067840. Throughput: 0: 13060.6. Samples: 12035033. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:51:03,367][155126] Avg episode reward: [(0, '1615.372')] [2023-03-07 07:51:03,708][155452] Updated weights for policy 0, policy_version 11790 (0.0007) [2023-03-07 07:51:04,501][155452] Updated weights for policy 0, policy_version 11800 (0.0005) [2023-03-07 07:51:05,278][155452] Updated weights for policy 0, policy_version 11810 (0.0007) [2023-03-07 07:51:06,049][155452] Updated weights for policy 0, policy_version 11820 (0.0006) [2023-03-07 07:51:06,840][155452] Updated weights for policy 0, policy_version 11830 (0.0006) [2023-03-07 07:51:07,625][155452] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-07 07:51:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 12133376. Throughput: 0: 13058.0. Samples: 12113257. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:51:08,367][155126] Avg episode reward: [(0, '1413.267')] [2023-03-07 07:51:08,407][155452] Updated weights for policy 0, policy_version 11850 (0.0006) [2023-03-07 07:51:09,190][155452] Updated weights for policy 0, policy_version 11860 (0.0005) [2023-03-07 07:51:09,988][155452] Updated weights for policy 0, policy_version 11870 (0.0006) [2023-03-07 07:51:10,761][155452] Updated weights for policy 0, policy_version 11880 (0.0006) [2023-03-07 07:51:11,538][155452] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-07 07:51:12,338][155452] Updated weights for policy 0, policy_version 11900 (0.0005) [2023-03-07 07:51:13,106][155452] Updated weights for policy 0, policy_version 11910 (0.0006) [2023-03-07 07:51:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 12198912. Throughput: 0: 13066.2. Samples: 12191677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:51:13,367][155126] Avg episode reward: [(0, '1306.471')] [2023-03-07 07:51:13,896][155452] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-07 07:51:14,657][155452] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-07 07:51:15,453][155452] Updated weights for policy 0, policy_version 11940 (0.0007) [2023-03-07 07:51:16,234][155452] Updated weights for policy 0, policy_version 11950 (0.0006) [2023-03-07 07:51:17,026][155452] Updated weights for policy 0, policy_version 11960 (0.0008) [2023-03-07 07:51:17,813][155452] Updated weights for policy 0, policy_version 11970 (0.0006) [2023-03-07 07:51:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 12263424. Throughput: 0: 13065.3. Samples: 12231012. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:51:18,367][155126] Avg episode reward: [(0, '1793.825')] [2023-03-07 07:51:18,601][155452] Updated weights for policy 0, policy_version 11980 (0.0005) [2023-03-07 07:51:19,386][155452] Updated weights for policy 0, policy_version 11990 (0.0007) [2023-03-07 07:51:20,180][155452] Updated weights for policy 0, policy_version 12000 (0.0007) [2023-03-07 07:51:20,961][155452] Updated weights for policy 0, policy_version 12010 (0.0006) [2023-03-07 07:51:21,761][155452] Updated weights for policy 0, policy_version 12020 (0.0006) [2023-03-07 07:51:22,535][155452] Updated weights for policy 0, policy_version 12030 (0.0006) [2023-03-07 07:51:23,315][155452] Updated weights for policy 0, policy_version 12040 (0.0006) [2023-03-07 07:51:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 12328960. Throughput: 0: 13060.3. Samples: 12309021. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:51:23,367][155126] Avg episode reward: [(0, '1665.249')] [2023-03-07 07:51:24,098][155452] Updated weights for policy 0, policy_version 12050 (0.0007) [2023-03-07 07:51:24,899][155452] Updated weights for policy 0, policy_version 12060 (0.0006) [2023-03-07 07:51:25,676][155452] Updated weights for policy 0, policy_version 12070 (0.0006) [2023-03-07 07:51:26,479][155452] Updated weights for policy 0, policy_version 12080 (0.0006) [2023-03-07 07:51:27,266][155452] Updated weights for policy 0, policy_version 12090 (0.0006) [2023-03-07 07:51:28,049][155452] Updated weights for policy 0, policy_version 12100 (0.0006) [2023-03-07 07:51:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 12394496. Throughput: 0: 13052.6. Samples: 12387182. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:51:28,367][155126] Avg episode reward: [(0, '1453.062')] [2023-03-07 07:51:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000012104_12394496.pth... [2023-03-07 07:51:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000009041_9257984.pth [2023-03-07 07:51:28,833][155452] Updated weights for policy 0, policy_version 12110 (0.0006) [2023-03-07 07:51:29,624][155452] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-07 07:51:30,401][155452] Updated weights for policy 0, policy_version 12130 (0.0006) [2023-03-07 07:51:31,186][155452] Updated weights for policy 0, policy_version 12140 (0.0006) [2023-03-07 07:51:31,976][155452] Updated weights for policy 0, policy_version 12150 (0.0006) [2023-03-07 07:51:32,772][155452] Updated weights for policy 0, policy_version 12160 (0.0007) [2023-03-07 07:51:33,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 12459008. Throughput: 0: 13049.2. Samples: 12426284. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:51:33,368][155126] Avg episode reward: [(0, '1511.509')] [2023-03-07 07:51:33,549][155452] Updated weights for policy 0, policy_version 12170 (0.0006) [2023-03-07 07:51:34,326][155452] Updated weights for policy 0, policy_version 12180 (0.0007) [2023-03-07 07:51:35,107][155452] Updated weights for policy 0, policy_version 12190 (0.0007) [2023-03-07 07:51:35,863][155452] Updated weights for policy 0, policy_version 12200 (0.0006) [2023-03-07 07:51:36,662][155452] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-07 07:51:37,450][155452] Updated weights for policy 0, policy_version 12220 (0.0006) [2023-03-07 07:51:38,238][155452] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-07 07:51:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 12524544. Throughput: 0: 13054.8. Samples: 12504701. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:51:38,367][155126] Avg episode reward: [(0, '1499.712')] [2023-03-07 07:51:39,030][155452] Updated weights for policy 0, policy_version 12240 (0.0006) [2023-03-07 07:51:39,813][155452] Updated weights for policy 0, policy_version 12250 (0.0007) [2023-03-07 07:51:40,594][155452] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-07 07:51:41,381][155452] Updated weights for policy 0, policy_version 12270 (0.0006) [2023-03-07 07:51:42,171][155452] Updated weights for policy 0, policy_version 12280 (0.0007) [2023-03-07 07:51:42,947][155452] Updated weights for policy 0, policy_version 12290 (0.0006) [2023-03-07 07:51:43,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 12590080. Throughput: 0: 13048.2. Samples: 12583079. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:51:43,367][155126] Avg episode reward: [(0, '1311.750')] [2023-03-07 07:51:43,732][155452] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-07 07:51:44,521][155452] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-07 07:51:45,321][155452] Updated weights for policy 0, policy_version 12320 (0.0006) [2023-03-07 07:51:46,108][155452] Updated weights for policy 0, policy_version 12330 (0.0006) [2023-03-07 07:51:46,880][155452] Updated weights for policy 0, policy_version 12340 (0.0006) [2023-03-07 07:51:47,649][155452] Updated weights for policy 0, policy_version 12350 (0.0007) [2023-03-07 07:51:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 12655616. Throughput: 0: 13041.7. Samples: 12621907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:51:48,367][155126] Avg episode reward: [(0, '1203.858')] [2023-03-07 07:51:48,433][155452] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-07 07:51:49,211][155452] Updated weights for policy 0, policy_version 12370 (0.0005) [2023-03-07 07:51:49,991][155452] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-07 07:51:50,757][155452] Updated weights for policy 0, policy_version 12390 (0.0006) [2023-03-07 07:51:51,561][155452] Updated weights for policy 0, policy_version 12400 (0.0006) [2023-03-07 07:51:52,316][155452] Updated weights for policy 0, policy_version 12410 (0.0005) [2023-03-07 07:51:53,089][155452] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-07 07:51:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 12721152. Throughput: 0: 13056.2. Samples: 12700788. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:51:53,367][155126] Avg episode reward: [(0, '1215.634')] [2023-03-07 07:51:53,889][155452] Updated weights for policy 0, policy_version 12430 (0.0005) [2023-03-07 07:51:54,669][155452] Updated weights for policy 0, policy_version 12440 (0.0005) [2023-03-07 07:51:55,446][155452] Updated weights for policy 0, policy_version 12450 (0.0006) [2023-03-07 07:51:56,218][155452] Updated weights for policy 0, policy_version 12460 (0.0007) [2023-03-07 07:51:57,000][155452] Updated weights for policy 0, policy_version 12470 (0.0007) [2023-03-07 07:51:57,795][155452] Updated weights for policy 0, policy_version 12480 (0.0005) [2023-03-07 07:51:58,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 12786688. Throughput: 0: 13062.8. Samples: 12779503. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:51:58,368][155126] Avg episode reward: [(0, '1384.385')] [2023-03-07 07:51:58,574][155452] Updated weights for policy 0, policy_version 12490 (0.0006) [2023-03-07 07:51:59,350][155452] Updated weights for policy 0, policy_version 12500 (0.0007) [2023-03-07 07:52:00,143][155452] Updated weights for policy 0, policy_version 12510 (0.0006) [2023-03-07 07:52:00,940][155452] Updated weights for policy 0, policy_version 12520 (0.0006) [2023-03-07 07:52:01,702][155452] Updated weights for policy 0, policy_version 12530 (0.0006) [2023-03-07 07:52:02,497][155452] Updated weights for policy 0, policy_version 12540 (0.0006) [2023-03-07 07:52:03,289][155452] Updated weights for policy 0, policy_version 12550 (0.0007) [2023-03-07 07:52:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13065.6). Total num frames: 12851200. Throughput: 0: 13059.8. Samples: 12818701. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:52:03,367][155126] Avg episode reward: [(0, '1214.380')] [2023-03-07 07:52:04,049][155452] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-07 07:52:04,838][155452] Updated weights for policy 0, policy_version 12570 (0.0005) [2023-03-07 07:52:05,632][155452] Updated weights for policy 0, policy_version 12580 (0.0007) [2023-03-07 07:52:06,414][155452] Updated weights for policy 0, policy_version 12590 (0.0006) [2023-03-07 07:52:07,196][155452] Updated weights for policy 0, policy_version 12600 (0.0006) [2023-03-07 07:52:07,984][155452] Updated weights for policy 0, policy_version 12610 (0.0006) [2023-03-07 07:52:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 12916736. Throughput: 0: 13067.8. Samples: 12897072. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:52:08,367][155126] Avg episode reward: [(0, '1392.145')] [2023-03-07 07:52:08,755][155452] Updated weights for policy 0, policy_version 12620 (0.0007) [2023-03-07 07:52:09,537][155452] Updated weights for policy 0, policy_version 12630 (0.0005) [2023-03-07 07:52:10,324][155452] Updated weights for policy 0, policy_version 12640 (0.0006) [2023-03-07 07:52:11,097][155452] Updated weights for policy 0, policy_version 12650 (0.0007) [2023-03-07 07:52:11,890][155452] Updated weights for policy 0, policy_version 12660 (0.0007) [2023-03-07 07:52:12,676][155452] Updated weights for policy 0, policy_version 12670 (0.0007) [2023-03-07 07:52:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 12982272. Throughput: 0: 13078.0. Samples: 12975691. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:52:13,367][155126] Avg episode reward: [(0, '1679.864')] [2023-03-07 07:52:13,446][155452] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-07 07:52:14,242][155452] Updated weights for policy 0, policy_version 12690 (0.0006) [2023-03-07 07:52:15,040][155452] Updated weights for policy 0, policy_version 12700 (0.0007) [2023-03-07 07:52:15,817][155452] Updated weights for policy 0, policy_version 12710 (0.0006) [2023-03-07 07:52:16,604][155452] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-07 07:52:17,382][155452] Updated weights for policy 0, policy_version 12730 (0.0006) [2023-03-07 07:52:18,148][155452] Updated weights for policy 0, policy_version 12740 (0.0007) [2023-03-07 07:52:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 13047808. Throughput: 0: 13077.0. Samples: 13014749. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:52:18,367][155126] Avg episode reward: [(0, '1440.731')] [2023-03-07 07:52:18,937][155452] Updated weights for policy 0, policy_version 12750 (0.0006) [2023-03-07 07:52:19,726][155452] Updated weights for policy 0, policy_version 12760 (0.0007) [2023-03-07 07:52:20,511][155452] Updated weights for policy 0, policy_version 12770 (0.0006) [2023-03-07 07:52:21,301][155452] Updated weights for policy 0, policy_version 12780 (0.0006) [2023-03-07 07:52:22,087][155452] Updated weights for policy 0, policy_version 12790 (0.0006) [2023-03-07 07:52:22,882][155452] Updated weights for policy 0, policy_version 12800 (0.0006) [2023-03-07 07:52:23,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 13113344. Throughput: 0: 13080.1. Samples: 13093307. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:52:23,367][155126] Avg episode reward: [(0, '1629.769')] [2023-03-07 07:52:23,664][155452] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-07 07:52:24,447][155452] Updated weights for policy 0, policy_version 12820 (0.0005) [2023-03-07 07:52:25,234][155452] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-07 07:52:25,986][155452] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-07 07:52:26,770][155452] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-07 07:52:27,532][155452] Updated weights for policy 0, policy_version 12860 (0.0006) [2023-03-07 07:52:28,316][155452] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-07 07:52:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 13178880. Throughput: 0: 13086.6. Samples: 13171978. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:52:28,367][155126] Avg episode reward: [(0, '1600.266')] [2023-03-07 07:52:29,094][155452] Updated weights for policy 0, policy_version 12880 (0.0006) [2023-03-07 07:52:29,890][155452] Updated weights for policy 0, policy_version 12890 (0.0006) [2023-03-07 07:52:30,677][155452] Updated weights for policy 0, policy_version 12900 (0.0006) [2023-03-07 07:52:31,452][155452] Updated weights for policy 0, policy_version 12910 (0.0005) [2023-03-07 07:52:32,224][155452] Updated weights for policy 0, policy_version 12920 (0.0006) [2023-03-07 07:52:33,011][155452] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-07 07:52:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13072.5). Total num frames: 13244416. Throughput: 0: 13098.1. Samples: 13211321. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:52:33,367][155126] Avg episode reward: [(0, '1169.642')] [2023-03-07 07:52:33,793][155452] Updated weights for policy 0, policy_version 12940 (0.0006) [2023-03-07 07:52:34,583][155452] Updated weights for policy 0, policy_version 12950 (0.0006) [2023-03-07 07:52:35,399][155452] Updated weights for policy 0, policy_version 12960 (0.0007) [2023-03-07 07:52:36,170][155452] Updated weights for policy 0, policy_version 12970 (0.0007) [2023-03-07 07:52:36,934][155452] Updated weights for policy 0, policy_version 12980 (0.0006) [2023-03-07 07:52:37,718][155452] Updated weights for policy 0, policy_version 12990 (0.0005) [2023-03-07 07:52:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13090.1, 300 sec: 13072.5). Total num frames: 13309952. Throughput: 0: 13085.5. Samples: 13289635. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:52:38,367][155126] Avg episode reward: [(0, '1572.182')] [2023-03-07 07:52:38,497][155452] Updated weights for policy 0, policy_version 13000 (0.0006) [2023-03-07 07:52:39,283][155452] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-07 07:52:40,060][155452] Updated weights for policy 0, policy_version 13020 (0.0007) [2023-03-07 07:52:40,857][155452] Updated weights for policy 0, policy_version 13030 (0.0006) [2023-03-07 07:52:41,638][155452] Updated weights for policy 0, policy_version 13040 (0.0006) [2023-03-07 07:52:42,428][155452] Updated weights for policy 0, policy_version 13050 (0.0005) [2023-03-07 07:52:43,205][155452] Updated weights for policy 0, policy_version 13060 (0.0006) [2023-03-07 07:52:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 13374464. Throughput: 0: 13082.6. Samples: 13368219. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:52:43,368][155126] Avg episode reward: [(0, '1413.375')] [2023-03-07 07:52:43,986][155452] Updated weights for policy 0, policy_version 13070 (0.0007) [2023-03-07 07:52:44,766][155452] Updated weights for policy 0, policy_version 13080 (0.0006) [2023-03-07 07:52:45,550][155452] Updated weights for policy 0, policy_version 13090 (0.0006) [2023-03-07 07:52:46,330][155452] Updated weights for policy 0, policy_version 13100 (0.0007) [2023-03-07 07:52:47,122][155452] Updated weights for policy 0, policy_version 13110 (0.0005) [2023-03-07 07:52:47,894][155452] Updated weights for policy 0, policy_version 13120 (0.0006) [2023-03-07 07:52:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 13440000. Throughput: 0: 13085.6. Samples: 13407556. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:52:48,367][155126] Avg episode reward: [(0, '1642.804')] [2023-03-07 07:52:48,689][155452] Updated weights for policy 0, policy_version 13130 (0.0006) [2023-03-07 07:52:49,465][155452] Updated weights for policy 0, policy_version 13140 (0.0006) [2023-03-07 07:52:50,257][155452] Updated weights for policy 0, policy_version 13150 (0.0006) [2023-03-07 07:52:51,020][155452] Updated weights for policy 0, policy_version 13160 (0.0006) [2023-03-07 07:52:51,793][155452] Updated weights for policy 0, policy_version 13170 (0.0006) [2023-03-07 07:52:52,593][155452] Updated weights for policy 0, policy_version 13180 (0.0007) [2023-03-07 07:52:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 13505536. Throughput: 0: 13087.3. Samples: 13486000. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:52:53,367][155126] Avg episode reward: [(0, '1545.683')] [2023-03-07 07:52:53,373][155452] Updated weights for policy 0, policy_version 13190 (0.0007) [2023-03-07 07:52:54,162][155452] Updated weights for policy 0, policy_version 13200 (0.0006) [2023-03-07 07:52:54,950][155452] Updated weights for policy 0, policy_version 13210 (0.0006) [2023-03-07 07:52:55,730][155452] Updated weights for policy 0, policy_version 13220 (0.0006) [2023-03-07 07:52:56,534][155452] Updated weights for policy 0, policy_version 13230 (0.0006) [2023-03-07 07:52:57,307][155452] Updated weights for policy 0, policy_version 13240 (0.0006) [2023-03-07 07:52:58,105][155452] Updated weights for policy 0, policy_version 13250 (0.0006) [2023-03-07 07:52:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 13571072. Throughput: 0: 13075.5. Samples: 13564091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:52:58,367][155126] Avg episode reward: [(0, '1507.379')] [2023-03-07 07:52:58,889][155452] Updated weights for policy 0, policy_version 13260 (0.0006) [2023-03-07 07:52:59,673][155452] Updated weights for policy 0, policy_version 13270 (0.0005) [2023-03-07 07:53:00,473][155452] Updated weights for policy 0, policy_version 13280 (0.0007) [2023-03-07 07:53:01,263][155452] Updated weights for policy 0, policy_version 13290 (0.0006) [2023-03-07 07:53:02,049][155452] Updated weights for policy 0, policy_version 13300 (0.0006) [2023-03-07 07:53:02,826][155452] Updated weights for policy 0, policy_version 13310 (0.0006) [2023-03-07 07:53:03,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 13635584. Throughput: 0: 13073.6. Samples: 13603065. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:53:03,368][155126] Avg episode reward: [(0, '1508.298')] [2023-03-07 07:53:03,608][155452] Updated weights for policy 0, policy_version 13320 (0.0006) [2023-03-07 07:53:04,397][155452] Updated weights for policy 0, policy_version 13330 (0.0008) [2023-03-07 07:53:05,187][155452] Updated weights for policy 0, policy_version 13340 (0.0005) [2023-03-07 07:53:05,966][155452] Updated weights for policy 0, policy_version 13350 (0.0006) [2023-03-07 07:53:06,751][155452] Updated weights for policy 0, policy_version 13360 (0.0006) [2023-03-07 07:53:07,540][155452] Updated weights for policy 0, policy_version 13370 (0.0006) [2023-03-07 07:53:08,329][155452] Updated weights for policy 0, policy_version 13380 (0.0005) [2023-03-07 07:53:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 13701120. Throughput: 0: 13064.7. Samples: 13681220. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:53:08,367][155126] Avg episode reward: [(0, '1570.415')] [2023-03-07 07:53:09,096][155452] Updated weights for policy 0, policy_version 13390 (0.0005) [2023-03-07 07:53:09,886][155452] Updated weights for policy 0, policy_version 13400 (0.0006) [2023-03-07 07:53:10,676][155452] Updated weights for policy 0, policy_version 13410 (0.0006) [2023-03-07 07:53:11,445][155452] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-07 07:53:12,222][155452] Updated weights for policy 0, policy_version 13430 (0.0005) [2023-03-07 07:53:12,997][155452] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-07 07:53:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.0, 300 sec: 13069.0). Total num frames: 13766656. Throughput: 0: 13066.5. Samples: 13759972. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:53:13,368][155126] Avg episode reward: [(0, '1628.802')] [2023-03-07 07:53:13,770][155452] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-07 07:53:14,558][155452] Updated weights for policy 0, policy_version 13460 (0.0006) [2023-03-07 07:53:15,334][155452] Updated weights for policy 0, policy_version 13470 (0.0006) [2023-03-07 07:53:16,113][155452] Updated weights for policy 0, policy_version 13480 (0.0006) [2023-03-07 07:53:16,924][155452] Updated weights for policy 0, policy_version 13490 (0.0007) [2023-03-07 07:53:17,710][155452] Updated weights for policy 0, policy_version 13500 (0.0006) [2023-03-07 07:53:18,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13072.5). Total num frames: 13832192. Throughput: 0: 13064.4. Samples: 13799223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:53:18,368][155126] Avg episode reward: [(0, '1661.247')] [2023-03-07 07:53:18,507][155452] Updated weights for policy 0, policy_version 13510 (0.0006) [2023-03-07 07:53:19,290][155452] Updated weights for policy 0, policy_version 13520 (0.0005) [2023-03-07 07:53:20,090][155452] Updated weights for policy 0, policy_version 13530 (0.0006) [2023-03-07 07:53:20,868][155452] Updated weights for policy 0, policy_version 13540 (0.0005) [2023-03-07 07:53:21,645][155452] Updated weights for policy 0, policy_version 13550 (0.0006) [2023-03-07 07:53:22,425][155452] Updated weights for policy 0, policy_version 13560 (0.0006) [2023-03-07 07:53:23,222][155452] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-07 07:53:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13069.0). Total num frames: 13896704. Throughput: 0: 13055.3. Samples: 13877126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:53:23,368][155126] Avg episode reward: [(0, '1456.531')] [2023-03-07 07:53:23,998][155452] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-07 07:53:24,770][155452] Updated weights for policy 0, policy_version 13590 (0.0006) [2023-03-07 07:53:25,553][155452] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-07 07:53:26,332][155452] Updated weights for policy 0, policy_version 13610 (0.0006) [2023-03-07 07:53:27,087][155452] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-07 07:53:27,887][155452] Updated weights for policy 0, policy_version 13630 (0.0006) [2023-03-07 07:53:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 13963264. Throughput: 0: 13062.3. Samples: 13956020. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:53:28,367][155126] Avg episode reward: [(0, '1713.387')] [2023-03-07 07:53:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000013636_13963264.pth... [2023-03-07 07:53:28,404][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000010573_10826752.pth [2023-03-07 07:53:28,664][155452] Updated weights for policy 0, policy_version 13640 (0.0006) [2023-03-07 07:53:29,453][155452] Updated weights for policy 0, policy_version 13650 (0.0007) [2023-03-07 07:53:30,233][155452] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-07 07:53:31,000][155452] Updated weights for policy 0, policy_version 13670 (0.0006) [2023-03-07 07:53:31,783][155452] Updated weights for policy 0, policy_version 13680 (0.0007) [2023-03-07 07:53:32,575][155452] Updated weights for policy 0, policy_version 13690 (0.0006) [2023-03-07 07:53:33,361][155452] Updated weights for policy 0, policy_version 13700 (0.0006) [2023-03-07 07:53:33,367][155126] Fps is (10 sec: 13209.6, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 14028800. Throughput: 0: 13061.0. Samples: 13995300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:53:33,368][155126] Avg episode reward: [(0, '1524.549')] [2023-03-07 07:53:34,128][155452] Updated weights for policy 0, policy_version 13710 (0.0008) [2023-03-07 07:53:34,917][155452] Updated weights for policy 0, policy_version 13720 (0.0006) [2023-03-07 07:53:35,674][155452] Updated weights for policy 0, policy_version 13730 (0.0006) [2023-03-07 07:53:36,459][155452] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-07 07:53:37,247][155452] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-07 07:53:38,025][155452] Updated weights for policy 0, policy_version 13760 (0.0006) [2023-03-07 07:53:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 14094336. Throughput: 0: 13070.7. Samples: 14074181. Policy #0 lag: (min: 0.0, avg: 0.9, max: 3.0) [2023-03-07 07:53:38,368][155126] Avg episode reward: [(0, '1487.996')] [2023-03-07 07:53:38,815][155452] Updated weights for policy 0, policy_version 13770 (0.0006) [2023-03-07 07:53:39,585][155452] Updated weights for policy 0, policy_version 13780 (0.0006) [2023-03-07 07:53:40,365][155452] Updated weights for policy 0, policy_version 13790 (0.0007) [2023-03-07 07:53:41,129][155452] Updated weights for policy 0, policy_version 13800 (0.0006) [2023-03-07 07:53:41,930][155452] Updated weights for policy 0, policy_version 13810 (0.0006) [2023-03-07 07:53:42,720][155452] Updated weights for policy 0, policy_version 13820 (0.0006) [2023-03-07 07:53:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13090.2, 300 sec: 13076.0). Total num frames: 14159872. Throughput: 0: 13078.6. Samples: 14152627. Policy #0 lag: (min: 0.0, avg: 0.9, max: 3.0) [2023-03-07 07:53:43,367][155126] Avg episode reward: [(0, '1584.371')] [2023-03-07 07:53:43,506][155452] Updated weights for policy 0, policy_version 13830 (0.0006) [2023-03-07 07:53:44,300][155452] Updated weights for policy 0, policy_version 13840 (0.0007) [2023-03-07 07:53:45,076][155452] Updated weights for policy 0, policy_version 13850 (0.0006) [2023-03-07 07:53:45,848][155452] Updated weights for policy 0, policy_version 13860 (0.0006) [2023-03-07 07:53:46,642][155452] Updated weights for policy 0, policy_version 13870 (0.0006) [2023-03-07 07:53:47,429][155452] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-07 07:53:48,210][155452] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-07 07:53:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 14224384. Throughput: 0: 13080.8. Samples: 14191696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:53:48,367][155126] Avg episode reward: [(0, '1580.445')] [2023-03-07 07:53:48,997][155452] Updated weights for policy 0, policy_version 13900 (0.0007) [2023-03-07 07:53:49,773][155452] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-07 07:53:50,553][155452] Updated weights for policy 0, policy_version 13920 (0.0007) [2023-03-07 07:53:51,347][155452] Updated weights for policy 0, policy_version 13930 (0.0006) [2023-03-07 07:53:52,126][155452] Updated weights for policy 0, policy_version 13940 (0.0006) [2023-03-07 07:53:52,912][155452] Updated weights for policy 0, policy_version 13950 (0.0007) [2023-03-07 07:53:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13076.0). Total num frames: 14289920. Throughput: 0: 13090.8. Samples: 14270305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:53:53,367][155126] Avg episode reward: [(0, '1611.680')] [2023-03-07 07:53:53,696][155452] Updated weights for policy 0, policy_version 13960 (0.0006) [2023-03-07 07:53:54,476][155452] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-07 07:53:55,272][155452] Updated weights for policy 0, policy_version 13980 (0.0006) [2023-03-07 07:53:56,061][155452] Updated weights for policy 0, policy_version 13990 (0.0006) [2023-03-07 07:53:56,839][155452] Updated weights for policy 0, policy_version 14000 (0.0006) [2023-03-07 07:53:57,629][155452] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-07 07:53:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13072.5). Total num frames: 14355456. Throughput: 0: 13079.0. Samples: 14348528. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:53:58,367][155126] Avg episode reward: [(0, '1575.962')] [2023-03-07 07:53:58,417][155452] Updated weights for policy 0, policy_version 14020 (0.0006) [2023-03-07 07:53:59,214][155452] Updated weights for policy 0, policy_version 14030 (0.0006) [2023-03-07 07:53:59,996][155452] Updated weights for policy 0, policy_version 14040 (0.0006) [2023-03-07 07:54:00,792][155452] Updated weights for policy 0, policy_version 14050 (0.0006) [2023-03-07 07:54:01,572][155452] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-07 07:54:02,362][155452] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-07 07:54:03,144][155452] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-07 07:54:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 14419968. Throughput: 0: 13066.9. Samples: 14387233. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:54:03,367][155126] Avg episode reward: [(0, '1524.867')] [2023-03-07 07:54:03,941][155452] Updated weights for policy 0, policy_version 14090 (0.0006) [2023-03-07 07:54:04,718][155452] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-07 07:54:05,513][155452] Updated weights for policy 0, policy_version 14110 (0.0006) [2023-03-07 07:54:06,280][155452] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-07 07:54:07,062][155452] Updated weights for policy 0, policy_version 14130 (0.0006) [2023-03-07 07:54:07,834][155452] Updated weights for policy 0, policy_version 14140 (0.0006) [2023-03-07 07:54:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 14485504. Throughput: 0: 13080.3. Samples: 14465738. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:54:08,368][155126] Avg episode reward: [(0, '1794.213')] [2023-03-07 07:54:08,638][155452] Updated weights for policy 0, policy_version 14150 (0.0006) [2023-03-07 07:54:09,425][155452] Updated weights for policy 0, policy_version 14160 (0.0006) [2023-03-07 07:54:10,193][155452] Updated weights for policy 0, policy_version 14170 (0.0007) [2023-03-07 07:54:10,962][155452] Updated weights for policy 0, policy_version 14180 (0.0006) [2023-03-07 07:54:11,762][155452] Updated weights for policy 0, policy_version 14190 (0.0006) [2023-03-07 07:54:12,542][155452] Updated weights for policy 0, policy_version 14200 (0.0006) [2023-03-07 07:54:13,325][155452] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-07 07:54:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 14551040. Throughput: 0: 13065.7. Samples: 14543975. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:54:13,367][155126] Avg episode reward: [(0, '1593.183')] [2023-03-07 07:54:14,126][155452] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-07 07:54:14,896][155452] Updated weights for policy 0, policy_version 14230 (0.0007) [2023-03-07 07:54:15,686][155452] Updated weights for policy 0, policy_version 14240 (0.0006) [2023-03-07 07:54:16,464][155452] Updated weights for policy 0, policy_version 14250 (0.0006) [2023-03-07 07:54:17,253][155452] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-07 07:54:18,045][155452] Updated weights for policy 0, policy_version 14270 (0.0007) [2023-03-07 07:54:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13069.0). Total num frames: 14616576. Throughput: 0: 13062.1. Samples: 14583093. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:54:18,367][155126] Avg episode reward: [(0, '1734.978')] [2023-03-07 07:54:18,828][155452] Updated weights for policy 0, policy_version 14280 (0.0006) [2023-03-07 07:54:19,613][155452] Updated weights for policy 0, policy_version 14290 (0.0006) [2023-03-07 07:54:20,421][155452] Updated weights for policy 0, policy_version 14300 (0.0006) [2023-03-07 07:54:21,199][155452] Updated weights for policy 0, policy_version 14310 (0.0006) [2023-03-07 07:54:21,979][155452] Updated weights for policy 0, policy_version 14320 (0.0006) [2023-03-07 07:54:22,754][155452] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-07 07:54:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13065.5). Total num frames: 14681088. Throughput: 0: 13046.2. Samples: 14661261. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:54:23,368][155126] Avg episode reward: [(0, '1471.834')] [2023-03-07 07:54:23,534][155452] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-07 07:54:24,325][155452] Updated weights for policy 0, policy_version 14350 (0.0006) [2023-03-07 07:54:25,118][155452] Updated weights for policy 0, policy_version 14360 (0.0005) [2023-03-07 07:54:25,917][155452] Updated weights for policy 0, policy_version 14370 (0.0006) [2023-03-07 07:54:26,690][155452] Updated weights for policy 0, policy_version 14380 (0.0005) [2023-03-07 07:54:27,474][155452] Updated weights for policy 0, policy_version 14390 (0.0006) [2023-03-07 07:54:28,265][155452] Updated weights for policy 0, policy_version 14400 (0.0005) [2023-03-07 07:54:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 14746624. Throughput: 0: 13041.9. Samples: 14739514. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:54:28,367][155126] Avg episode reward: [(0, '1892.644')] [2023-03-07 07:54:29,041][155452] Updated weights for policy 0, policy_version 14410 (0.0006) [2023-03-07 07:54:29,816][155452] Updated weights for policy 0, policy_version 14420 (0.0006) [2023-03-07 07:54:30,600][155452] Updated weights for policy 0, policy_version 14430 (0.0006) [2023-03-07 07:54:31,382][155452] Updated weights for policy 0, policy_version 14440 (0.0006) [2023-03-07 07:54:32,170][155452] Updated weights for policy 0, policy_version 14450 (0.0006) [2023-03-07 07:54:32,946][155452] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-07 07:54:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 14812160. Throughput: 0: 13047.7. Samples: 14778842. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 07:54:33,367][155126] Avg episode reward: [(0, '1627.449')] [2023-03-07 07:54:33,740][155452] Updated weights for policy 0, policy_version 14470 (0.0006) [2023-03-07 07:54:34,527][155452] Updated weights for policy 0, policy_version 14480 (0.0006) [2023-03-07 07:54:35,312][155452] Updated weights for policy 0, policy_version 14490 (0.0006) [2023-03-07 07:54:36,098][155452] Updated weights for policy 0, policy_version 14500 (0.0007) [2023-03-07 07:54:36,896][155452] Updated weights for policy 0, policy_version 14510 (0.0006) [2023-03-07 07:54:37,677][155452] Updated weights for policy 0, policy_version 14520 (0.0006) [2023-03-07 07:54:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13065.5). Total num frames: 14876672. Throughput: 0: 13039.5. Samples: 14857084. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:54:38,367][155126] Avg episode reward: [(0, '1714.420')] [2023-03-07 07:54:38,460][155452] Updated weights for policy 0, policy_version 14530 (0.0005) [2023-03-07 07:54:39,235][155452] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-07 07:54:40,022][155452] Updated weights for policy 0, policy_version 14550 (0.0006) [2023-03-07 07:54:40,815][155452] Updated weights for policy 0, policy_version 14560 (0.0006) [2023-03-07 07:54:41,597][155452] Updated weights for policy 0, policy_version 14570 (0.0007) [2023-03-07 07:54:42,364][155452] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-07 07:54:43,149][155452] Updated weights for policy 0, policy_version 14590 (0.0007) [2023-03-07 07:54:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13065.6). Total num frames: 14942208. Throughput: 0: 13040.3. Samples: 14935340. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:54:43,367][155126] Avg episode reward: [(0, '1834.684')] [2023-03-07 07:54:43,938][155452] Updated weights for policy 0, policy_version 14600 (0.0006) [2023-03-07 07:54:44,721][155452] Updated weights for policy 0, policy_version 14610 (0.0005) [2023-03-07 07:54:45,515][155452] Updated weights for policy 0, policy_version 14620 (0.0006) [2023-03-07 07:54:46,310][155452] Updated weights for policy 0, policy_version 14630 (0.0007) [2023-03-07 07:54:47,083][155452] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-07 07:54:47,866][155452] Updated weights for policy 0, policy_version 14650 (0.0007) [2023-03-07 07:54:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13065.5). Total num frames: 15007744. Throughput: 0: 13048.8. Samples: 14974430. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:54:48,367][155126] Avg episode reward: [(0, '1885.861')] [2023-03-07 07:54:48,651][155452] Updated weights for policy 0, policy_version 14660 (0.0006) [2023-03-07 07:54:49,439][155452] Updated weights for policy 0, policy_version 14670 (0.0005) [2023-03-07 07:54:50,225][155452] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-07 07:54:51,029][155452] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-07 07:54:51,807][155452] Updated weights for policy 0, policy_version 14700 (0.0007) [2023-03-07 07:54:52,604][155452] Updated weights for policy 0, policy_version 14710 (0.0006) [2023-03-07 07:54:53,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 15072256. Throughput: 0: 13040.4. Samples: 15052556. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:54:53,368][155126] Avg episode reward: [(0, '1861.680')] [2023-03-07 07:54:53,388][155452] Updated weights for policy 0, policy_version 14720 (0.0006) [2023-03-07 07:54:54,170][155452] Updated weights for policy 0, policy_version 14730 (0.0007) [2023-03-07 07:54:54,957][155452] Updated weights for policy 0, policy_version 14740 (0.0006) [2023-03-07 07:54:55,745][155452] Updated weights for policy 0, policy_version 14750 (0.0006) [2023-03-07 07:54:56,537][155452] Updated weights for policy 0, policy_version 14760 (0.0006) [2023-03-07 07:54:57,329][155452] Updated weights for policy 0, policy_version 14770 (0.0005) [2023-03-07 07:54:58,112][155452] Updated weights for policy 0, policy_version 14780 (0.0006) [2023-03-07 07:54:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 15137792. Throughput: 0: 13040.1. Samples: 15130777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:54:58,367][155126] Avg episode reward: [(0, '2060.750')] [2023-03-07 07:54:58,900][155452] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-07 07:54:59,678][155452] Updated weights for policy 0, policy_version 14800 (0.0006) [2023-03-07 07:55:00,474][155452] Updated weights for policy 0, policy_version 14810 (0.0007) [2023-03-07 07:55:01,255][155452] Updated weights for policy 0, policy_version 14820 (0.0006) [2023-03-07 07:55:02,030][155452] Updated weights for policy 0, policy_version 14830 (0.0006) [2023-03-07 07:55:02,826][155452] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-07 07:55:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 15202304. Throughput: 0: 13035.9. Samples: 15169710. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:55:03,367][155126] Avg episode reward: [(0, '1814.730')] [2023-03-07 07:55:03,629][155452] Updated weights for policy 0, policy_version 14850 (0.0007) [2023-03-07 07:55:04,432][155452] Updated weights for policy 0, policy_version 14860 (0.0006) [2023-03-07 07:55:05,215][155452] Updated weights for policy 0, policy_version 14870 (0.0007) [2023-03-07 07:55:06,012][155452] Updated weights for policy 0, policy_version 14880 (0.0006) [2023-03-07 07:55:06,800][155452] Updated weights for policy 0, policy_version 14890 (0.0006) [2023-03-07 07:55:07,566][155452] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-07 07:55:08,348][155452] Updated weights for policy 0, policy_version 14910 (0.0006) [2023-03-07 07:55:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13062.1). Total num frames: 15267840. Throughput: 0: 13027.7. Samples: 15247505. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:55:08,367][155126] Avg episode reward: [(0, '1684.101')] [2023-03-07 07:55:09,153][155452] Updated weights for policy 0, policy_version 14920 (0.0006) [2023-03-07 07:55:09,927][155452] Updated weights for policy 0, policy_version 14930 (0.0007) [2023-03-07 07:55:10,707][155452] Updated weights for policy 0, policy_version 14940 (0.0006) [2023-03-07 07:55:11,496][155452] Updated weights for policy 0, policy_version 14950 (0.0006) [2023-03-07 07:55:12,283][155452] Updated weights for policy 0, policy_version 14960 (0.0008) [2023-03-07 07:55:13,071][155452] Updated weights for policy 0, policy_version 14970 (0.0006) [2023-03-07 07:55:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13058.6). Total num frames: 15332352. Throughput: 0: 13028.2. Samples: 15325782. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:55:13,367][155126] Avg episode reward: [(0, '1620.466')] [2023-03-07 07:55:13,840][155452] Updated weights for policy 0, policy_version 14980 (0.0006) [2023-03-07 07:55:14,629][155452] Updated weights for policy 0, policy_version 14990 (0.0007) [2023-03-07 07:55:15,419][155452] Updated weights for policy 0, policy_version 15000 (0.0006) [2023-03-07 07:55:16,214][155452] Updated weights for policy 0, policy_version 15010 (0.0005) [2023-03-07 07:55:16,982][155452] Updated weights for policy 0, policy_version 15020 (0.0006) [2023-03-07 07:55:17,769][155452] Updated weights for policy 0, policy_version 15030 (0.0006) [2023-03-07 07:55:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13058.6). Total num frames: 15397888. Throughput: 0: 13023.2. Samples: 15364889. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:55:18,368][155126] Avg episode reward: [(0, '1675.419')] [2023-03-07 07:55:18,555][155452] Updated weights for policy 0, policy_version 15040 (0.0006) [2023-03-07 07:55:19,330][155452] Updated weights for policy 0, policy_version 15050 (0.0007) [2023-03-07 07:55:20,116][155452] Updated weights for policy 0, policy_version 15060 (0.0006) [2023-03-07 07:55:20,904][155452] Updated weights for policy 0, policy_version 15070 (0.0007) [2023-03-07 07:55:21,690][155452] Updated weights for policy 0, policy_version 15080 (0.0006) [2023-03-07 07:55:22,463][155452] Updated weights for policy 0, policy_version 15090 (0.0005) [2023-03-07 07:55:23,254][155452] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-07 07:55:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 15463424. Throughput: 0: 13022.4. Samples: 15443091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:55:23,367][155126] Avg episode reward: [(0, '1695.085')] [2023-03-07 07:55:24,034][155452] Updated weights for policy 0, policy_version 15110 (0.0005) [2023-03-07 07:55:24,819][155452] Updated weights for policy 0, policy_version 15120 (0.0006) [2023-03-07 07:55:25,616][155452] Updated weights for policy 0, policy_version 15130 (0.0007) [2023-03-07 07:55:26,400][155452] Updated weights for policy 0, policy_version 15140 (0.0007) [2023-03-07 07:55:27,187][155452] Updated weights for policy 0, policy_version 15150 (0.0006) [2023-03-07 07:55:27,982][155452] Updated weights for policy 0, policy_version 15160 (0.0007) [2023-03-07 07:55:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 15528960. Throughput: 0: 13026.8. Samples: 15521549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:55:28,367][155126] Avg episode reward: [(0, '1876.514')] [2023-03-07 07:55:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000015165_15528960.pth... [2023-03-07 07:55:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000012104_12394496.pth [2023-03-07 07:55:28,763][155452] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-07 07:55:29,528][155452] Updated weights for policy 0, policy_version 15180 (0.0006) [2023-03-07 07:55:30,329][155452] Updated weights for policy 0, policy_version 15190 (0.0007) [2023-03-07 07:55:31,106][155452] Updated weights for policy 0, policy_version 15200 (0.0006) [2023-03-07 07:55:31,915][155452] Updated weights for policy 0, policy_version 15210 (0.0007) [2023-03-07 07:55:32,673][155452] Updated weights for policy 0, policy_version 15220 (0.0006) [2023-03-07 07:55:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13058.6). Total num frames: 15593472. Throughput: 0: 13026.8. Samples: 15560635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:55:33,368][155126] Avg episode reward: [(0, '1639.677')] [2023-03-07 07:55:33,486][155452] Updated weights for policy 0, policy_version 15230 (0.0005) [2023-03-07 07:55:34,269][155452] Updated weights for policy 0, policy_version 15240 (0.0006) [2023-03-07 07:55:35,055][155452] Updated weights for policy 0, policy_version 15250 (0.0007) [2023-03-07 07:55:35,823][155452] Updated weights for policy 0, policy_version 15260 (0.0006) [2023-03-07 07:55:36,613][155452] Updated weights for policy 0, policy_version 15270 (0.0006) [2023-03-07 07:55:37,407][155452] Updated weights for policy 0, policy_version 15280 (0.0007) [2023-03-07 07:55:38,180][155452] Updated weights for policy 0, policy_version 15290 (0.0007) [2023-03-07 07:55:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 15659008. Throughput: 0: 13031.1. Samples: 15638952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:55:38,367][155126] Avg episode reward: [(0, '1601.964')] [2023-03-07 07:55:38,975][155452] Updated weights for policy 0, policy_version 15300 (0.0006) [2023-03-07 07:55:39,755][155452] Updated weights for policy 0, policy_version 15310 (0.0006) [2023-03-07 07:55:40,528][155452] Updated weights for policy 0, policy_version 15320 (0.0007) [2023-03-07 07:55:41,325][155452] Updated weights for policy 0, policy_version 15330 (0.0007) [2023-03-07 07:55:42,097][155452] Updated weights for policy 0, policy_version 15340 (0.0006) [2023-03-07 07:55:42,891][155452] Updated weights for policy 0, policy_version 15350 (0.0006) [2023-03-07 07:55:43,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13055.1). Total num frames: 15723520. Throughput: 0: 13033.3. Samples: 15717275. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 07:55:43,367][155126] Avg episode reward: [(0, '1734.840')] [2023-03-07 07:55:43,694][155452] Updated weights for policy 0, policy_version 15360 (0.0006) [2023-03-07 07:55:44,465][155452] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-07 07:55:45,253][155452] Updated weights for policy 0, policy_version 15380 (0.0006) [2023-03-07 07:55:46,043][155452] Updated weights for policy 0, policy_version 15390 (0.0006) [2023-03-07 07:55:46,817][155452] Updated weights for policy 0, policy_version 15400 (0.0007) [2023-03-07 07:55:47,594][155452] Updated weights for policy 0, policy_version 15410 (0.0006) [2023-03-07 07:55:48,361][155452] Updated weights for policy 0, policy_version 15420 (0.0006) [2023-03-07 07:55:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13062.1). Total num frames: 15790080. Throughput: 0: 13037.5. Samples: 15756397. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 07:55:48,367][155126] Avg episode reward: [(0, '1801.457')] [2023-03-07 07:55:49,165][155452] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-07 07:55:49,952][155452] Updated weights for policy 0, policy_version 15440 (0.0007) [2023-03-07 07:55:50,730][155452] Updated weights for policy 0, policy_version 15450 (0.0006) [2023-03-07 07:55:51,526][155452] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-07 07:55:52,295][155452] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-07 07:55:53,081][155452] Updated weights for policy 0, policy_version 15480 (0.0007) [2023-03-07 07:55:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13058.6). Total num frames: 15854592. Throughput: 0: 13046.1. Samples: 15834579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:55:53,367][155126] Avg episode reward: [(0, '1775.437')] [2023-03-07 07:55:53,865][155452] Updated weights for policy 0, policy_version 15490 (0.0006) [2023-03-07 07:55:54,653][155452] Updated weights for policy 0, policy_version 15500 (0.0006) [2023-03-07 07:55:55,418][155452] Updated weights for policy 0, policy_version 15510 (0.0006) [2023-03-07 07:55:56,212][155452] Updated weights for policy 0, policy_version 15520 (0.0006) [2023-03-07 07:55:57,001][155452] Updated weights for policy 0, policy_version 15530 (0.0006) [2023-03-07 07:55:57,775][155452] Updated weights for policy 0, policy_version 15540 (0.0006) [2023-03-07 07:55:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 15920128. Throughput: 0: 13054.0. Samples: 15913212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:55:58,367][155126] Avg episode reward: [(0, '1790.157')] [2023-03-07 07:55:58,580][155452] Updated weights for policy 0, policy_version 15550 (0.0006) [2023-03-07 07:55:59,349][155452] Updated weights for policy 0, policy_version 15560 (0.0008) [2023-03-07 07:56:00,139][155452] Updated weights for policy 0, policy_version 15570 (0.0007) [2023-03-07 07:56:00,940][155452] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-07 07:56:01,720][155452] Updated weights for policy 0, policy_version 15590 (0.0006) [2023-03-07 07:56:02,500][155452] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-07 07:56:03,280][155452] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-07 07:56:03,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 15985664. Throughput: 0: 13052.4. Samples: 15952245. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:56:03,367][155126] Avg episode reward: [(0, '1669.922')] [2023-03-07 07:56:04,077][155452] Updated weights for policy 0, policy_version 15620 (0.0006) [2023-03-07 07:56:04,849][155452] Updated weights for policy 0, policy_version 15630 (0.0006) [2023-03-07 07:56:05,648][155452] Updated weights for policy 0, policy_version 15640 (0.0007) [2023-03-07 07:56:06,429][155452] Updated weights for policy 0, policy_version 15650 (0.0007) [2023-03-07 07:56:07,229][155452] Updated weights for policy 0, policy_version 15660 (0.0007) [2023-03-07 07:56:08,014][155452] Updated weights for policy 0, policy_version 15670 (0.0007) [2023-03-07 07:56:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 16050176. Throughput: 0: 13047.3. Samples: 16030221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:56:08,367][155126] Avg episode reward: [(0, '1668.841')] [2023-03-07 07:56:08,806][155452] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-07 07:56:09,580][155452] Updated weights for policy 0, policy_version 15690 (0.0006) [2023-03-07 07:56:10,367][155452] Updated weights for policy 0, policy_version 15700 (0.0007) [2023-03-07 07:56:11,165][155452] Updated weights for policy 0, policy_version 15710 (0.0005) [2023-03-07 07:56:11,949][155452] Updated weights for policy 0, policy_version 15720 (0.0006) [2023-03-07 07:56:12,725][155452] Updated weights for policy 0, policy_version 15730 (0.0006) [2023-03-07 07:56:13,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 16115712. Throughput: 0: 13044.1. Samples: 16108533. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:56:13,367][155126] Avg episode reward: [(0, '1715.023')] [2023-03-07 07:56:13,502][155452] Updated weights for policy 0, policy_version 15740 (0.0006) [2023-03-07 07:56:14,280][155452] Updated weights for policy 0, policy_version 15750 (0.0006) [2023-03-07 07:56:15,066][155452] Updated weights for policy 0, policy_version 15760 (0.0006) [2023-03-07 07:56:15,866][155452] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-07 07:56:16,639][155452] Updated weights for policy 0, policy_version 15780 (0.0006) [2023-03-07 07:56:17,409][155452] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-07 07:56:18,213][155452] Updated weights for policy 0, policy_version 15800 (0.0006) [2023-03-07 07:56:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 16180224. Throughput: 0: 13044.6. Samples: 16147641. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 07:56:18,368][155126] Avg episode reward: [(0, '1365.932')] [2023-03-07 07:56:19,017][155452] Updated weights for policy 0, policy_version 15810 (0.0006) [2023-03-07 07:56:19,797][155452] Updated weights for policy 0, policy_version 15820 (0.0006) [2023-03-07 07:56:20,588][155452] Updated weights for policy 0, policy_version 15830 (0.0007) [2023-03-07 07:56:21,374][155452] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-07 07:56:22,157][155452] Updated weights for policy 0, policy_version 15850 (0.0005) [2023-03-07 07:56:22,937][155452] Updated weights for policy 0, policy_version 15860 (0.0006) [2023-03-07 07:56:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 16245760. Throughput: 0: 13036.5. Samples: 16225597. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 07:56:23,367][155126] Avg episode reward: [(0, '1584.373')] [2023-03-07 07:56:23,719][155452] Updated weights for policy 0, policy_version 15870 (0.0006) [2023-03-07 07:56:24,505][155452] Updated weights for policy 0, policy_version 15880 (0.0006) [2023-03-07 07:56:25,310][155452] Updated weights for policy 0, policy_version 15890 (0.0006) [2023-03-07 07:56:26,094][155452] Updated weights for policy 0, policy_version 15900 (0.0006) [2023-03-07 07:56:26,874][155452] Updated weights for policy 0, policy_version 15910 (0.0005) [2023-03-07 07:56:27,669][155452] Updated weights for policy 0, policy_version 15920 (0.0006) [2023-03-07 07:56:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13055.1). Total num frames: 16310272. Throughput: 0: 13032.2. Samples: 16303723. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:56:28,367][155126] Avg episode reward: [(0, '1654.648')] [2023-03-07 07:56:28,445][155452] Updated weights for policy 0, policy_version 15930 (0.0006) [2023-03-07 07:56:29,245][155452] Updated weights for policy 0, policy_version 15940 (0.0006) [2023-03-07 07:56:30,027][155452] Updated weights for policy 0, policy_version 15950 (0.0007) [2023-03-07 07:56:30,816][155452] Updated weights for policy 0, policy_version 15960 (0.0006) [2023-03-07 07:56:31,582][155452] Updated weights for policy 0, policy_version 15970 (0.0006) [2023-03-07 07:56:32,364][155452] Updated weights for policy 0, policy_version 15980 (0.0006) [2023-03-07 07:56:33,153][155452] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-07 07:56:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13055.1). Total num frames: 16375808. Throughput: 0: 13031.4. Samples: 16342811. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:56:33,367][155126] Avg episode reward: [(0, '1571.655')] [2023-03-07 07:56:33,947][155452] Updated weights for policy 0, policy_version 16000 (0.0006) [2023-03-07 07:56:34,710][155452] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-07 07:56:35,505][155452] Updated weights for policy 0, policy_version 16020 (0.0006) [2023-03-07 07:56:36,287][155452] Updated weights for policy 0, policy_version 16030 (0.0006) [2023-03-07 07:56:37,071][155452] Updated weights for policy 0, policy_version 16040 (0.0006) [2023-03-07 07:56:37,855][155452] Updated weights for policy 0, policy_version 16050 (0.0006) [2023-03-07 07:56:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 16441344. Throughput: 0: 13037.7. Samples: 16421277. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:56:38,367][155126] Avg episode reward: [(0, '1619.699')] [2023-03-07 07:56:38,647][155452] Updated weights for policy 0, policy_version 16060 (0.0005) [2023-03-07 07:56:39,425][155452] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-07 07:56:40,209][155452] Updated weights for policy 0, policy_version 16080 (0.0006) [2023-03-07 07:56:40,994][155452] Updated weights for policy 0, policy_version 16090 (0.0007) [2023-03-07 07:56:41,785][155452] Updated weights for policy 0, policy_version 16100 (0.0005) [2023-03-07 07:56:42,566][155452] Updated weights for policy 0, policy_version 16110 (0.0006) [2023-03-07 07:56:43,359][155452] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-07 07:56:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 16506880. Throughput: 0: 13031.2. Samples: 16499614. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:56:43,367][155126] Avg episode reward: [(0, '1507.660')] [2023-03-07 07:56:44,131][155452] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-07 07:56:44,934][155452] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-07 07:56:45,724][155452] Updated weights for policy 0, policy_version 16150 (0.0006) [2023-03-07 07:56:46,508][155452] Updated weights for policy 0, policy_version 16160 (0.0007) [2023-03-07 07:56:47,281][155452] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-07 07:56:48,049][155452] Updated weights for policy 0, policy_version 16180 (0.0005) [2023-03-07 07:56:48,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13055.1). Total num frames: 16572416. Throughput: 0: 13029.8. Samples: 16538584. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:56:48,367][155126] Avg episode reward: [(0, '1480.381')] [2023-03-07 07:56:48,837][155452] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-07 07:56:49,619][155452] Updated weights for policy 0, policy_version 16200 (0.0006) [2023-03-07 07:56:50,404][155452] Updated weights for policy 0, policy_version 16210 (0.0006) [2023-03-07 07:56:51,172][155452] Updated weights for policy 0, policy_version 16220 (0.0007) [2023-03-07 07:56:51,961][155452] Updated weights for policy 0, policy_version 16230 (0.0006) [2023-03-07 07:56:52,745][155452] Updated weights for policy 0, policy_version 16240 (0.0006) [2023-03-07 07:56:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 16636928. Throughput: 0: 13046.1. Samples: 16617297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:56:53,367][155126] Avg episode reward: [(0, '1455.863')] [2023-03-07 07:56:53,545][155452] Updated weights for policy 0, policy_version 16250 (0.0006) [2023-03-07 07:56:54,340][155452] Updated weights for policy 0, policy_version 16260 (0.0006) [2023-03-07 07:56:55,117][155452] Updated weights for policy 0, policy_version 16270 (0.0006) [2023-03-07 07:56:55,894][155452] Updated weights for policy 0, policy_version 16280 (0.0006) [2023-03-07 07:56:56,677][155452] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-07 07:56:57,434][155452] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-07 07:56:58,222][155452] Updated weights for policy 0, policy_version 16310 (0.0006) [2023-03-07 07:56:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 16702464. Throughput: 0: 13047.1. Samples: 16695652. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:56:58,367][155126] Avg episode reward: [(0, '1641.951')] [2023-03-07 07:56:59,013][155452] Updated weights for policy 0, policy_version 16320 (0.0006) [2023-03-07 07:56:59,803][155452] Updated weights for policy 0, policy_version 16330 (0.0006) [2023-03-07 07:57:00,589][155452] Updated weights for policy 0, policy_version 16340 (0.0006) [2023-03-07 07:57:01,379][155452] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-07 07:57:02,163][155452] Updated weights for policy 0, policy_version 16360 (0.0007) [2023-03-07 07:57:02,965][155452] Updated weights for policy 0, policy_version 16370 (0.0006) [2023-03-07 07:57:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13055.1). Total num frames: 16768000. Throughput: 0: 13050.1. Samples: 16734892. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:57:03,367][155126] Avg episode reward: [(0, '1465.879')] [2023-03-07 07:57:03,750][155452] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-07 07:57:04,544][155452] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-07 07:57:05,334][155452] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-07 07:57:06,108][155452] Updated weights for policy 0, policy_version 16410 (0.0006) [2023-03-07 07:57:06,892][155452] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-07 07:57:07,693][155452] Updated weights for policy 0, policy_version 16430 (0.0006) [2023-03-07 07:57:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 16832512. Throughput: 0: 13050.9. Samples: 16812891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:57:08,367][155126] Avg episode reward: [(0, '1471.833')] [2023-03-07 07:57:08,475][155452] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-07 07:57:09,285][155452] Updated weights for policy 0, policy_version 16450 (0.0006) [2023-03-07 07:57:10,063][155452] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-07 07:57:10,833][155452] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-07 07:57:11,622][155452] Updated weights for policy 0, policy_version 16480 (0.0005) [2023-03-07 07:57:12,416][155452] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-07 07:57:13,195][155452] Updated weights for policy 0, policy_version 16500 (0.0006) [2023-03-07 07:57:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 16898048. Throughput: 0: 13046.0. Samples: 16890794. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:57:13,367][155126] Avg episode reward: [(0, '1428.283')] [2023-03-07 07:57:13,973][155452] Updated weights for policy 0, policy_version 16510 (0.0006) [2023-03-07 07:57:14,737][155452] Updated weights for policy 0, policy_version 16520 (0.0006) [2023-03-07 07:57:15,531][155452] Updated weights for policy 0, policy_version 16530 (0.0005) [2023-03-07 07:57:16,291][155452] Updated weights for policy 0, policy_version 16540 (0.0006) [2023-03-07 07:57:17,092][155452] Updated weights for policy 0, policy_version 16550 (0.0007) [2023-03-07 07:57:17,873][155452] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-07 07:57:18,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 16963584. Throughput: 0: 13056.7. Samples: 16930362. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:57:18,367][155126] Avg episode reward: [(0, '1540.298')] [2023-03-07 07:57:18,648][155452] Updated weights for policy 0, policy_version 16570 (0.0006) [2023-03-07 07:57:19,441][155452] Updated weights for policy 0, policy_version 16580 (0.0007) [2023-03-07 07:57:20,242][155452] Updated weights for policy 0, policy_version 16590 (0.0006) [2023-03-07 07:57:21,030][155452] Updated weights for policy 0, policy_version 16600 (0.0006) [2023-03-07 07:57:21,808][155452] Updated weights for policy 0, policy_version 16610 (0.0007) [2023-03-07 07:57:22,583][155452] Updated weights for policy 0, policy_version 16620 (0.0006) [2023-03-07 07:57:23,363][155452] Updated weights for policy 0, policy_version 16630 (0.0006) [2023-03-07 07:57:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 17029120. Throughput: 0: 13051.8. Samples: 17008606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:57:23,367][155126] Avg episode reward: [(0, '1675.803')] [2023-03-07 07:57:24,156][155452] Updated weights for policy 0, policy_version 16640 (0.0007) [2023-03-07 07:57:24,945][155452] Updated weights for policy 0, policy_version 16650 (0.0006) [2023-03-07 07:57:25,722][155452] Updated weights for policy 0, policy_version 16660 (0.0006) [2023-03-07 07:57:26,510][155452] Updated weights for policy 0, policy_version 16670 (0.0007) [2023-03-07 07:57:27,311][155452] Updated weights for policy 0, policy_version 16680 (0.0007) [2023-03-07 07:57:28,088][155452] Updated weights for policy 0, policy_version 16690 (0.0006) [2023-03-07 07:57:28,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 17093632. Throughput: 0: 13045.7. Samples: 17086672. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:57:28,368][155126] Avg episode reward: [(0, '1822.606')] [2023-03-07 07:57:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000016693_17093632.pth... [2023-03-07 07:57:28,405][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000013636_13963264.pth [2023-03-07 07:57:28,889][155452] Updated weights for policy 0, policy_version 16700 (0.0007) [2023-03-07 07:57:29,672][155452] Updated weights for policy 0, policy_version 16710 (0.0007) [2023-03-07 07:57:30,441][155452] Updated weights for policy 0, policy_version 16720 (0.0007) [2023-03-07 07:57:31,223][155452] Updated weights for policy 0, policy_version 16730 (0.0006) [2023-03-07 07:57:32,012][155452] Updated weights for policy 0, policy_version 16740 (0.0005) [2023-03-07 07:57:32,776][155452] Updated weights for policy 0, policy_version 16750 (0.0006) [2023-03-07 07:57:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 17159168. Throughput: 0: 13050.7. Samples: 17125865. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:57:33,367][155126] Avg episode reward: [(0, '1612.090')] [2023-03-07 07:57:33,568][155452] Updated weights for policy 0, policy_version 16760 (0.0007) [2023-03-07 07:57:34,361][155452] Updated weights for policy 0, policy_version 16770 (0.0005) [2023-03-07 07:57:35,122][155452] Updated weights for policy 0, policy_version 16780 (0.0006) [2023-03-07 07:57:35,917][155452] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-07 07:57:36,696][155452] Updated weights for policy 0, policy_version 16800 (0.0006) [2023-03-07 07:57:37,491][155452] Updated weights for policy 0, policy_version 16810 (0.0006) [2023-03-07 07:57:38,257][155452] Updated weights for policy 0, policy_version 16820 (0.0006) [2023-03-07 07:57:38,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 17224704. Throughput: 0: 13049.0. Samples: 17204506. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:57:38,368][155126] Avg episode reward: [(0, '1417.188')] [2023-03-07 07:57:39,044][155452] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-07 07:57:39,821][155452] Updated weights for policy 0, policy_version 16840 (0.0006) [2023-03-07 07:57:40,627][155452] Updated weights for policy 0, policy_version 16850 (0.0006) [2023-03-07 07:57:41,405][155452] Updated weights for policy 0, policy_version 16860 (0.0006) [2023-03-07 07:57:42,181][155452] Updated weights for policy 0, policy_version 16870 (0.0007) [2023-03-07 07:57:42,974][155452] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-07 07:57:43,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 17290240. Throughput: 0: 13050.5. Samples: 17282925. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:57:43,368][155126] Avg episode reward: [(0, '1280.768')] [2023-03-07 07:57:43,767][155452] Updated weights for policy 0, policy_version 16890 (0.0007) [2023-03-07 07:57:44,539][155452] Updated weights for policy 0, policy_version 16900 (0.0006) [2023-03-07 07:57:45,324][155452] Updated weights for policy 0, policy_version 16910 (0.0006) [2023-03-07 07:57:46,098][155452] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-07 07:57:46,869][155452] Updated weights for policy 0, policy_version 16930 (0.0006) [2023-03-07 07:57:47,660][155452] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-07 07:57:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 17354752. Throughput: 0: 13046.2. Samples: 17321973. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 07:57:48,367][155126] Avg episode reward: [(0, '1603.273')] [2023-03-07 07:57:48,468][155452] Updated weights for policy 0, policy_version 16950 (0.0006) [2023-03-07 07:57:49,237][155452] Updated weights for policy 0, policy_version 16960 (0.0006) [2023-03-07 07:57:50,021][155452] Updated weights for policy 0, policy_version 16970 (0.0007) [2023-03-07 07:57:50,822][155452] Updated weights for policy 0, policy_version 16980 (0.0006) [2023-03-07 07:57:51,586][155452] Updated weights for policy 0, policy_version 16990 (0.0006) [2023-03-07 07:57:52,357][155452] Updated weights for policy 0, policy_version 17000 (0.0006) [2023-03-07 07:57:53,145][155452] Updated weights for policy 0, policy_version 17010 (0.0006) [2023-03-07 07:57:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 17421312. Throughput: 0: 13059.1. Samples: 17400548. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 07:57:53,367][155126] Avg episode reward: [(0, '1417.376')] [2023-03-07 07:57:53,920][155452] Updated weights for policy 0, policy_version 17020 (0.0006) [2023-03-07 07:57:54,695][155452] Updated weights for policy 0, policy_version 17030 (0.0006) [2023-03-07 07:57:55,476][155452] Updated weights for policy 0, policy_version 17040 (0.0006) [2023-03-07 07:57:56,263][155452] Updated weights for policy 0, policy_version 17050 (0.0006) [2023-03-07 07:57:57,036][155452] Updated weights for policy 0, policy_version 17060 (0.0006) [2023-03-07 07:57:57,821][155452] Updated weights for policy 0, policy_version 17070 (0.0006) [2023-03-07 07:57:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 17485824. Throughput: 0: 13078.0. Samples: 17479306. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:57:58,367][155126] Avg episode reward: [(0, '1605.956')] [2023-03-07 07:57:58,595][155452] Updated weights for policy 0, policy_version 17080 (0.0006) [2023-03-07 07:57:59,377][155452] Updated weights for policy 0, policy_version 17090 (0.0007) [2023-03-07 07:58:00,179][155452] Updated weights for policy 0, policy_version 17100 (0.0006) [2023-03-07 07:58:00,952][155452] Updated weights for policy 0, policy_version 17110 (0.0006) [2023-03-07 07:58:01,749][155452] Updated weights for policy 0, policy_version 17120 (0.0007) [2023-03-07 07:58:02,536][155452] Updated weights for policy 0, policy_version 17130 (0.0006) [2023-03-07 07:58:03,308][155452] Updated weights for policy 0, policy_version 17140 (0.0006) [2023-03-07 07:58:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 17551360. Throughput: 0: 13070.9. Samples: 17518553. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:58:03,367][155126] Avg episode reward: [(0, '1668.281')] [2023-03-07 07:58:04,100][155452] Updated weights for policy 0, policy_version 17150 (0.0007) [2023-03-07 07:58:04,901][155452] Updated weights for policy 0, policy_version 17160 (0.0006) [2023-03-07 07:58:05,682][155452] Updated weights for policy 0, policy_version 17170 (0.0007) [2023-03-07 07:58:06,473][155452] Updated weights for policy 0, policy_version 17180 (0.0007) [2023-03-07 07:58:07,270][155452] Updated weights for policy 0, policy_version 17190 (0.0006) [2023-03-07 07:58:08,045][155452] Updated weights for policy 0, policy_version 17200 (0.0005) [2023-03-07 07:58:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 17616896. Throughput: 0: 13065.7. Samples: 17596562. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 07:58:08,368][155126] Avg episode reward: [(0, '1570.280')] [2023-03-07 07:58:08,837][155452] Updated weights for policy 0, policy_version 17210 (0.0007) [2023-03-07 07:58:09,617][155452] Updated weights for policy 0, policy_version 17220 (0.0006) [2023-03-07 07:58:10,405][155452] Updated weights for policy 0, policy_version 17230 (0.0006) [2023-03-07 07:58:11,180][155452] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-07 07:58:11,980][155452] Updated weights for policy 0, policy_version 17250 (0.0006) [2023-03-07 07:58:12,757][155452] Updated weights for policy 0, policy_version 17260 (0.0006) [2023-03-07 07:58:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 17681408. Throughput: 0: 13067.6. Samples: 17674711. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:58:13,367][155126] Avg episode reward: [(0, '1511.031')] [2023-03-07 07:58:13,531][155452] Updated weights for policy 0, policy_version 17270 (0.0007) [2023-03-07 07:58:14,317][155452] Updated weights for policy 0, policy_version 17280 (0.0006) [2023-03-07 07:58:15,108][155452] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-07 07:58:15,889][155452] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-07 07:58:16,678][155452] Updated weights for policy 0, policy_version 17310 (0.0006) [2023-03-07 07:58:17,447][155452] Updated weights for policy 0, policy_version 17320 (0.0006) [2023-03-07 07:58:18,228][155452] Updated weights for policy 0, policy_version 17330 (0.0006) [2023-03-07 07:58:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 17746944. Throughput: 0: 13068.7. Samples: 17713959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:58:18,367][155126] Avg episode reward: [(0, '1484.210')] [2023-03-07 07:58:19,014][155452] Updated weights for policy 0, policy_version 17340 (0.0006) [2023-03-07 07:58:19,802][155452] Updated weights for policy 0, policy_version 17350 (0.0006) [2023-03-07 07:58:20,583][155452] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-07 07:58:21,357][155452] Updated weights for policy 0, policy_version 17370 (0.0006) [2023-03-07 07:58:22,132][155452] Updated weights for policy 0, policy_version 17380 (0.0007) [2023-03-07 07:58:22,930][155452] Updated weights for policy 0, policy_version 17390 (0.0007) [2023-03-07 07:58:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 17812480. Throughput: 0: 13067.7. Samples: 17792553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:58:23,367][155126] Avg episode reward: [(0, '1675.308')] [2023-03-07 07:58:23,714][155452] Updated weights for policy 0, policy_version 17400 (0.0006) [2023-03-07 07:58:24,488][155452] Updated weights for policy 0, policy_version 17410 (0.0005) [2023-03-07 07:58:25,272][155452] Updated weights for policy 0, policy_version 17420 (0.0006) [2023-03-07 07:58:26,066][155452] Updated weights for policy 0, policy_version 17430 (0.0007) [2023-03-07 07:58:26,848][155452] Updated weights for policy 0, policy_version 17440 (0.0006) [2023-03-07 07:58:27,635][155452] Updated weights for policy 0, policy_version 17450 (0.0006) [2023-03-07 07:58:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 17878016. Throughput: 0: 13064.5. Samples: 17870826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:58:28,367][155126] Avg episode reward: [(0, '1625.688')] [2023-03-07 07:58:28,422][155452] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-07 07:58:29,197][155452] Updated weights for policy 0, policy_version 17470 (0.0005) [2023-03-07 07:58:29,991][155452] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-07 07:58:30,756][155452] Updated weights for policy 0, policy_version 17490 (0.0006) [2023-03-07 07:58:31,577][155452] Updated weights for policy 0, policy_version 17500 (0.0006) [2023-03-07 07:58:32,347][155452] Updated weights for policy 0, policy_version 17510 (0.0006) [2023-03-07 07:58:33,111][155452] Updated weights for policy 0, policy_version 17520 (0.0006) [2023-03-07 07:58:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 17943552. Throughput: 0: 13067.9. Samples: 17910027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:58:33,367][155126] Avg episode reward: [(0, '1671.593')] [2023-03-07 07:58:33,899][155452] Updated weights for policy 0, policy_version 17530 (0.0006) [2023-03-07 07:58:34,685][155452] Updated weights for policy 0, policy_version 17540 (0.0006) [2023-03-07 07:58:35,470][155452] Updated weights for policy 0, policy_version 17550 (0.0006) [2023-03-07 07:58:36,258][155452] Updated weights for policy 0, policy_version 17560 (0.0005) [2023-03-07 07:58:37,042][155452] Updated weights for policy 0, policy_version 17570 (0.0006) [2023-03-07 07:58:37,825][155452] Updated weights for policy 0, policy_version 17580 (0.0006) [2023-03-07 07:58:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 18008064. Throughput: 0: 13066.2. Samples: 17988527. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:58:38,367][155126] Avg episode reward: [(0, '1693.806')] [2023-03-07 07:58:38,602][155452] Updated weights for policy 0, policy_version 17590 (0.0006) [2023-03-07 07:58:39,390][155452] Updated weights for policy 0, policy_version 17600 (0.0006) [2023-03-07 07:58:40,185][155452] Updated weights for policy 0, policy_version 17610 (0.0007) [2023-03-07 07:58:40,969][155452] Updated weights for policy 0, policy_version 17620 (0.0006) [2023-03-07 07:58:41,749][155452] Updated weights for policy 0, policy_version 17630 (0.0006) [2023-03-07 07:58:42,537][155452] Updated weights for policy 0, policy_version 17640 (0.0006) [2023-03-07 07:58:43,325][155452] Updated weights for policy 0, policy_version 17650 (0.0006) [2023-03-07 07:58:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18073600. Throughput: 0: 13052.8. Samples: 18066682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:58:43,368][155126] Avg episode reward: [(0, '1720.313')] [2023-03-07 07:58:44,111][155452] Updated weights for policy 0, policy_version 17660 (0.0006) [2023-03-07 07:58:44,881][155452] Updated weights for policy 0, policy_version 17670 (0.0006) [2023-03-07 07:58:45,667][155452] Updated weights for policy 0, policy_version 17680 (0.0006) [2023-03-07 07:58:46,459][155452] Updated weights for policy 0, policy_version 17690 (0.0006) [2023-03-07 07:58:47,250][155452] Updated weights for policy 0, policy_version 17700 (0.0006) [2023-03-07 07:58:48,026][155452] Updated weights for policy 0, policy_version 17710 (0.0007) [2023-03-07 07:58:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 18139136. Throughput: 0: 13052.1. Samples: 18105900. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:58:48,367][155126] Avg episode reward: [(0, '1629.881')] [2023-03-07 07:58:48,798][155452] Updated weights for policy 0, policy_version 17720 (0.0007) [2023-03-07 07:58:49,626][155452] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-07 07:58:50,388][155452] Updated weights for policy 0, policy_version 17740 (0.0006) [2023-03-07 07:58:51,175][155452] Updated weights for policy 0, policy_version 17750 (0.0006) [2023-03-07 07:58:51,966][155452] Updated weights for policy 0, policy_version 17760 (0.0006) [2023-03-07 07:58:52,757][155452] Updated weights for policy 0, policy_version 17770 (0.0007) [2023-03-07 07:58:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18204672. Throughput: 0: 13054.0. Samples: 18183990. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:58:53,368][155126] Avg episode reward: [(0, '1554.130')] [2023-03-07 07:58:53,526][155452] Updated weights for policy 0, policy_version 17780 (0.0006) [2023-03-07 07:58:54,289][155452] Updated weights for policy 0, policy_version 17790 (0.0006) [2023-03-07 07:58:55,099][155452] Updated weights for policy 0, policy_version 17800 (0.0008) [2023-03-07 07:58:55,877][155452] Updated weights for policy 0, policy_version 17810 (0.0006) [2023-03-07 07:58:56,657][155452] Updated weights for policy 0, policy_version 17820 (0.0005) [2023-03-07 07:58:57,454][155452] Updated weights for policy 0, policy_version 17830 (0.0007) [2023-03-07 07:58:58,245][155452] Updated weights for policy 0, policy_version 17840 (0.0006) [2023-03-07 07:58:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18269184. Throughput: 0: 13059.5. Samples: 18262388. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:58:58,367][155126] Avg episode reward: [(0, '1458.110')] [2023-03-07 07:58:59,037][155452] Updated weights for policy 0, policy_version 17850 (0.0006) [2023-03-07 07:58:59,805][155452] Updated weights for policy 0, policy_version 17860 (0.0005) [2023-03-07 07:59:00,598][155452] Updated weights for policy 0, policy_version 17870 (0.0006) [2023-03-07 07:59:01,381][155452] Updated weights for policy 0, policy_version 17880 (0.0006) [2023-03-07 07:59:02,161][155452] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-07 07:59:02,937][155452] Updated weights for policy 0, policy_version 17900 (0.0006) [2023-03-07 07:59:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18334720. Throughput: 0: 13053.1. Samples: 18301351. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:59:03,368][155126] Avg episode reward: [(0, '1548.155')] [2023-03-07 07:59:03,711][155452] Updated weights for policy 0, policy_version 17910 (0.0006) [2023-03-07 07:59:04,500][155452] Updated weights for policy 0, policy_version 17920 (0.0006) [2023-03-07 07:59:05,288][155452] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-07 07:59:06,072][155452] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-07 07:59:06,870][155452] Updated weights for policy 0, policy_version 17950 (0.0006) [2023-03-07 07:59:07,654][155452] Updated weights for policy 0, policy_version 17960 (0.0006) [2023-03-07 07:59:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13044.7). Total num frames: 18399232. Throughput: 0: 13051.4. Samples: 18379866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:59:08,367][155126] Avg episode reward: [(0, '1601.721')] [2023-03-07 07:59:08,469][155452] Updated weights for policy 0, policy_version 17970 (0.0006) [2023-03-07 07:59:09,242][155452] Updated weights for policy 0, policy_version 17980 (0.0007) [2023-03-07 07:59:10,025][155452] Updated weights for policy 0, policy_version 17990 (0.0005) [2023-03-07 07:59:10,822][155452] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-07 07:59:11,578][155452] Updated weights for policy 0, policy_version 18010 (0.0006) [2023-03-07 07:59:12,356][155452] Updated weights for policy 0, policy_version 18020 (0.0006) [2023-03-07 07:59:13,180][155452] Updated weights for policy 0, policy_version 18030 (0.0005) [2023-03-07 07:59:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 18464768. Throughput: 0: 13045.9. Samples: 18457894. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:13,367][155126] Avg episode reward: [(0, '1498.464')] [2023-03-07 07:59:13,956][155452] Updated weights for policy 0, policy_version 18040 (0.0006) [2023-03-07 07:59:14,732][155452] Updated weights for policy 0, policy_version 18050 (0.0005) [2023-03-07 07:59:15,532][155452] Updated weights for policy 0, policy_version 18060 (0.0007) [2023-03-07 07:59:16,311][155452] Updated weights for policy 0, policy_version 18070 (0.0006) [2023-03-07 07:59:17,088][155452] Updated weights for policy 0, policy_version 18080 (0.0005) [2023-03-07 07:59:17,883][155452] Updated weights for policy 0, policy_version 18090 (0.0007) [2023-03-07 07:59:18,367][155126] Fps is (10 sec: 13106.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18530304. Throughput: 0: 13044.7. Samples: 18497043. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:18,368][155126] Avg episode reward: [(0, '1640.906')] [2023-03-07 07:59:18,649][155452] Updated weights for policy 0, policy_version 18100 (0.0006) [2023-03-07 07:59:19,424][155452] Updated weights for policy 0, policy_version 18110 (0.0006) [2023-03-07 07:59:20,203][155452] Updated weights for policy 0, policy_version 18120 (0.0006) [2023-03-07 07:59:20,976][155452] Updated weights for policy 0, policy_version 18130 (0.0006) [2023-03-07 07:59:21,770][155452] Updated weights for policy 0, policy_version 18140 (0.0006) [2023-03-07 07:59:22,540][155452] Updated weights for policy 0, policy_version 18150 (0.0007) [2023-03-07 07:59:23,340][155452] Updated weights for policy 0, policy_version 18160 (0.0007) [2023-03-07 07:59:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18595840. Throughput: 0: 13048.8. Samples: 18575727. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:23,368][155126] Avg episode reward: [(0, '1763.071')] [2023-03-07 07:59:24,116][155452] Updated weights for policy 0, policy_version 18170 (0.0005) [2023-03-07 07:59:24,878][155452] Updated weights for policy 0, policy_version 18180 (0.0006) [2023-03-07 07:59:25,691][155452] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-07 07:59:26,490][155452] Updated weights for policy 0, policy_version 18200 (0.0006) [2023-03-07 07:59:27,269][155452] Updated weights for policy 0, policy_version 18210 (0.0005) [2023-03-07 07:59:28,058][155452] Updated weights for policy 0, policy_version 18220 (0.0007) [2023-03-07 07:59:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18661376. Throughput: 0: 13049.9. Samples: 18653929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:59:28,375][155126] Avg episode reward: [(0, '1635.990')] [2023-03-07 07:59:28,380][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000018224_18661376.pth... [2023-03-07 07:59:28,411][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000015165_15528960.pth [2023-03-07 07:59:28,840][155452] Updated weights for policy 0, policy_version 18230 (0.0006) [2023-03-07 07:59:29,614][155452] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-07 07:59:30,393][155452] Updated weights for policy 0, policy_version 18250 (0.0006) [2023-03-07 07:59:31,187][155452] Updated weights for policy 0, policy_version 18260 (0.0006) [2023-03-07 07:59:31,966][155452] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-07 07:59:32,760][155452] Updated weights for policy 0, policy_version 18280 (0.0007) [2023-03-07 07:59:33,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 18725888. Throughput: 0: 13051.3. Samples: 18693208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 07:59:33,367][155126] Avg episode reward: [(0, '1512.633')] [2023-03-07 07:59:33,542][155452] Updated weights for policy 0, policy_version 18290 (0.0006) [2023-03-07 07:59:34,309][155452] Updated weights for policy 0, policy_version 18300 (0.0006) [2023-03-07 07:59:35,104][155452] Updated weights for policy 0, policy_version 18310 (0.0006) [2023-03-07 07:59:35,891][155452] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-07 07:59:36,678][155452] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-07 07:59:37,454][155452] Updated weights for policy 0, policy_version 18340 (0.0006) [2023-03-07 07:59:38,239][155452] Updated weights for policy 0, policy_version 18350 (0.0007) [2023-03-07 07:59:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18791424. Throughput: 0: 13055.2. Samples: 18771477. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:38,367][155126] Avg episode reward: [(0, '1513.372')] [2023-03-07 07:59:39,020][155452] Updated weights for policy 0, policy_version 18360 (0.0007) [2023-03-07 07:59:39,810][155452] Updated weights for policy 0, policy_version 18370 (0.0007) [2023-03-07 07:59:40,594][155452] Updated weights for policy 0, policy_version 18380 (0.0006) [2023-03-07 07:59:41,362][155452] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-07 07:59:42,158][155452] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-07 07:59:42,947][155452] Updated weights for policy 0, policy_version 18410 (0.0006) [2023-03-07 07:59:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 18856960. Throughput: 0: 13057.3. Samples: 18849965. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:43,367][155126] Avg episode reward: [(0, '1432.383')] [2023-03-07 07:59:43,740][155452] Updated weights for policy 0, policy_version 18420 (0.0006) [2023-03-07 07:59:44,516][155452] Updated weights for policy 0, policy_version 18430 (0.0006) [2023-03-07 07:59:45,298][155452] Updated weights for policy 0, policy_version 18440 (0.0006) [2023-03-07 07:59:46,075][155452] Updated weights for policy 0, policy_version 18450 (0.0006) [2023-03-07 07:59:46,842][155452] Updated weights for policy 0, policy_version 18460 (0.0006) [2023-03-07 07:59:47,639][155452] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-07 07:59:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 18922496. Throughput: 0: 13061.8. Samples: 18889133. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:48,367][155126] Avg episode reward: [(0, '1511.103')] [2023-03-07 07:59:48,412][155452] Updated weights for policy 0, policy_version 18480 (0.0007) [2023-03-07 07:59:49,209][155452] Updated weights for policy 0, policy_version 18490 (0.0006) [2023-03-07 07:59:49,995][155452] Updated weights for policy 0, policy_version 18500 (0.0005) [2023-03-07 07:59:50,772][155452] Updated weights for policy 0, policy_version 18510 (0.0006) [2023-03-07 07:59:51,552][155452] Updated weights for policy 0, policy_version 18520 (0.0007) [2023-03-07 07:59:52,335][155452] Updated weights for policy 0, policy_version 18530 (0.0006) [2023-03-07 07:59:53,106][155452] Updated weights for policy 0, policy_version 18540 (0.0006) [2023-03-07 07:59:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 18988032. Throughput: 0: 13063.5. Samples: 18967724. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:53,367][155126] Avg episode reward: [(0, '1336.896')] [2023-03-07 07:59:53,876][155452] Updated weights for policy 0, policy_version 18550 (0.0008) [2023-03-07 07:59:54,657][155452] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-07 07:59:55,457][155452] Updated weights for policy 0, policy_version 18570 (0.0006) [2023-03-07 07:59:56,216][155452] Updated weights for policy 0, policy_version 18580 (0.0007) [2023-03-07 07:59:57,010][155452] Updated weights for policy 0, policy_version 18590 (0.0006) [2023-03-07 07:59:57,791][155452] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-07 07:59:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 19053568. Throughput: 0: 13076.8. Samples: 19046350. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 07:59:58,367][155126] Avg episode reward: [(0, '1581.835')] [2023-03-07 07:59:58,579][155452] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-07 07:59:59,371][155452] Updated weights for policy 0, policy_version 18620 (0.0006) [2023-03-07 08:00:00,157][155452] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-07 08:00:00,920][155452] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-07 08:00:01,697][155452] Updated weights for policy 0, policy_version 18650 (0.0007) [2023-03-07 08:00:02,478][155452] Updated weights for policy 0, policy_version 18660 (0.0006) [2023-03-07 08:00:03,259][155452] Updated weights for policy 0, policy_version 18670 (0.0007) [2023-03-07 08:00:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 19119104. Throughput: 0: 13077.8. Samples: 19085541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-07 08:00:03,367][155126] Avg episode reward: [(0, '1594.060')] [2023-03-07 08:00:04,045][155452] Updated weights for policy 0, policy_version 18680 (0.0006) [2023-03-07 08:00:04,841][155452] Updated weights for policy 0, policy_version 18690 (0.0006) [2023-03-07 08:00:05,633][155452] Updated weights for policy 0, policy_version 18700 (0.0007) [2023-03-07 08:00:06,426][155452] Updated weights for policy 0, policy_version 18710 (0.0007) [2023-03-07 08:00:07,209][155452] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-07 08:00:08,001][155452] Updated weights for policy 0, policy_version 18730 (0.0006) [2023-03-07 08:00:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13055.1). Total num frames: 19183616. Throughput: 0: 13070.7. Samples: 19163908. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-07 08:00:08,368][155126] Avg episode reward: [(0, '1413.073')] [2023-03-07 08:00:08,781][155452] Updated weights for policy 0, policy_version 18740 (0.0006) [2023-03-07 08:00:09,578][155452] Updated weights for policy 0, policy_version 18750 (0.0006) [2023-03-07 08:00:10,365][155452] Updated weights for policy 0, policy_version 18760 (0.0006) [2023-03-07 08:00:11,161][155452] Updated weights for policy 0, policy_version 18770 (0.0007) [2023-03-07 08:00:11,938][155452] Updated weights for policy 0, policy_version 18780 (0.0006) [2023-03-07 08:00:12,732][155452] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-07 08:00:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 19249152. Throughput: 0: 13067.6. Samples: 19241969. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-07 08:00:13,367][155126] Avg episode reward: [(0, '1600.564')] [2023-03-07 08:00:13,506][155452] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-07 08:00:14,282][155452] Updated weights for policy 0, policy_version 18810 (0.0006) [2023-03-07 08:00:15,069][155452] Updated weights for policy 0, policy_version 18820 (0.0007) [2023-03-07 08:00:15,849][155452] Updated weights for policy 0, policy_version 18830 (0.0006) [2023-03-07 08:00:16,649][155452] Updated weights for policy 0, policy_version 18840 (0.0007) [2023-03-07 08:00:17,435][155452] Updated weights for policy 0, policy_version 18850 (0.0006) [2023-03-07 08:00:18,208][155452] Updated weights for policy 0, policy_version 18860 (0.0006) [2023-03-07 08:00:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 19314688. Throughput: 0: 13066.2. Samples: 19281190. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:00:18,368][155126] Avg episode reward: [(0, '1472.859')] [2023-03-07 08:00:18,994][155452] Updated weights for policy 0, policy_version 18870 (0.0006) [2023-03-07 08:00:19,789][155452] Updated weights for policy 0, policy_version 18880 (0.0006) [2023-03-07 08:00:20,573][155452] Updated weights for policy 0, policy_version 18890 (0.0006) [2023-03-07 08:00:21,361][155452] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-07 08:00:22,145][155452] Updated weights for policy 0, policy_version 18910 (0.0007) [2023-03-07 08:00:22,913][155452] Updated weights for policy 0, policy_version 18920 (0.0006) [2023-03-07 08:00:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 19379200. Throughput: 0: 13064.9. Samples: 19359397. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:00:23,367][155126] Avg episode reward: [(0, '1554.367')] [2023-03-07 08:00:23,711][155452] Updated weights for policy 0, policy_version 18930 (0.0006) [2023-03-07 08:00:24,509][155452] Updated weights for policy 0, policy_version 18940 (0.0006) [2023-03-07 08:00:25,294][155452] Updated weights for policy 0, policy_version 18950 (0.0007) [2023-03-07 08:00:26,073][155452] Updated weights for policy 0, policy_version 18960 (0.0006) [2023-03-07 08:00:26,849][155452] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-07 08:00:27,650][155452] Updated weights for policy 0, policy_version 18980 (0.0007) [2023-03-07 08:00:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 19444736. Throughput: 0: 13059.7. Samples: 19437653. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:00:28,367][155126] Avg episode reward: [(0, '1503.681')] [2023-03-07 08:00:28,421][155452] Updated weights for policy 0, policy_version 18990 (0.0005) [2023-03-07 08:00:29,211][155452] Updated weights for policy 0, policy_version 19000 (0.0006) [2023-03-07 08:00:29,978][155452] Updated weights for policy 0, policy_version 19010 (0.0006) [2023-03-07 08:00:30,754][155452] Updated weights for policy 0, policy_version 19020 (0.0007) [2023-03-07 08:00:31,553][155452] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-07 08:00:32,329][155452] Updated weights for policy 0, policy_version 19040 (0.0006) [2023-03-07 08:00:33,102][155452] Updated weights for policy 0, policy_version 19050 (0.0006) [2023-03-07 08:00:33,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13055.1). Total num frames: 19510272. Throughput: 0: 13063.8. Samples: 19477006. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:00:33,368][155126] Avg episode reward: [(0, '1768.931')] [2023-03-07 08:00:33,870][155452] Updated weights for policy 0, policy_version 19060 (0.0006) [2023-03-07 08:00:34,676][155452] Updated weights for policy 0, policy_version 19070 (0.0006) [2023-03-07 08:00:35,453][155452] Updated weights for policy 0, policy_version 19080 (0.0006) [2023-03-07 08:00:36,259][155452] Updated weights for policy 0, policy_version 19090 (0.0006) [2023-03-07 08:00:37,039][155452] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-07 08:00:37,829][155452] Updated weights for policy 0, policy_version 19110 (0.0006) [2023-03-07 08:00:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 19574784. Throughput: 0: 13052.6. Samples: 19555091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:00:38,368][155126] Avg episode reward: [(0, '1763.797')] [2023-03-07 08:00:38,615][155452] Updated weights for policy 0, policy_version 19120 (0.0005) [2023-03-07 08:00:39,425][155452] Updated weights for policy 0, policy_version 19130 (0.0006) [2023-03-07 08:00:40,195][155452] Updated weights for policy 0, policy_version 19140 (0.0006) [2023-03-07 08:00:40,979][155452] Updated weights for policy 0, policy_version 19150 (0.0005) [2023-03-07 08:00:41,761][155452] Updated weights for policy 0, policy_version 19160 (0.0006) [2023-03-07 08:00:42,541][155452] Updated weights for policy 0, policy_version 19170 (0.0005) [2023-03-07 08:00:43,319][155452] Updated weights for policy 0, policy_version 19180 (0.0006) [2023-03-07 08:00:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 19640320. Throughput: 0: 13047.5. Samples: 19633486. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:00:43,367][155126] Avg episode reward: [(0, '1746.887')] [2023-03-07 08:00:44,122][155452] Updated weights for policy 0, policy_version 19190 (0.0006) [2023-03-07 08:00:44,896][155452] Updated weights for policy 0, policy_version 19200 (0.0007) [2023-03-07 08:00:45,688][155452] Updated weights for policy 0, policy_version 19210 (0.0006) [2023-03-07 08:00:46,479][155452] Updated weights for policy 0, policy_version 19220 (0.0006) [2023-03-07 08:00:47,284][155452] Updated weights for policy 0, policy_version 19230 (0.0006) [2023-03-07 08:00:48,062][155452] Updated weights for policy 0, policy_version 19240 (0.0005) [2023-03-07 08:00:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 19704832. Throughput: 0: 13041.4. Samples: 19672405. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:00:48,367][155126] Avg episode reward: [(0, '1679.192')] [2023-03-07 08:00:48,844][155452] Updated weights for policy 0, policy_version 19250 (0.0006) [2023-03-07 08:00:49,640][155452] Updated weights for policy 0, policy_version 19260 (0.0006) [2023-03-07 08:00:50,435][155452] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-07 08:00:51,222][155452] Updated weights for policy 0, policy_version 19280 (0.0006) [2023-03-07 08:00:52,017][155452] Updated weights for policy 0, policy_version 19290 (0.0006) [2023-03-07 08:00:52,794][155452] Updated weights for policy 0, policy_version 19300 (0.0006) [2023-03-07 08:00:53,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 19770368. Throughput: 0: 13026.2. Samples: 19750084. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:00:53,367][155126] Avg episode reward: [(0, '2072.811')] [2023-03-07 08:00:53,583][155452] Updated weights for policy 0, policy_version 19310 (0.0007) [2023-03-07 08:00:54,364][155452] Updated weights for policy 0, policy_version 19320 (0.0006) [2023-03-07 08:00:55,141][155452] Updated weights for policy 0, policy_version 19330 (0.0006) [2023-03-07 08:00:55,934][155452] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-07 08:00:56,719][155452] Updated weights for policy 0, policy_version 19350 (0.0006) [2023-03-07 08:00:57,502][155452] Updated weights for policy 0, policy_version 19360 (0.0007) [2023-03-07 08:00:58,275][155452] Updated weights for policy 0, policy_version 19370 (0.0006) [2023-03-07 08:00:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 19835904. Throughput: 0: 13035.8. Samples: 19828579. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:00:58,368][155126] Avg episode reward: [(0, '1741.333')] [2023-03-07 08:00:59,064][155452] Updated weights for policy 0, policy_version 19380 (0.0006) [2023-03-07 08:00:59,845][155452] Updated weights for policy 0, policy_version 19390 (0.0006) [2023-03-07 08:01:00,639][155452] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-07 08:01:01,432][155452] Updated weights for policy 0, policy_version 19410 (0.0007) [2023-03-07 08:01:02,210][155452] Updated weights for policy 0, policy_version 19420 (0.0007) [2023-03-07 08:01:02,990][155452] Updated weights for policy 0, policy_version 19430 (0.0006) [2023-03-07 08:01:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13051.7). Total num frames: 19900416. Throughput: 0: 13034.7. Samples: 19867750. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:03,367][155126] Avg episode reward: [(0, '1825.233')] [2023-03-07 08:01:03,776][155452] Updated weights for policy 0, policy_version 19440 (0.0007) [2023-03-07 08:01:04,542][155452] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-07 08:01:05,324][155452] Updated weights for policy 0, policy_version 19460 (0.0006) [2023-03-07 08:01:06,105][155452] Updated weights for policy 0, policy_version 19470 (0.0006) [2023-03-07 08:01:06,888][155452] Updated weights for policy 0, policy_version 19480 (0.0007) [2023-03-07 08:01:07,678][155452] Updated weights for policy 0, policy_version 19490 (0.0006) [2023-03-07 08:01:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 19965952. Throughput: 0: 13040.4. Samples: 19946216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:08,367][155126] Avg episode reward: [(0, '2029.703')] [2023-03-07 08:01:08,462][155452] Updated weights for policy 0, policy_version 19500 (0.0006) [2023-03-07 08:01:09,240][155452] Updated weights for policy 0, policy_version 19510 (0.0006) [2023-03-07 08:01:10,013][155452] Updated weights for policy 0, policy_version 19520 (0.0006) [2023-03-07 08:01:10,817][155452] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-07 08:01:11,600][155452] Updated weights for policy 0, policy_version 19540 (0.0006) [2023-03-07 08:01:12,368][155452] Updated weights for policy 0, policy_version 19550 (0.0006) [2023-03-07 08:01:13,175][155452] Updated weights for policy 0, policy_version 19560 (0.0006) [2023-03-07 08:01:13,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 20031488. Throughput: 0: 13043.0. Samples: 20024589. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:01:13,367][155126] Avg episode reward: [(0, '1898.031')] [2023-03-07 08:01:13,953][155452] Updated weights for policy 0, policy_version 19570 (0.0006) [2023-03-07 08:01:14,747][155452] Updated weights for policy 0, policy_version 19580 (0.0007) [2023-03-07 08:01:15,521][155452] Updated weights for policy 0, policy_version 19590 (0.0006) [2023-03-07 08:01:16,311][155452] Updated weights for policy 0, policy_version 19600 (0.0006) [2023-03-07 08:01:17,109][155452] Updated weights for policy 0, policy_version 19610 (0.0007) [2023-03-07 08:01:17,899][155452] Updated weights for policy 0, policy_version 19620 (0.0006) [2023-03-07 08:01:18,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13021.9, 300 sec: 13051.7). Total num frames: 20096000. Throughput: 0: 13035.2. Samples: 20063592. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:01:18,368][155126] Avg episode reward: [(0, '1732.578')] [2023-03-07 08:01:18,688][155452] Updated weights for policy 0, policy_version 19630 (0.0007) [2023-03-07 08:01:19,458][155452] Updated weights for policy 0, policy_version 19640 (0.0006) [2023-03-07 08:01:20,250][155452] Updated weights for policy 0, policy_version 19650 (0.0008) [2023-03-07 08:01:21,056][155452] Updated weights for policy 0, policy_version 19660 (0.0006) [2023-03-07 08:01:21,832][155452] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-07 08:01:22,608][155452] Updated weights for policy 0, policy_version 19680 (0.0007) [2023-03-07 08:01:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 20161536. Throughput: 0: 13037.2. Samples: 20141765. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:23,367][155126] Avg episode reward: [(0, '1824.035')] [2023-03-07 08:01:23,387][155452] Updated weights for policy 0, policy_version 19690 (0.0006) [2023-03-07 08:01:24,161][155452] Updated weights for policy 0, policy_version 19700 (0.0006) [2023-03-07 08:01:24,932][155452] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-07 08:01:25,721][155452] Updated weights for policy 0, policy_version 19720 (0.0007) [2023-03-07 08:01:26,493][155452] Updated weights for policy 0, policy_version 19730 (0.0006) [2023-03-07 08:01:27,289][155452] Updated weights for policy 0, policy_version 19740 (0.0008) [2023-03-07 08:01:28,076][155452] Updated weights for policy 0, policy_version 19750 (0.0006) [2023-03-07 08:01:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 20227072. Throughput: 0: 13046.2. Samples: 20220564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:28,368][155126] Avg episode reward: [(0, '1814.235')] [2023-03-07 08:01:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000019753_20227072.pth... [2023-03-07 08:01:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000016693_17093632.pth [2023-03-07 08:01:28,860][155452] Updated weights for policy 0, policy_version 19760 (0.0007) [2023-03-07 08:01:29,643][155452] Updated weights for policy 0, policy_version 19770 (0.0006) [2023-03-07 08:01:30,440][155452] Updated weights for policy 0, policy_version 19780 (0.0007) [2023-03-07 08:01:31,238][155452] Updated weights for policy 0, policy_version 19790 (0.0007) [2023-03-07 08:01:32,029][155452] Updated weights for policy 0, policy_version 19800 (0.0006) [2023-03-07 08:01:32,802][155452] Updated weights for policy 0, policy_version 19810 (0.0006) [2023-03-07 08:01:33,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13055.1). Total num frames: 20292608. Throughput: 0: 13039.3. Samples: 20259173. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:33,367][155126] Avg episode reward: [(0, '1749.066')] [2023-03-07 08:01:33,589][155452] Updated weights for policy 0, policy_version 19820 (0.0006) [2023-03-07 08:01:34,377][155452] Updated weights for policy 0, policy_version 19830 (0.0006) [2023-03-07 08:01:35,155][155452] Updated weights for policy 0, policy_version 19840 (0.0006) [2023-03-07 08:01:35,935][155452] Updated weights for policy 0, policy_version 19850 (0.0006) [2023-03-07 08:01:36,712][155452] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-07 08:01:37,497][155452] Updated weights for policy 0, policy_version 19870 (0.0006) [2023-03-07 08:01:38,299][155452] Updated weights for policy 0, policy_version 19880 (0.0006) [2023-03-07 08:01:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 20357120. Throughput: 0: 13057.2. Samples: 20337659. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:38,367][155126] Avg episode reward: [(0, '1636.625')] [2023-03-07 08:01:39,077][155452] Updated weights for policy 0, policy_version 19890 (0.0006) [2023-03-07 08:01:39,862][155452] Updated weights for policy 0, policy_version 19900 (0.0006) [2023-03-07 08:01:40,657][155452] Updated weights for policy 0, policy_version 19910 (0.0006) [2023-03-07 08:01:41,457][155452] Updated weights for policy 0, policy_version 19920 (0.0006) [2023-03-07 08:01:42,221][155452] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-07 08:01:43,042][155452] Updated weights for policy 0, policy_version 19940 (0.0007) [2023-03-07 08:01:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 20422656. Throughput: 0: 13046.2. Samples: 20415657. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:43,368][155126] Avg episode reward: [(0, '1522.029')] [2023-03-07 08:01:43,825][155452] Updated weights for policy 0, policy_version 19950 (0.0006) [2023-03-07 08:01:44,598][155452] Updated weights for policy 0, policy_version 19960 (0.0005) [2023-03-07 08:01:45,401][155452] Updated weights for policy 0, policy_version 19970 (0.0007) [2023-03-07 08:01:46,173][155452] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-07 08:01:46,950][155452] Updated weights for policy 0, policy_version 19990 (0.0007) [2023-03-07 08:01:47,762][155452] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-07 08:01:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 20487168. Throughput: 0: 13042.4. Samples: 20454658. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:01:48,367][155126] Avg episode reward: [(0, '1819.245')] [2023-03-07 08:01:48,549][155452] Updated weights for policy 0, policy_version 20010 (0.0006) [2023-03-07 08:01:49,346][155452] Updated weights for policy 0, policy_version 20020 (0.0005) [2023-03-07 08:01:50,122][155452] Updated weights for policy 0, policy_version 20030 (0.0007) [2023-03-07 08:01:50,893][155452] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-07 08:01:51,686][155452] Updated weights for policy 0, policy_version 20050 (0.0006) [2023-03-07 08:01:52,465][155452] Updated weights for policy 0, policy_version 20060 (0.0006) [2023-03-07 08:01:53,248][155452] Updated weights for policy 0, policy_version 20070 (0.0005) [2023-03-07 08:01:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 20552704. Throughput: 0: 13029.7. Samples: 20532554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:01:53,367][155126] Avg episode reward: [(0, '1958.745')] [2023-03-07 08:01:54,040][155452] Updated weights for policy 0, policy_version 20080 (0.0006) [2023-03-07 08:01:54,833][155452] Updated weights for policy 0, policy_version 20090 (0.0006) [2023-03-07 08:01:55,600][155452] Updated weights for policy 0, policy_version 20100 (0.0007) [2023-03-07 08:01:56,391][155452] Updated weights for policy 0, policy_version 20110 (0.0006) [2023-03-07 08:01:57,169][155452] Updated weights for policy 0, policy_version 20120 (0.0006) [2023-03-07 08:01:57,970][155452] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-07 08:01:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 20618240. Throughput: 0: 13028.2. Samples: 20610859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:01:58,368][155126] Avg episode reward: [(0, '1777.536')] [2023-03-07 08:01:58,738][155452] Updated weights for policy 0, policy_version 20140 (0.0006) [2023-03-07 08:01:59,533][155452] Updated weights for policy 0, policy_version 20150 (0.0006) [2023-03-07 08:02:00,322][155452] Updated weights for policy 0, policy_version 20160 (0.0006) [2023-03-07 08:02:01,099][155452] Updated weights for policy 0, policy_version 20170 (0.0007) [2023-03-07 08:02:01,879][155452] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-07 08:02:02,685][155452] Updated weights for policy 0, policy_version 20190 (0.0008) [2023-03-07 08:02:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 20682752. Throughput: 0: 13033.6. Samples: 20650102. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:02:03,367][155126] Avg episode reward: [(0, '1807.806')] [2023-03-07 08:02:03,453][155452] Updated weights for policy 0, policy_version 20200 (0.0006) [2023-03-07 08:02:04,230][155452] Updated weights for policy 0, policy_version 20210 (0.0006) [2023-03-07 08:02:05,006][155452] Updated weights for policy 0, policy_version 20220 (0.0007) [2023-03-07 08:02:05,784][155452] Updated weights for policy 0, policy_version 20230 (0.0006) [2023-03-07 08:02:06,572][155452] Updated weights for policy 0, policy_version 20240 (0.0006) [2023-03-07 08:02:07,357][155452] Updated weights for policy 0, policy_version 20250 (0.0006) [2023-03-07 08:02:08,128][155452] Updated weights for policy 0, policy_version 20260 (0.0007) [2023-03-07 08:02:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 20749312. Throughput: 0: 13043.3. Samples: 20728711. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:02:08,367][155126] Avg episode reward: [(0, '1795.950')] [2023-03-07 08:02:08,896][155452] Updated weights for policy 0, policy_version 20270 (0.0005) [2023-03-07 08:02:09,687][155452] Updated weights for policy 0, policy_version 20280 (0.0006) [2023-03-07 08:02:10,488][155452] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-07 08:02:11,270][155452] Updated weights for policy 0, policy_version 20300 (0.0007) [2023-03-07 08:02:12,053][155452] Updated weights for policy 0, policy_version 20310 (0.0006) [2023-03-07 08:02:12,838][155452] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-07 08:02:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 20813824. Throughput: 0: 13032.1. Samples: 20807007. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:02:13,367][155126] Avg episode reward: [(0, '1881.417')] [2023-03-07 08:02:13,641][155452] Updated weights for policy 0, policy_version 20330 (0.0006) [2023-03-07 08:02:14,410][155452] Updated weights for policy 0, policy_version 20340 (0.0005) [2023-03-07 08:02:15,192][155452] Updated weights for policy 0, policy_version 20350 (0.0005) [2023-03-07 08:02:15,974][155452] Updated weights for policy 0, policy_version 20360 (0.0007) [2023-03-07 08:02:16,758][155452] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-07 08:02:17,537][155452] Updated weights for policy 0, policy_version 20380 (0.0005) [2023-03-07 08:02:18,301][155452] Updated weights for policy 0, policy_version 20390 (0.0007) [2023-03-07 08:02:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 20879360. Throughput: 0: 13046.3. Samples: 20846258. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:02:18,368][155126] Avg episode reward: [(0, '1475.225')] [2023-03-07 08:02:19,093][155452] Updated weights for policy 0, policy_version 20400 (0.0007) [2023-03-07 08:02:19,884][155452] Updated weights for policy 0, policy_version 20410 (0.0006) [2023-03-07 08:02:20,665][155452] Updated weights for policy 0, policy_version 20420 (0.0006) [2023-03-07 08:02:21,442][155452] Updated weights for policy 0, policy_version 20430 (0.0006) [2023-03-07 08:02:22,228][155452] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-07 08:02:23,007][155452] Updated weights for policy 0, policy_version 20450 (0.0006) [2023-03-07 08:02:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 20944896. Throughput: 0: 13048.2. Samples: 20924830. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:02:23,368][155126] Avg episode reward: [(0, '1580.988')] [2023-03-07 08:02:23,793][155452] Updated weights for policy 0, policy_version 20460 (0.0006) [2023-03-07 08:02:24,579][155452] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-07 08:02:25,366][155452] Updated weights for policy 0, policy_version 20480 (0.0007) [2023-03-07 08:02:26,161][155452] Updated weights for policy 0, policy_version 20490 (0.0006) [2023-03-07 08:02:26,951][155452] Updated weights for policy 0, policy_version 20500 (0.0007) [2023-03-07 08:02:27,744][155452] Updated weights for policy 0, policy_version 20510 (0.0008) [2023-03-07 08:02:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 21009408. Throughput: 0: 13049.3. Samples: 21002875. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:02:28,368][155126] Avg episode reward: [(0, '1528.778')] [2023-03-07 08:02:28,529][155452] Updated weights for policy 0, policy_version 20520 (0.0006) [2023-03-07 08:02:29,313][155452] Updated weights for policy 0, policy_version 20530 (0.0007) [2023-03-07 08:02:30,108][155452] Updated weights for policy 0, policy_version 20540 (0.0007) [2023-03-07 08:02:30,885][155452] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-07 08:02:31,690][155452] Updated weights for policy 0, policy_version 20560 (0.0006) [2023-03-07 08:02:32,445][155452] Updated weights for policy 0, policy_version 20570 (0.0007) [2023-03-07 08:02:33,233][155452] Updated weights for policy 0, policy_version 20580 (0.0006) [2023-03-07 08:02:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 21074944. Throughput: 0: 13050.5. Samples: 21041932. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:02:33,367][155126] Avg episode reward: [(0, '1625.150')] [2023-03-07 08:02:34,030][155452] Updated weights for policy 0, policy_version 20590 (0.0006) [2023-03-07 08:02:34,818][155452] Updated weights for policy 0, policy_version 20600 (0.0005) [2023-03-07 08:02:35,597][155452] Updated weights for policy 0, policy_version 20610 (0.0006) [2023-03-07 08:02:36,397][155452] Updated weights for policy 0, policy_version 20620 (0.0007) [2023-03-07 08:02:37,186][155452] Updated weights for policy 0, policy_version 20630 (0.0005) [2023-03-07 08:02:37,965][155452] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-07 08:02:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 21140480. Throughput: 0: 13056.8. Samples: 21120109. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:02:38,367][155126] Avg episode reward: [(0, '1835.870')] [2023-03-07 08:02:38,746][155452] Updated weights for policy 0, policy_version 20650 (0.0006) [2023-03-07 08:02:39,533][155452] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-07 08:02:40,313][155452] Updated weights for policy 0, policy_version 20670 (0.0006) [2023-03-07 08:02:41,109][155452] Updated weights for policy 0, policy_version 20680 (0.0006) [2023-03-07 08:02:41,889][155452] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-07 08:02:42,685][155452] Updated weights for policy 0, policy_version 20700 (0.0006) [2023-03-07 08:02:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 21204992. Throughput: 0: 13054.9. Samples: 21198330. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:02:43,368][155126] Avg episode reward: [(0, '1499.853')] [2023-03-07 08:02:43,474][155452] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-07 08:02:44,276][155452] Updated weights for policy 0, policy_version 20720 (0.0006) [2023-03-07 08:02:45,053][155452] Updated weights for policy 0, policy_version 20730 (0.0006) [2023-03-07 08:02:45,832][155452] Updated weights for policy 0, policy_version 20740 (0.0005) [2023-03-07 08:02:46,608][155452] Updated weights for policy 0, policy_version 20750 (0.0006) [2023-03-07 08:02:47,392][155452] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-07 08:02:48,177][155452] Updated weights for policy 0, policy_version 20770 (0.0006) [2023-03-07 08:02:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 21270528. Throughput: 0: 13051.6. Samples: 21237424. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:02:48,367][155126] Avg episode reward: [(0, '1547.646')] [2023-03-07 08:02:48,946][155452] Updated weights for policy 0, policy_version 20780 (0.0006) [2023-03-07 08:02:49,735][155452] Updated weights for policy 0, policy_version 20790 (0.0007) [2023-03-07 08:02:50,505][155452] Updated weights for policy 0, policy_version 20800 (0.0008) [2023-03-07 08:02:51,288][155452] Updated weights for policy 0, policy_version 20810 (0.0005) [2023-03-07 08:02:52,083][155452] Updated weights for policy 0, policy_version 20820 (0.0005) [2023-03-07 08:02:52,862][155452] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-07 08:02:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 21336064. Throughput: 0: 13049.0. Samples: 21315917. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:02:53,367][155126] Avg episode reward: [(0, '1513.169')] [2023-03-07 08:02:53,634][155452] Updated weights for policy 0, policy_version 20840 (0.0006) [2023-03-07 08:02:54,432][155452] Updated weights for policy 0, policy_version 20850 (0.0006) [2023-03-07 08:02:55,214][155452] Updated weights for policy 0, policy_version 20860 (0.0006) [2023-03-07 08:02:56,014][155452] Updated weights for policy 0, policy_version 20870 (0.0006) [2023-03-07 08:02:56,816][155452] Updated weights for policy 0, policy_version 20880 (0.0007) [2023-03-07 08:02:57,589][155452] Updated weights for policy 0, policy_version 20890 (0.0007) [2023-03-07 08:02:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 21400576. Throughput: 0: 13044.6. Samples: 21394013. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:02:58,367][155126] Avg episode reward: [(0, '1412.914')] [2023-03-07 08:02:58,377][155452] Updated weights for policy 0, policy_version 20900 (0.0007) [2023-03-07 08:02:59,166][155452] Updated weights for policy 0, policy_version 20910 (0.0006) [2023-03-07 08:02:59,932][155452] Updated weights for policy 0, policy_version 20920 (0.0006) [2023-03-07 08:03:00,728][155452] Updated weights for policy 0, policy_version 20930 (0.0007) [2023-03-07 08:03:01,501][155452] Updated weights for policy 0, policy_version 20940 (0.0006) [2023-03-07 08:03:02,303][155452] Updated weights for policy 0, policy_version 20950 (0.0006) [2023-03-07 08:03:03,086][155452] Updated weights for policy 0, policy_version 20960 (0.0006) [2023-03-07 08:03:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 21466112. Throughput: 0: 13042.4. Samples: 21433162. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:03:03,367][155126] Avg episode reward: [(0, '1669.094')] [2023-03-07 08:03:03,872][155452] Updated weights for policy 0, policy_version 20970 (0.0006) [2023-03-07 08:03:04,674][155452] Updated weights for policy 0, policy_version 20980 (0.0005) [2023-03-07 08:03:05,438][155452] Updated weights for policy 0, policy_version 20990 (0.0006) [2023-03-07 08:03:06,238][155452] Updated weights for policy 0, policy_version 21000 (0.0006) [2023-03-07 08:03:07,044][155452] Updated weights for policy 0, policy_version 21010 (0.0006) [2023-03-07 08:03:07,807][155452] Updated weights for policy 0, policy_version 21020 (0.0007) [2023-03-07 08:03:08,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 21531648. Throughput: 0: 13031.8. Samples: 21511262. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:03:08,367][155126] Avg episode reward: [(0, '1662.463')] [2023-03-07 08:03:08,595][155452] Updated weights for policy 0, policy_version 21030 (0.0006) [2023-03-07 08:03:09,377][155452] Updated weights for policy 0, policy_version 21040 (0.0006) [2023-03-07 08:03:10,158][155452] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-07 08:03:10,934][155452] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-07 08:03:11,733][155452] Updated weights for policy 0, policy_version 21070 (0.0006) [2023-03-07 08:03:12,509][155452] Updated weights for policy 0, policy_version 21080 (0.0006) [2023-03-07 08:03:13,300][155452] Updated weights for policy 0, policy_version 21090 (0.0007) [2023-03-07 08:03:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 21597184. Throughput: 0: 13038.9. Samples: 21589623. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:03:13,367][155126] Avg episode reward: [(0, '1578.649')] [2023-03-07 08:03:14,078][155452] Updated weights for policy 0, policy_version 21100 (0.0006) [2023-03-07 08:03:14,874][155452] Updated weights for policy 0, policy_version 21110 (0.0006) [2023-03-07 08:03:15,674][155452] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-07 08:03:16,452][155452] Updated weights for policy 0, policy_version 21130 (0.0006) [2023-03-07 08:03:17,252][155452] Updated weights for policy 0, policy_version 21140 (0.0006) [2023-03-07 08:03:18,042][155452] Updated weights for policy 0, policy_version 21150 (0.0006) [2023-03-07 08:03:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 21661696. Throughput: 0: 13036.4. Samples: 21628568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:03:18,368][155126] Avg episode reward: [(0, '1631.131')] [2023-03-07 08:03:18,810][155452] Updated weights for policy 0, policy_version 21160 (0.0006) [2023-03-07 08:03:19,593][155452] Updated weights for policy 0, policy_version 21170 (0.0006) [2023-03-07 08:03:20,382][155452] Updated weights for policy 0, policy_version 21180 (0.0007) [2023-03-07 08:03:21,156][155452] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-07 08:03:21,937][155452] Updated weights for policy 0, policy_version 21200 (0.0006) [2023-03-07 08:03:22,737][155452] Updated weights for policy 0, policy_version 21210 (0.0006) [2023-03-07 08:03:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 21727232. Throughput: 0: 13040.6. Samples: 21706937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:03:23,367][155126] Avg episode reward: [(0, '1718.500')] [2023-03-07 08:03:23,514][155452] Updated weights for policy 0, policy_version 21220 (0.0005) [2023-03-07 08:03:24,301][155452] Updated weights for policy 0, policy_version 21230 (0.0006) [2023-03-07 08:03:25,072][155452] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-07 08:03:25,856][155452] Updated weights for policy 0, policy_version 21250 (0.0006) [2023-03-07 08:03:26,641][155452] Updated weights for policy 0, policy_version 21260 (0.0006) [2023-03-07 08:03:27,417][155452] Updated weights for policy 0, policy_version 21270 (0.0006) [2023-03-07 08:03:28,201][155452] Updated weights for policy 0, policy_version 21280 (0.0006) [2023-03-07 08:03:28,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 21792768. Throughput: 0: 13047.6. Samples: 21785471. Policy #0 lag: (min: 0.0, avg: 1.0, max: 4.0) [2023-03-07 08:03:28,367][155126] Avg episode reward: [(0, '1752.520')] [2023-03-07 08:03:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000021282_21792768.pth... [2023-03-07 08:03:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000018224_18661376.pth [2023-03-07 08:03:28,973][155452] Updated weights for policy 0, policy_version 21290 (0.0006) [2023-03-07 08:03:29,755][155452] Updated weights for policy 0, policy_version 21300 (0.0008) [2023-03-07 08:03:30,546][155452] Updated weights for policy 0, policy_version 21310 (0.0006) [2023-03-07 08:03:31,305][155452] Updated weights for policy 0, policy_version 21320 (0.0007) [2023-03-07 08:03:32,096][155452] Updated weights for policy 0, policy_version 21330 (0.0006) [2023-03-07 08:03:32,862][155452] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-07 08:03:33,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 21858304. Throughput: 0: 13050.5. Samples: 21824699. Policy #0 lag: (min: 0.0, avg: 1.0, max: 4.0) [2023-03-07 08:03:33,367][155126] Avg episode reward: [(0, '1624.754')] [2023-03-07 08:03:33,656][155452] Updated weights for policy 0, policy_version 21350 (0.0007) [2023-03-07 08:03:34,431][155452] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-07 08:03:35,207][155452] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-07 08:03:35,982][155452] Updated weights for policy 0, policy_version 21380 (0.0007) [2023-03-07 08:03:36,758][155452] Updated weights for policy 0, policy_version 21390 (0.0006) [2023-03-07 08:03:37,554][155452] Updated weights for policy 0, policy_version 21400 (0.0006) [2023-03-07 08:03:38,326][155452] Updated weights for policy 0, policy_version 21410 (0.0006) [2023-03-07 08:03:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 21923840. Throughput: 0: 13061.2. Samples: 21903670. Policy #0 lag: (min: 0.0, avg: 1.0, max: 4.0) [2023-03-07 08:03:38,367][155126] Avg episode reward: [(0, '1707.060')] [2023-03-07 08:03:39,125][155452] Updated weights for policy 0, policy_version 21420 (0.0007) [2023-03-07 08:03:39,914][155452] Updated weights for policy 0, policy_version 21430 (0.0005) [2023-03-07 08:03:40,692][155452] Updated weights for policy 0, policy_version 21440 (0.0007) [2023-03-07 08:03:41,491][155452] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-07 08:03:42,267][155452] Updated weights for policy 0, policy_version 21460 (0.0007) [2023-03-07 08:03:43,061][155452] Updated weights for policy 0, policy_version 21470 (0.0007) [2023-03-07 08:03:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 21988352. Throughput: 0: 13063.9. Samples: 21981888. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:03:43,368][155126] Avg episode reward: [(0, '1666.398')] [2023-03-07 08:03:43,874][155452] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-07 08:03:44,661][155452] Updated weights for policy 0, policy_version 21490 (0.0006) [2023-03-07 08:03:45,444][155452] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-07 08:03:46,230][155452] Updated weights for policy 0, policy_version 21510 (0.0007) [2023-03-07 08:03:47,006][155452] Updated weights for policy 0, policy_version 21520 (0.0007) [2023-03-07 08:03:47,805][155452] Updated weights for policy 0, policy_version 21530 (0.0007) [2023-03-07 08:03:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 22053888. Throughput: 0: 13055.9. Samples: 22020679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:03:48,368][155126] Avg episode reward: [(0, '1879.886')] [2023-03-07 08:03:48,598][155452] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-07 08:03:49,388][155452] Updated weights for policy 0, policy_version 21550 (0.0006) [2023-03-07 08:03:50,166][155452] Updated weights for policy 0, policy_version 21560 (0.0007) [2023-03-07 08:03:50,955][155452] Updated weights for policy 0, policy_version 21570 (0.0007) [2023-03-07 08:03:51,735][155452] Updated weights for policy 0, policy_version 21580 (0.0006) [2023-03-07 08:03:52,502][155452] Updated weights for policy 0, policy_version 21590 (0.0006) [2023-03-07 08:03:53,299][155452] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-07 08:03:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 22119424. Throughput: 0: 13055.8. Samples: 22098774. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:03:53,367][155126] Avg episode reward: [(0, '1741.544')] [2023-03-07 08:03:54,068][155452] Updated weights for policy 0, policy_version 21610 (0.0007) [2023-03-07 08:03:54,846][155452] Updated weights for policy 0, policy_version 21620 (0.0006) [2023-03-07 08:03:55,628][155452] Updated weights for policy 0, policy_version 21630 (0.0006) [2023-03-07 08:03:56,401][155452] Updated weights for policy 0, policy_version 21640 (0.0006) [2023-03-07 08:03:57,201][155452] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-07 08:03:57,992][155452] Updated weights for policy 0, policy_version 21660 (0.0006) [2023-03-07 08:03:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 22183936. Throughput: 0: 13058.4. Samples: 22177251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:03:58,367][155126] Avg episode reward: [(0, '1769.780')] [2023-03-07 08:03:58,771][155452] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-07 08:03:59,581][155452] Updated weights for policy 0, policy_version 21680 (0.0007) [2023-03-07 08:04:00,353][155452] Updated weights for policy 0, policy_version 21690 (0.0006) [2023-03-07 08:04:01,158][155452] Updated weights for policy 0, policy_version 21700 (0.0006) [2023-03-07 08:04:01,932][155452] Updated weights for policy 0, policy_version 21710 (0.0007) [2023-03-07 08:04:02,717][155452] Updated weights for policy 0, policy_version 21720 (0.0006) [2023-03-07 08:04:03,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 22248448. Throughput: 0: 13058.3. Samples: 22216191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:03,367][155126] Avg episode reward: [(0, '1743.892')] [2023-03-07 08:04:03,521][155452] Updated weights for policy 0, policy_version 21730 (0.0006) [2023-03-07 08:04:04,300][155452] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-07 08:04:05,068][155452] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-07 08:04:05,871][155452] Updated weights for policy 0, policy_version 21760 (0.0006) [2023-03-07 08:04:06,665][155452] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-07 08:04:07,441][155452] Updated weights for policy 0, policy_version 21780 (0.0007) [2023-03-07 08:04:08,226][155452] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-07 08:04:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 22313984. Throughput: 0: 13052.7. Samples: 22294311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:08,367][155126] Avg episode reward: [(0, '1701.926')] [2023-03-07 08:04:09,005][155452] Updated weights for policy 0, policy_version 21800 (0.0006) [2023-03-07 08:04:09,790][155452] Updated weights for policy 0, policy_version 21810 (0.0006) [2023-03-07 08:04:10,575][155452] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-07 08:04:11,372][155452] Updated weights for policy 0, policy_version 21830 (0.0007) [2023-03-07 08:04:12,152][155452] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-07 08:04:12,949][155452] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-07 08:04:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 22379520. Throughput: 0: 13042.1. Samples: 22372364. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:04:13,367][155126] Avg episode reward: [(0, '1671.943')] [2023-03-07 08:04:13,740][155452] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-07 08:04:14,535][155452] Updated weights for policy 0, policy_version 21870 (0.0006) [2023-03-07 08:04:15,312][155452] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-07 08:04:16,105][155452] Updated weights for policy 0, policy_version 21890 (0.0007) [2023-03-07 08:04:16,889][155452] Updated weights for policy 0, policy_version 21900 (0.0006) [2023-03-07 08:04:17,688][155452] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-07 08:04:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 22444032. Throughput: 0: 13037.6. Samples: 22411392. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:04:18,367][155126] Avg episode reward: [(0, '1546.501')] [2023-03-07 08:04:18,465][155452] Updated weights for policy 0, policy_version 21920 (0.0006) [2023-03-07 08:04:19,228][155452] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-07 08:04:20,003][155452] Updated weights for policy 0, policy_version 21940 (0.0007) [2023-03-07 08:04:20,801][155452] Updated weights for policy 0, policy_version 21950 (0.0007) [2023-03-07 08:04:21,576][155452] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-07 08:04:22,364][155452] Updated weights for policy 0, policy_version 21970 (0.0006) [2023-03-07 08:04:23,142][155452] Updated weights for policy 0, policy_version 21980 (0.0006) [2023-03-07 08:04:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 22509568. Throughput: 0: 13024.2. Samples: 22489759. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:04:23,367][155126] Avg episode reward: [(0, '1416.785')] [2023-03-07 08:04:23,925][155452] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-07 08:04:24,699][155452] Updated weights for policy 0, policy_version 22000 (0.0006) [2023-03-07 08:04:25,512][155452] Updated weights for policy 0, policy_version 22010 (0.0006) [2023-03-07 08:04:26,293][155452] Updated weights for policy 0, policy_version 22020 (0.0006) [2023-03-07 08:04:27,058][155452] Updated weights for policy 0, policy_version 22030 (0.0006) [2023-03-07 08:04:27,873][155452] Updated weights for policy 0, policy_version 22040 (0.0006) [2023-03-07 08:04:28,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 22575104. Throughput: 0: 13029.0. Samples: 22568191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:28,367][155126] Avg episode reward: [(0, '1608.332')] [2023-03-07 08:04:28,660][155452] Updated weights for policy 0, policy_version 22050 (0.0006) [2023-03-07 08:04:29,429][155452] Updated weights for policy 0, policy_version 22060 (0.0006) [2023-03-07 08:04:30,211][155452] Updated weights for policy 0, policy_version 22070 (0.0006) [2023-03-07 08:04:30,992][155452] Updated weights for policy 0, policy_version 22080 (0.0006) [2023-03-07 08:04:31,777][155452] Updated weights for policy 0, policy_version 22090 (0.0005) [2023-03-07 08:04:32,555][155452] Updated weights for policy 0, policy_version 22100 (0.0007) [2023-03-07 08:04:33,338][155452] Updated weights for policy 0, policy_version 22110 (0.0006) [2023-03-07 08:04:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 22640640. Throughput: 0: 13037.5. Samples: 22607366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:33,367][155126] Avg episode reward: [(0, '1673.245')] [2023-03-07 08:04:34,132][155452] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-07 08:04:34,925][155452] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-07 08:04:35,717][155452] Updated weights for policy 0, policy_version 22140 (0.0006) [2023-03-07 08:04:36,509][155452] Updated weights for policy 0, policy_version 22150 (0.0007) [2023-03-07 08:04:37,277][155452] Updated weights for policy 0, policy_version 22160 (0.0006) [2023-03-07 08:04:38,088][155452] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-07 08:04:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 22705152. Throughput: 0: 13031.9. Samples: 22685212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:38,367][155126] Avg episode reward: [(0, '1664.454')] [2023-03-07 08:04:38,867][155452] Updated weights for policy 0, policy_version 22180 (0.0005) [2023-03-07 08:04:39,638][155452] Updated weights for policy 0, policy_version 22190 (0.0006) [2023-03-07 08:04:40,434][155452] Updated weights for policy 0, policy_version 22200 (0.0006) [2023-03-07 08:04:41,210][155452] Updated weights for policy 0, policy_version 22210 (0.0006) [2023-03-07 08:04:41,990][155452] Updated weights for policy 0, policy_version 22220 (0.0006) [2023-03-07 08:04:42,777][155452] Updated weights for policy 0, policy_version 22230 (0.0005) [2023-03-07 08:04:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13044.7). Total num frames: 22770688. Throughput: 0: 13032.8. Samples: 22763724. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:43,367][155126] Avg episode reward: [(0, '2009.086')] [2023-03-07 08:04:43,561][155452] Updated weights for policy 0, policy_version 22240 (0.0006) [2023-03-07 08:04:44,333][155452] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-07 08:04:45,126][155452] Updated weights for policy 0, policy_version 22260 (0.0006) [2023-03-07 08:04:45,908][155452] Updated weights for policy 0, policy_version 22270 (0.0006) [2023-03-07 08:04:46,694][155452] Updated weights for policy 0, policy_version 22280 (0.0007) [2023-03-07 08:04:47,465][155452] Updated weights for policy 0, policy_version 22290 (0.0006) [2023-03-07 08:04:48,263][155452] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-07 08:04:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 22836224. Throughput: 0: 13039.8. Samples: 22802984. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:48,367][155126] Avg episode reward: [(0, '1729.599')] [2023-03-07 08:04:49,043][155452] Updated weights for policy 0, policy_version 22310 (0.0007) [2023-03-07 08:04:49,840][155452] Updated weights for policy 0, policy_version 22320 (0.0007) [2023-03-07 08:04:50,606][155452] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-07 08:04:51,393][155452] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-07 08:04:52,178][155452] Updated weights for policy 0, policy_version 22350 (0.0006) [2023-03-07 08:04:52,969][155452] Updated weights for policy 0, policy_version 22360 (0.0006) [2023-03-07 08:04:53,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 22901760. Throughput: 0: 13043.8. Samples: 22881281. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:53,368][155126] Avg episode reward: [(0, '1870.447')] [2023-03-07 08:04:53,748][155452] Updated weights for policy 0, policy_version 22370 (0.0007) [2023-03-07 08:04:54,542][155452] Updated weights for policy 0, policy_version 22380 (0.0006) [2023-03-07 08:04:55,295][155452] Updated weights for policy 0, policy_version 22390 (0.0007) [2023-03-07 08:04:56,089][155452] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-07 08:04:56,877][155452] Updated weights for policy 0, policy_version 22410 (0.0006) [2023-03-07 08:04:57,662][155452] Updated weights for policy 0, policy_version 22420 (0.0006) [2023-03-07 08:04:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 22967296. Throughput: 0: 13053.7. Samples: 22959781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:04:58,378][155126] Avg episode reward: [(0, '1672.860')] [2023-03-07 08:04:58,448][155452] Updated weights for policy 0, policy_version 22430 (0.0008) [2023-03-07 08:04:59,234][155452] Updated weights for policy 0, policy_version 22440 (0.0007) [2023-03-07 08:05:00,015][155452] Updated weights for policy 0, policy_version 22450 (0.0006) [2023-03-07 08:05:00,807][155452] Updated weights for policy 0, policy_version 22460 (0.0006) [2023-03-07 08:05:01,595][155452] Updated weights for policy 0, policy_version 22470 (0.0006) [2023-03-07 08:05:02,377][155452] Updated weights for policy 0, policy_version 22480 (0.0006) [2023-03-07 08:05:03,161][155452] Updated weights for policy 0, policy_version 22490 (0.0006) [2023-03-07 08:05:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 23031808. Throughput: 0: 13057.0. Samples: 22998959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:05:03,367][155126] Avg episode reward: [(0, '1723.779')] [2023-03-07 08:05:03,962][155452] Updated weights for policy 0, policy_version 22500 (0.0006) [2023-03-07 08:05:04,727][155452] Updated weights for policy 0, policy_version 22510 (0.0006) [2023-03-07 08:05:05,534][155452] Updated weights for policy 0, policy_version 22520 (0.0006) [2023-03-07 08:05:06,294][155452] Updated weights for policy 0, policy_version 22530 (0.0006) [2023-03-07 08:05:07,087][155452] Updated weights for policy 0, policy_version 22540 (0.0006) [2023-03-07 08:05:07,864][155452] Updated weights for policy 0, policy_version 22550 (0.0008) [2023-03-07 08:05:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 23097344. Throughput: 0: 13052.5. Samples: 23077120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:05:08,367][155126] Avg episode reward: [(0, '1735.199')] [2023-03-07 08:05:08,685][155452] Updated weights for policy 0, policy_version 22560 (0.0006) [2023-03-07 08:05:09,468][155452] Updated weights for policy 0, policy_version 22570 (0.0007) [2023-03-07 08:05:10,234][155452] Updated weights for policy 0, policy_version 22580 (0.0006) [2023-03-07 08:05:11,003][155452] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-07 08:05:11,814][155452] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-07 08:05:12,634][155452] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-07 08:05:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 23161856. Throughput: 0: 13034.0. Samples: 23154720. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:05:13,368][155126] Avg episode reward: [(0, '1580.679')] [2023-03-07 08:05:13,444][155452] Updated weights for policy 0, policy_version 22620 (0.0006) [2023-03-07 08:05:14,265][155452] Updated weights for policy 0, policy_version 22630 (0.0006) [2023-03-07 08:05:15,081][155452] Updated weights for policy 0, policy_version 22640 (0.0006) [2023-03-07 08:05:15,891][155452] Updated weights for policy 0, policy_version 22650 (0.0007) [2023-03-07 08:05:16,669][155452] Updated weights for policy 0, policy_version 22660 (0.0006) [2023-03-07 08:05:17,486][155452] Updated weights for policy 0, policy_version 22670 (0.0006) [2023-03-07 08:05:18,286][155452] Updated weights for policy 0, policy_version 22680 (0.0007) [2023-03-07 08:05:18,367][155126] Fps is (10 sec: 12697.6, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 23224320. Throughput: 0: 13000.2. Samples: 23192374. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:05:18,367][155126] Avg episode reward: [(0, '1552.335')] [2023-03-07 08:05:19,057][155452] Updated weights for policy 0, policy_version 22690 (0.0006) [2023-03-07 08:05:19,847][155452] Updated weights for policy 0, policy_version 22700 (0.0006) [2023-03-07 08:05:20,626][155452] Updated weights for policy 0, policy_version 22710 (0.0006) [2023-03-07 08:05:21,407][155452] Updated weights for policy 0, policy_version 22720 (0.0007) [2023-03-07 08:05:22,197][155452] Updated weights for policy 0, policy_version 22730 (0.0007) [2023-03-07 08:05:22,988][155452] Updated weights for policy 0, policy_version 22740 (0.0006) [2023-03-07 08:05:23,367][155126] Fps is (10 sec: 12800.1, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 23289856. Throughput: 0: 13001.8. Samples: 23270294. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:05:23,367][155126] Avg episode reward: [(0, '1824.622')] [2023-03-07 08:05:23,778][155452] Updated weights for policy 0, policy_version 22750 (0.0006) [2023-03-07 08:05:24,565][155452] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-07 08:05:25,357][155452] Updated weights for policy 0, policy_version 22770 (0.0006) [2023-03-07 08:05:26,133][155452] Updated weights for policy 0, policy_version 22780 (0.0006) [2023-03-07 08:05:26,927][155452] Updated weights for policy 0, policy_version 22790 (0.0006) [2023-03-07 08:05:27,699][155452] Updated weights for policy 0, policy_version 22800 (0.0007) [2023-03-07 08:05:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 23355392. Throughput: 0: 12991.8. Samples: 23348354. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:05:28,367][155126] Avg episode reward: [(0, '1621.990')] [2023-03-07 08:05:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000022808_23355392.pth... [2023-03-07 08:05:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000019753_20227072.pth [2023-03-07 08:05:28,489][155452] Updated weights for policy 0, policy_version 22810 (0.0006) [2023-03-07 08:05:29,279][155452] Updated weights for policy 0, policy_version 22820 (0.0005) [2023-03-07 08:05:30,073][155452] Updated weights for policy 0, policy_version 22830 (0.0005) [2023-03-07 08:05:30,853][155452] Updated weights for policy 0, policy_version 22840 (0.0006) [2023-03-07 08:05:31,637][155452] Updated weights for policy 0, policy_version 22850 (0.0007) [2023-03-07 08:05:32,416][155452] Updated weights for policy 0, policy_version 22860 (0.0005) [2023-03-07 08:05:33,218][155452] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-07 08:05:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 23420928. Throughput: 0: 12988.3. Samples: 23387457. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:05:33,367][155126] Avg episode reward: [(0, '1836.093')] [2023-03-07 08:05:33,993][155452] Updated weights for policy 0, policy_version 22880 (0.0007) [2023-03-07 08:05:34,768][155452] Updated weights for policy 0, policy_version 22890 (0.0006) [2023-03-07 08:05:35,554][155452] Updated weights for policy 0, policy_version 22900 (0.0007) [2023-03-07 08:05:36,359][155452] Updated weights for policy 0, policy_version 22910 (0.0007) [2023-03-07 08:05:37,147][155452] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-07 08:05:37,942][155452] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-07 08:05:38,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 23485440. Throughput: 0: 12981.7. Samples: 23465458. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:05:38,368][155126] Avg episode reward: [(0, '1715.133')] [2023-03-07 08:05:38,717][155452] Updated weights for policy 0, policy_version 22940 (0.0006) [2023-03-07 08:05:39,491][155452] Updated weights for policy 0, policy_version 22950 (0.0006) [2023-03-07 08:05:40,277][155452] Updated weights for policy 0, policy_version 22960 (0.0007) [2023-03-07 08:05:41,063][155452] Updated weights for policy 0, policy_version 22970 (0.0007) [2023-03-07 08:05:41,873][155452] Updated weights for policy 0, policy_version 22980 (0.0005) [2023-03-07 08:05:42,660][155452] Updated weights for policy 0, policy_version 22990 (0.0006) [2023-03-07 08:05:43,367][155126] Fps is (10 sec: 12902.6, 60 sec: 12987.7, 300 sec: 13034.3). Total num frames: 23549952. Throughput: 0: 12974.0. Samples: 23543609. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:05:43,367][155126] Avg episode reward: [(0, '1973.379')] [2023-03-07 08:05:43,458][155452] Updated weights for policy 0, policy_version 23000 (0.0006) [2023-03-07 08:05:44,245][155452] Updated weights for policy 0, policy_version 23010 (0.0006) [2023-03-07 08:05:45,032][155452] Updated weights for policy 0, policy_version 23020 (0.0007) [2023-03-07 08:05:45,802][155452] Updated weights for policy 0, policy_version 23030 (0.0006) [2023-03-07 08:05:46,601][155452] Updated weights for policy 0, policy_version 23040 (0.0005) [2023-03-07 08:05:47,383][155452] Updated weights for policy 0, policy_version 23050 (0.0006) [2023-03-07 08:05:48,156][155452] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-07 08:05:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 12987.7, 300 sec: 13034.3). Total num frames: 23615488. Throughput: 0: 12968.3. Samples: 23582532. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:05:48,367][155126] Avg episode reward: [(0, '1788.303')] [2023-03-07 08:05:48,934][155452] Updated weights for policy 0, policy_version 23070 (0.0006) [2023-03-07 08:05:49,718][155452] Updated weights for policy 0, policy_version 23080 (0.0007) [2023-03-07 08:05:50,504][155452] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-07 08:05:51,294][155452] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-07 08:05:52,079][155452] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-07 08:05:52,864][155452] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-07 08:05:53,367][155126] Fps is (10 sec: 13107.0, 60 sec: 12987.7, 300 sec: 13034.3). Total num frames: 23681024. Throughput: 0: 12973.6. Samples: 23660931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:05:53,367][155126] Avg episode reward: [(0, '1754.690')] [2023-03-07 08:05:53,655][155452] Updated weights for policy 0, policy_version 23130 (0.0006) [2023-03-07 08:05:54,456][155452] Updated weights for policy 0, policy_version 23140 (0.0006) [2023-03-07 08:05:55,259][155452] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-07 08:05:56,037][155452] Updated weights for policy 0, policy_version 23160 (0.0006) [2023-03-07 08:05:56,835][155452] Updated weights for policy 0, policy_version 23170 (0.0006) [2023-03-07 08:05:57,611][155452] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-07 08:05:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12970.7, 300 sec: 13034.3). Total num frames: 23745536. Throughput: 0: 12975.9. Samples: 23738636. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:05:58,368][155126] Avg episode reward: [(0, '1728.602')] [2023-03-07 08:05:58,400][155452] Updated weights for policy 0, policy_version 23190 (0.0006) [2023-03-07 08:05:59,187][155452] Updated weights for policy 0, policy_version 23200 (0.0007) [2023-03-07 08:05:59,966][155452] Updated weights for policy 0, policy_version 23210 (0.0007) [2023-03-07 08:06:00,772][155452] Updated weights for policy 0, policy_version 23220 (0.0006) [2023-03-07 08:06:01,547][155452] Updated weights for policy 0, policy_version 23230 (0.0006) [2023-03-07 08:06:02,336][155452] Updated weights for policy 0, policy_version 23240 (0.0006) [2023-03-07 08:06:03,102][155452] Updated weights for policy 0, policy_version 23250 (0.0006) [2023-03-07 08:06:03,367][155126] Fps is (10 sec: 13005.0, 60 sec: 12987.8, 300 sec: 13034.3). Total num frames: 23811072. Throughput: 0: 13004.6. Samples: 23777582. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:06:03,367][155126] Avg episode reward: [(0, '1822.098')] [2023-03-07 08:06:03,906][155452] Updated weights for policy 0, policy_version 23260 (0.0006) [2023-03-07 08:06:04,682][155452] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-07 08:06:05,467][155452] Updated weights for policy 0, policy_version 23280 (0.0006) [2023-03-07 08:06:06,245][155452] Updated weights for policy 0, policy_version 23290 (0.0006) [2023-03-07 08:06:07,039][155452] Updated weights for policy 0, policy_version 23300 (0.0006) [2023-03-07 08:06:07,809][155452] Updated weights for policy 0, policy_version 23310 (0.0007) [2023-03-07 08:06:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 12970.7, 300 sec: 13030.8). Total num frames: 23875584. Throughput: 0: 13017.0. Samples: 23856055. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:06:08,367][155126] Avg episode reward: [(0, '1552.516')] [2023-03-07 08:06:08,606][155452] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-07 08:06:09,381][155452] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-07 08:06:10,172][155452] Updated weights for policy 0, policy_version 23340 (0.0007) [2023-03-07 08:06:10,976][155452] Updated weights for policy 0, policy_version 23350 (0.0005) [2023-03-07 08:06:11,746][155452] Updated weights for policy 0, policy_version 23360 (0.0006) [2023-03-07 08:06:12,561][155452] Updated weights for policy 0, policy_version 23370 (0.0007) [2023-03-07 08:06:13,332][155452] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-07 08:06:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 12987.8, 300 sec: 13034.3). Total num frames: 23941120. Throughput: 0: 13016.8. Samples: 23934111. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:06:13,367][155126] Avg episode reward: [(0, '1696.400')] [2023-03-07 08:06:14,118][155452] Updated weights for policy 0, policy_version 23390 (0.0007) [2023-03-07 08:06:14,896][155452] Updated weights for policy 0, policy_version 23400 (0.0006) [2023-03-07 08:06:15,684][155452] Updated weights for policy 0, policy_version 23410 (0.0006) [2023-03-07 08:06:16,480][155452] Updated weights for policy 0, policy_version 23420 (0.0006) [2023-03-07 08:06:17,254][155452] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-07 08:06:18,038][155452] Updated weights for policy 0, policy_version 23440 (0.0007) [2023-03-07 08:06:18,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 24006656. Throughput: 0: 13019.7. Samples: 23973346. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:06:18,367][155126] Avg episode reward: [(0, '1794.052')] [2023-03-07 08:06:18,819][155452] Updated weights for policy 0, policy_version 23450 (0.0006) [2023-03-07 08:06:19,621][155452] Updated weights for policy 0, policy_version 23460 (0.0007) [2023-03-07 08:06:20,388][155452] Updated weights for policy 0, policy_version 23470 (0.0006) [2023-03-07 08:06:21,197][155452] Updated weights for policy 0, policy_version 23480 (0.0007) [2023-03-07 08:06:21,970][155452] Updated weights for policy 0, policy_version 23490 (0.0007) [2023-03-07 08:06:22,732][155452] Updated weights for policy 0, policy_version 23500 (0.0006) [2023-03-07 08:06:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 24071168. Throughput: 0: 13023.9. Samples: 24051533. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:06:23,368][155126] Avg episode reward: [(0, '1653.756')] [2023-03-07 08:06:23,534][155452] Updated weights for policy 0, policy_version 23510 (0.0006) [2023-03-07 08:06:24,293][155452] Updated weights for policy 0, policy_version 23520 (0.0007) [2023-03-07 08:06:25,065][155452] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-07 08:06:25,852][155452] Updated weights for policy 0, policy_version 23540 (0.0006) [2023-03-07 08:06:26,635][155452] Updated weights for policy 0, policy_version 23550 (0.0006) [2023-03-07 08:06:27,408][155452] Updated weights for policy 0, policy_version 23560 (0.0006) [2023-03-07 08:06:28,187][155452] Updated weights for policy 0, policy_version 23570 (0.0006) [2023-03-07 08:06:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 24137728. Throughput: 0: 13039.5. Samples: 24130390. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:06:28,367][155126] Avg episode reward: [(0, '1575.405')] [2023-03-07 08:06:28,983][155452] Updated weights for policy 0, policy_version 23580 (0.0006) [2023-03-07 08:06:29,773][155452] Updated weights for policy 0, policy_version 23590 (0.0006) [2023-03-07 08:06:30,566][155452] Updated weights for policy 0, policy_version 23600 (0.0007) [2023-03-07 08:06:31,349][155452] Updated weights for policy 0, policy_version 23610 (0.0007) [2023-03-07 08:06:32,121][155452] Updated weights for policy 0, policy_version 23620 (0.0005) [2023-03-07 08:06:32,909][155452] Updated weights for policy 0, policy_version 23630 (0.0006) [2023-03-07 08:06:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 24202240. Throughput: 0: 13044.0. Samples: 24169511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:06:33,368][155126] Avg episode reward: [(0, '1669.556')] [2023-03-07 08:06:33,713][155452] Updated weights for policy 0, policy_version 23640 (0.0007) [2023-03-07 08:06:34,497][155452] Updated weights for policy 0, policy_version 23650 (0.0006) [2023-03-07 08:06:35,300][155452] Updated weights for policy 0, policy_version 23660 (0.0006) [2023-03-07 08:06:36,064][155452] Updated weights for policy 0, policy_version 23670 (0.0006) [2023-03-07 08:06:36,868][155452] Updated weights for policy 0, policy_version 23680 (0.0007) [2023-03-07 08:06:37,635][155452] Updated weights for policy 0, policy_version 23690 (0.0006) [2023-03-07 08:06:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 24267776. Throughput: 0: 13035.6. Samples: 24247530. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:06:38,367][155126] Avg episode reward: [(0, '1434.479')] [2023-03-07 08:06:38,424][155452] Updated weights for policy 0, policy_version 23700 (0.0007) [2023-03-07 08:06:39,205][155452] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-07 08:06:39,986][155452] Updated weights for policy 0, policy_version 23720 (0.0007) [2023-03-07 08:06:40,774][155452] Updated weights for policy 0, policy_version 23730 (0.0005) [2023-03-07 08:06:41,569][155452] Updated weights for policy 0, policy_version 23740 (0.0006) [2023-03-07 08:06:42,362][155452] Updated weights for policy 0, policy_version 23750 (0.0006) [2023-03-07 08:06:43,133][155452] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-07 08:06:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 24332288. Throughput: 0: 13047.6. Samples: 24325779. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:06:43,367][155126] Avg episode reward: [(0, '1648.792')] [2023-03-07 08:06:43,915][155452] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-07 08:06:44,709][155452] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-07 08:06:45,480][155452] Updated weights for policy 0, policy_version 23790 (0.0006) [2023-03-07 08:06:46,278][155452] Updated weights for policy 0, policy_version 23800 (0.0006) [2023-03-07 08:06:47,082][155452] Updated weights for policy 0, policy_version 23810 (0.0006) [2023-03-07 08:06:47,869][155452] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-07 08:06:48,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 24397824. Throughput: 0: 13051.6. Samples: 24364908. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:06:48,368][155126] Avg episode reward: [(0, '1537.744')] [2023-03-07 08:06:48,658][155452] Updated weights for policy 0, policy_version 23830 (0.0006) [2023-03-07 08:06:49,466][155452] Updated weights for policy 0, policy_version 23840 (0.0006) [2023-03-07 08:06:50,241][155452] Updated weights for policy 0, policy_version 23850 (0.0006) [2023-03-07 08:06:51,021][155452] Updated weights for policy 0, policy_version 23860 (0.0006) [2023-03-07 08:06:51,810][155452] Updated weights for policy 0, policy_version 23870 (0.0007) [2023-03-07 08:06:52,601][155452] Updated weights for policy 0, policy_version 23880 (0.0006) [2023-03-07 08:06:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 24462336. Throughput: 0: 13035.0. Samples: 24442631. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:06:53,367][155126] Avg episode reward: [(0, '1755.014')] [2023-03-07 08:06:53,375][155452] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-07 08:06:54,168][155452] Updated weights for policy 0, policy_version 23900 (0.0006) [2023-03-07 08:06:54,966][155452] Updated weights for policy 0, policy_version 23910 (0.0006) [2023-03-07 08:06:55,750][155452] Updated weights for policy 0, policy_version 23920 (0.0006) [2023-03-07 08:06:56,531][155452] Updated weights for policy 0, policy_version 23930 (0.0007) [2023-03-07 08:06:57,308][155452] Updated weights for policy 0, policy_version 23940 (0.0006) [2023-03-07 08:06:58,102][155452] Updated weights for policy 0, policy_version 23950 (0.0005) [2023-03-07 08:06:58,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 24527872. Throughput: 0: 13038.6. Samples: 24520847. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:06:58,367][155126] Avg episode reward: [(0, '1852.938')] [2023-03-07 08:06:58,903][155452] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-07 08:06:59,671][155452] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-07 08:07:00,449][155452] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-07 08:07:01,231][155452] Updated weights for policy 0, policy_version 23990 (0.0006) [2023-03-07 08:07:02,020][155452] Updated weights for policy 0, policy_version 24000 (0.0006) [2023-03-07 08:07:02,807][155452] Updated weights for policy 0, policy_version 24010 (0.0008) [2023-03-07 08:07:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 24592384. Throughput: 0: 13036.7. Samples: 24559995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:07:03,367][155126] Avg episode reward: [(0, '1671.941')] [2023-03-07 08:07:03,581][155452] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-07 08:07:04,402][155452] Updated weights for policy 0, policy_version 24030 (0.0005) [2023-03-07 08:07:05,180][155452] Updated weights for policy 0, policy_version 24040 (0.0006) [2023-03-07 08:07:05,953][155452] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-07 08:07:06,755][155452] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-07 08:07:07,527][155452] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-07 08:07:08,304][155452] Updated weights for policy 0, policy_version 24080 (0.0006) [2023-03-07 08:07:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 24657920. Throughput: 0: 13035.9. Samples: 24638147. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:07:08,367][155126] Avg episode reward: [(0, '1542.423')] [2023-03-07 08:07:09,086][155452] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-07 08:07:09,864][155452] Updated weights for policy 0, policy_version 24100 (0.0007) [2023-03-07 08:07:10,649][155452] Updated weights for policy 0, policy_version 24110 (0.0007) [2023-03-07 08:07:11,424][155452] Updated weights for policy 0, policy_version 24120 (0.0006) [2023-03-07 08:07:12,197][155452] Updated weights for policy 0, policy_version 24130 (0.0006) [2023-03-07 08:07:12,985][155452] Updated weights for policy 0, policy_version 24140 (0.0006) [2023-03-07 08:07:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 24723456. Throughput: 0: 13033.2. Samples: 24716881. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:07:13,367][155126] Avg episode reward: [(0, '1642.627')] [2023-03-07 08:07:13,776][155452] Updated weights for policy 0, policy_version 24150 (0.0005) [2023-03-07 08:07:14,563][155452] Updated weights for policy 0, policy_version 24160 (0.0006) [2023-03-07 08:07:15,345][155452] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-07 08:07:16,151][155452] Updated weights for policy 0, policy_version 24180 (0.0006) [2023-03-07 08:07:16,933][155452] Updated weights for policy 0, policy_version 24190 (0.0006) [2023-03-07 08:07:17,722][155452] Updated weights for policy 0, policy_version 24200 (0.0006) [2023-03-07 08:07:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 24788992. Throughput: 0: 13029.3. Samples: 24755827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:07:18,367][155126] Avg episode reward: [(0, '1751.597')] [2023-03-07 08:07:18,507][155452] Updated weights for policy 0, policy_version 24210 (0.0006) [2023-03-07 08:07:19,280][155452] Updated weights for policy 0, policy_version 24220 (0.0006) [2023-03-07 08:07:20,065][155452] Updated weights for policy 0, policy_version 24230 (0.0006) [2023-03-07 08:07:20,846][155452] Updated weights for policy 0, policy_version 24240 (0.0006) [2023-03-07 08:07:21,641][155452] Updated weights for policy 0, policy_version 24250 (0.0007) [2023-03-07 08:07:22,423][155452] Updated weights for policy 0, policy_version 24260 (0.0007) [2023-03-07 08:07:23,216][155452] Updated weights for policy 0, policy_version 24270 (0.0007) [2023-03-07 08:07:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 24853504. Throughput: 0: 13032.7. Samples: 24834000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:07:23,367][155126] Avg episode reward: [(0, '1802.541')] [2023-03-07 08:07:23,995][155452] Updated weights for policy 0, policy_version 24280 (0.0005) [2023-03-07 08:07:24,782][155452] Updated weights for policy 0, policy_version 24290 (0.0006) [2023-03-07 08:07:25,555][155452] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-07 08:07:26,359][155452] Updated weights for policy 0, policy_version 24310 (0.0006) [2023-03-07 08:07:27,139][155452] Updated weights for policy 0, policy_version 24320 (0.0007) [2023-03-07 08:07:27,922][155452] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-07 08:07:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 24919040. Throughput: 0: 13029.1. Samples: 24912088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:07:28,367][155126] Avg episode reward: [(0, '1774.574')] [2023-03-07 08:07:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000024335_24919040.pth... [2023-03-07 08:07:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000021282_21792768.pth [2023-03-07 08:07:28,709][155452] Updated weights for policy 0, policy_version 24340 (0.0006) [2023-03-07 08:07:29,493][155452] Updated weights for policy 0, policy_version 24350 (0.0007) [2023-03-07 08:07:30,281][155452] Updated weights for policy 0, policy_version 24360 (0.0006) [2023-03-07 08:07:31,048][155452] Updated weights for policy 0, policy_version 24370 (0.0006) [2023-03-07 08:07:31,859][155452] Updated weights for policy 0, policy_version 24380 (0.0008) [2023-03-07 08:07:32,652][155452] Updated weights for policy 0, policy_version 24390 (0.0006) [2023-03-07 08:07:33,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 24983552. Throughput: 0: 13030.1. Samples: 24951263. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:07:33,368][155126] Avg episode reward: [(0, '1822.084')] [2023-03-07 08:07:33,443][155452] Updated weights for policy 0, policy_version 24400 (0.0006) [2023-03-07 08:07:34,237][155452] Updated weights for policy 0, policy_version 24410 (0.0007) [2023-03-07 08:07:35,027][155452] Updated weights for policy 0, policy_version 24420 (0.0007) [2023-03-07 08:07:35,802][155452] Updated weights for policy 0, policy_version 24430 (0.0007) [2023-03-07 08:07:36,582][155452] Updated weights for policy 0, policy_version 24440 (0.0007) [2023-03-07 08:07:37,369][155452] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-07 08:07:38,168][155452] Updated weights for policy 0, policy_version 24460 (0.0006) [2023-03-07 08:07:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 25049088. Throughput: 0: 13032.4. Samples: 25029090. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:07:38,367][155126] Avg episode reward: [(0, '1673.522')] [2023-03-07 08:07:38,968][155452] Updated weights for policy 0, policy_version 24470 (0.0006) [2023-03-07 08:07:39,728][155452] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-07 08:07:40,526][155452] Updated weights for policy 0, policy_version 24490 (0.0006) [2023-03-07 08:07:41,306][155452] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-07 08:07:42,078][155452] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-07 08:07:42,873][155452] Updated weights for policy 0, policy_version 24520 (0.0006) [2023-03-07 08:07:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 25114624. Throughput: 0: 13034.5. Samples: 25107398. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:07:43,367][155126] Avg episode reward: [(0, '1686.971')] [2023-03-07 08:07:43,633][155452] Updated weights for policy 0, policy_version 24530 (0.0006) [2023-03-07 08:07:44,406][155452] Updated weights for policy 0, policy_version 24540 (0.0006) [2023-03-07 08:07:45,195][155452] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-07 08:07:45,986][155452] Updated weights for policy 0, policy_version 24560 (0.0007) [2023-03-07 08:07:46,776][155452] Updated weights for policy 0, policy_version 24570 (0.0006) [2023-03-07 08:07:47,566][155452] Updated weights for policy 0, policy_version 24580 (0.0007) [2023-03-07 08:07:48,339][155452] Updated weights for policy 0, policy_version 24590 (0.0006) [2023-03-07 08:07:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 25180160. Throughput: 0: 13040.9. Samples: 25146837. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:07:48,367][155126] Avg episode reward: [(0, '1658.812')] [2023-03-07 08:07:49,141][155452] Updated weights for policy 0, policy_version 24600 (0.0007) [2023-03-07 08:07:49,906][155452] Updated weights for policy 0, policy_version 24610 (0.0006) [2023-03-07 08:07:50,678][155452] Updated weights for policy 0, policy_version 24620 (0.0006) [2023-03-07 08:07:51,478][155452] Updated weights for policy 0, policy_version 24630 (0.0006) [2023-03-07 08:07:52,254][155452] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-07 08:07:53,035][155452] Updated weights for policy 0, policy_version 24650 (0.0005) [2023-03-07 08:07:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 25245696. Throughput: 0: 13048.7. Samples: 25225341. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:07:53,367][155126] Avg episode reward: [(0, '1769.754')] [2023-03-07 08:07:53,822][155452] Updated weights for policy 0, policy_version 24660 (0.0006) [2023-03-07 08:07:54,617][155452] Updated weights for policy 0, policy_version 24670 (0.0006) [2023-03-07 08:07:55,397][155452] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-07 08:07:56,185][155452] Updated weights for policy 0, policy_version 24690 (0.0007) [2023-03-07 08:07:56,961][155452] Updated weights for policy 0, policy_version 24700 (0.0005) [2023-03-07 08:07:57,729][155452] Updated weights for policy 0, policy_version 24710 (0.0006) [2023-03-07 08:07:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 25311232. Throughput: 0: 13042.4. Samples: 25303792. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:07:58,368][155126] Avg episode reward: [(0, '1548.494')] [2023-03-07 08:07:58,520][155452] Updated weights for policy 0, policy_version 24720 (0.0007) [2023-03-07 08:07:59,303][155452] Updated weights for policy 0, policy_version 24730 (0.0007) [2023-03-07 08:08:00,101][155452] Updated weights for policy 0, policy_version 24740 (0.0006) [2023-03-07 08:08:00,880][155452] Updated weights for policy 0, policy_version 24750 (0.0006) [2023-03-07 08:08:01,663][155452] Updated weights for policy 0, policy_version 24760 (0.0006) [2023-03-07 08:08:02,447][155452] Updated weights for policy 0, policy_version 24770 (0.0006) [2023-03-07 08:08:03,233][155452] Updated weights for policy 0, policy_version 24780 (0.0006) [2023-03-07 08:08:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 25375744. Throughput: 0: 13043.1. Samples: 25342769. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:08:03,367][155126] Avg episode reward: [(0, '1750.033')] [2023-03-07 08:08:04,017][155452] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-07 08:08:04,802][155452] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-07 08:08:05,590][155452] Updated weights for policy 0, policy_version 24810 (0.0006) [2023-03-07 08:08:06,362][155452] Updated weights for policy 0, policy_version 24820 (0.0006) [2023-03-07 08:08:07,143][155452] Updated weights for policy 0, policy_version 24830 (0.0006) [2023-03-07 08:08:07,946][155452] Updated weights for policy 0, policy_version 24840 (0.0006) [2023-03-07 08:08:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 25441280. Throughput: 0: 13047.4. Samples: 25421135. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:08:08,367][155126] Avg episode reward: [(0, '1848.194')] [2023-03-07 08:08:08,733][155452] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-07 08:08:09,513][155452] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-07 08:08:10,302][155452] Updated weights for policy 0, policy_version 24870 (0.0006) [2023-03-07 08:08:11,091][155452] Updated weights for policy 0, policy_version 24880 (0.0006) [2023-03-07 08:08:11,874][155452] Updated weights for policy 0, policy_version 24890 (0.0007) [2023-03-07 08:08:12,648][155452] Updated weights for policy 0, policy_version 24900 (0.0006) [2023-03-07 08:08:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 25506816. Throughput: 0: 13053.4. Samples: 25499491. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:08:13,368][155126] Avg episode reward: [(0, '1495.512')] [2023-03-07 08:08:13,434][155452] Updated weights for policy 0, policy_version 24910 (0.0006) [2023-03-07 08:08:14,234][155452] Updated weights for policy 0, policy_version 24920 (0.0006) [2023-03-07 08:08:15,017][155452] Updated weights for policy 0, policy_version 24930 (0.0006) [2023-03-07 08:08:15,805][155452] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-07 08:08:16,592][155452] Updated weights for policy 0, policy_version 24950 (0.0006) [2023-03-07 08:08:17,384][155452] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-07 08:08:18,163][155452] Updated weights for policy 0, policy_version 24970 (0.0006) [2023-03-07 08:08:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 25571328. Throughput: 0: 13052.2. Samples: 25538612. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:08:18,367][155126] Avg episode reward: [(0, '1717.337')] [2023-03-07 08:08:18,962][155452] Updated weights for policy 0, policy_version 24980 (0.0007) [2023-03-07 08:08:19,719][155452] Updated weights for policy 0, policy_version 24990 (0.0007) [2023-03-07 08:08:20,509][155452] Updated weights for policy 0, policy_version 25000 (0.0006) [2023-03-07 08:08:21,279][155452] Updated weights for policy 0, policy_version 25010 (0.0006) [2023-03-07 08:08:22,073][155452] Updated weights for policy 0, policy_version 25020 (0.0006) [2023-03-07 08:08:22,848][155452] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-07 08:08:23,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 25636864. Throughput: 0: 13060.3. Samples: 25616803. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:23,367][155126] Avg episode reward: [(0, '1734.000')] [2023-03-07 08:08:23,633][155452] Updated weights for policy 0, policy_version 25040 (0.0006) [2023-03-07 08:08:24,418][155452] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-07 08:08:25,202][155452] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-07 08:08:25,982][155452] Updated weights for policy 0, policy_version 25070 (0.0006) [2023-03-07 08:08:26,769][155452] Updated weights for policy 0, policy_version 25080 (0.0006) [2023-03-07 08:08:27,543][155452] Updated weights for policy 0, policy_version 25090 (0.0007) [2023-03-07 08:08:28,343][155452] Updated weights for policy 0, policy_version 25100 (0.0006) [2023-03-07 08:08:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 25702400. Throughput: 0: 13066.0. Samples: 25695370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:28,367][155126] Avg episode reward: [(0, '1463.832')] [2023-03-07 08:08:29,113][155452] Updated weights for policy 0, policy_version 25110 (0.0006) [2023-03-07 08:08:29,909][155452] Updated weights for policy 0, policy_version 25120 (0.0005) [2023-03-07 08:08:30,672][155452] Updated weights for policy 0, policy_version 25130 (0.0007) [2023-03-07 08:08:31,462][155452] Updated weights for policy 0, policy_version 25140 (0.0006) [2023-03-07 08:08:32,242][155452] Updated weights for policy 0, policy_version 25150 (0.0006) [2023-03-07 08:08:33,027][155452] Updated weights for policy 0, policy_version 25160 (0.0007) [2023-03-07 08:08:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13030.8). Total num frames: 25767936. Throughput: 0: 13060.8. Samples: 25734572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:33,367][155126] Avg episode reward: [(0, '1445.936')] [2023-03-07 08:08:33,800][155452] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-07 08:08:34,602][155452] Updated weights for policy 0, policy_version 25180 (0.0006) [2023-03-07 08:08:35,381][155452] Updated weights for policy 0, policy_version 25190 (0.0007) [2023-03-07 08:08:36,157][155452] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-07 08:08:36,942][155452] Updated weights for policy 0, policy_version 25210 (0.0007) [2023-03-07 08:08:37,716][155452] Updated weights for policy 0, policy_version 25220 (0.0006) [2023-03-07 08:08:38,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13034.3). Total num frames: 25833472. Throughput: 0: 13062.2. Samples: 25813140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:38,368][155126] Avg episode reward: [(0, '1762.046')] [2023-03-07 08:08:38,514][155452] Updated weights for policy 0, policy_version 25230 (0.0006) [2023-03-07 08:08:39,310][155452] Updated weights for policy 0, policy_version 25240 (0.0006) [2023-03-07 08:08:40,091][155452] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-07 08:08:40,868][155452] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-07 08:08:41,639][155452] Updated weights for policy 0, policy_version 25270 (0.0006) [2023-03-07 08:08:42,421][155452] Updated weights for policy 0, policy_version 25280 (0.0006) [2023-03-07 08:08:43,205][155452] Updated weights for policy 0, policy_version 25290 (0.0006) [2023-03-07 08:08:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13034.3). Total num frames: 25899008. Throughput: 0: 13061.5. Samples: 25891558. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:43,367][155126] Avg episode reward: [(0, '1562.632')] [2023-03-07 08:08:43,998][155452] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-07 08:08:44,809][155452] Updated weights for policy 0, policy_version 25310 (0.0006) [2023-03-07 08:08:45,569][155452] Updated weights for policy 0, policy_version 25320 (0.0007) [2023-03-07 08:08:46,365][155452] Updated weights for policy 0, policy_version 25330 (0.0006) [2023-03-07 08:08:47,130][155452] Updated weights for policy 0, policy_version 25340 (0.0007) [2023-03-07 08:08:47,905][155452] Updated weights for policy 0, policy_version 25350 (0.0006) [2023-03-07 08:08:48,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 25963520. Throughput: 0: 13062.0. Samples: 25930559. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:48,367][155126] Avg episode reward: [(0, '1579.406')] [2023-03-07 08:08:48,699][155452] Updated weights for policy 0, policy_version 25360 (0.0006) [2023-03-07 08:08:49,477][155452] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-07 08:08:50,255][155452] Updated weights for policy 0, policy_version 25380 (0.0007) [2023-03-07 08:08:51,037][155452] Updated weights for policy 0, policy_version 25390 (0.0007) [2023-03-07 08:08:51,819][155452] Updated weights for policy 0, policy_version 25400 (0.0006) [2023-03-07 08:08:52,597][155452] Updated weights for policy 0, policy_version 25410 (0.0006) [2023-03-07 08:08:53,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 26029056. Throughput: 0: 13074.5. Samples: 26009489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:53,368][155126] Avg episode reward: [(0, '1753.541')] [2023-03-07 08:08:53,369][155452] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-07 08:08:54,166][155452] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-07 08:08:54,969][155452] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-07 08:08:55,754][155452] Updated weights for policy 0, policy_version 25450 (0.0006) [2023-03-07 08:08:56,530][155452] Updated weights for policy 0, policy_version 25460 (0.0007) [2023-03-07 08:08:57,307][155452] Updated weights for policy 0, policy_version 25470 (0.0006) [2023-03-07 08:08:58,097][155452] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-07 08:08:58,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 26094592. Throughput: 0: 13070.3. Samples: 26087655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:08:58,367][155126] Avg episode reward: [(0, '1659.148')] [2023-03-07 08:08:58,869][155452] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-07 08:08:59,655][155452] Updated weights for policy 0, policy_version 25500 (0.0007) [2023-03-07 08:09:00,444][155452] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-07 08:09:01,232][155452] Updated weights for policy 0, policy_version 25520 (0.0008) [2023-03-07 08:09:02,004][155452] Updated weights for policy 0, policy_version 25530 (0.0006) [2023-03-07 08:09:02,779][155452] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-07 08:09:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 26160128. Throughput: 0: 13071.7. Samples: 26126840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:09:03,367][155126] Avg episode reward: [(0, '1695.321')] [2023-03-07 08:09:03,555][155452] Updated weights for policy 0, policy_version 25550 (0.0006) [2023-03-07 08:09:04,345][155452] Updated weights for policy 0, policy_version 25560 (0.0005) [2023-03-07 08:09:05,137][155452] Updated weights for policy 0, policy_version 25570 (0.0006) [2023-03-07 08:09:05,933][155452] Updated weights for policy 0, policy_version 25580 (0.0006) [2023-03-07 08:09:06,713][155452] Updated weights for policy 0, policy_version 25590 (0.0007) [2023-03-07 08:09:07,522][155452] Updated weights for policy 0, policy_version 25600 (0.0006) [2023-03-07 08:09:08,305][155452] Updated weights for policy 0, policy_version 25610 (0.0006) [2023-03-07 08:09:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 26224640. Throughput: 0: 13077.7. Samples: 26205299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:09:08,367][155126] Avg episode reward: [(0, '1819.190')] [2023-03-07 08:09:09,089][155452] Updated weights for policy 0, policy_version 25620 (0.0006) [2023-03-07 08:09:09,882][155452] Updated weights for policy 0, policy_version 25630 (0.0006) [2023-03-07 08:09:10,658][155452] Updated weights for policy 0, policy_version 25640 (0.0005) [2023-03-07 08:09:11,447][155452] Updated weights for policy 0, policy_version 25650 (0.0006) [2023-03-07 08:09:12,216][155452] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-07 08:09:12,998][155452] Updated weights for policy 0, policy_version 25670 (0.0006) [2023-03-07 08:09:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 26290176. Throughput: 0: 13065.0. Samples: 26283295. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:09:13,368][155126] Avg episode reward: [(0, '1714.019')] [2023-03-07 08:09:13,807][155452] Updated weights for policy 0, policy_version 25680 (0.0006) [2023-03-07 08:09:14,572][155452] Updated weights for policy 0, policy_version 25690 (0.0006) [2023-03-07 08:09:15,362][155452] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-07 08:09:16,148][155452] Updated weights for policy 0, policy_version 25710 (0.0007) [2023-03-07 08:09:16,923][155452] Updated weights for policy 0, policy_version 25720 (0.0006) [2023-03-07 08:09:17,713][155452] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-07 08:09:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 26355712. Throughput: 0: 13058.7. Samples: 26322213. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:09:18,367][155126] Avg episode reward: [(0, '1723.143')] [2023-03-07 08:09:18,495][155452] Updated weights for policy 0, policy_version 25740 (0.0007) [2023-03-07 08:09:19,276][155452] Updated weights for policy 0, policy_version 25750 (0.0007) [2023-03-07 08:09:20,071][155452] Updated weights for policy 0, policy_version 25760 (0.0006) [2023-03-07 08:09:20,839][155452] Updated weights for policy 0, policy_version 25770 (0.0006) [2023-03-07 08:09:21,612][155452] Updated weights for policy 0, policy_version 25780 (0.0006) [2023-03-07 08:09:22,405][155452] Updated weights for policy 0, policy_version 25790 (0.0006) [2023-03-07 08:09:23,186][155452] Updated weights for policy 0, policy_version 25800 (0.0006) [2023-03-07 08:09:23,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 26421248. Throughput: 0: 13064.0. Samples: 26401016. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:09:23,367][155126] Avg episode reward: [(0, '1619.502')] [2023-03-07 08:09:23,965][155452] Updated weights for policy 0, policy_version 25810 (0.0006) [2023-03-07 08:09:24,749][155452] Updated weights for policy 0, policy_version 25820 (0.0006) [2023-03-07 08:09:25,534][155452] Updated weights for policy 0, policy_version 25830 (0.0006) [2023-03-07 08:09:26,327][155452] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-07 08:09:27,112][155452] Updated weights for policy 0, policy_version 25850 (0.0006) [2023-03-07 08:09:27,880][155452] Updated weights for policy 0, policy_version 25860 (0.0006) [2023-03-07 08:09:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 26486784. Throughput: 0: 13064.5. Samples: 26479462. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:09:28,367][155126] Avg episode reward: [(0, '1653.608')] [2023-03-07 08:09:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000025866_26486784.pth... [2023-03-07 08:09:28,404][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000022808_23355392.pth [2023-03-07 08:09:28,675][155452] Updated weights for policy 0, policy_version 25870 (0.0007) [2023-03-07 08:09:29,480][155452] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-07 08:09:30,265][155452] Updated weights for policy 0, policy_version 25890 (0.0005) [2023-03-07 08:09:31,056][155452] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-07 08:09:31,839][155452] Updated weights for policy 0, policy_version 25910 (0.0006) [2023-03-07 08:09:32,637][155452] Updated weights for policy 0, policy_version 25920 (0.0006) [2023-03-07 08:09:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 26551296. Throughput: 0: 13057.1. Samples: 26518128. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:09:33,367][155126] Avg episode reward: [(0, '1706.731')] [2023-03-07 08:09:33,419][155452] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-07 08:09:34,195][155452] Updated weights for policy 0, policy_version 25940 (0.0006) [2023-03-07 08:09:34,985][155452] Updated weights for policy 0, policy_version 25950 (0.0005) [2023-03-07 08:09:35,763][155452] Updated weights for policy 0, policy_version 25960 (0.0006) [2023-03-07 08:09:36,548][155452] Updated weights for policy 0, policy_version 25970 (0.0006) [2023-03-07 08:09:37,353][155452] Updated weights for policy 0, policy_version 25980 (0.0006) [2023-03-07 08:09:38,135][155452] Updated weights for policy 0, policy_version 25990 (0.0006) [2023-03-07 08:09:38,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 26615808. Throughput: 0: 13041.9. Samples: 26596376. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:09:38,367][155126] Avg episode reward: [(0, '1861.898')] [2023-03-07 08:09:38,923][155452] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-07 08:09:39,704][155452] Updated weights for policy 0, policy_version 26010 (0.0006) [2023-03-07 08:09:40,484][155452] Updated weights for policy 0, policy_version 26020 (0.0007) [2023-03-07 08:09:41,275][155452] Updated weights for policy 0, policy_version 26030 (0.0007) [2023-03-07 08:09:42,057][155452] Updated weights for policy 0, policy_version 26040 (0.0007) [2023-03-07 08:09:42,846][155452] Updated weights for policy 0, policy_version 26050 (0.0006) [2023-03-07 08:09:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 26681344. Throughput: 0: 13042.2. Samples: 26674554. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:09:43,367][155126] Avg episode reward: [(0, '1727.748')] [2023-03-07 08:09:43,629][155452] Updated weights for policy 0, policy_version 26060 (0.0006) [2023-03-07 08:09:44,417][155452] Updated weights for policy 0, policy_version 26070 (0.0006) [2023-03-07 08:09:45,203][155452] Updated weights for policy 0, policy_version 26080 (0.0006) [2023-03-07 08:09:45,989][155452] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-07 08:09:46,789][155452] Updated weights for policy 0, policy_version 26100 (0.0007) [2023-03-07 08:09:47,569][155452] Updated weights for policy 0, policy_version 26110 (0.0006) [2023-03-07 08:09:48,328][155452] Updated weights for policy 0, policy_version 26120 (0.0006) [2023-03-07 08:09:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 26746880. Throughput: 0: 13040.4. Samples: 26713655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:09:48,367][155126] Avg episode reward: [(0, '1675.425')] [2023-03-07 08:09:49,121][155452] Updated weights for policy 0, policy_version 26130 (0.0006) [2023-03-07 08:09:49,901][155452] Updated weights for policy 0, policy_version 26140 (0.0006) [2023-03-07 08:09:50,696][155452] Updated weights for policy 0, policy_version 26150 (0.0006) [2023-03-07 08:09:51,474][155452] Updated weights for policy 0, policy_version 26160 (0.0006) [2023-03-07 08:09:52,241][155452] Updated weights for policy 0, policy_version 26170 (0.0007) [2023-03-07 08:09:53,025][155452] Updated weights for policy 0, policy_version 26180 (0.0006) [2023-03-07 08:09:53,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 26812416. Throughput: 0: 13040.4. Samples: 26792119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:09:53,368][155126] Avg episode reward: [(0, '1730.229')] [2023-03-07 08:09:53,817][155452] Updated weights for policy 0, policy_version 26190 (0.0006) [2023-03-07 08:09:54,600][155452] Updated weights for policy 0, policy_version 26200 (0.0007) [2023-03-07 08:09:55,382][155452] Updated weights for policy 0, policy_version 26210 (0.0006) [2023-03-07 08:09:56,152][155452] Updated weights for policy 0, policy_version 26220 (0.0006) [2023-03-07 08:09:56,936][155452] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-07 08:09:57,722][155452] Updated weights for policy 0, policy_version 26240 (0.0006) [2023-03-07 08:09:58,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 26877952. Throughput: 0: 13051.3. Samples: 26870604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:09:58,368][155126] Avg episode reward: [(0, '1879.602')] [2023-03-07 08:09:58,506][155452] Updated weights for policy 0, policy_version 26250 (0.0006) [2023-03-07 08:09:59,283][155452] Updated weights for policy 0, policy_version 26260 (0.0006) [2023-03-07 08:10:00,062][155452] Updated weights for policy 0, policy_version 26270 (0.0007) [2023-03-07 08:10:00,849][155452] Updated weights for policy 0, policy_version 26280 (0.0006) [2023-03-07 08:10:01,642][155452] Updated weights for policy 0, policy_version 26290 (0.0006) [2023-03-07 08:10:02,427][155452] Updated weights for policy 0, policy_version 26300 (0.0006) [2023-03-07 08:10:03,240][155452] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-07 08:10:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 26942464. Throughput: 0: 13059.3. Samples: 26909885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:10:03,368][155126] Avg episode reward: [(0, '1519.756')] [2023-03-07 08:10:04,017][155452] Updated weights for policy 0, policy_version 26320 (0.0006) [2023-03-07 08:10:04,805][155452] Updated weights for policy 0, policy_version 26330 (0.0006) [2023-03-07 08:10:05,570][155452] Updated weights for policy 0, policy_version 26340 (0.0006) [2023-03-07 08:10:06,353][155452] Updated weights for policy 0, policy_version 26350 (0.0006) [2023-03-07 08:10:07,146][155452] Updated weights for policy 0, policy_version 26360 (0.0006) [2023-03-07 08:10:07,945][155452] Updated weights for policy 0, policy_version 26370 (0.0006) [2023-03-07 08:10:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 27008000. Throughput: 0: 13046.1. Samples: 26988094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:10:08,368][155126] Avg episode reward: [(0, '1644.727')] [2023-03-07 08:10:08,718][155452] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-07 08:10:09,517][155452] Updated weights for policy 0, policy_version 26390 (0.0006) [2023-03-07 08:10:10,301][155452] Updated weights for policy 0, policy_version 26400 (0.0007) [2023-03-07 08:10:11,070][155452] Updated weights for policy 0, policy_version 26410 (0.0006) [2023-03-07 08:10:11,853][155452] Updated weights for policy 0, policy_version 26420 (0.0007) [2023-03-07 08:10:12,635][155452] Updated weights for policy 0, policy_version 26430 (0.0006) [2023-03-07 08:10:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 27073536. Throughput: 0: 13044.2. Samples: 27066451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:10:13,368][155126] Avg episode reward: [(0, '1774.282')] [2023-03-07 08:10:13,412][155452] Updated weights for policy 0, policy_version 26440 (0.0006) [2023-03-07 08:10:14,209][155452] Updated weights for policy 0, policy_version 26450 (0.0006) [2023-03-07 08:10:15,008][155452] Updated weights for policy 0, policy_version 26460 (0.0006) [2023-03-07 08:10:15,777][155452] Updated weights for policy 0, policy_version 26470 (0.0006) [2023-03-07 08:10:16,570][155452] Updated weights for policy 0, policy_version 26480 (0.0006) [2023-03-07 08:10:17,341][155452] Updated weights for policy 0, policy_version 26490 (0.0006) [2023-03-07 08:10:18,150][155452] Updated weights for policy 0, policy_version 26500 (0.0006) [2023-03-07 08:10:18,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 27139072. Throughput: 0: 13053.7. Samples: 27105549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:10:18,367][155126] Avg episode reward: [(0, '1764.377')] [2023-03-07 08:10:18,926][155452] Updated weights for policy 0, policy_version 26510 (0.0006) [2023-03-07 08:10:19,707][155452] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-07 08:10:20,464][155452] Updated weights for policy 0, policy_version 26530 (0.0005) [2023-03-07 08:10:21,253][155452] Updated weights for policy 0, policy_version 26540 (0.0006) [2023-03-07 08:10:22,055][155452] Updated weights for policy 0, policy_version 26550 (0.0006) [2023-03-07 08:10:22,840][155452] Updated weights for policy 0, policy_version 26560 (0.0006) [2023-03-07 08:10:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 27203584. Throughput: 0: 13051.3. Samples: 27183683. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 08:10:23,367][155126] Avg episode reward: [(0, '1590.844')] [2023-03-07 08:10:23,637][155452] Updated weights for policy 0, policy_version 26570 (0.0006) [2023-03-07 08:10:24,409][155452] Updated weights for policy 0, policy_version 26580 (0.0006) [2023-03-07 08:10:25,212][155452] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-07 08:10:25,986][155452] Updated weights for policy 0, policy_version 26600 (0.0007) [2023-03-07 08:10:26,780][155452] Updated weights for policy 0, policy_version 26610 (0.0006) [2023-03-07 08:10:27,558][155452] Updated weights for policy 0, policy_version 26620 (0.0005) [2023-03-07 08:10:28,338][155452] Updated weights for policy 0, policy_version 26630 (0.0006) [2023-03-07 08:10:28,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 27269120. Throughput: 0: 13052.8. Samples: 27261928. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 08:10:28,367][155126] Avg episode reward: [(0, '1538.895')] [2023-03-07 08:10:29,124][155452] Updated weights for policy 0, policy_version 26640 (0.0006) [2023-03-07 08:10:29,914][155452] Updated weights for policy 0, policy_version 26650 (0.0006) [2023-03-07 08:10:30,712][155452] Updated weights for policy 0, policy_version 26660 (0.0006) [2023-03-07 08:10:31,505][155452] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-07 08:10:32,298][155452] Updated weights for policy 0, policy_version 26680 (0.0006) [2023-03-07 08:10:33,092][155452] Updated weights for policy 0, policy_version 26690 (0.0006) [2023-03-07 08:10:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 27333632. Throughput: 0: 13053.0. Samples: 27301040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 08:10:33,367][155126] Avg episode reward: [(0, '1688.571')] [2023-03-07 08:10:33,874][155452] Updated weights for policy 0, policy_version 26700 (0.0006) [2023-03-07 08:10:34,669][155452] Updated weights for policy 0, policy_version 26710 (0.0006) [2023-03-07 08:10:35,444][155452] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-07 08:10:36,228][155452] Updated weights for policy 0, policy_version 26730 (0.0006) [2023-03-07 08:10:37,005][155452] Updated weights for policy 0, policy_version 26740 (0.0007) [2023-03-07 08:10:37,788][155452] Updated weights for policy 0, policy_version 26750 (0.0006) [2023-03-07 08:10:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 27399168. Throughput: 0: 13041.7. Samples: 27378994. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 08:10:38,367][155126] Avg episode reward: [(0, '1737.499')] [2023-03-07 08:10:38,573][155452] Updated weights for policy 0, policy_version 26760 (0.0006) [2023-03-07 08:10:39,379][155452] Updated weights for policy 0, policy_version 26770 (0.0006) [2023-03-07 08:10:40,164][155452] Updated weights for policy 0, policy_version 26780 (0.0007) [2023-03-07 08:10:40,932][155452] Updated weights for policy 0, policy_version 26790 (0.0006) [2023-03-07 08:10:41,697][155452] Updated weights for policy 0, policy_version 26800 (0.0007) [2023-03-07 08:10:42,493][155452] Updated weights for policy 0, policy_version 26810 (0.0006) [2023-03-07 08:10:43,276][155452] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-07 08:10:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 27464704. Throughput: 0: 13040.2. Samples: 27457411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:10:43,367][155126] Avg episode reward: [(0, '1610.663')] [2023-03-07 08:10:44,055][155452] Updated weights for policy 0, policy_version 26830 (0.0006) [2023-03-07 08:10:44,835][155452] Updated weights for policy 0, policy_version 26840 (0.0006) [2023-03-07 08:10:45,623][155452] Updated weights for policy 0, policy_version 26850 (0.0005) [2023-03-07 08:10:46,401][155452] Updated weights for policy 0, policy_version 26860 (0.0006) [2023-03-07 08:10:47,192][155452] Updated weights for policy 0, policy_version 26870 (0.0006) [2023-03-07 08:10:47,996][155452] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-07 08:10:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 27530240. Throughput: 0: 13043.1. Samples: 27496821. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:10:48,367][155126] Avg episode reward: [(0, '1863.946')] [2023-03-07 08:10:48,763][155452] Updated weights for policy 0, policy_version 26890 (0.0006) [2023-03-07 08:10:49,547][155452] Updated weights for policy 0, policy_version 26900 (0.0006) [2023-03-07 08:10:50,328][155452] Updated weights for policy 0, policy_version 26910 (0.0008) [2023-03-07 08:10:51,115][155452] Updated weights for policy 0, policy_version 26920 (0.0006) [2023-03-07 08:10:51,889][155452] Updated weights for policy 0, policy_version 26930 (0.0006) [2023-03-07 08:10:52,676][155452] Updated weights for policy 0, policy_version 26940 (0.0008) [2023-03-07 08:10:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 27594752. Throughput: 0: 13042.7. Samples: 27575013. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:10:53,367][155126] Avg episode reward: [(0, '1791.327')] [2023-03-07 08:10:53,471][155452] Updated weights for policy 0, policy_version 26950 (0.0006) [2023-03-07 08:10:54,243][155452] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-07 08:10:55,032][155452] Updated weights for policy 0, policy_version 26970 (0.0007) [2023-03-07 08:10:55,804][155452] Updated weights for policy 0, policy_version 26980 (0.0006) [2023-03-07 08:10:56,601][155452] Updated weights for policy 0, policy_version 26990 (0.0007) [2023-03-07 08:10:57,389][155452] Updated weights for policy 0, policy_version 27000 (0.0006) [2023-03-07 08:10:58,177][155452] Updated weights for policy 0, policy_version 27010 (0.0006) [2023-03-07 08:10:58,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 27660288. Throughput: 0: 13037.6. Samples: 27653144. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:10:58,368][155126] Avg episode reward: [(0, '1706.649')] [2023-03-07 08:10:58,965][155452] Updated weights for policy 0, policy_version 27020 (0.0006) [2023-03-07 08:10:59,778][155452] Updated weights for policy 0, policy_version 27030 (0.0006) [2023-03-07 08:11:00,562][155452] Updated weights for policy 0, policy_version 27040 (0.0006) [2023-03-07 08:11:01,348][155452] Updated weights for policy 0, policy_version 27050 (0.0006) [2023-03-07 08:11:02,127][155452] Updated weights for policy 0, policy_version 27060 (0.0008) [2023-03-07 08:11:02,923][155452] Updated weights for policy 0, policy_version 27070 (0.0006) [2023-03-07 08:11:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 27724800. Throughput: 0: 13026.3. Samples: 27691731. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:11:03,367][155126] Avg episode reward: [(0, '1733.509')] [2023-03-07 08:11:03,706][155452] Updated weights for policy 0, policy_version 27080 (0.0005) [2023-03-07 08:11:04,491][155452] Updated weights for policy 0, policy_version 27090 (0.0007) [2023-03-07 08:11:05,277][155452] Updated weights for policy 0, policy_version 27100 (0.0006) [2023-03-07 08:11:06,041][155452] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-07 08:11:06,829][155452] Updated weights for policy 0, policy_version 27120 (0.0007) [2023-03-07 08:11:07,635][155452] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-07 08:11:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 27790336. Throughput: 0: 13037.9. Samples: 27770389. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:11:08,367][155126] Avg episode reward: [(0, '1764.587')] [2023-03-07 08:11:08,418][155452] Updated weights for policy 0, policy_version 27140 (0.0006) [2023-03-07 08:11:09,206][155452] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-07 08:11:09,998][155452] Updated weights for policy 0, policy_version 27160 (0.0005) [2023-03-07 08:11:10,789][155452] Updated weights for policy 0, policy_version 27170 (0.0007) [2023-03-07 08:11:11,565][155452] Updated weights for policy 0, policy_version 27180 (0.0006) [2023-03-07 08:11:12,346][155452] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-07 08:11:13,133][155452] Updated weights for policy 0, policy_version 27200 (0.0006) [2023-03-07 08:11:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 27854848. Throughput: 0: 13033.5. Samples: 27848436. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:11:13,367][155126] Avg episode reward: [(0, '1790.854')] [2023-03-07 08:11:13,913][155452] Updated weights for policy 0, policy_version 27210 (0.0006) [2023-03-07 08:11:14,686][155452] Updated weights for policy 0, policy_version 27220 (0.0005) [2023-03-07 08:11:15,470][155452] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-07 08:11:16,260][155452] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-07 08:11:17,058][155452] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-07 08:11:17,840][155452] Updated weights for policy 0, policy_version 27260 (0.0006) [2023-03-07 08:11:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 27921408. Throughput: 0: 13037.6. Samples: 27887732. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:11:18,367][155126] Avg episode reward: [(0, '1732.929')] [2023-03-07 08:11:18,628][155452] Updated weights for policy 0, policy_version 27270 (0.0006) [2023-03-07 08:11:19,399][155452] Updated weights for policy 0, policy_version 27280 (0.0006) [2023-03-07 08:11:20,184][155452] Updated weights for policy 0, policy_version 27290 (0.0007) [2023-03-07 08:11:20,974][155452] Updated weights for policy 0, policy_version 27300 (0.0006) [2023-03-07 08:11:21,763][155452] Updated weights for policy 0, policy_version 27310 (0.0006) [2023-03-07 08:11:22,555][155452] Updated weights for policy 0, policy_version 27320 (0.0005) [2023-03-07 08:11:23,347][155452] Updated weights for policy 0, policy_version 27330 (0.0007) [2023-03-07 08:11:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 27985920. Throughput: 0: 13039.7. Samples: 27965783. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:11:23,367][155126] Avg episode reward: [(0, '1775.368')] [2023-03-07 08:11:24,118][155452] Updated weights for policy 0, policy_version 27340 (0.0006) [2023-03-07 08:11:24,893][155452] Updated weights for policy 0, policy_version 27350 (0.0006) [2023-03-07 08:11:25,683][155452] Updated weights for policy 0, policy_version 27360 (0.0007) [2023-03-07 08:11:26,452][155452] Updated weights for policy 0, policy_version 27370 (0.0006) [2023-03-07 08:11:27,237][155452] Updated weights for policy 0, policy_version 27380 (0.0006) [2023-03-07 08:11:28,009][155452] Updated weights for policy 0, policy_version 27390 (0.0006) [2023-03-07 08:11:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 28051456. Throughput: 0: 13043.3. Samples: 28044360. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:11:28,367][155126] Avg episode reward: [(0, '1977.160')] [2023-03-07 08:11:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000027394_28051456.pth... [2023-03-07 08:11:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000024335_24919040.pth [2023-03-07 08:11:28,826][155452] Updated weights for policy 0, policy_version 27400 (0.0006) [2023-03-07 08:11:29,592][155452] Updated weights for policy 0, policy_version 27410 (0.0005) [2023-03-07 08:11:30,379][155452] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-07 08:11:31,159][155452] Updated weights for policy 0, policy_version 27430 (0.0006) [2023-03-07 08:11:31,930][155452] Updated weights for policy 0, policy_version 27440 (0.0006) [2023-03-07 08:11:32,730][155452] Updated weights for policy 0, policy_version 27450 (0.0007) [2023-03-07 08:11:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 28116992. Throughput: 0: 13038.4. Samples: 28083549. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:11:33,367][155126] Avg episode reward: [(0, '1555.491')] [2023-03-07 08:11:33,506][155452] Updated weights for policy 0, policy_version 27460 (0.0006) [2023-03-07 08:11:34,295][155452] Updated weights for policy 0, policy_version 27470 (0.0006) [2023-03-07 08:11:35,076][155452] Updated weights for policy 0, policy_version 27480 (0.0007) [2023-03-07 08:11:35,864][155452] Updated weights for policy 0, policy_version 27490 (0.0006) [2023-03-07 08:11:36,648][155452] Updated weights for policy 0, policy_version 27500 (0.0006) [2023-03-07 08:11:37,418][155452] Updated weights for policy 0, policy_version 27510 (0.0006) [2023-03-07 08:11:38,192][155452] Updated weights for policy 0, policy_version 27520 (0.0007) [2023-03-07 08:11:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 28182528. Throughput: 0: 13045.0. Samples: 28162038. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:11:38,367][155126] Avg episode reward: [(0, '1493.694')] [2023-03-07 08:11:38,999][155452] Updated weights for policy 0, policy_version 27530 (0.0007) [2023-03-07 08:11:39,778][155452] Updated weights for policy 0, policy_version 27540 (0.0006) [2023-03-07 08:11:40,564][155452] Updated weights for policy 0, policy_version 27550 (0.0006) [2023-03-07 08:11:41,364][155452] Updated weights for policy 0, policy_version 27560 (0.0006) [2023-03-07 08:11:42,141][155452] Updated weights for policy 0, policy_version 27570 (0.0006) [2023-03-07 08:11:42,911][155452] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-07 08:11:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 28247040. Throughput: 0: 13049.7. Samples: 28240380. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:11:43,367][155126] Avg episode reward: [(0, '1670.188')] [2023-03-07 08:11:43,692][155452] Updated weights for policy 0, policy_version 27590 (0.0006) [2023-03-07 08:11:44,493][155452] Updated weights for policy 0, policy_version 27600 (0.0006) [2023-03-07 08:11:45,271][155452] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-07 08:11:46,061][155452] Updated weights for policy 0, policy_version 27620 (0.0007) [2023-03-07 08:11:46,859][155452] Updated weights for policy 0, policy_version 27630 (0.0006) [2023-03-07 08:11:47,638][155452] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-07 08:11:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 28312576. Throughput: 0: 13061.5. Samples: 28279497. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:11:48,367][155126] Avg episode reward: [(0, '1429.753')] [2023-03-07 08:11:48,427][155452] Updated weights for policy 0, policy_version 27650 (0.0006) [2023-03-07 08:11:49,203][155452] Updated weights for policy 0, policy_version 27660 (0.0006) [2023-03-07 08:11:50,003][155452] Updated weights for policy 0, policy_version 27670 (0.0006) [2023-03-07 08:11:50,806][155452] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-07 08:11:51,573][155452] Updated weights for policy 0, policy_version 27690 (0.0006) [2023-03-07 08:11:52,349][155452] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-07 08:11:53,156][155452] Updated weights for policy 0, policy_version 27710 (0.0006) [2023-03-07 08:11:53,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 28377088. Throughput: 0: 13047.7. Samples: 28357534. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:11:53,367][155126] Avg episode reward: [(0, '1784.467')] [2023-03-07 08:11:53,927][155452] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-07 08:11:54,715][155452] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-07 08:11:55,508][155452] Updated weights for policy 0, policy_version 27740 (0.0007) [2023-03-07 08:11:56,296][155452] Updated weights for policy 0, policy_version 27750 (0.0006) [2023-03-07 08:11:57,076][155452] Updated weights for policy 0, policy_version 27760 (0.0006) [2023-03-07 08:11:57,850][155452] Updated weights for policy 0, policy_version 27770 (0.0006) [2023-03-07 08:11:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13051.7). Total num frames: 28442624. Throughput: 0: 13049.7. Samples: 28435674. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:11:58,367][155126] Avg episode reward: [(0, '1631.390')] [2023-03-07 08:11:58,641][155452] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-07 08:11:59,433][155452] Updated weights for policy 0, policy_version 27790 (0.0006) [2023-03-07 08:12:00,233][155452] Updated weights for policy 0, policy_version 27800 (0.0006) [2023-03-07 08:12:01,010][155452] Updated weights for policy 0, policy_version 27810 (0.0006) [2023-03-07 08:12:01,793][155452] Updated weights for policy 0, policy_version 27820 (0.0006) [2023-03-07 08:12:02,571][155452] Updated weights for policy 0, policy_version 27830 (0.0005) [2023-03-07 08:12:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 28507136. Throughput: 0: 13040.5. Samples: 28474555. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:12:03,375][155452] Updated weights for policy 0, policy_version 27840 (0.0006) [2023-03-07 08:12:03,378][155126] Avg episode reward: [(0, '1718.172')] [2023-03-07 08:12:04,161][155452] Updated weights for policy 0, policy_version 27850 (0.0007) [2023-03-07 08:12:04,928][155452] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-07 08:12:05,699][155452] Updated weights for policy 0, policy_version 27870 (0.0007) [2023-03-07 08:12:06,485][155452] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-07 08:12:07,275][155452] Updated weights for policy 0, policy_version 27890 (0.0006) [2023-03-07 08:12:08,047][155452] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-07 08:12:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 28572672. Throughput: 0: 13053.1. Samples: 28553172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:12:08,367][155126] Avg episode reward: [(0, '1632.012')] [2023-03-07 08:12:08,835][155452] Updated weights for policy 0, policy_version 27910 (0.0006) [2023-03-07 08:12:09,630][155452] Updated weights for policy 0, policy_version 27920 (0.0006) [2023-03-07 08:12:10,413][155452] Updated weights for policy 0, policy_version 27930 (0.0006) [2023-03-07 08:12:11,202][155452] Updated weights for policy 0, policy_version 27940 (0.0006) [2023-03-07 08:12:11,991][155452] Updated weights for policy 0, policy_version 27950 (0.0006) [2023-03-07 08:12:12,775][155452] Updated weights for policy 0, policy_version 27960 (0.0007) [2023-03-07 08:12:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 28638208. Throughput: 0: 13045.1. Samples: 28631389. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:12:13,367][155126] Avg episode reward: [(0, '1636.010')] [2023-03-07 08:12:13,548][155452] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-07 08:12:14,328][155452] Updated weights for policy 0, policy_version 27980 (0.0006) [2023-03-07 08:12:15,105][155452] Updated weights for policy 0, policy_version 27990 (0.0006) [2023-03-07 08:12:15,899][155452] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-07 08:12:16,682][155452] Updated weights for policy 0, policy_version 28010 (0.0006) [2023-03-07 08:12:17,471][155452] Updated weights for policy 0, policy_version 28020 (0.0007) [2023-03-07 08:12:18,243][155452] Updated weights for policy 0, policy_version 28030 (0.0007) [2023-03-07 08:12:18,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 28703744. Throughput: 0: 13047.6. Samples: 28670694. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:12:18,378][155126] Avg episode reward: [(0, '1664.347')] [2023-03-07 08:12:19,026][155452] Updated weights for policy 0, policy_version 28040 (0.0005) [2023-03-07 08:12:19,813][155452] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-07 08:12:20,593][155452] Updated weights for policy 0, policy_version 28060 (0.0005) [2023-03-07 08:12:21,374][155452] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-07 08:12:22,167][155452] Updated weights for policy 0, policy_version 28080 (0.0005) [2023-03-07 08:12:22,946][155452] Updated weights for policy 0, policy_version 28090 (0.0006) [2023-03-07 08:12:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 28769280. Throughput: 0: 13047.5. Samples: 28749176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:12:23,368][155126] Avg episode reward: [(0, '1487.581')] [2023-03-07 08:12:23,717][155452] Updated weights for policy 0, policy_version 28100 (0.0006) [2023-03-07 08:12:24,514][155452] Updated weights for policy 0, policy_version 28110 (0.0005) [2023-03-07 08:12:25,306][155452] Updated weights for policy 0, policy_version 28120 (0.0006) [2023-03-07 08:12:26,095][155452] Updated weights for policy 0, policy_version 28130 (0.0006) [2023-03-07 08:12:26,878][155452] Updated weights for policy 0, policy_version 28140 (0.0006) [2023-03-07 08:12:27,681][155452] Updated weights for policy 0, policy_version 28150 (0.0006) [2023-03-07 08:12:28,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 28833792. Throughput: 0: 13037.8. Samples: 28827080. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:12:28,367][155126] Avg episode reward: [(0, '1392.162')] [2023-03-07 08:12:28,468][155452] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-07 08:12:29,255][155452] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-07 08:12:30,046][155452] Updated weights for policy 0, policy_version 28180 (0.0007) [2023-03-07 08:12:30,806][155452] Updated weights for policy 0, policy_version 28190 (0.0005) [2023-03-07 08:12:31,608][155452] Updated weights for policy 0, policy_version 28200 (0.0007) [2023-03-07 08:12:32,404][155452] Updated weights for policy 0, policy_version 28210 (0.0005) [2023-03-07 08:12:33,194][155452] Updated weights for policy 0, policy_version 28220 (0.0006) [2023-03-07 08:12:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 28899328. Throughput: 0: 13039.7. Samples: 28866285. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:12:33,368][155126] Avg episode reward: [(0, '1611.940')] [2023-03-07 08:12:33,985][155452] Updated weights for policy 0, policy_version 28230 (0.0006) [2023-03-07 08:12:34,769][155452] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-07 08:12:35,572][155452] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-07 08:12:36,369][155452] Updated weights for policy 0, policy_version 28260 (0.0007) [2023-03-07 08:12:37,133][155452] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-07 08:12:37,921][155452] Updated weights for policy 0, policy_version 28280 (0.0006) [2023-03-07 08:12:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 28963840. Throughput: 0: 13034.4. Samples: 28944081. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:12:38,367][155126] Avg episode reward: [(0, '1427.849')] [2023-03-07 08:12:38,695][155452] Updated weights for policy 0, policy_version 28290 (0.0006) [2023-03-07 08:12:39,486][155452] Updated weights for policy 0, policy_version 28300 (0.0006) [2023-03-07 08:12:40,261][155452] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-07 08:12:41,037][155452] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-07 08:12:41,831][155452] Updated weights for policy 0, policy_version 28330 (0.0006) [2023-03-07 08:12:42,618][155452] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-07 08:12:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 29029376. Throughput: 0: 13041.6. Samples: 29022545. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:12:43,368][155126] Avg episode reward: [(0, '1665.229')] [2023-03-07 08:12:43,399][155452] Updated weights for policy 0, policy_version 28350 (0.0007) [2023-03-07 08:12:44,185][155452] Updated weights for policy 0, policy_version 28360 (0.0006) [2023-03-07 08:12:44,978][155452] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-07 08:12:45,755][155452] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-07 08:12:46,557][155452] Updated weights for policy 0, policy_version 28390 (0.0006) [2023-03-07 08:12:47,331][155452] Updated weights for policy 0, policy_version 28400 (0.0006) [2023-03-07 08:12:48,115][155452] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-07 08:12:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 29094912. Throughput: 0: 13047.6. Samples: 29061699. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:12:48,368][155126] Avg episode reward: [(0, '1653.676')] [2023-03-07 08:12:48,894][155452] Updated weights for policy 0, policy_version 28420 (0.0006) [2023-03-07 08:12:49,674][155452] Updated weights for policy 0, policy_version 28430 (0.0006) [2023-03-07 08:12:50,464][155452] Updated weights for policy 0, policy_version 28440 (0.0006) [2023-03-07 08:12:51,243][155452] Updated weights for policy 0, policy_version 28450 (0.0006) [2023-03-07 08:12:52,018][155452] Updated weights for policy 0, policy_version 28460 (0.0006) [2023-03-07 08:12:52,811][155452] Updated weights for policy 0, policy_version 28470 (0.0006) [2023-03-07 08:12:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 29159424. Throughput: 0: 13040.8. Samples: 29140009. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:12:53,367][155126] Avg episode reward: [(0, '1557.911')] [2023-03-07 08:12:53,603][155452] Updated weights for policy 0, policy_version 28480 (0.0006) [2023-03-07 08:12:54,386][155452] Updated weights for policy 0, policy_version 28490 (0.0006) [2023-03-07 08:12:55,178][155452] Updated weights for policy 0, policy_version 28500 (0.0006) [2023-03-07 08:12:55,983][155452] Updated weights for policy 0, policy_version 28510 (0.0007) [2023-03-07 08:12:56,758][155452] Updated weights for policy 0, policy_version 28520 (0.0006) [2023-03-07 08:12:57,561][155452] Updated weights for policy 0, policy_version 28530 (0.0005) [2023-03-07 08:12:58,340][155452] Updated weights for policy 0, policy_version 28540 (0.0006) [2023-03-07 08:12:58,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 29224960. Throughput: 0: 13034.6. Samples: 29217945. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:12:58,367][155126] Avg episode reward: [(0, '1745.156')] [2023-03-07 08:12:59,136][155452] Updated weights for policy 0, policy_version 28550 (0.0006) [2023-03-07 08:12:59,921][155452] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-07 08:13:00,722][155452] Updated weights for policy 0, policy_version 28570 (0.0006) [2023-03-07 08:13:01,500][155452] Updated weights for policy 0, policy_version 28580 (0.0006) [2023-03-07 08:13:02,292][155452] Updated weights for policy 0, policy_version 28590 (0.0007) [2023-03-07 08:13:03,076][155452] Updated weights for policy 0, policy_version 28600 (0.0006) [2023-03-07 08:13:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 29289472. Throughput: 0: 13027.5. Samples: 29256930. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:13:03,367][155126] Avg episode reward: [(0, '1728.638')] [2023-03-07 08:13:03,849][155452] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-07 08:13:04,612][155452] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-07 08:13:05,414][155452] Updated weights for policy 0, policy_version 28630 (0.0007) [2023-03-07 08:13:06,206][155452] Updated weights for policy 0, policy_version 28640 (0.0007) [2023-03-07 08:13:06,989][155452] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-07 08:13:07,769][155452] Updated weights for policy 0, policy_version 28660 (0.0006) [2023-03-07 08:13:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 29355008. Throughput: 0: 13020.2. Samples: 29335084. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:13:08,367][155126] Avg episode reward: [(0, '1799.958')] [2023-03-07 08:13:08,557][155452] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-07 08:13:09,345][155452] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-07 08:13:10,126][155452] Updated weights for policy 0, policy_version 28690 (0.0005) [2023-03-07 08:13:10,918][155452] Updated weights for policy 0, policy_version 28700 (0.0007) [2023-03-07 08:13:11,709][155452] Updated weights for policy 0, policy_version 28710 (0.0007) [2023-03-07 08:13:12,534][155452] Updated weights for policy 0, policy_version 28720 (0.0007) [2023-03-07 08:13:13,315][155452] Updated weights for policy 0, policy_version 28730 (0.0006) [2023-03-07 08:13:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 29419520. Throughput: 0: 13018.2. Samples: 29412900. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:13:13,367][155126] Avg episode reward: [(0, '1670.994')] [2023-03-07 08:13:14,085][155452] Updated weights for policy 0, policy_version 28740 (0.0006) [2023-03-07 08:13:14,890][155452] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-07 08:13:15,656][155452] Updated weights for policy 0, policy_version 28760 (0.0006) [2023-03-07 08:13:16,456][155452] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-07 08:13:17,240][155452] Updated weights for policy 0, policy_version 28780 (0.0006) [2023-03-07 08:13:18,017][155452] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-07 08:13:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 29485056. Throughput: 0: 13014.1. Samples: 29451919. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:13:18,367][155126] Avg episode reward: [(0, '1900.959')] [2023-03-07 08:13:18,801][155452] Updated weights for policy 0, policy_version 28800 (0.0006) [2023-03-07 08:13:19,584][155452] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-07 08:13:20,361][155452] Updated weights for policy 0, policy_version 28820 (0.0007) [2023-03-07 08:13:21,156][155452] Updated weights for policy 0, policy_version 28830 (0.0006) [2023-03-07 08:13:21,946][155452] Updated weights for policy 0, policy_version 28840 (0.0006) [2023-03-07 08:13:22,715][155452] Updated weights for policy 0, policy_version 28850 (0.0006) [2023-03-07 08:13:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 29550592. Throughput: 0: 13025.0. Samples: 29530204. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:13:23,367][155126] Avg episode reward: [(0, '1786.672')] [2023-03-07 08:13:23,508][155452] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-07 08:13:24,290][155452] Updated weights for policy 0, policy_version 28870 (0.0006) [2023-03-07 08:13:25,082][155452] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-07 08:13:25,865][155452] Updated weights for policy 0, policy_version 28890 (0.0006) [2023-03-07 08:13:26,657][155452] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-07 08:13:27,437][155452] Updated weights for policy 0, policy_version 28910 (0.0006) [2023-03-07 08:13:28,229][155452] Updated weights for policy 0, policy_version 28920 (0.0005) [2023-03-07 08:13:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 29615104. Throughput: 0: 13018.5. Samples: 29608376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:13:28,367][155126] Avg episode reward: [(0, '1540.062')] [2023-03-07 08:13:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000028921_29615104.pth... [2023-03-07 08:13:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000025866_26486784.pth [2023-03-07 08:13:29,016][155452] Updated weights for policy 0, policy_version 28930 (0.0005) [2023-03-07 08:13:29,799][155452] Updated weights for policy 0, policy_version 28940 (0.0007) [2023-03-07 08:13:30,581][155452] Updated weights for policy 0, policy_version 28950 (0.0006) [2023-03-07 08:13:31,380][155452] Updated weights for policy 0, policy_version 28960 (0.0006) [2023-03-07 08:13:32,171][155452] Updated weights for policy 0, policy_version 28970 (0.0006) [2023-03-07 08:13:32,948][155452] Updated weights for policy 0, policy_version 28980 (0.0007) [2023-03-07 08:13:33,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 29680640. Throughput: 0: 13018.8. Samples: 29647547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:13:33,368][155126] Avg episode reward: [(0, '1732.727')] [2023-03-07 08:13:33,730][155452] Updated weights for policy 0, policy_version 28990 (0.0006) [2023-03-07 08:13:34,530][155452] Updated weights for policy 0, policy_version 29000 (0.0006) [2023-03-07 08:13:35,322][155452] Updated weights for policy 0, policy_version 29010 (0.0006) [2023-03-07 08:13:36,090][155452] Updated weights for policy 0, policy_version 29020 (0.0006) [2023-03-07 08:13:36,871][155452] Updated weights for policy 0, policy_version 29030 (0.0006) [2023-03-07 08:13:37,648][155452] Updated weights for policy 0, policy_version 29040 (0.0006) [2023-03-07 08:13:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 29746176. Throughput: 0: 13016.0. Samples: 29725730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:13:38,367][155126] Avg episode reward: [(0, '1674.025')] [2023-03-07 08:13:38,434][155452] Updated weights for policy 0, policy_version 29050 (0.0006) [2023-03-07 08:13:39,211][155452] Updated weights for policy 0, policy_version 29060 (0.0007) [2023-03-07 08:13:39,985][155452] Updated weights for policy 0, policy_version 29070 (0.0006) [2023-03-07 08:13:40,766][155452] Updated weights for policy 0, policy_version 29080 (0.0006) [2023-03-07 08:13:41,554][155452] Updated weights for policy 0, policy_version 29090 (0.0006) [2023-03-07 08:13:42,331][155452] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-07 08:13:43,134][155452] Updated weights for policy 0, policy_version 29110 (0.0006) [2023-03-07 08:13:43,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 29810688. Throughput: 0: 13029.5. Samples: 29804271. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:13:43,367][155126] Avg episode reward: [(0, '1717.231')] [2023-03-07 08:13:43,910][155452] Updated weights for policy 0, policy_version 29120 (0.0006) [2023-03-07 08:13:44,689][155452] Updated weights for policy 0, policy_version 29130 (0.0006) [2023-03-07 08:13:45,483][155452] Updated weights for policy 0, policy_version 29140 (0.0007) [2023-03-07 08:13:46,262][155452] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-07 08:13:47,056][155452] Updated weights for policy 0, policy_version 29160 (0.0006) [2023-03-07 08:13:47,857][155452] Updated weights for policy 0, policy_version 29170 (0.0006) [2023-03-07 08:13:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 29876224. Throughput: 0: 13031.7. Samples: 29843354. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:13:48,367][155126] Avg episode reward: [(0, '1653.848')] [2023-03-07 08:13:48,640][155452] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-07 08:13:49,417][155452] Updated weights for policy 0, policy_version 29190 (0.0006) [2023-03-07 08:13:50,218][155452] Updated weights for policy 0, policy_version 29200 (0.0007) [2023-03-07 08:13:50,978][155452] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-07 08:13:51,782][155452] Updated weights for policy 0, policy_version 29220 (0.0007) [2023-03-07 08:13:52,562][155452] Updated weights for policy 0, policy_version 29230 (0.0006) [2023-03-07 08:13:53,349][155452] Updated weights for policy 0, policy_version 29240 (0.0007) [2023-03-07 08:13:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 29941760. Throughput: 0: 13032.2. Samples: 29921533. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:13:53,367][155126] Avg episode reward: [(0, '1810.682')] [2023-03-07 08:13:54,111][155452] Updated weights for policy 0, policy_version 29250 (0.0006) [2023-03-07 08:13:54,894][155452] Updated weights for policy 0, policy_version 29260 (0.0006) [2023-03-07 08:13:55,685][155452] Updated weights for policy 0, policy_version 29270 (0.0007) [2023-03-07 08:13:56,469][155452] Updated weights for policy 0, policy_version 29280 (0.0006) [2023-03-07 08:13:57,246][155452] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-07 08:13:58,048][155452] Updated weights for policy 0, policy_version 29300 (0.0006) [2023-03-07 08:13:58,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 30007296. Throughput: 0: 13046.7. Samples: 30000003. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:13:58,368][155126] Avg episode reward: [(0, '1955.080')] [2023-03-07 08:13:58,827][155452] Updated weights for policy 0, policy_version 29310 (0.0006) [2023-03-07 08:13:59,604][155452] Updated weights for policy 0, policy_version 29320 (0.0006) [2023-03-07 08:14:00,388][155452] Updated weights for policy 0, policy_version 29330 (0.0006) [2023-03-07 08:14:01,191][155452] Updated weights for policy 0, policy_version 29340 (0.0006) [2023-03-07 08:14:01,977][155452] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-07 08:14:02,750][155452] Updated weights for policy 0, policy_version 29360 (0.0006) [2023-03-07 08:14:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 30071808. Throughput: 0: 13047.3. Samples: 30039048. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:14:03,367][155126] Avg episode reward: [(0, '1931.592')] [2023-03-07 08:14:03,529][155452] Updated weights for policy 0, policy_version 29370 (0.0007) [2023-03-07 08:14:04,309][155452] Updated weights for policy 0, policy_version 29380 (0.0006) [2023-03-07 08:14:05,099][155452] Updated weights for policy 0, policy_version 29390 (0.0006) [2023-03-07 08:14:05,897][155452] Updated weights for policy 0, policy_version 29400 (0.0006) [2023-03-07 08:14:06,678][155452] Updated weights for policy 0, policy_version 29410 (0.0006) [2023-03-07 08:14:07,471][155452] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-07 08:14:08,281][155452] Updated weights for policy 0, policy_version 29430 (0.0006) [2023-03-07 08:14:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 30137344. Throughput: 0: 13045.7. Samples: 30117264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:14:08,368][155126] Avg episode reward: [(0, '1720.424')] [2023-03-07 08:14:09,046][155452] Updated weights for policy 0, policy_version 29440 (0.0005) [2023-03-07 08:14:09,834][155452] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-07 08:14:10,606][155452] Updated weights for policy 0, policy_version 29460 (0.0006) [2023-03-07 08:14:11,383][155452] Updated weights for policy 0, policy_version 29470 (0.0006) [2023-03-07 08:14:12,161][155452] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-07 08:14:12,948][155452] Updated weights for policy 0, policy_version 29490 (0.0007) [2023-03-07 08:14:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 30201856. Throughput: 0: 13048.4. Samples: 30195553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:14:13,367][155126] Avg episode reward: [(0, '1770.595')] [2023-03-07 08:14:13,734][155452] Updated weights for policy 0, policy_version 29500 (0.0007) [2023-03-07 08:14:14,515][155452] Updated weights for policy 0, policy_version 29510 (0.0006) [2023-03-07 08:14:15,308][155452] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-07 08:14:16,102][155452] Updated weights for policy 0, policy_version 29530 (0.0006) [2023-03-07 08:14:16,913][155452] Updated weights for policy 0, policy_version 29540 (0.0007) [2023-03-07 08:14:17,671][155452] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-07 08:14:18,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 30268416. Throughput: 0: 13046.8. Samples: 30234651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:14:18,367][155126] Avg episode reward: [(0, '1927.108')] [2023-03-07 08:14:18,458][155452] Updated weights for policy 0, policy_version 29560 (0.0006) [2023-03-07 08:14:19,257][155452] Updated weights for policy 0, policy_version 29570 (0.0006) [2023-03-07 08:14:20,040][155452] Updated weights for policy 0, policy_version 29580 (0.0006) [2023-03-07 08:14:20,813][155452] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-07 08:14:21,626][155452] Updated weights for policy 0, policy_version 29600 (0.0006) [2023-03-07 08:14:22,398][155452] Updated weights for policy 0, policy_version 29610 (0.0006) [2023-03-07 08:14:23,199][155452] Updated weights for policy 0, policy_version 29620 (0.0006) [2023-03-07 08:14:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 30332928. Throughput: 0: 13044.8. Samples: 30312749. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:14:23,367][155126] Avg episode reward: [(0, '1905.505')] [2023-03-07 08:14:23,994][155452] Updated weights for policy 0, policy_version 29630 (0.0006) [2023-03-07 08:14:24,790][155452] Updated weights for policy 0, policy_version 29640 (0.0006) [2023-03-07 08:14:25,569][155452] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-07 08:14:26,320][155452] Updated weights for policy 0, policy_version 29660 (0.0007) [2023-03-07 08:14:27,110][155452] Updated weights for policy 0, policy_version 29670 (0.0006) [2023-03-07 08:14:27,898][155452] Updated weights for policy 0, policy_version 29680 (0.0006) [2023-03-07 08:14:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 30398464. Throughput: 0: 13037.8. Samples: 30390969. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:14:28,367][155126] Avg episode reward: [(0, '1652.651')] [2023-03-07 08:14:28,680][155452] Updated weights for policy 0, policy_version 29690 (0.0006) [2023-03-07 08:14:29,466][155452] Updated weights for policy 0, policy_version 29700 (0.0007) [2023-03-07 08:14:30,268][155452] Updated weights for policy 0, policy_version 29710 (0.0007) [2023-03-07 08:14:31,054][155452] Updated weights for policy 0, policy_version 29720 (0.0008) [2023-03-07 08:14:31,845][155452] Updated weights for policy 0, policy_version 29730 (0.0006) [2023-03-07 08:14:32,612][155452] Updated weights for policy 0, policy_version 29740 (0.0007) [2023-03-07 08:14:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 30462976. Throughput: 0: 13038.7. Samples: 30430096. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:14:33,367][155126] Avg episode reward: [(0, '1995.812')] [2023-03-07 08:14:33,392][155452] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-07 08:14:34,174][155452] Updated weights for policy 0, policy_version 29760 (0.0006) [2023-03-07 08:14:34,952][155452] Updated weights for policy 0, policy_version 29770 (0.0006) [2023-03-07 08:14:35,742][155452] Updated weights for policy 0, policy_version 29780 (0.0007) [2023-03-07 08:14:36,529][155452] Updated weights for policy 0, policy_version 29790 (0.0007) [2023-03-07 08:14:37,309][155452] Updated weights for policy 0, policy_version 29800 (0.0007) [2023-03-07 08:14:38,096][155452] Updated weights for policy 0, policy_version 29810 (0.0006) [2023-03-07 08:14:38,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 30528512. Throughput: 0: 13040.7. Samples: 30508365. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:14:38,368][155126] Avg episode reward: [(0, '1689.945')] [2023-03-07 08:14:38,875][155452] Updated weights for policy 0, policy_version 29820 (0.0006) [2023-03-07 08:14:39,666][155452] Updated weights for policy 0, policy_version 29830 (0.0006) [2023-03-07 08:14:40,444][155452] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-07 08:14:41,210][155452] Updated weights for policy 0, policy_version 29850 (0.0006) [2023-03-07 08:14:41,996][155452] Updated weights for policy 0, policy_version 29860 (0.0006) [2023-03-07 08:14:42,781][155452] Updated weights for policy 0, policy_version 29870 (0.0006) [2023-03-07 08:14:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 30594048. Throughput: 0: 13042.9. Samples: 30586934. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:14:43,367][155126] Avg episode reward: [(0, '1687.947')] [2023-03-07 08:14:43,574][155452] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-07 08:14:44,362][155452] Updated weights for policy 0, policy_version 29890 (0.0006) [2023-03-07 08:14:45,170][155452] Updated weights for policy 0, policy_version 29900 (0.0006) [2023-03-07 08:14:45,958][155452] Updated weights for policy 0, policy_version 29910 (0.0007) [2023-03-07 08:14:46,753][155452] Updated weights for policy 0, policy_version 29920 (0.0006) [2023-03-07 08:14:47,530][155452] Updated weights for policy 0, policy_version 29930 (0.0007) [2023-03-07 08:14:48,317][155452] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-07 08:14:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 30658560. Throughput: 0: 13040.8. Samples: 30625886. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:14:48,367][155126] Avg episode reward: [(0, '1755.904')] [2023-03-07 08:14:49,103][155452] Updated weights for policy 0, policy_version 29950 (0.0006) [2023-03-07 08:14:49,889][155452] Updated weights for policy 0, policy_version 29960 (0.0006) [2023-03-07 08:14:50,664][155452] Updated weights for policy 0, policy_version 29970 (0.0005) [2023-03-07 08:14:51,456][155452] Updated weights for policy 0, policy_version 29980 (0.0007) [2023-03-07 08:14:52,256][155452] Updated weights for policy 0, policy_version 29990 (0.0005) [2023-03-07 08:14:53,048][155452] Updated weights for policy 0, policy_version 30000 (0.0007) [2023-03-07 08:14:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 30724096. Throughput: 0: 13034.7. Samples: 30703823. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:14:53,368][155126] Avg episode reward: [(0, '1933.856')] [2023-03-07 08:14:53,851][155452] Updated weights for policy 0, policy_version 30010 (0.0006) [2023-03-07 08:14:54,618][155452] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-07 08:14:55,410][155452] Updated weights for policy 0, policy_version 30030 (0.0006) [2023-03-07 08:14:56,195][155452] Updated weights for policy 0, policy_version 30040 (0.0006) [2023-03-07 08:14:56,993][155452] Updated weights for policy 0, policy_version 30050 (0.0006) [2023-03-07 08:14:57,794][155452] Updated weights for policy 0, policy_version 30060 (0.0006) [2023-03-07 08:14:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 30788608. Throughput: 0: 13019.2. Samples: 30781419. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:14:58,367][155126] Avg episode reward: [(0, '1862.962')] [2023-03-07 08:14:58,589][155452] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-07 08:14:59,376][155452] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-07 08:15:00,174][155452] Updated weights for policy 0, policy_version 30090 (0.0006) [2023-03-07 08:15:00,944][155452] Updated weights for policy 0, policy_version 30100 (0.0006) [2023-03-07 08:15:01,722][155452] Updated weights for policy 0, policy_version 30110 (0.0007) [2023-03-07 08:15:02,517][155452] Updated weights for policy 0, policy_version 30120 (0.0007) [2023-03-07 08:15:03,299][155452] Updated weights for policy 0, policy_version 30130 (0.0006) [2023-03-07 08:15:03,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 30853120. Throughput: 0: 13017.9. Samples: 30820458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:03,368][155126] Avg episode reward: [(0, '1690.710')] [2023-03-07 08:15:04,112][155452] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-07 08:15:04,883][155452] Updated weights for policy 0, policy_version 30150 (0.0006) [2023-03-07 08:15:05,668][155452] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-07 08:15:06,455][155452] Updated weights for policy 0, policy_version 30170 (0.0006) [2023-03-07 08:15:07,246][155452] Updated weights for policy 0, policy_version 30180 (0.0006) [2023-03-07 08:15:08,022][155452] Updated weights for policy 0, policy_version 30190 (0.0006) [2023-03-07 08:15:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 30918656. Throughput: 0: 13017.3. Samples: 30898527. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:08,367][155126] Avg episode reward: [(0, '1825.776')] [2023-03-07 08:15:08,822][155452] Updated weights for policy 0, policy_version 30200 (0.0006) [2023-03-07 08:15:09,595][155452] Updated weights for policy 0, policy_version 30210 (0.0006) [2023-03-07 08:15:10,378][155452] Updated weights for policy 0, policy_version 30220 (0.0007) [2023-03-07 08:15:11,173][155452] Updated weights for policy 0, policy_version 30230 (0.0006) [2023-03-07 08:15:11,935][155452] Updated weights for policy 0, policy_version 30240 (0.0006) [2023-03-07 08:15:12,716][155452] Updated weights for policy 0, policy_version 30250 (0.0006) [2023-03-07 08:15:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 30984192. Throughput: 0: 13020.0. Samples: 30976871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:13,378][155126] Avg episode reward: [(0, '1885.676')] [2023-03-07 08:15:13,513][155452] Updated weights for policy 0, policy_version 30260 (0.0006) [2023-03-07 08:15:14,299][155452] Updated weights for policy 0, policy_version 30270 (0.0005) [2023-03-07 08:15:15,089][155452] Updated weights for policy 0, policy_version 30280 (0.0006) [2023-03-07 08:15:15,877][155452] Updated weights for policy 0, policy_version 30290 (0.0007) [2023-03-07 08:15:16,652][155452] Updated weights for policy 0, policy_version 30300 (0.0007) [2023-03-07 08:15:17,434][155452] Updated weights for policy 0, policy_version 30310 (0.0006) [2023-03-07 08:15:18,217][155452] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-07 08:15:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 31048704. Throughput: 0: 13018.4. Samples: 31015923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:18,367][155126] Avg episode reward: [(0, '1879.546')] [2023-03-07 08:15:19,005][155452] Updated weights for policy 0, policy_version 30330 (0.0007) [2023-03-07 08:15:19,792][155452] Updated weights for policy 0, policy_version 30340 (0.0006) [2023-03-07 08:15:20,574][155452] Updated weights for policy 0, policy_version 30350 (0.0007) [2023-03-07 08:15:21,360][155452] Updated weights for policy 0, policy_version 30360 (0.0006) [2023-03-07 08:15:22,154][155452] Updated weights for policy 0, policy_version 30370 (0.0006) [2023-03-07 08:15:22,929][155452] Updated weights for policy 0, policy_version 30380 (0.0006) [2023-03-07 08:15:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 31114240. Throughput: 0: 13018.1. Samples: 31094178. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:23,367][155126] Avg episode reward: [(0, '1803.341')] [2023-03-07 08:15:23,706][155452] Updated weights for policy 0, policy_version 30390 (0.0007) [2023-03-07 08:15:24,489][155452] Updated weights for policy 0, policy_version 30400 (0.0006) [2023-03-07 08:15:25,274][155452] Updated weights for policy 0, policy_version 30410 (0.0007) [2023-03-07 08:15:26,074][155452] Updated weights for policy 0, policy_version 30420 (0.0007) [2023-03-07 08:15:26,852][155452] Updated weights for policy 0, policy_version 30430 (0.0006) [2023-03-07 08:15:27,640][155452] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-07 08:15:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 31179776. Throughput: 0: 13014.9. Samples: 31172603. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:28,367][155126] Avg episode reward: [(0, '1660.964')] [2023-03-07 08:15:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000030449_31179776.pth... [2023-03-07 08:15:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000027394_28051456.pth [2023-03-07 08:15:28,421][155452] Updated weights for policy 0, policy_version 30450 (0.0007) [2023-03-07 08:15:29,199][155452] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-07 08:15:29,981][155452] Updated weights for policy 0, policy_version 30470 (0.0007) [2023-03-07 08:15:30,761][155452] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-07 08:15:31,548][155452] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-07 08:15:32,319][155452] Updated weights for policy 0, policy_version 30500 (0.0007) [2023-03-07 08:15:33,100][155452] Updated weights for policy 0, policy_version 30510 (0.0006) [2023-03-07 08:15:33,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 31245312. Throughput: 0: 13023.6. Samples: 31211949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:33,368][155126] Avg episode reward: [(0, '1647.515')] [2023-03-07 08:15:33,905][155452] Updated weights for policy 0, policy_version 30520 (0.0007) [2023-03-07 08:15:34,677][155452] Updated weights for policy 0, policy_version 30530 (0.0006) [2023-03-07 08:15:35,448][155452] Updated weights for policy 0, policy_version 30540 (0.0007) [2023-03-07 08:15:36,250][155452] Updated weights for policy 0, policy_version 30550 (0.0006) [2023-03-07 08:15:37,023][155452] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-07 08:15:37,804][155452] Updated weights for policy 0, policy_version 30570 (0.0006) [2023-03-07 08:15:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 31310848. Throughput: 0: 13037.1. Samples: 31290493. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:15:38,367][155126] Avg episode reward: [(0, '1698.536')] [2023-03-07 08:15:38,585][155452] Updated weights for policy 0, policy_version 30580 (0.0007) [2023-03-07 08:15:39,365][155452] Updated weights for policy 0, policy_version 30590 (0.0006) [2023-03-07 08:15:40,150][155452] Updated weights for policy 0, policy_version 30600 (0.0006) [2023-03-07 08:15:40,926][155452] Updated weights for policy 0, policy_version 30610 (0.0006) [2023-03-07 08:15:41,723][155452] Updated weights for policy 0, policy_version 30620 (0.0006) [2023-03-07 08:15:42,493][155452] Updated weights for policy 0, policy_version 30630 (0.0006) [2023-03-07 08:15:43,289][155452] Updated weights for policy 0, policy_version 30640 (0.0007) [2023-03-07 08:15:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 31376384. Throughput: 0: 13056.5. Samples: 31368959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:15:43,367][155126] Avg episode reward: [(0, '1809.677')] [2023-03-07 08:15:44,058][155452] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-07 08:15:44,841][155452] Updated weights for policy 0, policy_version 30660 (0.0006) [2023-03-07 08:15:45,632][155452] Updated weights for policy 0, policy_version 30670 (0.0006) [2023-03-07 08:15:46,418][155452] Updated weights for policy 0, policy_version 30680 (0.0006) [2023-03-07 08:15:47,200][155452] Updated weights for policy 0, policy_version 30690 (0.0006) [2023-03-07 08:15:47,989][155452] Updated weights for policy 0, policy_version 30700 (0.0007) [2023-03-07 08:15:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 31440896. Throughput: 0: 13060.8. Samples: 31408194. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:15:48,367][155126] Avg episode reward: [(0, '1825.967')] [2023-03-07 08:15:48,772][155452] Updated weights for policy 0, policy_version 30710 (0.0006) [2023-03-07 08:15:49,545][155452] Updated weights for policy 0, policy_version 30720 (0.0006) [2023-03-07 08:15:50,337][155452] Updated weights for policy 0, policy_version 30730 (0.0006) [2023-03-07 08:15:51,098][155452] Updated weights for policy 0, policy_version 30740 (0.0006) [2023-03-07 08:15:51,877][155452] Updated weights for policy 0, policy_version 30750 (0.0006) [2023-03-07 08:15:52,684][155452] Updated weights for policy 0, policy_version 30760 (0.0006) [2023-03-07 08:15:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 31506432. Throughput: 0: 13073.8. Samples: 31486849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:15:53,367][155126] Avg episode reward: [(0, '1650.372')] [2023-03-07 08:15:53,451][155452] Updated weights for policy 0, policy_version 30770 (0.0006) [2023-03-07 08:15:54,250][155452] Updated weights for policy 0, policy_version 30780 (0.0007) [2023-03-07 08:15:55,021][155452] Updated weights for policy 0, policy_version 30790 (0.0007) [2023-03-07 08:15:55,826][155452] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-07 08:15:56,603][155452] Updated weights for policy 0, policy_version 30810 (0.0006) [2023-03-07 08:15:57,396][155452] Updated weights for policy 0, policy_version 30820 (0.0006) [2023-03-07 08:15:58,189][155452] Updated weights for policy 0, policy_version 30830 (0.0006) [2023-03-07 08:15:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 31571968. Throughput: 0: 13065.6. Samples: 31564823. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:15:58,368][155126] Avg episode reward: [(0, '1758.930')] [2023-03-07 08:15:58,964][155452] Updated weights for policy 0, policy_version 30840 (0.0006) [2023-03-07 08:15:59,762][155452] Updated weights for policy 0, policy_version 30850 (0.0006) [2023-03-07 08:16:00,536][155452] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-07 08:16:01,326][155452] Updated weights for policy 0, policy_version 30870 (0.0007) [2023-03-07 08:16:02,101][155452] Updated weights for policy 0, policy_version 30880 (0.0007) [2023-03-07 08:16:02,905][155452] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-07 08:16:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13041.3). Total num frames: 31637504. Throughput: 0: 13066.7. Samples: 31603924. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:16:03,367][155126] Avg episode reward: [(0, '1587.590')] [2023-03-07 08:16:03,676][155452] Updated weights for policy 0, policy_version 30900 (0.0007) [2023-03-07 08:16:04,446][155452] Updated weights for policy 0, policy_version 30910 (0.0006) [2023-03-07 08:16:05,242][155452] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-07 08:16:06,008][155452] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-07 08:16:06,798][155452] Updated weights for policy 0, policy_version 30940 (0.0006) [2023-03-07 08:16:07,559][155452] Updated weights for policy 0, policy_version 30950 (0.0005) [2023-03-07 08:16:08,346][155452] Updated weights for policy 0, policy_version 30960 (0.0007) [2023-03-07 08:16:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 31703040. Throughput: 0: 13076.8. Samples: 31682637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:16:08,367][155126] Avg episode reward: [(0, '1464.736')] [2023-03-07 08:16:09,151][155452] Updated weights for policy 0, policy_version 30970 (0.0005) [2023-03-07 08:16:09,924][155452] Updated weights for policy 0, policy_version 30980 (0.0007) [2023-03-07 08:16:10,727][155452] Updated weights for policy 0, policy_version 30990 (0.0006) [2023-03-07 08:16:11,505][155452] Updated weights for policy 0, policy_version 31000 (0.0006) [2023-03-07 08:16:12,287][155452] Updated weights for policy 0, policy_version 31010 (0.0006) [2023-03-07 08:16:13,072][155452] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-07 08:16:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 31767552. Throughput: 0: 13069.2. Samples: 31760719. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:16:13,378][155126] Avg episode reward: [(0, '1598.533')] [2023-03-07 08:16:13,866][155452] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-07 08:16:14,641][155452] Updated weights for policy 0, policy_version 31040 (0.0006) [2023-03-07 08:16:15,425][155452] Updated weights for policy 0, policy_version 31050 (0.0005) [2023-03-07 08:16:16,219][155452] Updated weights for policy 0, policy_version 31060 (0.0007) [2023-03-07 08:16:16,991][155452] Updated weights for policy 0, policy_version 31070 (0.0006) [2023-03-07 08:16:17,781][155452] Updated weights for policy 0, policy_version 31080 (0.0006) [2023-03-07 08:16:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13041.2). Total num frames: 31833088. Throughput: 0: 13067.2. Samples: 31799975. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:16:18,378][155126] Avg episode reward: [(0, '1416.492')] [2023-03-07 08:16:18,562][155452] Updated weights for policy 0, policy_version 31090 (0.0006) [2023-03-07 08:16:19,349][155452] Updated weights for policy 0, policy_version 31100 (0.0006) [2023-03-07 08:16:20,121][155452] Updated weights for policy 0, policy_version 31110 (0.0007) [2023-03-07 08:16:20,902][155452] Updated weights for policy 0, policy_version 31120 (0.0006) [2023-03-07 08:16:21,685][155452] Updated weights for policy 0, policy_version 31130 (0.0006) [2023-03-07 08:16:22,455][155452] Updated weights for policy 0, policy_version 31140 (0.0006) [2023-03-07 08:16:23,243][155452] Updated weights for policy 0, policy_version 31150 (0.0006) [2023-03-07 08:16:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13041.2). Total num frames: 31898624. Throughput: 0: 13068.7. Samples: 31878587. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:23,367][155126] Avg episode reward: [(0, '1447.942')] [2023-03-07 08:16:24,041][155452] Updated weights for policy 0, policy_version 31160 (0.0006) [2023-03-07 08:16:24,811][155452] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-07 08:16:25,582][155452] Updated weights for policy 0, policy_version 31180 (0.0006) [2023-03-07 08:16:26,385][155452] Updated weights for policy 0, policy_version 31190 (0.0005) [2023-03-07 08:16:27,176][155452] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-07 08:16:27,961][155452] Updated weights for policy 0, policy_version 31210 (0.0006) [2023-03-07 08:16:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13041.2). Total num frames: 31964160. Throughput: 0: 13068.9. Samples: 31957059. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:28,368][155126] Avg episode reward: [(0, '1764.772')] [2023-03-07 08:16:28,760][155452] Updated weights for policy 0, policy_version 31220 (0.0007) [2023-03-07 08:16:29,549][155452] Updated weights for policy 0, policy_version 31230 (0.0007) [2023-03-07 08:16:30,320][155452] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-07 08:16:31,102][155452] Updated weights for policy 0, policy_version 31250 (0.0006) [2023-03-07 08:16:31,895][155452] Updated weights for policy 0, policy_version 31260 (0.0006) [2023-03-07 08:16:32,670][155452] Updated weights for policy 0, policy_version 31270 (0.0007) [2023-03-07 08:16:33,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13041.2). Total num frames: 32029696. Throughput: 0: 13063.5. Samples: 31996052. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:33,378][155126] Avg episode reward: [(0, '1899.190')] [2023-03-07 08:16:33,446][155452] Updated weights for policy 0, policy_version 31280 (0.0006) [2023-03-07 08:16:34,233][155452] Updated weights for policy 0, policy_version 31290 (0.0006) [2023-03-07 08:16:35,026][155452] Updated weights for policy 0, policy_version 31300 (0.0006) [2023-03-07 08:16:35,804][155452] Updated weights for policy 0, policy_version 31310 (0.0008) [2023-03-07 08:16:36,590][155452] Updated weights for policy 0, policy_version 31320 (0.0005) [2023-03-07 08:16:37,366][155452] Updated weights for policy 0, policy_version 31330 (0.0006) [2023-03-07 08:16:38,146][155452] Updated weights for policy 0, policy_version 31340 (0.0007) [2023-03-07 08:16:38,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 32094208. Throughput: 0: 13058.3. Samples: 32074473. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:38,378][155126] Avg episode reward: [(0, '1893.563')] [2023-03-07 08:16:38,937][155452] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-07 08:16:39,727][155452] Updated weights for policy 0, policy_version 31360 (0.0007) [2023-03-07 08:16:40,499][155452] Updated weights for policy 0, policy_version 31370 (0.0006) [2023-03-07 08:16:41,301][155452] Updated weights for policy 0, policy_version 31380 (0.0006) [2023-03-07 08:16:42,084][155452] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-07 08:16:42,873][155452] Updated weights for policy 0, policy_version 31400 (0.0006) [2023-03-07 08:16:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 32159744. Throughput: 0: 13061.4. Samples: 32152586. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:43,379][155126] Avg episode reward: [(0, '1748.975')] [2023-03-07 08:16:43,647][155452] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-07 08:16:44,446][155452] Updated weights for policy 0, policy_version 31420 (0.0006) [2023-03-07 08:16:45,209][155452] Updated weights for policy 0, policy_version 31430 (0.0006) [2023-03-07 08:16:46,013][155452] Updated weights for policy 0, policy_version 31440 (0.0005) [2023-03-07 08:16:46,788][155452] Updated weights for policy 0, policy_version 31450 (0.0006) [2023-03-07 08:16:47,559][155452] Updated weights for policy 0, policy_version 31460 (0.0006) [2023-03-07 08:16:48,344][155452] Updated weights for policy 0, policy_version 31470 (0.0006) [2023-03-07 08:16:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 32225280. Throughput: 0: 13062.0. Samples: 32191719. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:48,378][155126] Avg episode reward: [(0, '1653.284')] [2023-03-07 08:16:49,127][155452] Updated weights for policy 0, policy_version 31480 (0.0006) [2023-03-07 08:16:49,900][155452] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-07 08:16:50,696][155452] Updated weights for policy 0, policy_version 31500 (0.0006) [2023-03-07 08:16:51,463][155452] Updated weights for policy 0, policy_version 31510 (0.0006) [2023-03-07 08:16:52,250][155452] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-07 08:16:53,049][155452] Updated weights for policy 0, policy_version 31530 (0.0006) [2023-03-07 08:16:53,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 32290816. Throughput: 0: 13064.6. Samples: 32270540. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:53,367][155126] Avg episode reward: [(0, '1528.444')] [2023-03-07 08:16:53,823][155452] Updated weights for policy 0, policy_version 31540 (0.0006) [2023-03-07 08:16:54,610][155452] Updated weights for policy 0, policy_version 31550 (0.0007) [2023-03-07 08:16:55,394][155452] Updated weights for policy 0, policy_version 31560 (0.0005) [2023-03-07 08:16:56,176][155452] Updated weights for policy 0, policy_version 31570 (0.0006) [2023-03-07 08:16:56,982][155452] Updated weights for policy 0, policy_version 31580 (0.0005) [2023-03-07 08:16:57,754][155452] Updated weights for policy 0, policy_version 31590 (0.0006) [2023-03-07 08:16:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 32355328. Throughput: 0: 13067.1. Samples: 32348740. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:16:58,368][155126] Avg episode reward: [(0, '1646.085')] [2023-03-07 08:16:58,534][155452] Updated weights for policy 0, policy_version 31600 (0.0007) [2023-03-07 08:16:59,345][155452] Updated weights for policy 0, policy_version 31610 (0.0006) [2023-03-07 08:17:00,128][155452] Updated weights for policy 0, policy_version 31620 (0.0006) [2023-03-07 08:17:00,920][155452] Updated weights for policy 0, policy_version 31630 (0.0006) [2023-03-07 08:17:01,690][155452] Updated weights for policy 0, policy_version 31640 (0.0006) [2023-03-07 08:17:02,483][155452] Updated weights for policy 0, policy_version 31650 (0.0007) [2023-03-07 08:17:03,271][155452] Updated weights for policy 0, policy_version 31660 (0.0006) [2023-03-07 08:17:03,367][155126] Fps is (10 sec: 13004.4, 60 sec: 13055.9, 300 sec: 13044.7). Total num frames: 32420864. Throughput: 0: 13057.8. Samples: 32387576. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:17:03,368][155126] Avg episode reward: [(0, '1752.503')] [2023-03-07 08:17:04,051][155452] Updated weights for policy 0, policy_version 31670 (0.0006) [2023-03-07 08:17:04,853][155452] Updated weights for policy 0, policy_version 31680 (0.0006) [2023-03-07 08:17:05,628][155452] Updated weights for policy 0, policy_version 31690 (0.0006) [2023-03-07 08:17:06,424][155452] Updated weights for policy 0, policy_version 31700 (0.0006) [2023-03-07 08:17:07,209][155452] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-07 08:17:07,990][155452] Updated weights for policy 0, policy_version 31720 (0.0006) [2023-03-07 08:17:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 32485376. Throughput: 0: 13043.8. Samples: 32465558. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:17:08,368][155126] Avg episode reward: [(0, '1469.122')] [2023-03-07 08:17:08,778][155452] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-07 08:17:09,574][155452] Updated weights for policy 0, policy_version 31740 (0.0006) [2023-03-07 08:17:10,350][155452] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-07 08:17:11,132][155452] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-07 08:17:11,913][155452] Updated weights for policy 0, policy_version 31770 (0.0007) [2023-03-07 08:17:12,705][155452] Updated weights for policy 0, policy_version 31780 (0.0006) [2023-03-07 08:17:13,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 32550912. Throughput: 0: 13047.1. Samples: 32544178. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:17:13,367][155126] Avg episode reward: [(0, '1639.481')] [2023-03-07 08:17:13,478][155452] Updated weights for policy 0, policy_version 31790 (0.0007) [2023-03-07 08:17:14,250][155452] Updated weights for policy 0, policy_version 31800 (0.0006) [2023-03-07 08:17:15,021][155452] Updated weights for policy 0, policy_version 31810 (0.0006) [2023-03-07 08:17:15,807][155452] Updated weights for policy 0, policy_version 31820 (0.0006) [2023-03-07 08:17:16,593][155452] Updated weights for policy 0, policy_version 31830 (0.0006) [2023-03-07 08:17:17,370][155452] Updated weights for policy 0, policy_version 31840 (0.0006) [2023-03-07 08:17:18,179][155452] Updated weights for policy 0, policy_version 31850 (0.0006) [2023-03-07 08:17:18,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 32616448. Throughput: 0: 13055.8. Samples: 32583561. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:17:18,367][155126] Avg episode reward: [(0, '1388.476')] [2023-03-07 08:17:18,949][155452] Updated weights for policy 0, policy_version 31860 (0.0007) [2023-03-07 08:17:19,717][155452] Updated weights for policy 0, policy_version 31870 (0.0007) [2023-03-07 08:17:20,517][155452] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-07 08:17:20,656][155401] KL-divergence is very high: 199.3102 [2023-03-07 08:17:21,293][155452] Updated weights for policy 0, policy_version 31890 (0.0006) [2023-03-07 08:17:22,074][155452] Updated weights for policy 0, policy_version 31900 (0.0007) [2023-03-07 08:17:22,855][155452] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-07 08:17:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 32681984. Throughput: 0: 13055.1. Samples: 32661952. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:17:23,367][155126] Avg episode reward: [(0, '1511.044')] [2023-03-07 08:17:23,640][155452] Updated weights for policy 0, policy_version 31920 (0.0006) [2023-03-07 08:17:24,422][155452] Updated weights for policy 0, policy_version 31930 (0.0006) [2023-03-07 08:17:25,218][155452] Updated weights for policy 0, policy_version 31940 (0.0006) [2023-03-07 08:17:25,987][155452] Updated weights for policy 0, policy_version 31950 (0.0006) [2023-03-07 08:17:26,775][155452] Updated weights for policy 0, policy_version 31960 (0.0006) [2023-03-07 08:17:27,566][155452] Updated weights for policy 0, policy_version 31970 (0.0006) [2023-03-07 08:17:28,325][155452] Updated weights for policy 0, policy_version 31980 (0.0006) [2023-03-07 08:17:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 32747520. Throughput: 0: 13060.9. Samples: 32740324. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:17:28,367][155126] Avg episode reward: [(0, '1345.084')] [2023-03-07 08:17:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000031980_32747520.pth... [2023-03-07 08:17:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000028921_29615104.pth [2023-03-07 08:17:29,109][155452] Updated weights for policy 0, policy_version 31990 (0.0006) [2023-03-07 08:17:29,897][155452] Updated weights for policy 0, policy_version 32000 (0.0005) [2023-03-07 08:17:30,676][155452] Updated weights for policy 0, policy_version 32010 (0.0006) [2023-03-07 08:17:31,455][155452] Updated weights for policy 0, policy_version 32020 (0.0006) [2023-03-07 08:17:32,242][155452] Updated weights for policy 0, policy_version 32030 (0.0006) [2023-03-07 08:17:33,033][155452] Updated weights for policy 0, policy_version 32040 (0.0007) [2023-03-07 08:17:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 32813056. Throughput: 0: 13066.6. Samples: 32779714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:17:33,367][155126] Avg episode reward: [(0, '1535.225')] [2023-03-07 08:17:33,805][155452] Updated weights for policy 0, policy_version 32050 (0.0006) [2023-03-07 08:17:34,590][155452] Updated weights for policy 0, policy_version 32060 (0.0006) [2023-03-07 08:17:35,377][155452] Updated weights for policy 0, policy_version 32070 (0.0006) [2023-03-07 08:17:36,157][155452] Updated weights for policy 0, policy_version 32080 (0.0006) [2023-03-07 08:17:36,940][155452] Updated weights for policy 0, policy_version 32090 (0.0005) [2023-03-07 08:17:37,733][155452] Updated weights for policy 0, policy_version 32100 (0.0006) [2023-03-07 08:17:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 32878592. Throughput: 0: 13059.5. Samples: 32858219. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:17:38,367][155126] Avg episode reward: [(0, '1440.440')] [2023-03-07 08:17:38,515][155452] Updated weights for policy 0, policy_version 32110 (0.0006) [2023-03-07 08:17:39,297][155452] Updated weights for policy 0, policy_version 32120 (0.0007) [2023-03-07 08:17:40,081][155452] Updated weights for policy 0, policy_version 32130 (0.0007) [2023-03-07 08:17:40,857][155452] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-07 08:17:41,658][155452] Updated weights for policy 0, policy_version 32150 (0.0006) [2023-03-07 08:17:42,450][155452] Updated weights for policy 0, policy_version 32160 (0.0006) [2023-03-07 08:17:43,223][155452] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-07 08:17:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 32944128. Throughput: 0: 13056.9. Samples: 32936299. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:17:43,367][155126] Avg episode reward: [(0, '1506.213')] [2023-03-07 08:17:43,995][155452] Updated weights for policy 0, policy_version 32180 (0.0006) [2023-03-07 08:17:44,788][155452] Updated weights for policy 0, policy_version 32190 (0.0007) [2023-03-07 08:17:45,562][155452] Updated weights for policy 0, policy_version 32200 (0.0005) [2023-03-07 08:17:46,345][155452] Updated weights for policy 0, policy_version 32210 (0.0006) [2023-03-07 08:17:47,141][155452] Updated weights for policy 0, policy_version 32220 (0.0006) [2023-03-07 08:17:47,914][155452] Updated weights for policy 0, policy_version 32230 (0.0006) [2023-03-07 08:17:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 33008640. Throughput: 0: 13069.6. Samples: 32975704. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:17:48,378][155126] Avg episode reward: [(0, '1547.588')] [2023-03-07 08:17:48,714][155452] Updated weights for policy 0, policy_version 32240 (0.0006) [2023-03-07 08:17:49,504][155452] Updated weights for policy 0, policy_version 32250 (0.0006) [2023-03-07 08:17:50,289][155452] Updated weights for policy 0, policy_version 32260 (0.0006) [2023-03-07 08:17:51,067][155452] Updated weights for policy 0, policy_version 32270 (0.0006) [2023-03-07 08:17:51,853][155452] Updated weights for policy 0, policy_version 32280 (0.0006) [2023-03-07 08:17:52,612][155452] Updated weights for policy 0, policy_version 32290 (0.0006) [2023-03-07 08:17:53,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 33074176. Throughput: 0: 13077.5. Samples: 33054046. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:17:53,378][155126] Avg episode reward: [(0, '1501.679')] [2023-03-07 08:17:53,417][155452] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-07 08:17:54,211][155452] Updated weights for policy 0, policy_version 32310 (0.0006) [2023-03-07 08:17:55,009][155452] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-07 08:17:55,777][155452] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-07 08:17:56,577][155452] Updated weights for policy 0, policy_version 32340 (0.0006) [2023-03-07 08:17:57,356][155452] Updated weights for policy 0, policy_version 32350 (0.0006) [2023-03-07 08:17:58,149][155452] Updated weights for policy 0, policy_version 32360 (0.0006) [2023-03-07 08:17:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 33138688. Throughput: 0: 13065.3. Samples: 33132115. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:17:58,378][155126] Avg episode reward: [(0, '1738.648')] [2023-03-07 08:17:58,953][155452] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-07 08:17:59,738][155452] Updated weights for policy 0, policy_version 32380 (0.0006) [2023-03-07 08:18:00,513][155452] Updated weights for policy 0, policy_version 32390 (0.0007) [2023-03-07 08:18:01,295][155452] Updated weights for policy 0, policy_version 32400 (0.0005) [2023-03-07 08:18:02,093][155452] Updated weights for policy 0, policy_version 32410 (0.0006) [2023-03-07 08:18:02,873][155452] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-07 08:18:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 33204224. Throughput: 0: 13055.4. Samples: 33171055. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:18:03,367][155126] Avg episode reward: [(0, '1422.594')] [2023-03-07 08:18:03,664][155452] Updated weights for policy 0, policy_version 32430 (0.0006) [2023-03-07 08:18:04,447][155452] Updated weights for policy 0, policy_version 32440 (0.0006) [2023-03-07 08:18:05,234][155452] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-07 08:18:06,022][155452] Updated weights for policy 0, policy_version 32460 (0.0006) [2023-03-07 08:18:06,802][155452] Updated weights for policy 0, policy_version 32470 (0.0006) [2023-03-07 08:18:07,595][155452] Updated weights for policy 0, policy_version 32480 (0.0006) [2023-03-07 08:18:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 33268736. Throughput: 0: 13051.0. Samples: 33249245. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:18:08,378][155452] Updated weights for policy 0, policy_version 32490 (0.0007) [2023-03-07 08:18:08,367][155126] Avg episode reward: [(0, '1539.704')] [2023-03-07 08:18:09,174][155452] Updated weights for policy 0, policy_version 32500 (0.0007) [2023-03-07 08:18:09,957][155452] Updated weights for policy 0, policy_version 32510 (0.0007) [2023-03-07 08:18:10,741][155452] Updated weights for policy 0, policy_version 32520 (0.0006) [2023-03-07 08:18:11,531][155452] Updated weights for policy 0, policy_version 32530 (0.0006) [2023-03-07 08:18:12,309][155452] Updated weights for policy 0, policy_version 32540 (0.0006) [2023-03-07 08:18:13,099][155452] Updated weights for policy 0, policy_version 32550 (0.0006) [2023-03-07 08:18:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 33334272. Throughput: 0: 13043.1. Samples: 33327263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:18:13,367][155126] Avg episode reward: [(0, '1536.766')] [2023-03-07 08:18:13,877][155452] Updated weights for policy 0, policy_version 32560 (0.0006) [2023-03-07 08:18:14,649][155452] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-07 08:18:15,428][155452] Updated weights for policy 0, policy_version 32580 (0.0006) [2023-03-07 08:18:16,230][155452] Updated weights for policy 0, policy_version 32590 (0.0006) [2023-03-07 08:18:17,005][155452] Updated weights for policy 0, policy_version 32600 (0.0006) [2023-03-07 08:18:17,779][155452] Updated weights for policy 0, policy_version 32610 (0.0006) [2023-03-07 08:18:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 33399808. Throughput: 0: 13042.8. Samples: 33366641. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:18:18,367][155126] Avg episode reward: [(0, '1777.790')] [2023-03-07 08:18:18,572][155452] Updated weights for policy 0, policy_version 32620 (0.0006) [2023-03-07 08:18:19,359][155452] Updated weights for policy 0, policy_version 32630 (0.0007) [2023-03-07 08:18:20,141][155452] Updated weights for policy 0, policy_version 32640 (0.0006) [2023-03-07 08:18:20,913][155452] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-07 08:18:21,714][155452] Updated weights for policy 0, policy_version 32660 (0.0006) [2023-03-07 08:18:22,495][155452] Updated weights for policy 0, policy_version 32670 (0.0006) [2023-03-07 08:18:23,286][155452] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-07 08:18:23,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 33465344. Throughput: 0: 13038.0. Samples: 33444927. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:18:23,367][155126] Avg episode reward: [(0, '1785.871')] [2023-03-07 08:18:24,072][155452] Updated weights for policy 0, policy_version 32690 (0.0006) [2023-03-07 08:18:24,851][155452] Updated weights for policy 0, policy_version 32700 (0.0007) [2023-03-07 08:18:25,620][155452] Updated weights for policy 0, policy_version 32710 (0.0007) [2023-03-07 08:18:26,414][155452] Updated weights for policy 0, policy_version 32720 (0.0006) [2023-03-07 08:18:27,188][155452] Updated weights for policy 0, policy_version 32730 (0.0007) [2023-03-07 08:18:27,966][155452] Updated weights for policy 0, policy_version 32740 (0.0007) [2023-03-07 08:18:28,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 33530880. Throughput: 0: 13046.9. Samples: 33523411. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:18:28,368][155126] Avg episode reward: [(0, '1618.174')] [2023-03-07 08:18:28,761][155452] Updated weights for policy 0, policy_version 32750 (0.0006) [2023-03-07 08:18:29,555][155452] Updated weights for policy 0, policy_version 32760 (0.0007) [2023-03-07 08:18:30,338][155452] Updated weights for policy 0, policy_version 32770 (0.0006) [2023-03-07 08:18:31,119][155452] Updated weights for policy 0, policy_version 32780 (0.0006) [2023-03-07 08:18:31,905][155452] Updated weights for policy 0, policy_version 32790 (0.0007) [2023-03-07 08:18:32,685][155452] Updated weights for policy 0, policy_version 32800 (0.0006) [2023-03-07 08:18:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 33595392. Throughput: 0: 13039.1. Samples: 33562466. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:18:33,368][155126] Avg episode reward: [(0, '1756.891')] [2023-03-07 08:18:33,483][155452] Updated weights for policy 0, policy_version 32810 (0.0006) [2023-03-07 08:18:34,267][155452] Updated weights for policy 0, policy_version 32820 (0.0006) [2023-03-07 08:18:35,038][155452] Updated weights for policy 0, policy_version 32830 (0.0006) [2023-03-07 08:18:35,833][155452] Updated weights for policy 0, policy_version 32840 (0.0006) [2023-03-07 08:18:36,609][155452] Updated weights for policy 0, policy_version 32850 (0.0005) [2023-03-07 08:18:37,405][155452] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-07 08:18:38,191][155452] Updated weights for policy 0, policy_version 32870 (0.0006) [2023-03-07 08:18:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 33660928. Throughput: 0: 13038.2. Samples: 33640765. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:18:38,367][155126] Avg episode reward: [(0, '1523.509')] [2023-03-07 08:18:38,988][155452] Updated weights for policy 0, policy_version 32880 (0.0007) [2023-03-07 08:18:39,748][155452] Updated weights for policy 0, policy_version 32890 (0.0007) [2023-03-07 08:18:40,553][155452] Updated weights for policy 0, policy_version 32900 (0.0006) [2023-03-07 08:18:41,320][155452] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-07 08:18:42,093][155452] Updated weights for policy 0, policy_version 32920 (0.0006) [2023-03-07 08:18:42,881][155452] Updated weights for policy 0, policy_version 32930 (0.0006) [2023-03-07 08:18:43,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 33726464. Throughput: 0: 13047.3. Samples: 33719245. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:18:43,368][155126] Avg episode reward: [(0, '1787.506')] [2023-03-07 08:18:43,661][155452] Updated weights for policy 0, policy_version 32940 (0.0006) [2023-03-07 08:18:44,445][155452] Updated weights for policy 0, policy_version 32950 (0.0007) [2023-03-07 08:18:45,233][155452] Updated weights for policy 0, policy_version 32960 (0.0006) [2023-03-07 08:18:46,021][155452] Updated weights for policy 0, policy_version 32970 (0.0006) [2023-03-07 08:18:46,817][155452] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-07 08:18:47,602][155452] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-07 08:18:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 33790976. Throughput: 0: 13052.0. Samples: 33758394. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:18:48,367][155126] Avg episode reward: [(0, '1765.177')] [2023-03-07 08:18:48,374][155452] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-07 08:18:49,151][155452] Updated weights for policy 0, policy_version 33010 (0.0006) [2023-03-07 08:18:49,946][155452] Updated weights for policy 0, policy_version 33020 (0.0006) [2023-03-07 08:18:50,721][155452] Updated weights for policy 0, policy_version 33030 (0.0007) [2023-03-07 08:18:51,523][155452] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-07 08:18:52,301][155452] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-07 08:18:53,071][155452] Updated weights for policy 0, policy_version 33060 (0.0006) [2023-03-07 08:18:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 33856512. Throughput: 0: 13054.8. Samples: 33836711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:18:53,368][155126] Avg episode reward: [(0, '1606.423')] [2023-03-07 08:18:53,862][155452] Updated weights for policy 0, policy_version 33070 (0.0006) [2023-03-07 08:18:54,635][155452] Updated weights for policy 0, policy_version 33080 (0.0006) [2023-03-07 08:18:55,404][155452] Updated weights for policy 0, policy_version 33090 (0.0008) [2023-03-07 08:18:56,202][155452] Updated weights for policy 0, policy_version 33100 (0.0006) [2023-03-07 08:18:56,983][155452] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-07 08:18:57,767][155452] Updated weights for policy 0, policy_version 33120 (0.0006) [2023-03-07 08:18:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 33922048. Throughput: 0: 13069.3. Samples: 33915383. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:18:58,367][155126] Avg episode reward: [(0, '1755.165')] [2023-03-07 08:18:58,544][155452] Updated weights for policy 0, policy_version 33130 (0.0006) [2023-03-07 08:18:59,325][155452] Updated weights for policy 0, policy_version 33140 (0.0006) [2023-03-07 08:19:00,121][155452] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-07 08:19:00,907][155452] Updated weights for policy 0, policy_version 33160 (0.0007) [2023-03-07 08:19:01,681][155452] Updated weights for policy 0, policy_version 33170 (0.0007) [2023-03-07 08:19:02,444][155452] Updated weights for policy 0, policy_version 33180 (0.0006) [2023-03-07 08:19:03,238][155452] Updated weights for policy 0, policy_version 33190 (0.0007) [2023-03-07 08:19:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13051.7). Total num frames: 33987584. Throughput: 0: 13061.8. Samples: 33954423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:19:03,367][155126] Avg episode reward: [(0, '1971.348')] [2023-03-07 08:19:04,016][155452] Updated weights for policy 0, policy_version 33200 (0.0006) [2023-03-07 08:19:04,790][155452] Updated weights for policy 0, policy_version 33210 (0.0006) [2023-03-07 08:19:05,565][155452] Updated weights for policy 0, policy_version 33220 (0.0006) [2023-03-07 08:19:06,355][155452] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-07 08:19:07,141][155452] Updated weights for policy 0, policy_version 33240 (0.0006) [2023-03-07 08:19:07,924][155452] Updated weights for policy 0, policy_version 33250 (0.0005) [2023-03-07 08:19:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 34053120. Throughput: 0: 13074.6. Samples: 34033284. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:19:08,367][155126] Avg episode reward: [(0, '1634.766')] [2023-03-07 08:19:08,690][155452] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-07 08:19:09,465][155452] Updated weights for policy 0, policy_version 33270 (0.0006) [2023-03-07 08:19:10,255][155452] Updated weights for policy 0, policy_version 33280 (0.0006) [2023-03-07 08:19:11,044][155452] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-07 08:19:11,829][155452] Updated weights for policy 0, policy_version 33300 (0.0006) [2023-03-07 08:19:12,600][155452] Updated weights for policy 0, policy_version 33310 (0.0007) [2023-03-07 08:19:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13051.7). Total num frames: 34118656. Throughput: 0: 13079.4. Samples: 34111984. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:19:13,367][155126] Avg episode reward: [(0, '1598.808')] [2023-03-07 08:19:13,382][155452] Updated weights for policy 0, policy_version 33320 (0.0006) [2023-03-07 08:19:14,166][155452] Updated weights for policy 0, policy_version 33330 (0.0006) [2023-03-07 08:19:14,944][155452] Updated weights for policy 0, policy_version 33340 (0.0006) [2023-03-07 08:19:15,720][155452] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-07 08:19:16,500][155452] Updated weights for policy 0, policy_version 33360 (0.0006) [2023-03-07 08:19:17,295][155452] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-07 08:19:18,062][155452] Updated weights for policy 0, policy_version 33380 (0.0006) [2023-03-07 08:19:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 34184192. Throughput: 0: 13084.3. Samples: 34151258. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:19:18,367][155126] Avg episode reward: [(0, '1674.669')] [2023-03-07 08:19:18,857][155452] Updated weights for policy 0, policy_version 33390 (0.0006) [2023-03-07 08:19:19,641][155452] Updated weights for policy 0, policy_version 33400 (0.0007) [2023-03-07 08:19:20,429][155452] Updated weights for policy 0, policy_version 33410 (0.0007) [2023-03-07 08:19:21,216][155452] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-07 08:19:22,017][155452] Updated weights for policy 0, policy_version 33430 (0.0006) [2023-03-07 08:19:22,798][155452] Updated weights for policy 0, policy_version 33440 (0.0006) [2023-03-07 08:19:23,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 34249728. Throughput: 0: 13087.2. Samples: 34229689. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:19:23,367][155126] Avg episode reward: [(0, '1707.767')] [2023-03-07 08:19:23,578][155452] Updated weights for policy 0, policy_version 33450 (0.0006) [2023-03-07 08:19:24,385][155452] Updated weights for policy 0, policy_version 33460 (0.0006) [2023-03-07 08:19:25,168][155452] Updated weights for policy 0, policy_version 33470 (0.0007) [2023-03-07 08:19:25,955][155452] Updated weights for policy 0, policy_version 33480 (0.0006) [2023-03-07 08:19:26,755][155452] Updated weights for policy 0, policy_version 33490 (0.0006) [2023-03-07 08:19:27,539][155452] Updated weights for policy 0, policy_version 33500 (0.0006) [2023-03-07 08:19:28,321][155452] Updated weights for policy 0, policy_version 33510 (0.0006) [2023-03-07 08:19:28,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 34314240. Throughput: 0: 13067.2. Samples: 34307268. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:19:28,368][155126] Avg episode reward: [(0, '1690.123')] [2023-03-07 08:19:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000033510_34314240.pth... [2023-03-07 08:19:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000030449_31179776.pth [2023-03-07 08:19:29,109][155452] Updated weights for policy 0, policy_version 33520 (0.0007) [2023-03-07 08:19:29,906][155452] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-07 08:19:30,689][155452] Updated weights for policy 0, policy_version 33540 (0.0006) [2023-03-07 08:19:31,466][155452] Updated weights for policy 0, policy_version 33550 (0.0006) [2023-03-07 08:19:32,258][155452] Updated weights for policy 0, policy_version 33560 (0.0005) [2023-03-07 08:19:33,036][155452] Updated weights for policy 0, policy_version 33570 (0.0006) [2023-03-07 08:19:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 34379776. Throughput: 0: 13065.7. Samples: 34346350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:19:33,367][155126] Avg episode reward: [(0, '1788.345')] [2023-03-07 08:19:33,819][155452] Updated weights for policy 0, policy_version 33580 (0.0006) [2023-03-07 08:19:34,603][155452] Updated weights for policy 0, policy_version 33590 (0.0006) [2023-03-07 08:19:35,386][155452] Updated weights for policy 0, policy_version 33600 (0.0006) [2023-03-07 08:19:36,186][155452] Updated weights for policy 0, policy_version 33610 (0.0006) [2023-03-07 08:19:36,957][155452] Updated weights for policy 0, policy_version 33620 (0.0006) [2023-03-07 08:19:37,739][155452] Updated weights for policy 0, policy_version 33630 (0.0006) [2023-03-07 08:19:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 34445312. Throughput: 0: 13068.3. Samples: 34424783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:19:38,367][155126] Avg episode reward: [(0, '1888.150')] [2023-03-07 08:19:38,513][155452] Updated weights for policy 0, policy_version 33640 (0.0006) [2023-03-07 08:19:39,316][155452] Updated weights for policy 0, policy_version 33650 (0.0006) [2023-03-07 08:19:40,099][155452] Updated weights for policy 0, policy_version 33660 (0.0006) [2023-03-07 08:19:40,874][155452] Updated weights for policy 0, policy_version 33670 (0.0006) [2023-03-07 08:19:41,653][155452] Updated weights for policy 0, policy_version 33680 (0.0007) [2023-03-07 08:19:42,430][155452] Updated weights for policy 0, policy_version 33690 (0.0005) [2023-03-07 08:19:43,209][155452] Updated weights for policy 0, policy_version 33700 (0.0006) [2023-03-07 08:19:43,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 34510848. Throughput: 0: 13062.5. Samples: 34503194. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:19:43,368][155126] Avg episode reward: [(0, '1558.425')] [2023-03-07 08:19:43,998][155452] Updated weights for policy 0, policy_version 33710 (0.0006) [2023-03-07 08:19:44,810][155452] Updated weights for policy 0, policy_version 33720 (0.0007) [2023-03-07 08:19:45,583][155452] Updated weights for policy 0, policy_version 33730 (0.0006) [2023-03-07 08:19:46,352][155452] Updated weights for policy 0, policy_version 33740 (0.0006) [2023-03-07 08:19:47,162][155452] Updated weights for policy 0, policy_version 33750 (0.0006) [2023-03-07 08:19:47,929][155452] Updated weights for policy 0, policy_version 33760 (0.0006) [2023-03-07 08:19:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13073.1, 300 sec: 13055.1). Total num frames: 34575360. Throughput: 0: 13066.9. Samples: 34542432. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:19:48,367][155126] Avg episode reward: [(0, '1788.486')] [2023-03-07 08:19:48,706][155452] Updated weights for policy 0, policy_version 33770 (0.0005) [2023-03-07 08:19:49,497][155452] Updated weights for policy 0, policy_version 33780 (0.0007) [2023-03-07 08:19:50,280][155452] Updated weights for policy 0, policy_version 33790 (0.0006) [2023-03-07 08:19:51,050][155452] Updated weights for policy 0, policy_version 33800 (0.0007) [2023-03-07 08:19:51,849][155452] Updated weights for policy 0, policy_version 33810 (0.0007) [2023-03-07 08:19:52,617][155452] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-07 08:19:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 34640896. Throughput: 0: 13054.7. Samples: 34620745. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:19:53,367][155126] Avg episode reward: [(0, '1817.657')] [2023-03-07 08:19:53,415][155452] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-07 08:19:54,197][155452] Updated weights for policy 0, policy_version 33840 (0.0006) [2023-03-07 08:19:54,971][155452] Updated weights for policy 0, policy_version 33850 (0.0006) [2023-03-07 08:19:55,749][155452] Updated weights for policy 0, policy_version 33860 (0.0006) [2023-03-07 08:19:56,538][155452] Updated weights for policy 0, policy_version 33870 (0.0006) [2023-03-07 08:19:57,315][155452] Updated weights for policy 0, policy_version 33880 (0.0007) [2023-03-07 08:19:58,082][155452] Updated weights for policy 0, policy_version 33890 (0.0007) [2023-03-07 08:19:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 34706432. Throughput: 0: 13054.8. Samples: 34699452. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:19:58,367][155126] Avg episode reward: [(0, '1817.694')] [2023-03-07 08:19:58,873][155452] Updated weights for policy 0, policy_version 33900 (0.0007) [2023-03-07 08:19:59,662][155452] Updated weights for policy 0, policy_version 33910 (0.0006) [2023-03-07 08:20:00,444][155452] Updated weights for policy 0, policy_version 33920 (0.0006) [2023-03-07 08:20:01,239][155452] Updated weights for policy 0, policy_version 33930 (0.0006) [2023-03-07 08:20:02,029][155452] Updated weights for policy 0, policy_version 33940 (0.0006) [2023-03-07 08:20:02,806][155452] Updated weights for policy 0, policy_version 33950 (0.0006) [2023-03-07 08:20:03,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 34771968. Throughput: 0: 13050.6. Samples: 34738533. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:03,367][155126] Avg episode reward: [(0, '2098.166')] [2023-03-07 08:20:03,602][155452] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-07 08:20:04,380][155452] Updated weights for policy 0, policy_version 33970 (0.0006) [2023-03-07 08:20:05,161][155452] Updated weights for policy 0, policy_version 33980 (0.0006) [2023-03-07 08:20:05,949][155452] Updated weights for policy 0, policy_version 33990 (0.0006) [2023-03-07 08:20:06,741][155452] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-07 08:20:07,518][155452] Updated weights for policy 0, policy_version 34010 (0.0006) [2023-03-07 08:20:08,292][155452] Updated weights for policy 0, policy_version 34020 (0.0007) [2023-03-07 08:20:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 34837504. Throughput: 0: 13045.9. Samples: 34816753. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:08,367][155126] Avg episode reward: [(0, '1748.392')] [2023-03-07 08:20:09,083][155452] Updated weights for policy 0, policy_version 34030 (0.0006) [2023-03-07 08:20:09,876][155452] Updated weights for policy 0, policy_version 34040 (0.0006) [2023-03-07 08:20:10,657][155452] Updated weights for policy 0, policy_version 34050 (0.0006) [2023-03-07 08:20:11,455][155452] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-07 08:20:12,219][155452] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-07 08:20:13,010][155452] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-07 08:20:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 34902016. Throughput: 0: 13060.9. Samples: 34895008. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:20:13,367][155126] Avg episode reward: [(0, '1592.310')] [2023-03-07 08:20:13,797][155452] Updated weights for policy 0, policy_version 34090 (0.0006) [2023-03-07 08:20:14,588][155452] Updated weights for policy 0, policy_version 34100 (0.0007) [2023-03-07 08:20:15,378][155452] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-07 08:20:16,146][155452] Updated weights for policy 0, policy_version 34120 (0.0007) [2023-03-07 08:20:16,950][155452] Updated weights for policy 0, policy_version 34130 (0.0006) [2023-03-07 08:20:17,754][155452] Updated weights for policy 0, policy_version 34140 (0.0007) [2023-03-07 08:20:18,367][155126] Fps is (10 sec: 12902.2, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 34966528. Throughput: 0: 13062.9. Samples: 34934181. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:20:18,367][155126] Avg episode reward: [(0, '1675.227')] [2023-03-07 08:20:18,525][155452] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-07 08:20:19,316][155452] Updated weights for policy 0, policy_version 34160 (0.0005) [2023-03-07 08:20:20,093][155452] Updated weights for policy 0, policy_version 34170 (0.0006) [2023-03-07 08:20:20,872][155452] Updated weights for policy 0, policy_version 34180 (0.0006) [2023-03-07 08:20:21,637][155452] Updated weights for policy 0, policy_version 34190 (0.0006) [2023-03-07 08:20:22,426][155452] Updated weights for policy 0, policy_version 34200 (0.0006) [2023-03-07 08:20:23,202][155452] Updated weights for policy 0, policy_version 34210 (0.0006) [2023-03-07 08:20:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 35033088. Throughput: 0: 13063.2. Samples: 35012627. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:20:23,367][155126] Avg episode reward: [(0, '1776.169')] [2023-03-07 08:20:23,984][155452] Updated weights for policy 0, policy_version 34220 (0.0006) [2023-03-07 08:20:24,763][155452] Updated weights for policy 0, policy_version 34230 (0.0006) [2023-03-07 08:20:25,540][155452] Updated weights for policy 0, policy_version 34240 (0.0006) [2023-03-07 08:20:26,306][155452] Updated weights for policy 0, policy_version 34250 (0.0006) [2023-03-07 08:20:27,096][155452] Updated weights for policy 0, policy_version 34260 (0.0007) [2023-03-07 08:20:27,865][155452] Updated weights for policy 0, policy_version 34270 (0.0006) [2023-03-07 08:20:28,367][155126] Fps is (10 sec: 13209.5, 60 sec: 13073.1, 300 sec: 13062.1). Total num frames: 35098624. Throughput: 0: 13072.6. Samples: 35091461. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:20:28,368][155126] Avg episode reward: [(0, '1674.444')] [2023-03-07 08:20:28,650][155452] Updated weights for policy 0, policy_version 34280 (0.0006) [2023-03-07 08:20:29,436][155452] Updated weights for policy 0, policy_version 34290 (0.0006) [2023-03-07 08:20:30,222][155452] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-07 08:20:31,019][155452] Updated weights for policy 0, policy_version 34310 (0.0006) [2023-03-07 08:20:31,799][155452] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-07 08:20:32,580][155452] Updated weights for policy 0, policy_version 34330 (0.0005) [2023-03-07 08:20:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35163136. Throughput: 0: 13068.0. Samples: 35130494. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:33,367][155126] Avg episode reward: [(0, '1613.070')] [2023-03-07 08:20:33,369][155452] Updated weights for policy 0, policy_version 34340 (0.0006) [2023-03-07 08:20:34,154][155452] Updated weights for policy 0, policy_version 34350 (0.0006) [2023-03-07 08:20:34,932][155452] Updated weights for policy 0, policy_version 34360 (0.0009) [2023-03-07 08:20:35,714][155452] Updated weights for policy 0, policy_version 34370 (0.0006) [2023-03-07 08:20:36,506][155452] Updated weights for policy 0, policy_version 34380 (0.0006) [2023-03-07 08:20:37,285][155452] Updated weights for policy 0, policy_version 34390 (0.0006) [2023-03-07 08:20:38,077][155452] Updated weights for policy 0, policy_version 34400 (0.0008) [2023-03-07 08:20:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35228672. Throughput: 0: 13069.4. Samples: 35208867. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:38,368][155126] Avg episode reward: [(0, '1893.206')] [2023-03-07 08:20:38,844][155452] Updated weights for policy 0, policy_version 34410 (0.0006) [2023-03-07 08:20:39,629][155452] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-07 08:20:40,452][155452] Updated weights for policy 0, policy_version 34430 (0.0006) [2023-03-07 08:20:41,228][155452] Updated weights for policy 0, policy_version 34440 (0.0006) [2023-03-07 08:20:42,017][155452] Updated weights for policy 0, policy_version 34450 (0.0006) [2023-03-07 08:20:42,794][155452] Updated weights for policy 0, policy_version 34460 (0.0006) [2023-03-07 08:20:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 35294208. Throughput: 0: 13059.5. Samples: 35287131. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:43,367][155126] Avg episode reward: [(0, '1652.761')] [2023-03-07 08:20:43,601][155452] Updated weights for policy 0, policy_version 34470 (0.0007) [2023-03-07 08:20:44,379][155452] Updated weights for policy 0, policy_version 34480 (0.0007) [2023-03-07 08:20:45,162][155452] Updated weights for policy 0, policy_version 34490 (0.0006) [2023-03-07 08:20:45,939][155452] Updated weights for policy 0, policy_version 34500 (0.0006) [2023-03-07 08:20:46,716][155452] Updated weights for policy 0, policy_version 34510 (0.0006) [2023-03-07 08:20:47,498][155452] Updated weights for policy 0, policy_version 34520 (0.0006) [2023-03-07 08:20:48,281][155452] Updated weights for policy 0, policy_version 34530 (0.0006) [2023-03-07 08:20:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35358720. Throughput: 0: 13057.4. Samples: 35326119. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:48,367][155126] Avg episode reward: [(0, '1714.633')] [2023-03-07 08:20:49,077][155452] Updated weights for policy 0, policy_version 34540 (0.0006) [2023-03-07 08:20:49,846][155452] Updated weights for policy 0, policy_version 34550 (0.0006) [2023-03-07 08:20:50,649][155452] Updated weights for policy 0, policy_version 34560 (0.0007) [2023-03-07 08:20:51,408][155452] Updated weights for policy 0, policy_version 34570 (0.0007) [2023-03-07 08:20:52,200][155452] Updated weights for policy 0, policy_version 34580 (0.0006) [2023-03-07 08:20:52,992][155452] Updated weights for policy 0, policy_version 34590 (0.0007) [2023-03-07 08:20:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35424256. Throughput: 0: 13063.0. Samples: 35404590. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:53,367][155126] Avg episode reward: [(0, '1814.000')] [2023-03-07 08:20:53,774][155452] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-07 08:20:54,555][155452] Updated weights for policy 0, policy_version 34610 (0.0006) [2023-03-07 08:20:55,342][155452] Updated weights for policy 0, policy_version 34620 (0.0006) [2023-03-07 08:20:56,131][155452] Updated weights for policy 0, policy_version 34630 (0.0006) [2023-03-07 08:20:56,905][155452] Updated weights for policy 0, policy_version 34640 (0.0007) [2023-03-07 08:20:57,701][155452] Updated weights for policy 0, policy_version 34650 (0.0006) [2023-03-07 08:20:58,367][155126] Fps is (10 sec: 13106.9, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35489792. Throughput: 0: 13065.5. Samples: 35482961. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:20:58,378][155126] Avg episode reward: [(0, '1526.461')] [2023-03-07 08:20:58,477][155452] Updated weights for policy 0, policy_version 34660 (0.0007) [2023-03-07 08:20:59,271][155452] Updated weights for policy 0, policy_version 34670 (0.0006) [2023-03-07 08:21:00,048][155452] Updated weights for policy 0, policy_version 34680 (0.0006) [2023-03-07 08:21:00,823][155452] Updated weights for policy 0, policy_version 34690 (0.0006) [2023-03-07 08:21:01,611][155452] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-07 08:21:02,403][155452] Updated weights for policy 0, policy_version 34710 (0.0006) [2023-03-07 08:21:03,176][155452] Updated weights for policy 0, policy_version 34720 (0.0006) [2023-03-07 08:21:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35555328. Throughput: 0: 13066.7. Samples: 35522181. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:21:03,367][155126] Avg episode reward: [(0, '1898.090')] [2023-03-07 08:21:03,980][155452] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-07 08:21:04,766][155452] Updated weights for policy 0, policy_version 34740 (0.0006) [2023-03-07 08:21:05,548][155452] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-07 08:21:06,332][155452] Updated weights for policy 0, policy_version 34760 (0.0006) [2023-03-07 08:21:07,116][155452] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-07 08:21:07,891][155452] Updated weights for policy 0, policy_version 34780 (0.0006) [2023-03-07 08:21:08,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13062.1). Total num frames: 35620864. Throughput: 0: 13061.1. Samples: 35600377. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:21:08,367][155126] Avg episode reward: [(0, '1881.871')] [2023-03-07 08:21:08,670][155452] Updated weights for policy 0, policy_version 34790 (0.0006) [2023-03-07 08:21:09,459][155452] Updated weights for policy 0, policy_version 34800 (0.0006) [2023-03-07 08:21:10,246][155452] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-07 08:21:11,050][155452] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-07 08:21:11,842][155452] Updated weights for policy 0, policy_version 34830 (0.0006) [2023-03-07 08:21:12,637][155452] Updated weights for policy 0, policy_version 34840 (0.0006) [2023-03-07 08:21:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35685376. Throughput: 0: 13043.0. Samples: 35678396. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:21:13,367][155126] Avg episode reward: [(0, '1869.258')] [2023-03-07 08:21:13,404][155452] Updated weights for policy 0, policy_version 34850 (0.0006) [2023-03-07 08:21:14,189][155452] Updated weights for policy 0, policy_version 34860 (0.0007) [2023-03-07 08:21:14,963][155452] Updated weights for policy 0, policy_version 34870 (0.0006) [2023-03-07 08:21:15,749][155452] Updated weights for policy 0, policy_version 34880 (0.0005) [2023-03-07 08:21:16,511][155452] Updated weights for policy 0, policy_version 34890 (0.0006) [2023-03-07 08:21:17,309][155452] Updated weights for policy 0, policy_version 34900 (0.0007) [2023-03-07 08:21:18,093][155452] Updated weights for policy 0, policy_version 34910 (0.0006) [2023-03-07 08:21:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13073.1, 300 sec: 13058.6). Total num frames: 35750912. Throughput: 0: 13052.9. Samples: 35717874. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:21:18,368][155126] Avg episode reward: [(0, '1676.849')] [2023-03-07 08:21:18,887][155452] Updated weights for policy 0, policy_version 34920 (0.0005) [2023-03-07 08:21:19,676][155452] Updated weights for policy 0, policy_version 34930 (0.0006) [2023-03-07 08:21:20,472][155452] Updated weights for policy 0, policy_version 34940 (0.0007) [2023-03-07 08:21:21,245][155452] Updated weights for policy 0, policy_version 34950 (0.0006) [2023-03-07 08:21:22,045][155452] Updated weights for policy 0, policy_version 34960 (0.0006) [2023-03-07 08:21:22,822][155452] Updated weights for policy 0, policy_version 34970 (0.0007) [2023-03-07 08:21:23,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35816448. Throughput: 0: 13044.2. Samples: 35795853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:21:23,367][155126] Avg episode reward: [(0, '1922.388')] [2023-03-07 08:21:23,613][155452] Updated weights for policy 0, policy_version 34980 (0.0006) [2023-03-07 08:21:24,394][155452] Updated weights for policy 0, policy_version 34990 (0.0005) [2023-03-07 08:21:25,162][155452] Updated weights for policy 0, policy_version 35000 (0.0006) [2023-03-07 08:21:25,955][155452] Updated weights for policy 0, policy_version 35010 (0.0007) [2023-03-07 08:21:26,730][155452] Updated weights for policy 0, policy_version 35020 (0.0006) [2023-03-07 08:21:27,512][155452] Updated weights for policy 0, policy_version 35030 (0.0005) [2023-03-07 08:21:28,300][155452] Updated weights for policy 0, policy_version 35040 (0.0006) [2023-03-07 08:21:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 35880960. Throughput: 0: 13051.2. Samples: 35874434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:21:28,368][155126] Avg episode reward: [(0, '1888.747')] [2023-03-07 08:21:28,386][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000035041_35881984.pth... [2023-03-07 08:21:28,417][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000031980_32747520.pth [2023-03-07 08:21:29,095][155452] Updated weights for policy 0, policy_version 35050 (0.0006) [2023-03-07 08:21:29,890][155452] Updated weights for policy 0, policy_version 35060 (0.0006) [2023-03-07 08:21:30,670][155452] Updated weights for policy 0, policy_version 35070 (0.0006) [2023-03-07 08:21:31,457][155452] Updated weights for policy 0, policy_version 35080 (0.0007) [2023-03-07 08:21:32,247][155452] Updated weights for policy 0, policy_version 35090 (0.0006) [2023-03-07 08:21:33,015][155452] Updated weights for policy 0, policy_version 35100 (0.0006) [2023-03-07 08:21:33,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 35946496. Throughput: 0: 13047.3. Samples: 35913250. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:21:33,367][155126] Avg episode reward: [(0, '2153.318')] [2023-03-07 08:21:33,813][155452] Updated weights for policy 0, policy_version 35110 (0.0007) [2023-03-07 08:21:34,600][155452] Updated weights for policy 0, policy_version 35120 (0.0006) [2023-03-07 08:21:35,361][155452] Updated weights for policy 0, policy_version 35130 (0.0006) [2023-03-07 08:21:36,169][155452] Updated weights for policy 0, policy_version 35140 (0.0006) [2023-03-07 08:21:36,949][155452] Updated weights for policy 0, policy_version 35150 (0.0006) [2023-03-07 08:21:37,721][155452] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-07 08:21:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 36012032. Throughput: 0: 13042.8. Samples: 35991515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:21:38,367][155126] Avg episode reward: [(0, '1927.636')] [2023-03-07 08:21:38,509][155452] Updated weights for policy 0, policy_version 35170 (0.0005) [2023-03-07 08:21:39,308][155452] Updated weights for policy 0, policy_version 35180 (0.0006) [2023-03-07 08:21:40,079][155452] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-07 08:21:40,860][155452] Updated weights for policy 0, policy_version 35200 (0.0006) [2023-03-07 08:21:41,641][155452] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-07 08:21:42,410][155452] Updated weights for policy 0, policy_version 35220 (0.0006) [2023-03-07 08:21:43,197][155452] Updated weights for policy 0, policy_version 35230 (0.0006) [2023-03-07 08:21:43,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 36077568. Throughput: 0: 13047.4. Samples: 36070092. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:21:43,367][155126] Avg episode reward: [(0, '1997.294')] [2023-03-07 08:21:44,006][155452] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-07 08:21:44,789][155452] Updated weights for policy 0, policy_version 35250 (0.0006) [2023-03-07 08:21:45,577][155452] Updated weights for policy 0, policy_version 35260 (0.0006) [2023-03-07 08:21:46,364][155452] Updated weights for policy 0, policy_version 35270 (0.0007) [2023-03-07 08:21:47,150][155452] Updated weights for policy 0, policy_version 35280 (0.0006) [2023-03-07 08:21:47,921][155452] Updated weights for policy 0, policy_version 35290 (0.0006) [2023-03-07 08:21:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13055.1). Total num frames: 36142080. Throughput: 0: 13042.6. Samples: 36109100. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:21:48,367][155126] Avg episode reward: [(0, '1813.338')] [2023-03-07 08:21:48,730][155452] Updated weights for policy 0, policy_version 35300 (0.0006) [2023-03-07 08:21:49,505][155452] Updated weights for policy 0, policy_version 35310 (0.0006) [2023-03-07 08:21:50,277][155452] Updated weights for policy 0, policy_version 35320 (0.0007) [2023-03-07 08:21:51,078][155452] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-07 08:21:51,850][155452] Updated weights for policy 0, policy_version 35340 (0.0006) [2023-03-07 08:21:52,636][155452] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-07 08:21:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13058.6). Total num frames: 36207616. Throughput: 0: 13045.2. Samples: 36187410. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:21:53,367][155126] Avg episode reward: [(0, '1910.253')] [2023-03-07 08:21:53,409][155452] Updated weights for policy 0, policy_version 35360 (0.0006) [2023-03-07 08:21:54,188][155452] Updated weights for policy 0, policy_version 35370 (0.0007) [2023-03-07 08:21:55,026][155452] Updated weights for policy 0, policy_version 35380 (0.0006) [2023-03-07 08:21:55,777][155452] Updated weights for policy 0, policy_version 35390 (0.0007) [2023-03-07 08:21:56,560][155452] Updated weights for policy 0, policy_version 35400 (0.0006) [2023-03-07 08:21:57,356][155452] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-07 08:21:58,142][155452] Updated weights for policy 0, policy_version 35420 (0.0006) [2023-03-07 08:21:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13055.1). Total num frames: 36272128. Throughput: 0: 13047.4. Samples: 36265526. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:21:58,367][155126] Avg episode reward: [(0, '1893.979')] [2023-03-07 08:21:58,931][155452] Updated weights for policy 0, policy_version 35430 (0.0007) [2023-03-07 08:21:59,707][155452] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-07 08:22:00,500][155452] Updated weights for policy 0, policy_version 35450 (0.0006) [2023-03-07 08:22:01,294][155452] Updated weights for policy 0, policy_version 35460 (0.0006) [2023-03-07 08:22:02,061][155452] Updated weights for policy 0, policy_version 35470 (0.0006) [2023-03-07 08:22:02,845][155452] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-07 08:22:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13058.6). Total num frames: 36337664. Throughput: 0: 13037.2. Samples: 36304550. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:03,367][155126] Avg episode reward: [(0, '2009.946')] [2023-03-07 08:22:03,659][155452] Updated weights for policy 0, policy_version 35490 (0.0007) [2023-03-07 08:22:04,438][155452] Updated weights for policy 0, policy_version 35500 (0.0006) [2023-03-07 08:22:05,199][155452] Updated weights for policy 0, policy_version 35510 (0.0006) [2023-03-07 08:22:06,016][155452] Updated weights for policy 0, policy_version 35520 (0.0007) [2023-03-07 08:22:06,813][155452] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-07 08:22:07,593][155452] Updated weights for policy 0, policy_version 35540 (0.0006) [2023-03-07 08:22:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13055.1). Total num frames: 36402176. Throughput: 0: 13039.4. Samples: 36382629. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:08,367][155126] Avg episode reward: [(0, '1859.117')] [2023-03-07 08:22:08,392][155452] Updated weights for policy 0, policy_version 35550 (0.0006) [2023-03-07 08:22:09,179][155452] Updated weights for policy 0, policy_version 35560 (0.0006) [2023-03-07 08:22:09,963][155452] Updated weights for policy 0, policy_version 35570 (0.0006) [2023-03-07 08:22:10,767][155452] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-07 08:22:11,539][155452] Updated weights for policy 0, policy_version 35590 (0.0006) [2023-03-07 08:22:12,319][155452] Updated weights for policy 0, policy_version 35600 (0.0007) [2023-03-07 08:22:13,117][155452] Updated weights for policy 0, policy_version 35610 (0.0007) [2023-03-07 08:22:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 36467712. Throughput: 0: 13027.6. Samples: 36460678. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:13,367][155126] Avg episode reward: [(0, '1807.308')] [2023-03-07 08:22:13,894][155452] Updated weights for policy 0, policy_version 35620 (0.0007) [2023-03-07 08:22:14,678][155452] Updated weights for policy 0, policy_version 35630 (0.0005) [2023-03-07 08:22:15,467][155452] Updated weights for policy 0, policy_version 35640 (0.0006) [2023-03-07 08:22:16,243][155452] Updated weights for policy 0, policy_version 35650 (0.0006) [2023-03-07 08:22:17,030][155452] Updated weights for policy 0, policy_version 35660 (0.0006) [2023-03-07 08:22:17,810][155452] Updated weights for policy 0, policy_version 35670 (0.0006) [2023-03-07 08:22:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 36533248. Throughput: 0: 13033.7. Samples: 36499769. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:18,368][155126] Avg episode reward: [(0, '1914.171')] [2023-03-07 08:22:18,609][155452] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-07 08:22:19,372][155452] Updated weights for policy 0, policy_version 35690 (0.0006) [2023-03-07 08:22:20,172][155452] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-07 08:22:20,970][155452] Updated weights for policy 0, policy_version 35710 (0.0006) [2023-03-07 08:22:21,730][155452] Updated weights for policy 0, policy_version 35720 (0.0006) [2023-03-07 08:22:22,513][155452] Updated weights for policy 0, policy_version 35730 (0.0007) [2023-03-07 08:22:23,308][155452] Updated weights for policy 0, policy_version 35740 (0.0007) [2023-03-07 08:22:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13051.7). Total num frames: 36597760. Throughput: 0: 13034.9. Samples: 36578089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:23,368][155126] Avg episode reward: [(0, '2054.318')] [2023-03-07 08:22:24,089][155452] Updated weights for policy 0, policy_version 35750 (0.0006) [2023-03-07 08:22:24,879][155452] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-07 08:22:25,662][155452] Updated weights for policy 0, policy_version 35770 (0.0006) [2023-03-07 08:22:26,437][155452] Updated weights for policy 0, policy_version 35780 (0.0006) [2023-03-07 08:22:27,211][155452] Updated weights for policy 0, policy_version 35790 (0.0006) [2023-03-07 08:22:27,986][155452] Updated weights for policy 0, policy_version 35800 (0.0005) [2023-03-07 08:22:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 36663296. Throughput: 0: 13033.0. Samples: 36656582. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:22:28,368][155126] Avg episode reward: [(0, '1888.493')] [2023-03-07 08:22:28,767][155452] Updated weights for policy 0, policy_version 35810 (0.0006) [2023-03-07 08:22:29,549][155452] Updated weights for policy 0, policy_version 35820 (0.0006) [2023-03-07 08:22:30,338][155452] Updated weights for policy 0, policy_version 35830 (0.0006) [2023-03-07 08:22:31,138][155452] Updated weights for policy 0, policy_version 35840 (0.0006) [2023-03-07 08:22:31,915][155452] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-07 08:22:32,705][155452] Updated weights for policy 0, policy_version 35860 (0.0006) [2023-03-07 08:22:33,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 36728832. Throughput: 0: 13035.6. Samples: 36695703. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:22:33,367][155126] Avg episode reward: [(0, '1748.931')] [2023-03-07 08:22:33,499][155452] Updated weights for policy 0, policy_version 35870 (0.0006) [2023-03-07 08:22:34,300][155452] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-07 08:22:35,092][155452] Updated weights for policy 0, policy_version 35890 (0.0006) [2023-03-07 08:22:35,869][155452] Updated weights for policy 0, policy_version 35900 (0.0006) [2023-03-07 08:22:36,652][155452] Updated weights for policy 0, policy_version 35910 (0.0006) [2023-03-07 08:22:37,431][155452] Updated weights for policy 0, policy_version 35920 (0.0006) [2023-03-07 08:22:38,240][155452] Updated weights for policy 0, policy_version 35930 (0.0006) [2023-03-07 08:22:38,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 36793344. Throughput: 0: 13027.8. Samples: 36773659. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:22:38,367][155126] Avg episode reward: [(0, '1757.112')] [2023-03-07 08:22:39,003][155452] Updated weights for policy 0, policy_version 35940 (0.0006) [2023-03-07 08:22:39,796][155452] Updated weights for policy 0, policy_version 35950 (0.0007) [2023-03-07 08:22:40,573][155452] Updated weights for policy 0, policy_version 35960 (0.0006) [2023-03-07 08:22:41,365][155452] Updated weights for policy 0, policy_version 35970 (0.0007) [2023-03-07 08:22:42,162][155452] Updated weights for policy 0, policy_version 35980 (0.0006) [2023-03-07 08:22:42,966][155452] Updated weights for policy 0, policy_version 35990 (0.0006) [2023-03-07 08:22:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13051.7). Total num frames: 36858880. Throughput: 0: 13027.0. Samples: 36851743. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:22:43,378][155126] Avg episode reward: [(0, '1819.396')] [2023-03-07 08:22:43,747][155452] Updated weights for policy 0, policy_version 36000 (0.0007) [2023-03-07 08:22:44,535][155452] Updated weights for policy 0, policy_version 36010 (0.0006) [2023-03-07 08:22:45,300][155452] Updated weights for policy 0, policy_version 36020 (0.0007) [2023-03-07 08:22:46,080][155452] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-07 08:22:46,861][155452] Updated weights for policy 0, policy_version 36040 (0.0005) [2023-03-07 08:22:47,637][155452] Updated weights for policy 0, policy_version 36050 (0.0007) [2023-03-07 08:22:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 36924416. Throughput: 0: 13030.3. Samples: 36890913. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:48,367][155126] Avg episode reward: [(0, '1841.735')] [2023-03-07 08:22:48,418][155452] Updated weights for policy 0, policy_version 36060 (0.0006) [2023-03-07 08:22:49,200][155452] Updated weights for policy 0, policy_version 36070 (0.0007) [2023-03-07 08:22:49,996][155452] Updated weights for policy 0, policy_version 36080 (0.0007) [2023-03-07 08:22:50,778][155452] Updated weights for policy 0, policy_version 36090 (0.0007) [2023-03-07 08:22:51,560][155452] Updated weights for policy 0, policy_version 36100 (0.0006) [2023-03-07 08:22:52,337][155452] Updated weights for policy 0, policy_version 36110 (0.0005) [2023-03-07 08:22:53,134][155452] Updated weights for policy 0, policy_version 36120 (0.0006) [2023-03-07 08:22:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13055.1). Total num frames: 36989952. Throughput: 0: 13042.0. Samples: 36969519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:53,367][155126] Avg episode reward: [(0, '2003.008')] [2023-03-07 08:22:53,917][155452] Updated weights for policy 0, policy_version 36130 (0.0007) [2023-03-07 08:22:54,713][155452] Updated weights for policy 0, policy_version 36140 (0.0007) [2023-03-07 08:22:55,493][155452] Updated weights for policy 0, policy_version 36150 (0.0006) [2023-03-07 08:22:56,270][155452] Updated weights for policy 0, policy_version 36160 (0.0006) [2023-03-07 08:22:57,054][155452] Updated weights for policy 0, policy_version 36170 (0.0006) [2023-03-07 08:22:57,849][155452] Updated weights for policy 0, policy_version 36180 (0.0006) [2023-03-07 08:22:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 37054464. Throughput: 0: 13042.9. Samples: 37047606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:22:58,367][155126] Avg episode reward: [(0, '1797.569')] [2023-03-07 08:22:58,637][155452] Updated weights for policy 0, policy_version 36190 (0.0006) [2023-03-07 08:22:59,412][155452] Updated weights for policy 0, policy_version 36200 (0.0007) [2023-03-07 08:23:00,197][155452] Updated weights for policy 0, policy_version 36210 (0.0006) [2023-03-07 08:23:00,980][155452] Updated weights for policy 0, policy_version 36220 (0.0006) [2023-03-07 08:23:01,783][155452] Updated weights for policy 0, policy_version 36230 (0.0006) [2023-03-07 08:23:02,551][155452] Updated weights for policy 0, policy_version 36240 (0.0006) [2023-03-07 08:23:03,343][155452] Updated weights for policy 0, policy_version 36250 (0.0007) [2023-03-07 08:23:03,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13055.1). Total num frames: 37120000. Throughput: 0: 13039.4. Samples: 37086542. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:23:03,368][155126] Avg episode reward: [(0, '1847.333')] [2023-03-07 08:23:04,123][155452] Updated weights for policy 0, policy_version 36260 (0.0005) [2023-03-07 08:23:04,929][155452] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-07 08:23:05,697][155452] Updated weights for policy 0, policy_version 36280 (0.0006) [2023-03-07 08:23:06,495][155452] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-07 08:23:07,292][155452] Updated weights for policy 0, policy_version 36300 (0.0006) [2023-03-07 08:23:08,063][155452] Updated weights for policy 0, policy_version 36310 (0.0006) [2023-03-07 08:23:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 37184512. Throughput: 0: 13040.3. Samples: 37164903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:23:08,368][155126] Avg episode reward: [(0, '1852.517')] [2023-03-07 08:23:08,863][155452] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-07 08:23:09,645][155452] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-07 08:23:10,430][155452] Updated weights for policy 0, policy_version 36340 (0.0007) [2023-03-07 08:23:11,225][155452] Updated weights for policy 0, policy_version 36350 (0.0006) [2023-03-07 08:23:12,017][155452] Updated weights for policy 0, policy_version 36360 (0.0006) [2023-03-07 08:23:12,804][155452] Updated weights for policy 0, policy_version 36370 (0.0005) [2023-03-07 08:23:13,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 37249024. Throughput: 0: 13021.1. Samples: 37242532. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:23:13,367][155126] Avg episode reward: [(0, '1779.684')] [2023-03-07 08:23:13,618][155452] Updated weights for policy 0, policy_version 36380 (0.0007) [2023-03-07 08:23:14,418][155452] Updated weights for policy 0, policy_version 36390 (0.0006) [2023-03-07 08:23:15,209][155452] Updated weights for policy 0, policy_version 36400 (0.0006) [2023-03-07 08:23:15,979][155452] Updated weights for policy 0, policy_version 36410 (0.0007) [2023-03-07 08:23:16,768][155452] Updated weights for policy 0, policy_version 36420 (0.0006) [2023-03-07 08:23:17,547][155452] Updated weights for policy 0, policy_version 36430 (0.0005) [2023-03-07 08:23:18,323][155452] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-07 08:23:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 37314560. Throughput: 0: 13013.5. Samples: 37281310. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:23:18,367][155126] Avg episode reward: [(0, '1781.410')] [2023-03-07 08:23:19,109][155452] Updated weights for policy 0, policy_version 36450 (0.0007) [2023-03-07 08:23:19,885][155452] Updated weights for policy 0, policy_version 36460 (0.0006) [2023-03-07 08:23:20,677][155452] Updated weights for policy 0, policy_version 36470 (0.0006) [2023-03-07 08:23:21,470][155452] Updated weights for policy 0, policy_version 36480 (0.0007) [2023-03-07 08:23:22,242][155452] Updated weights for policy 0, policy_version 36490 (0.0006) [2023-03-07 08:23:23,011][155452] Updated weights for policy 0, policy_version 36500 (0.0006) [2023-03-07 08:23:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 37380096. Throughput: 0: 13028.5. Samples: 37359943. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:23:23,367][155126] Avg episode reward: [(0, '1994.319')] [2023-03-07 08:23:23,806][155452] Updated weights for policy 0, policy_version 36510 (0.0007) [2023-03-07 08:23:24,594][155452] Updated weights for policy 0, policy_version 36520 (0.0006) [2023-03-07 08:23:25,378][155452] Updated weights for policy 0, policy_version 36530 (0.0006) [2023-03-07 08:23:26,152][155452] Updated weights for policy 0, policy_version 36540 (0.0007) [2023-03-07 08:23:26,954][155452] Updated weights for policy 0, policy_version 36550 (0.0006) [2023-03-07 08:23:27,744][155452] Updated weights for policy 0, policy_version 36560 (0.0006) [2023-03-07 08:23:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 37445632. Throughput: 0: 13034.2. Samples: 37438282. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:23:28,368][155126] Avg episode reward: [(0, '1894.597')] [2023-03-07 08:23:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000036568_37445632.pth... [2023-03-07 08:23:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000033510_34314240.pth [2023-03-07 08:23:28,518][155452] Updated weights for policy 0, policy_version 36570 (0.0006) [2023-03-07 08:23:29,299][155452] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-07 08:23:30,101][155452] Updated weights for policy 0, policy_version 36590 (0.0006) [2023-03-07 08:23:30,868][155452] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-07 08:23:31,655][155452] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-07 08:23:32,428][155452] Updated weights for policy 0, policy_version 36620 (0.0005) [2023-03-07 08:23:33,223][155452] Updated weights for policy 0, policy_version 36630 (0.0005) [2023-03-07 08:23:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 37510144. Throughput: 0: 13030.7. Samples: 37477295. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:23:33,367][155126] Avg episode reward: [(0, '2051.097')] [2023-03-07 08:23:34,004][155452] Updated weights for policy 0, policy_version 36640 (0.0007) [2023-03-07 08:23:34,799][155452] Updated weights for policy 0, policy_version 36650 (0.0006) [2023-03-07 08:23:35,574][155452] Updated weights for policy 0, policy_version 36660 (0.0007) [2023-03-07 08:23:36,367][155452] Updated weights for policy 0, policy_version 36670 (0.0006) [2023-03-07 08:23:37,151][155452] Updated weights for policy 0, policy_version 36680 (0.0006) [2023-03-07 08:23:37,924][155452] Updated weights for policy 0, policy_version 36690 (0.0006) [2023-03-07 08:23:38,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 37575680. Throughput: 0: 13023.7. Samples: 37555587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:23:38,367][155126] Avg episode reward: [(0, '1983.423')] [2023-03-07 08:23:38,698][155452] Updated weights for policy 0, policy_version 36700 (0.0007) [2023-03-07 08:23:39,504][155452] Updated weights for policy 0, policy_version 36710 (0.0006) [2023-03-07 08:23:40,290][155452] Updated weights for policy 0, policy_version 36720 (0.0006) [2023-03-07 08:23:41,072][155452] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-07 08:23:41,852][155452] Updated weights for policy 0, policy_version 36740 (0.0006) [2023-03-07 08:23:42,635][155452] Updated weights for policy 0, policy_version 36750 (0.0006) [2023-03-07 08:23:43,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13051.7). Total num frames: 37641216. Throughput: 0: 13034.0. Samples: 37634138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:23:43,367][155126] Avg episode reward: [(0, '1855.761')] [2023-03-07 08:23:43,424][155452] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-07 08:23:44,221][155452] Updated weights for policy 0, policy_version 36770 (0.0006) [2023-03-07 08:23:45,011][155452] Updated weights for policy 0, policy_version 36780 (0.0006) [2023-03-07 08:23:45,799][155452] Updated weights for policy 0, policy_version 36790 (0.0006) [2023-03-07 08:23:46,581][155452] Updated weights for policy 0, policy_version 36800 (0.0006) [2023-03-07 08:23:47,351][155452] Updated weights for policy 0, policy_version 36810 (0.0006) [2023-03-07 08:23:48,138][155452] Updated weights for policy 0, policy_version 36820 (0.0006) [2023-03-07 08:23:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 37705728. Throughput: 0: 13032.0. Samples: 37672979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:23:48,367][155126] Avg episode reward: [(0, '1896.562')] [2023-03-07 08:23:48,925][155452] Updated weights for policy 0, policy_version 36830 (0.0006) [2023-03-07 08:23:49,717][155452] Updated weights for policy 0, policy_version 36840 (0.0006) [2023-03-07 08:23:50,490][155452] Updated weights for policy 0, policy_version 36850 (0.0006) [2023-03-07 08:23:51,297][155452] Updated weights for policy 0, policy_version 36860 (0.0006) [2023-03-07 08:23:52,077][155452] Updated weights for policy 0, policy_version 36870 (0.0006) [2023-03-07 08:23:52,858][155452] Updated weights for policy 0, policy_version 36880 (0.0006) [2023-03-07 08:23:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 37771264. Throughput: 0: 13030.5. Samples: 37751272. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:23:53,367][155126] Avg episode reward: [(0, '1653.406')] [2023-03-07 08:23:53,647][155452] Updated weights for policy 0, policy_version 36890 (0.0007) [2023-03-07 08:23:54,421][155452] Updated weights for policy 0, policy_version 36900 (0.0006) [2023-03-07 08:23:55,206][155452] Updated weights for policy 0, policy_version 36910 (0.0006) [2023-03-07 08:23:55,996][155452] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-07 08:23:56,767][155452] Updated weights for policy 0, policy_version 36930 (0.0006) [2023-03-07 08:23:57,546][155452] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-07 08:23:58,338][155452] Updated weights for policy 0, policy_version 36950 (0.0007) [2023-03-07 08:23:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 37836800. Throughput: 0: 13049.1. Samples: 37829743. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:23:58,367][155126] Avg episode reward: [(0, '1582.159')] [2023-03-07 08:23:59,121][155452] Updated weights for policy 0, policy_version 36960 (0.0006) [2023-03-07 08:23:59,915][155452] Updated weights for policy 0, policy_version 36970 (0.0006) [2023-03-07 08:24:00,688][155452] Updated weights for policy 0, policy_version 36980 (0.0007) [2023-03-07 08:24:01,496][155452] Updated weights for policy 0, policy_version 36990 (0.0008) [2023-03-07 08:24:02,278][155452] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-07 08:24:03,068][155452] Updated weights for policy 0, policy_version 37010 (0.0007) [2023-03-07 08:24:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13044.7). Total num frames: 37901312. Throughput: 0: 13051.9. Samples: 37868644. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:03,367][155126] Avg episode reward: [(0, '1695.765')] [2023-03-07 08:24:03,864][155452] Updated weights for policy 0, policy_version 37020 (0.0006) [2023-03-07 08:24:04,630][155452] Updated weights for policy 0, policy_version 37030 (0.0006) [2023-03-07 08:24:05,422][155452] Updated weights for policy 0, policy_version 37040 (0.0006) [2023-03-07 08:24:06,217][155452] Updated weights for policy 0, policy_version 37050 (0.0006) [2023-03-07 08:24:06,986][155452] Updated weights for policy 0, policy_version 37060 (0.0006) [2023-03-07 08:24:07,756][155452] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-07 08:24:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13044.7). Total num frames: 37966848. Throughput: 0: 13046.4. Samples: 37947034. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:08,367][155126] Avg episode reward: [(0, '1799.526')] [2023-03-07 08:24:08,547][155452] Updated weights for policy 0, policy_version 37080 (0.0007) [2023-03-07 08:24:09,332][155452] Updated weights for policy 0, policy_version 37090 (0.0006) [2023-03-07 08:24:09,703][155401] KL-divergence is very high: 255.0155 [2023-03-07 08:24:10,105][155452] Updated weights for policy 0, policy_version 37100 (0.0006) [2023-03-07 08:24:10,881][155452] Updated weights for policy 0, policy_version 37110 (0.0006) [2023-03-07 08:24:11,264][155401] KL-divergence is very high: 10784.0264 [2023-03-07 08:24:11,661][155452] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-07 08:24:12,284][155401] KL-divergence is very high: 22836.3633 [2023-03-07 08:24:12,440][155401] KL-divergence is very high: 862573.1250 [2023-03-07 08:24:12,447][155452] Updated weights for policy 0, policy_version 37130 (0.0006) [2023-03-07 08:24:12,830][155401] KL-divergence is very high: 647.4448 [2023-03-07 08:24:12,919][155401] KL-divergence is very high: 312.9594 [2023-03-07 08:24:13,229][155452] Updated weights for policy 0, policy_version 37140 (0.0006) [2023-03-07 08:24:13,302][155401] KL-divergence is very high: 1202.9775 [2023-03-07 08:24:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 38032384. Throughput: 0: 13051.7. Samples: 38025605. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:13,367][155126] Avg episode reward: [(0, '1870.615')] [2023-03-07 08:24:13,922][155401] KL-divergence is very high: 1246.6958 [2023-03-07 08:24:14,004][155452] Updated weights for policy 0, policy_version 37150 (0.0006) [2023-03-07 08:24:14,796][155452] Updated weights for policy 0, policy_version 37160 (0.0007) [2023-03-07 08:24:15,585][155452] Updated weights for policy 0, policy_version 37170 (0.0006) [2023-03-07 08:24:16,349][155401] KL-divergence is very high: 2581.5032 [2023-03-07 08:24:16,357][155452] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-07 08:24:17,135][155452] Updated weights for policy 0, policy_version 37190 (0.0006) [2023-03-07 08:24:17,920][155452] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-07 08:24:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 38097920. Throughput: 0: 13061.8. Samples: 38065075. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:18,367][155126] Avg episode reward: [(0, '1801.793')] [2023-03-07 08:24:18,699][155452] Updated weights for policy 0, policy_version 37210 (0.0006) [2023-03-07 08:24:19,485][155452] Updated weights for policy 0, policy_version 37220 (0.0006) [2023-03-07 08:24:20,280][155452] Updated weights for policy 0, policy_version 37230 (0.0007) [2023-03-07 08:24:21,058][155452] Updated weights for policy 0, policy_version 37240 (0.0005) [2023-03-07 08:24:21,839][155452] Updated weights for policy 0, policy_version 37250 (0.0006) [2023-03-07 08:24:22,628][155452] Updated weights for policy 0, policy_version 37260 (0.0007) [2023-03-07 08:24:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 38163456. Throughput: 0: 13061.6. Samples: 38143358. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:23,378][155126] Avg episode reward: [(0, '1941.573')] [2023-03-07 08:24:23,407][155452] Updated weights for policy 0, policy_version 37270 (0.0007) [2023-03-07 08:24:23,549][155401] KL-divergence is very high: 322.3274 [2023-03-07 08:24:24,189][155452] Updated weights for policy 0, policy_version 37280 (0.0005) [2023-03-07 08:24:24,960][155452] Updated weights for policy 0, policy_version 37290 (0.0006) [2023-03-07 08:24:25,734][155452] Updated weights for policy 0, policy_version 37300 (0.0006) [2023-03-07 08:24:26,540][155452] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-07 08:24:27,311][155452] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-07 08:24:28,109][155452] Updated weights for policy 0, policy_version 37330 (0.0006) [2023-03-07 08:24:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 38228992. Throughput: 0: 13061.9. Samples: 38221923. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:28,367][155126] Avg episode reward: [(0, '1858.183')] [2023-03-07 08:24:28,897][155452] Updated weights for policy 0, policy_version 37340 (0.0006) [2023-03-07 08:24:29,670][155452] Updated weights for policy 0, policy_version 37350 (0.0007) [2023-03-07 08:24:30,466][155452] Updated weights for policy 0, policy_version 37360 (0.0006) [2023-03-07 08:24:31,242][155452] Updated weights for policy 0, policy_version 37370 (0.0006) [2023-03-07 08:24:32,034][155452] Updated weights for policy 0, policy_version 37380 (0.0006) [2023-03-07 08:24:32,817][155452] Updated weights for policy 0, policy_version 37390 (0.0006) [2023-03-07 08:24:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13048.2). Total num frames: 38294528. Throughput: 0: 13068.3. Samples: 38261054. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:33,367][155126] Avg episode reward: [(0, '1949.848')] [2023-03-07 08:24:33,604][155452] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-07 08:24:34,382][155452] Updated weights for policy 0, policy_version 37410 (0.0007) [2023-03-07 08:24:35,198][155452] Updated weights for policy 0, policy_version 37420 (0.0007) [2023-03-07 08:24:35,969][155452] Updated weights for policy 0, policy_version 37430 (0.0006) [2023-03-07 08:24:36,725][155452] Updated weights for policy 0, policy_version 37440 (0.0006) [2023-03-07 08:24:37,525][155452] Updated weights for policy 0, policy_version 37450 (0.0006) [2023-03-07 08:24:38,281][155452] Updated weights for policy 0, policy_version 37460 (0.0006) [2023-03-07 08:24:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 38359040. Throughput: 0: 13068.3. Samples: 38339344. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:38,367][155126] Avg episode reward: [(0, '2061.981')] [2023-03-07 08:24:39,079][155452] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-07 08:24:39,866][155452] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-07 08:24:40,626][155452] Updated weights for policy 0, policy_version 37490 (0.0006) [2023-03-07 08:24:41,413][155452] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-07 08:24:42,189][155452] Updated weights for policy 0, policy_version 37510 (0.0006) [2023-03-07 08:24:42,977][155452] Updated weights for policy 0, policy_version 37520 (0.0006) [2023-03-07 08:24:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 38424576. Throughput: 0: 13073.9. Samples: 38418067. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:24:43,368][155126] Avg episode reward: [(0, '1834.459')] [2023-03-07 08:24:43,765][155452] Updated weights for policy 0, policy_version 37530 (0.0006) [2023-03-07 08:24:44,562][155452] Updated weights for policy 0, policy_version 37540 (0.0007) [2023-03-07 08:24:45,349][155452] Updated weights for policy 0, policy_version 37550 (0.0007) [2023-03-07 08:24:46,137][155452] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-07 08:24:46,927][155452] Updated weights for policy 0, policy_version 37570 (0.0006) [2023-03-07 08:24:47,710][155452] Updated weights for policy 0, policy_version 37580 (0.0006) [2023-03-07 08:24:48,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13073.0, 300 sec: 13048.2). Total num frames: 38490112. Throughput: 0: 13076.0. Samples: 38457064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:24:48,368][155126] Avg episode reward: [(0, '1857.043')] [2023-03-07 08:24:48,489][155452] Updated weights for policy 0, policy_version 37590 (0.0006) [2023-03-07 08:24:49,288][155452] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-07 08:24:50,072][155452] Updated weights for policy 0, policy_version 37610 (0.0006) [2023-03-07 08:24:50,858][155452] Updated weights for policy 0, policy_version 37620 (0.0006) [2023-03-07 08:24:51,655][155452] Updated weights for policy 0, policy_version 37630 (0.0007) [2023-03-07 08:24:52,439][155452] Updated weights for policy 0, policy_version 37640 (0.0006) [2023-03-07 08:24:53,225][155452] Updated weights for policy 0, policy_version 37650 (0.0006) [2023-03-07 08:24:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 38554624. Throughput: 0: 13069.3. Samples: 38535151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:24:53,367][155126] Avg episode reward: [(0, '1705.320')] [2023-03-07 08:24:54,025][155452] Updated weights for policy 0, policy_version 37660 (0.0005) [2023-03-07 08:24:54,802][155452] Updated weights for policy 0, policy_version 37670 (0.0006) [2023-03-07 08:24:55,591][155452] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-07 08:24:56,382][155452] Updated weights for policy 0, policy_version 37690 (0.0006) [2023-03-07 08:24:57,171][155452] Updated weights for policy 0, policy_version 37700 (0.0006) [2023-03-07 08:24:57,964][155452] Updated weights for policy 0, policy_version 37710 (0.0006) [2023-03-07 08:24:58,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 38620160. Throughput: 0: 13051.7. Samples: 38612933. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:24:58,367][155126] Avg episode reward: [(0, '1796.562')] [2023-03-07 08:24:58,744][155452] Updated weights for policy 0, policy_version 37720 (0.0006) [2023-03-07 08:24:59,528][155452] Updated weights for policy 0, policy_version 37730 (0.0007) [2023-03-07 08:25:00,318][155452] Updated weights for policy 0, policy_version 37740 (0.0006) [2023-03-07 08:25:01,129][155452] Updated weights for policy 0, policy_version 37750 (0.0006) [2023-03-07 08:25:01,914][155452] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-07 08:25:02,711][155452] Updated weights for policy 0, policy_version 37770 (0.0007) [2023-03-07 08:25:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 38684672. Throughput: 0: 13040.8. Samples: 38651911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:03,367][155126] Avg episode reward: [(0, '1659.840')] [2023-03-07 08:25:03,481][155452] Updated weights for policy 0, policy_version 37780 (0.0005) [2023-03-07 08:25:04,272][155452] Updated weights for policy 0, policy_version 37790 (0.0005) [2023-03-07 08:25:05,068][155452] Updated weights for policy 0, policy_version 37800 (0.0006) [2023-03-07 08:25:05,854][155452] Updated weights for policy 0, policy_version 37810 (0.0006) [2023-03-07 08:25:06,641][155452] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-07 08:25:07,428][155452] Updated weights for policy 0, policy_version 37830 (0.0006) [2023-03-07 08:25:08,190][155452] Updated weights for policy 0, policy_version 37840 (0.0006) [2023-03-07 08:25:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 38750208. Throughput: 0: 13032.7. Samples: 38729830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:08,367][155126] Avg episode reward: [(0, '1823.814')] [2023-03-07 08:25:08,990][155452] Updated weights for policy 0, policy_version 37850 (0.0007) [2023-03-07 08:25:09,797][155452] Updated weights for policy 0, policy_version 37860 (0.0006) [2023-03-07 08:25:10,595][155452] Updated weights for policy 0, policy_version 37870 (0.0006) [2023-03-07 08:25:11,398][155452] Updated weights for policy 0, policy_version 37880 (0.0006) [2023-03-07 08:25:12,172][155452] Updated weights for policy 0, policy_version 37890 (0.0006) [2023-03-07 08:25:12,934][155452] Updated weights for policy 0, policy_version 37900 (0.0006) [2023-03-07 08:25:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 38814720. Throughput: 0: 13016.0. Samples: 38807645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:13,367][155126] Avg episode reward: [(0, '1991.836')] [2023-03-07 08:25:13,731][155452] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-07 08:25:14,509][155452] Updated weights for policy 0, policy_version 37920 (0.0006) [2023-03-07 08:25:15,294][155452] Updated weights for policy 0, policy_version 37930 (0.0005) [2023-03-07 08:25:16,086][155452] Updated weights for policy 0, policy_version 37940 (0.0006) [2023-03-07 08:25:16,873][155452] Updated weights for policy 0, policy_version 37950 (0.0007) [2023-03-07 08:25:17,672][155452] Updated weights for policy 0, policy_version 37960 (0.0007) [2023-03-07 08:25:18,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 38879232. Throughput: 0: 13012.8. Samples: 38846629. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:18,367][155126] Avg episode reward: [(0, '1958.690')] [2023-03-07 08:25:18,467][155452] Updated weights for policy 0, policy_version 37970 (0.0006) [2023-03-07 08:25:19,237][155452] Updated weights for policy 0, policy_version 37980 (0.0006) [2023-03-07 08:25:20,030][155452] Updated weights for policy 0, policy_version 37990 (0.0007) [2023-03-07 08:25:20,809][155452] Updated weights for policy 0, policy_version 38000 (0.0007) [2023-03-07 08:25:21,592][155452] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-07 08:25:22,393][155452] Updated weights for policy 0, policy_version 38020 (0.0006) [2023-03-07 08:25:23,182][155452] Updated weights for policy 0, policy_version 38030 (0.0006) [2023-03-07 08:25:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 38944768. Throughput: 0: 13008.3. Samples: 38924716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:23,367][155126] Avg episode reward: [(0, '2044.769')] [2023-03-07 08:25:23,937][155452] Updated weights for policy 0, policy_version 38040 (0.0006) [2023-03-07 08:25:24,729][155452] Updated weights for policy 0, policy_version 38050 (0.0006) [2023-03-07 08:25:25,511][155452] Updated weights for policy 0, policy_version 38060 (0.0007) [2023-03-07 08:25:26,321][155452] Updated weights for policy 0, policy_version 38070 (0.0006) [2023-03-07 08:25:27,119][155452] Updated weights for policy 0, policy_version 38080 (0.0006) [2023-03-07 08:25:27,881][155452] Updated weights for policy 0, policy_version 38090 (0.0006) [2023-03-07 08:25:28,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13041.2). Total num frames: 39010304. Throughput: 0: 12999.6. Samples: 39003050. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:28,367][155126] Avg episode reward: [(0, '1975.005')] [2023-03-07 08:25:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000038096_39010304.pth... [2023-03-07 08:25:28,405][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000035041_35881984.pth [2023-03-07 08:25:28,665][155452] Updated weights for policy 0, policy_version 38100 (0.0006) [2023-03-07 08:25:29,446][155452] Updated weights for policy 0, policy_version 38110 (0.0006) [2023-03-07 08:25:30,228][155452] Updated weights for policy 0, policy_version 38120 (0.0007) [2023-03-07 08:25:31,018][155452] Updated weights for policy 0, policy_version 38130 (0.0007) [2023-03-07 08:25:31,816][155452] Updated weights for policy 0, policy_version 38140 (0.0006) [2023-03-07 08:25:32,593][155452] Updated weights for policy 0, policy_version 38150 (0.0006) [2023-03-07 08:25:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 39074816. Throughput: 0: 13001.1. Samples: 39042112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:33,367][155126] Avg episode reward: [(0, '1960.915')] [2023-03-07 08:25:33,405][155452] Updated weights for policy 0, policy_version 38160 (0.0006) [2023-03-07 08:25:34,182][155452] Updated weights for policy 0, policy_version 38170 (0.0006) [2023-03-07 08:25:34,966][155452] Updated weights for policy 0, policy_version 38180 (0.0006) [2023-03-07 08:25:35,757][155452] Updated weights for policy 0, policy_version 38190 (0.0006) [2023-03-07 08:25:36,550][155452] Updated weights for policy 0, policy_version 38200 (0.0005) [2023-03-07 08:25:37,357][155452] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-07 08:25:38,140][155452] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-07 08:25:38,367][155126] Fps is (10 sec: 12902.6, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 39139328. Throughput: 0: 12994.2. Samples: 39119888. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:38,367][155126] Avg episode reward: [(0, '2002.379')] [2023-03-07 08:25:38,926][155452] Updated weights for policy 0, policy_version 38230 (0.0007) [2023-03-07 08:25:39,709][155452] Updated weights for policy 0, policy_version 38240 (0.0006) [2023-03-07 08:25:40,507][155452] Updated weights for policy 0, policy_version 38250 (0.0006) [2023-03-07 08:25:41,275][155452] Updated weights for policy 0, policy_version 38260 (0.0006) [2023-03-07 08:25:42,069][155452] Updated weights for policy 0, policy_version 38270 (0.0006) [2023-03-07 08:25:42,873][155452] Updated weights for policy 0, policy_version 38280 (0.0006) [2023-03-07 08:25:43,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 39204864. Throughput: 0: 13000.2. Samples: 39197942. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:43,367][155126] Avg episode reward: [(0, '1893.460')] [2023-03-07 08:25:43,641][155452] Updated weights for policy 0, policy_version 38290 (0.0005) [2023-03-07 08:25:44,418][155452] Updated weights for policy 0, policy_version 38300 (0.0007) [2023-03-07 08:25:45,212][155452] Updated weights for policy 0, policy_version 38310 (0.0007) [2023-03-07 08:25:45,993][155452] Updated weights for policy 0, policy_version 38320 (0.0006) [2023-03-07 08:25:46,774][155452] Updated weights for policy 0, policy_version 38330 (0.0006) [2023-03-07 08:25:47,538][155452] Updated weights for policy 0, policy_version 38340 (0.0006) [2023-03-07 08:25:48,342][155452] Updated weights for policy 0, policy_version 38350 (0.0006) [2023-03-07 08:25:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 39270400. Throughput: 0: 13002.9. Samples: 39237043. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:48,368][155126] Avg episode reward: [(0, '1871.715')] [2023-03-07 08:25:49,110][155452] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-07 08:25:49,902][155452] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-07 08:25:50,674][155452] Updated weights for policy 0, policy_version 38380 (0.0006) [2023-03-07 08:25:51,459][155452] Updated weights for policy 0, policy_version 38390 (0.0006) [2023-03-07 08:25:52,255][155452] Updated weights for policy 0, policy_version 38400 (0.0006) [2023-03-07 08:25:53,034][155452] Updated weights for policy 0, policy_version 38410 (0.0006) [2023-03-07 08:25:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 39335936. Throughput: 0: 13019.2. Samples: 39315694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:25:53,367][155126] Avg episode reward: [(0, '1810.603')] [2023-03-07 08:25:53,824][155452] Updated weights for policy 0, policy_version 38420 (0.0006) [2023-03-07 08:25:54,612][155452] Updated weights for policy 0, policy_version 38430 (0.0006) [2023-03-07 08:25:55,377][155452] Updated weights for policy 0, policy_version 38440 (0.0006) [2023-03-07 08:25:56,180][155452] Updated weights for policy 0, policy_version 38450 (0.0007) [2023-03-07 08:25:56,977][155452] Updated weights for policy 0, policy_version 38460 (0.0006) [2023-03-07 08:25:57,757][155452] Updated weights for policy 0, policy_version 38470 (0.0007) [2023-03-07 08:25:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 39400448. Throughput: 0: 13026.1. Samples: 39393820. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:25:58,378][155126] Avg episode reward: [(0, '1797.945')] [2023-03-07 08:25:58,537][155452] Updated weights for policy 0, policy_version 38480 (0.0005) [2023-03-07 08:25:59,311][155452] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-07 08:26:00,120][155452] Updated weights for policy 0, policy_version 38500 (0.0006) [2023-03-07 08:26:00,903][155452] Updated weights for policy 0, policy_version 38510 (0.0007) [2023-03-07 08:26:01,666][155452] Updated weights for policy 0, policy_version 38520 (0.0006) [2023-03-07 08:26:02,461][155452] Updated weights for policy 0, policy_version 38530 (0.0006) [2023-03-07 08:26:03,225][155452] Updated weights for policy 0, policy_version 38540 (0.0006) [2023-03-07 08:26:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 39465984. Throughput: 0: 13029.0. Samples: 39432932. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:26:03,378][155126] Avg episode reward: [(0, '1604.552')] [2023-03-07 08:26:04,014][155452] Updated weights for policy 0, policy_version 38550 (0.0006) [2023-03-07 08:26:04,808][155452] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-07 08:26:05,579][155452] Updated weights for policy 0, policy_version 38570 (0.0007) [2023-03-07 08:26:06,379][155452] Updated weights for policy 0, policy_version 38580 (0.0006) [2023-03-07 08:26:07,156][155452] Updated weights for policy 0, policy_version 38590 (0.0006) [2023-03-07 08:26:07,930][155452] Updated weights for policy 0, policy_version 38600 (0.0006) [2023-03-07 08:26:08,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 39531520. Throughput: 0: 13036.9. Samples: 39511377. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:26:08,367][155126] Avg episode reward: [(0, '1675.747')] [2023-03-07 08:26:08,717][155452] Updated weights for policy 0, policy_version 38610 (0.0005) [2023-03-07 08:26:09,488][155452] Updated weights for policy 0, policy_version 38620 (0.0007) [2023-03-07 08:26:10,263][155452] Updated weights for policy 0, policy_version 38630 (0.0006) [2023-03-07 08:26:11,033][155452] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-07 08:26:11,816][155452] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-07 08:26:12,617][155452] Updated weights for policy 0, policy_version 38660 (0.0007) [2023-03-07 08:26:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 39597056. Throughput: 0: 13046.3. Samples: 39590132. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:26:13,367][155126] Avg episode reward: [(0, '1822.091')] [2023-03-07 08:26:13,389][155452] Updated weights for policy 0, policy_version 38670 (0.0006) [2023-03-07 08:26:14,168][155452] Updated weights for policy 0, policy_version 38680 (0.0006) [2023-03-07 08:26:14,953][155452] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-07 08:26:15,740][155452] Updated weights for policy 0, policy_version 38700 (0.0006) [2023-03-07 08:26:16,527][155452] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-07 08:26:17,305][155452] Updated weights for policy 0, policy_version 38720 (0.0006) [2023-03-07 08:26:18,099][155452] Updated weights for policy 0, policy_version 38730 (0.0006) [2023-03-07 08:26:18,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 39662592. Throughput: 0: 13053.9. Samples: 39629538. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:26:18,367][155126] Avg episode reward: [(0, '1827.225')] [2023-03-07 08:26:18,881][155452] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-07 08:26:19,682][155452] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-07 08:26:20,445][155452] Updated weights for policy 0, policy_version 38760 (0.0006) [2023-03-07 08:26:21,228][155452] Updated weights for policy 0, policy_version 38770 (0.0006) [2023-03-07 08:26:22,029][155452] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-07 08:26:22,786][155452] Updated weights for policy 0, policy_version 38790 (0.0006) [2023-03-07 08:26:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 39728128. Throughput: 0: 13063.9. Samples: 39707764. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:26:23,367][155126] Avg episode reward: [(0, '1626.427')] [2023-03-07 08:26:23,566][155452] Updated weights for policy 0, policy_version 38800 (0.0006) [2023-03-07 08:26:24,350][155452] Updated weights for policy 0, policy_version 38810 (0.0006) [2023-03-07 08:26:25,144][155452] Updated weights for policy 0, policy_version 38820 (0.0006) [2023-03-07 08:26:25,929][155452] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-07 08:26:26,715][155452] Updated weights for policy 0, policy_version 38840 (0.0007) [2023-03-07 08:26:27,505][155452] Updated weights for policy 0, policy_version 38850 (0.0006) [2023-03-07 08:26:28,284][155452] Updated weights for policy 0, policy_version 38860 (0.0006) [2023-03-07 08:26:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 39793664. Throughput: 0: 13070.4. Samples: 39786109. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:26:28,367][155126] Avg episode reward: [(0, '1995.810')] [2023-03-07 08:26:29,075][155452] Updated weights for policy 0, policy_version 38870 (0.0007) [2023-03-07 08:26:29,856][155452] Updated weights for policy 0, policy_version 38880 (0.0007) [2023-03-07 08:26:30,630][155452] Updated weights for policy 0, policy_version 38890 (0.0006) [2023-03-07 08:26:31,416][155452] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-07 08:26:32,210][155452] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-07 08:26:32,994][155452] Updated weights for policy 0, policy_version 38920 (0.0006) [2023-03-07 08:26:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 39858176. Throughput: 0: 13076.7. Samples: 39825493. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:26:33,367][155126] Avg episode reward: [(0, '1831.984')] [2023-03-07 08:26:33,791][155452] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-07 08:26:34,578][155452] Updated weights for policy 0, policy_version 38940 (0.0006) [2023-03-07 08:26:35,370][155452] Updated weights for policy 0, policy_version 38950 (0.0006) [2023-03-07 08:26:36,157][155452] Updated weights for policy 0, policy_version 38960 (0.0006) [2023-03-07 08:26:36,942][155452] Updated weights for policy 0, policy_version 38970 (0.0007) [2023-03-07 08:26:37,719][155452] Updated weights for policy 0, policy_version 38980 (0.0006) [2023-03-07 08:26:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13073.0, 300 sec: 13037.8). Total num frames: 39923712. Throughput: 0: 13060.9. Samples: 39903433. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:26:38,368][155126] Avg episode reward: [(0, '1898.978')] [2023-03-07 08:26:38,505][155452] Updated weights for policy 0, policy_version 38990 (0.0006) [2023-03-07 08:26:39,299][155452] Updated weights for policy 0, policy_version 39000 (0.0006) [2023-03-07 08:26:40,082][155452] Updated weights for policy 0, policy_version 39010 (0.0006) [2023-03-07 08:26:40,859][155452] Updated weights for policy 0, policy_version 39020 (0.0006) [2023-03-07 08:26:41,653][155452] Updated weights for policy 0, policy_version 39030 (0.0006) [2023-03-07 08:26:42,440][155452] Updated weights for policy 0, policy_version 39040 (0.0006) [2023-03-07 08:26:43,197][155452] Updated weights for policy 0, policy_version 39050 (0.0006) [2023-03-07 08:26:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13041.3). Total num frames: 39989248. Throughput: 0: 13061.5. Samples: 39981589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:26:43,367][155126] Avg episode reward: [(0, '2093.532')] [2023-03-07 08:26:43,997][155452] Updated weights for policy 0, policy_version 39060 (0.0007) [2023-03-07 08:26:44,791][155452] Updated weights for policy 0, policy_version 39070 (0.0007) [2023-03-07 08:26:45,569][155452] Updated weights for policy 0, policy_version 39080 (0.0005) [2023-03-07 08:26:46,351][155452] Updated weights for policy 0, policy_version 39090 (0.0007) [2023-03-07 08:26:47,154][155452] Updated weights for policy 0, policy_version 39100 (0.0007) [2023-03-07 08:26:47,928][155452] Updated weights for policy 0, policy_version 39110 (0.0006) [2023-03-07 08:26:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 40053760. Throughput: 0: 13063.3. Samples: 40020780. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:26:48,367][155126] Avg episode reward: [(0, '1996.198')] [2023-03-07 08:26:48,705][155452] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-07 08:26:49,489][155452] Updated weights for policy 0, policy_version 39130 (0.0006) [2023-03-07 08:26:50,269][155452] Updated weights for policy 0, policy_version 39140 (0.0006) [2023-03-07 08:26:51,023][155452] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-07 08:26:51,819][155452] Updated weights for policy 0, policy_version 39160 (0.0007) [2023-03-07 08:26:52,621][155452] Updated weights for policy 0, policy_version 39170 (0.0006) [2023-03-07 08:26:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 40119296. Throughput: 0: 13068.1. Samples: 40099442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:26:53,367][155126] Avg episode reward: [(0, '1888.908')] [2023-03-07 08:26:53,391][155452] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-07 08:26:54,197][155452] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-07 08:26:54,974][155452] Updated weights for policy 0, policy_version 39200 (0.0006) [2023-03-07 08:26:55,764][155452] Updated weights for policy 0, policy_version 39210 (0.0006) [2023-03-07 08:26:56,544][155452] Updated weights for policy 0, policy_version 39220 (0.0006) [2023-03-07 08:26:57,326][155452] Updated weights for policy 0, policy_version 39230 (0.0006) [2023-03-07 08:26:58,123][155452] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-07 08:26:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.0, 300 sec: 13041.2). Total num frames: 40184832. Throughput: 0: 13055.1. Samples: 40177614. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:26:58,367][155126] Avg episode reward: [(0, '1730.498')] [2023-03-07 08:26:58,915][155452] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-07 08:26:59,694][155452] Updated weights for policy 0, policy_version 39260 (0.0006) [2023-03-07 08:27:00,469][155452] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-07 08:27:01,266][155452] Updated weights for policy 0, policy_version 39280 (0.0006) [2023-03-07 08:27:02,043][155452] Updated weights for policy 0, policy_version 39290 (0.0006) [2023-03-07 08:27:02,830][155452] Updated weights for policy 0, policy_version 39300 (0.0006) [2023-03-07 08:27:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 40249344. Throughput: 0: 13047.6. Samples: 40216678. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:27:03,368][155126] Avg episode reward: [(0, '2021.895')] [2023-03-07 08:27:03,620][155452] Updated weights for policy 0, policy_version 39310 (0.0007) [2023-03-07 08:27:04,406][155452] Updated weights for policy 0, policy_version 39320 (0.0006) [2023-03-07 08:27:05,186][155452] Updated weights for policy 0, policy_version 39330 (0.0006) [2023-03-07 08:27:05,961][155452] Updated weights for policy 0, policy_version 39340 (0.0007) [2023-03-07 08:27:06,764][155452] Updated weights for policy 0, policy_version 39350 (0.0005) [2023-03-07 08:27:07,556][155452] Updated weights for policy 0, policy_version 39360 (0.0007) [2023-03-07 08:27:08,327][155452] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-07 08:27:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 40314880. Throughput: 0: 13048.4. Samples: 40294944. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:27:08,367][155126] Avg episode reward: [(0, '1844.618')] [2023-03-07 08:27:09,139][155452] Updated weights for policy 0, policy_version 39380 (0.0006) [2023-03-07 08:27:09,924][155452] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-07 08:27:10,698][155452] Updated weights for policy 0, policy_version 39400 (0.0006) [2023-03-07 08:27:11,492][155452] Updated weights for policy 0, policy_version 39410 (0.0006) [2023-03-07 08:27:12,275][155452] Updated weights for policy 0, policy_version 39420 (0.0006) [2023-03-07 08:27:13,055][155452] Updated weights for policy 0, policy_version 39430 (0.0006) [2023-03-07 08:27:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 40380416. Throughput: 0: 13039.8. Samples: 40372900. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:13,368][155126] Avg episode reward: [(0, '1838.926')] [2023-03-07 08:27:13,847][155452] Updated weights for policy 0, policy_version 39440 (0.0006) [2023-03-07 08:27:14,627][155452] Updated weights for policy 0, policy_version 39450 (0.0006) [2023-03-07 08:27:15,413][155452] Updated weights for policy 0, policy_version 39460 (0.0006) [2023-03-07 08:27:16,206][155452] Updated weights for policy 0, policy_version 39470 (0.0006) [2023-03-07 08:27:16,993][155452] Updated weights for policy 0, policy_version 39480 (0.0006) [2023-03-07 08:27:17,785][155452] Updated weights for policy 0, policy_version 39490 (0.0007) [2023-03-07 08:27:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 40444928. Throughput: 0: 13033.0. Samples: 40411978. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:18,368][155126] Avg episode reward: [(0, '1935.129')] [2023-03-07 08:27:18,597][155452] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-07 08:27:19,365][155452] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-07 08:27:20,158][155452] Updated weights for policy 0, policy_version 39520 (0.0006) [2023-03-07 08:27:20,959][155452] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-07 08:27:21,734][155452] Updated weights for policy 0, policy_version 39540 (0.0007) [2023-03-07 08:27:22,506][155452] Updated weights for policy 0, policy_version 39550 (0.0005) [2023-03-07 08:27:23,310][155452] Updated weights for policy 0, policy_version 39560 (0.0006) [2023-03-07 08:27:23,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 40509440. Throughput: 0: 13030.3. Samples: 40489797. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:23,367][155126] Avg episode reward: [(0, '1840.434')] [2023-03-07 08:27:24,082][155452] Updated weights for policy 0, policy_version 39570 (0.0006) [2023-03-07 08:27:24,861][155452] Updated weights for policy 0, policy_version 39580 (0.0007) [2023-03-07 08:27:25,642][155452] Updated weights for policy 0, policy_version 39590 (0.0007) [2023-03-07 08:27:26,428][155452] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-07 08:27:27,203][155452] Updated weights for policy 0, policy_version 39610 (0.0006) [2023-03-07 08:27:27,988][155452] Updated weights for policy 0, policy_version 39620 (0.0008) [2023-03-07 08:27:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 40574976. Throughput: 0: 13036.2. Samples: 40568219. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:28,367][155126] Avg episode reward: [(0, '1843.442')] [2023-03-07 08:27:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000039624_40574976.pth... [2023-03-07 08:27:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000036568_37445632.pth [2023-03-07 08:27:28,773][155452] Updated weights for policy 0, policy_version 39630 (0.0006) [2023-03-07 08:27:29,548][155452] Updated weights for policy 0, policy_version 39640 (0.0006) [2023-03-07 08:27:30,351][155452] Updated weights for policy 0, policy_version 39650 (0.0006) [2023-03-07 08:27:31,126][155452] Updated weights for policy 0, policy_version 39660 (0.0006) [2023-03-07 08:27:31,922][155452] Updated weights for policy 0, policy_version 39670 (0.0006) [2023-03-07 08:27:32,714][155452] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-07 08:27:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 40640512. Throughput: 0: 13034.9. Samples: 40607348. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:33,367][155126] Avg episode reward: [(0, '1970.547')] [2023-03-07 08:27:33,512][155452] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-07 08:27:34,299][155452] Updated weights for policy 0, policy_version 39700 (0.0007) [2023-03-07 08:27:35,078][155452] Updated weights for policy 0, policy_version 39710 (0.0006) [2023-03-07 08:27:35,869][155452] Updated weights for policy 0, policy_version 39720 (0.0007) [2023-03-07 08:27:36,650][155452] Updated weights for policy 0, policy_version 39730 (0.0007) [2023-03-07 08:27:37,429][155452] Updated weights for policy 0, policy_version 39740 (0.0007) [2023-03-07 08:27:38,211][155452] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-07 08:27:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 40706048. Throughput: 0: 13023.3. Samples: 40685493. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:38,378][155126] Avg episode reward: [(0, '2021.227')] [2023-03-07 08:27:38,972][155452] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-07 08:27:39,759][155452] Updated weights for policy 0, policy_version 39770 (0.0005) [2023-03-07 08:27:40,529][155452] Updated weights for policy 0, policy_version 39780 (0.0006) [2023-03-07 08:27:41,330][155452] Updated weights for policy 0, policy_version 39790 (0.0006) [2023-03-07 08:27:42,129][155452] Updated weights for policy 0, policy_version 39800 (0.0005) [2023-03-07 08:27:42,920][155452] Updated weights for policy 0, policy_version 39810 (0.0007) [2023-03-07 08:27:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 40770560. Throughput: 0: 13025.7. Samples: 40763769. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:43,368][155126] Avg episode reward: [(0, '1884.131')] [2023-03-07 08:27:43,714][155452] Updated weights for policy 0, policy_version 39820 (0.0006) [2023-03-07 08:27:44,504][155452] Updated weights for policy 0, policy_version 39830 (0.0007) [2023-03-07 08:27:45,300][155452] Updated weights for policy 0, policy_version 39840 (0.0006) [2023-03-07 08:27:46,089][155452] Updated weights for policy 0, policy_version 39850 (0.0005) [2023-03-07 08:27:46,851][155452] Updated weights for policy 0, policy_version 39860 (0.0006) [2023-03-07 08:27:47,660][155452] Updated weights for policy 0, policy_version 39870 (0.0007) [2023-03-07 08:27:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 40836096. Throughput: 0: 13020.8. Samples: 40802613. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:48,367][155126] Avg episode reward: [(0, '1812.312')] [2023-03-07 08:27:48,449][155452] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-07 08:27:49,230][155452] Updated weights for policy 0, policy_version 39890 (0.0006) [2023-03-07 08:27:50,006][155452] Updated weights for policy 0, policy_version 39900 (0.0006) [2023-03-07 08:27:50,813][155452] Updated weights for policy 0, policy_version 39910 (0.0007) [2023-03-07 08:27:51,571][155452] Updated weights for policy 0, policy_version 39920 (0.0006) [2023-03-07 08:27:52,374][155452] Updated weights for policy 0, policy_version 39930 (0.0006) [2023-03-07 08:27:53,175][155452] Updated weights for policy 0, policy_version 39940 (0.0006) [2023-03-07 08:27:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 40900608. Throughput: 0: 13021.8. Samples: 40880927. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:53,367][155126] Avg episode reward: [(0, '1987.839')] [2023-03-07 08:27:53,944][155452] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-07 08:27:54,739][155452] Updated weights for policy 0, policy_version 39960 (0.0007) [2023-03-07 08:27:55,532][155452] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-07 08:27:56,313][155452] Updated weights for policy 0, policy_version 39980 (0.0006) [2023-03-07 08:27:57,088][155452] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-07 08:27:57,881][155452] Updated weights for policy 0, policy_version 40000 (0.0006) [2023-03-07 08:27:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 40966144. Throughput: 0: 13023.4. Samples: 40958951. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:27:58,367][155126] Avg episode reward: [(0, '1830.584')] [2023-03-07 08:27:58,673][155452] Updated weights for policy 0, policy_version 40010 (0.0006) [2023-03-07 08:27:59,447][155452] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-07 08:28:00,229][155452] Updated weights for policy 0, policy_version 40030 (0.0007) [2023-03-07 08:28:01,010][155452] Updated weights for policy 0, policy_version 40040 (0.0006) [2023-03-07 08:28:01,806][155452] Updated weights for policy 0, policy_version 40050 (0.0006) [2023-03-07 08:28:02,593][155452] Updated weights for policy 0, policy_version 40060 (0.0005) [2023-03-07 08:28:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 41030656. Throughput: 0: 13026.2. Samples: 40998158. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:28:03,367][155126] Avg episode reward: [(0, '1756.323')] [2023-03-07 08:28:03,397][155452] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-07 08:28:04,176][155452] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-07 08:28:04,961][155452] Updated weights for policy 0, policy_version 40090 (0.0007) [2023-03-07 08:28:05,736][155452] Updated weights for policy 0, policy_version 40100 (0.0006) [2023-03-07 08:28:06,518][155452] Updated weights for policy 0, policy_version 40110 (0.0006) [2023-03-07 08:28:07,309][155452] Updated weights for policy 0, policy_version 40120 (0.0006) [2023-03-07 08:28:08,071][155452] Updated weights for policy 0, policy_version 40130 (0.0006) [2023-03-07 08:28:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 41096192. Throughput: 0: 13028.0. Samples: 41076057. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:28:08,367][155126] Avg episode reward: [(0, '2017.749')] [2023-03-07 08:28:08,859][155452] Updated weights for policy 0, policy_version 40140 (0.0006) [2023-03-07 08:28:09,646][155452] Updated weights for policy 0, policy_version 40150 (0.0007) [2023-03-07 08:28:10,420][155452] Updated weights for policy 0, policy_version 40160 (0.0006) [2023-03-07 08:28:11,193][155452] Updated weights for policy 0, policy_version 40170 (0.0007) [2023-03-07 08:28:11,989][155452] Updated weights for policy 0, policy_version 40180 (0.0006) [2023-03-07 08:28:12,777][155452] Updated weights for policy 0, policy_version 40190 (0.0007) [2023-03-07 08:28:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 41161728. Throughput: 0: 13033.7. Samples: 41154734. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:28:13,367][155126] Avg episode reward: [(0, '1892.752')] [2023-03-07 08:28:13,549][155452] Updated weights for policy 0, policy_version 40200 (0.0006) [2023-03-07 08:28:14,356][155452] Updated weights for policy 0, policy_version 40210 (0.0006) [2023-03-07 08:28:15,133][155452] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-07 08:28:15,902][155452] Updated weights for policy 0, policy_version 40230 (0.0006) [2023-03-07 08:28:16,697][155452] Updated weights for policy 0, policy_version 40240 (0.0006) [2023-03-07 08:28:17,479][155452] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-07 08:28:18,263][155452] Updated weights for policy 0, policy_version 40260 (0.0006) [2023-03-07 08:28:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 41227264. Throughput: 0: 13035.1. Samples: 41193928. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:28:18,367][155126] Avg episode reward: [(0, '1792.731')] [2023-03-07 08:28:19,044][155452] Updated weights for policy 0, policy_version 40270 (0.0006) [2023-03-07 08:28:19,840][155452] Updated weights for policy 0, policy_version 40280 (0.0007) [2023-03-07 08:28:20,615][155452] Updated weights for policy 0, policy_version 40290 (0.0006) [2023-03-07 08:28:21,405][155452] Updated weights for policy 0, policy_version 40300 (0.0006) [2023-03-07 08:28:22,197][155452] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-07 08:28:22,981][155452] Updated weights for policy 0, policy_version 40320 (0.0007) [2023-03-07 08:28:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 41292800. Throughput: 0: 13043.1. Samples: 41272432. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:28:23,367][155126] Avg episode reward: [(0, '2032.451')] [2023-03-07 08:28:23,759][155452] Updated weights for policy 0, policy_version 40330 (0.0007) [2023-03-07 08:28:24,534][155452] Updated weights for policy 0, policy_version 40340 (0.0006) [2023-03-07 08:28:25,330][155452] Updated weights for policy 0, policy_version 40350 (0.0006) [2023-03-07 08:28:26,119][155452] Updated weights for policy 0, policy_version 40360 (0.0006) [2023-03-07 08:28:26,917][155452] Updated weights for policy 0, policy_version 40370 (0.0007) [2023-03-07 08:28:27,698][155452] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-07 08:28:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 41357312. Throughput: 0: 13037.9. Samples: 41350471. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:28:28,367][155126] Avg episode reward: [(0, '1885.755')] [2023-03-07 08:28:28,471][155452] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-07 08:28:29,258][155452] Updated weights for policy 0, policy_version 40400 (0.0006) [2023-03-07 08:28:30,031][155452] Updated weights for policy 0, policy_version 40410 (0.0006) [2023-03-07 08:28:30,806][155452] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-07 08:28:31,605][155452] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-07 08:28:32,382][155452] Updated weights for policy 0, policy_version 40440 (0.0007) [2023-03-07 08:28:33,170][155452] Updated weights for policy 0, policy_version 40450 (0.0005) [2023-03-07 08:28:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 41422848. Throughput: 0: 13048.7. Samples: 41389804. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:28:33,367][155126] Avg episode reward: [(0, '1812.692')] [2023-03-07 08:28:33,936][155452] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-07 08:28:34,729][155452] Updated weights for policy 0, policy_version 40470 (0.0005) [2023-03-07 08:28:35,503][155452] Updated weights for policy 0, policy_version 40480 (0.0005) [2023-03-07 08:28:36,283][155452] Updated weights for policy 0, policy_version 40490 (0.0006) [2023-03-07 08:28:37,082][155452] Updated weights for policy 0, policy_version 40500 (0.0006) [2023-03-07 08:28:37,874][155452] Updated weights for policy 0, policy_version 40510 (0.0006) [2023-03-07 08:28:38,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 41488384. Throughput: 0: 13052.6. Samples: 41468294. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:28:38,368][155126] Avg episode reward: [(0, '1943.097')] [2023-03-07 08:28:38,654][155452] Updated weights for policy 0, policy_version 40520 (0.0006) [2023-03-07 08:28:39,434][155452] Updated weights for policy 0, policy_version 40530 (0.0006) [2023-03-07 08:28:40,225][155452] Updated weights for policy 0, policy_version 40540 (0.0006) [2023-03-07 08:28:40,998][155452] Updated weights for policy 0, policy_version 40550 (0.0007) [2023-03-07 08:28:41,799][155452] Updated weights for policy 0, policy_version 40560 (0.0006) [2023-03-07 08:28:42,578][155452] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-07 08:28:43,359][155452] Updated weights for policy 0, policy_version 40580 (0.0006) [2023-03-07 08:28:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 41553920. Throughput: 0: 13059.6. Samples: 41546634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:28:43,367][155126] Avg episode reward: [(0, '1917.879')] [2023-03-07 08:28:44,134][155452] Updated weights for policy 0, policy_version 40590 (0.0006) [2023-03-07 08:28:44,914][155452] Updated weights for policy 0, policy_version 40600 (0.0007) [2023-03-07 08:28:45,704][155452] Updated weights for policy 0, policy_version 40610 (0.0006) [2023-03-07 08:28:46,481][155452] Updated weights for policy 0, policy_version 40620 (0.0006) [2023-03-07 08:28:47,260][155452] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-07 08:28:48,060][155452] Updated weights for policy 0, policy_version 40640 (0.0006) [2023-03-07 08:28:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 41618432. Throughput: 0: 13062.2. Samples: 41585955. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:28:48,367][155126] Avg episode reward: [(0, '1876.883')] [2023-03-07 08:28:48,838][155452] Updated weights for policy 0, policy_version 40650 (0.0006) [2023-03-07 08:28:49,636][155452] Updated weights for policy 0, policy_version 40660 (0.0006) [2023-03-07 08:28:50,423][155452] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-07 08:28:51,185][155452] Updated weights for policy 0, policy_version 40680 (0.0006) [2023-03-07 08:28:51,975][155452] Updated weights for policy 0, policy_version 40690 (0.0007) [2023-03-07 08:28:52,747][155452] Updated weights for policy 0, policy_version 40700 (0.0006) [2023-03-07 08:28:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 41683968. Throughput: 0: 13071.6. Samples: 41664280. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:28:53,367][155126] Avg episode reward: [(0, '2030.436')] [2023-03-07 08:28:53,530][155452] Updated weights for policy 0, policy_version 40710 (0.0006) [2023-03-07 08:28:54,305][155452] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-07 08:28:55,083][155452] Updated weights for policy 0, policy_version 40730 (0.0007) [2023-03-07 08:28:55,873][155452] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-07 08:28:56,681][155452] Updated weights for policy 0, policy_version 40750 (0.0006) [2023-03-07 08:28:57,452][155452] Updated weights for policy 0, policy_version 40760 (0.0006) [2023-03-07 08:28:58,234][155452] Updated weights for policy 0, policy_version 40770 (0.0005) [2023-03-07 08:28:58,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 41749504. Throughput: 0: 13065.8. Samples: 41742694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:28:58,368][155126] Avg episode reward: [(0, '1921.949')] [2023-03-07 08:28:59,024][155452] Updated weights for policy 0, policy_version 40780 (0.0007) [2023-03-07 08:28:59,808][155452] Updated weights for policy 0, policy_version 40790 (0.0006) [2023-03-07 08:29:00,569][155452] Updated weights for policy 0, policy_version 40800 (0.0006) [2023-03-07 08:29:01,374][155452] Updated weights for policy 0, policy_version 40810 (0.0007) [2023-03-07 08:29:02,139][155452] Updated weights for policy 0, policy_version 40820 (0.0006) [2023-03-07 08:29:02,944][155452] Updated weights for policy 0, policy_version 40830 (0.0006) [2023-03-07 08:29:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 41815040. Throughput: 0: 13067.7. Samples: 41781974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:29:03,367][155126] Avg episode reward: [(0, '1927.720')] [2023-03-07 08:29:03,720][155452] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-07 08:29:04,498][155452] Updated weights for policy 0, policy_version 40850 (0.0007) [2023-03-07 08:29:05,290][155452] Updated weights for policy 0, policy_version 40860 (0.0006) [2023-03-07 08:29:06,087][155452] Updated weights for policy 0, policy_version 40870 (0.0005) [2023-03-07 08:29:06,859][155452] Updated weights for policy 0, policy_version 40880 (0.0007) [2023-03-07 08:29:07,673][155452] Updated weights for policy 0, policy_version 40890 (0.0007) [2023-03-07 08:29:08,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 41880576. Throughput: 0: 13063.0. Samples: 41860264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:29:08,367][155126] Avg episode reward: [(0, '1820.579')] [2023-03-07 08:29:08,434][155452] Updated weights for policy 0, policy_version 40900 (0.0007) [2023-03-07 08:29:09,209][155452] Updated weights for policy 0, policy_version 40910 (0.0006) [2023-03-07 08:29:09,996][155452] Updated weights for policy 0, policy_version 40920 (0.0007) [2023-03-07 08:29:10,770][155452] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-07 08:29:11,561][155452] Updated weights for policy 0, policy_version 40940 (0.0007) [2023-03-07 08:29:12,350][155452] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-07 08:29:13,133][155452] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-07 08:29:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 41946112. Throughput: 0: 13066.6. Samples: 41938467. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:29:13,367][155126] Avg episode reward: [(0, '1847.373')] [2023-03-07 08:29:13,925][155452] Updated weights for policy 0, policy_version 40970 (0.0006) [2023-03-07 08:29:14,703][155452] Updated weights for policy 0, policy_version 40980 (0.0007) [2023-03-07 08:29:15,500][155452] Updated weights for policy 0, policy_version 40990 (0.0006) [2023-03-07 08:29:16,273][155452] Updated weights for policy 0, policy_version 41000 (0.0007) [2023-03-07 08:29:17,047][155452] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-07 08:29:17,831][155452] Updated weights for policy 0, policy_version 41020 (0.0005) [2023-03-07 08:29:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 42010624. Throughput: 0: 13065.9. Samples: 41977772. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:29:18,367][155126] Avg episode reward: [(0, '1677.548')] [2023-03-07 08:29:18,625][155452] Updated weights for policy 0, policy_version 41030 (0.0006) [2023-03-07 08:29:19,408][155452] Updated weights for policy 0, policy_version 41040 (0.0005) [2023-03-07 08:29:20,184][155452] Updated weights for policy 0, policy_version 41050 (0.0006) [2023-03-07 08:29:20,962][155452] Updated weights for policy 0, policy_version 41060 (0.0006) [2023-03-07 08:29:21,736][155452] Updated weights for policy 0, policy_version 41070 (0.0006) [2023-03-07 08:29:22,513][155452] Updated weights for policy 0, policy_version 41080 (0.0007) [2023-03-07 08:29:23,300][155452] Updated weights for policy 0, policy_version 41090 (0.0007) [2023-03-07 08:29:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 42076160. Throughput: 0: 13069.8. Samples: 42056431. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:29:23,367][155126] Avg episode reward: [(0, '2039.138')] [2023-03-07 08:29:24,091][155452] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-07 08:29:24,866][155452] Updated weights for policy 0, policy_version 41110 (0.0006) [2023-03-07 08:29:25,653][155452] Updated weights for policy 0, policy_version 41120 (0.0006) [2023-03-07 08:29:26,447][155452] Updated weights for policy 0, policy_version 41130 (0.0005) [2023-03-07 08:29:27,231][155452] Updated weights for policy 0, policy_version 41140 (0.0007) [2023-03-07 08:29:28,018][155452] Updated weights for policy 0, policy_version 41150 (0.0005) [2023-03-07 08:29:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.0, 300 sec: 13041.2). Total num frames: 42141696. Throughput: 0: 13068.0. Samples: 42134697. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:29:28,367][155126] Avg episode reward: [(0, '1953.333')] [2023-03-07 08:29:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000041154_42141696.pth... [2023-03-07 08:29:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000038096_39010304.pth [2023-03-07 08:29:28,792][155452] Updated weights for policy 0, policy_version 41160 (0.0007) [2023-03-07 08:29:29,582][155452] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-07 08:29:30,363][155452] Updated weights for policy 0, policy_version 41180 (0.0006) [2023-03-07 08:29:31,133][155452] Updated weights for policy 0, policy_version 41190 (0.0006) [2023-03-07 08:29:31,926][155452] Updated weights for policy 0, policy_version 41200 (0.0006) [2023-03-07 08:29:32,725][155452] Updated weights for policy 0, policy_version 41210 (0.0006) [2023-03-07 08:29:33,367][155126] Fps is (10 sec: 13106.9, 60 sec: 13073.0, 300 sec: 13044.7). Total num frames: 42207232. Throughput: 0: 13067.1. Samples: 42173978. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:29:33,368][155126] Avg episode reward: [(0, '1894.430')] [2023-03-07 08:29:33,501][155452] Updated weights for policy 0, policy_version 41220 (0.0006) [2023-03-07 08:29:34,269][155452] Updated weights for policy 0, policy_version 41230 (0.0007) [2023-03-07 08:29:35,057][155452] Updated weights for policy 0, policy_version 41240 (0.0007) [2023-03-07 08:29:35,853][155452] Updated weights for policy 0, policy_version 41250 (0.0006) [2023-03-07 08:29:36,636][155452] Updated weights for policy 0, policy_version 41260 (0.0006) [2023-03-07 08:29:37,414][155452] Updated weights for policy 0, policy_version 41270 (0.0006) [2023-03-07 08:29:38,198][155452] Updated weights for policy 0, policy_version 41280 (0.0006) [2023-03-07 08:29:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 42272768. Throughput: 0: 13067.0. Samples: 42252296. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:29:38,368][155126] Avg episode reward: [(0, '1951.974')] [2023-03-07 08:29:38,990][155452] Updated weights for policy 0, policy_version 41290 (0.0006) [2023-03-07 08:29:39,772][155452] Updated weights for policy 0, policy_version 41300 (0.0006) [2023-03-07 08:29:40,559][155452] Updated weights for policy 0, policy_version 41310 (0.0006) [2023-03-07 08:29:41,355][155452] Updated weights for policy 0, policy_version 41320 (0.0006) [2023-03-07 08:29:42,144][155452] Updated weights for policy 0, policy_version 41330 (0.0006) [2023-03-07 08:29:42,925][155452] Updated weights for policy 0, policy_version 41340 (0.0005) [2023-03-07 08:29:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 42337280. Throughput: 0: 13059.4. Samples: 42330368. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:29:43,367][155126] Avg episode reward: [(0, '1894.402')] [2023-03-07 08:29:43,717][155452] Updated weights for policy 0, policy_version 41350 (0.0006) [2023-03-07 08:29:44,494][155452] Updated weights for policy 0, policy_version 41360 (0.0006) [2023-03-07 08:29:45,268][155452] Updated weights for policy 0, policy_version 41370 (0.0006) [2023-03-07 08:29:46,061][155452] Updated weights for policy 0, policy_version 41380 (0.0006) [2023-03-07 08:29:46,856][155452] Updated weights for policy 0, policy_version 41390 (0.0006) [2023-03-07 08:29:47,642][155452] Updated weights for policy 0, policy_version 41400 (0.0006) [2023-03-07 08:29:48,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13073.1, 300 sec: 13044.7). Total num frames: 42402816. Throughput: 0: 13058.3. Samples: 42369598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:29:48,367][155126] Avg episode reward: [(0, '1832.686')] [2023-03-07 08:29:48,427][155452] Updated weights for policy 0, policy_version 41410 (0.0006) [2023-03-07 08:29:49,224][155452] Updated weights for policy 0, policy_version 41420 (0.0006) [2023-03-07 08:29:50,017][155452] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-07 08:29:50,796][155452] Updated weights for policy 0, policy_version 41440 (0.0006) [2023-03-07 08:29:51,564][155452] Updated weights for policy 0, policy_version 41450 (0.0005) [2023-03-07 08:29:52,393][155452] Updated weights for policy 0, policy_version 41460 (0.0007) [2023-03-07 08:29:53,175][155452] Updated weights for policy 0, policy_version 41470 (0.0006) [2023-03-07 08:29:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 42467328. Throughput: 0: 13053.8. Samples: 42447688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:29:53,367][155126] Avg episode reward: [(0, '1776.338')] [2023-03-07 08:29:53,954][155452] Updated weights for policy 0, policy_version 41480 (0.0006) [2023-03-07 08:29:54,744][155452] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-07 08:29:55,529][155452] Updated weights for policy 0, policy_version 41500 (0.0006) [2023-03-07 08:29:56,316][155452] Updated weights for policy 0, policy_version 41510 (0.0005) [2023-03-07 08:29:57,111][155452] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-07 08:29:57,886][155452] Updated weights for policy 0, policy_version 41530 (0.0006) [2023-03-07 08:29:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 42532864. Throughput: 0: 13047.2. Samples: 42525590. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:29:58,367][155126] Avg episode reward: [(0, '1652.949')] [2023-03-07 08:29:58,680][155452] Updated weights for policy 0, policy_version 41540 (0.0008) [2023-03-07 08:29:59,455][155452] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-07 08:30:00,238][155452] Updated weights for policy 0, policy_version 41560 (0.0006) [2023-03-07 08:30:01,033][155452] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-07 08:30:01,792][155452] Updated weights for policy 0, policy_version 41580 (0.0006) [2023-03-07 08:30:02,584][155452] Updated weights for policy 0, policy_version 41590 (0.0006) [2023-03-07 08:30:03,352][155452] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-07 08:30:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 42598400. Throughput: 0: 13042.7. Samples: 42564696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:30:03,368][155126] Avg episode reward: [(0, '1637.696')] [2023-03-07 08:30:04,122][155452] Updated weights for policy 0, policy_version 41610 (0.0007) [2023-03-07 08:30:04,918][155452] Updated weights for policy 0, policy_version 41620 (0.0007) [2023-03-07 08:30:05,703][155452] Updated weights for policy 0, policy_version 41630 (0.0007) [2023-03-07 08:30:06,478][155452] Updated weights for policy 0, policy_version 41640 (0.0006) [2023-03-07 08:30:07,269][155452] Updated weights for policy 0, policy_version 41650 (0.0007) [2023-03-07 08:30:08,045][155452] Updated weights for policy 0, policy_version 41660 (0.0007) [2023-03-07 08:30:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 42662912. Throughput: 0: 13043.4. Samples: 42643386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:30:08,367][155126] Avg episode reward: [(0, '1587.834')] [2023-03-07 08:30:08,862][155452] Updated weights for policy 0, policy_version 41670 (0.0006) [2023-03-07 08:30:09,646][155452] Updated weights for policy 0, policy_version 41680 (0.0007) [2023-03-07 08:30:10,429][155452] Updated weights for policy 0, policy_version 41690 (0.0006) [2023-03-07 08:30:11,235][155452] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-07 08:30:12,025][155452] Updated weights for policy 0, policy_version 41710 (0.0007) [2023-03-07 08:30:12,804][155452] Updated weights for policy 0, policy_version 41720 (0.0006) [2023-03-07 08:30:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 42728448. Throughput: 0: 13030.4. Samples: 42721064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:30:13,367][155126] Avg episode reward: [(0, '1551.234')] [2023-03-07 08:30:13,607][155452] Updated weights for policy 0, policy_version 41730 (0.0007) [2023-03-07 08:30:14,395][155452] Updated weights for policy 0, policy_version 41740 (0.0007) [2023-03-07 08:30:15,164][155452] Updated weights for policy 0, policy_version 41750 (0.0005) [2023-03-07 08:30:15,961][155452] Updated weights for policy 0, policy_version 41760 (0.0006) [2023-03-07 08:30:16,724][155452] Updated weights for policy 0, policy_version 41770 (0.0006) [2023-03-07 08:30:17,508][155452] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-07 08:30:18,284][155452] Updated weights for policy 0, policy_version 41790 (0.0007) [2023-03-07 08:30:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13048.2). Total num frames: 42793984. Throughput: 0: 13027.1. Samples: 42760198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:30:18,367][155126] Avg episode reward: [(0, '1490.413')] [2023-03-07 08:30:19,074][155452] Updated weights for policy 0, policy_version 41800 (0.0006) [2023-03-07 08:30:19,868][155452] Updated weights for policy 0, policy_version 41810 (0.0006) [2023-03-07 08:30:20,651][155452] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-07 08:30:21,437][155452] Updated weights for policy 0, policy_version 41830 (0.0006) [2023-03-07 08:30:22,206][155452] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-07 08:30:23,017][155452] Updated weights for policy 0, policy_version 41850 (0.0007) [2023-03-07 08:30:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 42858496. Throughput: 0: 13028.7. Samples: 42838586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:30:23,368][155126] Avg episode reward: [(0, '1693.219')] [2023-03-07 08:30:23,794][155452] Updated weights for policy 0, policy_version 41860 (0.0007) [2023-03-07 08:30:24,578][155452] Updated weights for policy 0, policy_version 41870 (0.0006) [2023-03-07 08:30:25,365][155452] Updated weights for policy 0, policy_version 41880 (0.0006) [2023-03-07 08:30:26,160][155452] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-07 08:30:26,930][155452] Updated weights for policy 0, policy_version 41900 (0.0007) [2023-03-07 08:30:27,707][155452] Updated weights for policy 0, policy_version 41910 (0.0006) [2023-03-07 08:30:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 42924032. Throughput: 0: 13037.0. Samples: 42917036. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:30:28,367][155126] Avg episode reward: [(0, '1367.667')] [2023-03-07 08:30:28,489][155452] Updated weights for policy 0, policy_version 41920 (0.0006) [2023-03-07 08:30:29,294][155452] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-07 08:30:30,078][155452] Updated weights for policy 0, policy_version 41940 (0.0007) [2023-03-07 08:30:30,869][155452] Updated weights for policy 0, policy_version 41950 (0.0007) [2023-03-07 08:30:31,677][155452] Updated weights for policy 0, policy_version 41960 (0.0006) [2023-03-07 08:30:32,447][155452] Updated weights for policy 0, policy_version 41970 (0.0006) [2023-03-07 08:30:33,220][155452] Updated weights for policy 0, policy_version 41980 (0.0006) [2023-03-07 08:30:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 42988544. Throughput: 0: 13026.7. Samples: 42955802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:30:33,367][155126] Avg episode reward: [(0, '1593.606')] [2023-03-07 08:30:34,005][155452] Updated weights for policy 0, policy_version 41990 (0.0007) [2023-03-07 08:30:34,799][155452] Updated weights for policy 0, policy_version 42000 (0.0007) [2023-03-07 08:30:35,587][155452] Updated weights for policy 0, policy_version 42010 (0.0006) [2023-03-07 08:30:36,363][155452] Updated weights for policy 0, policy_version 42020 (0.0006) [2023-03-07 08:30:37,149][155452] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-07 08:30:37,931][155452] Updated weights for policy 0, policy_version 42040 (0.0006) [2023-03-07 08:30:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13048.2). Total num frames: 43054080. Throughput: 0: 13029.2. Samples: 43034002. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:30:38,368][155126] Avg episode reward: [(0, '1710.821')] [2023-03-07 08:30:38,711][155452] Updated weights for policy 0, policy_version 42050 (0.0005) [2023-03-07 08:30:39,506][155452] Updated weights for policy 0, policy_version 42060 (0.0006) [2023-03-07 08:30:40,289][155452] Updated weights for policy 0, policy_version 42070 (0.0006) [2023-03-07 08:30:41,076][155452] Updated weights for policy 0, policy_version 42080 (0.0006) [2023-03-07 08:30:41,868][155452] Updated weights for policy 0, policy_version 42090 (0.0006) [2023-03-07 08:30:42,653][155452] Updated weights for policy 0, policy_version 42100 (0.0006) [2023-03-07 08:30:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 43119616. Throughput: 0: 13035.9. Samples: 43112205. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:30:43,368][155126] Avg episode reward: [(0, '1665.272')] [2023-03-07 08:30:43,430][155452] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-07 08:30:44,210][155452] Updated weights for policy 0, policy_version 42120 (0.0006) [2023-03-07 08:30:44,982][155452] Updated weights for policy 0, policy_version 42130 (0.0006) [2023-03-07 08:30:45,781][155452] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-07 08:30:46,566][155452] Updated weights for policy 0, policy_version 42150 (0.0005) [2023-03-07 08:30:47,348][155452] Updated weights for policy 0, policy_version 42160 (0.0006) [2023-03-07 08:30:48,134][155452] Updated weights for policy 0, policy_version 42170 (0.0006) [2023-03-07 08:30:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 43185152. Throughput: 0: 13043.8. Samples: 43151666. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:30:48,368][155126] Avg episode reward: [(0, '1700.033')] [2023-03-07 08:30:48,921][155452] Updated weights for policy 0, policy_version 42180 (0.0006) [2023-03-07 08:30:49,693][155452] Updated weights for policy 0, policy_version 42190 (0.0006) [2023-03-07 08:30:50,470][155452] Updated weights for policy 0, policy_version 42200 (0.0006) [2023-03-07 08:30:51,265][155452] Updated weights for policy 0, policy_version 42210 (0.0007) [2023-03-07 08:30:52,039][155452] Updated weights for policy 0, policy_version 42220 (0.0006) [2023-03-07 08:30:52,838][155452] Updated weights for policy 0, policy_version 42230 (0.0006) [2023-03-07 08:30:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 43249664. Throughput: 0: 13035.3. Samples: 43229974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:30:53,367][155126] Avg episode reward: [(0, '1508.731')] [2023-03-07 08:30:53,626][155452] Updated weights for policy 0, policy_version 42240 (0.0007) [2023-03-07 08:30:54,415][155452] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-07 08:30:55,202][155452] Updated weights for policy 0, policy_version 42260 (0.0006) [2023-03-07 08:30:55,988][155452] Updated weights for policy 0, policy_version 42270 (0.0006) [2023-03-07 08:30:56,786][155452] Updated weights for policy 0, policy_version 42280 (0.0006) [2023-03-07 08:30:57,565][155452] Updated weights for policy 0, policy_version 42290 (0.0005) [2023-03-07 08:30:58,361][155452] Updated weights for policy 0, policy_version 42300 (0.0006) [2023-03-07 08:30:58,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13048.2). Total num frames: 43315200. Throughput: 0: 13038.9. Samples: 43307813. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:30:58,378][155126] Avg episode reward: [(0, '1626.563')] [2023-03-07 08:30:59,136][155452] Updated weights for policy 0, policy_version 42310 (0.0006) [2023-03-07 08:30:59,935][155452] Updated weights for policy 0, policy_version 42320 (0.0006) [2023-03-07 08:31:00,713][155452] Updated weights for policy 0, policy_version 42330 (0.0006) [2023-03-07 08:31:01,491][155452] Updated weights for policy 0, policy_version 42340 (0.0006) [2023-03-07 08:31:02,274][155452] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-07 08:31:03,056][155452] Updated weights for policy 0, policy_version 42360 (0.0006) [2023-03-07 08:31:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13048.2). Total num frames: 43380736. Throughput: 0: 13038.6. Samples: 43346935. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:31:03,367][155126] Avg episode reward: [(0, '1856.399')] [2023-03-07 08:31:03,856][155452] Updated weights for policy 0, policy_version 42370 (0.0005) [2023-03-07 08:31:04,648][155452] Updated weights for policy 0, policy_version 42380 (0.0006) [2023-03-07 08:31:05,414][155452] Updated weights for policy 0, policy_version 42390 (0.0006) [2023-03-07 08:31:06,223][155452] Updated weights for policy 0, policy_version 42400 (0.0006) [2023-03-07 08:31:06,990][155452] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-07 08:31:07,769][155452] Updated weights for policy 0, policy_version 42420 (0.0006) [2023-03-07 08:31:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 43445248. Throughput: 0: 13037.8. Samples: 43425286. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:31:08,367][155126] Avg episode reward: [(0, '1825.884')] [2023-03-07 08:31:08,558][155452] Updated weights for policy 0, policy_version 42430 (0.0007) [2023-03-07 08:31:09,359][155452] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-07 08:31:10,140][155452] Updated weights for policy 0, policy_version 42450 (0.0006) [2023-03-07 08:31:10,947][155452] Updated weights for policy 0, policy_version 42460 (0.0006) [2023-03-07 08:31:11,720][155452] Updated weights for policy 0, policy_version 42470 (0.0006) [2023-03-07 08:31:12,512][155452] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-07 08:31:13,285][155452] Updated weights for policy 0, policy_version 42490 (0.0007) [2023-03-07 08:31:13,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 43509760. Throughput: 0: 13029.4. Samples: 43503357. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:31:13,367][155126] Avg episode reward: [(0, '1662.515')] [2023-03-07 08:31:14,057][155452] Updated weights for policy 0, policy_version 42500 (0.0006) [2023-03-07 08:31:14,854][155452] Updated weights for policy 0, policy_version 42510 (0.0006) [2023-03-07 08:31:15,628][155452] Updated weights for policy 0, policy_version 42520 (0.0006) [2023-03-07 08:31:16,413][155452] Updated weights for policy 0, policy_version 42530 (0.0007) [2023-03-07 08:31:17,198][155452] Updated weights for policy 0, policy_version 42540 (0.0006) [2023-03-07 08:31:17,997][155452] Updated weights for policy 0, policy_version 42550 (0.0006) [2023-03-07 08:31:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 43575296. Throughput: 0: 13041.1. Samples: 43542650. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:31:18,367][155126] Avg episode reward: [(0, '1837.505')] [2023-03-07 08:31:18,768][155452] Updated weights for policy 0, policy_version 42560 (0.0006) [2023-03-07 08:31:19,553][155452] Updated weights for policy 0, policy_version 42570 (0.0007) [2023-03-07 08:31:20,333][155452] Updated weights for policy 0, policy_version 42580 (0.0006) [2023-03-07 08:31:21,105][155452] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-07 08:31:21,897][155452] Updated weights for policy 0, policy_version 42600 (0.0007) [2023-03-07 08:31:22,686][155452] Updated weights for policy 0, policy_version 42610 (0.0006) [2023-03-07 08:31:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 43640832. Throughput: 0: 13043.6. Samples: 43620962. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:31:23,367][155126] Avg episode reward: [(0, '1653.976')] [2023-03-07 08:31:23,487][155452] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-07 08:31:24,261][155452] Updated weights for policy 0, policy_version 42630 (0.0005) [2023-03-07 08:31:25,047][155452] Updated weights for policy 0, policy_version 42640 (0.0006) [2023-03-07 08:31:25,845][155452] Updated weights for policy 0, policy_version 42650 (0.0006) [2023-03-07 08:31:26,609][155452] Updated weights for policy 0, policy_version 42660 (0.0007) [2023-03-07 08:31:27,419][155452] Updated weights for policy 0, policy_version 42670 (0.0006) [2023-03-07 08:31:28,183][155452] Updated weights for policy 0, policy_version 42680 (0.0006) [2023-03-07 08:31:28,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 43706368. Throughput: 0: 13043.9. Samples: 43699183. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:31:28,368][155126] Avg episode reward: [(0, '1717.797')] [2023-03-07 08:31:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000042682_43706368.pth... [2023-03-07 08:31:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000039624_40574976.pth [2023-03-07 08:31:28,965][155452] Updated weights for policy 0, policy_version 42690 (0.0006) [2023-03-07 08:31:29,759][155452] Updated weights for policy 0, policy_version 42700 (0.0006) [2023-03-07 08:31:30,546][155452] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-07 08:31:31,336][155452] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-07 08:31:32,131][155452] Updated weights for policy 0, policy_version 42730 (0.0005) [2023-03-07 08:31:32,898][155452] Updated weights for policy 0, policy_version 42740 (0.0006) [2023-03-07 08:31:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 43770880. Throughput: 0: 13037.3. Samples: 43738344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:31:33,367][155126] Avg episode reward: [(0, '1729.252')] [2023-03-07 08:31:33,695][155452] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-07 08:31:34,475][155452] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-07 08:31:35,258][155452] Updated weights for policy 0, policy_version 42770 (0.0006) [2023-03-07 08:31:36,047][155452] Updated weights for policy 0, policy_version 42780 (0.0006) [2023-03-07 08:31:36,838][155452] Updated weights for policy 0, policy_version 42790 (0.0006) [2023-03-07 08:31:37,634][155452] Updated weights for policy 0, policy_version 42800 (0.0006) [2023-03-07 08:31:38,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 43836416. Throughput: 0: 13032.5. Samples: 43816433. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:31:38,367][155126] Avg episode reward: [(0, '1732.294')] [2023-03-07 08:31:38,422][155452] Updated weights for policy 0, policy_version 42810 (0.0005) [2023-03-07 08:31:39,228][155452] Updated weights for policy 0, policy_version 42820 (0.0006) [2023-03-07 08:31:40,006][155452] Updated weights for policy 0, policy_version 42830 (0.0006) [2023-03-07 08:31:40,804][155452] Updated weights for policy 0, policy_version 42840 (0.0005) [2023-03-07 08:31:41,596][155452] Updated weights for policy 0, policy_version 42850 (0.0006) [2023-03-07 08:31:42,382][155452] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-07 08:31:43,175][155452] Updated weights for policy 0, policy_version 42870 (0.0006) [2023-03-07 08:31:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13041.3). Total num frames: 43900928. Throughput: 0: 13024.9. Samples: 43893931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:31:43,367][155126] Avg episode reward: [(0, '1873.597')] [2023-03-07 08:31:43,976][155452] Updated weights for policy 0, policy_version 42880 (0.0007) [2023-03-07 08:31:44,782][155452] Updated weights for policy 0, policy_version 42890 (0.0006) [2023-03-07 08:31:45,549][155452] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-07 08:31:46,322][155452] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-07 08:31:47,109][155452] Updated weights for policy 0, policy_version 42920 (0.0007) [2023-03-07 08:31:47,892][155452] Updated weights for policy 0, policy_version 42930 (0.0006) [2023-03-07 08:31:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 43966464. Throughput: 0: 13020.9. Samples: 43932878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:31:48,367][155126] Avg episode reward: [(0, '1616.391')] [2023-03-07 08:31:48,665][155452] Updated weights for policy 0, policy_version 42940 (0.0006) [2023-03-07 08:31:49,464][155452] Updated weights for policy 0, policy_version 42950 (0.0007) [2023-03-07 08:31:50,254][155452] Updated weights for policy 0, policy_version 42960 (0.0006) [2023-03-07 08:31:51,055][155452] Updated weights for policy 0, policy_version 42970 (0.0006) [2023-03-07 08:31:51,840][155452] Updated weights for policy 0, policy_version 42980 (0.0006) [2023-03-07 08:31:52,638][155452] Updated weights for policy 0, policy_version 42990 (0.0006) [2023-03-07 08:31:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 44030976. Throughput: 0: 13011.6. Samples: 44010804. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:31:53,367][155126] Avg episode reward: [(0, '1635.249')] [2023-03-07 08:31:53,414][155452] Updated weights for policy 0, policy_version 43000 (0.0006) [2023-03-07 08:31:54,197][155452] Updated weights for policy 0, policy_version 43010 (0.0006) [2023-03-07 08:31:54,983][155452] Updated weights for policy 0, policy_version 43020 (0.0006) [2023-03-07 08:31:55,782][155452] Updated weights for policy 0, policy_version 43030 (0.0006) [2023-03-07 08:31:56,589][155452] Updated weights for policy 0, policy_version 43040 (0.0005) [2023-03-07 08:31:57,361][155452] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-07 08:31:58,166][155452] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-07 08:31:58,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 44095488. Throughput: 0: 13007.5. Samples: 44088695. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:31:58,367][155126] Avg episode reward: [(0, '1715.756')] [2023-03-07 08:31:58,974][155452] Updated weights for policy 0, policy_version 43070 (0.0007) [2023-03-07 08:31:59,756][155452] Updated weights for policy 0, policy_version 43080 (0.0006) [2023-03-07 08:32:00,533][155452] Updated weights for policy 0, policy_version 43090 (0.0005) [2023-03-07 08:32:01,320][155452] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-07 08:32:02,104][155452] Updated weights for policy 0, policy_version 43110 (0.0006) [2023-03-07 08:32:02,890][155452] Updated weights for policy 0, policy_version 43120 (0.0006) [2023-03-07 08:32:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 44161024. Throughput: 0: 12995.7. Samples: 44127456. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:32:03,367][155126] Avg episode reward: [(0, '1593.354')] [2023-03-07 08:32:03,696][155452] Updated weights for policy 0, policy_version 43130 (0.0007) [2023-03-07 08:32:04,488][155452] Updated weights for policy 0, policy_version 43140 (0.0006) [2023-03-07 08:32:05,261][155452] Updated weights for policy 0, policy_version 43150 (0.0006) [2023-03-07 08:32:06,058][155452] Updated weights for policy 0, policy_version 43160 (0.0007) [2023-03-07 08:32:06,840][155452] Updated weights for policy 0, policy_version 43170 (0.0007) [2023-03-07 08:32:07,618][155452] Updated weights for policy 0, policy_version 43180 (0.0006) [2023-03-07 08:32:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 44225536. Throughput: 0: 12994.4. Samples: 44205711. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:32:08,367][155126] Avg episode reward: [(0, '1495.369')] [2023-03-07 08:32:08,390][155452] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-07 08:32:09,198][155452] Updated weights for policy 0, policy_version 43200 (0.0006) [2023-03-07 08:32:09,977][155452] Updated weights for policy 0, policy_version 43210 (0.0006) [2023-03-07 08:32:10,778][155452] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-07 08:32:11,559][155452] Updated weights for policy 0, policy_version 43230 (0.0006) [2023-03-07 08:32:12,333][155452] Updated weights for policy 0, policy_version 43240 (0.0006) [2023-03-07 08:32:13,130][155452] Updated weights for policy 0, policy_version 43250 (0.0007) [2023-03-07 08:32:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 44291072. Throughput: 0: 12990.9. Samples: 44283771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:32:13,368][155126] Avg episode reward: [(0, '1584.922')] [2023-03-07 08:32:13,914][155452] Updated weights for policy 0, policy_version 43260 (0.0006) [2023-03-07 08:32:14,707][155452] Updated weights for policy 0, policy_version 43270 (0.0006) [2023-03-07 08:32:15,491][155452] Updated weights for policy 0, policy_version 43280 (0.0006) [2023-03-07 08:32:16,290][155452] Updated weights for policy 0, policy_version 43290 (0.0006) [2023-03-07 08:32:17,070][155452] Updated weights for policy 0, policy_version 43300 (0.0006) [2023-03-07 08:32:17,853][155452] Updated weights for policy 0, policy_version 43310 (0.0005) [2023-03-07 08:32:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 44355584. Throughput: 0: 12985.5. Samples: 44322693. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:32:18,367][155126] Avg episode reward: [(0, '1625.191')] [2023-03-07 08:32:18,654][155452] Updated weights for policy 0, policy_version 43320 (0.0007) [2023-03-07 08:32:19,416][155452] Updated weights for policy 0, policy_version 43330 (0.0006) [2023-03-07 08:32:20,202][155452] Updated weights for policy 0, policy_version 43340 (0.0006) [2023-03-07 08:32:20,964][155452] Updated weights for policy 0, policy_version 43350 (0.0006) [2023-03-07 08:32:21,755][155452] Updated weights for policy 0, policy_version 43360 (0.0006) [2023-03-07 08:32:22,546][155452] Updated weights for policy 0, policy_version 43370 (0.0006) [2023-03-07 08:32:23,334][155452] Updated weights for policy 0, policy_version 43380 (0.0006) [2023-03-07 08:32:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 44421120. Throughput: 0: 12989.8. Samples: 44400975. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:32:23,367][155126] Avg episode reward: [(0, '1465.149')] [2023-03-07 08:32:24,114][155452] Updated weights for policy 0, policy_version 43390 (0.0006) [2023-03-07 08:32:24,882][155452] Updated weights for policy 0, policy_version 43400 (0.0007) [2023-03-07 08:32:25,669][155452] Updated weights for policy 0, policy_version 43410 (0.0006) [2023-03-07 08:32:26,469][155452] Updated weights for policy 0, policy_version 43420 (0.0007) [2023-03-07 08:32:27,263][155452] Updated weights for policy 0, policy_version 43430 (0.0006) [2023-03-07 08:32:28,040][155452] Updated weights for policy 0, policy_version 43440 (0.0006) [2023-03-07 08:32:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13037.8). Total num frames: 44486656. Throughput: 0: 13010.5. Samples: 44479403. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:32:28,367][155126] Avg episode reward: [(0, '1665.799')] [2023-03-07 08:32:28,798][155452] Updated weights for policy 0, policy_version 43450 (0.0006) [2023-03-07 08:32:29,613][155452] Updated weights for policy 0, policy_version 43460 (0.0005) [2023-03-07 08:32:30,378][155452] Updated weights for policy 0, policy_version 43470 (0.0007) [2023-03-07 08:32:31,169][155452] Updated weights for policy 0, policy_version 43480 (0.0007) [2023-03-07 08:32:31,939][155452] Updated weights for policy 0, policy_version 43490 (0.0006) [2023-03-07 08:32:32,711][155452] Updated weights for policy 0, policy_version 43500 (0.0005) [2023-03-07 08:32:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 44552192. Throughput: 0: 13018.8. Samples: 44518722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:32:33,367][155126] Avg episode reward: [(0, '1749.797')] [2023-03-07 08:32:33,505][155452] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-07 08:32:34,277][155452] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-07 08:32:35,052][155452] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-07 08:32:35,837][155452] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-07 08:32:36,622][155452] Updated weights for policy 0, policy_version 43550 (0.0006) [2023-03-07 08:32:37,405][155452] Updated weights for policy 0, policy_version 43560 (0.0006) [2023-03-07 08:32:38,192][155452] Updated weights for policy 0, policy_version 43570 (0.0006) [2023-03-07 08:32:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13041.2). Total num frames: 44617728. Throughput: 0: 13038.1. Samples: 44597519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:32:38,368][155126] Avg episode reward: [(0, '1672.822')] [2023-03-07 08:32:38,959][155452] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-07 08:32:39,758][155452] Updated weights for policy 0, policy_version 43590 (0.0006) [2023-03-07 08:32:40,553][155452] Updated weights for policy 0, policy_version 43600 (0.0006) [2023-03-07 08:32:41,341][155452] Updated weights for policy 0, policy_version 43610 (0.0006) [2023-03-07 08:32:42,117][155452] Updated weights for policy 0, policy_version 43620 (0.0006) [2023-03-07 08:32:42,913][155452] Updated weights for policy 0, policy_version 43630 (0.0006) [2023-03-07 08:32:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 44682240. Throughput: 0: 13043.7. Samples: 44675661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:32:43,368][155126] Avg episode reward: [(0, '1789.745')] [2023-03-07 08:32:43,701][155452] Updated weights for policy 0, policy_version 43640 (0.0005) [2023-03-07 08:32:44,486][155452] Updated weights for policy 0, policy_version 43650 (0.0005) [2023-03-07 08:32:45,266][155452] Updated weights for policy 0, policy_version 43660 (0.0007) [2023-03-07 08:32:46,054][155452] Updated weights for policy 0, policy_version 43670 (0.0005) [2023-03-07 08:32:46,843][155452] Updated weights for policy 0, policy_version 43680 (0.0006) [2023-03-07 08:32:47,621][155452] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-07 08:32:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 44747776. Throughput: 0: 13048.7. Samples: 44714650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:32:48,378][155126] Avg episode reward: [(0, '1606.832')] [2023-03-07 08:32:48,415][155452] Updated weights for policy 0, policy_version 43700 (0.0006) [2023-03-07 08:32:49,192][155452] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-07 08:32:49,971][155452] Updated weights for policy 0, policy_version 43720 (0.0007) [2023-03-07 08:32:50,764][155452] Updated weights for policy 0, policy_version 43730 (0.0006) [2023-03-07 08:32:51,540][155452] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-07 08:32:52,329][155452] Updated weights for policy 0, policy_version 43750 (0.0006) [2023-03-07 08:32:53,116][155452] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-07 08:32:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 44813312. Throughput: 0: 13053.6. Samples: 44793126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:32:53,368][155126] Avg episode reward: [(0, '1679.550')] [2023-03-07 08:32:53,883][155452] Updated weights for policy 0, policy_version 43770 (0.0006) [2023-03-07 08:32:54,674][155452] Updated weights for policy 0, policy_version 43780 (0.0005) [2023-03-07 08:32:55,448][155452] Updated weights for policy 0, policy_version 43790 (0.0006) [2023-03-07 08:32:56,236][155452] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-07 08:32:57,036][155452] Updated weights for policy 0, policy_version 43810 (0.0006) [2023-03-07 08:32:57,814][155452] Updated weights for policy 0, policy_version 43820 (0.0005) [2023-03-07 08:32:58,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13044.7). Total num frames: 44878848. Throughput: 0: 13060.9. Samples: 44871511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:32:58,367][155126] Avg episode reward: [(0, '1792.319')] [2023-03-07 08:32:58,589][155452] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-07 08:32:59,374][155452] Updated weights for policy 0, policy_version 43840 (0.0006) [2023-03-07 08:33:00,158][155452] Updated weights for policy 0, policy_version 43850 (0.0006) [2023-03-07 08:33:00,952][155452] Updated weights for policy 0, policy_version 43860 (0.0006) [2023-03-07 08:33:01,757][155452] Updated weights for policy 0, policy_version 43870 (0.0006) [2023-03-07 08:33:02,539][155452] Updated weights for policy 0, policy_version 43880 (0.0006) [2023-03-07 08:33:03,328][155452] Updated weights for policy 0, policy_version 43890 (0.0006) [2023-03-07 08:33:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 44943360. Throughput: 0: 13063.3. Samples: 44910542. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:03,368][155126] Avg episode reward: [(0, '1700.390')] [2023-03-07 08:33:04,116][155452] Updated weights for policy 0, policy_version 43900 (0.0006) [2023-03-07 08:33:04,909][155452] Updated weights for policy 0, policy_version 43910 (0.0006) [2023-03-07 08:33:05,701][155452] Updated weights for policy 0, policy_version 43920 (0.0006) [2023-03-07 08:33:06,486][155452] Updated weights for policy 0, policy_version 43930 (0.0006) [2023-03-07 08:33:07,261][155452] Updated weights for policy 0, policy_version 43940 (0.0006) [2023-03-07 08:33:08,076][155452] Updated weights for policy 0, policy_version 43950 (0.0006) [2023-03-07 08:33:08,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 45007872. Throughput: 0: 13054.8. Samples: 44988440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:08,368][155126] Avg episode reward: [(0, '1663.817')] [2023-03-07 08:33:08,851][155452] Updated weights for policy 0, policy_version 43960 (0.0007) [2023-03-07 08:33:09,634][155452] Updated weights for policy 0, policy_version 43970 (0.0006) [2023-03-07 08:33:10,420][155452] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-07 08:33:11,206][155452] Updated weights for policy 0, policy_version 43990 (0.0008) [2023-03-07 08:33:12,011][155452] Updated weights for policy 0, policy_version 44000 (0.0006) [2023-03-07 08:33:12,790][155452] Updated weights for policy 0, policy_version 44010 (0.0006) [2023-03-07 08:33:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 45073408. Throughput: 0: 13043.8. Samples: 45066375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:13,367][155126] Avg episode reward: [(0, '1907.335')] [2023-03-07 08:33:13,574][155452] Updated weights for policy 0, policy_version 44020 (0.0006) [2023-03-07 08:33:14,355][155452] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-07 08:33:15,128][155452] Updated weights for policy 0, policy_version 44040 (0.0006) [2023-03-07 08:33:15,914][155452] Updated weights for policy 0, policy_version 44050 (0.0007) [2023-03-07 08:33:16,713][155452] Updated weights for policy 0, policy_version 44060 (0.0006) [2023-03-07 08:33:17,477][155452] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-07 08:33:18,275][155452] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-07 08:33:18,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 45138944. Throughput: 0: 13043.8. Samples: 45105695. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:18,368][155126] Avg episode reward: [(0, '1806.603')] [2023-03-07 08:33:19,071][155452] Updated weights for policy 0, policy_version 44090 (0.0005) [2023-03-07 08:33:19,865][155452] Updated weights for policy 0, policy_version 44100 (0.0006) [2023-03-07 08:33:20,650][155452] Updated weights for policy 0, policy_version 44110 (0.0007) [2023-03-07 08:33:21,443][155452] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-07 08:33:22,229][155452] Updated weights for policy 0, policy_version 44130 (0.0006) [2023-03-07 08:33:23,026][155452] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-07 08:33:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 45203456. Throughput: 0: 13020.4. Samples: 45183434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:23,368][155126] Avg episode reward: [(0, '1818.874')] [2023-03-07 08:33:23,830][155452] Updated weights for policy 0, policy_version 44150 (0.0005) [2023-03-07 08:33:24,598][155452] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-07 08:33:25,385][155452] Updated weights for policy 0, policy_version 44170 (0.0006) [2023-03-07 08:33:26,182][155452] Updated weights for policy 0, policy_version 44180 (0.0007) [2023-03-07 08:33:26,961][155452] Updated weights for policy 0, policy_version 44190 (0.0006) [2023-03-07 08:33:27,729][155452] Updated weights for policy 0, policy_version 44200 (0.0006) [2023-03-07 08:33:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 45268992. Throughput: 0: 13016.3. Samples: 45261395. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:28,368][155126] Avg episode reward: [(0, '1718.773')] [2023-03-07 08:33:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000044208_45268992.pth... [2023-03-07 08:33:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000041154_42141696.pth [2023-03-07 08:33:28,526][155452] Updated weights for policy 0, policy_version 44210 (0.0006) [2023-03-07 08:33:29,297][155452] Updated weights for policy 0, policy_version 44220 (0.0006) [2023-03-07 08:33:30,114][155452] Updated weights for policy 0, policy_version 44230 (0.0006) [2023-03-07 08:33:30,896][155452] Updated weights for policy 0, policy_version 44240 (0.0006) [2023-03-07 08:33:31,672][155452] Updated weights for policy 0, policy_version 44250 (0.0007) [2023-03-07 08:33:32,475][155452] Updated weights for policy 0, policy_version 44260 (0.0007) [2023-03-07 08:33:33,277][155452] Updated weights for policy 0, policy_version 44270 (0.0007) [2023-03-07 08:33:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 45333504. Throughput: 0: 13019.4. Samples: 45300520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:33,367][155126] Avg episode reward: [(0, '1959.152')] [2023-03-07 08:33:34,055][155452] Updated weights for policy 0, policy_version 44280 (0.0006) [2023-03-07 08:33:34,838][155452] Updated weights for policy 0, policy_version 44290 (0.0006) [2023-03-07 08:33:35,628][155452] Updated weights for policy 0, policy_version 44300 (0.0007) [2023-03-07 08:33:36,411][155452] Updated weights for policy 0, policy_version 44310 (0.0006) [2023-03-07 08:33:37,201][155452] Updated weights for policy 0, policy_version 44320 (0.0006) [2023-03-07 08:33:37,992][155452] Updated weights for policy 0, policy_version 44330 (0.0005) [2023-03-07 08:33:38,367][155126] Fps is (10 sec: 12902.7, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 45398016. Throughput: 0: 13006.0. Samples: 45378393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:38,367][155126] Avg episode reward: [(0, '1755.699')] [2023-03-07 08:33:38,785][155452] Updated weights for policy 0, policy_version 44340 (0.0006) [2023-03-07 08:33:39,579][155452] Updated weights for policy 0, policy_version 44350 (0.0006) [2023-03-07 08:33:40,357][155452] Updated weights for policy 0, policy_version 44360 (0.0007) [2023-03-07 08:33:41,145][155452] Updated weights for policy 0, policy_version 44370 (0.0006) [2023-03-07 08:33:41,947][155452] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-07 08:33:42,744][155452] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-07 08:33:43,367][155126] Fps is (10 sec: 12902.2, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 45462528. Throughput: 0: 12992.1. Samples: 45456156. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:43,368][155126] Avg episode reward: [(0, '1695.635')] [2023-03-07 08:33:43,527][155452] Updated weights for policy 0, policy_version 44400 (0.0006) [2023-03-07 08:33:44,317][155452] Updated weights for policy 0, policy_version 44410 (0.0006) [2023-03-07 08:33:45,098][155452] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-07 08:33:45,886][155452] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-07 08:33:46,668][155452] Updated weights for policy 0, policy_version 44440 (0.0007) [2023-03-07 08:33:47,445][155452] Updated weights for policy 0, policy_version 44450 (0.0006) [2023-03-07 08:33:48,233][155452] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-07 08:33:48,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 45528064. Throughput: 0: 12992.9. Samples: 45495222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:48,367][155126] Avg episode reward: [(0, '1920.942')] [2023-03-07 08:33:49,019][155452] Updated weights for policy 0, policy_version 44470 (0.0007) [2023-03-07 08:33:49,785][155452] Updated weights for policy 0, policy_version 44480 (0.0006) [2023-03-07 08:33:50,577][155452] Updated weights for policy 0, policy_version 44490 (0.0007) [2023-03-07 08:33:51,378][155452] Updated weights for policy 0, policy_version 44500 (0.0006) [2023-03-07 08:33:52,154][155452] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-07 08:33:52,946][155452] Updated weights for policy 0, policy_version 44520 (0.0005) [2023-03-07 08:33:53,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 45593600. Throughput: 0: 13001.2. Samples: 45573491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:53,367][155126] Avg episode reward: [(0, '1923.380')] [2023-03-07 08:33:53,724][155452] Updated weights for policy 0, policy_version 44530 (0.0006) [2023-03-07 08:33:54,525][155452] Updated weights for policy 0, policy_version 44540 (0.0006) [2023-03-07 08:33:55,295][155452] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-07 08:33:56,078][155452] Updated weights for policy 0, policy_version 44560 (0.0006) [2023-03-07 08:33:56,858][155452] Updated weights for policy 0, policy_version 44570 (0.0007) [2023-03-07 08:33:57,654][155452] Updated weights for policy 0, policy_version 44580 (0.0006) [2023-03-07 08:33:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 45659136. Throughput: 0: 13012.4. Samples: 45651936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:33:58,368][155126] Avg episode reward: [(0, '1776.194')] [2023-03-07 08:33:58,430][155452] Updated weights for policy 0, policy_version 44590 (0.0007) [2023-03-07 08:33:59,201][155452] Updated weights for policy 0, policy_version 44600 (0.0006) [2023-03-07 08:33:59,991][155452] Updated weights for policy 0, policy_version 44610 (0.0006) [2023-03-07 08:34:00,771][155452] Updated weights for policy 0, policy_version 44620 (0.0006) [2023-03-07 08:34:01,570][155452] Updated weights for policy 0, policy_version 44630 (0.0007) [2023-03-07 08:34:02,359][155452] Updated weights for policy 0, policy_version 44640 (0.0006) [2023-03-07 08:34:03,142][155452] Updated weights for policy 0, policy_version 44650 (0.0006) [2023-03-07 08:34:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 45723648. Throughput: 0: 13009.0. Samples: 45691099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:34:03,367][155126] Avg episode reward: [(0, '1853.457')] [2023-03-07 08:34:03,949][155452] Updated weights for policy 0, policy_version 44660 (0.0006) [2023-03-07 08:34:04,731][155452] Updated weights for policy 0, policy_version 44670 (0.0006) [2023-03-07 08:34:05,519][155452] Updated weights for policy 0, policy_version 44680 (0.0007) [2023-03-07 08:34:06,312][155452] Updated weights for policy 0, policy_version 44690 (0.0006) [2023-03-07 08:34:07,097][155452] Updated weights for policy 0, policy_version 44700 (0.0006) [2023-03-07 08:34:07,894][155452] Updated weights for policy 0, policy_version 44710 (0.0007) [2023-03-07 08:34:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 45789184. Throughput: 0: 13011.5. Samples: 45768953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:34:08,378][155126] Avg episode reward: [(0, '1991.908')] [2023-03-07 08:34:08,667][155452] Updated weights for policy 0, policy_version 44720 (0.0006) [2023-03-07 08:34:09,453][155452] Updated weights for policy 0, policy_version 44730 (0.0006) [2023-03-07 08:34:10,238][155452] Updated weights for policy 0, policy_version 44740 (0.0005) [2023-03-07 08:34:11,024][155452] Updated weights for policy 0, policy_version 44750 (0.0006) [2023-03-07 08:34:11,804][155452] Updated weights for policy 0, policy_version 44760 (0.0006) [2023-03-07 08:34:12,592][155452] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-07 08:34:13,367][155452] Updated weights for policy 0, policy_version 44780 (0.0007) [2023-03-07 08:34:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 45854720. Throughput: 0: 13016.8. Samples: 45847151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:34:13,378][155126] Avg episode reward: [(0, '1802.253')] [2023-03-07 08:34:14,167][155452] Updated weights for policy 0, policy_version 44790 (0.0006) [2023-03-07 08:34:14,952][155452] Updated weights for policy 0, policy_version 44800 (0.0007) [2023-03-07 08:34:15,740][155452] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-07 08:34:16,516][155452] Updated weights for policy 0, policy_version 44820 (0.0006) [2023-03-07 08:34:17,301][155452] Updated weights for policy 0, policy_version 44830 (0.0006) [2023-03-07 08:34:18,091][155452] Updated weights for policy 0, policy_version 44840 (0.0007) [2023-03-07 08:34:18,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13004.9, 300 sec: 13027.4). Total num frames: 45919232. Throughput: 0: 13012.6. Samples: 45886086. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:34:18,378][155126] Avg episode reward: [(0, '1838.216')] [2023-03-07 08:34:18,891][155452] Updated weights for policy 0, policy_version 44850 (0.0006) [2023-03-07 08:34:19,679][155452] Updated weights for policy 0, policy_version 44860 (0.0006) [2023-03-07 08:34:20,467][155452] Updated weights for policy 0, policy_version 44870 (0.0006) [2023-03-07 08:34:21,247][155452] Updated weights for policy 0, policy_version 44880 (0.0006) [2023-03-07 08:34:22,034][155452] Updated weights for policy 0, policy_version 44890 (0.0006) [2023-03-07 08:34:22,821][155452] Updated weights for policy 0, policy_version 44900 (0.0006) [2023-03-07 08:34:23,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 45983744. Throughput: 0: 13016.6. Samples: 45964143. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:34:23,378][155126] Avg episode reward: [(0, '1925.340')] [2023-03-07 08:34:23,613][155452] Updated weights for policy 0, policy_version 44910 (0.0006) [2023-03-07 08:34:24,385][155452] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-07 08:34:25,159][155452] Updated weights for policy 0, policy_version 44930 (0.0006) [2023-03-07 08:34:25,949][155452] Updated weights for policy 0, policy_version 44940 (0.0006) [2023-03-07 08:34:26,749][155452] Updated weights for policy 0, policy_version 44950 (0.0006) [2023-03-07 08:34:27,552][155452] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-07 08:34:28,327][155452] Updated weights for policy 0, policy_version 44970 (0.0006) [2023-03-07 08:34:28,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 46049280. Throughput: 0: 13027.2. Samples: 46042379. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:34:28,368][155126] Avg episode reward: [(0, '1781.409')] [2023-03-07 08:34:29,098][155452] Updated weights for policy 0, policy_version 44980 (0.0006) [2023-03-07 08:34:29,891][155452] Updated weights for policy 0, policy_version 44990 (0.0006) [2023-03-07 08:34:30,647][155452] Updated weights for policy 0, policy_version 45000 (0.0007) [2023-03-07 08:34:31,455][155452] Updated weights for policy 0, policy_version 45010 (0.0006) [2023-03-07 08:34:32,245][155452] Updated weights for policy 0, policy_version 45020 (0.0006) [2023-03-07 08:34:33,042][155452] Updated weights for policy 0, policy_version 45030 (0.0005) [2023-03-07 08:34:33,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 46114816. Throughput: 0: 13036.9. Samples: 46081883. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:34:33,367][155126] Avg episode reward: [(0, '1791.594')] [2023-03-07 08:34:33,829][155452] Updated weights for policy 0, policy_version 45040 (0.0006) [2023-03-07 08:34:34,612][155452] Updated weights for policy 0, policy_version 45050 (0.0006) [2023-03-07 08:34:35,397][155452] Updated weights for policy 0, policy_version 45060 (0.0006) [2023-03-07 08:34:36,184][155452] Updated weights for policy 0, policy_version 45070 (0.0006) [2023-03-07 08:34:36,971][155452] Updated weights for policy 0, policy_version 45080 (0.0007) [2023-03-07 08:34:37,773][155452] Updated weights for policy 0, policy_version 45090 (0.0007) [2023-03-07 08:34:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 46179328. Throughput: 0: 13026.0. Samples: 46159663. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:34:38,367][155126] Avg episode reward: [(0, '1847.678')] [2023-03-07 08:34:38,539][155452] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-07 08:34:39,330][155452] Updated weights for policy 0, policy_version 45110 (0.0006) [2023-03-07 08:34:40,129][155452] Updated weights for policy 0, policy_version 45120 (0.0007) [2023-03-07 08:34:40,930][155452] Updated weights for policy 0, policy_version 45130 (0.0006) [2023-03-07 08:34:41,694][155452] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-07 08:34:42,486][155452] Updated weights for policy 0, policy_version 45150 (0.0006) [2023-03-07 08:34:43,273][155452] Updated weights for policy 0, policy_version 45160 (0.0006) [2023-03-07 08:34:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 46244864. Throughput: 0: 13014.4. Samples: 46237581. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:34:43,367][155126] Avg episode reward: [(0, '1930.031')] [2023-03-07 08:34:44,047][155452] Updated weights for policy 0, policy_version 45170 (0.0005) [2023-03-07 08:34:44,856][155452] Updated weights for policy 0, policy_version 45180 (0.0006) [2023-03-07 08:34:45,629][155452] Updated weights for policy 0, policy_version 45190 (0.0006) [2023-03-07 08:34:46,414][155452] Updated weights for policy 0, policy_version 45200 (0.0006) [2023-03-07 08:34:47,211][155452] Updated weights for policy 0, policy_version 45210 (0.0007) [2023-03-07 08:34:47,999][155452] Updated weights for policy 0, policy_version 45220 (0.0006) [2023-03-07 08:34:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 46309376. Throughput: 0: 13012.4. Samples: 46276656. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:34:48,367][155126] Avg episode reward: [(0, '2037.079')] [2023-03-07 08:34:48,781][155452] Updated weights for policy 0, policy_version 45230 (0.0006) [2023-03-07 08:34:49,568][155452] Updated weights for policy 0, policy_version 45240 (0.0005) [2023-03-07 08:34:50,344][155452] Updated weights for policy 0, policy_version 45250 (0.0007) [2023-03-07 08:34:51,144][155452] Updated weights for policy 0, policy_version 45260 (0.0006) [2023-03-07 08:34:51,927][155452] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-07 08:34:52,704][155452] Updated weights for policy 0, policy_version 45280 (0.0006) [2023-03-07 08:34:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 46374912. Throughput: 0: 13019.3. Samples: 46354820. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:34:53,367][155126] Avg episode reward: [(0, '1804.173')] [2023-03-07 08:34:53,507][155452] Updated weights for policy 0, policy_version 45290 (0.0006) [2023-03-07 08:34:54,293][155452] Updated weights for policy 0, policy_version 45300 (0.0006) [2023-03-07 08:34:55,062][155452] Updated weights for policy 0, policy_version 45310 (0.0006) [2023-03-07 08:34:55,851][155452] Updated weights for policy 0, policy_version 45320 (0.0006) [2023-03-07 08:34:56,644][155452] Updated weights for policy 0, policy_version 45330 (0.0006) [2023-03-07 08:34:57,422][155452] Updated weights for policy 0, policy_version 45340 (0.0006) [2023-03-07 08:34:58,195][155452] Updated weights for policy 0, policy_version 45350 (0.0006) [2023-03-07 08:34:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 46440448. Throughput: 0: 13018.4. Samples: 46432977. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:34:58,367][155126] Avg episode reward: [(0, '1716.519')] [2023-03-07 08:34:58,974][155452] Updated weights for policy 0, policy_version 45360 (0.0005) [2023-03-07 08:34:59,783][155452] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-07 08:35:00,573][155452] Updated weights for policy 0, policy_version 45380 (0.0006) [2023-03-07 08:35:01,341][155452] Updated weights for policy 0, policy_version 45390 (0.0006) [2023-03-07 08:35:02,138][155452] Updated weights for policy 0, policy_version 45400 (0.0007) [2023-03-07 08:35:02,915][155452] Updated weights for policy 0, policy_version 45410 (0.0007) [2023-03-07 08:35:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 46504960. Throughput: 0: 13022.8. Samples: 46472116. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:35:03,367][155126] Avg episode reward: [(0, '1832.286')] [2023-03-07 08:35:03,691][155452] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-07 08:35:04,467][155452] Updated weights for policy 0, policy_version 45430 (0.0006) [2023-03-07 08:35:05,250][155452] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-07 08:35:06,034][155452] Updated weights for policy 0, policy_version 45450 (0.0006) [2023-03-07 08:35:06,826][155452] Updated weights for policy 0, policy_version 45460 (0.0006) [2023-03-07 08:35:07,616][155452] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-07 08:35:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 46570496. Throughput: 0: 13034.3. Samples: 46550688. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:35:08,367][155126] Avg episode reward: [(0, '1728.174')] [2023-03-07 08:35:08,405][155452] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-07 08:35:09,193][155452] Updated weights for policy 0, policy_version 45490 (0.0006) [2023-03-07 08:35:09,952][155452] Updated weights for policy 0, policy_version 45500 (0.0006) [2023-03-07 08:35:10,761][155452] Updated weights for policy 0, policy_version 45510 (0.0007) [2023-03-07 08:35:11,536][155452] Updated weights for policy 0, policy_version 45520 (0.0005) [2023-03-07 08:35:12,313][155452] Updated weights for policy 0, policy_version 45530 (0.0006) [2023-03-07 08:35:13,115][155452] Updated weights for policy 0, policy_version 45540 (0.0006) [2023-03-07 08:35:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 46636032. Throughput: 0: 13032.4. Samples: 46628838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:35:13,368][155126] Avg episode reward: [(0, '1744.128')] [2023-03-07 08:35:13,903][155452] Updated weights for policy 0, policy_version 45550 (0.0008) [2023-03-07 08:35:14,694][155452] Updated weights for policy 0, policy_version 45560 (0.0006) [2023-03-07 08:35:15,483][155452] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-07 08:35:16,263][155452] Updated weights for policy 0, policy_version 45580 (0.0006) [2023-03-07 08:35:17,048][155452] Updated weights for policy 0, policy_version 45590 (0.0006) [2023-03-07 08:35:17,818][155452] Updated weights for policy 0, policy_version 45600 (0.0006) [2023-03-07 08:35:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 46700544. Throughput: 0: 13018.7. Samples: 46667727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:35:18,367][155126] Avg episode reward: [(0, '1765.905')] [2023-03-07 08:35:18,618][155452] Updated weights for policy 0, policy_version 45610 (0.0006) [2023-03-07 08:35:19,398][155452] Updated weights for policy 0, policy_version 45620 (0.0006) [2023-03-07 08:35:20,190][155452] Updated weights for policy 0, policy_version 45630 (0.0006) [2023-03-07 08:35:20,980][155452] Updated weights for policy 0, policy_version 45640 (0.0006) [2023-03-07 08:35:21,763][155452] Updated weights for policy 0, policy_version 45650 (0.0007) [2023-03-07 08:35:22,538][155452] Updated weights for policy 0, policy_version 45660 (0.0006) [2023-03-07 08:35:23,340][155452] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-07 08:35:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 46766080. Throughput: 0: 13030.3. Samples: 46746027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:35:23,367][155126] Avg episode reward: [(0, '1709.557')] [2023-03-07 08:35:24,136][155452] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-07 08:35:24,931][155452] Updated weights for policy 0, policy_version 45690 (0.0006) [2023-03-07 08:35:25,713][155452] Updated weights for policy 0, policy_version 45700 (0.0005) [2023-03-07 08:35:26,518][155452] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-07 08:35:27,292][155452] Updated weights for policy 0, policy_version 45720 (0.0006) [2023-03-07 08:35:28,058][155452] Updated weights for policy 0, policy_version 45730 (0.0006) [2023-03-07 08:35:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 46830592. Throughput: 0: 13030.6. Samples: 46823958. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:35:28,367][155126] Avg episode reward: [(0, '1655.206')] [2023-03-07 08:35:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000045734_46831616.pth... [2023-03-07 08:35:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000042682_43706368.pth [2023-03-07 08:35:28,850][155452] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-07 08:35:29,649][155452] Updated weights for policy 0, policy_version 45750 (0.0006) [2023-03-07 08:35:30,438][155452] Updated weights for policy 0, policy_version 45760 (0.0006) [2023-03-07 08:35:31,249][155452] Updated weights for policy 0, policy_version 45770 (0.0006) [2023-03-07 08:35:32,038][155452] Updated weights for policy 0, policy_version 45780 (0.0007) [2023-03-07 08:35:32,812][155452] Updated weights for policy 0, policy_version 45790 (0.0007) [2023-03-07 08:35:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 46896128. Throughput: 0: 13021.1. Samples: 46862605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:35:33,367][155126] Avg episode reward: [(0, '1674.499')] [2023-03-07 08:35:33,599][155452] Updated weights for policy 0, policy_version 45800 (0.0005) [2023-03-07 08:35:34,390][155452] Updated weights for policy 0, policy_version 45810 (0.0007) [2023-03-07 08:35:35,184][155452] Updated weights for policy 0, policy_version 45820 (0.0006) [2023-03-07 08:35:35,976][155452] Updated weights for policy 0, policy_version 45830 (0.0007) [2023-03-07 08:35:36,744][155452] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-07 08:35:37,554][155452] Updated weights for policy 0, policy_version 45850 (0.0006) [2023-03-07 08:35:38,338][155452] Updated weights for policy 0, policy_version 45860 (0.0007) [2023-03-07 08:35:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 46960640. Throughput: 0: 13021.9. Samples: 46940804. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:35:38,367][155126] Avg episode reward: [(0, '1756.547')] [2023-03-07 08:35:39,122][155452] Updated weights for policy 0, policy_version 45870 (0.0006) [2023-03-07 08:35:39,913][155452] Updated weights for policy 0, policy_version 45880 (0.0007) [2023-03-07 08:35:40,706][155452] Updated weights for policy 0, policy_version 45890 (0.0006) [2023-03-07 08:35:41,475][155452] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-07 08:35:42,276][155452] Updated weights for policy 0, policy_version 45910 (0.0006) [2023-03-07 08:35:43,074][155452] Updated weights for policy 0, policy_version 45920 (0.0005) [2023-03-07 08:35:43,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 47025152. Throughput: 0: 13008.9. Samples: 47018379. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:35:43,367][155126] Avg episode reward: [(0, '1872.843')] [2023-03-07 08:35:43,871][155452] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-07 08:35:44,666][155452] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-07 08:35:45,441][155452] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-07 08:35:46,215][155452] Updated weights for policy 0, policy_version 45960 (0.0006) [2023-03-07 08:35:47,007][155452] Updated weights for policy 0, policy_version 45970 (0.0006) [2023-03-07 08:35:47,778][155452] Updated weights for policy 0, policy_version 45980 (0.0006) [2023-03-07 08:35:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 47090688. Throughput: 0: 13006.4. Samples: 47057405. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:35:48,368][155126] Avg episode reward: [(0, '1943.808')] [2023-03-07 08:35:48,585][155452] Updated weights for policy 0, policy_version 45990 (0.0007) [2023-03-07 08:35:49,366][155452] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-07 08:35:50,157][155452] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-07 08:35:50,927][155452] Updated weights for policy 0, policy_version 46020 (0.0006) [2023-03-07 08:35:51,734][155452] Updated weights for policy 0, policy_version 46030 (0.0007) [2023-03-07 08:35:52,499][155452] Updated weights for policy 0, policy_version 46040 (0.0006) [2023-03-07 08:35:53,267][155452] Updated weights for policy 0, policy_version 46050 (0.0005) [2023-03-07 08:35:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 47156224. Throughput: 0: 13003.4. Samples: 47135838. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:35:53,367][155126] Avg episode reward: [(0, '1664.306')] [2023-03-07 08:35:54,065][155452] Updated weights for policy 0, policy_version 46060 (0.0006) [2023-03-07 08:35:54,852][155452] Updated weights for policy 0, policy_version 46070 (0.0006) [2023-03-07 08:35:55,639][155452] Updated weights for policy 0, policy_version 46080 (0.0006) [2023-03-07 08:35:56,437][155452] Updated weights for policy 0, policy_version 46090 (0.0006) [2023-03-07 08:35:57,240][155452] Updated weights for policy 0, policy_version 46100 (0.0006) [2023-03-07 08:35:58,022][155452] Updated weights for policy 0, policy_version 46110 (0.0006) [2023-03-07 08:35:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 47220736. Throughput: 0: 12999.1. Samples: 47213798. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:35:58,367][155126] Avg episode reward: [(0, '1660.440')] [2023-03-07 08:35:58,805][155452] Updated weights for policy 0, policy_version 46120 (0.0006) [2023-03-07 08:35:59,587][155452] Updated weights for policy 0, policy_version 46130 (0.0006) [2023-03-07 08:36:00,397][155452] Updated weights for policy 0, policy_version 46140 (0.0006) [2023-03-07 08:36:01,168][155452] Updated weights for policy 0, policy_version 46150 (0.0006) [2023-03-07 08:36:01,953][155452] Updated weights for policy 0, policy_version 46160 (0.0006) [2023-03-07 08:36:02,735][155452] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-07 08:36:03,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 47285248. Throughput: 0: 13002.4. Samples: 47252834. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:36:03,367][155126] Avg episode reward: [(0, '1654.803')] [2023-03-07 08:36:03,539][155452] Updated weights for policy 0, policy_version 46180 (0.0007) [2023-03-07 08:36:04,317][155452] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-07 08:36:05,098][155452] Updated weights for policy 0, policy_version 46200 (0.0006) [2023-03-07 08:36:05,870][155452] Updated weights for policy 0, policy_version 46210 (0.0006) [2023-03-07 08:36:06,646][155452] Updated weights for policy 0, policy_version 46220 (0.0006) [2023-03-07 08:36:07,420][155452] Updated weights for policy 0, policy_version 46230 (0.0007) [2023-03-07 08:36:08,216][155452] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-07 08:36:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 47351808. Throughput: 0: 13002.6. Samples: 47331141. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:36:08,367][155126] Avg episode reward: [(0, '1802.506')] [2023-03-07 08:36:09,005][155452] Updated weights for policy 0, policy_version 46250 (0.0006) [2023-03-07 08:36:09,792][155452] Updated weights for policy 0, policy_version 46260 (0.0007) [2023-03-07 08:36:10,570][155452] Updated weights for policy 0, policy_version 46270 (0.0007) [2023-03-07 08:36:11,365][155452] Updated weights for policy 0, policy_version 46280 (0.0006) [2023-03-07 08:36:12,153][155452] Updated weights for policy 0, policy_version 46290 (0.0006) [2023-03-07 08:36:12,927][155452] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-07 08:36:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 47416320. Throughput: 0: 13011.9. Samples: 47409492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:13,367][155126] Avg episode reward: [(0, '1691.688')] [2023-03-07 08:36:13,714][155452] Updated weights for policy 0, policy_version 46310 (0.0006) [2023-03-07 08:36:14,504][155452] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-07 08:36:15,280][155452] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-07 08:36:16,069][155452] Updated weights for policy 0, policy_version 46340 (0.0006) [2023-03-07 08:36:16,840][155452] Updated weights for policy 0, policy_version 46350 (0.0007) [2023-03-07 08:36:17,638][155452] Updated weights for policy 0, policy_version 46360 (0.0006) [2023-03-07 08:36:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 47481856. Throughput: 0: 13018.5. Samples: 47448440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:18,367][155126] Avg episode reward: [(0, '1826.359')] [2023-03-07 08:36:18,425][155452] Updated weights for policy 0, policy_version 46370 (0.0006) [2023-03-07 08:36:19,198][155452] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-07 08:36:19,989][155452] Updated weights for policy 0, policy_version 46390 (0.0006) [2023-03-07 08:36:20,759][155452] Updated weights for policy 0, policy_version 46400 (0.0005) [2023-03-07 08:36:21,550][155452] Updated weights for policy 0, policy_version 46410 (0.0006) [2023-03-07 08:36:22,322][155452] Updated weights for policy 0, policy_version 46420 (0.0006) [2023-03-07 08:36:23,109][155452] Updated weights for policy 0, policy_version 46430 (0.0006) [2023-03-07 08:36:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 47547392. Throughput: 0: 13030.1. Samples: 47527160. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:23,367][155126] Avg episode reward: [(0, '1714.623')] [2023-03-07 08:36:23,904][155452] Updated weights for policy 0, policy_version 46440 (0.0007) [2023-03-07 08:36:24,697][155452] Updated weights for policy 0, policy_version 46450 (0.0006) [2023-03-07 08:36:25,480][155452] Updated weights for policy 0, policy_version 46460 (0.0006) [2023-03-07 08:36:26,258][155452] Updated weights for policy 0, policy_version 46470 (0.0006) [2023-03-07 08:36:27,066][155452] Updated weights for policy 0, policy_version 46480 (0.0007) [2023-03-07 08:36:27,829][155452] Updated weights for policy 0, policy_version 46490 (0.0007) [2023-03-07 08:36:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 47611904. Throughput: 0: 13043.9. Samples: 47605354. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:28,367][155126] Avg episode reward: [(0, '1823.500')] [2023-03-07 08:36:28,615][155452] Updated weights for policy 0, policy_version 46500 (0.0007) [2023-03-07 08:36:29,401][155452] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-07 08:36:30,183][155452] Updated weights for policy 0, policy_version 46520 (0.0006) [2023-03-07 08:36:30,977][155452] Updated weights for policy 0, policy_version 46530 (0.0006) [2023-03-07 08:36:31,742][155452] Updated weights for policy 0, policy_version 46540 (0.0006) [2023-03-07 08:36:32,534][155452] Updated weights for policy 0, policy_version 46550 (0.0007) [2023-03-07 08:36:33,321][155452] Updated weights for policy 0, policy_version 46560 (0.0006) [2023-03-07 08:36:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 47677440. Throughput: 0: 13048.0. Samples: 47644565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:33,367][155126] Avg episode reward: [(0, '1811.671')] [2023-03-07 08:36:34,102][155452] Updated weights for policy 0, policy_version 46570 (0.0006) [2023-03-07 08:36:34,885][155452] Updated weights for policy 0, policy_version 46580 (0.0007) [2023-03-07 08:36:35,662][155452] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-07 08:36:36,439][155452] Updated weights for policy 0, policy_version 46600 (0.0006) [2023-03-07 08:36:37,258][155452] Updated weights for policy 0, policy_version 46610 (0.0006) [2023-03-07 08:36:38,038][155452] Updated weights for policy 0, policy_version 46620 (0.0007) [2023-03-07 08:36:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 47742976. Throughput: 0: 13045.2. Samples: 47722873. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:38,367][155126] Avg episode reward: [(0, '1884.320')] [2023-03-07 08:36:38,826][155452] Updated weights for policy 0, policy_version 46630 (0.0006) [2023-03-07 08:36:39,619][155452] Updated weights for policy 0, policy_version 46640 (0.0007) [2023-03-07 08:36:40,395][155452] Updated weights for policy 0, policy_version 46650 (0.0006) [2023-03-07 08:36:41,173][155452] Updated weights for policy 0, policy_version 46660 (0.0006) [2023-03-07 08:36:41,959][155452] Updated weights for policy 0, policy_version 46670 (0.0006) [2023-03-07 08:36:42,737][155452] Updated weights for policy 0, policy_version 46680 (0.0006) [2023-03-07 08:36:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 47807488. Throughput: 0: 13049.8. Samples: 47801039. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:43,367][155126] Avg episode reward: [(0, '1889.461')] [2023-03-07 08:36:43,529][155452] Updated weights for policy 0, policy_version 46690 (0.0007) [2023-03-07 08:36:44,311][155452] Updated weights for policy 0, policy_version 46700 (0.0006) [2023-03-07 08:36:45,083][155452] Updated weights for policy 0, policy_version 46710 (0.0006) [2023-03-07 08:36:45,889][155452] Updated weights for policy 0, policy_version 46720 (0.0006) [2023-03-07 08:36:46,659][155452] Updated weights for policy 0, policy_version 46730 (0.0006) [2023-03-07 08:36:47,446][155452] Updated weights for policy 0, policy_version 46740 (0.0005) [2023-03-07 08:36:48,229][155452] Updated weights for policy 0, policy_version 46750 (0.0006) [2023-03-07 08:36:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 47873024. Throughput: 0: 13051.3. Samples: 47840144. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:48,367][155126] Avg episode reward: [(0, '2069.233')] [2023-03-07 08:36:49,006][155452] Updated weights for policy 0, policy_version 46760 (0.0006) [2023-03-07 08:36:49,788][155452] Updated weights for policy 0, policy_version 46770 (0.0007) [2023-03-07 08:36:50,594][155452] Updated weights for policy 0, policy_version 46780 (0.0006) [2023-03-07 08:36:51,368][155452] Updated weights for policy 0, policy_version 46790 (0.0007) [2023-03-07 08:36:52,161][155452] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-07 08:36:52,959][155452] Updated weights for policy 0, policy_version 46810 (0.0006) [2023-03-07 08:36:53,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 47937536. Throughput: 0: 13049.7. Samples: 47918379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:53,368][155126] Avg episode reward: [(0, '1860.549')] [2023-03-07 08:36:53,743][155452] Updated weights for policy 0, policy_version 46820 (0.0006) [2023-03-07 08:36:54,528][155452] Updated weights for policy 0, policy_version 46830 (0.0007) [2023-03-07 08:36:55,324][155452] Updated weights for policy 0, policy_version 46840 (0.0006) [2023-03-07 08:36:56,105][155452] Updated weights for policy 0, policy_version 46850 (0.0007) [2023-03-07 08:36:56,883][155452] Updated weights for policy 0, policy_version 46860 (0.0006) [2023-03-07 08:36:57,662][155452] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-07 08:36:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 48003072. Throughput: 0: 13044.9. Samples: 47996513. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:36:58,368][155126] Avg episode reward: [(0, '2010.979')] [2023-03-07 08:36:58,451][155452] Updated weights for policy 0, policy_version 46880 (0.0006) [2023-03-07 08:36:59,241][155452] Updated weights for policy 0, policy_version 46890 (0.0006) [2023-03-07 08:37:00,013][155452] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-07 08:37:00,805][155452] Updated weights for policy 0, policy_version 46910 (0.0006) [2023-03-07 08:37:01,560][155452] Updated weights for policy 0, policy_version 46920 (0.0007) [2023-03-07 08:37:02,366][155452] Updated weights for policy 0, policy_version 46930 (0.0006) [2023-03-07 08:37:03,146][155452] Updated weights for policy 0, policy_version 46940 (0.0006) [2023-03-07 08:37:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 48068608. Throughput: 0: 13053.0. Samples: 48035823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:37:03,367][155126] Avg episode reward: [(0, '1966.437')] [2023-03-07 08:37:03,928][155452] Updated weights for policy 0, policy_version 46950 (0.0006) [2023-03-07 08:37:04,707][155452] Updated weights for policy 0, policy_version 46960 (0.0006) [2023-03-07 08:37:05,485][155452] Updated weights for policy 0, policy_version 46970 (0.0006) [2023-03-07 08:37:06,276][155452] Updated weights for policy 0, policy_version 46980 (0.0006) [2023-03-07 08:37:07,059][155452] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-07 08:37:07,854][155452] Updated weights for policy 0, policy_version 47000 (0.0006) [2023-03-07 08:37:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 48134144. Throughput: 0: 13045.0. Samples: 48114185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:37:08,367][155126] Avg episode reward: [(0, '1796.092')] [2023-03-07 08:37:08,648][155452] Updated weights for policy 0, policy_version 47010 (0.0007) [2023-03-07 08:37:09,419][155452] Updated weights for policy 0, policy_version 47020 (0.0007) [2023-03-07 08:37:10,205][155452] Updated weights for policy 0, policy_version 47030 (0.0006) [2023-03-07 08:37:10,984][155452] Updated weights for policy 0, policy_version 47040 (0.0007) [2023-03-07 08:37:11,772][155452] Updated weights for policy 0, policy_version 47050 (0.0006) [2023-03-07 08:37:12,563][155452] Updated weights for policy 0, policy_version 47060 (0.0006) [2023-03-07 08:37:13,347][155452] Updated weights for policy 0, policy_version 47070 (0.0006) [2023-03-07 08:37:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 48199680. Throughput: 0: 13049.0. Samples: 48192560. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:13,367][155126] Avg episode reward: [(0, '1803.769')] [2023-03-07 08:37:14,125][155452] Updated weights for policy 0, policy_version 47080 (0.0006) [2023-03-07 08:37:14,908][155452] Updated weights for policy 0, policy_version 47090 (0.0006) [2023-03-07 08:37:15,686][155452] Updated weights for policy 0, policy_version 47100 (0.0006) [2023-03-07 08:37:16,467][155452] Updated weights for policy 0, policy_version 47110 (0.0006) [2023-03-07 08:37:17,252][155452] Updated weights for policy 0, policy_version 47120 (0.0007) [2023-03-07 08:37:18,029][155452] Updated weights for policy 0, policy_version 47130 (0.0006) [2023-03-07 08:37:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 48265216. Throughput: 0: 13048.6. Samples: 48231753. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:18,368][155126] Avg episode reward: [(0, '1673.581')] [2023-03-07 08:37:18,803][155452] Updated weights for policy 0, policy_version 47140 (0.0006) [2023-03-07 08:37:19,597][155452] Updated weights for policy 0, policy_version 47150 (0.0006) [2023-03-07 08:37:20,389][155452] Updated weights for policy 0, policy_version 47160 (0.0006) [2023-03-07 08:37:21,173][155452] Updated weights for policy 0, policy_version 47170 (0.0006) [2023-03-07 08:37:21,953][155452] Updated weights for policy 0, policy_version 47180 (0.0006) [2023-03-07 08:37:22,730][155452] Updated weights for policy 0, policy_version 47190 (0.0007) [2023-03-07 08:37:23,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 48330752. Throughput: 0: 13052.7. Samples: 48310245. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:23,367][155126] Avg episode reward: [(0, '1810.032')] [2023-03-07 08:37:23,533][155452] Updated weights for policy 0, policy_version 47200 (0.0006) [2023-03-07 08:37:24,317][155452] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-07 08:37:25,106][155452] Updated weights for policy 0, policy_version 47220 (0.0007) [2023-03-07 08:37:25,895][155452] Updated weights for policy 0, policy_version 47230 (0.0006) [2023-03-07 08:37:26,685][155452] Updated weights for policy 0, policy_version 47240 (0.0006) [2023-03-07 08:37:27,463][155452] Updated weights for policy 0, policy_version 47250 (0.0007) [2023-03-07 08:37:28,242][155452] Updated weights for policy 0, policy_version 47260 (0.0007) [2023-03-07 08:37:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 48395264. Throughput: 0: 13051.5. Samples: 48388356. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:28,368][155126] Avg episode reward: [(0, '1941.906')] [2023-03-07 08:37:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000047261_48395264.pth... [2023-03-07 08:37:28,405][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000044208_45268992.pth [2023-03-07 08:37:29,021][155452] Updated weights for policy 0, policy_version 47270 (0.0006) [2023-03-07 08:37:29,818][155452] Updated weights for policy 0, policy_version 47280 (0.0006) [2023-03-07 08:37:30,587][155452] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-07 08:37:31,390][155452] Updated weights for policy 0, policy_version 47300 (0.0007) [2023-03-07 08:37:32,177][155452] Updated weights for policy 0, policy_version 47310 (0.0006) [2023-03-07 08:37:32,973][155452] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-07 08:37:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 48460800. Throughput: 0: 13054.4. Samples: 48427588. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:33,367][155126] Avg episode reward: [(0, '1796.353')] [2023-03-07 08:37:33,750][155452] Updated weights for policy 0, policy_version 47330 (0.0006) [2023-03-07 08:37:34,534][155452] Updated weights for policy 0, policy_version 47340 (0.0006) [2023-03-07 08:37:35,323][155452] Updated weights for policy 0, policy_version 47350 (0.0006) [2023-03-07 08:37:36,103][155452] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-07 08:37:36,888][155452] Updated weights for policy 0, policy_version 47370 (0.0006) [2023-03-07 08:37:37,669][155452] Updated weights for policy 0, policy_version 47380 (0.0007) [2023-03-07 08:37:38,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 48526336. Throughput: 0: 13055.1. Samples: 48505857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:38,369][155126] Avg episode reward: [(0, '1800.568')] [2023-03-07 08:37:38,460][155452] Updated weights for policy 0, policy_version 47390 (0.0006) [2023-03-07 08:37:39,243][155452] Updated weights for policy 0, policy_version 47400 (0.0007) [2023-03-07 08:37:40,027][155452] Updated weights for policy 0, policy_version 47410 (0.0007) [2023-03-07 08:37:40,806][155452] Updated weights for policy 0, policy_version 47420 (0.0006) [2023-03-07 08:37:41,586][155452] Updated weights for policy 0, policy_version 47430 (0.0007) [2023-03-07 08:37:42,358][155452] Updated weights for policy 0, policy_version 47440 (0.0006) [2023-03-07 08:37:43,149][155452] Updated weights for policy 0, policy_version 47450 (0.0006) [2023-03-07 08:37:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 48590848. Throughput: 0: 13059.2. Samples: 48584177. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:43,378][155126] Avg episode reward: [(0, '1828.043')] [2023-03-07 08:37:43,940][155452] Updated weights for policy 0, policy_version 47460 (0.0006) [2023-03-07 08:37:44,717][155452] Updated weights for policy 0, policy_version 47470 (0.0005) [2023-03-07 08:37:45,514][155452] Updated weights for policy 0, policy_version 47480 (0.0006) [2023-03-07 08:37:46,293][155452] Updated weights for policy 0, policy_version 47490 (0.0006) [2023-03-07 08:37:47,077][155452] Updated weights for policy 0, policy_version 47500 (0.0007) [2023-03-07 08:37:47,868][155452] Updated weights for policy 0, policy_version 47510 (0.0006) [2023-03-07 08:37:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 48656384. Throughput: 0: 13055.2. Samples: 48623306. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:48,367][155126] Avg episode reward: [(0, '1844.707')] [2023-03-07 08:37:48,642][155452] Updated weights for policy 0, policy_version 47520 (0.0006) [2023-03-07 08:37:49,441][155452] Updated weights for policy 0, policy_version 47530 (0.0006) [2023-03-07 08:37:50,221][155452] Updated weights for policy 0, policy_version 47540 (0.0006) [2023-03-07 08:37:51,015][155452] Updated weights for policy 0, policy_version 47550 (0.0006) [2023-03-07 08:37:51,806][155452] Updated weights for policy 0, policy_version 47560 (0.0006) [2023-03-07 08:37:52,574][155452] Updated weights for policy 0, policy_version 47570 (0.0006) [2023-03-07 08:37:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13023.9). Total num frames: 48720896. Throughput: 0: 13047.4. Samples: 48701320. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:53,367][155126] Avg episode reward: [(0, '1815.504')] [2023-03-07 08:37:53,376][155452] Updated weights for policy 0, policy_version 47580 (0.0007) [2023-03-07 08:37:54,171][155452] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-07 08:37:54,954][155452] Updated weights for policy 0, policy_version 47600 (0.0007) [2023-03-07 08:37:55,743][155452] Updated weights for policy 0, policy_version 47610 (0.0006) [2023-03-07 08:37:56,523][155452] Updated weights for policy 0, policy_version 47620 (0.0006) [2023-03-07 08:37:57,312][155452] Updated weights for policy 0, policy_version 47630 (0.0006) [2023-03-07 08:37:58,090][155452] Updated weights for policy 0, policy_version 47640 (0.0006) [2023-03-07 08:37:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 48786432. Throughput: 0: 13041.3. Samples: 48779419. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:37:58,367][155126] Avg episode reward: [(0, '2054.159')] [2023-03-07 08:37:58,888][155452] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-07 08:37:59,672][155452] Updated weights for policy 0, policy_version 47660 (0.0007) [2023-03-07 08:38:00,451][155452] Updated weights for policy 0, policy_version 47670 (0.0006) [2023-03-07 08:38:01,229][155452] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-07 08:38:02,011][155452] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-07 08:38:02,799][155452] Updated weights for policy 0, policy_version 47700 (0.0006) [2023-03-07 08:38:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 48851968. Throughput: 0: 13038.6. Samples: 48818489. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:38:03,367][155126] Avg episode reward: [(0, '2091.003')] [2023-03-07 08:38:03,600][155452] Updated weights for policy 0, policy_version 47710 (0.0006) [2023-03-07 08:38:04,398][155452] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-07 08:38:05,173][155452] Updated weights for policy 0, policy_version 47730 (0.0007) [2023-03-07 08:38:05,960][155452] Updated weights for policy 0, policy_version 47740 (0.0006) [2023-03-07 08:38:06,758][155452] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-07 08:38:07,531][155452] Updated weights for policy 0, policy_version 47760 (0.0006) [2023-03-07 08:38:08,305][155452] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-07 08:38:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 48916480. Throughput: 0: 13030.7. Samples: 48896628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:38:08,367][155126] Avg episode reward: [(0, '1866.017')] [2023-03-07 08:38:09,098][155452] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-07 08:38:09,877][155452] Updated weights for policy 0, policy_version 47790 (0.0006) [2023-03-07 08:38:10,654][155452] Updated weights for policy 0, policy_version 47800 (0.0006) [2023-03-07 08:38:11,434][155452] Updated weights for policy 0, policy_version 47810 (0.0007) [2023-03-07 08:38:12,211][155452] Updated weights for policy 0, policy_version 47820 (0.0007) [2023-03-07 08:38:12,992][155452] Updated weights for policy 0, policy_version 47830 (0.0006) [2023-03-07 08:38:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 48982016. Throughput: 0: 13040.4. Samples: 48975174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:38:13,367][155126] Avg episode reward: [(0, '1922.478')] [2023-03-07 08:38:13,793][155452] Updated weights for policy 0, policy_version 47840 (0.0007) [2023-03-07 08:38:14,577][155452] Updated weights for policy 0, policy_version 47850 (0.0007) [2023-03-07 08:38:15,369][155452] Updated weights for policy 0, policy_version 47860 (0.0007) [2023-03-07 08:38:16,163][155452] Updated weights for policy 0, policy_version 47870 (0.0006) [2023-03-07 08:38:16,944][155452] Updated weights for policy 0, policy_version 47880 (0.0006) [2023-03-07 08:38:17,725][155452] Updated weights for policy 0, policy_version 47890 (0.0006) [2023-03-07 08:38:18,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 49047552. Throughput: 0: 13033.5. Samples: 49014099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:38:18,368][155126] Avg episode reward: [(0, '1918.271')] [2023-03-07 08:38:18,511][155452] Updated weights for policy 0, policy_version 47900 (0.0005) [2023-03-07 08:38:19,298][155452] Updated weights for policy 0, policy_version 47910 (0.0007) [2023-03-07 08:38:20,081][155452] Updated weights for policy 0, policy_version 47920 (0.0007) [2023-03-07 08:38:20,874][155452] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-07 08:38:21,674][155452] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-07 08:38:22,449][155452] Updated weights for policy 0, policy_version 47950 (0.0007) [2023-03-07 08:38:23,257][155452] Updated weights for policy 0, policy_version 47960 (0.0005) [2023-03-07 08:38:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 49112064. Throughput: 0: 13026.0. Samples: 49092030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:38:23,368][155126] Avg episode reward: [(0, '1981.552')] [2023-03-07 08:38:24,038][155452] Updated weights for policy 0, policy_version 47970 (0.0006) [2023-03-07 08:38:24,826][155452] Updated weights for policy 0, policy_version 47980 (0.0006) [2023-03-07 08:38:25,614][155452] Updated weights for policy 0, policy_version 47990 (0.0006) [2023-03-07 08:38:26,410][155452] Updated weights for policy 0, policy_version 48000 (0.0006) [2023-03-07 08:38:27,210][155452] Updated weights for policy 0, policy_version 48010 (0.0006) [2023-03-07 08:38:28,006][155452] Updated weights for policy 0, policy_version 48020 (0.0006) [2023-03-07 08:38:28,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 49176576. Throughput: 0: 13011.0. Samples: 49169673. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:38:28,367][155126] Avg episode reward: [(0, '1990.716')] [2023-03-07 08:38:28,784][155452] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-07 08:38:29,583][155452] Updated weights for policy 0, policy_version 48040 (0.0006) [2023-03-07 08:38:30,350][155452] Updated weights for policy 0, policy_version 48050 (0.0007) [2023-03-07 08:38:31,134][155452] Updated weights for policy 0, policy_version 48060 (0.0007) [2023-03-07 08:38:31,930][155452] Updated weights for policy 0, policy_version 48070 (0.0006) [2023-03-07 08:38:32,713][155452] Updated weights for policy 0, policy_version 48080 (0.0007) [2023-03-07 08:38:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 49242112. Throughput: 0: 13015.7. Samples: 49209015. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:38:33,368][155126] Avg episode reward: [(0, '2038.113')] [2023-03-07 08:38:33,495][155452] Updated weights for policy 0, policy_version 48090 (0.0006) [2023-03-07 08:38:34,290][155452] Updated weights for policy 0, policy_version 48100 (0.0006) [2023-03-07 08:38:35,076][155452] Updated weights for policy 0, policy_version 48110 (0.0007) [2023-03-07 08:38:35,859][155452] Updated weights for policy 0, policy_version 48120 (0.0007) [2023-03-07 08:38:36,654][155452] Updated weights for policy 0, policy_version 48130 (0.0007) [2023-03-07 08:38:37,431][155452] Updated weights for policy 0, policy_version 48140 (0.0008) [2023-03-07 08:38:38,232][155452] Updated weights for policy 0, policy_version 48150 (0.0006) [2023-03-07 08:38:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 49306624. Throughput: 0: 13017.0. Samples: 49287085. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:38:38,368][155126] Avg episode reward: [(0, '1906.008')] [2023-03-07 08:38:39,021][155452] Updated weights for policy 0, policy_version 48160 (0.0006) [2023-03-07 08:38:39,797][155452] Updated weights for policy 0, policy_version 48170 (0.0007) [2023-03-07 08:38:40,597][155452] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-07 08:38:41,372][155452] Updated weights for policy 0, policy_version 48190 (0.0006) [2023-03-07 08:38:42,154][155452] Updated weights for policy 0, policy_version 48200 (0.0006) [2023-03-07 08:38:42,953][155452] Updated weights for policy 0, policy_version 48210 (0.0005) [2023-03-07 08:38:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 49372160. Throughput: 0: 13014.7. Samples: 49365081. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:38:43,367][155126] Avg episode reward: [(0, '1942.949')] [2023-03-07 08:38:43,723][155452] Updated weights for policy 0, policy_version 48220 (0.0007) [2023-03-07 08:38:44,527][155452] Updated weights for policy 0, policy_version 48230 (0.0006) [2023-03-07 08:38:45,316][155452] Updated weights for policy 0, policy_version 48240 (0.0007) [2023-03-07 08:38:46,094][155452] Updated weights for policy 0, policy_version 48250 (0.0006) [2023-03-07 08:38:46,882][155452] Updated weights for policy 0, policy_version 48260 (0.0006) [2023-03-07 08:38:47,663][155452] Updated weights for policy 0, policy_version 48270 (0.0006) [2023-03-07 08:38:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 49437696. Throughput: 0: 13014.5. Samples: 49404139. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:38:48,378][155126] Avg episode reward: [(0, '1842.012')] [2023-03-07 08:38:48,452][155452] Updated weights for policy 0, policy_version 48280 (0.0006) [2023-03-07 08:38:49,235][155452] Updated weights for policy 0, policy_version 48290 (0.0006) [2023-03-07 08:38:50,023][155452] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-07 08:38:50,802][155452] Updated weights for policy 0, policy_version 48310 (0.0005) [2023-03-07 08:38:51,601][155452] Updated weights for policy 0, policy_version 48320 (0.0006) [2023-03-07 08:38:52,385][155452] Updated weights for policy 0, policy_version 48330 (0.0006) [2023-03-07 08:38:53,162][155452] Updated weights for policy 0, policy_version 48340 (0.0005) [2023-03-07 08:38:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 49502208. Throughput: 0: 13014.3. Samples: 49482270. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:38:53,367][155126] Avg episode reward: [(0, '1928.877')] [2023-03-07 08:38:53,941][155452] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-07 08:38:54,720][155452] Updated weights for policy 0, policy_version 48360 (0.0006) [2023-03-07 08:38:55,502][155452] Updated weights for policy 0, policy_version 48370 (0.0006) [2023-03-07 08:38:56,285][155452] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-07 08:38:57,074][155452] Updated weights for policy 0, policy_version 48390 (0.0006) [2023-03-07 08:38:57,852][155452] Updated weights for policy 0, policy_version 48400 (0.0007) [2023-03-07 08:38:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 49567744. Throughput: 0: 13012.7. Samples: 49560745. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:38:58,367][155126] Avg episode reward: [(0, '1852.531')] [2023-03-07 08:38:58,634][155452] Updated weights for policy 0, policy_version 48410 (0.0007) [2023-03-07 08:38:59,443][155452] Updated weights for policy 0, policy_version 48420 (0.0006) [2023-03-07 08:39:00,229][155452] Updated weights for policy 0, policy_version 48430 (0.0006) [2023-03-07 08:39:01,017][155452] Updated weights for policy 0, policy_version 48440 (0.0006) [2023-03-07 08:39:01,803][155452] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-07 08:39:02,605][155452] Updated weights for policy 0, policy_version 48460 (0.0007) [2023-03-07 08:39:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 49632256. Throughput: 0: 13015.3. Samples: 49599785. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:03,367][155126] Avg episode reward: [(0, '1816.355')] [2023-03-07 08:39:03,371][155452] Updated weights for policy 0, policy_version 48470 (0.0006) [2023-03-07 08:39:04,157][155452] Updated weights for policy 0, policy_version 48480 (0.0007) [2023-03-07 08:39:04,933][155452] Updated weights for policy 0, policy_version 48490 (0.0006) [2023-03-07 08:39:05,733][155452] Updated weights for policy 0, policy_version 48500 (0.0006) [2023-03-07 08:39:06,516][155452] Updated weights for policy 0, policy_version 48510 (0.0006) [2023-03-07 08:39:07,302][155452] Updated weights for policy 0, policy_version 48520 (0.0006) [2023-03-07 08:39:08,082][155452] Updated weights for policy 0, policy_version 48530 (0.0006) [2023-03-07 08:39:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 49697792. Throughput: 0: 13016.6. Samples: 49677777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:08,367][155126] Avg episode reward: [(0, '1702.761')] [2023-03-07 08:39:08,880][155452] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-07 08:39:09,683][155452] Updated weights for policy 0, policy_version 48550 (0.0006) [2023-03-07 08:39:10,467][155452] Updated weights for policy 0, policy_version 48560 (0.0006) [2023-03-07 08:39:11,238][155452] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-07 08:39:12,027][155452] Updated weights for policy 0, policy_version 48580 (0.0007) [2023-03-07 08:39:12,817][155452] Updated weights for policy 0, policy_version 48590 (0.0006) [2023-03-07 08:39:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 49763328. Throughput: 0: 13027.8. Samples: 49755926. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:13,368][155126] Avg episode reward: [(0, '1706.034')] [2023-03-07 08:39:13,583][155452] Updated weights for policy 0, policy_version 48600 (0.0007) [2023-03-07 08:39:14,379][155452] Updated weights for policy 0, policy_version 48610 (0.0006) [2023-03-07 08:39:15,178][155452] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-07 08:39:15,951][155452] Updated weights for policy 0, policy_version 48630 (0.0005) [2023-03-07 08:39:16,726][155452] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-07 08:39:17,506][155452] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-07 08:39:18,283][155452] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-07 08:39:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 49827840. Throughput: 0: 13024.6. Samples: 49795118. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:18,367][155126] Avg episode reward: [(0, '1941.885')] [2023-03-07 08:39:19,068][155452] Updated weights for policy 0, policy_version 48670 (0.0006) [2023-03-07 08:39:19,857][155452] Updated weights for policy 0, policy_version 48680 (0.0007) [2023-03-07 08:39:20,633][155452] Updated weights for policy 0, policy_version 48690 (0.0006) [2023-03-07 08:39:21,428][155452] Updated weights for policy 0, policy_version 48700 (0.0006) [2023-03-07 08:39:22,211][155452] Updated weights for policy 0, policy_version 48710 (0.0006) [2023-03-07 08:39:22,988][155452] Updated weights for policy 0, policy_version 48720 (0.0007) [2023-03-07 08:39:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 49893376. Throughput: 0: 13037.5. Samples: 49873772. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:23,367][155126] Avg episode reward: [(0, '1690.355')] [2023-03-07 08:39:23,768][155452] Updated weights for policy 0, policy_version 48730 (0.0006) [2023-03-07 08:39:24,580][155452] Updated weights for policy 0, policy_version 48740 (0.0006) [2023-03-07 08:39:25,340][155452] Updated weights for policy 0, policy_version 48750 (0.0006) [2023-03-07 08:39:26,123][155452] Updated weights for policy 0, policy_version 48760 (0.0005) [2023-03-07 08:39:26,916][155452] Updated weights for policy 0, policy_version 48770 (0.0006) [2023-03-07 08:39:27,708][155452] Updated weights for policy 0, policy_version 48780 (0.0006) [2023-03-07 08:39:28,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 49958912. Throughput: 0: 13041.9. Samples: 49951967. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:28,368][155126] Avg episode reward: [(0, '1897.067')] [2023-03-07 08:39:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000048788_49958912.pth... [2023-03-07 08:39:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000045734_46831616.pth [2023-03-07 08:39:28,478][155452] Updated weights for policy 0, policy_version 48790 (0.0006) [2023-03-07 08:39:29,280][155452] Updated weights for policy 0, policy_version 48800 (0.0006) [2023-03-07 08:39:30,068][155452] Updated weights for policy 0, policy_version 48810 (0.0007) [2023-03-07 08:39:30,857][155452] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-07 08:39:31,653][155452] Updated weights for policy 0, policy_version 48830 (0.0006) [2023-03-07 08:39:32,433][155452] Updated weights for policy 0, policy_version 48840 (0.0006) [2023-03-07 08:39:33,211][155452] Updated weights for policy 0, policy_version 48850 (0.0007) [2023-03-07 08:39:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 50024448. Throughput: 0: 13043.1. Samples: 49991078. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:33,367][155126] Avg episode reward: [(0, '1904.652')] [2023-03-07 08:39:34,000][155452] Updated weights for policy 0, policy_version 48860 (0.0006) [2023-03-07 08:39:34,799][155452] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-07 08:39:35,567][155452] Updated weights for policy 0, policy_version 48880 (0.0006) [2023-03-07 08:39:36,348][155452] Updated weights for policy 0, policy_version 48890 (0.0006) [2023-03-07 08:39:37,143][155452] Updated weights for policy 0, policy_version 48900 (0.0006) [2023-03-07 08:39:37,926][155452] Updated weights for policy 0, policy_version 48910 (0.0006) [2023-03-07 08:39:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 50088960. Throughput: 0: 13041.4. Samples: 50069133. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:38,368][155126] Avg episode reward: [(0, '1678.816')] [2023-03-07 08:39:38,697][155452] Updated weights for policy 0, policy_version 48920 (0.0006) [2023-03-07 08:39:39,503][155452] Updated weights for policy 0, policy_version 48930 (0.0006) [2023-03-07 08:39:40,285][155452] Updated weights for policy 0, policy_version 48940 (0.0006) [2023-03-07 08:39:41,058][155452] Updated weights for policy 0, policy_version 48950 (0.0006) [2023-03-07 08:39:41,865][155452] Updated weights for policy 0, policy_version 48960 (0.0006) [2023-03-07 08:39:42,641][155452] Updated weights for policy 0, policy_version 48970 (0.0006) [2023-03-07 08:39:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 50154496. Throughput: 0: 13034.1. Samples: 50147281. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:43,367][155126] Avg episode reward: [(0, '1954.459')] [2023-03-07 08:39:43,428][155452] Updated weights for policy 0, policy_version 48980 (0.0007) [2023-03-07 08:39:44,224][155452] Updated weights for policy 0, policy_version 48990 (0.0007) [2023-03-07 08:39:45,006][155452] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-07 08:39:45,802][155452] Updated weights for policy 0, policy_version 49010 (0.0006) [2023-03-07 08:39:46,571][155452] Updated weights for policy 0, policy_version 49020 (0.0007) [2023-03-07 08:39:47,352][155452] Updated weights for policy 0, policy_version 49030 (0.0006) [2023-03-07 08:39:48,142][155452] Updated weights for policy 0, policy_version 49040 (0.0006) [2023-03-07 08:39:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 50219008. Throughput: 0: 13035.0. Samples: 50186361. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:48,367][155126] Avg episode reward: [(0, '1715.106')] [2023-03-07 08:39:48,932][155452] Updated weights for policy 0, policy_version 49050 (0.0006) [2023-03-07 08:39:49,726][155452] Updated weights for policy 0, policy_version 49060 (0.0007) [2023-03-07 08:39:50,514][155452] Updated weights for policy 0, policy_version 49070 (0.0007) [2023-03-07 08:39:51,278][155452] Updated weights for policy 0, policy_version 49080 (0.0006) [2023-03-07 08:39:52,081][155452] Updated weights for policy 0, policy_version 49090 (0.0006) [2023-03-07 08:39:52,850][155452] Updated weights for policy 0, policy_version 49100 (0.0006) [2023-03-07 08:39:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 50284544. Throughput: 0: 13036.7. Samples: 50264430. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:53,367][155126] Avg episode reward: [(0, '1566.408')] [2023-03-07 08:39:53,647][155452] Updated weights for policy 0, policy_version 49110 (0.0006) [2023-03-07 08:39:54,430][155452] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-07 08:39:55,205][155452] Updated weights for policy 0, policy_version 49130 (0.0006) [2023-03-07 08:39:56,001][155452] Updated weights for policy 0, policy_version 49140 (0.0006) [2023-03-07 08:39:56,778][155452] Updated weights for policy 0, policy_version 49150 (0.0006) [2023-03-07 08:39:57,548][155452] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-07 08:39:58,345][155452] Updated weights for policy 0, policy_version 49170 (0.0006) [2023-03-07 08:39:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 50350080. Throughput: 0: 13046.3. Samples: 50343008. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:39:58,367][155126] Avg episode reward: [(0, '1631.470')] [2023-03-07 08:39:59,128][155452] Updated weights for policy 0, policy_version 49180 (0.0006) [2023-03-07 08:39:59,916][155452] Updated weights for policy 0, policy_version 49190 (0.0006) [2023-03-07 08:40:00,717][155452] Updated weights for policy 0, policy_version 49200 (0.0005) [2023-03-07 08:40:01,490][155452] Updated weights for policy 0, policy_version 49210 (0.0006) [2023-03-07 08:40:02,269][155452] Updated weights for policy 0, policy_version 49220 (0.0007) [2023-03-07 08:40:03,043][155452] Updated weights for policy 0, policy_version 49230 (0.0006) [2023-03-07 08:40:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 50415616. Throughput: 0: 13039.6. Samples: 50381900. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:40:03,367][155126] Avg episode reward: [(0, '1817.463')] [2023-03-07 08:40:03,840][155452] Updated weights for policy 0, policy_version 49240 (0.0006) [2023-03-07 08:40:04,619][155452] Updated weights for policy 0, policy_version 49250 (0.0006) [2023-03-07 08:40:05,423][155452] Updated weights for policy 0, policy_version 49260 (0.0007) [2023-03-07 08:40:06,213][155452] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-07 08:40:06,990][155452] Updated weights for policy 0, policy_version 49280 (0.0007) [2023-03-07 08:40:07,777][155452] Updated weights for policy 0, policy_version 49290 (0.0006) [2023-03-07 08:40:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 50480128. Throughput: 0: 13027.0. Samples: 50459987. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:40:08,368][155126] Avg episode reward: [(0, '1766.602')] [2023-03-07 08:40:08,554][155452] Updated weights for policy 0, policy_version 49300 (0.0006) [2023-03-07 08:40:09,327][155452] Updated weights for policy 0, policy_version 49310 (0.0006) [2023-03-07 08:40:10,107][155452] Updated weights for policy 0, policy_version 49320 (0.0006) [2023-03-07 08:40:10,897][155452] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-07 08:40:11,678][155452] Updated weights for policy 0, policy_version 49340 (0.0006) [2023-03-07 08:40:12,443][155452] Updated weights for policy 0, policy_version 49350 (0.0006) [2023-03-07 08:40:13,258][155452] Updated weights for policy 0, policy_version 49360 (0.0006) [2023-03-07 08:40:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 50545664. Throughput: 0: 13040.7. Samples: 50538796. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:40:13,367][155126] Avg episode reward: [(0, '1882.114')] [2023-03-07 08:40:14,026][155452] Updated weights for policy 0, policy_version 49370 (0.0007) [2023-03-07 08:40:14,811][155452] Updated weights for policy 0, policy_version 49380 (0.0006) [2023-03-07 08:40:15,610][155452] Updated weights for policy 0, policy_version 49390 (0.0007) [2023-03-07 08:40:16,409][155452] Updated weights for policy 0, policy_version 49400 (0.0006) [2023-03-07 08:40:17,184][155452] Updated weights for policy 0, policy_version 49410 (0.0005) [2023-03-07 08:40:17,979][155452] Updated weights for policy 0, policy_version 49420 (0.0007) [2023-03-07 08:40:18,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 50610176. Throughput: 0: 13036.3. Samples: 50577710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:40:18,367][155126] Avg episode reward: [(0, '1745.321')] [2023-03-07 08:40:18,777][155452] Updated weights for policy 0, policy_version 49430 (0.0006) [2023-03-07 08:40:19,547][155452] Updated weights for policy 0, policy_version 49440 (0.0006) [2023-03-07 08:40:20,354][155452] Updated weights for policy 0, policy_version 49450 (0.0006) [2023-03-07 08:40:21,142][155452] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-07 08:40:21,932][155452] Updated weights for policy 0, policy_version 49470 (0.0006) [2023-03-07 08:40:22,714][155452] Updated weights for policy 0, policy_version 49480 (0.0007) [2023-03-07 08:40:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 50675712. Throughput: 0: 13030.6. Samples: 50655511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:40:23,368][155126] Avg episode reward: [(0, '1744.077')] [2023-03-07 08:40:23,493][155452] Updated weights for policy 0, policy_version 49490 (0.0006) [2023-03-07 08:40:24,294][155452] Updated weights for policy 0, policy_version 49500 (0.0007) [2023-03-07 08:40:25,098][155452] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-07 08:40:25,860][155452] Updated weights for policy 0, policy_version 49520 (0.0007) [2023-03-07 08:40:26,649][155452] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-07 08:40:27,443][155452] Updated weights for policy 0, policy_version 49540 (0.0006) [2023-03-07 08:40:28,232][155452] Updated weights for policy 0, policy_version 49550 (0.0005) [2023-03-07 08:40:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 50740224. Throughput: 0: 13027.3. Samples: 50733511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:40:28,367][155126] Avg episode reward: [(0, '1686.445')] [2023-03-07 08:40:29,002][155452] Updated weights for policy 0, policy_version 49560 (0.0006) [2023-03-07 08:40:29,778][155452] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-07 08:40:30,559][155452] Updated weights for policy 0, policy_version 49580 (0.0007) [2023-03-07 08:40:31,352][155452] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-07 08:40:32,137][155452] Updated weights for policy 0, policy_version 49600 (0.0007) [2023-03-07 08:40:32,915][155452] Updated weights for policy 0, policy_version 49610 (0.0006) [2023-03-07 08:40:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 50805760. Throughput: 0: 13034.4. Samples: 50772910. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:40:33,367][155126] Avg episode reward: [(0, '1670.791')] [2023-03-07 08:40:33,723][155452] Updated weights for policy 0, policy_version 49620 (0.0006) [2023-03-07 08:40:34,515][155452] Updated weights for policy 0, policy_version 49630 (0.0006) [2023-03-07 08:40:35,286][155452] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-07 08:40:36,079][155452] Updated weights for policy 0, policy_version 49650 (0.0006) [2023-03-07 08:40:36,874][155452] Updated weights for policy 0, policy_version 49660 (0.0006) [2023-03-07 08:40:37,649][155452] Updated weights for policy 0, policy_version 49670 (0.0007) [2023-03-07 08:40:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 50871296. Throughput: 0: 13032.8. Samples: 50850907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:40:38,367][155126] Avg episode reward: [(0, '1855.030')] [2023-03-07 08:40:38,423][155452] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-07 08:40:39,191][155452] Updated weights for policy 0, policy_version 49690 (0.0006) [2023-03-07 08:40:39,981][155452] Updated weights for policy 0, policy_version 49700 (0.0006) [2023-03-07 08:40:40,758][155452] Updated weights for policy 0, policy_version 49710 (0.0005) [2023-03-07 08:40:41,546][155452] Updated weights for policy 0, policy_version 49720 (0.0006) [2023-03-07 08:40:42,318][155452] Updated weights for policy 0, policy_version 49730 (0.0006) [2023-03-07 08:40:43,105][155452] Updated weights for policy 0, policy_version 49740 (0.0006) [2023-03-07 08:40:43,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 50936832. Throughput: 0: 13039.9. Samples: 50929804. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:40:43,368][155126] Avg episode reward: [(0, '1858.321')] [2023-03-07 08:40:43,882][155452] Updated weights for policy 0, policy_version 49750 (0.0006) [2023-03-07 08:40:44,662][155452] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-07 08:40:45,450][155452] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-07 08:40:46,225][155452] Updated weights for policy 0, policy_version 49780 (0.0006) [2023-03-07 08:40:47,027][155452] Updated weights for policy 0, policy_version 49790 (0.0006) [2023-03-07 08:40:47,800][155452] Updated weights for policy 0, policy_version 49800 (0.0006) [2023-03-07 08:40:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 51002368. Throughput: 0: 13045.4. Samples: 50968942. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:40:48,367][155126] Avg episode reward: [(0, '1845.324')] [2023-03-07 08:40:48,593][155452] Updated weights for policy 0, policy_version 49810 (0.0007) [2023-03-07 08:40:49,383][155452] Updated weights for policy 0, policy_version 49820 (0.0006) [2023-03-07 08:40:50,157][155452] Updated weights for policy 0, policy_version 49830 (0.0006) [2023-03-07 08:40:50,953][155452] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-07 08:40:51,740][155452] Updated weights for policy 0, policy_version 49850 (0.0006) [2023-03-07 08:40:52,526][155452] Updated weights for policy 0, policy_version 49860 (0.0006) [2023-03-07 08:40:53,304][155452] Updated weights for policy 0, policy_version 49870 (0.0006) [2023-03-07 08:40:53,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 51066880. Throughput: 0: 13049.8. Samples: 51047225. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:40:53,367][155126] Avg episode reward: [(0, '1795.459')] [2023-03-07 08:40:54,098][155452] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-07 08:40:54,888][155452] Updated weights for policy 0, policy_version 49890 (0.0006) [2023-03-07 08:40:55,682][155452] Updated weights for policy 0, policy_version 49900 (0.0005) [2023-03-07 08:40:56,473][155452] Updated weights for policy 0, policy_version 49910 (0.0006) [2023-03-07 08:40:57,251][155452] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-07 08:40:58,018][155452] Updated weights for policy 0, policy_version 49930 (0.0007) [2023-03-07 08:40:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 51132416. Throughput: 0: 13034.5. Samples: 51125347. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:40:58,378][155126] Avg episode reward: [(0, '2096.242')] [2023-03-07 08:40:58,797][155452] Updated weights for policy 0, policy_version 49940 (0.0005) [2023-03-07 08:40:59,596][155452] Updated weights for policy 0, policy_version 49950 (0.0006) [2023-03-07 08:41:00,361][155452] Updated weights for policy 0, policy_version 49960 (0.0006) [2023-03-07 08:41:01,145][155452] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-07 08:41:01,939][155452] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-07 08:41:02,735][155452] Updated weights for policy 0, policy_version 49990 (0.0006) [2023-03-07 08:41:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 51196928. Throughput: 0: 13039.4. Samples: 51164485. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:41:03,367][155126] Avg episode reward: [(0, '1964.481')] [2023-03-07 08:41:03,530][155452] Updated weights for policy 0, policy_version 50000 (0.0006) [2023-03-07 08:41:04,304][155452] Updated weights for policy 0, policy_version 50010 (0.0006) [2023-03-07 08:41:05,094][155452] Updated weights for policy 0, policy_version 50020 (0.0005) [2023-03-07 08:41:05,866][155452] Updated weights for policy 0, policy_version 50030 (0.0007) [2023-03-07 08:41:06,678][155452] Updated weights for policy 0, policy_version 50040 (0.0006) [2023-03-07 08:41:07,471][155452] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-07 08:41:08,233][155452] Updated weights for policy 0, policy_version 50060 (0.0006) [2023-03-07 08:41:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 51262464. Throughput: 0: 13047.8. Samples: 51242659. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:41:08,367][155126] Avg episode reward: [(0, '2016.239')] [2023-03-07 08:41:09,030][155452] Updated weights for policy 0, policy_version 50070 (0.0006) [2023-03-07 08:41:09,816][155452] Updated weights for policy 0, policy_version 50080 (0.0006) [2023-03-07 08:41:10,576][155452] Updated weights for policy 0, policy_version 50090 (0.0006) [2023-03-07 08:41:11,370][155452] Updated weights for policy 0, policy_version 50100 (0.0006) [2023-03-07 08:41:12,157][155452] Updated weights for policy 0, policy_version 50110 (0.0006) [2023-03-07 08:41:12,939][155452] Updated weights for policy 0, policy_version 50120 (0.0006) [2023-03-07 08:41:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 51328000. Throughput: 0: 13053.3. Samples: 51320909. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:41:13,367][155126] Avg episode reward: [(0, '1850.340')] [2023-03-07 08:41:13,741][155452] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-07 08:41:14,521][155452] Updated weights for policy 0, policy_version 50140 (0.0007) [2023-03-07 08:41:15,317][155452] Updated weights for policy 0, policy_version 50150 (0.0006) [2023-03-07 08:41:16,097][155452] Updated weights for policy 0, policy_version 50160 (0.0007) [2023-03-07 08:41:16,890][155452] Updated weights for policy 0, policy_version 50170 (0.0006) [2023-03-07 08:41:17,670][155452] Updated weights for policy 0, policy_version 50180 (0.0007) [2023-03-07 08:41:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 51393536. Throughput: 0: 13044.8. Samples: 51359927. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:41:18,367][155126] Avg episode reward: [(0, '2103.217')] [2023-03-07 08:41:18,454][155452] Updated weights for policy 0, policy_version 50190 (0.0006) [2023-03-07 08:41:19,242][155452] Updated weights for policy 0, policy_version 50200 (0.0006) [2023-03-07 08:41:20,045][155452] Updated weights for policy 0, policy_version 50210 (0.0007) [2023-03-07 08:41:20,813][155452] Updated weights for policy 0, policy_version 50220 (0.0006) [2023-03-07 08:41:21,609][155452] Updated weights for policy 0, policy_version 50230 (0.0006) [2023-03-07 08:41:22,365][155452] Updated weights for policy 0, policy_version 50240 (0.0006) [2023-03-07 08:41:23,152][155452] Updated weights for policy 0, policy_version 50250 (0.0006) [2023-03-07 08:41:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 51458048. Throughput: 0: 13048.1. Samples: 51438073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:41:23,368][155126] Avg episode reward: [(0, '1805.262')] [2023-03-07 08:41:23,958][155452] Updated weights for policy 0, policy_version 50260 (0.0006) [2023-03-07 08:41:24,736][155452] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-07 08:41:25,510][155452] Updated weights for policy 0, policy_version 50280 (0.0006) [2023-03-07 08:41:26,316][155452] Updated weights for policy 0, policy_version 50290 (0.0006) [2023-03-07 08:41:27,114][155452] Updated weights for policy 0, policy_version 50300 (0.0005) [2023-03-07 08:41:27,885][155452] Updated weights for policy 0, policy_version 50310 (0.0006) [2023-03-07 08:41:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 51523584. Throughput: 0: 13032.8. Samples: 51516279. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:41:28,367][155126] Avg episode reward: [(0, '2061.820')] [2023-03-07 08:41:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000050316_51523584.pth... [2023-03-07 08:41:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000047261_48395264.pth [2023-03-07 08:41:28,663][155452] Updated weights for policy 0, policy_version 50320 (0.0006) [2023-03-07 08:41:29,457][155452] Updated weights for policy 0, policy_version 50330 (0.0007) [2023-03-07 08:41:30,241][155452] Updated weights for policy 0, policy_version 50340 (0.0006) [2023-03-07 08:41:31,041][155452] Updated weights for policy 0, policy_version 50350 (0.0006) [2023-03-07 08:41:31,821][155452] Updated weights for policy 0, policy_version 50360 (0.0006) [2023-03-07 08:41:32,610][155452] Updated weights for policy 0, policy_version 50370 (0.0006) [2023-03-07 08:41:33,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 51588096. Throughput: 0: 13032.5. Samples: 51555401. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:41:33,367][155126] Avg episode reward: [(0, '1885.746')] [2023-03-07 08:41:33,385][155452] Updated weights for policy 0, policy_version 50380 (0.0006) [2023-03-07 08:41:34,162][155452] Updated weights for policy 0, policy_version 50390 (0.0006) [2023-03-07 08:41:34,947][155452] Updated weights for policy 0, policy_version 50400 (0.0006) [2023-03-07 08:41:35,739][155452] Updated weights for policy 0, policy_version 50410 (0.0006) [2023-03-07 08:41:36,510][155452] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-07 08:41:37,306][155452] Updated weights for policy 0, policy_version 50430 (0.0006) [2023-03-07 08:41:38,086][155452] Updated weights for policy 0, policy_version 50440 (0.0006) [2023-03-07 08:41:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 51653632. Throughput: 0: 13032.2. Samples: 51633673. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:41:38,367][155126] Avg episode reward: [(0, '1947.695')] [2023-03-07 08:41:38,864][155452] Updated weights for policy 0, policy_version 50450 (0.0006) [2023-03-07 08:41:39,655][155452] Updated weights for policy 0, policy_version 50460 (0.0006) [2023-03-07 08:41:40,430][155452] Updated weights for policy 0, policy_version 50470 (0.0006) [2023-03-07 08:41:41,238][155452] Updated weights for policy 0, policy_version 50480 (0.0006) [2023-03-07 08:41:42,029][155452] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-07 08:41:42,814][155452] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-07 08:41:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 51718144. Throughput: 0: 13032.9. Samples: 51711827. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:41:43,367][155126] Avg episode reward: [(0, '1776.991')] [2023-03-07 08:41:43,621][155452] Updated weights for policy 0, policy_version 50510 (0.0006) [2023-03-07 08:41:44,396][155452] Updated weights for policy 0, policy_version 50520 (0.0008) [2023-03-07 08:41:45,171][155452] Updated weights for policy 0, policy_version 50530 (0.0006) [2023-03-07 08:41:45,962][155452] Updated weights for policy 0, policy_version 50540 (0.0006) [2023-03-07 08:41:46,734][155452] Updated weights for policy 0, policy_version 50550 (0.0006) [2023-03-07 08:41:47,510][155452] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-07 08:41:48,295][155452] Updated weights for policy 0, policy_version 50570 (0.0006) [2023-03-07 08:41:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 51783680. Throughput: 0: 13028.7. Samples: 51750776. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:41:48,367][155126] Avg episode reward: [(0, '1961.227')] [2023-03-07 08:41:49,109][155452] Updated weights for policy 0, policy_version 50580 (0.0006) [2023-03-07 08:41:49,893][155452] Updated weights for policy 0, policy_version 50590 (0.0006) [2023-03-07 08:41:50,658][155452] Updated weights for policy 0, policy_version 50600 (0.0006) [2023-03-07 08:41:51,457][155452] Updated weights for policy 0, policy_version 50610 (0.0007) [2023-03-07 08:41:52,237][155452] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-07 08:41:53,009][155452] Updated weights for policy 0, policy_version 50630 (0.0006) [2023-03-07 08:41:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 51849216. Throughput: 0: 13036.0. Samples: 51829278. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:41:53,367][155126] Avg episode reward: [(0, '1809.898')] [2023-03-07 08:41:53,804][155452] Updated weights for policy 0, policy_version 50640 (0.0006) [2023-03-07 08:41:54,591][155452] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-07 08:41:55,382][155452] Updated weights for policy 0, policy_version 50660 (0.0006) [2023-03-07 08:41:56,174][155452] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-07 08:41:56,954][155452] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-07 08:41:57,744][155452] Updated weights for policy 0, policy_version 50690 (0.0006) [2023-03-07 08:41:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 51913728. Throughput: 0: 13030.0. Samples: 51907259. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:41:58,367][155126] Avg episode reward: [(0, '1941.459')] [2023-03-07 08:41:58,539][155452] Updated weights for policy 0, policy_version 50700 (0.0007) [2023-03-07 08:41:59,311][155452] Updated weights for policy 0, policy_version 50710 (0.0006) [2023-03-07 08:42:00,099][155452] Updated weights for policy 0, policy_version 50720 (0.0007) [2023-03-07 08:42:00,885][155452] Updated weights for policy 0, policy_version 50730 (0.0005) [2023-03-07 08:42:01,694][155452] Updated weights for policy 0, policy_version 50740 (0.0006) [2023-03-07 08:42:02,490][155452] Updated weights for policy 0, policy_version 50750 (0.0006) [2023-03-07 08:42:03,273][155452] Updated weights for policy 0, policy_version 50760 (0.0007) [2023-03-07 08:42:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 51979264. Throughput: 0: 13028.2. Samples: 51946196. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:42:03,367][155126] Avg episode reward: [(0, '1784.636')] [2023-03-07 08:42:04,046][155452] Updated weights for policy 0, policy_version 50770 (0.0006) [2023-03-07 08:42:04,836][155452] Updated weights for policy 0, policy_version 50780 (0.0006) [2023-03-07 08:42:05,617][155452] Updated weights for policy 0, policy_version 50790 (0.0006) [2023-03-07 08:42:06,393][155452] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-07 08:42:07,186][155452] Updated weights for policy 0, policy_version 50810 (0.0006) [2023-03-07 08:42:07,969][155452] Updated weights for policy 0, policy_version 50820 (0.0005) [2023-03-07 08:42:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 52044800. Throughput: 0: 13027.2. Samples: 52024297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:42:08,367][155126] Avg episode reward: [(0, '1888.036')] [2023-03-07 08:42:08,748][155452] Updated weights for policy 0, policy_version 50830 (0.0006) [2023-03-07 08:42:09,537][155452] Updated weights for policy 0, policy_version 50840 (0.0007) [2023-03-07 08:42:10,305][155452] Updated weights for policy 0, policy_version 50850 (0.0006) [2023-03-07 08:42:11,068][155452] Updated weights for policy 0, policy_version 50860 (0.0006) [2023-03-07 08:42:11,865][155452] Updated weights for policy 0, policy_version 50870 (0.0005) [2023-03-07 08:42:12,628][155452] Updated weights for policy 0, policy_version 50880 (0.0006) [2023-03-07 08:42:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 52110336. Throughput: 0: 13045.3. Samples: 52103317. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:42:13,367][155126] Avg episode reward: [(0, '1883.997')] [2023-03-07 08:42:13,409][155452] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-07 08:42:14,193][155452] Updated weights for policy 0, policy_version 50900 (0.0006) [2023-03-07 08:42:14,971][155452] Updated weights for policy 0, policy_version 50910 (0.0005) [2023-03-07 08:42:15,780][155452] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-07 08:42:16,565][155452] Updated weights for policy 0, policy_version 50930 (0.0007) [2023-03-07 08:42:17,334][155452] Updated weights for policy 0, policy_version 50940 (0.0006) [2023-03-07 08:42:18,121][155452] Updated weights for policy 0, policy_version 50950 (0.0006) [2023-03-07 08:42:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 52175872. Throughput: 0: 13045.2. Samples: 52142436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:42:18,367][155126] Avg episode reward: [(0, '1838.712')] [2023-03-07 08:42:18,898][155452] Updated weights for policy 0, policy_version 50960 (0.0007) [2023-03-07 08:42:19,686][155452] Updated weights for policy 0, policy_version 50970 (0.0006) [2023-03-07 08:42:20,467][155452] Updated weights for policy 0, policy_version 50980 (0.0007) [2023-03-07 08:42:21,258][155452] Updated weights for policy 0, policy_version 50990 (0.0007) [2023-03-07 08:42:22,054][155452] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-07 08:42:22,830][155452] Updated weights for policy 0, policy_version 51010 (0.0006) [2023-03-07 08:42:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 52240384. Throughput: 0: 13045.1. Samples: 52220704. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:42:23,367][155126] Avg episode reward: [(0, '2028.755')] [2023-03-07 08:42:23,615][155452] Updated weights for policy 0, policy_version 51020 (0.0006) [2023-03-07 08:42:24,416][155452] Updated weights for policy 0, policy_version 51030 (0.0007) [2023-03-07 08:42:25,226][155452] Updated weights for policy 0, policy_version 51040 (0.0006) [2023-03-07 08:42:26,016][155452] Updated weights for policy 0, policy_version 51050 (0.0006) [2023-03-07 08:42:26,779][155452] Updated weights for policy 0, policy_version 51060 (0.0006) [2023-03-07 08:42:27,573][155452] Updated weights for policy 0, policy_version 51070 (0.0007) [2023-03-07 08:42:28,363][155452] Updated weights for policy 0, policy_version 51080 (0.0006) [2023-03-07 08:42:28,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 52305920. Throughput: 0: 13038.2. Samples: 52298545. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:42:28,368][155126] Avg episode reward: [(0, '1920.135')] [2023-03-07 08:42:29,148][155452] Updated weights for policy 0, policy_version 51090 (0.0006) [2023-03-07 08:42:29,923][155452] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-07 08:42:30,726][155452] Updated weights for policy 0, policy_version 51110 (0.0006) [2023-03-07 08:42:31,502][155452] Updated weights for policy 0, policy_version 51120 (0.0005) [2023-03-07 08:42:32,293][155452] Updated weights for policy 0, policy_version 51130 (0.0007) [2023-03-07 08:42:33,064][155452] Updated weights for policy 0, policy_version 51140 (0.0007) [2023-03-07 08:42:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 52370432. Throughput: 0: 13039.4. Samples: 52337548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:42:33,367][155126] Avg episode reward: [(0, '1883.887')] [2023-03-07 08:42:33,859][155452] Updated weights for policy 0, policy_version 51150 (0.0006) [2023-03-07 08:42:34,619][155452] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-07 08:42:35,413][155452] Updated weights for policy 0, policy_version 51170 (0.0006) [2023-03-07 08:42:36,188][155452] Updated weights for policy 0, policy_version 51180 (0.0005) [2023-03-07 08:42:36,978][155452] Updated weights for policy 0, policy_version 51190 (0.0007) [2023-03-07 08:42:37,736][155452] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-07 08:42:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 52435968. Throughput: 0: 13047.1. Samples: 52416397. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:42:38,367][155126] Avg episode reward: [(0, '2124.550')] [2023-03-07 08:42:38,540][155452] Updated weights for policy 0, policy_version 51210 (0.0006) [2023-03-07 08:42:39,330][155452] Updated weights for policy 0, policy_version 51220 (0.0006) [2023-03-07 08:42:40,099][155452] Updated weights for policy 0, policy_version 51230 (0.0006) [2023-03-07 08:42:40,876][155452] Updated weights for policy 0, policy_version 51240 (0.0006) [2023-03-07 08:42:41,670][155452] Updated weights for policy 0, policy_version 51250 (0.0007) [2023-03-07 08:42:42,464][155452] Updated weights for policy 0, policy_version 51260 (0.0006) [2023-03-07 08:42:43,258][155452] Updated weights for policy 0, policy_version 51270 (0.0007) [2023-03-07 08:42:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 52501504. Throughput: 0: 13051.1. Samples: 52494557. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:42:43,367][155126] Avg episode reward: [(0, '2013.497')] [2023-03-07 08:42:44,032][155452] Updated weights for policy 0, policy_version 51280 (0.0007) [2023-03-07 08:42:44,830][155452] Updated weights for policy 0, policy_version 51290 (0.0005) [2023-03-07 08:42:45,607][155452] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-07 08:42:46,397][155452] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-07 08:42:47,179][155452] Updated weights for policy 0, policy_version 51320 (0.0006) [2023-03-07 08:42:47,968][155452] Updated weights for policy 0, policy_version 51330 (0.0005) [2023-03-07 08:42:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 52567040. Throughput: 0: 13056.2. Samples: 52533725. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:42:48,378][155126] Avg episode reward: [(0, '2046.123')] [2023-03-07 08:42:48,759][155452] Updated weights for policy 0, policy_version 51340 (0.0006) [2023-03-07 08:42:49,531][155452] Updated weights for policy 0, policy_version 51350 (0.0006) [2023-03-07 08:42:50,329][155452] Updated weights for policy 0, policy_version 51360 (0.0007) [2023-03-07 08:42:51,093][155452] Updated weights for policy 0, policy_version 51370 (0.0007) [2023-03-07 08:42:51,869][155452] Updated weights for policy 0, policy_version 51380 (0.0007) [2023-03-07 08:42:52,647][155452] Updated weights for policy 0, policy_version 51390 (0.0006) [2023-03-07 08:42:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 52632576. Throughput: 0: 13061.6. Samples: 52612071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:42:53,367][155126] Avg episode reward: [(0, '1796.675')] [2023-03-07 08:42:53,447][155452] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-07 08:42:54,226][155452] Updated weights for policy 0, policy_version 51410 (0.0006) [2023-03-07 08:42:55,014][155452] Updated weights for policy 0, policy_version 51420 (0.0006) [2023-03-07 08:42:55,803][155452] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-07 08:42:56,595][155452] Updated weights for policy 0, policy_version 51440 (0.0006) [2023-03-07 08:42:57,393][155452] Updated weights for policy 0, policy_version 51450 (0.0007) [2023-03-07 08:42:58,162][155452] Updated weights for policy 0, policy_version 51460 (0.0008) [2023-03-07 08:42:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 52697088. Throughput: 0: 13044.3. Samples: 52690311. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:42:58,367][155126] Avg episode reward: [(0, '2101.321')] [2023-03-07 08:42:58,944][155452] Updated weights for policy 0, policy_version 51470 (0.0006) [2023-03-07 08:42:59,732][155452] Updated weights for policy 0, policy_version 51480 (0.0006) [2023-03-07 08:43:00,509][155452] Updated weights for policy 0, policy_version 51490 (0.0006) [2023-03-07 08:43:01,297][155452] Updated weights for policy 0, policy_version 51500 (0.0006) [2023-03-07 08:43:02,084][155452] Updated weights for policy 0, policy_version 51510 (0.0007) [2023-03-07 08:43:02,854][155452] Updated weights for policy 0, policy_version 51520 (0.0006) [2023-03-07 08:43:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 52762624. Throughput: 0: 13046.6. Samples: 52729534. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:43:03,367][155126] Avg episode reward: [(0, '1858.073')] [2023-03-07 08:43:03,649][155452] Updated weights for policy 0, policy_version 51530 (0.0006) [2023-03-07 08:43:04,428][155452] Updated weights for policy 0, policy_version 51540 (0.0006) [2023-03-07 08:43:05,201][155452] Updated weights for policy 0, policy_version 51550 (0.0007) [2023-03-07 08:43:05,986][155452] Updated weights for policy 0, policy_version 51560 (0.0007) [2023-03-07 08:43:06,756][155452] Updated weights for policy 0, policy_version 51570 (0.0006) [2023-03-07 08:43:07,564][155452] Updated weights for policy 0, policy_version 51580 (0.0006) [2023-03-07 08:43:08,349][155452] Updated weights for policy 0, policy_version 51590 (0.0006) [2023-03-07 08:43:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 52828160. Throughput: 0: 13051.4. Samples: 52808016. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:43:08,368][155126] Avg episode reward: [(0, '1982.572')] [2023-03-07 08:43:09,147][155452] Updated weights for policy 0, policy_version 51600 (0.0007) [2023-03-07 08:43:09,924][155452] Updated weights for policy 0, policy_version 51610 (0.0005) [2023-03-07 08:43:10,695][155452] Updated weights for policy 0, policy_version 51620 (0.0007) [2023-03-07 08:43:11,470][155452] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-07 08:43:12,258][155452] Updated weights for policy 0, policy_version 51640 (0.0006) [2023-03-07 08:43:13,049][155452] Updated weights for policy 0, policy_version 51650 (0.0006) [2023-03-07 08:43:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 52893696. Throughput: 0: 13064.3. Samples: 52886440. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:43:13,367][155126] Avg episode reward: [(0, '1877.498')] [2023-03-07 08:43:13,832][155452] Updated weights for policy 0, policy_version 51660 (0.0007) [2023-03-07 08:43:14,622][155452] Updated weights for policy 0, policy_version 51670 (0.0006) [2023-03-07 08:43:15,398][155452] Updated weights for policy 0, policy_version 51680 (0.0006) [2023-03-07 08:43:16,202][155452] Updated weights for policy 0, policy_version 51690 (0.0006) [2023-03-07 08:43:16,978][155452] Updated weights for policy 0, policy_version 51700 (0.0006) [2023-03-07 08:43:17,756][155452] Updated weights for policy 0, policy_version 51710 (0.0006) [2023-03-07 08:43:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 52958208. Throughput: 0: 13062.9. Samples: 52925379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:18,367][155126] Avg episode reward: [(0, '1966.082')] [2023-03-07 08:43:18,545][155452] Updated weights for policy 0, policy_version 51720 (0.0007) [2023-03-07 08:43:19,346][155452] Updated weights for policy 0, policy_version 51730 (0.0006) [2023-03-07 08:43:20,128][155452] Updated weights for policy 0, policy_version 51740 (0.0006) [2023-03-07 08:43:20,924][155452] Updated weights for policy 0, policy_version 51750 (0.0006) [2023-03-07 08:43:21,703][155452] Updated weights for policy 0, policy_version 51760 (0.0005) [2023-03-07 08:43:22,469][155452] Updated weights for policy 0, policy_version 51770 (0.0006) [2023-03-07 08:43:23,264][155452] Updated weights for policy 0, policy_version 51780 (0.0006) [2023-03-07 08:43:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 53023744. Throughput: 0: 13048.3. Samples: 53003569. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:23,368][155126] Avg episode reward: [(0, '2020.423')] [2023-03-07 08:43:24,069][155452] Updated weights for policy 0, policy_version 51790 (0.0007) [2023-03-07 08:43:24,838][155452] Updated weights for policy 0, policy_version 51800 (0.0007) [2023-03-07 08:43:25,622][155452] Updated weights for policy 0, policy_version 51810 (0.0006) [2023-03-07 08:43:26,407][155452] Updated weights for policy 0, policy_version 51820 (0.0006) [2023-03-07 08:43:27,198][155452] Updated weights for policy 0, policy_version 51830 (0.0006) [2023-03-07 08:43:27,976][155452] Updated weights for policy 0, policy_version 51840 (0.0006) [2023-03-07 08:43:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 53088256. Throughput: 0: 13047.5. Samples: 53081694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:28,367][155126] Avg episode reward: [(0, '1927.652')] [2023-03-07 08:43:28,375][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000051845_53089280.pth... [2023-03-07 08:43:28,405][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000048788_49958912.pth [2023-03-07 08:43:28,752][155452] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-07 08:43:29,552][155452] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-07 08:43:30,339][155452] Updated weights for policy 0, policy_version 51870 (0.0006) [2023-03-07 08:43:31,127][155452] Updated weights for policy 0, policy_version 51880 (0.0007) [2023-03-07 08:43:31,906][155452] Updated weights for policy 0, policy_version 51890 (0.0005) [2023-03-07 08:43:32,691][155452] Updated weights for policy 0, policy_version 51900 (0.0006) [2023-03-07 08:43:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 53153792. Throughput: 0: 13045.4. Samples: 53120770. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:33,367][155126] Avg episode reward: [(0, '2083.868')] [2023-03-07 08:43:33,485][155452] Updated weights for policy 0, policy_version 51910 (0.0006) [2023-03-07 08:43:34,272][155452] Updated weights for policy 0, policy_version 51920 (0.0006) [2023-03-07 08:43:35,063][155452] Updated weights for policy 0, policy_version 51930 (0.0007) [2023-03-07 08:43:35,840][155452] Updated weights for policy 0, policy_version 51940 (0.0006) [2023-03-07 08:43:36,618][155452] Updated weights for policy 0, policy_version 51950 (0.0006) [2023-03-07 08:43:37,402][155452] Updated weights for policy 0, policy_version 51960 (0.0006) [2023-03-07 08:43:38,197][155452] Updated weights for policy 0, policy_version 51970 (0.0006) [2023-03-07 08:43:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 53219328. Throughput: 0: 13040.3. Samples: 53198882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:38,367][155126] Avg episode reward: [(0, '1990.861')] [2023-03-07 08:43:38,991][155452] Updated weights for policy 0, policy_version 51980 (0.0006) [2023-03-07 08:43:39,778][155452] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-07 08:43:40,585][155452] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-07 08:43:41,367][155452] Updated weights for policy 0, policy_version 52010 (0.0006) [2023-03-07 08:43:42,167][155452] Updated weights for policy 0, policy_version 52020 (0.0006) [2023-03-07 08:43:42,960][155452] Updated weights for policy 0, policy_version 52030 (0.0005) [2023-03-07 08:43:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 53283840. Throughput: 0: 13033.7. Samples: 53276829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:43,368][155126] Avg episode reward: [(0, '1871.702')] [2023-03-07 08:43:43,737][155452] Updated weights for policy 0, policy_version 52040 (0.0007) [2023-03-07 08:43:44,503][155452] Updated weights for policy 0, policy_version 52050 (0.0006) [2023-03-07 08:43:45,304][155452] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-07 08:43:46,068][155452] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-07 08:43:46,868][155452] Updated weights for policy 0, policy_version 52080 (0.0006) [2023-03-07 08:43:47,658][155452] Updated weights for policy 0, policy_version 52090 (0.0006) [2023-03-07 08:43:48,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 53348352. Throughput: 0: 13031.2. Samples: 53315939. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:48,368][155126] Avg episode reward: [(0, '1900.361')] [2023-03-07 08:43:48,441][155452] Updated weights for policy 0, policy_version 52100 (0.0006) [2023-03-07 08:43:49,221][155452] Updated weights for policy 0, policy_version 52110 (0.0006) [2023-03-07 08:43:50,018][155452] Updated weights for policy 0, policy_version 52120 (0.0006) [2023-03-07 08:43:50,793][155452] Updated weights for policy 0, policy_version 52130 (0.0006) [2023-03-07 08:43:51,573][155452] Updated weights for policy 0, policy_version 52140 (0.0007) [2023-03-07 08:43:52,362][155452] Updated weights for policy 0, policy_version 52150 (0.0006) [2023-03-07 08:43:53,142][155452] Updated weights for policy 0, policy_version 52160 (0.0007) [2023-03-07 08:43:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 53413888. Throughput: 0: 13024.0. Samples: 53394095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:53,367][155126] Avg episode reward: [(0, '1968.310')] [2023-03-07 08:43:53,927][155452] Updated weights for policy 0, policy_version 52170 (0.0006) [2023-03-07 08:43:54,723][155452] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-07 08:43:55,506][155452] Updated weights for policy 0, policy_version 52190 (0.0008) [2023-03-07 08:43:56,286][155452] Updated weights for policy 0, policy_version 52200 (0.0007) [2023-03-07 08:43:57,098][155452] Updated weights for policy 0, policy_version 52210 (0.0008) [2023-03-07 08:43:57,888][155452] Updated weights for policy 0, policy_version 52220 (0.0006) [2023-03-07 08:43:58,367][155126] Fps is (10 sec: 13107.5, 60 sec: 13039.0, 300 sec: 13041.3). Total num frames: 53479424. Throughput: 0: 13014.2. Samples: 53472078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:43:58,367][155126] Avg episode reward: [(0, '1970.503')] [2023-03-07 08:43:58,686][155452] Updated weights for policy 0, policy_version 52230 (0.0005) [2023-03-07 08:43:59,462][155452] Updated weights for policy 0, policy_version 52240 (0.0007) [2023-03-07 08:44:00,247][155452] Updated weights for policy 0, policy_version 52250 (0.0006) [2023-03-07 08:44:01,038][155452] Updated weights for policy 0, policy_version 52260 (0.0006) [2023-03-07 08:44:01,811][155452] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-07 08:44:02,587][155452] Updated weights for policy 0, policy_version 52280 (0.0006) [2023-03-07 08:44:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 53543936. Throughput: 0: 13016.8. Samples: 53511137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:03,368][155126] Avg episode reward: [(0, '2000.272')] [2023-03-07 08:44:03,393][155452] Updated weights for policy 0, policy_version 52290 (0.0006) [2023-03-07 08:44:04,170][155452] Updated weights for policy 0, policy_version 52300 (0.0006) [2023-03-07 08:44:04,944][155452] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-07 08:44:05,735][155452] Updated weights for policy 0, policy_version 52320 (0.0006) [2023-03-07 08:44:06,515][155452] Updated weights for policy 0, policy_version 52330 (0.0006) [2023-03-07 08:44:07,305][155452] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-07 08:44:08,077][155452] Updated weights for policy 0, policy_version 52350 (0.0006) [2023-03-07 08:44:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 53609472. Throughput: 0: 13018.6. Samples: 53589405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:08,368][155126] Avg episode reward: [(0, '1883.911')] [2023-03-07 08:44:08,865][155452] Updated weights for policy 0, policy_version 52360 (0.0006) [2023-03-07 08:44:09,665][155452] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-07 08:44:10,427][155452] Updated weights for policy 0, policy_version 52380 (0.0005) [2023-03-07 08:44:11,220][155452] Updated weights for policy 0, policy_version 52390 (0.0007) [2023-03-07 08:44:12,009][155452] Updated weights for policy 0, policy_version 52400 (0.0006) [2023-03-07 08:44:12,810][155452] Updated weights for policy 0, policy_version 52410 (0.0006) [2023-03-07 08:44:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13041.2). Total num frames: 53675008. Throughput: 0: 13024.8. Samples: 53667809. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:13,367][155126] Avg episode reward: [(0, '1976.625')] [2023-03-07 08:44:13,582][155452] Updated weights for policy 0, policy_version 52420 (0.0006) [2023-03-07 08:44:14,362][155452] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-07 08:44:15,149][155452] Updated weights for policy 0, policy_version 52440 (0.0006) [2023-03-07 08:44:15,937][155452] Updated weights for policy 0, policy_version 52450 (0.0007) [2023-03-07 08:44:16,744][155452] Updated weights for policy 0, policy_version 52460 (0.0006) [2023-03-07 08:44:17,507][155452] Updated weights for policy 0, policy_version 52470 (0.0006) [2023-03-07 08:44:18,313][155452] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-07 08:44:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 53739520. Throughput: 0: 13025.7. Samples: 53706928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:18,368][155126] Avg episode reward: [(0, '1903.290')] [2023-03-07 08:44:19,091][155452] Updated weights for policy 0, policy_version 52490 (0.0006) [2023-03-07 08:44:19,866][155452] Updated weights for policy 0, policy_version 52500 (0.0007) [2023-03-07 08:44:20,671][155452] Updated weights for policy 0, policy_version 52510 (0.0006) [2023-03-07 08:44:21,464][155452] Updated weights for policy 0, policy_version 52520 (0.0006) [2023-03-07 08:44:22,255][155452] Updated weights for policy 0, policy_version 52530 (0.0006) [2023-03-07 08:44:23,054][155452] Updated weights for policy 0, policy_version 52540 (0.0006) [2023-03-07 08:44:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 53805056. Throughput: 0: 13014.7. Samples: 53784543. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:23,367][155126] Avg episode reward: [(0, '1838.292')] [2023-03-07 08:44:23,839][155452] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-07 08:44:24,623][155452] Updated weights for policy 0, policy_version 52560 (0.0007) [2023-03-07 08:44:25,420][155452] Updated weights for policy 0, policy_version 52570 (0.0007) [2023-03-07 08:44:26,194][155452] Updated weights for policy 0, policy_version 52580 (0.0007) [2023-03-07 08:44:26,998][155452] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-07 08:44:27,792][155452] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-07 08:44:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 53869568. Throughput: 0: 13013.8. Samples: 53862449. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:28,367][155126] Avg episode reward: [(0, '1820.874')] [2023-03-07 08:44:28,585][155452] Updated weights for policy 0, policy_version 52610 (0.0006) [2023-03-07 08:44:29,375][155452] Updated weights for policy 0, policy_version 52620 (0.0006) [2023-03-07 08:44:30,167][155452] Updated weights for policy 0, policy_version 52630 (0.0006) [2023-03-07 08:44:30,940][155452] Updated weights for policy 0, policy_version 52640 (0.0006) [2023-03-07 08:44:31,738][155452] Updated weights for policy 0, policy_version 52650 (0.0006) [2023-03-07 08:44:32,528][155452] Updated weights for policy 0, policy_version 52660 (0.0006) [2023-03-07 08:44:33,298][155452] Updated weights for policy 0, policy_version 52670 (0.0006) [2023-03-07 08:44:33,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 53934080. Throughput: 0: 13010.2. Samples: 53901397. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:33,367][155126] Avg episode reward: [(0, '1877.978')] [2023-03-07 08:44:34,097][155452] Updated weights for policy 0, policy_version 52680 (0.0006) [2023-03-07 08:44:34,864][155452] Updated weights for policy 0, policy_version 52690 (0.0005) [2023-03-07 08:44:35,661][155452] Updated weights for policy 0, policy_version 52700 (0.0007) [2023-03-07 08:44:36,458][155452] Updated weights for policy 0, policy_version 52710 (0.0007) [2023-03-07 08:44:37,221][155452] Updated weights for policy 0, policy_version 52720 (0.0006) [2023-03-07 08:44:38,017][155452] Updated weights for policy 0, policy_version 52730 (0.0006) [2023-03-07 08:44:38,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 53999616. Throughput: 0: 13008.0. Samples: 53979453. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:38,367][155126] Avg episode reward: [(0, '1803.666')] [2023-03-07 08:44:38,815][155452] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-07 08:44:39,597][155452] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-07 08:44:40,395][155452] Updated weights for policy 0, policy_version 52760 (0.0006) [2023-03-07 08:44:41,166][155452] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-07 08:44:41,953][155452] Updated weights for policy 0, policy_version 52780 (0.0006) [2023-03-07 08:44:42,753][155452] Updated weights for policy 0, policy_version 52790 (0.0006) [2023-03-07 08:44:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 54064128. Throughput: 0: 13010.2. Samples: 54057538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:43,367][155126] Avg episode reward: [(0, '2059.122')] [2023-03-07 08:44:43,535][155452] Updated weights for policy 0, policy_version 52800 (0.0007) [2023-03-07 08:44:44,313][155452] Updated weights for policy 0, policy_version 52810 (0.0006) [2023-03-07 08:44:45,118][155452] Updated weights for policy 0, policy_version 52820 (0.0006) [2023-03-07 08:44:45,893][155452] Updated weights for policy 0, policy_version 52830 (0.0007) [2023-03-07 08:44:46,682][155452] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-07 08:44:47,461][155452] Updated weights for policy 0, policy_version 52850 (0.0006) [2023-03-07 08:44:48,259][155452] Updated weights for policy 0, policy_version 52860 (0.0006) [2023-03-07 08:44:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 54129664. Throughput: 0: 13012.1. Samples: 54096683. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:48,367][155126] Avg episode reward: [(0, '2038.339')] [2023-03-07 08:44:49,043][155452] Updated weights for policy 0, policy_version 52870 (0.0007) [2023-03-07 08:44:49,834][155452] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-07 08:44:50,610][155452] Updated weights for policy 0, policy_version 52890 (0.0008) [2023-03-07 08:44:51,410][155452] Updated weights for policy 0, policy_version 52900 (0.0006) [2023-03-07 08:44:52,181][155452] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-07 08:44:52,964][155452] Updated weights for policy 0, policy_version 52920 (0.0006) [2023-03-07 08:44:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 54195200. Throughput: 0: 13005.6. Samples: 54174654. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:44:53,367][155126] Avg episode reward: [(0, '2077.136')] [2023-03-07 08:44:53,751][155452] Updated weights for policy 0, policy_version 52930 (0.0006) [2023-03-07 08:44:54,535][155452] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-07 08:44:55,333][155452] Updated weights for policy 0, policy_version 52950 (0.0006) [2023-03-07 08:44:56,111][155452] Updated weights for policy 0, policy_version 52960 (0.0006) [2023-03-07 08:44:56,891][155452] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-07 08:44:57,681][155452] Updated weights for policy 0, policy_version 52980 (0.0006) [2023-03-07 08:44:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 54259712. Throughput: 0: 13003.0. Samples: 54252943. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:44:58,367][155126] Avg episode reward: [(0, '2011.897')] [2023-03-07 08:44:58,477][155452] Updated weights for policy 0, policy_version 52990 (0.0007) [2023-03-07 08:44:59,250][155452] Updated weights for policy 0, policy_version 53000 (0.0006) [2023-03-07 08:45:00,045][155452] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-07 08:45:00,834][155452] Updated weights for policy 0, policy_version 53020 (0.0006) [2023-03-07 08:45:01,634][155452] Updated weights for policy 0, policy_version 53030 (0.0006) [2023-03-07 08:45:02,414][155452] Updated weights for policy 0, policy_version 53040 (0.0006) [2023-03-07 08:45:03,223][155452] Updated weights for policy 0, policy_version 53050 (0.0006) [2023-03-07 08:45:03,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 54324224. Throughput: 0: 12999.1. Samples: 54291887. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:45:03,367][155126] Avg episode reward: [(0, '1928.543')] [2023-03-07 08:45:04,001][155452] Updated weights for policy 0, policy_version 53060 (0.0007) [2023-03-07 08:45:04,789][155452] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-07 08:45:05,570][155452] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-07 08:45:06,335][155452] Updated weights for policy 0, policy_version 53090 (0.0006) [2023-03-07 08:45:07,132][155452] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-07 08:45:07,927][155452] Updated weights for policy 0, policy_version 53110 (0.0007) [2023-03-07 08:45:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 54389760. Throughput: 0: 13009.5. Samples: 54369971. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:45:08,367][155126] Avg episode reward: [(0, '2013.258')] [2023-03-07 08:45:08,719][155452] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-07 08:45:09,503][155452] Updated weights for policy 0, policy_version 53130 (0.0007) [2023-03-07 08:45:10,289][155452] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-07 08:45:11,073][155452] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-07 08:45:11,851][155452] Updated weights for policy 0, policy_version 53160 (0.0006) [2023-03-07 08:45:12,640][155452] Updated weights for policy 0, policy_version 53170 (0.0007) [2023-03-07 08:45:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 54455296. Throughput: 0: 13013.5. Samples: 54448054. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:45:13,367][155126] Avg episode reward: [(0, '1861.877')] [2023-03-07 08:45:13,411][155452] Updated weights for policy 0, policy_version 53180 (0.0006) [2023-03-07 08:45:14,204][155452] Updated weights for policy 0, policy_version 53190 (0.0007) [2023-03-07 08:45:14,978][155452] Updated weights for policy 0, policy_version 53200 (0.0007) [2023-03-07 08:45:15,757][155452] Updated weights for policy 0, policy_version 53210 (0.0006) [2023-03-07 08:45:16,548][155452] Updated weights for policy 0, policy_version 53220 (0.0007) [2023-03-07 08:45:17,339][155452] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-07 08:45:18,147][155452] Updated weights for policy 0, policy_version 53240 (0.0006) [2023-03-07 08:45:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 54520832. Throughput: 0: 13022.2. Samples: 54487397. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:45:18,367][155126] Avg episode reward: [(0, '2161.578')] [2023-03-07 08:45:18,918][155452] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-07 08:45:19,691][155452] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-07 08:45:20,509][155452] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-07 08:45:21,268][155452] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-07 08:45:22,067][155452] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-07 08:45:22,867][155452] Updated weights for policy 0, policy_version 53300 (0.0006) [2023-03-07 08:45:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 54585344. Throughput: 0: 13019.6. Samples: 54565338. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:45:23,367][155126] Avg episode reward: [(0, '2094.762')] [2023-03-07 08:45:23,639][155452] Updated weights for policy 0, policy_version 53310 (0.0006) [2023-03-07 08:45:24,424][155452] Updated weights for policy 0, policy_version 53320 (0.0007) [2023-03-07 08:45:25,186][155452] Updated weights for policy 0, policy_version 53330 (0.0006) [2023-03-07 08:45:25,990][155452] Updated weights for policy 0, policy_version 53340 (0.0007) [2023-03-07 08:45:26,780][155452] Updated weights for policy 0, policy_version 53350 (0.0006) [2023-03-07 08:45:27,567][155452] Updated weights for policy 0, policy_version 53360 (0.0006) [2023-03-07 08:45:28,352][155452] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-07 08:45:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 54650880. Throughput: 0: 13019.9. Samples: 54643435. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:45:28,367][155126] Avg episode reward: [(0, '1945.939')] [2023-03-07 08:45:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000053370_54650880.pth... [2023-03-07 08:45:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000050316_51523584.pth [2023-03-07 08:45:29,134][155452] Updated weights for policy 0, policy_version 53380 (0.0007) [2023-03-07 08:45:29,920][155452] Updated weights for policy 0, policy_version 53390 (0.0006) [2023-03-07 08:45:30,726][155452] Updated weights for policy 0, policy_version 53400 (0.0006) [2023-03-07 08:45:31,506][155452] Updated weights for policy 0, policy_version 53410 (0.0007) [2023-03-07 08:45:32,296][155452] Updated weights for policy 0, policy_version 53420 (0.0006) [2023-03-07 08:45:33,072][155452] Updated weights for policy 0, policy_version 53430 (0.0007) [2023-03-07 08:45:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 54715392. Throughput: 0: 13019.9. Samples: 54682578. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:45:33,367][155126] Avg episode reward: [(0, '1965.544')] [2023-03-07 08:45:33,848][155452] Updated weights for policy 0, policy_version 53440 (0.0007) [2023-03-07 08:45:34,647][155452] Updated weights for policy 0, policy_version 53450 (0.0008) [2023-03-07 08:45:35,429][155452] Updated weights for policy 0, policy_version 53460 (0.0006) [2023-03-07 08:45:36,228][155452] Updated weights for policy 0, policy_version 53470 (0.0006) [2023-03-07 08:45:37,016][155452] Updated weights for policy 0, policy_version 53480 (0.0006) [2023-03-07 08:45:37,783][155452] Updated weights for policy 0, policy_version 53490 (0.0008) [2023-03-07 08:45:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 54780928. Throughput: 0: 13022.2. Samples: 54760655. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:45:38,367][155126] Avg episode reward: [(0, '1858.424')] [2023-03-07 08:45:38,562][155452] Updated weights for policy 0, policy_version 53500 (0.0006) [2023-03-07 08:45:39,372][155452] Updated weights for policy 0, policy_version 53510 (0.0006) [2023-03-07 08:45:40,154][155452] Updated weights for policy 0, policy_version 53520 (0.0006) [2023-03-07 08:45:40,949][155452] Updated weights for policy 0, policy_version 53530 (0.0006) [2023-03-07 08:45:41,722][155452] Updated weights for policy 0, policy_version 53540 (0.0006) [2023-03-07 08:45:42,509][155452] Updated weights for policy 0, policy_version 53550 (0.0007) [2023-03-07 08:45:43,305][155452] Updated weights for policy 0, policy_version 53560 (0.0006) [2023-03-07 08:45:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 54845440. Throughput: 0: 13022.0. Samples: 54838931. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:45:43,367][155126] Avg episode reward: [(0, '2046.944')] [2023-03-07 08:45:44,105][155452] Updated weights for policy 0, policy_version 53570 (0.0007) [2023-03-07 08:45:44,898][155452] Updated weights for policy 0, policy_version 53580 (0.0006) [2023-03-07 08:45:45,677][155452] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-07 08:45:46,470][155452] Updated weights for policy 0, policy_version 53600 (0.0006) [2023-03-07 08:45:47,252][155452] Updated weights for policy 0, policy_version 53610 (0.0005) [2023-03-07 08:45:48,049][155452] Updated weights for policy 0, policy_version 53620 (0.0006) [2023-03-07 08:45:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 54910976. Throughput: 0: 13019.5. Samples: 54877765. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:45:48,367][155126] Avg episode reward: [(0, '1901.850')] [2023-03-07 08:45:48,825][155452] Updated weights for policy 0, policy_version 53630 (0.0007) [2023-03-07 08:45:49,605][155452] Updated weights for policy 0, policy_version 53640 (0.0007) [2023-03-07 08:45:50,402][155452] Updated weights for policy 0, policy_version 53650 (0.0007) [2023-03-07 08:45:51,188][155452] Updated weights for policy 0, policy_version 53660 (0.0006) [2023-03-07 08:45:51,968][155452] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-07 08:45:52,777][155452] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-07 08:45:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 54975488. Throughput: 0: 13016.9. Samples: 54955731. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:45:53,367][155126] Avg episode reward: [(0, '2042.579')] [2023-03-07 08:45:53,549][155452] Updated weights for policy 0, policy_version 53690 (0.0005) [2023-03-07 08:45:54,354][155452] Updated weights for policy 0, policy_version 53700 (0.0006) [2023-03-07 08:45:55,118][155452] Updated weights for policy 0, policy_version 53710 (0.0006) [2023-03-07 08:45:55,899][155452] Updated weights for policy 0, policy_version 53720 (0.0005) [2023-03-07 08:45:56,696][155452] Updated weights for policy 0, policy_version 53730 (0.0005) [2023-03-07 08:45:57,465][155452] Updated weights for policy 0, policy_version 53740 (0.0006) [2023-03-07 08:45:58,250][155452] Updated weights for policy 0, policy_version 53750 (0.0005) [2023-03-07 08:45:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 55041024. Throughput: 0: 13021.4. Samples: 55034022. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:45:58,368][155126] Avg episode reward: [(0, '1807.085')] [2023-03-07 08:45:59,050][155452] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-07 08:45:59,837][155452] Updated weights for policy 0, policy_version 53770 (0.0006) [2023-03-07 08:46:00,600][155452] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-07 08:46:01,386][155452] Updated weights for policy 0, policy_version 53790 (0.0006) [2023-03-07 08:46:02,164][155452] Updated weights for policy 0, policy_version 53800 (0.0006) [2023-03-07 08:46:02,947][155452] Updated weights for policy 0, policy_version 53810 (0.0007) [2023-03-07 08:46:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 55106560. Throughput: 0: 13017.1. Samples: 55073165. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:03,367][155126] Avg episode reward: [(0, '1905.705')] [2023-03-07 08:46:03,746][155452] Updated weights for policy 0, policy_version 53820 (0.0006) [2023-03-07 08:46:04,519][155452] Updated weights for policy 0, policy_version 53830 (0.0007) [2023-03-07 08:46:05,301][155452] Updated weights for policy 0, policy_version 53840 (0.0007) [2023-03-07 08:46:06,083][155452] Updated weights for policy 0, policy_version 53850 (0.0006) [2023-03-07 08:46:06,871][155452] Updated weights for policy 0, policy_version 53860 (0.0007) [2023-03-07 08:46:07,663][155452] Updated weights for policy 0, policy_version 53870 (0.0007) [2023-03-07 08:46:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 55171072. Throughput: 0: 13028.9. Samples: 55151639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:08,378][155126] Avg episode reward: [(0, '1889.412')] [2023-03-07 08:46:08,446][155452] Updated weights for policy 0, policy_version 53880 (0.0006) [2023-03-07 08:46:09,241][155452] Updated weights for policy 0, policy_version 53890 (0.0006) [2023-03-07 08:46:10,020][155452] Updated weights for policy 0, policy_version 53900 (0.0007) [2023-03-07 08:46:10,808][155452] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-07 08:46:11,606][155452] Updated weights for policy 0, policy_version 53920 (0.0006) [2023-03-07 08:46:12,389][155452] Updated weights for policy 0, policy_version 53930 (0.0006) [2023-03-07 08:46:13,149][155452] Updated weights for policy 0, policy_version 53940 (0.0006) [2023-03-07 08:46:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 55236608. Throughput: 0: 13025.9. Samples: 55229602. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:13,368][155126] Avg episode reward: [(0, '1942.367')] [2023-03-07 08:46:13,950][155452] Updated weights for policy 0, policy_version 53950 (0.0006) [2023-03-07 08:46:14,730][155452] Updated weights for policy 0, policy_version 53960 (0.0006) [2023-03-07 08:46:15,506][155452] Updated weights for policy 0, policy_version 53970 (0.0006) [2023-03-07 08:46:16,294][155452] Updated weights for policy 0, policy_version 53980 (0.0007) [2023-03-07 08:46:17,072][155452] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-07 08:46:17,862][155452] Updated weights for policy 0, policy_version 54000 (0.0005) [2023-03-07 08:46:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 55302144. Throughput: 0: 13029.9. Samples: 55268925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:18,367][155126] Avg episode reward: [(0, '2055.309')] [2023-03-07 08:46:18,644][155452] Updated weights for policy 0, policy_version 54010 (0.0006) [2023-03-07 08:46:19,431][155452] Updated weights for policy 0, policy_version 54020 (0.0006) [2023-03-07 08:46:20,203][155452] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-07 08:46:20,994][155452] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-07 08:46:21,769][155452] Updated weights for policy 0, policy_version 54050 (0.0006) [2023-03-07 08:46:22,571][155452] Updated weights for policy 0, policy_version 54060 (0.0006) [2023-03-07 08:46:23,342][155452] Updated weights for policy 0, policy_version 54070 (0.0006) [2023-03-07 08:46:23,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 55367680. Throughput: 0: 13038.1. Samples: 55347371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:23,367][155126] Avg episode reward: [(0, '2022.133')] [2023-03-07 08:46:24,132][155452] Updated weights for policy 0, policy_version 54080 (0.0006) [2023-03-07 08:46:24,911][155452] Updated weights for policy 0, policy_version 54090 (0.0006) [2023-03-07 08:46:25,682][155452] Updated weights for policy 0, policy_version 54100 (0.0006) [2023-03-07 08:46:26,479][155452] Updated weights for policy 0, policy_version 54110 (0.0005) [2023-03-07 08:46:27,263][155452] Updated weights for policy 0, policy_version 54120 (0.0007) [2023-03-07 08:46:28,049][155452] Updated weights for policy 0, policy_version 54130 (0.0006) [2023-03-07 08:46:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 55433216. Throughput: 0: 13039.6. Samples: 55425711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:28,367][155126] Avg episode reward: [(0, '2009.935')] [2023-03-07 08:46:28,829][155452] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-07 08:46:29,631][155452] Updated weights for policy 0, policy_version 54150 (0.0006) [2023-03-07 08:46:30,429][155452] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-07 08:46:31,222][155452] Updated weights for policy 0, policy_version 54170 (0.0006) [2023-03-07 08:46:32,013][155452] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-07 08:46:32,804][155452] Updated weights for policy 0, policy_version 54190 (0.0006) [2023-03-07 08:46:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 55497728. Throughput: 0: 13041.4. Samples: 55464626. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:33,367][155126] Avg episode reward: [(0, '1844.585')] [2023-03-07 08:46:33,600][155452] Updated weights for policy 0, policy_version 54200 (0.0006) [2023-03-07 08:46:34,374][155452] Updated weights for policy 0, policy_version 54210 (0.0006) [2023-03-07 08:46:35,176][155452] Updated weights for policy 0, policy_version 54220 (0.0007) [2023-03-07 08:46:35,947][155452] Updated weights for policy 0, policy_version 54230 (0.0006) [2023-03-07 08:46:36,725][155452] Updated weights for policy 0, policy_version 54240 (0.0006) [2023-03-07 08:46:37,511][155452] Updated weights for policy 0, policy_version 54250 (0.0006) [2023-03-07 08:46:38,280][155452] Updated weights for policy 0, policy_version 54260 (0.0006) [2023-03-07 08:46:38,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 55562240. Throughput: 0: 13041.0. Samples: 55542575. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:38,367][155126] Avg episode reward: [(0, '1739.992')] [2023-03-07 08:46:39,090][155452] Updated weights for policy 0, policy_version 54270 (0.0006) [2023-03-07 08:46:39,878][155452] Updated weights for policy 0, policy_version 54280 (0.0006) [2023-03-07 08:46:40,662][155452] Updated weights for policy 0, policy_version 54290 (0.0007) [2023-03-07 08:46:41,445][155452] Updated weights for policy 0, policy_version 54300 (0.0006) [2023-03-07 08:46:42,222][155452] Updated weights for policy 0, policy_version 54310 (0.0006) [2023-03-07 08:46:43,009][155452] Updated weights for policy 0, policy_version 54320 (0.0006) [2023-03-07 08:46:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 55627776. Throughput: 0: 13041.1. Samples: 55620871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:43,368][155126] Avg episode reward: [(0, '1978.055')] [2023-03-07 08:46:43,791][155452] Updated weights for policy 0, policy_version 54330 (0.0005) [2023-03-07 08:46:44,578][155452] Updated weights for policy 0, policy_version 54340 (0.0007) [2023-03-07 08:46:45,375][155452] Updated weights for policy 0, policy_version 54350 (0.0007) [2023-03-07 08:46:46,144][155452] Updated weights for policy 0, policy_version 54360 (0.0006) [2023-03-07 08:46:46,922][155452] Updated weights for policy 0, policy_version 54370 (0.0006) [2023-03-07 08:46:47,713][155452] Updated weights for policy 0, policy_version 54380 (0.0006) [2023-03-07 08:46:48,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 55693312. Throughput: 0: 13038.5. Samples: 55659900. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:48,367][155126] Avg episode reward: [(0, '2003.860')] [2023-03-07 08:46:48,497][155452] Updated weights for policy 0, policy_version 54390 (0.0007) [2023-03-07 08:46:49,276][155452] Updated weights for policy 0, policy_version 54400 (0.0006) [2023-03-07 08:46:50,073][155452] Updated weights for policy 0, policy_version 54410 (0.0006) [2023-03-07 08:46:50,856][155452] Updated weights for policy 0, policy_version 54420 (0.0006) [2023-03-07 08:46:51,641][155452] Updated weights for policy 0, policy_version 54430 (0.0006) [2023-03-07 08:46:52,403][155452] Updated weights for policy 0, policy_version 54440 (0.0006) [2023-03-07 08:46:53,181][155452] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-07 08:46:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 55758848. Throughput: 0: 13040.0. Samples: 55738438. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:53,367][155126] Avg episode reward: [(0, '1984.097')] [2023-03-07 08:46:53,954][155452] Updated weights for policy 0, policy_version 54460 (0.0006) [2023-03-07 08:46:54,740][155452] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-07 08:46:55,517][155452] Updated weights for policy 0, policy_version 54480 (0.0006) [2023-03-07 08:46:56,328][155452] Updated weights for policy 0, policy_version 54490 (0.0007) [2023-03-07 08:46:57,105][155452] Updated weights for policy 0, policy_version 54500 (0.0006) [2023-03-07 08:46:57,885][155452] Updated weights for policy 0, policy_version 54510 (0.0006) [2023-03-07 08:46:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 55824384. Throughput: 0: 13049.9. Samples: 55816846. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:46:58,367][155126] Avg episode reward: [(0, '2127.538')] [2023-03-07 08:46:58,699][155452] Updated weights for policy 0, policy_version 54520 (0.0005) [2023-03-07 08:46:59,484][155452] Updated weights for policy 0, policy_version 54530 (0.0008) [2023-03-07 08:47:00,262][155452] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-07 08:47:01,047][155452] Updated weights for policy 0, policy_version 54550 (0.0006) [2023-03-07 08:47:01,827][155452] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-07 08:47:02,601][155452] Updated weights for policy 0, policy_version 54570 (0.0006) [2023-03-07 08:47:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 55888896. Throughput: 0: 13043.4. Samples: 55855879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:03,367][155126] Avg episode reward: [(0, '2062.833')] [2023-03-07 08:47:03,385][155452] Updated weights for policy 0, policy_version 54580 (0.0007) [2023-03-07 08:47:04,166][155452] Updated weights for policy 0, policy_version 54590 (0.0006) [2023-03-07 08:47:04,960][155452] Updated weights for policy 0, policy_version 54600 (0.0006) [2023-03-07 08:47:05,762][155452] Updated weights for policy 0, policy_version 54610 (0.0006) [2023-03-07 08:47:06,539][155452] Updated weights for policy 0, policy_version 54620 (0.0006) [2023-03-07 08:47:07,327][155452] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-07 08:47:08,113][155452] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-07 08:47:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 55954432. Throughput: 0: 13040.6. Samples: 55934197. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:08,367][155126] Avg episode reward: [(0, '2109.311')] [2023-03-07 08:47:08,908][155452] Updated weights for policy 0, policy_version 54650 (0.0007) [2023-03-07 08:47:09,684][155452] Updated weights for policy 0, policy_version 54660 (0.0006) [2023-03-07 08:47:10,462][155452] Updated weights for policy 0, policy_version 54670 (0.0007) [2023-03-07 08:47:11,229][155452] Updated weights for policy 0, policy_version 54680 (0.0006) [2023-03-07 08:47:12,016][155452] Updated weights for policy 0, policy_version 54690 (0.0007) [2023-03-07 08:47:12,790][155452] Updated weights for policy 0, policy_version 54700 (0.0006) [2023-03-07 08:47:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 56019968. Throughput: 0: 13048.8. Samples: 56012909. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:13,367][155126] Avg episode reward: [(0, '1877.975')] [2023-03-07 08:47:13,563][155452] Updated weights for policy 0, policy_version 54710 (0.0006) [2023-03-07 08:47:14,365][155452] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-07 08:47:15,132][155452] Updated weights for policy 0, policy_version 54730 (0.0007) [2023-03-07 08:47:15,923][155452] Updated weights for policy 0, policy_version 54740 (0.0005) [2023-03-07 08:47:16,702][155452] Updated weights for policy 0, policy_version 54750 (0.0006) [2023-03-07 08:47:17,492][155452] Updated weights for policy 0, policy_version 54760 (0.0006) [2023-03-07 08:47:18,280][155452] Updated weights for policy 0, policy_version 54770 (0.0005) [2023-03-07 08:47:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 56085504. Throughput: 0: 13056.7. Samples: 56052176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:18,367][155126] Avg episode reward: [(0, '2079.026')] [2023-03-07 08:47:19,064][155452] Updated weights for policy 0, policy_version 54780 (0.0006) [2023-03-07 08:47:19,858][155452] Updated weights for policy 0, policy_version 54790 (0.0006) [2023-03-07 08:47:20,654][155452] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-07 08:47:21,433][155452] Updated weights for policy 0, policy_version 54810 (0.0006) [2023-03-07 08:47:22,224][155452] Updated weights for policy 0, policy_version 54820 (0.0006) [2023-03-07 08:47:23,009][155452] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-07 08:47:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 56150016. Throughput: 0: 13060.9. Samples: 56130316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:23,367][155126] Avg episode reward: [(0, '2006.208')] [2023-03-07 08:47:23,784][155452] Updated weights for policy 0, policy_version 54840 (0.0007) [2023-03-07 08:47:24,575][155452] Updated weights for policy 0, policy_version 54850 (0.0006) [2023-03-07 08:47:25,360][155452] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-07 08:47:26,131][155452] Updated weights for policy 0, policy_version 54870 (0.0006) [2023-03-07 08:47:26,921][155452] Updated weights for policy 0, policy_version 54880 (0.0007) [2023-03-07 08:47:27,717][155452] Updated weights for policy 0, policy_version 54890 (0.0006) [2023-03-07 08:47:28,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 56215552. Throughput: 0: 13054.1. Samples: 56208305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:28,368][155126] Avg episode reward: [(0, '1863.511')] [2023-03-07 08:47:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000054898_56215552.pth... [2023-03-07 08:47:28,404][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000051845_53089280.pth [2023-03-07 08:47:28,500][155452] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-07 08:47:29,296][155452] Updated weights for policy 0, policy_version 54910 (0.0007) [2023-03-07 08:47:30,086][155452] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-07 08:47:30,865][155452] Updated weights for policy 0, policy_version 54930 (0.0006) [2023-03-07 08:47:31,656][155452] Updated weights for policy 0, policy_version 54940 (0.0007) [2023-03-07 08:47:32,449][155452] Updated weights for policy 0, policy_version 54950 (0.0007) [2023-03-07 08:47:33,225][155452] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-07 08:47:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 56280064. Throughput: 0: 13058.2. Samples: 56247521. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:33,368][155126] Avg episode reward: [(0, '2113.327')] [2023-03-07 08:47:34,013][155452] Updated weights for policy 0, policy_version 54970 (0.0007) [2023-03-07 08:47:34,802][155452] Updated weights for policy 0, policy_version 54980 (0.0006) [2023-03-07 08:47:35,587][155452] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-07 08:47:36,358][155452] Updated weights for policy 0, policy_version 55000 (0.0006) [2023-03-07 08:47:37,134][155452] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-07 08:47:37,913][155452] Updated weights for policy 0, policy_version 55020 (0.0007) [2023-03-07 08:47:38,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 56345600. Throughput: 0: 13049.9. Samples: 56325685. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:47:38,367][155126] Avg episode reward: [(0, '1898.962')] [2023-03-07 08:47:38,699][155452] Updated weights for policy 0, policy_version 55030 (0.0006) [2023-03-07 08:47:39,485][155452] Updated weights for policy 0, policy_version 55040 (0.0007) [2023-03-07 08:47:40,282][155452] Updated weights for policy 0, policy_version 55050 (0.0007) [2023-03-07 08:47:41,069][155452] Updated weights for policy 0, policy_version 55060 (0.0006) [2023-03-07 08:47:41,859][155452] Updated weights for policy 0, policy_version 55070 (0.0006) [2023-03-07 08:47:42,650][155452] Updated weights for policy 0, policy_version 55080 (0.0007) [2023-03-07 08:47:43,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 56411136. Throughput: 0: 13045.4. Samples: 56403892. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:47:43,368][155126] Avg episode reward: [(0, '2187.843')] [2023-03-07 08:47:43,416][155452] Updated weights for policy 0, policy_version 55090 (0.0006) [2023-03-07 08:47:44,210][155452] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-07 08:47:45,009][155452] Updated weights for policy 0, policy_version 55110 (0.0006) [2023-03-07 08:47:45,794][155452] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-07 08:47:46,584][155452] Updated weights for policy 0, policy_version 55130 (0.0006) [2023-03-07 08:47:47,354][155452] Updated weights for policy 0, policy_version 55140 (0.0008) [2023-03-07 08:47:48,142][155452] Updated weights for policy 0, policy_version 55150 (0.0007) [2023-03-07 08:47:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 56475648. Throughput: 0: 13045.0. Samples: 56442905. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:47:48,367][155126] Avg episode reward: [(0, '1872.534')] [2023-03-07 08:47:48,924][155452] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-07 08:47:49,717][155452] Updated weights for policy 0, policy_version 55170 (0.0006) [2023-03-07 08:47:50,506][155452] Updated weights for policy 0, policy_version 55180 (0.0007) [2023-03-07 08:47:51,293][155452] Updated weights for policy 0, policy_version 55190 (0.0007) [2023-03-07 08:47:52,075][155452] Updated weights for policy 0, policy_version 55200 (0.0006) [2023-03-07 08:47:52,853][155452] Updated weights for policy 0, policy_version 55210 (0.0006) [2023-03-07 08:47:53,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 56541184. Throughput: 0: 13044.9. Samples: 56521216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:47:53,367][155126] Avg episode reward: [(0, '2020.629')] [2023-03-07 08:47:53,639][155452] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-07 08:47:53,718][155401] KL-divergence is very high: 1984.1348 [2023-03-07 08:47:53,876][155401] KL-divergence is very high: 113.3578 [2023-03-07 08:47:54,431][155452] Updated weights for policy 0, policy_version 55230 (0.0006) [2023-03-07 08:47:55,215][155452] Updated weights for policy 0, policy_version 55240 (0.0006) [2023-03-07 08:47:55,999][155452] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-07 08:47:56,779][155452] Updated weights for policy 0, policy_version 55260 (0.0006) [2023-03-07 08:47:57,565][155452] Updated weights for policy 0, policy_version 55270 (0.0007) [2023-03-07 08:47:58,354][155452] Updated weights for policy 0, policy_version 55280 (0.0006) [2023-03-07 08:47:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 56606720. Throughput: 0: 13034.9. Samples: 56599479. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:47:58,367][155126] Avg episode reward: [(0, '1924.596')] [2023-03-07 08:47:59,143][155452] Updated weights for policy 0, policy_version 55290 (0.0007) [2023-03-07 08:47:59,932][155452] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-07 08:48:00,723][155452] Updated weights for policy 0, policy_version 55310 (0.0006) [2023-03-07 08:48:01,525][155452] Updated weights for policy 0, policy_version 55320 (0.0007) [2023-03-07 08:48:02,297][155452] Updated weights for policy 0, policy_version 55330 (0.0006) [2023-03-07 08:48:03,090][155452] Updated weights for policy 0, policy_version 55340 (0.0006) [2023-03-07 08:48:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 56671232. Throughput: 0: 13029.7. Samples: 56638514. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:48:03,367][155126] Avg episode reward: [(0, '2045.073')] [2023-03-07 08:48:03,864][155452] Updated weights for policy 0, policy_version 55350 (0.0006) [2023-03-07 08:48:04,651][155452] Updated weights for policy 0, policy_version 55360 (0.0007) [2023-03-07 08:48:05,432][155452] Updated weights for policy 0, policy_version 55370 (0.0006) [2023-03-07 08:48:06,221][155452] Updated weights for policy 0, policy_version 55380 (0.0005) [2023-03-07 08:48:07,025][155452] Updated weights for policy 0, policy_version 55390 (0.0006) [2023-03-07 08:48:07,809][155452] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-07 08:48:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 56736768. Throughput: 0: 13027.2. Samples: 56716542. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:48:08,368][155126] Avg episode reward: [(0, '1977.124')] [2023-03-07 08:48:08,605][155452] Updated weights for policy 0, policy_version 55410 (0.0007) [2023-03-07 08:48:09,393][155452] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-07 08:48:10,174][155452] Updated weights for policy 0, policy_version 55430 (0.0006) [2023-03-07 08:48:10,970][155452] Updated weights for policy 0, policy_version 55440 (0.0006) [2023-03-07 08:48:11,768][155452] Updated weights for policy 0, policy_version 55450 (0.0006) [2023-03-07 08:48:12,540][155452] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-07 08:48:13,322][155452] Updated weights for policy 0, policy_version 55470 (0.0006) [2023-03-07 08:48:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 56801280. Throughput: 0: 13024.9. Samples: 56794426. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:48:13,368][155126] Avg episode reward: [(0, '2087.926')] [2023-03-07 08:48:14,102][155452] Updated weights for policy 0, policy_version 55480 (0.0005) [2023-03-07 08:48:14,887][155452] Updated weights for policy 0, policy_version 55490 (0.0007) [2023-03-07 08:48:15,656][155452] Updated weights for policy 0, policy_version 55500 (0.0006) [2023-03-07 08:48:16,457][155452] Updated weights for policy 0, policy_version 55510 (0.0006) [2023-03-07 08:48:17,246][155452] Updated weights for policy 0, policy_version 55520 (0.0006) [2023-03-07 08:48:18,025][155452] Updated weights for policy 0, policy_version 55530 (0.0006) [2023-03-07 08:48:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 56866816. Throughput: 0: 13024.9. Samples: 56833642. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:48:18,368][155126] Avg episode reward: [(0, '1880.418')] [2023-03-07 08:48:18,827][155452] Updated weights for policy 0, policy_version 55540 (0.0006) [2023-03-07 08:48:19,616][155452] Updated weights for policy 0, policy_version 55550 (0.0006) [2023-03-07 08:48:20,431][155452] Updated weights for policy 0, policy_version 55560 (0.0006) [2023-03-07 08:48:21,193][155452] Updated weights for policy 0, policy_version 55570 (0.0006) [2023-03-07 08:48:21,994][155452] Updated weights for policy 0, policy_version 55580 (0.0006) [2023-03-07 08:48:22,770][155452] Updated weights for policy 0, policy_version 55590 (0.0006) [2023-03-07 08:48:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 56931328. Throughput: 0: 13015.9. Samples: 56911401. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:48:23,367][155126] Avg episode reward: [(0, '1979.844')] [2023-03-07 08:48:23,556][155452] Updated weights for policy 0, policy_version 55600 (0.0006) [2023-03-07 08:48:24,344][155452] Updated weights for policy 0, policy_version 55610 (0.0006) [2023-03-07 08:48:25,126][155452] Updated weights for policy 0, policy_version 55620 (0.0005) [2023-03-07 08:48:25,927][155452] Updated weights for policy 0, policy_version 55630 (0.0006) [2023-03-07 08:48:26,726][155452] Updated weights for policy 0, policy_version 55640 (0.0006) [2023-03-07 08:48:27,497][155452] Updated weights for policy 0, policy_version 55650 (0.0006) [2023-03-07 08:48:28,281][155452] Updated weights for policy 0, policy_version 55660 (0.0006) [2023-03-07 08:48:28,367][155126] Fps is (10 sec: 12902.6, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 56995840. Throughput: 0: 13012.2. Samples: 56989442. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:48:28,367][155126] Avg episode reward: [(0, '1999.310')] [2023-03-07 08:48:29,083][155452] Updated weights for policy 0, policy_version 55670 (0.0007) [2023-03-07 08:48:29,861][155452] Updated weights for policy 0, policy_version 55680 (0.0005) [2023-03-07 08:48:30,660][155452] Updated weights for policy 0, policy_version 55690 (0.0007) [2023-03-07 08:48:31,429][155452] Updated weights for policy 0, policy_version 55700 (0.0006) [2023-03-07 08:48:32,198][155452] Updated weights for policy 0, policy_version 55710 (0.0006) [2023-03-07 08:48:33,010][155452] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-07 08:48:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 57061376. Throughput: 0: 13012.2. Samples: 57028456. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:48:33,367][155126] Avg episode reward: [(0, '2015.489')] [2023-03-07 08:48:33,807][155452] Updated weights for policy 0, policy_version 55730 (0.0006) [2023-03-07 08:48:34,574][155452] Updated weights for policy 0, policy_version 55740 (0.0006) [2023-03-07 08:48:35,354][155452] Updated weights for policy 0, policy_version 55750 (0.0006) [2023-03-07 08:48:36,157][155452] Updated weights for policy 0, policy_version 55760 (0.0007) [2023-03-07 08:48:36,934][155452] Updated weights for policy 0, policy_version 55770 (0.0006) [2023-03-07 08:48:37,713][155452] Updated weights for policy 0, policy_version 55780 (0.0006) [2023-03-07 08:48:38,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 57126912. Throughput: 0: 13009.5. Samples: 57106646. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:48:38,367][155126] Avg episode reward: [(0, '2030.440')] [2023-03-07 08:48:38,494][155452] Updated weights for policy 0, policy_version 55790 (0.0007) [2023-03-07 08:48:39,282][155452] Updated weights for policy 0, policy_version 55800 (0.0007) [2023-03-07 08:48:40,077][155452] Updated weights for policy 0, policy_version 55810 (0.0006) [2023-03-07 08:48:40,857][155452] Updated weights for policy 0, policy_version 55820 (0.0006) [2023-03-07 08:48:41,646][155452] Updated weights for policy 0, policy_version 55830 (0.0006) [2023-03-07 08:48:42,430][155452] Updated weights for policy 0, policy_version 55840 (0.0007) [2023-03-07 08:48:43,233][155452] Updated weights for policy 0, policy_version 55850 (0.0006) [2023-03-07 08:48:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 57191424. Throughput: 0: 13011.5. Samples: 57184997. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:48:43,367][155126] Avg episode reward: [(0, '1924.503')] [2023-03-07 08:48:44,001][155452] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-07 08:48:44,774][155452] Updated weights for policy 0, policy_version 55870 (0.0006) [2023-03-07 08:48:45,548][155452] Updated weights for policy 0, policy_version 55880 (0.0007) [2023-03-07 08:48:46,350][155452] Updated weights for policy 0, policy_version 55890 (0.0006) [2023-03-07 08:48:47,138][155452] Updated weights for policy 0, policy_version 55900 (0.0006) [2023-03-07 08:48:47,917][155452] Updated weights for policy 0, policy_version 55910 (0.0006) [2023-03-07 08:48:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 57256960. Throughput: 0: 13010.3. Samples: 57223978. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:48:48,367][155126] Avg episode reward: [(0, '1730.610')] [2023-03-07 08:48:48,707][155452] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-07 08:48:49,503][155452] Updated weights for policy 0, policy_version 55930 (0.0006) [2023-03-07 08:48:50,297][155452] Updated weights for policy 0, policy_version 55940 (0.0006) [2023-03-07 08:48:51,064][155401] KL-divergence is very high: 429.9638 [2023-03-07 08:48:51,069][155452] Updated weights for policy 0, policy_version 55950 (0.0006) [2023-03-07 08:48:51,872][155452] Updated weights for policy 0, policy_version 55960 (0.0007) [2023-03-07 08:48:52,628][155452] Updated weights for policy 0, policy_version 55970 (0.0006) [2023-03-07 08:48:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 57322496. Throughput: 0: 13015.3. Samples: 57302230. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:48:53,367][155126] Avg episode reward: [(0, '1717.838')] [2023-03-07 08:48:53,422][155452] Updated weights for policy 0, policy_version 55980 (0.0006) [2023-03-07 08:48:54,213][155452] Updated weights for policy 0, policy_version 55990 (0.0006) [2023-03-07 08:48:54,982][155452] Updated weights for policy 0, policy_version 56000 (0.0007) [2023-03-07 08:48:55,769][155452] Updated weights for policy 0, policy_version 56010 (0.0005) [2023-03-07 08:48:56,571][155452] Updated weights for policy 0, policy_version 56020 (0.0006) [2023-03-07 08:48:57,330][155452] Updated weights for policy 0, policy_version 56030 (0.0006) [2023-03-07 08:48:58,126][155452] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-07 08:48:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 57388032. Throughput: 0: 13027.5. Samples: 57380664. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:48:58,367][155126] Avg episode reward: [(0, '1434.996')] [2023-03-07 08:48:58,919][155452] Updated weights for policy 0, policy_version 56050 (0.0007) [2023-03-07 08:48:59,698][155452] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-07 08:49:00,492][155452] Updated weights for policy 0, policy_version 56070 (0.0007) [2023-03-07 08:49:01,271][155452] Updated weights for policy 0, policy_version 56080 (0.0006) [2023-03-07 08:49:02,034][155452] Updated weights for policy 0, policy_version 56090 (0.0006) [2023-03-07 08:49:02,827][155452] Updated weights for policy 0, policy_version 56100 (0.0006) [2023-03-07 08:49:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 57452544. Throughput: 0: 13021.6. Samples: 57419613. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:49:03,367][155126] Avg episode reward: [(0, '1659.023')] [2023-03-07 08:49:03,613][155452] Updated weights for policy 0, policy_version 56110 (0.0006) [2023-03-07 08:49:04,395][155452] Updated weights for policy 0, policy_version 56120 (0.0006) [2023-03-07 08:49:05,191][155452] Updated weights for policy 0, policy_version 56130 (0.0007) [2023-03-07 08:49:05,976][155452] Updated weights for policy 0, policy_version 56140 (0.0006) [2023-03-07 08:49:06,767][155452] Updated weights for policy 0, policy_version 56150 (0.0006) [2023-03-07 08:49:07,541][155452] Updated weights for policy 0, policy_version 56160 (0.0006) [2023-03-07 08:49:08,320][155452] Updated weights for policy 0, policy_version 56170 (0.0006) [2023-03-07 08:49:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 57518080. Throughput: 0: 13036.9. Samples: 57498060. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:49:08,367][155126] Avg episode reward: [(0, '1761.969')] [2023-03-07 08:49:09,095][155452] Updated weights for policy 0, policy_version 56180 (0.0006) [2023-03-07 08:49:09,879][155452] Updated weights for policy 0, policy_version 56190 (0.0006) [2023-03-07 08:49:10,662][155452] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-07 08:49:11,447][155452] Updated weights for policy 0, policy_version 56210 (0.0006) [2023-03-07 08:49:12,245][155452] Updated weights for policy 0, policy_version 56220 (0.0006) [2023-03-07 08:49:13,025][155452] Updated weights for policy 0, policy_version 56230 (0.0006) [2023-03-07 08:49:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 57583616. Throughput: 0: 13048.2. Samples: 57576609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:49:13,367][155126] Avg episode reward: [(0, '1592.131')] [2023-03-07 08:49:13,798][155452] Updated weights for policy 0, policy_version 56240 (0.0006) [2023-03-07 08:49:14,588][155452] Updated weights for policy 0, policy_version 56250 (0.0006) [2023-03-07 08:49:15,373][155452] Updated weights for policy 0, policy_version 56260 (0.0006) [2023-03-07 08:49:16,154][155452] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-07 08:49:16,954][155452] Updated weights for policy 0, policy_version 56280 (0.0006) [2023-03-07 08:49:17,733][155452] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-07 08:49:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 57648128. Throughput: 0: 13046.7. Samples: 57615558. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:49:18,367][155126] Avg episode reward: [(0, '1673.263')] [2023-03-07 08:49:18,523][155452] Updated weights for policy 0, policy_version 56300 (0.0006) [2023-03-07 08:49:19,310][155452] Updated weights for policy 0, policy_version 56310 (0.0007) [2023-03-07 08:49:20,091][155452] Updated weights for policy 0, policy_version 56320 (0.0005) [2023-03-07 08:49:20,887][155452] Updated weights for policy 0, policy_version 56330 (0.0006) [2023-03-07 08:49:21,677][155452] Updated weights for policy 0, policy_version 56340 (0.0006) [2023-03-07 08:49:22,469][155452] Updated weights for policy 0, policy_version 56350 (0.0006) [2023-03-07 08:49:22,850][155401] KL-divergence is very high: 108.5044 [2023-03-07 08:49:23,240][155452] Updated weights for policy 0, policy_version 56360 (0.0006) [2023-03-07 08:49:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 57713664. Throughput: 0: 13046.1. Samples: 57693723. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:49:23,367][155126] Avg episode reward: [(0, '1795.397')] [2023-03-07 08:49:24,024][155452] Updated weights for policy 0, policy_version 56370 (0.0007) [2023-03-07 08:49:24,093][155401] KL-divergence is very high: 134.3198 [2023-03-07 08:49:24,810][155452] Updated weights for policy 0, policy_version 56380 (0.0006) [2023-03-07 08:49:25,578][155452] Updated weights for policy 0, policy_version 56390 (0.0006) [2023-03-07 08:49:26,371][155452] Updated weights for policy 0, policy_version 56400 (0.0006) [2023-03-07 08:49:26,832][155401] KL-divergence is very high: 124.5807 [2023-03-07 08:49:27,159][155452] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-07 08:49:27,963][155452] Updated weights for policy 0, policy_version 56420 (0.0006) [2023-03-07 08:49:28,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 57779200. Throughput: 0: 13041.4. Samples: 57771862. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:49:28,368][155126] Avg episode reward: [(0, '1817.455')] [2023-03-07 08:49:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000056425_57779200.pth... [2023-03-07 08:49:28,406][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000053370_54650880.pth [2023-03-07 08:49:28,732][155452] Updated weights for policy 0, policy_version 56430 (0.0007) [2023-03-07 08:49:29,516][155452] Updated weights for policy 0, policy_version 56440 (0.0006) [2023-03-07 08:49:30,300][155452] Updated weights for policy 0, policy_version 56450 (0.0005) [2023-03-07 08:49:31,077][155452] Updated weights for policy 0, policy_version 56460 (0.0006) [2023-03-07 08:49:31,854][155452] Updated weights for policy 0, policy_version 56470 (0.0007) [2023-03-07 08:49:32,642][155452] Updated weights for policy 0, policy_version 56480 (0.0007) [2023-03-07 08:49:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 57844736. Throughput: 0: 13050.5. Samples: 57811250. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:49:33,378][155126] Avg episode reward: [(0, '1772.631')] [2023-03-07 08:49:33,441][155452] Updated weights for policy 0, policy_version 56490 (0.0006) [2023-03-07 08:49:34,225][155452] Updated weights for policy 0, policy_version 56500 (0.0008) [2023-03-07 08:49:35,003][155452] Updated weights for policy 0, policy_version 56510 (0.0006) [2023-03-07 08:49:35,787][155452] Updated weights for policy 0, policy_version 56520 (0.0006) [2023-03-07 08:49:36,568][155452] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-07 08:49:37,342][155452] Updated weights for policy 0, policy_version 56540 (0.0006) [2023-03-07 08:49:38,129][155452] Updated weights for policy 0, policy_version 56550 (0.0008) [2023-03-07 08:49:38,367][155126] Fps is (10 sec: 13107.5, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 57910272. Throughput: 0: 13057.6. Samples: 57889820. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:49:38,367][155126] Avg episode reward: [(0, '1772.468')] [2023-03-07 08:49:38,925][155452] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-07 08:49:39,699][155452] Updated weights for policy 0, policy_version 56570 (0.0006) [2023-03-07 08:49:40,496][155452] Updated weights for policy 0, policy_version 56580 (0.0007) [2023-03-07 08:49:41,262][155452] Updated weights for policy 0, policy_version 56590 (0.0007) [2023-03-07 08:49:42,061][155452] Updated weights for policy 0, policy_version 56600 (0.0006) [2023-03-07 08:49:42,853][155452] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-07 08:49:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 57974784. Throughput: 0: 13051.8. Samples: 57967993. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:49:43,367][155126] Avg episode reward: [(0, '1627.550')] [2023-03-07 08:49:43,618][155452] Updated weights for policy 0, policy_version 56620 (0.0006) [2023-03-07 08:49:44,404][155452] Updated weights for policy 0, policy_version 56630 (0.0007) [2023-03-07 08:49:45,223][155452] Updated weights for policy 0, policy_version 56640 (0.0006) [2023-03-07 08:49:46,010][155452] Updated weights for policy 0, policy_version 56650 (0.0006) [2023-03-07 08:49:46,801][155452] Updated weights for policy 0, policy_version 56660 (0.0006) [2023-03-07 08:49:47,587][155452] Updated weights for policy 0, policy_version 56670 (0.0007) [2023-03-07 08:49:48,363][155452] Updated weights for policy 0, policy_version 56680 (0.0007) [2023-03-07 08:49:48,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 58040320. Throughput: 0: 13047.7. Samples: 58006761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:49:48,367][155126] Avg episode reward: [(0, '1821.318')] [2023-03-07 08:49:49,146][155452] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-07 08:49:49,935][155452] Updated weights for policy 0, policy_version 56700 (0.0007) [2023-03-07 08:49:50,717][155452] Updated weights for policy 0, policy_version 56710 (0.0006) [2023-03-07 08:49:51,520][155452] Updated weights for policy 0, policy_version 56720 (0.0006) [2023-03-07 08:49:52,293][155452] Updated weights for policy 0, policy_version 56730 (0.0006) [2023-03-07 08:49:53,096][155452] Updated weights for policy 0, policy_version 56740 (0.0006) [2023-03-07 08:49:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 58104832. Throughput: 0: 13039.9. Samples: 58084857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:49:53,367][155126] Avg episode reward: [(0, '1646.413')] [2023-03-07 08:49:53,882][155452] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-07 08:49:54,663][155452] Updated weights for policy 0, policy_version 56760 (0.0006) [2023-03-07 08:49:55,433][155452] Updated weights for policy 0, policy_version 56770 (0.0006) [2023-03-07 08:49:56,218][155452] Updated weights for policy 0, policy_version 56780 (0.0006) [2023-03-07 08:49:56,997][155452] Updated weights for policy 0, policy_version 56790 (0.0007) [2023-03-07 08:49:57,782][155452] Updated weights for policy 0, policy_version 56800 (0.0006) [2023-03-07 08:49:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 58170368. Throughput: 0: 13038.6. Samples: 58163348. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:49:58,367][155126] Avg episode reward: [(0, '1747.856')] [2023-03-07 08:49:58,574][155452] Updated weights for policy 0, policy_version 56810 (0.0006) [2023-03-07 08:49:59,361][155452] Updated weights for policy 0, policy_version 56820 (0.0006) [2023-03-07 08:50:00,142][155452] Updated weights for policy 0, policy_version 56830 (0.0007) [2023-03-07 08:50:00,937][155452] Updated weights for policy 0, policy_version 56840 (0.0006) [2023-03-07 08:50:01,716][155452] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-07 08:50:02,502][155452] Updated weights for policy 0, policy_version 56860 (0.0006) [2023-03-07 08:50:03,282][155452] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-07 08:50:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 58234880. Throughput: 0: 13040.2. Samples: 58202366. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:50:03,367][155126] Avg episode reward: [(0, '1708.214')] [2023-03-07 08:50:04,064][155452] Updated weights for policy 0, policy_version 56880 (0.0005) [2023-03-07 08:50:04,838][155452] Updated weights for policy 0, policy_version 56890 (0.0006) [2023-03-07 08:50:05,627][155452] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-07 08:50:06,408][155452] Updated weights for policy 0, policy_version 56910 (0.0006) [2023-03-07 08:50:07,205][155452] Updated weights for policy 0, policy_version 56920 (0.0006) [2023-03-07 08:50:07,982][155452] Updated weights for policy 0, policy_version 56930 (0.0008) [2023-03-07 08:50:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 58300416. Throughput: 0: 13043.1. Samples: 58280661. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:50:08,367][155126] Avg episode reward: [(0, '1601.291')] [2023-03-07 08:50:08,789][155452] Updated weights for policy 0, policy_version 56940 (0.0006) [2023-03-07 08:50:09,556][155452] Updated weights for policy 0, policy_version 56950 (0.0006) [2023-03-07 08:50:10,347][155452] Updated weights for policy 0, policy_version 56960 (0.0007) [2023-03-07 08:50:11,121][155452] Updated weights for policy 0, policy_version 56970 (0.0006) [2023-03-07 08:50:11,894][155452] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-07 08:50:12,699][155452] Updated weights for policy 0, policy_version 56990 (0.0007) [2023-03-07 08:50:13,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 58365952. Throughput: 0: 13048.2. Samples: 58359030. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:50:13,368][155126] Avg episode reward: [(0, '1693.174')] [2023-03-07 08:50:13,497][155452] Updated weights for policy 0, policy_version 57000 (0.0006) [2023-03-07 08:50:14,279][155452] Updated weights for policy 0, policy_version 57010 (0.0006) [2023-03-07 08:50:15,065][155452] Updated weights for policy 0, policy_version 57020 (0.0006) [2023-03-07 08:50:15,833][155452] Updated weights for policy 0, policy_version 57030 (0.0006) [2023-03-07 08:50:16,615][155452] Updated weights for policy 0, policy_version 57040 (0.0006) [2023-03-07 08:50:17,410][155452] Updated weights for policy 0, policy_version 57050 (0.0007) [2023-03-07 08:50:18,202][155452] Updated weights for policy 0, policy_version 57060 (0.0006) [2023-03-07 08:50:18,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 58431488. Throughput: 0: 13041.3. Samples: 58398110. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:50:18,367][155126] Avg episode reward: [(0, '1252.107')] [2023-03-07 08:50:19,012][155452] Updated weights for policy 0, policy_version 57070 (0.0006) [2023-03-07 08:50:19,789][155452] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-07 08:50:20,561][155452] Updated weights for policy 0, policy_version 57090 (0.0006) [2023-03-07 08:50:21,355][155452] Updated weights for policy 0, policy_version 57100 (0.0007) [2023-03-07 08:50:22,139][155452] Updated weights for policy 0, policy_version 57110 (0.0006) [2023-03-07 08:50:22,931][155452] Updated weights for policy 0, policy_version 57120 (0.0006) [2023-03-07 08:50:23,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 58496000. Throughput: 0: 13028.8. Samples: 58476115. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:50:23,367][155126] Avg episode reward: [(0, '1326.855')] [2023-03-07 08:50:23,722][155452] Updated weights for policy 0, policy_version 57130 (0.0006) [2023-03-07 08:50:24,506][155452] Updated weights for policy 0, policy_version 57140 (0.0008) [2023-03-07 08:50:25,286][155452] Updated weights for policy 0, policy_version 57150 (0.0006) [2023-03-07 08:50:26,087][155452] Updated weights for policy 0, policy_version 57160 (0.0006) [2023-03-07 08:50:26,879][155452] Updated weights for policy 0, policy_version 57170 (0.0007) [2023-03-07 08:50:27,657][155452] Updated weights for policy 0, policy_version 57180 (0.0006) [2023-03-07 08:50:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 58561536. Throughput: 0: 13029.9. Samples: 58554339. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:50:28,367][155126] Avg episode reward: [(0, '1394.193')] [2023-03-07 08:50:28,430][155452] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-07 08:50:29,216][155452] Updated weights for policy 0, policy_version 57200 (0.0006) [2023-03-07 08:50:29,993][155452] Updated weights for policy 0, policy_version 57210 (0.0006) [2023-03-07 08:50:30,788][155452] Updated weights for policy 0, policy_version 57220 (0.0007) [2023-03-07 08:50:31,564][155452] Updated weights for policy 0, policy_version 57230 (0.0006) [2023-03-07 08:50:32,346][155452] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-07 08:50:33,117][155452] Updated weights for policy 0, policy_version 57250 (0.0006) [2023-03-07 08:50:33,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 58627072. Throughput: 0: 13039.0. Samples: 58593518. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:50:33,368][155126] Avg episode reward: [(0, '1447.657')] [2023-03-07 08:50:33,903][155452] Updated weights for policy 0, policy_version 57260 (0.0005) [2023-03-07 08:50:34,686][155452] Updated weights for policy 0, policy_version 57270 (0.0007) [2023-03-07 08:50:35,474][155452] Updated weights for policy 0, policy_version 57280 (0.0006) [2023-03-07 08:50:36,260][155452] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-07 08:50:37,051][155452] Updated weights for policy 0, policy_version 57300 (0.0006) [2023-03-07 08:50:37,832][155452] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-07 08:50:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 58691584. Throughput: 0: 13044.2. Samples: 58671845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:50:38,367][155126] Avg episode reward: [(0, '1418.336')] [2023-03-07 08:50:38,618][155452] Updated weights for policy 0, policy_version 57320 (0.0006) [2023-03-07 08:50:39,400][155452] Updated weights for policy 0, policy_version 57330 (0.0006) [2023-03-07 08:50:40,192][155452] Updated weights for policy 0, policy_version 57340 (0.0007) [2023-03-07 08:50:40,987][155452] Updated weights for policy 0, policy_version 57350 (0.0006) [2023-03-07 08:50:41,769][155452] Updated weights for policy 0, policy_version 57360 (0.0006) [2023-03-07 08:50:42,546][155452] Updated weights for policy 0, policy_version 57370 (0.0007) [2023-03-07 08:50:43,339][155452] Updated weights for policy 0, policy_version 57380 (0.0006) [2023-03-07 08:50:43,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 58757120. Throughput: 0: 13039.3. Samples: 58750115. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:50:43,367][155126] Avg episode reward: [(0, '1538.423')] [2023-03-07 08:50:44,123][155452] Updated weights for policy 0, policy_version 57390 (0.0006) [2023-03-07 08:50:44,906][155452] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-07 08:50:45,685][155452] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-07 08:50:46,469][155452] Updated weights for policy 0, policy_version 57420 (0.0007) [2023-03-07 08:50:47,249][155452] Updated weights for policy 0, policy_version 57430 (0.0006) [2023-03-07 08:50:48,026][155452] Updated weights for policy 0, policy_version 57440 (0.0008) [2023-03-07 08:50:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 58822656. Throughput: 0: 13037.7. Samples: 58789063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:50:48,367][155126] Avg episode reward: [(0, '1699.488')] [2023-03-07 08:50:48,839][155452] Updated weights for policy 0, policy_version 57450 (0.0006) [2023-03-07 08:50:49,625][155452] Updated weights for policy 0, policy_version 57460 (0.0006) [2023-03-07 08:50:50,392][155452] Updated weights for policy 0, policy_version 57470 (0.0006) [2023-03-07 08:50:51,188][155452] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-07 08:50:51,973][155452] Updated weights for policy 0, policy_version 57490 (0.0006) [2023-03-07 08:50:52,757][155452] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-07 08:50:53,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 58887168. Throughput: 0: 13042.1. Samples: 58867559. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:50:53,368][155126] Avg episode reward: [(0, '1932.691')] [2023-03-07 08:50:53,538][155452] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-07 08:50:54,311][155452] Updated weights for policy 0, policy_version 57520 (0.0006) [2023-03-07 08:50:55,110][155452] Updated weights for policy 0, policy_version 57530 (0.0006) [2023-03-07 08:50:55,871][155452] Updated weights for policy 0, policy_version 57540 (0.0006) [2023-03-07 08:50:56,654][155452] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-07 08:50:57,443][155452] Updated weights for policy 0, policy_version 57560 (0.0007) [2023-03-07 08:50:58,226][155452] Updated weights for policy 0, policy_version 57570 (0.0006) [2023-03-07 08:50:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 58952704. Throughput: 0: 13042.3. Samples: 58945934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:50:58,368][155126] Avg episode reward: [(0, '1595.994')] [2023-03-07 08:50:59,019][155452] Updated weights for policy 0, policy_version 57580 (0.0006) [2023-03-07 08:50:59,788][155452] Updated weights for policy 0, policy_version 57590 (0.0005) [2023-03-07 08:51:00,562][155452] Updated weights for policy 0, policy_version 57600 (0.0006) [2023-03-07 08:51:01,360][155452] Updated weights for policy 0, policy_version 57610 (0.0006) [2023-03-07 08:51:02,141][155452] Updated weights for policy 0, policy_version 57620 (0.0007) [2023-03-07 08:51:02,938][155452] Updated weights for policy 0, policy_version 57630 (0.0006) [2023-03-07 08:51:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 59018240. Throughput: 0: 13050.2. Samples: 58985370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:51:03,367][155126] Avg episode reward: [(0, '1604.447')] [2023-03-07 08:51:03,702][155452] Updated weights for policy 0, policy_version 57640 (0.0006) [2023-03-07 08:51:04,501][155452] Updated weights for policy 0, policy_version 57650 (0.0006) [2023-03-07 08:51:05,293][155452] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-07 08:51:06,047][155452] Updated weights for policy 0, policy_version 57670 (0.0007) [2023-03-07 08:51:06,857][155452] Updated weights for policy 0, policy_version 57680 (0.0006) [2023-03-07 08:51:07,641][155452] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-07 08:51:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 59083776. Throughput: 0: 13054.3. Samples: 59063561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:51:08,367][155126] Avg episode reward: [(0, '1443.757')] [2023-03-07 08:51:08,446][155452] Updated weights for policy 0, policy_version 57700 (0.0007) [2023-03-07 08:51:09,231][155452] Updated weights for policy 0, policy_version 57710 (0.0006) [2023-03-07 08:51:10,021][155452] Updated weights for policy 0, policy_version 57720 (0.0006) [2023-03-07 08:51:10,815][155452] Updated weights for policy 0, policy_version 57730 (0.0007) [2023-03-07 08:51:11,595][155452] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-07 08:51:12,369][155452] Updated weights for policy 0, policy_version 57750 (0.0005) [2023-03-07 08:51:13,160][155452] Updated weights for policy 0, policy_version 57760 (0.0006) [2023-03-07 08:51:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59148288. Throughput: 0: 13047.4. Samples: 59141475. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:13,368][155126] Avg episode reward: [(0, '1329.262')] [2023-03-07 08:51:13,937][155452] Updated weights for policy 0, policy_version 57770 (0.0006) [2023-03-07 08:51:14,727][155452] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-07 08:51:15,513][155452] Updated weights for policy 0, policy_version 57790 (0.0007) [2023-03-07 08:51:16,284][155452] Updated weights for policy 0, policy_version 57800 (0.0006) [2023-03-07 08:51:17,070][155452] Updated weights for policy 0, policy_version 57810 (0.0006) [2023-03-07 08:51:17,850][155452] Updated weights for policy 0, policy_version 57820 (0.0006) [2023-03-07 08:51:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59213824. Throughput: 0: 13048.7. Samples: 59180712. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:18,368][155126] Avg episode reward: [(0, '1527.660')] [2023-03-07 08:51:18,635][155452] Updated weights for policy 0, policy_version 57830 (0.0007) [2023-03-07 08:51:19,428][155452] Updated weights for policy 0, policy_version 57840 (0.0006) [2023-03-07 08:51:19,743][155401] KL-divergence is very high: 103.3727 [2023-03-07 08:51:19,983][155401] KL-divergence is very high: 319.8592 [2023-03-07 08:51:20,217][155452] Updated weights for policy 0, policy_version 57850 (0.0006) [2023-03-07 08:51:21,005][155452] Updated weights for policy 0, policy_version 57860 (0.0006) [2023-03-07 08:51:21,775][155452] Updated weights for policy 0, policy_version 57870 (0.0005) [2023-03-07 08:51:22,571][155452] Updated weights for policy 0, policy_version 57880 (0.0006) [2023-03-07 08:51:23,360][155452] Updated weights for policy 0, policy_version 57890 (0.0007) [2023-03-07 08:51:23,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 59279360. Throughput: 0: 13046.5. Samples: 59258937. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:23,367][155126] Avg episode reward: [(0, '1421.877')] [2023-03-07 08:51:24,152][155452] Updated weights for policy 0, policy_version 57900 (0.0005) [2023-03-07 08:51:24,945][155452] Updated weights for policy 0, policy_version 57910 (0.0006) [2023-03-07 08:51:25,730][155452] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-07 08:51:26,501][155452] Updated weights for policy 0, policy_version 57930 (0.0006) [2023-03-07 08:51:27,307][155452] Updated weights for policy 0, policy_version 57940 (0.0006) [2023-03-07 08:51:28,090][155452] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-07 08:51:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59343872. Throughput: 0: 13039.1. Samples: 59336878. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:28,378][155126] Avg episode reward: [(0, '1570.490')] [2023-03-07 08:51:28,382][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000057953_59343872.pth... [2023-03-07 08:51:28,466][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000054898_56215552.pth [2023-03-07 08:51:28,876][155452] Updated weights for policy 0, policy_version 57960 (0.0007) [2023-03-07 08:51:29,672][155452] Updated weights for policy 0, policy_version 57970 (0.0007) [2023-03-07 08:51:30,452][155452] Updated weights for policy 0, policy_version 57980 (0.0006) [2023-03-07 08:51:31,252][155452] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-07 08:51:32,030][155452] Updated weights for policy 0, policy_version 58000 (0.0006) [2023-03-07 08:51:32,812][155452] Updated weights for policy 0, policy_version 58010 (0.0006) [2023-03-07 08:51:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13041.2). Total num frames: 59409408. Throughput: 0: 13042.5. Samples: 59375974. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:33,367][155126] Avg episode reward: [(0, '1413.094')] [2023-03-07 08:51:33,583][155452] Updated weights for policy 0, policy_version 58020 (0.0006) [2023-03-07 08:51:34,404][155452] Updated weights for policy 0, policy_version 58030 (0.0007) [2023-03-07 08:51:35,188][155452] Updated weights for policy 0, policy_version 58040 (0.0006) [2023-03-07 08:51:35,982][155452] Updated weights for policy 0, policy_version 58050 (0.0006) [2023-03-07 08:51:36,765][155452] Updated weights for policy 0, policy_version 58060 (0.0005) [2023-03-07 08:51:37,542][155452] Updated weights for policy 0, policy_version 58070 (0.0006) [2023-03-07 08:51:38,342][155452] Updated weights for policy 0, policy_version 58080 (0.0006) [2023-03-07 08:51:38,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59473920. Throughput: 0: 13028.8. Samples: 59453853. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:38,367][155126] Avg episode reward: [(0, '1665.805')] [2023-03-07 08:51:39,141][155452] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-07 08:51:39,922][155452] Updated weights for policy 0, policy_version 58100 (0.0006) [2023-03-07 08:51:40,682][155452] Updated weights for policy 0, policy_version 58110 (0.0006) [2023-03-07 08:51:41,471][155452] Updated weights for policy 0, policy_version 58120 (0.0006) [2023-03-07 08:51:42,268][155452] Updated weights for policy 0, policy_version 58130 (0.0005) [2023-03-07 08:51:43,052][155452] Updated weights for policy 0, policy_version 58140 (0.0008) [2023-03-07 08:51:43,190][155401] KL-divergence is very high: 105.2092 [2023-03-07 08:51:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59539456. Throughput: 0: 13028.1. Samples: 59532199. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:43,367][155126] Avg episode reward: [(0, '1623.119')] [2023-03-07 08:51:43,836][155452] Updated weights for policy 0, policy_version 58150 (0.0006) [2023-03-07 08:51:44,621][155452] Updated weights for policy 0, policy_version 58160 (0.0007) [2023-03-07 08:51:45,382][155452] Updated weights for policy 0, policy_version 58170 (0.0006) [2023-03-07 08:51:46,156][155452] Updated weights for policy 0, policy_version 58180 (0.0005) [2023-03-07 08:51:46,953][155452] Updated weights for policy 0, policy_version 58190 (0.0006) [2023-03-07 08:51:47,725][155452] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-07 08:51:48,367][155126] Fps is (10 sec: 13106.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59604992. Throughput: 0: 13023.9. Samples: 59571445. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:51:48,368][155126] Avg episode reward: [(0, '1575.170')] [2023-03-07 08:51:48,513][155452] Updated weights for policy 0, policy_version 58210 (0.0006) [2023-03-07 08:51:49,305][155452] Updated weights for policy 0, policy_version 58220 (0.0007) [2023-03-07 08:51:50,088][155452] Updated weights for policy 0, policy_version 58230 (0.0006) [2023-03-07 08:51:50,876][155452] Updated weights for policy 0, policy_version 58240 (0.0006) [2023-03-07 08:51:51,645][155452] Updated weights for policy 0, policy_version 58250 (0.0006) [2023-03-07 08:51:52,426][155452] Updated weights for policy 0, policy_version 58260 (0.0006) [2023-03-07 08:51:53,223][155452] Updated weights for policy 0, policy_version 58270 (0.0006) [2023-03-07 08:51:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 59669504. Throughput: 0: 13033.0. Samples: 59650047. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 08:51:53,367][155126] Avg episode reward: [(0, '1730.933')] [2023-03-07 08:51:53,998][155452] Updated weights for policy 0, policy_version 58280 (0.0006) [2023-03-07 08:51:54,800][155452] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-07 08:51:55,581][155452] Updated weights for policy 0, policy_version 58300 (0.0006) [2023-03-07 08:51:56,369][155452] Updated weights for policy 0, policy_version 58310 (0.0006) [2023-03-07 08:51:57,164][155452] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-07 08:51:57,943][155452] Updated weights for policy 0, policy_version 58330 (0.0006) [2023-03-07 08:51:58,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59735040. Throughput: 0: 13033.8. Samples: 59727996. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 08:51:58,367][155126] Avg episode reward: [(0, '1531.144')] [2023-03-07 08:51:58,721][155452] Updated weights for policy 0, policy_version 58340 (0.0006) [2023-03-07 08:51:59,501][155452] Updated weights for policy 0, policy_version 58350 (0.0006) [2023-03-07 08:52:00,290][155452] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-07 08:52:01,065][155452] Updated weights for policy 0, policy_version 58370 (0.0007) [2023-03-07 08:52:01,865][155452] Updated weights for policy 0, policy_version 58380 (0.0006) [2023-03-07 08:52:02,652][155452] Updated weights for policy 0, policy_version 58390 (0.0006) [2023-03-07 08:52:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 59800576. Throughput: 0: 13033.7. Samples: 59767225. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 08:52:03,367][155126] Avg episode reward: [(0, '1672.187')] [2023-03-07 08:52:03,426][155452] Updated weights for policy 0, policy_version 58400 (0.0006) [2023-03-07 08:52:04,203][155452] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-07 08:52:05,002][155452] Updated weights for policy 0, policy_version 58420 (0.0006) [2023-03-07 08:52:05,791][155452] Updated weights for policy 0, policy_version 58430 (0.0007) [2023-03-07 08:52:06,574][155452] Updated weights for policy 0, policy_version 58440 (0.0006) [2023-03-07 08:52:07,379][155452] Updated weights for policy 0, policy_version 58450 (0.0006) [2023-03-07 08:52:08,165][155452] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-07 08:52:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 59865088. Throughput: 0: 13028.0. Samples: 59845199. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 08:52:08,367][155126] Avg episode reward: [(0, '1678.508')] [2023-03-07 08:52:08,953][155452] Updated weights for policy 0, policy_version 58470 (0.0007) [2023-03-07 08:52:09,728][155452] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-07 08:52:10,513][155452] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-07 08:52:11,294][155452] Updated weights for policy 0, policy_version 58500 (0.0005) [2023-03-07 08:52:12,084][155452] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-07 08:52:12,868][155452] Updated weights for policy 0, policy_version 58520 (0.0007) [2023-03-07 08:52:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 59930624. Throughput: 0: 13035.6. Samples: 59923479. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 08:52:13,368][155126] Avg episode reward: [(0, '1667.422')] [2023-03-07 08:52:13,662][155452] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-07 08:52:14,462][155452] Updated weights for policy 0, policy_version 58540 (0.0006) [2023-03-07 08:52:15,251][155452] Updated weights for policy 0, policy_version 58550 (0.0006) [2023-03-07 08:52:16,016][155452] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-07 08:52:16,809][155452] Updated weights for policy 0, policy_version 58570 (0.0006) [2023-03-07 08:52:17,587][155452] Updated weights for policy 0, policy_version 58580 (0.0006) [2023-03-07 08:52:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 59995136. Throughput: 0: 13032.9. Samples: 59962456. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 08:52:18,368][155126] Avg episode reward: [(0, '1633.785')] [2023-03-07 08:52:18,371][155452] Updated weights for policy 0, policy_version 58590 (0.0006) [2023-03-07 08:52:19,170][155452] Updated weights for policy 0, policy_version 58600 (0.0006) [2023-03-07 08:52:19,947][155452] Updated weights for policy 0, policy_version 58610 (0.0006) [2023-03-07 08:52:20,725][155452] Updated weights for policy 0, policy_version 58620 (0.0006) [2023-03-07 08:52:21,512][155452] Updated weights for policy 0, policy_version 58630 (0.0007) [2023-03-07 08:52:22,274][155452] Updated weights for policy 0, policy_version 58640 (0.0006) [2023-03-07 08:52:23,078][155452] Updated weights for policy 0, policy_version 58650 (0.0007) [2023-03-07 08:52:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60060672. Throughput: 0: 13039.6. Samples: 60040636. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 08:52:23,367][155126] Avg episode reward: [(0, '1740.843')] [2023-03-07 08:52:23,857][155452] Updated weights for policy 0, policy_version 58660 (0.0006) [2023-03-07 08:52:24,637][155452] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-07 08:52:25,425][155452] Updated weights for policy 0, policy_version 58680 (0.0006) [2023-03-07 08:52:26,214][155452] Updated weights for policy 0, policy_version 58690 (0.0006) [2023-03-07 08:52:27,007][155452] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-07 08:52:27,806][155452] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-07 08:52:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 60126208. Throughput: 0: 13041.2. Samples: 60119052. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:52:28,367][155126] Avg episode reward: [(0, '1518.223')] [2023-03-07 08:52:28,584][155452] Updated weights for policy 0, policy_version 58720 (0.0006) [2023-03-07 08:52:29,365][155452] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-07 08:52:30,173][155452] Updated weights for policy 0, policy_version 58740 (0.0006) [2023-03-07 08:52:30,953][155452] Updated weights for policy 0, policy_version 58750 (0.0005) [2023-03-07 08:52:31,759][155452] Updated weights for policy 0, policy_version 58760 (0.0006) [2023-03-07 08:52:32,539][155452] Updated weights for policy 0, policy_version 58770 (0.0006) [2023-03-07 08:52:33,342][155452] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-07 08:52:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60190720. Throughput: 0: 13034.5. Samples: 60157994. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:52:33,367][155126] Avg episode reward: [(0, '1528.070')] [2023-03-07 08:52:34,139][155452] Updated weights for policy 0, policy_version 58790 (0.0006) [2023-03-07 08:52:34,937][155452] Updated weights for policy 0, policy_version 58800 (0.0007) [2023-03-07 08:52:35,726][155452] Updated weights for policy 0, policy_version 58810 (0.0007) [2023-03-07 08:52:36,529][155452] Updated weights for policy 0, policy_version 58820 (0.0008) [2023-03-07 08:52:37,298][155452] Updated weights for policy 0, policy_version 58830 (0.0006) [2023-03-07 08:52:38,084][155452] Updated weights for policy 0, policy_version 58840 (0.0006) [2023-03-07 08:52:38,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 60255232. Throughput: 0: 13009.7. Samples: 60235484. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:52:38,368][155126] Avg episode reward: [(0, '1590.008')] [2023-03-07 08:52:38,866][155452] Updated weights for policy 0, policy_version 58850 (0.0007) [2023-03-07 08:52:39,654][155452] Updated weights for policy 0, policy_version 58860 (0.0006) [2023-03-07 08:52:40,431][155452] Updated weights for policy 0, policy_version 58870 (0.0006) [2023-03-07 08:52:41,202][155452] Updated weights for policy 0, policy_version 58880 (0.0006) [2023-03-07 08:52:41,999][155452] Updated weights for policy 0, policy_version 58890 (0.0006) [2023-03-07 08:52:42,778][155452] Updated weights for policy 0, policy_version 58900 (0.0007) [2023-03-07 08:52:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60320768. Throughput: 0: 13017.9. Samples: 60313803. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:52:43,368][155126] Avg episode reward: [(0, '1530.515')] [2023-03-07 08:52:43,563][155452] Updated weights for policy 0, policy_version 58910 (0.0006) [2023-03-07 08:52:44,341][155452] Updated weights for policy 0, policy_version 58920 (0.0008) [2023-03-07 08:52:45,143][155452] Updated weights for policy 0, policy_version 58930 (0.0006) [2023-03-07 08:52:45,921][155452] Updated weights for policy 0, policy_version 58940 (0.0006) [2023-03-07 08:52:46,710][155452] Updated weights for policy 0, policy_version 58950 (0.0006) [2023-03-07 08:52:47,486][155452] Updated weights for policy 0, policy_version 58960 (0.0007) [2023-03-07 08:52:48,262][155452] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-07 08:52:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60386304. Throughput: 0: 13015.1. Samples: 60352904. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:52:48,368][155126] Avg episode reward: [(0, '1557.113')] [2023-03-07 08:52:49,052][155452] Updated weights for policy 0, policy_version 58980 (0.0006) [2023-03-07 08:52:49,829][155452] Updated weights for policy 0, policy_version 58990 (0.0006) [2023-03-07 08:52:50,635][155452] Updated weights for policy 0, policy_version 59000 (0.0006) [2023-03-07 08:52:51,431][155452] Updated weights for policy 0, policy_version 59010 (0.0007) [2023-03-07 08:52:52,215][155452] Updated weights for policy 0, policy_version 59020 (0.0007) [2023-03-07 08:52:53,003][155452] Updated weights for policy 0, policy_version 59030 (0.0006) [2023-03-07 08:52:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 60450816. Throughput: 0: 13018.4. Samples: 60431025. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:52:53,367][155126] Avg episode reward: [(0, '1717.106')] [2023-03-07 08:52:53,810][155452] Updated weights for policy 0, policy_version 59040 (0.0006) [2023-03-07 08:52:54,585][155452] Updated weights for policy 0, policy_version 59050 (0.0005) [2023-03-07 08:52:55,384][155452] Updated weights for policy 0, policy_version 59060 (0.0007) [2023-03-07 08:52:56,169][155452] Updated weights for policy 0, policy_version 59070 (0.0005) [2023-03-07 08:52:56,966][155452] Updated weights for policy 0, policy_version 59080 (0.0006) [2023-03-07 08:52:57,741][155452] Updated weights for policy 0, policy_version 59090 (0.0006) [2023-03-07 08:52:58,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 60515328. Throughput: 0: 13005.8. Samples: 60508740. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 08:52:58,367][155126] Avg episode reward: [(0, '1716.829')] [2023-03-07 08:52:58,530][155452] Updated weights for policy 0, policy_version 59100 (0.0007) [2023-03-07 08:52:59,316][155452] Updated weights for policy 0, policy_version 59110 (0.0007) [2023-03-07 08:53:00,111][155452] Updated weights for policy 0, policy_version 59120 (0.0006) [2023-03-07 08:53:00,893][155452] Updated weights for policy 0, policy_version 59130 (0.0006) [2023-03-07 08:53:01,681][155452] Updated weights for policy 0, policy_version 59140 (0.0007) [2023-03-07 08:53:02,448][155452] Updated weights for policy 0, policy_version 59150 (0.0006) [2023-03-07 08:53:03,230][155452] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-07 08:53:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 60580864. Throughput: 0: 13010.5. Samples: 60547928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:03,367][155126] Avg episode reward: [(0, '1694.888')] [2023-03-07 08:53:04,012][155452] Updated weights for policy 0, policy_version 59170 (0.0006) [2023-03-07 08:53:04,791][155452] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-07 08:53:05,565][155452] Updated weights for policy 0, policy_version 59190 (0.0006) [2023-03-07 08:53:06,362][155452] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-07 08:53:07,134][155452] Updated weights for policy 0, policy_version 59210 (0.0005) [2023-03-07 08:53:07,930][155452] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-07 08:53:08,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60646400. Throughput: 0: 13018.8. Samples: 60626481. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:08,367][155126] Avg episode reward: [(0, '1755.426')] [2023-03-07 08:53:08,702][155452] Updated weights for policy 0, policy_version 59230 (0.0006) [2023-03-07 08:53:09,502][155452] Updated weights for policy 0, policy_version 59240 (0.0007) [2023-03-07 08:53:10,298][155452] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-07 08:53:11,080][155452] Updated weights for policy 0, policy_version 59260 (0.0006) [2023-03-07 08:53:11,853][155452] Updated weights for policy 0, policy_version 59270 (0.0006) [2023-03-07 08:53:12,640][155452] Updated weights for policy 0, policy_version 59280 (0.0006) [2023-03-07 08:53:12,960][155401] KL-divergence is very high: 4298.4307 [2023-03-07 08:53:13,281][155401] KL-divergence is very high: 142.7043 [2023-03-07 08:53:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60711936. Throughput: 0: 13012.8. Samples: 60704628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:13,367][155126] Avg episode reward: [(0, '1528.150')] [2023-03-07 08:53:13,443][155452] Updated weights for policy 0, policy_version 59290 (0.0006) [2023-03-07 08:53:14,222][155452] Updated weights for policy 0, policy_version 59300 (0.0006) [2023-03-07 08:53:15,018][155452] Updated weights for policy 0, policy_version 59310 (0.0006) [2023-03-07 08:53:15,790][155452] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-07 08:53:16,577][155452] Updated weights for policy 0, policy_version 59330 (0.0007) [2023-03-07 08:53:17,340][155452] Updated weights for policy 0, policy_version 59340 (0.0007) [2023-03-07 08:53:18,121][155452] Updated weights for policy 0, policy_version 59350 (0.0006) [2023-03-07 08:53:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 60776448. Throughput: 0: 13016.9. Samples: 60743756. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:18,367][155126] Avg episode reward: [(0, '1678.378')] [2023-03-07 08:53:18,929][155452] Updated weights for policy 0, policy_version 59360 (0.0006) [2023-03-07 08:53:19,697][155452] Updated weights for policy 0, policy_version 59370 (0.0007) [2023-03-07 08:53:20,488][155452] Updated weights for policy 0, policy_version 59380 (0.0006) [2023-03-07 08:53:21,260][155452] Updated weights for policy 0, policy_version 59390 (0.0006) [2023-03-07 08:53:22,049][155452] Updated weights for policy 0, policy_version 59400 (0.0006) [2023-03-07 08:53:22,818][155452] Updated weights for policy 0, policy_version 59410 (0.0006) [2023-03-07 08:53:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 60841984. Throughput: 0: 13039.5. Samples: 60822262. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:23,368][155126] Avg episode reward: [(0, '1548.982')] [2023-03-07 08:53:23,617][155452] Updated weights for policy 0, policy_version 59420 (0.0006) [2023-03-07 08:53:24,390][155452] Updated weights for policy 0, policy_version 59430 (0.0006) [2023-03-07 08:53:25,168][155452] Updated weights for policy 0, policy_version 59440 (0.0007) [2023-03-07 08:53:25,972][155452] Updated weights for policy 0, policy_version 59450 (0.0007) [2023-03-07 08:53:26,746][155452] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-07 08:53:27,548][155452] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-07 08:53:28,328][155452] Updated weights for policy 0, policy_version 59480 (0.0005) [2023-03-07 08:53:28,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 60907520. Throughput: 0: 13038.9. Samples: 60900555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:28,367][155126] Avg episode reward: [(0, '1721.771')] [2023-03-07 08:53:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000059480_60907520.pth... [2023-03-07 08:53:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000056425_57779200.pth [2023-03-07 08:53:29,124][155452] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-07 08:53:29,914][155452] Updated weights for policy 0, policy_version 59500 (0.0006) [2023-03-07 08:53:30,684][155452] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-07 08:53:31,474][155452] Updated weights for policy 0, policy_version 59520 (0.0006) [2023-03-07 08:53:32,252][155452] Updated weights for policy 0, policy_version 59530 (0.0005) [2023-03-07 08:53:33,034][155452] Updated weights for policy 0, policy_version 59540 (0.0006) [2023-03-07 08:53:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 60973056. Throughput: 0: 13038.1. Samples: 60939617. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:33,367][155126] Avg episode reward: [(0, '1521.840')] [2023-03-07 08:53:33,826][155452] Updated weights for policy 0, policy_version 59550 (0.0007) [2023-03-07 08:53:34,612][155452] Updated weights for policy 0, policy_version 59560 (0.0006) [2023-03-07 08:53:35,389][155452] Updated weights for policy 0, policy_version 59570 (0.0007) [2023-03-07 08:53:35,468][155401] KL-divergence is very high: 482.2096 [2023-03-07 08:53:36,184][155452] Updated weights for policy 0, policy_version 59580 (0.0007) [2023-03-07 08:53:36,963][155452] Updated weights for policy 0, policy_version 59590 (0.0007) [2023-03-07 08:53:37,728][155452] Updated weights for policy 0, policy_version 59600 (0.0006) [2023-03-07 08:53:38,290][155401] KL-divergence is very high: 319.3671 [2023-03-07 08:53:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 61037568. Throughput: 0: 13043.8. Samples: 61017997. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:38,368][155126] Avg episode reward: [(0, '1399.873')] [2023-03-07 08:53:38,531][155452] Updated weights for policy 0, policy_version 59610 (0.0006) [2023-03-07 08:53:39,303][155452] Updated weights for policy 0, policy_version 59620 (0.0006) [2023-03-07 08:53:40,086][155452] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-07 08:53:40,884][155452] Updated weights for policy 0, policy_version 59640 (0.0007) [2023-03-07 08:53:41,677][155452] Updated weights for policy 0, policy_version 59650 (0.0006) [2023-03-07 08:53:42,469][155452] Updated weights for policy 0, policy_version 59660 (0.0007) [2023-03-07 08:53:43,266][155452] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-07 08:53:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 61103104. Throughput: 0: 13050.5. Samples: 61096012. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:43,367][155126] Avg episode reward: [(0, '1405.947')] [2023-03-07 08:53:44,028][155452] Updated weights for policy 0, policy_version 59680 (0.0005) [2023-03-07 08:53:44,811][155452] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-07 08:53:45,607][155452] Updated weights for policy 0, policy_version 59700 (0.0006) [2023-03-07 08:53:46,381][155452] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-07 08:53:47,182][155452] Updated weights for policy 0, policy_version 59720 (0.0007) [2023-03-07 08:53:47,942][155452] Updated weights for policy 0, policy_version 59730 (0.0005) [2023-03-07 08:53:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 61168640. Throughput: 0: 13053.5. Samples: 61135336. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:48,368][155126] Avg episode reward: [(0, '1340.227')] [2023-03-07 08:53:48,741][155452] Updated weights for policy 0, policy_version 59740 (0.0007) [2023-03-07 08:53:49,504][155452] Updated weights for policy 0, policy_version 59750 (0.0007) [2023-03-07 08:53:50,305][155452] Updated weights for policy 0, policy_version 59760 (0.0006) [2023-03-07 08:53:51,099][155452] Updated weights for policy 0, policy_version 59770 (0.0006) [2023-03-07 08:53:51,878][155452] Updated weights for policy 0, policy_version 59780 (0.0007) [2023-03-07 08:53:52,667][155452] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-07 08:53:53,065][155401] KL-divergence is very high: 4661421056.0000 [2023-03-07 08:53:53,142][155401] KL-divergence is very high: 471230.2812 [2023-03-07 08:53:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61233152. Throughput: 0: 13047.9. Samples: 61213637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:53,367][155126] Avg episode reward: [(0, '1561.150')] [2023-03-07 08:53:53,450][155452] Updated weights for policy 0, policy_version 59800 (0.0007) [2023-03-07 08:53:54,238][155452] Updated weights for policy 0, policy_version 59810 (0.0006) [2023-03-07 08:53:54,627][155401] KL-divergence is very high: 411.5107 [2023-03-07 08:53:55,032][155452] Updated weights for policy 0, policy_version 59820 (0.0006) [2023-03-07 08:53:55,572][155401] KL-divergence is very high: 541.7425 [2023-03-07 08:53:55,745][155401] KL-divergence is very high: 277.1457 [2023-03-07 08:53:55,814][155452] Updated weights for policy 0, policy_version 59830 (0.0006) [2023-03-07 08:53:56,620][155452] Updated weights for policy 0, policy_version 59840 (0.0007) [2023-03-07 08:53:57,431][155452] Updated weights for policy 0, policy_version 59850 (0.0006) [2023-03-07 08:53:58,194][155452] Updated weights for policy 0, policy_version 59860 (0.0006) [2023-03-07 08:53:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 61298688. Throughput: 0: 13038.1. Samples: 61291344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:53:58,367][155126] Avg episode reward: [(0, '1449.874')] [2023-03-07 08:53:58,978][155452] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-07 08:53:59,769][155452] Updated weights for policy 0, policy_version 59880 (0.0006) [2023-03-07 08:54:00,555][155452] Updated weights for policy 0, policy_version 59890 (0.0006) [2023-03-07 08:54:01,349][155452] Updated weights for policy 0, policy_version 59900 (0.0006) [2023-03-07 08:54:02,119][155452] Updated weights for policy 0, policy_version 59910 (0.0006) [2023-03-07 08:54:02,918][155452] Updated weights for policy 0, policy_version 59920 (0.0006) [2023-03-07 08:54:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61363200. Throughput: 0: 13035.9. Samples: 61330372. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:03,367][155126] Avg episode reward: [(0, '1182.245')] [2023-03-07 08:54:03,705][155452] Updated weights for policy 0, policy_version 59930 (0.0006) [2023-03-07 08:54:04,487][155452] Updated weights for policy 0, policy_version 59940 (0.0006) [2023-03-07 08:54:05,261][155452] Updated weights for policy 0, policy_version 59950 (0.0006) [2023-03-07 08:54:06,056][155452] Updated weights for policy 0, policy_version 59960 (0.0006) [2023-03-07 08:54:06,837][155452] Updated weights for policy 0, policy_version 59970 (0.0005) [2023-03-07 08:54:07,626][155452] Updated weights for policy 0, policy_version 59980 (0.0006) [2023-03-07 08:54:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61428736. Throughput: 0: 13032.8. Samples: 61408739. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:08,368][155126] Avg episode reward: [(0, '1497.542')] [2023-03-07 08:54:08,399][155452] Updated weights for policy 0, policy_version 59990 (0.0006) [2023-03-07 08:54:09,194][155452] Updated weights for policy 0, policy_version 60000 (0.0006) [2023-03-07 08:54:09,962][155452] Updated weights for policy 0, policy_version 60010 (0.0007) [2023-03-07 08:54:10,776][155452] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-07 08:54:11,558][155452] Updated weights for policy 0, policy_version 60030 (0.0007) [2023-03-07 08:54:12,335][155452] Updated weights for policy 0, policy_version 60040 (0.0006) [2023-03-07 08:54:13,120][155452] Updated weights for policy 0, policy_version 60050 (0.0006) [2023-03-07 08:54:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 61493248. Throughput: 0: 13029.4. Samples: 61486875. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:13,367][155126] Avg episode reward: [(0, '1623.852')] [2023-03-07 08:54:13,924][155452] Updated weights for policy 0, policy_version 60060 (0.0006) [2023-03-07 08:54:14,718][155452] Updated weights for policy 0, policy_version 60070 (0.0006) [2023-03-07 08:54:15,504][155452] Updated weights for policy 0, policy_version 60080 (0.0008) [2023-03-07 08:54:16,302][155452] Updated weights for policy 0, policy_version 60090 (0.0006) [2023-03-07 08:54:17,071][155452] Updated weights for policy 0, policy_version 60100 (0.0005) [2023-03-07 08:54:17,856][155452] Updated weights for policy 0, policy_version 60110 (0.0006) [2023-03-07 08:54:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61558784. Throughput: 0: 13020.4. Samples: 61525534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:18,378][155126] Avg episode reward: [(0, '1646.251')] [2023-03-07 08:54:18,641][155452] Updated weights for policy 0, policy_version 60120 (0.0005) [2023-03-07 08:54:19,434][155452] Updated weights for policy 0, policy_version 60130 (0.0007) [2023-03-07 08:54:20,233][155452] Updated weights for policy 0, policy_version 60140 (0.0006) [2023-03-07 08:54:21,013][155452] Updated weights for policy 0, policy_version 60150 (0.0006) [2023-03-07 08:54:21,792][155452] Updated weights for policy 0, policy_version 60160 (0.0007) [2023-03-07 08:54:22,573][155452] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-07 08:54:23,361][155452] Updated weights for policy 0, policy_version 60180 (0.0005) [2023-03-07 08:54:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61624320. Throughput: 0: 13019.7. Samples: 61603881. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:23,367][155126] Avg episode reward: [(0, '1679.051')] [2023-03-07 08:54:24,134][155452] Updated weights for policy 0, policy_version 60190 (0.0006) [2023-03-07 08:54:24,929][155452] Updated weights for policy 0, policy_version 60200 (0.0006) [2023-03-07 08:54:25,698][155452] Updated weights for policy 0, policy_version 60210 (0.0006) [2023-03-07 08:54:26,471][155452] Updated weights for policy 0, policy_version 60220 (0.0006) [2023-03-07 08:54:27,269][155452] Updated weights for policy 0, policy_version 60230 (0.0006) [2023-03-07 08:54:28,045][155452] Updated weights for policy 0, policy_version 60240 (0.0007) [2023-03-07 08:54:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 61689856. Throughput: 0: 13032.2. Samples: 61682462. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:28,367][155126] Avg episode reward: [(0, '1699.916')] [2023-03-07 08:54:28,824][155452] Updated weights for policy 0, policy_version 60250 (0.0008) [2023-03-07 08:54:29,620][155452] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-07 08:54:30,393][155452] Updated weights for policy 0, policy_version 60270 (0.0007) [2023-03-07 08:54:31,179][155452] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-07 08:54:31,961][155452] Updated weights for policy 0, policy_version 60290 (0.0006) [2023-03-07 08:54:32,757][155452] Updated weights for policy 0, policy_version 60300 (0.0006) [2023-03-07 08:54:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 61754368. Throughput: 0: 13030.5. Samples: 61721705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:33,378][155126] Avg episode reward: [(0, '1549.583')] [2023-03-07 08:54:33,549][155452] Updated weights for policy 0, policy_version 60310 (0.0006) [2023-03-07 08:54:34,323][155452] Updated weights for policy 0, policy_version 60320 (0.0006) [2023-03-07 08:54:35,095][155452] Updated weights for policy 0, policy_version 60330 (0.0005) [2023-03-07 08:54:35,889][155452] Updated weights for policy 0, policy_version 60340 (0.0007) [2023-03-07 08:54:36,689][155452] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-07 08:54:37,464][155452] Updated weights for policy 0, policy_version 60360 (0.0006) [2023-03-07 08:54:38,245][155452] Updated weights for policy 0, policy_version 60370 (0.0006) [2023-03-07 08:54:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 61819904. Throughput: 0: 13028.6. Samples: 61799922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:38,367][155126] Avg episode reward: [(0, '1754.102')] [2023-03-07 08:54:39,033][155452] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-07 08:54:39,814][155452] Updated weights for policy 0, policy_version 60390 (0.0006) [2023-03-07 08:54:40,598][155452] Updated weights for policy 0, policy_version 60400 (0.0005) [2023-03-07 08:54:41,378][155452] Updated weights for policy 0, policy_version 60410 (0.0006) [2023-03-07 08:54:42,152][155452] Updated weights for policy 0, policy_version 60420 (0.0007) [2023-03-07 08:54:42,945][155452] Updated weights for policy 0, policy_version 60430 (0.0006) [2023-03-07 08:54:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 61885440. Throughput: 0: 13046.5. Samples: 61878438. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:43,367][155126] Avg episode reward: [(0, '1576.868')] [2023-03-07 08:54:43,736][155452] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-07 08:54:44,504][155452] Updated weights for policy 0, policy_version 60450 (0.0007) [2023-03-07 08:54:45,274][155452] Updated weights for policy 0, policy_version 60460 (0.0006) [2023-03-07 08:54:46,068][155452] Updated weights for policy 0, policy_version 60470 (0.0007) [2023-03-07 08:54:46,841][155452] Updated weights for policy 0, policy_version 60480 (0.0006) [2023-03-07 08:54:47,642][155452] Updated weights for policy 0, policy_version 60490 (0.0006) [2023-03-07 08:54:48,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 61950976. Throughput: 0: 13051.3. Samples: 61917677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:48,367][155126] Avg episode reward: [(0, '1587.250')] [2023-03-07 08:54:48,431][155452] Updated weights for policy 0, policy_version 60500 (0.0006) [2023-03-07 08:54:49,217][155452] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-07 08:54:49,992][155452] Updated weights for policy 0, policy_version 60520 (0.0006) [2023-03-07 08:54:50,767][155452] Updated weights for policy 0, policy_version 60530 (0.0006) [2023-03-07 08:54:51,556][155452] Updated weights for policy 0, policy_version 60540 (0.0006) [2023-03-07 08:54:52,343][155452] Updated weights for policy 0, policy_version 60550 (0.0006) [2023-03-07 08:54:53,137][155452] Updated weights for policy 0, policy_version 60560 (0.0007) [2023-03-07 08:54:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 62015488. Throughput: 0: 13049.7. Samples: 61995974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:53,367][155126] Avg episode reward: [(0, '1633.107')] [2023-03-07 08:54:53,924][155452] Updated weights for policy 0, policy_version 60570 (0.0007) [2023-03-07 08:54:54,715][155452] Updated weights for policy 0, policy_version 60580 (0.0006) [2023-03-07 08:54:55,512][155452] Updated weights for policy 0, policy_version 60590 (0.0007) [2023-03-07 08:54:56,322][155452] Updated weights for policy 0, policy_version 60600 (0.0007) [2023-03-07 08:54:57,089][155452] Updated weights for policy 0, policy_version 60610 (0.0006) [2023-03-07 08:54:57,881][155452] Updated weights for policy 0, policy_version 60620 (0.0006) [2023-03-07 08:54:58,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 62081024. Throughput: 0: 13041.5. Samples: 62073742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:54:58,367][155126] Avg episode reward: [(0, '1710.528')] [2023-03-07 08:54:58,683][155452] Updated weights for policy 0, policy_version 60630 (0.0005) [2023-03-07 08:54:59,454][155452] Updated weights for policy 0, policy_version 60640 (0.0006) [2023-03-07 08:55:00,241][155452] Updated weights for policy 0, policy_version 60650 (0.0005) [2023-03-07 08:55:01,022][155452] Updated weights for policy 0, policy_version 60660 (0.0006) [2023-03-07 08:55:01,802][155452] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-07 08:55:02,590][155452] Updated weights for policy 0, policy_version 60680 (0.0006) [2023-03-07 08:55:03,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 62145536. Throughput: 0: 13051.7. Samples: 62112862. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:55:03,368][155126] Avg episode reward: [(0, '1809.569')] [2023-03-07 08:55:03,377][155452] Updated weights for policy 0, policy_version 60690 (0.0006) [2023-03-07 08:55:04,181][155452] Updated weights for policy 0, policy_version 60700 (0.0007) [2023-03-07 08:55:04,964][155452] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-07 08:55:05,758][155452] Updated weights for policy 0, policy_version 60720 (0.0007) [2023-03-07 08:55:06,540][155452] Updated weights for policy 0, policy_version 60730 (0.0007) [2023-03-07 08:55:07,317][155452] Updated weights for policy 0, policy_version 60740 (0.0006) [2023-03-07 08:55:08,111][155452] Updated weights for policy 0, policy_version 60750 (0.0007) [2023-03-07 08:55:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 62211072. Throughput: 0: 13043.5. Samples: 62190840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:55:08,367][155126] Avg episode reward: [(0, '1490.363')] [2023-03-07 08:55:08,871][155452] Updated weights for policy 0, policy_version 60760 (0.0006) [2023-03-07 08:55:09,665][155452] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-07 08:55:10,458][155452] Updated weights for policy 0, policy_version 60780 (0.0006) [2023-03-07 08:55:11,236][155452] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-07 08:55:12,022][155452] Updated weights for policy 0, policy_version 60800 (0.0006) [2023-03-07 08:55:12,799][155452] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-07 08:55:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 62275584. Throughput: 0: 13037.3. Samples: 62269141. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:55:13,367][155126] Avg episode reward: [(0, '1617.358')] [2023-03-07 08:55:13,584][155452] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-07 08:55:14,371][155452] Updated weights for policy 0, policy_version 60830 (0.0006) [2023-03-07 08:55:15,173][155452] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-07 08:55:15,953][155452] Updated weights for policy 0, policy_version 60850 (0.0006) [2023-03-07 08:55:16,751][155452] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-07 08:55:17,530][155452] Updated weights for policy 0, policy_version 60870 (0.0005) [2023-03-07 08:55:18,311][155452] Updated weights for policy 0, policy_version 60880 (0.0006) [2023-03-07 08:55:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 62341120. Throughput: 0: 13035.6. Samples: 62308307. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:55:18,367][155126] Avg episode reward: [(0, '1439.098')] [2023-03-07 08:55:19,129][155452] Updated weights for policy 0, policy_version 60890 (0.0007) [2023-03-07 08:55:19,929][155452] Updated weights for policy 0, policy_version 60900 (0.0006) [2023-03-07 08:55:20,729][155452] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-07 08:55:21,507][155452] Updated weights for policy 0, policy_version 60920 (0.0007) [2023-03-07 08:55:22,294][155452] Updated weights for policy 0, policy_version 60930 (0.0006) [2023-03-07 08:55:23,094][155452] Updated weights for policy 0, policy_version 60940 (0.0007) [2023-03-07 08:55:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 62405632. Throughput: 0: 13018.7. Samples: 62385766. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:55:23,368][155126] Avg episode reward: [(0, '1669.029')] [2023-03-07 08:55:23,870][155452] Updated weights for policy 0, policy_version 60950 (0.0006) [2023-03-07 08:55:24,654][155452] Updated weights for policy 0, policy_version 60960 (0.0006) [2023-03-07 08:55:25,448][155452] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-07 08:55:26,225][155452] Updated weights for policy 0, policy_version 60980 (0.0007) [2023-03-07 08:55:27,014][155452] Updated weights for policy 0, policy_version 60990 (0.0007) [2023-03-07 08:55:27,803][155452] Updated weights for policy 0, policy_version 61000 (0.0006) [2023-03-07 08:55:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 62471168. Throughput: 0: 13009.7. Samples: 62463874. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 08:55:28,367][155126] Avg episode reward: [(0, '1749.479')] [2023-03-07 08:55:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000061007_62471168.pth... [2023-03-07 08:55:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000057953_59343872.pth [2023-03-07 08:55:28,595][155452] Updated weights for policy 0, policy_version 61010 (0.0005) [2023-03-07 08:55:29,369][155452] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-07 08:55:30,173][155452] Updated weights for policy 0, policy_version 61030 (0.0005) [2023-03-07 08:55:30,939][155452] Updated weights for policy 0, policy_version 61040 (0.0006) [2023-03-07 08:55:31,720][155452] Updated weights for policy 0, policy_version 61050 (0.0006) [2023-03-07 08:55:32,515][155452] Updated weights for policy 0, policy_version 61060 (0.0006) [2023-03-07 08:55:33,289][155452] Updated weights for policy 0, policy_version 61070 (0.0008) [2023-03-07 08:55:33,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 62536704. Throughput: 0: 13008.8. Samples: 62503075. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:55:33,367][155126] Avg episode reward: [(0, '1788.723')] [2023-03-07 08:55:34,088][155452] Updated weights for policy 0, policy_version 61080 (0.0006) [2023-03-07 08:55:34,868][155452] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-07 08:55:35,640][155452] Updated weights for policy 0, policy_version 61100 (0.0007) [2023-03-07 08:55:36,425][155452] Updated weights for policy 0, policy_version 61110 (0.0006) [2023-03-07 08:55:37,207][155452] Updated weights for policy 0, policy_version 61120 (0.0006) [2023-03-07 08:55:37,995][155452] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-07 08:55:38,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 62601216. Throughput: 0: 13007.3. Samples: 62581300. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:55:38,367][155126] Avg episode reward: [(0, '1712.560')] [2023-03-07 08:55:38,782][155452] Updated weights for policy 0, policy_version 61140 (0.0005) [2023-03-07 08:55:39,571][155452] Updated weights for policy 0, policy_version 61150 (0.0006) [2023-03-07 08:55:40,373][155452] Updated weights for policy 0, policy_version 61160 (0.0006) [2023-03-07 08:55:41,186][155452] Updated weights for policy 0, policy_version 61170 (0.0006) [2023-03-07 08:55:41,971][155452] Updated weights for policy 0, policy_version 61180 (0.0006) [2023-03-07 08:55:42,744][155452] Updated weights for policy 0, policy_version 61190 (0.0006) [2023-03-07 08:55:43,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 62665728. Throughput: 0: 13007.5. Samples: 62659080. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:55:43,367][155126] Avg episode reward: [(0, '2026.681')] [2023-03-07 08:55:43,533][155452] Updated weights for policy 0, policy_version 61200 (0.0006) [2023-03-07 08:55:44,313][155452] Updated weights for policy 0, policy_version 61210 (0.0007) [2023-03-07 08:55:45,098][155452] Updated weights for policy 0, policy_version 61220 (0.0006) [2023-03-07 08:55:45,877][155452] Updated weights for policy 0, policy_version 61230 (0.0006) [2023-03-07 08:55:46,644][155452] Updated weights for policy 0, policy_version 61240 (0.0006) [2023-03-07 08:55:47,444][155452] Updated weights for policy 0, policy_version 61250 (0.0006) [2023-03-07 08:55:48,199][155452] Updated weights for policy 0, policy_version 61260 (0.0006) [2023-03-07 08:55:48,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 62731264. Throughput: 0: 13013.8. Samples: 62698481. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:55:48,367][155126] Avg episode reward: [(0, '1746.731')] [2023-03-07 08:55:49,000][155452] Updated weights for policy 0, policy_version 61270 (0.0006) [2023-03-07 08:55:49,785][155452] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-07 08:55:50,552][155452] Updated weights for policy 0, policy_version 61290 (0.0006) [2023-03-07 08:55:51,345][155452] Updated weights for policy 0, policy_version 61300 (0.0006) [2023-03-07 08:55:52,110][155452] Updated weights for policy 0, policy_version 61310 (0.0006) [2023-03-07 08:55:52,905][155452] Updated weights for policy 0, policy_version 61320 (0.0007) [2023-03-07 08:55:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 62796800. Throughput: 0: 13029.7. Samples: 62777176. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:55:53,367][155126] Avg episode reward: [(0, '1667.673')] [2023-03-07 08:55:53,679][155452] Updated weights for policy 0, policy_version 61330 (0.0006) [2023-03-07 08:55:54,469][155452] Updated weights for policy 0, policy_version 61340 (0.0005) [2023-03-07 08:55:55,255][155452] Updated weights for policy 0, policy_version 61350 (0.0007) [2023-03-07 08:55:56,045][155452] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-07 08:55:56,836][155452] Updated weights for policy 0, policy_version 61370 (0.0008) [2023-03-07 08:55:57,642][155452] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-07 08:55:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 62862336. Throughput: 0: 13026.5. Samples: 62855333. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:55:58,378][155126] Avg episode reward: [(0, '1709.407')] [2023-03-07 08:55:58,436][155452] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-07 08:55:59,213][155452] Updated weights for policy 0, policy_version 61400 (0.0006) [2023-03-07 08:56:00,000][155452] Updated weights for policy 0, policy_version 61410 (0.0006) [2023-03-07 08:56:00,764][155452] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-07 08:56:01,557][155452] Updated weights for policy 0, policy_version 61430 (0.0005) [2023-03-07 08:56:02,339][155452] Updated weights for policy 0, policy_version 61440 (0.0006) [2023-03-07 08:56:03,124][155452] Updated weights for policy 0, policy_version 61450 (0.0007) [2023-03-07 08:56:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13030.8). Total num frames: 62927872. Throughput: 0: 13023.9. Samples: 62894380. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:03,367][155126] Avg episode reward: [(0, '1624.896')] [2023-03-07 08:56:03,914][155452] Updated weights for policy 0, policy_version 61460 (0.0006) [2023-03-07 08:56:04,709][155452] Updated weights for policy 0, policy_version 61470 (0.0007) [2023-03-07 08:56:05,497][155452] Updated weights for policy 0, policy_version 61480 (0.0006) [2023-03-07 08:56:06,270][155452] Updated weights for policy 0, policy_version 61490 (0.0006) [2023-03-07 08:56:07,077][155452] Updated weights for policy 0, policy_version 61500 (0.0006) [2023-03-07 08:56:07,865][155452] Updated weights for policy 0, policy_version 61510 (0.0006) [2023-03-07 08:56:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 62992384. Throughput: 0: 13032.8. Samples: 62972240. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:08,367][155126] Avg episode reward: [(0, '1840.692')] [2023-03-07 08:56:08,621][155452] Updated weights for policy 0, policy_version 61520 (0.0006) [2023-03-07 08:56:09,435][155452] Updated weights for policy 0, policy_version 61530 (0.0006) [2023-03-07 08:56:10,213][155452] Updated weights for policy 0, policy_version 61540 (0.0006) [2023-03-07 08:56:11,001][155452] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-07 08:56:11,791][155452] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-07 08:56:12,573][155452] Updated weights for policy 0, policy_version 61570 (0.0007) [2023-03-07 08:56:13,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 63056896. Throughput: 0: 13035.4. Samples: 63050465. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:13,367][155126] Avg episode reward: [(0, '1626.983')] [2023-03-07 08:56:13,383][155452] Updated weights for policy 0, policy_version 61580 (0.0007) [2023-03-07 08:56:14,182][155452] Updated weights for policy 0, policy_version 61590 (0.0006) [2023-03-07 08:56:14,962][155452] Updated weights for policy 0, policy_version 61600 (0.0005) [2023-03-07 08:56:15,746][155452] Updated weights for policy 0, policy_version 61610 (0.0006) [2023-03-07 08:56:16,524][155452] Updated weights for policy 0, policy_version 61620 (0.0006) [2023-03-07 08:56:17,306][155452] Updated weights for policy 0, policy_version 61630 (0.0007) [2023-03-07 08:56:18,095][155452] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-07 08:56:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 63122432. Throughput: 0: 13029.4. Samples: 63089400. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:18,368][155126] Avg episode reward: [(0, '1795.652')] [2023-03-07 08:56:18,878][155452] Updated weights for policy 0, policy_version 61650 (0.0007) [2023-03-07 08:56:19,674][155452] Updated weights for policy 0, policy_version 61660 (0.0006) [2023-03-07 08:56:20,462][155452] Updated weights for policy 0, policy_version 61670 (0.0006) [2023-03-07 08:56:21,243][155452] Updated weights for policy 0, policy_version 61680 (0.0006) [2023-03-07 08:56:22,018][155452] Updated weights for policy 0, policy_version 61690 (0.0006) [2023-03-07 08:56:22,797][155452] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-07 08:56:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 63187968. Throughput: 0: 13030.6. Samples: 63167676. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:23,367][155126] Avg episode reward: [(0, '1509.624')] [2023-03-07 08:56:23,588][155452] Updated weights for policy 0, policy_version 61710 (0.0006) [2023-03-07 08:56:24,368][155452] Updated weights for policy 0, policy_version 61720 (0.0007) [2023-03-07 08:56:25,170][155452] Updated weights for policy 0, policy_version 61730 (0.0006) [2023-03-07 08:56:25,939][155452] Updated weights for policy 0, policy_version 61740 (0.0006) [2023-03-07 08:56:26,727][155452] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-07 08:56:27,497][155452] Updated weights for policy 0, policy_version 61760 (0.0007) [2023-03-07 08:56:28,279][155452] Updated weights for policy 0, policy_version 61770 (0.0005) [2023-03-07 08:56:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 63252480. Throughput: 0: 13041.8. Samples: 63245965. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:28,368][155126] Avg episode reward: [(0, '1524.473')] [2023-03-07 08:56:29,065][155452] Updated weights for policy 0, policy_version 61780 (0.0006) [2023-03-07 08:56:29,859][155452] Updated weights for policy 0, policy_version 61790 (0.0006) [2023-03-07 08:56:30,619][155452] Updated weights for policy 0, policy_version 61800 (0.0006) [2023-03-07 08:56:31,400][155452] Updated weights for policy 0, policy_version 61810 (0.0006) [2023-03-07 08:56:32,198][155452] Updated weights for policy 0, policy_version 61820 (0.0006) [2023-03-07 08:56:32,981][155452] Updated weights for policy 0, policy_version 61830 (0.0006) [2023-03-07 08:56:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 63318016. Throughput: 0: 13042.2. Samples: 63285379. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:33,367][155126] Avg episode reward: [(0, '1567.169')] [2023-03-07 08:56:33,762][155452] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-07 08:56:34,559][155452] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-07 08:56:35,347][155452] Updated weights for policy 0, policy_version 61860 (0.0006) [2023-03-07 08:56:36,138][155452] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-07 08:56:36,928][155452] Updated weights for policy 0, policy_version 61880 (0.0006) [2023-03-07 08:56:37,710][155452] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-07 08:56:38,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 63383552. Throughput: 0: 13027.7. Samples: 63363422. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:38,368][155126] Avg episode reward: [(0, '1696.174')] [2023-03-07 08:56:38,494][155452] Updated weights for policy 0, policy_version 61900 (0.0006) [2023-03-07 08:56:39,250][155452] Updated weights for policy 0, policy_version 61910 (0.0005) [2023-03-07 08:56:40,057][155452] Updated weights for policy 0, policy_version 61920 (0.0006) [2023-03-07 08:56:40,825][155452] Updated weights for policy 0, policy_version 61930 (0.0006) [2023-03-07 08:56:41,631][155452] Updated weights for policy 0, policy_version 61940 (0.0006) [2023-03-07 08:56:42,402][155452] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-07 08:56:43,170][155452] Updated weights for policy 0, policy_version 61960 (0.0006) [2023-03-07 08:56:43,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 63449088. Throughput: 0: 13039.8. Samples: 63442122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:43,367][155126] Avg episode reward: [(0, '1549.410')] [2023-03-07 08:56:43,957][155452] Updated weights for policy 0, policy_version 61970 (0.0007) [2023-03-07 08:56:44,732][155452] Updated weights for policy 0, policy_version 61980 (0.0006) [2023-03-07 08:56:45,524][155452] Updated weights for policy 0, policy_version 61990 (0.0007) [2023-03-07 08:56:46,294][155452] Updated weights for policy 0, policy_version 62000 (0.0007) [2023-03-07 08:56:47,097][155452] Updated weights for policy 0, policy_version 62010 (0.0007) [2023-03-07 08:56:47,876][155452] Updated weights for policy 0, policy_version 62020 (0.0007) [2023-03-07 08:56:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 63514624. Throughput: 0: 13044.6. Samples: 63481386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:48,367][155126] Avg episode reward: [(0, '1732.647')] [2023-03-07 08:56:48,655][155452] Updated weights for policy 0, policy_version 62030 (0.0005) [2023-03-07 08:56:49,447][155452] Updated weights for policy 0, policy_version 62040 (0.0006) [2023-03-07 08:56:50,234][155452] Updated weights for policy 0, policy_version 62050 (0.0005) [2023-03-07 08:56:51,010][155452] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-07 08:56:51,800][155452] Updated weights for policy 0, policy_version 62070 (0.0006) [2023-03-07 08:56:52,592][155452] Updated weights for policy 0, policy_version 62080 (0.0006) [2023-03-07 08:56:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 63579136. Throughput: 0: 13050.4. Samples: 63559505. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:53,367][155126] Avg episode reward: [(0, '1704.967')] [2023-03-07 08:56:53,368][155452] Updated weights for policy 0, policy_version 62090 (0.0006) [2023-03-07 08:56:54,173][155452] Updated weights for policy 0, policy_version 62100 (0.0006) [2023-03-07 08:56:54,962][155452] Updated weights for policy 0, policy_version 62110 (0.0006) [2023-03-07 08:56:55,727][155452] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-07 08:56:56,516][155452] Updated weights for policy 0, policy_version 62130 (0.0006) [2023-03-07 08:56:57,297][155452] Updated weights for policy 0, policy_version 62140 (0.0006) [2023-03-07 08:56:58,077][155452] Updated weights for policy 0, policy_version 62150 (0.0007) [2023-03-07 08:56:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 63644672. Throughput: 0: 13055.1. Samples: 63637945. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:56:58,367][155126] Avg episode reward: [(0, '1641.903')] [2023-03-07 08:56:58,860][155452] Updated weights for policy 0, policy_version 62160 (0.0006) [2023-03-07 08:56:59,650][155452] Updated weights for policy 0, policy_version 62170 (0.0006) [2023-03-07 08:57:00,422][155452] Updated weights for policy 0, policy_version 62180 (0.0007) [2023-03-07 08:57:01,223][155452] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-07 08:57:01,996][155452] Updated weights for policy 0, policy_version 62200 (0.0007) [2023-03-07 08:57:02,785][155452] Updated weights for policy 0, policy_version 62210 (0.0006) [2023-03-07 08:57:03,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 63710208. Throughput: 0: 13064.7. Samples: 63677314. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:57:03,368][155126] Avg episode reward: [(0, '1481.485')] [2023-03-07 08:57:03,572][155452] Updated weights for policy 0, policy_version 62220 (0.0007) [2023-03-07 08:57:04,342][155452] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-07 08:57:05,133][155452] Updated weights for policy 0, policy_version 62240 (0.0006) [2023-03-07 08:57:05,914][155452] Updated weights for policy 0, policy_version 62250 (0.0006) [2023-03-07 08:57:06,692][155452] Updated weights for policy 0, policy_version 62260 (0.0006) [2023-03-07 08:57:07,476][155452] Updated weights for policy 0, policy_version 62270 (0.0006) [2023-03-07 08:57:08,271][155452] Updated weights for policy 0, policy_version 62280 (0.0006) [2023-03-07 08:57:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 63775744. Throughput: 0: 13065.4. Samples: 63755620. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:57:08,367][155126] Avg episode reward: [(0, '1629.788')] [2023-03-07 08:57:09,045][155452] Updated weights for policy 0, policy_version 62290 (0.0007) [2023-03-07 08:57:09,820][155452] Updated weights for policy 0, policy_version 62300 (0.0006) [2023-03-07 08:57:10,604][155452] Updated weights for policy 0, policy_version 62310 (0.0006) [2023-03-07 08:57:11,398][155452] Updated weights for policy 0, policy_version 62320 (0.0006) [2023-03-07 08:57:12,178][155452] Updated weights for policy 0, policy_version 62330 (0.0005) [2023-03-07 08:57:12,973][155452] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-07 08:57:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13073.1, 300 sec: 13037.8). Total num frames: 63841280. Throughput: 0: 13069.0. Samples: 63834070. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:57:13,367][155126] Avg episode reward: [(0, '1748.805')] [2023-03-07 08:57:13,763][155452] Updated weights for policy 0, policy_version 62350 (0.0006) [2023-03-07 08:57:14,556][155452] Updated weights for policy 0, policy_version 62360 (0.0006) [2023-03-07 08:57:15,329][155452] Updated weights for policy 0, policy_version 62370 (0.0006) [2023-03-07 08:57:16,104][155452] Updated weights for policy 0, policy_version 62380 (0.0006) [2023-03-07 08:57:16,894][155452] Updated weights for policy 0, policy_version 62390 (0.0007) [2023-03-07 08:57:17,673][155452] Updated weights for policy 0, policy_version 62400 (0.0006) [2023-03-07 08:57:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 63905792. Throughput: 0: 13061.7. Samples: 63873156. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:57:18,368][155126] Avg episode reward: [(0, '1649.940')] [2023-03-07 08:57:18,467][155452] Updated weights for policy 0, policy_version 62410 (0.0006) [2023-03-07 08:57:19,258][155452] Updated weights for policy 0, policy_version 62420 (0.0006) [2023-03-07 08:57:20,033][155452] Updated weights for policy 0, policy_version 62430 (0.0006) [2023-03-07 08:57:20,848][155452] Updated weights for policy 0, policy_version 62440 (0.0007) [2023-03-07 08:57:21,609][155452] Updated weights for policy 0, policy_version 62450 (0.0006) [2023-03-07 08:57:22,386][155452] Updated weights for policy 0, policy_version 62460 (0.0006) [2023-03-07 08:57:23,177][155452] Updated weights for policy 0, policy_version 62470 (0.0007) [2023-03-07 08:57:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 63971328. Throughput: 0: 13059.2. Samples: 63951085. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:23,367][155126] Avg episode reward: [(0, '1733.236')] [2023-03-07 08:57:23,967][155452] Updated weights for policy 0, policy_version 62480 (0.0006) [2023-03-07 08:57:24,755][155452] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-07 08:57:25,556][155452] Updated weights for policy 0, policy_version 62500 (0.0007) [2023-03-07 08:57:26,330][155452] Updated weights for policy 0, policy_version 62510 (0.0006) [2023-03-07 08:57:27,129][155452] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-07 08:57:27,912][155452] Updated weights for policy 0, policy_version 62530 (0.0007) [2023-03-07 08:57:28,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 64035840. Throughput: 0: 13049.3. Samples: 64029340. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:28,367][155126] Avg episode reward: [(0, '1806.254')] [2023-03-07 08:57:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000062535_64035840.pth... [2023-03-07 08:57:28,406][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000059480_60907520.pth [2023-03-07 08:57:28,689][155452] Updated weights for policy 0, policy_version 62540 (0.0006) [2023-03-07 08:57:29,470][155452] Updated weights for policy 0, policy_version 62550 (0.0007) [2023-03-07 08:57:30,256][155452] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-07 08:57:31,052][155452] Updated weights for policy 0, policy_version 62570 (0.0006) [2023-03-07 08:57:31,833][155452] Updated weights for policy 0, policy_version 62580 (0.0006) [2023-03-07 08:57:32,615][155452] Updated weights for policy 0, policy_version 62590 (0.0007) [2023-03-07 08:57:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 64101376. Throughput: 0: 13042.7. Samples: 64068307. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:33,367][155126] Avg episode reward: [(0, '1757.178')] [2023-03-07 08:57:33,409][155452] Updated weights for policy 0, policy_version 62600 (0.0006) [2023-03-07 08:57:34,202][155452] Updated weights for policy 0, policy_version 62610 (0.0006) [2023-03-07 08:57:34,987][155452] Updated weights for policy 0, policy_version 62620 (0.0006) [2023-03-07 08:57:35,762][155452] Updated weights for policy 0, policy_version 62630 (0.0006) [2023-03-07 08:57:36,587][155452] Updated weights for policy 0, policy_version 62640 (0.0007) [2023-03-07 08:57:37,355][155452] Updated weights for policy 0, policy_version 62650 (0.0006) [2023-03-07 08:57:38,125][155452] Updated weights for policy 0, policy_version 62660 (0.0006) [2023-03-07 08:57:38,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 64166912. Throughput: 0: 13042.3. Samples: 64146412. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:38,368][155126] Avg episode reward: [(0, '1677.058')] [2023-03-07 08:57:38,925][155452] Updated weights for policy 0, policy_version 62670 (0.0007) [2023-03-07 08:57:39,701][155452] Updated weights for policy 0, policy_version 62680 (0.0007) [2023-03-07 08:57:40,468][155452] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-07 08:57:41,246][155452] Updated weights for policy 0, policy_version 62700 (0.0005) [2023-03-07 08:57:42,023][155452] Updated weights for policy 0, policy_version 62710 (0.0006) [2023-03-07 08:57:42,803][155452] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-07 08:57:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 64232448. Throughput: 0: 13050.3. Samples: 64225208. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:43,367][155126] Avg episode reward: [(0, '1738.274')] [2023-03-07 08:57:43,598][155452] Updated weights for policy 0, policy_version 62730 (0.0006) [2023-03-07 08:57:44,363][155452] Updated weights for policy 0, policy_version 62740 (0.0006) [2023-03-07 08:57:45,151][155452] Updated weights for policy 0, policy_version 62750 (0.0006) [2023-03-07 08:57:45,953][155452] Updated weights for policy 0, policy_version 62760 (0.0006) [2023-03-07 08:57:46,713][155452] Updated weights for policy 0, policy_version 62770 (0.0006) [2023-03-07 08:57:47,506][155452] Updated weights for policy 0, policy_version 62780 (0.0006) [2023-03-07 08:57:48,302][155452] Updated weights for policy 0, policy_version 62790 (0.0007) [2023-03-07 08:57:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 64296960. Throughput: 0: 13045.8. Samples: 64264373. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:48,367][155126] Avg episode reward: [(0, '1489.679')] [2023-03-07 08:57:49,085][155452] Updated weights for policy 0, policy_version 62800 (0.0007) [2023-03-07 08:57:49,881][155452] Updated weights for policy 0, policy_version 62810 (0.0006) [2023-03-07 08:57:50,663][155452] Updated weights for policy 0, policy_version 62820 (0.0006) [2023-03-07 08:57:51,441][155452] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-07 08:57:52,235][155452] Updated weights for policy 0, policy_version 62840 (0.0006) [2023-03-07 08:57:53,030][155452] Updated weights for policy 0, policy_version 62850 (0.0006) [2023-03-07 08:57:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 64362496. Throughput: 0: 13040.0. Samples: 64342417. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:53,367][155126] Avg episode reward: [(0, '1912.682')] [2023-03-07 08:57:53,814][155452] Updated weights for policy 0, policy_version 62860 (0.0005) [2023-03-07 08:57:54,614][155452] Updated weights for policy 0, policy_version 62870 (0.0006) [2023-03-07 08:57:55,387][155452] Updated weights for policy 0, policy_version 62880 (0.0006) [2023-03-07 08:57:56,174][155452] Updated weights for policy 0, policy_version 62890 (0.0005) [2023-03-07 08:57:56,950][155452] Updated weights for policy 0, policy_version 62900 (0.0006) [2023-03-07 08:57:57,732][155452] Updated weights for policy 0, policy_version 62910 (0.0006) [2023-03-07 08:57:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 64428032. Throughput: 0: 13033.6. Samples: 64420583. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 08:57:58,368][155126] Avg episode reward: [(0, '1709.092')] [2023-03-07 08:57:58,504][155452] Updated weights for policy 0, policy_version 62920 (0.0006) [2023-03-07 08:57:59,273][155452] Updated weights for policy 0, policy_version 62930 (0.0006) [2023-03-07 08:58:00,061][155452] Updated weights for policy 0, policy_version 62940 (0.0006) [2023-03-07 08:58:00,861][155452] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-07 08:58:01,630][155452] Updated weights for policy 0, policy_version 62960 (0.0006) [2023-03-07 08:58:02,437][155452] Updated weights for policy 0, policy_version 62970 (0.0006) [2023-03-07 08:58:03,213][155452] Updated weights for policy 0, policy_version 62980 (0.0006) [2023-03-07 08:58:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 64493568. Throughput: 0: 13042.4. Samples: 64460061. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:03,367][155126] Avg episode reward: [(0, '1751.277')] [2023-03-07 08:58:03,982][155452] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-07 08:58:04,779][155452] Updated weights for policy 0, policy_version 63000 (0.0007) [2023-03-07 08:58:05,589][155452] Updated weights for policy 0, policy_version 63010 (0.0006) [2023-03-07 08:58:06,364][155452] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-07 08:58:07,135][155452] Updated weights for policy 0, policy_version 63030 (0.0006) [2023-03-07 08:58:07,935][155452] Updated weights for policy 0, policy_version 63040 (0.0007) [2023-03-07 08:58:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 64558080. Throughput: 0: 13045.5. Samples: 64538134. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:08,367][155126] Avg episode reward: [(0, '1684.714')] [2023-03-07 08:58:08,730][155452] Updated weights for policy 0, policy_version 63050 (0.0006) [2023-03-07 08:58:09,498][155452] Updated weights for policy 0, policy_version 63060 (0.0006) [2023-03-07 08:58:10,299][155452] Updated weights for policy 0, policy_version 63070 (0.0006) [2023-03-07 08:58:11,079][155452] Updated weights for policy 0, policy_version 63080 (0.0007) [2023-03-07 08:58:11,862][155452] Updated weights for policy 0, policy_version 63090 (0.0006) [2023-03-07 08:58:12,651][155452] Updated weights for policy 0, policy_version 63100 (0.0006) [2023-03-07 08:58:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 64623616. Throughput: 0: 13044.7. Samples: 64616352. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:13,367][155126] Avg episode reward: [(0, '1725.501')] [2023-03-07 08:58:13,438][155452] Updated weights for policy 0, policy_version 63110 (0.0006) [2023-03-07 08:58:14,210][155452] Updated weights for policy 0, policy_version 63120 (0.0006) [2023-03-07 08:58:15,014][155452] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-07 08:58:15,771][155452] Updated weights for policy 0, policy_version 63140 (0.0005) [2023-03-07 08:58:16,554][155452] Updated weights for policy 0, policy_version 63150 (0.0006) [2023-03-07 08:58:17,342][155452] Updated weights for policy 0, policy_version 63160 (0.0007) [2023-03-07 08:58:18,116][155452] Updated weights for policy 0, policy_version 63170 (0.0007) [2023-03-07 08:58:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 64689152. Throughput: 0: 13050.6. Samples: 64655582. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:18,367][155126] Avg episode reward: [(0, '1751.477')] [2023-03-07 08:58:18,917][155452] Updated weights for policy 0, policy_version 63180 (0.0006) [2023-03-07 08:58:19,699][155452] Updated weights for policy 0, policy_version 63190 (0.0006) [2023-03-07 08:58:20,501][155452] Updated weights for policy 0, policy_version 63200 (0.0006) [2023-03-07 08:58:21,285][155452] Updated weights for policy 0, policy_version 63210 (0.0006) [2023-03-07 08:58:22,075][155452] Updated weights for policy 0, policy_version 63220 (0.0007) [2023-03-07 08:58:22,858][155452] Updated weights for policy 0, policy_version 63230 (0.0007) [2023-03-07 08:58:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 64753664. Throughput: 0: 13050.0. Samples: 64733662. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:23,367][155126] Avg episode reward: [(0, '2018.257')] [2023-03-07 08:58:23,641][155452] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-07 08:58:24,434][155452] Updated weights for policy 0, policy_version 63250 (0.0006) [2023-03-07 08:58:25,210][155452] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-07 08:58:25,978][155452] Updated weights for policy 0, policy_version 63270 (0.0006) [2023-03-07 08:58:26,769][155452] Updated weights for policy 0, policy_version 63280 (0.0006) [2023-03-07 08:58:27,550][155452] Updated weights for policy 0, policy_version 63290 (0.0006) [2023-03-07 08:58:28,358][155452] Updated weights for policy 0, policy_version 63300 (0.0006) [2023-03-07 08:58:28,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 64819200. Throughput: 0: 13037.3. Samples: 64811887. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:28,367][155126] Avg episode reward: [(0, '1905.871')] [2023-03-07 08:58:29,133][155452] Updated weights for policy 0, policy_version 63310 (0.0007) [2023-03-07 08:58:29,901][155452] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-07 08:58:30,700][155452] Updated weights for policy 0, policy_version 63330 (0.0007) [2023-03-07 08:58:31,494][155452] Updated weights for policy 0, policy_version 63340 (0.0006) [2023-03-07 08:58:32,270][155452] Updated weights for policy 0, policy_version 63350 (0.0006) [2023-03-07 08:58:33,060][155452] Updated weights for policy 0, policy_version 63360 (0.0006) [2023-03-07 08:58:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 64883712. Throughput: 0: 13039.3. Samples: 64851139. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:33,367][155126] Avg episode reward: [(0, '1857.102')] [2023-03-07 08:58:33,858][155452] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-07 08:58:34,636][155452] Updated weights for policy 0, policy_version 63380 (0.0006) [2023-03-07 08:58:35,408][155452] Updated weights for policy 0, policy_version 63390 (0.0005) [2023-03-07 08:58:36,216][155452] Updated weights for policy 0, policy_version 63400 (0.0006) [2023-03-07 08:58:37,003][155452] Updated weights for policy 0, policy_version 63410 (0.0006) [2023-03-07 08:58:37,787][155452] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-07 08:58:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 64949248. Throughput: 0: 13037.0. Samples: 64929082. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 08:58:38,367][155126] Avg episode reward: [(0, '1907.489')] [2023-03-07 08:58:38,573][155452] Updated weights for policy 0, policy_version 63430 (0.0006) [2023-03-07 08:58:39,369][155452] Updated weights for policy 0, policy_version 63440 (0.0006) [2023-03-07 08:58:40,161][155452] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-07 08:58:40,933][155452] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-07 08:58:41,722][155452] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-07 08:58:42,507][155452] Updated weights for policy 0, policy_version 63480 (0.0006) [2023-03-07 08:58:43,297][155452] Updated weights for policy 0, policy_version 63490 (0.0006) [2023-03-07 08:58:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 65013760. Throughput: 0: 13036.5. Samples: 65007224. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:58:43,367][155126] Avg episode reward: [(0, '1845.893')] [2023-03-07 08:58:44,071][155452] Updated weights for policy 0, policy_version 63500 (0.0006) [2023-03-07 08:58:44,856][155452] Updated weights for policy 0, policy_version 63510 (0.0007) [2023-03-07 08:58:45,653][155452] Updated weights for policy 0, policy_version 63520 (0.0006) [2023-03-07 08:58:46,437][155452] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-07 08:58:47,221][155452] Updated weights for policy 0, policy_version 63540 (0.0006) [2023-03-07 08:58:47,993][155452] Updated weights for policy 0, policy_version 63550 (0.0006) [2023-03-07 08:58:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 65079296. Throughput: 0: 13028.6. Samples: 65046350. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:58:48,367][155126] Avg episode reward: [(0, '1749.263')] [2023-03-07 08:58:48,789][155452] Updated weights for policy 0, policy_version 63560 (0.0006) [2023-03-07 08:58:49,569][155452] Updated weights for policy 0, policy_version 63570 (0.0006) [2023-03-07 08:58:50,352][155452] Updated weights for policy 0, policy_version 63580 (0.0006) [2023-03-07 08:58:51,137][155452] Updated weights for policy 0, policy_version 63590 (0.0006) [2023-03-07 08:58:51,918][155452] Updated weights for policy 0, policy_version 63600 (0.0006) [2023-03-07 08:58:52,708][155452] Updated weights for policy 0, policy_version 63610 (0.0006) [2023-03-07 08:58:53,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 65144832. Throughput: 0: 13033.2. Samples: 65124632. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:58:53,368][155126] Avg episode reward: [(0, '1809.005')] [2023-03-07 08:58:53,512][155452] Updated weights for policy 0, policy_version 63620 (0.0007) [2023-03-07 08:58:54,300][155452] Updated weights for policy 0, policy_version 63630 (0.0006) [2023-03-07 08:58:55,087][155452] Updated weights for policy 0, policy_version 63640 (0.0006) [2023-03-07 08:58:55,869][155452] Updated weights for policy 0, policy_version 63650 (0.0007) [2023-03-07 08:58:56,683][155452] Updated weights for policy 0, policy_version 63660 (0.0007) [2023-03-07 08:58:57,447][155452] Updated weights for policy 0, policy_version 63670 (0.0006) [2023-03-07 08:58:58,261][155452] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-07 08:58:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 65209344. Throughput: 0: 13026.6. Samples: 65202550. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:58:58,367][155126] Avg episode reward: [(0, '1892.002')] [2023-03-07 08:58:59,034][155452] Updated weights for policy 0, policy_version 63690 (0.0006) [2023-03-07 08:58:59,816][155452] Updated weights for policy 0, policy_version 63700 (0.0006) [2023-03-07 08:59:00,608][155452] Updated weights for policy 0, policy_version 63710 (0.0007) [2023-03-07 08:59:01,417][155452] Updated weights for policy 0, policy_version 63720 (0.0006) [2023-03-07 08:59:02,190][155452] Updated weights for policy 0, policy_version 63730 (0.0006) [2023-03-07 08:59:02,957][155452] Updated weights for policy 0, policy_version 63740 (0.0006) [2023-03-07 08:59:03,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 65274880. Throughput: 0: 13021.9. Samples: 65241570. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:03,367][155126] Avg episode reward: [(0, '1993.685')] [2023-03-07 08:59:03,755][155452] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-07 08:59:04,531][155452] Updated weights for policy 0, policy_version 63760 (0.0006) [2023-03-07 08:59:05,320][155452] Updated weights for policy 0, policy_version 63770 (0.0006) [2023-03-07 08:59:06,110][155452] Updated weights for policy 0, policy_version 63780 (0.0007) [2023-03-07 08:59:06,889][155452] Updated weights for policy 0, policy_version 63790 (0.0006) [2023-03-07 08:59:07,669][155452] Updated weights for policy 0, policy_version 63800 (0.0005) [2023-03-07 08:59:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 65339392. Throughput: 0: 13021.6. Samples: 65319635. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:08,367][155126] Avg episode reward: [(0, '1885.053')] [2023-03-07 08:59:08,472][155452] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-07 08:59:09,261][155452] Updated weights for policy 0, policy_version 63820 (0.0008) [2023-03-07 08:59:10,049][155452] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-07 08:59:10,833][155452] Updated weights for policy 0, policy_version 63840 (0.0006) [2023-03-07 08:59:11,631][155452] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-07 08:59:12,411][155452] Updated weights for policy 0, policy_version 63860 (0.0006) [2023-03-07 08:59:13,198][155452] Updated weights for policy 0, policy_version 63870 (0.0006) [2023-03-07 08:59:13,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 65403904. Throughput: 0: 13012.6. Samples: 65397454. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:13,367][155126] Avg episode reward: [(0, '1902.350')] [2023-03-07 08:59:13,983][155452] Updated weights for policy 0, policy_version 63880 (0.0006) [2023-03-07 08:59:14,773][155452] Updated weights for policy 0, policy_version 63890 (0.0007) [2023-03-07 08:59:15,557][155452] Updated weights for policy 0, policy_version 63900 (0.0005) [2023-03-07 08:59:16,356][155452] Updated weights for policy 0, policy_version 63910 (0.0006) [2023-03-07 08:59:17,126][155452] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-07 08:59:17,903][155452] Updated weights for policy 0, policy_version 63930 (0.0007) [2023-03-07 08:59:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 65469440. Throughput: 0: 13010.3. Samples: 65436604. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:18,367][155126] Avg episode reward: [(0, '1959.204')] [2023-03-07 08:59:18,710][155452] Updated weights for policy 0, policy_version 63940 (0.0006) [2023-03-07 08:59:19,483][155452] Updated weights for policy 0, policy_version 63950 (0.0006) [2023-03-07 08:59:20,284][155452] Updated weights for policy 0, policy_version 63960 (0.0006) [2023-03-07 08:59:21,058][155452] Updated weights for policy 0, policy_version 63970 (0.0006) [2023-03-07 08:59:21,829][155452] Updated weights for policy 0, policy_version 63980 (0.0007) [2023-03-07 08:59:22,634][155452] Updated weights for policy 0, policy_version 63990 (0.0006) [2023-03-07 08:59:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 65534976. Throughput: 0: 13015.7. Samples: 65514791. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:23,368][155126] Avg episode reward: [(0, '2091.768')] [2023-03-07 08:59:23,417][155452] Updated weights for policy 0, policy_version 64000 (0.0005) [2023-03-07 08:59:24,211][155452] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-07 08:59:25,009][155452] Updated weights for policy 0, policy_version 64020 (0.0005) [2023-03-07 08:59:25,816][155452] Updated weights for policy 0, policy_version 64030 (0.0007) [2023-03-07 08:59:26,595][155452] Updated weights for policy 0, policy_version 64040 (0.0007) [2023-03-07 08:59:27,369][155452] Updated weights for policy 0, policy_version 64050 (0.0006) [2023-03-07 08:59:28,140][155452] Updated weights for policy 0, policy_version 64060 (0.0007) [2023-03-07 08:59:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 65599488. Throughput: 0: 13012.1. Samples: 65592767. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:28,367][155126] Avg episode reward: [(0, '1945.597')] [2023-03-07 08:59:28,378][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000064063_65600512.pth... [2023-03-07 08:59:28,407][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000061007_62471168.pth [2023-03-07 08:59:28,936][155452] Updated weights for policy 0, policy_version 64070 (0.0006) [2023-03-07 08:59:29,727][155452] Updated weights for policy 0, policy_version 64080 (0.0006) [2023-03-07 08:59:30,489][155452] Updated weights for policy 0, policy_version 64090 (0.0007) [2023-03-07 08:59:31,262][155452] Updated weights for policy 0, policy_version 64100 (0.0006) [2023-03-07 08:59:32,052][155452] Updated weights for policy 0, policy_version 64110 (0.0006) [2023-03-07 08:59:32,840][155452] Updated weights for policy 0, policy_version 64120 (0.0006) [2023-03-07 08:59:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 65665024. Throughput: 0: 13018.7. Samples: 65632191. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:33,367][155126] Avg episode reward: [(0, '1870.144')] [2023-03-07 08:59:33,630][155452] Updated weights for policy 0, policy_version 64130 (0.0006) [2023-03-07 08:59:34,401][155452] Updated weights for policy 0, policy_version 64140 (0.0007) [2023-03-07 08:59:35,182][155452] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-07 08:59:35,973][155452] Updated weights for policy 0, policy_version 64160 (0.0007) [2023-03-07 08:59:36,757][155452] Updated weights for policy 0, policy_version 64170 (0.0006) [2023-03-07 08:59:37,548][155452] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-07 08:59:38,349][155452] Updated weights for policy 0, policy_version 64190 (0.0007) [2023-03-07 08:59:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 65730560. Throughput: 0: 13022.7. Samples: 65710651. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:38,367][155126] Avg episode reward: [(0, '1853.567')] [2023-03-07 08:59:39,131][155452] Updated weights for policy 0, policy_version 64200 (0.0007) [2023-03-07 08:59:39,902][155452] Updated weights for policy 0, policy_version 64210 (0.0006) [2023-03-07 08:59:40,696][155452] Updated weights for policy 0, policy_version 64220 (0.0006) [2023-03-07 08:59:41,476][155452] Updated weights for policy 0, policy_version 64230 (0.0007) [2023-03-07 08:59:42,256][155452] Updated weights for policy 0, policy_version 64240 (0.0006) [2023-03-07 08:59:43,051][155452] Updated weights for policy 0, policy_version 64250 (0.0006) [2023-03-07 08:59:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 65796096. Throughput: 0: 13028.0. Samples: 65788809. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:43,367][155126] Avg episode reward: [(0, '2013.211')] [2023-03-07 08:59:43,816][155452] Updated weights for policy 0, policy_version 64260 (0.0006) [2023-03-07 08:59:44,601][155452] Updated weights for policy 0, policy_version 64270 (0.0007) [2023-03-07 08:59:45,381][155452] Updated weights for policy 0, policy_version 64280 (0.0005) [2023-03-07 08:59:46,151][155452] Updated weights for policy 0, policy_version 64290 (0.0006) [2023-03-07 08:59:46,958][155452] Updated weights for policy 0, policy_version 64300 (0.0006) [2023-03-07 08:59:47,724][155452] Updated weights for policy 0, policy_version 64310 (0.0006) [2023-03-07 08:59:48,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 65860608. Throughput: 0: 13036.9. Samples: 65828232. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:48,368][155126] Avg episode reward: [(0, '2202.715')] [2023-03-07 08:59:48,530][155452] Updated weights for policy 0, policy_version 64320 (0.0007) [2023-03-07 08:59:49,332][155452] Updated weights for policy 0, policy_version 64330 (0.0006) [2023-03-07 08:59:50,118][155452] Updated weights for policy 0, policy_version 64340 (0.0006) [2023-03-07 08:59:50,905][155452] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-07 08:59:51,693][155452] Updated weights for policy 0, policy_version 64360 (0.0006) [2023-03-07 08:59:52,476][155452] Updated weights for policy 0, policy_version 64370 (0.0007) [2023-03-07 08:59:53,260][155452] Updated weights for policy 0, policy_version 64380 (0.0007) [2023-03-07 08:59:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 65926144. Throughput: 0: 13031.9. Samples: 65906069. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:53,367][155126] Avg episode reward: [(0, '2004.501')] [2023-03-07 08:59:54,038][155452] Updated weights for policy 0, policy_version 64390 (0.0006) [2023-03-07 08:59:54,834][155452] Updated weights for policy 0, policy_version 64400 (0.0006) [2023-03-07 08:59:55,616][155452] Updated weights for policy 0, policy_version 64410 (0.0006) [2023-03-07 08:59:56,391][155452] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-07 08:59:57,173][155452] Updated weights for policy 0, policy_version 64430 (0.0006) [2023-03-07 08:59:57,950][155452] Updated weights for policy 0, policy_version 64440 (0.0006) [2023-03-07 08:59:58,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 65991680. Throughput: 0: 13043.6. Samples: 65984416. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 08:59:58,367][155126] Avg episode reward: [(0, '1859.405')] [2023-03-07 08:59:58,738][155452] Updated weights for policy 0, policy_version 64450 (0.0006) [2023-03-07 08:59:59,528][155452] Updated weights for policy 0, policy_version 64460 (0.0006) [2023-03-07 09:00:00,313][155452] Updated weights for policy 0, policy_version 64470 (0.0006) [2023-03-07 09:00:01,095][155452] Updated weights for policy 0, policy_version 64480 (0.0006) [2023-03-07 09:00:01,857][155452] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-07 09:00:02,662][155452] Updated weights for policy 0, policy_version 64500 (0.0007) [2023-03-07 09:00:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 66057216. Throughput: 0: 13044.0. Samples: 66023581. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:03,367][155126] Avg episode reward: [(0, '1820.417')] [2023-03-07 09:00:03,448][155452] Updated weights for policy 0, policy_version 64510 (0.0006) [2023-03-07 09:00:04,228][155452] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-07 09:00:05,010][155452] Updated weights for policy 0, policy_version 64530 (0.0006) [2023-03-07 09:00:05,786][155452] Updated weights for policy 0, policy_version 64540 (0.0007) [2023-03-07 09:00:06,569][155452] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-07 09:00:07,334][155452] Updated weights for policy 0, policy_version 64560 (0.0006) [2023-03-07 09:00:08,132][155452] Updated weights for policy 0, policy_version 64570 (0.0006) [2023-03-07 09:00:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 66122752. Throughput: 0: 13053.2. Samples: 66102184. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:08,367][155126] Avg episode reward: [(0, '1982.271')] [2023-03-07 09:00:08,897][155452] Updated weights for policy 0, policy_version 64580 (0.0006) [2023-03-07 09:00:09,698][155452] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-07 09:00:10,494][155452] Updated weights for policy 0, policy_version 64600 (0.0007) [2023-03-07 09:00:11,273][155452] Updated weights for policy 0, policy_version 64610 (0.0007) [2023-03-07 09:00:12,050][155452] Updated weights for policy 0, policy_version 64620 (0.0006) [2023-03-07 09:00:12,855][155452] Updated weights for policy 0, policy_version 64630 (0.0006) [2023-03-07 09:00:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 66187264. Throughput: 0: 13061.7. Samples: 66180544. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:13,367][155126] Avg episode reward: [(0, '1929.109')] [2023-03-07 09:00:13,638][155452] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-07 09:00:14,419][155452] Updated weights for policy 0, policy_version 64650 (0.0006) [2023-03-07 09:00:15,202][155452] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-07 09:00:15,983][155452] Updated weights for policy 0, policy_version 64670 (0.0006) [2023-03-07 09:00:16,765][155452] Updated weights for policy 0, policy_version 64680 (0.0005) [2023-03-07 09:00:17,546][155452] Updated weights for policy 0, policy_version 64690 (0.0006) [2023-03-07 09:00:18,327][155452] Updated weights for policy 0, policy_version 64700 (0.0007) [2023-03-07 09:00:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13041.3). Total num frames: 66252800. Throughput: 0: 13056.3. Samples: 66219724. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:18,367][155126] Avg episode reward: [(0, '1913.025')] [2023-03-07 09:00:19,121][155452] Updated weights for policy 0, policy_version 64710 (0.0006) [2023-03-07 09:00:19,923][155452] Updated weights for policy 0, policy_version 64720 (0.0006) [2023-03-07 09:00:20,694][155452] Updated weights for policy 0, policy_version 64730 (0.0006) [2023-03-07 09:00:21,505][155452] Updated weights for policy 0, policy_version 64740 (0.0007) [2023-03-07 09:00:22,287][155452] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-07 09:00:23,074][155452] Updated weights for policy 0, policy_version 64760 (0.0006) [2023-03-07 09:00:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 66317312. Throughput: 0: 13043.6. Samples: 66297615. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:23,367][155126] Avg episode reward: [(0, '1840.202')] [2023-03-07 09:00:23,859][155452] Updated weights for policy 0, policy_version 64770 (0.0006) [2023-03-07 09:00:24,638][155452] Updated weights for policy 0, policy_version 64780 (0.0006) [2023-03-07 09:00:25,409][155452] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-07 09:00:26,212][155452] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-07 09:00:27,000][155452] Updated weights for policy 0, policy_version 64810 (0.0006) [2023-03-07 09:00:27,758][155452] Updated weights for policy 0, policy_version 64820 (0.0006) [2023-03-07 09:00:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 66382848. Throughput: 0: 13048.5. Samples: 66375993. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:28,367][155126] Avg episode reward: [(0, '1895.314')] [2023-03-07 09:00:28,541][155452] Updated weights for policy 0, policy_version 64830 (0.0007) [2023-03-07 09:00:29,349][155452] Updated weights for policy 0, policy_version 64840 (0.0007) [2023-03-07 09:00:30,124][155452] Updated weights for policy 0, policy_version 64850 (0.0006) [2023-03-07 09:00:30,892][155452] Updated weights for policy 0, policy_version 64860 (0.0007) [2023-03-07 09:00:31,657][155452] Updated weights for policy 0, policy_version 64870 (0.0006) [2023-03-07 09:00:32,466][155452] Updated weights for policy 0, policy_version 64880 (0.0006) [2023-03-07 09:00:33,257][155452] Updated weights for policy 0, policy_version 64890 (0.0007) [2023-03-07 09:00:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13041.2). Total num frames: 66448384. Throughput: 0: 13044.9. Samples: 66415252. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:33,367][155126] Avg episode reward: [(0, '1893.438')] [2023-03-07 09:00:34,048][155452] Updated weights for policy 0, policy_version 64900 (0.0006) [2023-03-07 09:00:34,833][155452] Updated weights for policy 0, policy_version 64910 (0.0006) [2023-03-07 09:00:35,639][155452] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-07 09:00:36,431][155452] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-07 09:00:37,211][155452] Updated weights for policy 0, policy_version 64940 (0.0005) [2023-03-07 09:00:38,003][155452] Updated weights for policy 0, policy_version 64950 (0.0006) [2023-03-07 09:00:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 66512896. Throughput: 0: 13046.6. Samples: 66493166. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:00:38,367][155126] Avg episode reward: [(0, '1739.458')] [2023-03-07 09:00:38,789][155452] Updated weights for policy 0, policy_version 64960 (0.0007) [2023-03-07 09:00:39,585][155452] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-07 09:00:40,373][155452] Updated weights for policy 0, policy_version 64980 (0.0006) [2023-03-07 09:00:41,160][155452] Updated weights for policy 0, policy_version 64990 (0.0006) [2023-03-07 09:00:41,924][155452] Updated weights for policy 0, policy_version 65000 (0.0006) [2023-03-07 09:00:42,713][155452] Updated weights for policy 0, policy_version 65010 (0.0006) [2023-03-07 09:00:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 66578432. Throughput: 0: 13041.5. Samples: 66571284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:00:43,368][155126] Avg episode reward: [(0, '1823.200')] [2023-03-07 09:00:43,517][155452] Updated weights for policy 0, policy_version 65020 (0.0006) [2023-03-07 09:00:44,280][155452] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-07 09:00:45,050][155452] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-07 09:00:45,844][155452] Updated weights for policy 0, policy_version 65050 (0.0007) [2023-03-07 09:00:46,637][155452] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-07 09:00:47,417][155452] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-07 09:00:48,196][155452] Updated weights for policy 0, policy_version 65080 (0.0006) [2023-03-07 09:00:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 66642944. Throughput: 0: 13042.1. Samples: 66610474. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:00:48,367][155126] Avg episode reward: [(0, '1838.913')] [2023-03-07 09:00:48,981][155452] Updated weights for policy 0, policy_version 65090 (0.0006) [2023-03-07 09:00:49,774][155452] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-07 09:00:50,558][155452] Updated weights for policy 0, policy_version 65110 (0.0005) [2023-03-07 09:00:51,331][155452] Updated weights for policy 0, policy_version 65120 (0.0006) [2023-03-07 09:00:52,119][155452] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-07 09:00:52,907][155452] Updated weights for policy 0, policy_version 65140 (0.0006) [2023-03-07 09:00:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 66708480. Throughput: 0: 13036.8. Samples: 66688840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:00:53,378][155126] Avg episode reward: [(0, '1750.029')] [2023-03-07 09:00:53,680][155452] Updated weights for policy 0, policy_version 65150 (0.0007) [2023-03-07 09:00:54,480][155452] Updated weights for policy 0, policy_version 65160 (0.0006) [2023-03-07 09:00:55,249][155452] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-07 09:00:56,042][155452] Updated weights for policy 0, policy_version 65180 (0.0005) [2023-03-07 09:00:56,814][155452] Updated weights for policy 0, policy_version 65190 (0.0006) [2023-03-07 09:00:57,615][155452] Updated weights for policy 0, policy_version 65200 (0.0006) [2023-03-07 09:00:58,367][155126] Fps is (10 sec: 13106.8, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 66774016. Throughput: 0: 13037.3. Samples: 66767223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:00:58,378][155126] Avg episode reward: [(0, '1799.072')] [2023-03-07 09:00:58,385][155452] Updated weights for policy 0, policy_version 65210 (0.0006) [2023-03-07 09:00:59,158][155452] Updated weights for policy 0, policy_version 65220 (0.0006) [2023-03-07 09:00:59,932][155452] Updated weights for policy 0, policy_version 65230 (0.0006) [2023-03-07 09:01:00,737][155452] Updated weights for policy 0, policy_version 65240 (0.0006) [2023-03-07 09:01:01,518][155452] Updated weights for policy 0, policy_version 65250 (0.0006) [2023-03-07 09:01:02,301][155452] Updated weights for policy 0, policy_version 65260 (0.0005) [2023-03-07 09:01:03,089][155452] Updated weights for policy 0, policy_version 65270 (0.0006) [2023-03-07 09:01:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 66839552. Throughput: 0: 13042.3. Samples: 66806629. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:03,367][155126] Avg episode reward: [(0, '1796.043')] [2023-03-07 09:01:03,879][155452] Updated weights for policy 0, policy_version 65280 (0.0006) [2023-03-07 09:01:04,669][155452] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-07 09:01:05,446][155452] Updated weights for policy 0, policy_version 65300 (0.0007) [2023-03-07 09:01:06,233][155452] Updated weights for policy 0, policy_version 65310 (0.0007) [2023-03-07 09:01:07,024][155452] Updated weights for policy 0, policy_version 65320 (0.0006) [2023-03-07 09:01:07,819][155452] Updated weights for policy 0, policy_version 65330 (0.0006) [2023-03-07 09:01:08,367][155126] Fps is (10 sec: 13107.5, 60 sec: 13038.9, 300 sec: 13044.7). Total num frames: 66905088. Throughput: 0: 13046.1. Samples: 66884690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:08,367][155126] Avg episode reward: [(0, '1857.407')] [2023-03-07 09:01:08,587][155452] Updated weights for policy 0, policy_version 65340 (0.0005) [2023-03-07 09:01:09,373][155452] Updated weights for policy 0, policy_version 65350 (0.0006) [2023-03-07 09:01:10,155][155452] Updated weights for policy 0, policy_version 65360 (0.0006) [2023-03-07 09:01:10,955][155452] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-07 09:01:11,746][155452] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-07 09:01:12,529][155452] Updated weights for policy 0, policy_version 65390 (0.0006) [2023-03-07 09:01:13,321][155452] Updated weights for policy 0, policy_version 65400 (0.0006) [2023-03-07 09:01:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 66969600. Throughput: 0: 13036.2. Samples: 66962624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:13,378][155126] Avg episode reward: [(0, '1713.434')] [2023-03-07 09:01:14,133][155452] Updated weights for policy 0, policy_version 65410 (0.0007) [2023-03-07 09:01:14,915][155452] Updated weights for policy 0, policy_version 65420 (0.0007) [2023-03-07 09:01:15,694][155452] Updated weights for policy 0, policy_version 65430 (0.0007) [2023-03-07 09:01:16,476][155452] Updated weights for policy 0, policy_version 65440 (0.0006) [2023-03-07 09:01:17,263][155452] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-07 09:01:18,050][155452] Updated weights for policy 0, policy_version 65460 (0.0006) [2023-03-07 09:01:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13041.2). Total num frames: 67035136. Throughput: 0: 13028.3. Samples: 67001524. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:18,367][155126] Avg episode reward: [(0, '1778.040')] [2023-03-07 09:01:18,842][155452] Updated weights for policy 0, policy_version 65470 (0.0006) [2023-03-07 09:01:19,642][155452] Updated weights for policy 0, policy_version 65480 (0.0006) [2023-03-07 09:01:20,406][155452] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-07 09:01:21,198][155452] Updated weights for policy 0, policy_version 65500 (0.0006) [2023-03-07 09:01:21,993][155452] Updated weights for policy 0, policy_version 65510 (0.0006) [2023-03-07 09:01:22,775][155452] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-07 09:01:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13041.3). Total num frames: 67099648. Throughput: 0: 13033.5. Samples: 67079672. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:23,367][155126] Avg episode reward: [(0, '1699.873')] [2023-03-07 09:01:23,557][155452] Updated weights for policy 0, policy_version 65530 (0.0006) [2023-03-07 09:01:24,339][155452] Updated weights for policy 0, policy_version 65540 (0.0005) [2023-03-07 09:01:25,131][155452] Updated weights for policy 0, policy_version 65550 (0.0006) [2023-03-07 09:01:25,934][155452] Updated weights for policy 0, policy_version 65560 (0.0007) [2023-03-07 09:01:26,727][155452] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-07 09:01:27,514][155452] Updated weights for policy 0, policy_version 65580 (0.0007) [2023-03-07 09:01:28,312][155452] Updated weights for policy 0, policy_version 65590 (0.0006) [2023-03-07 09:01:28,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 67164160. Throughput: 0: 13027.1. Samples: 67157502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:28,367][155126] Avg episode reward: [(0, '1857.725')] [2023-03-07 09:01:28,384][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000065591_67165184.pth... [2023-03-07 09:01:28,415][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000062535_64035840.pth [2023-03-07 09:01:29,104][155452] Updated weights for policy 0, policy_version 65600 (0.0006) [2023-03-07 09:01:29,878][155452] Updated weights for policy 0, policy_version 65610 (0.0007) [2023-03-07 09:01:30,665][155452] Updated weights for policy 0, policy_version 65620 (0.0006) [2023-03-07 09:01:31,454][155452] Updated weights for policy 0, policy_version 65630 (0.0007) [2023-03-07 09:01:32,235][155452] Updated weights for policy 0, policy_version 65640 (0.0006) [2023-03-07 09:01:33,037][155452] Updated weights for policy 0, policy_version 65650 (0.0006) [2023-03-07 09:01:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13037.8). Total num frames: 67229696. Throughput: 0: 13026.4. Samples: 67196663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:33,378][155126] Avg episode reward: [(0, '1974.040')] [2023-03-07 09:01:33,831][155452] Updated weights for policy 0, policy_version 65660 (0.0006) [2023-03-07 09:01:34,609][155452] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-07 09:01:35,412][155452] Updated weights for policy 0, policy_version 65680 (0.0006) [2023-03-07 09:01:36,211][155452] Updated weights for policy 0, policy_version 65690 (0.0006) [2023-03-07 09:01:37,008][155452] Updated weights for policy 0, policy_version 65700 (0.0006) [2023-03-07 09:01:37,760][155452] Updated weights for policy 0, policy_version 65710 (0.0007) [2023-03-07 09:01:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 67294208. Throughput: 0: 13007.6. Samples: 67274181. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:38,367][155126] Avg episode reward: [(0, '1895.946')] [2023-03-07 09:01:38,543][155452] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-07 09:01:39,335][155452] Updated weights for policy 0, policy_version 65730 (0.0006) [2023-03-07 09:01:40,109][155452] Updated weights for policy 0, policy_version 65740 (0.0007) [2023-03-07 09:01:40,892][155452] Updated weights for policy 0, policy_version 65750 (0.0005) [2023-03-07 09:01:41,700][155452] Updated weights for policy 0, policy_version 65760 (0.0008) [2023-03-07 09:01:42,487][155452] Updated weights for policy 0, policy_version 65770 (0.0007) [2023-03-07 09:01:43,262][155452] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-07 09:01:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 67359744. Throughput: 0: 13008.7. Samples: 67352612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:43,378][155126] Avg episode reward: [(0, '1899.030')] [2023-03-07 09:01:44,062][155452] Updated weights for policy 0, policy_version 65790 (0.0006) [2023-03-07 09:01:44,846][155452] Updated weights for policy 0, policy_version 65800 (0.0006) [2023-03-07 09:01:45,624][155452] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-07 09:01:46,418][155452] Updated weights for policy 0, policy_version 65820 (0.0006) [2023-03-07 09:01:47,216][155452] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-07 09:01:47,997][155452] Updated weights for policy 0, policy_version 65840 (0.0006) [2023-03-07 09:01:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 67424256. Throughput: 0: 12999.8. Samples: 67391619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:48,367][155126] Avg episode reward: [(0, '2055.880')] [2023-03-07 09:01:48,797][155452] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-07 09:01:49,586][155452] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-07 09:01:50,361][155452] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-07 09:01:51,147][155452] Updated weights for policy 0, policy_version 65880 (0.0006) [2023-03-07 09:01:51,943][155452] Updated weights for policy 0, policy_version 65890 (0.0006) [2023-03-07 09:01:52,720][155452] Updated weights for policy 0, policy_version 65900 (0.0007) [2023-03-07 09:01:53,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 67489792. Throughput: 0: 12995.5. Samples: 67469486. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:53,367][155126] Avg episode reward: [(0, '2073.136')] [2023-03-07 09:01:53,501][155452] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-07 09:01:54,313][155452] Updated weights for policy 0, policy_version 65920 (0.0006) [2023-03-07 09:01:55,105][155452] Updated weights for policy 0, policy_version 65930 (0.0007) [2023-03-07 09:01:55,878][155452] Updated weights for policy 0, policy_version 65940 (0.0006) [2023-03-07 09:01:56,674][155452] Updated weights for policy 0, policy_version 65950 (0.0007) [2023-03-07 09:01:57,460][155452] Updated weights for policy 0, policy_version 65960 (0.0007) [2023-03-07 09:01:58,241][155452] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-07 09:01:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 67554304. Throughput: 0: 12995.2. Samples: 67547407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:01:58,367][155126] Avg episode reward: [(0, '2016.941')] [2023-03-07 09:01:59,029][155452] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-07 09:01:59,801][155452] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-07 09:02:00,589][155452] Updated weights for policy 0, policy_version 66000 (0.0007) [2023-03-07 09:02:01,375][155452] Updated weights for policy 0, policy_version 66010 (0.0006) [2023-03-07 09:02:02,159][155452] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-07 09:02:02,944][155452] Updated weights for policy 0, policy_version 66030 (0.0006) [2023-03-07 09:02:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13030.8). Total num frames: 67619840. Throughput: 0: 12998.9. Samples: 67586474. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:03,367][155126] Avg episode reward: [(0, '2015.547')] [2023-03-07 09:02:03,750][155452] Updated weights for policy 0, policy_version 66040 (0.0005) [2023-03-07 09:02:04,542][155452] Updated weights for policy 0, policy_version 66050 (0.0007) [2023-03-07 09:02:05,341][155452] Updated weights for policy 0, policy_version 66060 (0.0006) [2023-03-07 09:02:06,134][155452] Updated weights for policy 0, policy_version 66070 (0.0007) [2023-03-07 09:02:06,921][155452] Updated weights for policy 0, policy_version 66080 (0.0006) [2023-03-07 09:02:07,715][155452] Updated weights for policy 0, policy_version 66090 (0.0006) [2023-03-07 09:02:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 13027.4). Total num frames: 67684352. Throughput: 0: 12992.7. Samples: 67664342. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:08,367][155126] Avg episode reward: [(0, '1986.217')] [2023-03-07 09:02:08,499][155452] Updated weights for policy 0, policy_version 66100 (0.0006) [2023-03-07 09:02:09,298][155452] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-07 09:02:10,076][155452] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-07 09:02:10,860][155452] Updated weights for policy 0, policy_version 66130 (0.0007) [2023-03-07 09:02:11,647][155452] Updated weights for policy 0, policy_version 66140 (0.0006) [2023-03-07 09:02:12,434][155452] Updated weights for policy 0, policy_version 66150 (0.0007) [2023-03-07 09:02:13,217][155452] Updated weights for policy 0, policy_version 66160 (0.0006) [2023-03-07 09:02:13,367][155126] Fps is (10 sec: 12902.2, 60 sec: 12987.7, 300 sec: 13027.4). Total num frames: 67748864. Throughput: 0: 12996.2. Samples: 67742330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:13,367][155126] Avg episode reward: [(0, '2066.356')] [2023-03-07 09:02:14,014][155452] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-07 09:02:14,789][155452] Updated weights for policy 0, policy_version 66180 (0.0006) [2023-03-07 09:02:15,574][155452] Updated weights for policy 0, policy_version 66190 (0.0007) [2023-03-07 09:02:16,388][155452] Updated weights for policy 0, policy_version 66200 (0.0006) [2023-03-07 09:02:17,166][155452] Updated weights for policy 0, policy_version 66210 (0.0006) [2023-03-07 09:02:17,949][155452] Updated weights for policy 0, policy_version 66220 (0.0006) [2023-03-07 09:02:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13027.4). Total num frames: 67814400. Throughput: 0: 12990.0. Samples: 67781214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:18,367][155126] Avg episode reward: [(0, '2118.429')] [2023-03-07 09:02:18,745][155452] Updated weights for policy 0, policy_version 66230 (0.0007) [2023-03-07 09:02:19,528][155452] Updated weights for policy 0, policy_version 66240 (0.0006) [2023-03-07 09:02:20,318][155452] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-07 09:02:21,096][155452] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-07 09:02:21,893][155452] Updated weights for policy 0, policy_version 66270 (0.0006) [2023-03-07 09:02:22,684][155452] Updated weights for policy 0, policy_version 66280 (0.0006) [2023-03-07 09:02:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13027.4). Total num frames: 67878912. Throughput: 0: 13002.3. Samples: 67859285. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:23,367][155126] Avg episode reward: [(0, '2068.800')] [2023-03-07 09:02:23,474][155452] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-07 09:02:24,255][155452] Updated weights for policy 0, policy_version 66300 (0.0006) [2023-03-07 09:02:25,038][155452] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-07 09:02:25,824][155452] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-07 09:02:26,610][155452] Updated weights for policy 0, policy_version 66330 (0.0007) [2023-03-07 09:02:27,406][155452] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-07 09:02:28,187][155452] Updated weights for policy 0, policy_version 66350 (0.0006) [2023-03-07 09:02:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 67944448. Throughput: 0: 12990.3. Samples: 67937174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:28,367][155126] Avg episode reward: [(0, '2117.940')] [2023-03-07 09:02:28,978][155452] Updated weights for policy 0, policy_version 66360 (0.0006) [2023-03-07 09:02:29,756][155452] Updated weights for policy 0, policy_version 66370 (0.0006) [2023-03-07 09:02:30,541][155452] Updated weights for policy 0, policy_version 66380 (0.0006) [2023-03-07 09:02:31,350][155452] Updated weights for policy 0, policy_version 66390 (0.0006) [2023-03-07 09:02:32,126][155452] Updated weights for policy 0, policy_version 66400 (0.0006) [2023-03-07 09:02:32,906][155452] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-07 09:02:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 13023.9). Total num frames: 68008960. Throughput: 0: 12994.5. Samples: 67976374. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:33,368][155126] Avg episode reward: [(0, '2121.621')] [2023-03-07 09:02:33,698][155452] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-07 09:02:34,485][155452] Updated weights for policy 0, policy_version 66430 (0.0006) [2023-03-07 09:02:35,296][155452] Updated weights for policy 0, policy_version 66440 (0.0007) [2023-03-07 09:02:36,081][155452] Updated weights for policy 0, policy_version 66450 (0.0005) [2023-03-07 09:02:36,881][155452] Updated weights for policy 0, policy_version 66460 (0.0006) [2023-03-07 09:02:37,661][155452] Updated weights for policy 0, policy_version 66470 (0.0006) [2023-03-07 09:02:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 68074496. Throughput: 0: 12991.2. Samples: 68054088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:02:38,367][155126] Avg episode reward: [(0, '1993.648')] [2023-03-07 09:02:38,472][155452] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-07 09:02:39,264][155452] Updated weights for policy 0, policy_version 66490 (0.0006) [2023-03-07 09:02:40,042][155452] Updated weights for policy 0, policy_version 66500 (0.0006) [2023-03-07 09:02:40,834][155452] Updated weights for policy 0, policy_version 66510 (0.0006) [2023-03-07 09:02:41,625][155452] Updated weights for policy 0, policy_version 66520 (0.0006) [2023-03-07 09:02:42,394][155452] Updated weights for policy 0, policy_version 66530 (0.0005) [2023-03-07 09:02:43,181][155452] Updated weights for policy 0, policy_version 66540 (0.0006) [2023-03-07 09:02:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.8, 300 sec: 13023.9). Total num frames: 68139008. Throughput: 0: 12992.8. Samples: 68132084. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:02:43,367][155126] Avg episode reward: [(0, '2059.128')] [2023-03-07 09:02:43,965][155452] Updated weights for policy 0, policy_version 66550 (0.0006) [2023-03-07 09:02:44,742][155452] Updated weights for policy 0, policy_version 66560 (0.0006) [2023-03-07 09:02:45,532][155452] Updated weights for policy 0, policy_version 66570 (0.0006) [2023-03-07 09:02:46,315][155452] Updated weights for policy 0, policy_version 66580 (0.0006) [2023-03-07 09:02:47,108][155452] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-07 09:02:47,897][155452] Updated weights for policy 0, policy_version 66600 (0.0006) [2023-03-07 09:02:48,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 68204544. Throughput: 0: 12994.1. Samples: 68171211. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:02:48,367][155126] Avg episode reward: [(0, '1945.962')] [2023-03-07 09:02:48,672][155452] Updated weights for policy 0, policy_version 66610 (0.0007) [2023-03-07 09:02:49,452][155452] Updated weights for policy 0, policy_version 66620 (0.0006) [2023-03-07 09:02:50,231][155452] Updated weights for policy 0, policy_version 66630 (0.0006) [2023-03-07 09:02:51,006][155452] Updated weights for policy 0, policy_version 66640 (0.0007) [2023-03-07 09:02:51,801][155452] Updated weights for policy 0, policy_version 66650 (0.0006) [2023-03-07 09:02:52,587][155452] Updated weights for policy 0, policy_version 66660 (0.0006) [2023-03-07 09:02:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13020.4). Total num frames: 68269056. Throughput: 0: 13005.4. Samples: 68249583. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:02:53,367][155126] Avg episode reward: [(0, '2051.791')] [2023-03-07 09:02:53,370][155452] Updated weights for policy 0, policy_version 66670 (0.0006) [2023-03-07 09:02:54,163][155452] Updated weights for policy 0, policy_version 66680 (0.0006) [2023-03-07 09:02:54,954][155452] Updated weights for policy 0, policy_version 66690 (0.0007) [2023-03-07 09:02:55,749][155452] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-07 09:02:56,519][155452] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-07 09:02:57,313][155452] Updated weights for policy 0, policy_version 66720 (0.0007) [2023-03-07 09:02:58,108][155452] Updated weights for policy 0, policy_version 66730 (0.0006) [2023-03-07 09:02:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 68334592. Throughput: 0: 13007.8. Samples: 68327680. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:02:58,367][155126] Avg episode reward: [(0, '1926.867')] [2023-03-07 09:02:58,886][155452] Updated weights for policy 0, policy_version 66740 (0.0008) [2023-03-07 09:02:59,660][155452] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-07 09:03:00,443][155452] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-07 09:03:01,246][155452] Updated weights for policy 0, policy_version 66770 (0.0006) [2023-03-07 09:03:02,034][155452] Updated weights for policy 0, policy_version 66780 (0.0007) [2023-03-07 09:03:02,801][155452] Updated weights for policy 0, policy_version 66790 (0.0006) [2023-03-07 09:03:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 68400128. Throughput: 0: 13011.2. Samples: 68366720. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:03:03,367][155126] Avg episode reward: [(0, '2200.192')] [2023-03-07 09:03:03,595][155452] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-07 09:03:04,376][155452] Updated weights for policy 0, policy_version 66810 (0.0006) [2023-03-07 09:03:05,155][155452] Updated weights for policy 0, policy_version 66820 (0.0006) [2023-03-07 09:03:05,951][155452] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-07 09:03:06,740][155452] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-07 09:03:07,533][155452] Updated weights for policy 0, policy_version 66850 (0.0006) [2023-03-07 09:03:08,317][155452] Updated weights for policy 0, policy_version 66860 (0.0006) [2023-03-07 09:03:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 68464640. Throughput: 0: 13010.8. Samples: 68444769. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:03:08,367][155126] Avg episode reward: [(0, '2216.622')] [2023-03-07 09:03:09,108][155452] Updated weights for policy 0, policy_version 66870 (0.0007) [2023-03-07 09:03:09,896][155452] Updated weights for policy 0, policy_version 66880 (0.0007) [2023-03-07 09:03:10,681][155452] Updated weights for policy 0, policy_version 66890 (0.0005) [2023-03-07 09:03:11,463][155452] Updated weights for policy 0, policy_version 66900 (0.0006) [2023-03-07 09:03:12,262][155452] Updated weights for policy 0, policy_version 66910 (0.0006) [2023-03-07 09:03:13,055][155452] Updated weights for policy 0, policy_version 66920 (0.0006) [2023-03-07 09:03:13,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 68530176. Throughput: 0: 13014.8. Samples: 68522840. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:03:13,378][155126] Avg episode reward: [(0, '1916.785')] [2023-03-07 09:03:13,821][155452] Updated weights for policy 0, policy_version 66930 (0.0006) [2023-03-07 09:03:14,609][155452] Updated weights for policy 0, policy_version 66940 (0.0007) [2023-03-07 09:03:15,406][155452] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-07 09:03:16,199][155452] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-07 09:03:16,980][155452] Updated weights for policy 0, policy_version 66970 (0.0006) [2023-03-07 09:03:17,792][155452] Updated weights for policy 0, policy_version 66980 (0.0007) [2023-03-07 09:03:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 68594688. Throughput: 0: 13010.6. Samples: 68561852. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:03:18,368][155126] Avg episode reward: [(0, '2004.856')] [2023-03-07 09:03:18,554][155452] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-07 09:03:19,353][155452] Updated weights for policy 0, policy_version 67000 (0.0006) [2023-03-07 09:03:20,147][155452] Updated weights for policy 0, policy_version 67010 (0.0007) [2023-03-07 09:03:20,930][155452] Updated weights for policy 0, policy_version 67020 (0.0006) [2023-03-07 09:03:21,713][155452] Updated weights for policy 0, policy_version 67030 (0.0006) [2023-03-07 09:03:22,502][155452] Updated weights for policy 0, policy_version 67040 (0.0007) [2023-03-07 09:03:23,296][155452] Updated weights for policy 0, policy_version 67050 (0.0007) [2023-03-07 09:03:23,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 68659200. Throughput: 0: 13016.6. Samples: 68639838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:23,367][155126] Avg episode reward: [(0, '2153.377')] [2023-03-07 09:03:24,075][155452] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-07 09:03:24,846][155452] Updated weights for policy 0, policy_version 67070 (0.0007) [2023-03-07 09:03:25,632][155452] Updated weights for policy 0, policy_version 67080 (0.0006) [2023-03-07 09:03:26,407][155452] Updated weights for policy 0, policy_version 67090 (0.0006) [2023-03-07 09:03:27,206][155452] Updated weights for policy 0, policy_version 67100 (0.0006) [2023-03-07 09:03:27,988][155452] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-07 09:03:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 68724736. Throughput: 0: 13023.3. Samples: 68718132. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:28,378][155126] Avg episode reward: [(0, '2096.833')] [2023-03-07 09:03:28,383][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000067115_68725760.pth... [2023-03-07 09:03:28,414][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000064063_65600512.pth [2023-03-07 09:03:28,760][155452] Updated weights for policy 0, policy_version 67120 (0.0006) [2023-03-07 09:03:29,556][155452] Updated weights for policy 0, policy_version 67130 (0.0007) [2023-03-07 09:03:30,339][155452] Updated weights for policy 0, policy_version 67140 (0.0006) [2023-03-07 09:03:31,133][155452] Updated weights for policy 0, policy_version 67150 (0.0007) [2023-03-07 09:03:31,915][155452] Updated weights for policy 0, policy_version 67160 (0.0006) [2023-03-07 09:03:32,702][155452] Updated weights for policy 0, policy_version 67170 (0.0006) [2023-03-07 09:03:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 68790272. Throughput: 0: 13023.2. Samples: 68757257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:33,367][155126] Avg episode reward: [(0, '1993.074')] [2023-03-07 09:03:33,482][155452] Updated weights for policy 0, policy_version 67180 (0.0006) [2023-03-07 09:03:34,276][155452] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-07 09:03:35,073][155452] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-07 09:03:35,853][155452] Updated weights for policy 0, policy_version 67210 (0.0006) [2023-03-07 09:03:36,648][155452] Updated weights for policy 0, policy_version 67220 (0.0006) [2023-03-07 09:03:37,423][155452] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-07 09:03:38,221][155452] Updated weights for policy 0, policy_version 67240 (0.0006) [2023-03-07 09:03:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 68854784. Throughput: 0: 13015.7. Samples: 68835292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:38,367][155126] Avg episode reward: [(0, '2024.879')] [2023-03-07 09:03:38,993][155452] Updated weights for policy 0, policy_version 67250 (0.0006) [2023-03-07 09:03:39,802][155452] Updated weights for policy 0, policy_version 67260 (0.0007) [2023-03-07 09:03:40,574][155452] Updated weights for policy 0, policy_version 67270 (0.0006) [2023-03-07 09:03:41,370][155452] Updated weights for policy 0, policy_version 67280 (0.0006) [2023-03-07 09:03:42,150][155452] Updated weights for policy 0, policy_version 67290 (0.0006) [2023-03-07 09:03:42,939][155452] Updated weights for policy 0, policy_version 67300 (0.0005) [2023-03-07 09:03:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 68920320. Throughput: 0: 13016.6. Samples: 68913429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:43,367][155126] Avg episode reward: [(0, '2060.003')] [2023-03-07 09:03:43,719][155452] Updated weights for policy 0, policy_version 67310 (0.0006) [2023-03-07 09:03:44,506][155452] Updated weights for policy 0, policy_version 67320 (0.0007) [2023-03-07 09:03:45,292][155452] Updated weights for policy 0, policy_version 67330 (0.0007) [2023-03-07 09:03:46,073][155452] Updated weights for policy 0, policy_version 67340 (0.0005) [2023-03-07 09:03:46,881][155452] Updated weights for policy 0, policy_version 67350 (0.0006) [2023-03-07 09:03:47,666][155452] Updated weights for policy 0, policy_version 67360 (0.0005) [2023-03-07 09:03:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 68985856. Throughput: 0: 13015.4. Samples: 68952411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:48,367][155126] Avg episode reward: [(0, '2118.131')] [2023-03-07 09:03:48,456][155452] Updated weights for policy 0, policy_version 67370 (0.0006) [2023-03-07 09:03:49,238][155452] Updated weights for policy 0, policy_version 67380 (0.0006) [2023-03-07 09:03:50,020][155452] Updated weights for policy 0, policy_version 67390 (0.0007) [2023-03-07 09:03:50,805][155452] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-07 09:03:51,598][155452] Updated weights for policy 0, policy_version 67410 (0.0006) [2023-03-07 09:03:52,412][155452] Updated weights for policy 0, policy_version 67420 (0.0006) [2023-03-07 09:03:53,189][155452] Updated weights for policy 0, policy_version 67430 (0.0007) [2023-03-07 09:03:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 69050368. Throughput: 0: 13010.5. Samples: 69030243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:53,367][155126] Avg episode reward: [(0, '1990.675')] [2023-03-07 09:03:53,965][155452] Updated weights for policy 0, policy_version 67440 (0.0007) [2023-03-07 09:03:54,741][155452] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-07 09:03:55,524][155452] Updated weights for policy 0, policy_version 67460 (0.0006) [2023-03-07 09:03:56,316][155452] Updated weights for policy 0, policy_version 67470 (0.0006) [2023-03-07 09:03:57,082][155452] Updated weights for policy 0, policy_version 67480 (0.0006) [2023-03-07 09:03:57,863][155452] Updated weights for policy 0, policy_version 67490 (0.0006) [2023-03-07 09:03:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 69115904. Throughput: 0: 13018.9. Samples: 69108692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:03:58,368][155126] Avg episode reward: [(0, '2075.594')] [2023-03-07 09:03:58,662][155452] Updated weights for policy 0, policy_version 67500 (0.0006) [2023-03-07 09:03:59,458][155452] Updated weights for policy 0, policy_version 67510 (0.0006) [2023-03-07 09:04:00,245][155452] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-07 09:04:01,044][155452] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-07 09:04:01,836][155452] Updated weights for policy 0, policy_version 67540 (0.0007) [2023-03-07 09:04:02,594][155452] Updated weights for policy 0, policy_version 67550 (0.0005) [2023-03-07 09:04:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 69180416. Throughput: 0: 13017.6. Samples: 69147644. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:03,367][155126] Avg episode reward: [(0, '2234.303')] [2023-03-07 09:04:03,377][155452] Updated weights for policy 0, policy_version 67560 (0.0005) [2023-03-07 09:04:04,157][155452] Updated weights for policy 0, policy_version 67570 (0.0006) [2023-03-07 09:04:04,942][155452] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-07 09:04:05,717][155452] Updated weights for policy 0, policy_version 67590 (0.0006) [2023-03-07 09:04:06,508][155452] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-07 09:04:07,304][155452] Updated weights for policy 0, policy_version 67610 (0.0006) [2023-03-07 09:04:08,087][155452] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-07 09:04:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 69245952. Throughput: 0: 13024.1. Samples: 69225925. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:08,367][155126] Avg episode reward: [(0, '2139.400')] [2023-03-07 09:04:08,870][155452] Updated weights for policy 0, policy_version 67630 (0.0005) [2023-03-07 09:04:09,662][155452] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-07 09:04:10,457][155452] Updated weights for policy 0, policy_version 67650 (0.0007) [2023-03-07 09:04:11,245][155452] Updated weights for policy 0, policy_version 67660 (0.0007) [2023-03-07 09:04:12,022][155452] Updated weights for policy 0, policy_version 67670 (0.0006) [2023-03-07 09:04:12,824][155452] Updated weights for policy 0, policy_version 67680 (0.0007) [2023-03-07 09:04:13,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 69311488. Throughput: 0: 13022.3. Samples: 69304135. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:13,367][155126] Avg episode reward: [(0, '2032.151')] [2023-03-07 09:04:13,584][155452] Updated weights for policy 0, policy_version 67690 (0.0007) [2023-03-07 09:04:14,366][155452] Updated weights for policy 0, policy_version 67700 (0.0006) [2023-03-07 09:04:15,157][155452] Updated weights for policy 0, policy_version 67710 (0.0006) [2023-03-07 09:04:15,942][155452] Updated weights for policy 0, policy_version 67720 (0.0006) [2023-03-07 09:04:16,729][155452] Updated weights for policy 0, policy_version 67730 (0.0006) [2023-03-07 09:04:17,497][155452] Updated weights for policy 0, policy_version 67740 (0.0005) [2023-03-07 09:04:18,275][155452] Updated weights for policy 0, policy_version 67750 (0.0006) [2023-03-07 09:04:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 69377024. Throughput: 0: 13022.9. Samples: 69343286. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:18,367][155126] Avg episode reward: [(0, '1965.783')] [2023-03-07 09:04:19,085][155452] Updated weights for policy 0, policy_version 67760 (0.0006) [2023-03-07 09:04:19,859][155452] Updated weights for policy 0, policy_version 67770 (0.0006) [2023-03-07 09:04:20,666][155452] Updated weights for policy 0, policy_version 67780 (0.0006) [2023-03-07 09:04:21,447][155452] Updated weights for policy 0, policy_version 67790 (0.0007) [2023-03-07 09:04:22,230][155452] Updated weights for policy 0, policy_version 67800 (0.0006) [2023-03-07 09:04:23,034][155452] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-07 09:04:23,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 69441536. Throughput: 0: 13023.9. Samples: 69421367. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:23,368][155126] Avg episode reward: [(0, '2121.384')] [2023-03-07 09:04:23,814][155452] Updated weights for policy 0, policy_version 67820 (0.0007) [2023-03-07 09:04:24,590][155452] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-07 09:04:25,411][155452] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-07 09:04:26,191][155452] Updated weights for policy 0, policy_version 67850 (0.0006) [2023-03-07 09:04:26,984][155452] Updated weights for policy 0, policy_version 67860 (0.0007) [2023-03-07 09:04:27,773][155452] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-07 09:04:28,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 69506048. Throughput: 0: 13018.6. Samples: 69499268. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:28,367][155126] Avg episode reward: [(0, '1965.784')] [2023-03-07 09:04:28,556][155452] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-07 09:04:29,362][155452] Updated weights for policy 0, policy_version 67890 (0.0006) [2023-03-07 09:04:30,152][155452] Updated weights for policy 0, policy_version 67900 (0.0006) [2023-03-07 09:04:30,950][155452] Updated weights for policy 0, policy_version 67910 (0.0006) [2023-03-07 09:04:31,727][155452] Updated weights for policy 0, policy_version 67920 (0.0006) [2023-03-07 09:04:32,508][155452] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-07 09:04:33,305][155452] Updated weights for policy 0, policy_version 67940 (0.0006) [2023-03-07 09:04:33,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 69571584. Throughput: 0: 13011.9. Samples: 69537947. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:33,367][155126] Avg episode reward: [(0, '2038.095')] [2023-03-07 09:04:34,089][155452] Updated weights for policy 0, policy_version 67950 (0.0006) [2023-03-07 09:04:34,878][155452] Updated weights for policy 0, policy_version 67960 (0.0005) [2023-03-07 09:04:35,653][155452] Updated weights for policy 0, policy_version 67970 (0.0005) [2023-03-07 09:04:36,442][155452] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-07 09:04:37,223][155452] Updated weights for policy 0, policy_version 67990 (0.0006) [2023-03-07 09:04:38,010][155452] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-07 09:04:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 69636096. Throughput: 0: 13021.6. Samples: 69616218. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:38,368][155126] Avg episode reward: [(0, '2064.824')] [2023-03-07 09:04:38,791][155452] Updated weights for policy 0, policy_version 68010 (0.0007) [2023-03-07 09:04:39,578][155452] Updated weights for policy 0, policy_version 68020 (0.0006) [2023-03-07 09:04:40,386][155452] Updated weights for policy 0, policy_version 68030 (0.0006) [2023-03-07 09:04:41,170][155452] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-07 09:04:41,952][155452] Updated weights for policy 0, policy_version 68050 (0.0007) [2023-03-07 09:04:42,764][155452] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-07 09:04:43,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 69700608. Throughput: 0: 13003.1. Samples: 69693829. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:43,367][155126] Avg episode reward: [(0, '2173.326')] [2023-03-07 09:04:43,550][155452] Updated weights for policy 0, policy_version 68070 (0.0005) [2023-03-07 09:04:44,337][155452] Updated weights for policy 0, policy_version 68080 (0.0007) [2023-03-07 09:04:45,125][155452] Updated weights for policy 0, policy_version 68090 (0.0006) [2023-03-07 09:04:45,934][155452] Updated weights for policy 0, policy_version 68100 (0.0006) [2023-03-07 09:04:46,712][155452] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-07 09:04:47,498][155452] Updated weights for policy 0, policy_version 68120 (0.0006) [2023-03-07 09:04:48,295][155452] Updated weights for policy 0, policy_version 68130 (0.0006) [2023-03-07 09:04:48,367][155126] Fps is (10 sec: 12902.6, 60 sec: 12987.7, 300 sec: 13013.5). Total num frames: 69765120. Throughput: 0: 13002.1. Samples: 69732738. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:48,367][155126] Avg episode reward: [(0, '2212.549')] [2023-03-07 09:04:49,074][155452] Updated weights for policy 0, policy_version 68140 (0.0007) [2023-03-07 09:04:49,860][155452] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-07 09:04:50,657][155452] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-07 09:04:51,439][155452] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-07 09:04:52,214][155452] Updated weights for policy 0, policy_version 68180 (0.0007) [2023-03-07 09:04:53,006][155452] Updated weights for policy 0, policy_version 68190 (0.0005) [2023-03-07 09:04:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 69830656. Throughput: 0: 12996.2. Samples: 69810753. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:53,367][155126] Avg episode reward: [(0, '2018.877')] [2023-03-07 09:04:53,776][155452] Updated weights for policy 0, policy_version 68200 (0.0005) [2023-03-07 09:04:54,560][155452] Updated weights for policy 0, policy_version 68210 (0.0007) [2023-03-07 09:04:55,357][155452] Updated weights for policy 0, policy_version 68220 (0.0006) [2023-03-07 09:04:56,138][155452] Updated weights for policy 0, policy_version 68230 (0.0006) [2023-03-07 09:04:56,930][155452] Updated weights for policy 0, policy_version 68240 (0.0007) [2023-03-07 09:04:57,740][155452] Updated weights for policy 0, policy_version 68250 (0.0006) [2023-03-07 09:04:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 69896192. Throughput: 0: 12992.5. Samples: 69888799. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:04:58,367][155126] Avg episode reward: [(0, '2087.468')] [2023-03-07 09:04:58,525][155452] Updated weights for policy 0, policy_version 68260 (0.0006) [2023-03-07 09:04:59,316][155452] Updated weights for policy 0, policy_version 68270 (0.0006) [2023-03-07 09:05:00,124][155452] Updated weights for policy 0, policy_version 68280 (0.0006) [2023-03-07 09:05:00,917][155452] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-07 09:05:01,696][155452] Updated weights for policy 0, policy_version 68300 (0.0006) [2023-03-07 09:05:02,478][155452] Updated weights for policy 0, policy_version 68310 (0.0007) [2023-03-07 09:05:03,259][155452] Updated weights for policy 0, policy_version 68320 (0.0006) [2023-03-07 09:05:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 69960704. Throughput: 0: 12980.9. Samples: 69927428. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:03,367][155126] Avg episode reward: [(0, '2037.122')] [2023-03-07 09:05:04,051][155452] Updated weights for policy 0, policy_version 68330 (0.0006) [2023-03-07 09:05:04,834][155452] Updated weights for policy 0, policy_version 68340 (0.0006) [2023-03-07 09:05:05,630][155452] Updated weights for policy 0, policy_version 68350 (0.0006) [2023-03-07 09:05:06,413][155452] Updated weights for policy 0, policy_version 68360 (0.0007) [2023-03-07 09:05:07,190][155452] Updated weights for policy 0, policy_version 68370 (0.0007) [2023-03-07 09:05:08,004][155452] Updated weights for policy 0, policy_version 68380 (0.0008) [2023-03-07 09:05:08,367][155126] Fps is (10 sec: 12902.4, 60 sec: 12987.8, 300 sec: 13010.0). Total num frames: 70025216. Throughput: 0: 12982.8. Samples: 70005590. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:08,367][155126] Avg episode reward: [(0, '2086.353')] [2023-03-07 09:05:08,781][155452] Updated weights for policy 0, policy_version 68390 (0.0006) [2023-03-07 09:05:09,577][155452] Updated weights for policy 0, policy_version 68400 (0.0006) [2023-03-07 09:05:10,359][155452] Updated weights for policy 0, policy_version 68410 (0.0006) [2023-03-07 09:05:11,137][155452] Updated weights for policy 0, policy_version 68420 (0.0006) [2023-03-07 09:05:11,926][155452] Updated weights for policy 0, policy_version 68430 (0.0006) [2023-03-07 09:05:12,711][155452] Updated weights for policy 0, policy_version 68440 (0.0006) [2023-03-07 09:05:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 70090752. Throughput: 0: 12985.9. Samples: 70083631. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:13,367][155126] Avg episode reward: [(0, '1878.274')] [2023-03-07 09:05:13,497][155452] Updated weights for policy 0, policy_version 68450 (0.0007) [2023-03-07 09:05:14,275][155452] Updated weights for policy 0, policy_version 68460 (0.0006) [2023-03-07 09:05:15,055][155452] Updated weights for policy 0, policy_version 68470 (0.0007) [2023-03-07 09:05:15,857][155452] Updated weights for policy 0, policy_version 68480 (0.0007) [2023-03-07 09:05:16,630][155452] Updated weights for policy 0, policy_version 68490 (0.0006) [2023-03-07 09:05:17,436][155452] Updated weights for policy 0, policy_version 68500 (0.0006) [2023-03-07 09:05:18,232][155452] Updated weights for policy 0, policy_version 68510 (0.0006) [2023-03-07 09:05:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12970.7, 300 sec: 13010.0). Total num frames: 70155264. Throughput: 0: 12994.3. Samples: 70122692. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:18,367][155126] Avg episode reward: [(0, '1934.573')] [2023-03-07 09:05:19,009][155452] Updated weights for policy 0, policy_version 68520 (0.0006) [2023-03-07 09:05:19,808][155452] Updated weights for policy 0, policy_version 68530 (0.0007) [2023-03-07 09:05:20,589][155452] Updated weights for policy 0, policy_version 68540 (0.0006) [2023-03-07 09:05:21,381][155452] Updated weights for policy 0, policy_version 68550 (0.0006) [2023-03-07 09:05:22,154][155452] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-07 09:05:22,968][155452] Updated weights for policy 0, policy_version 68570 (0.0006) [2023-03-07 09:05:23,367][155126] Fps is (10 sec: 13004.6, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 70220800. Throughput: 0: 12985.1. Samples: 70200546. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:23,368][155126] Avg episode reward: [(0, '2053.095')] [2023-03-07 09:05:23,736][155452] Updated weights for policy 0, policy_version 68580 (0.0006) [2023-03-07 09:05:24,521][155452] Updated weights for policy 0, policy_version 68590 (0.0005) [2023-03-07 09:05:25,308][155452] Updated weights for policy 0, policy_version 68600 (0.0007) [2023-03-07 09:05:26,077][155452] Updated weights for policy 0, policy_version 68610 (0.0005) [2023-03-07 09:05:26,865][155452] Updated weights for policy 0, policy_version 68620 (0.0006) [2023-03-07 09:05:27,651][155452] Updated weights for policy 0, policy_version 68630 (0.0007) [2023-03-07 09:05:28,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 70286336. Throughput: 0: 13003.1. Samples: 70278968. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:28,368][155126] Avg episode reward: [(0, '2042.369')] [2023-03-07 09:05:28,382][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000068639_70286336.pth... [2023-03-07 09:05:28,411][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000065591_67165184.pth [2023-03-07 09:05:28,434][155452] Updated weights for policy 0, policy_version 68640 (0.0006) [2023-03-07 09:05:29,213][155452] Updated weights for policy 0, policy_version 68650 (0.0006) [2023-03-07 09:05:30,017][155452] Updated weights for policy 0, policy_version 68660 (0.0007) [2023-03-07 09:05:30,803][155452] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-07 09:05:31,594][155452] Updated weights for policy 0, policy_version 68680 (0.0006) [2023-03-07 09:05:32,380][155452] Updated weights for policy 0, policy_version 68690 (0.0006) [2023-03-07 09:05:33,174][155452] Updated weights for policy 0, policy_version 68700 (0.0006) [2023-03-07 09:05:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 70350848. Throughput: 0: 13002.6. Samples: 70317857. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:33,367][155126] Avg episode reward: [(0, '2076.312')] [2023-03-07 09:05:33,983][155452] Updated weights for policy 0, policy_version 68710 (0.0006) [2023-03-07 09:05:34,753][155452] Updated weights for policy 0, policy_version 68720 (0.0007) [2023-03-07 09:05:35,544][155452] Updated weights for policy 0, policy_version 68730 (0.0006) [2023-03-07 09:05:36,337][155452] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-07 09:05:37,104][155452] Updated weights for policy 0, policy_version 68750 (0.0006) [2023-03-07 09:05:37,891][155452] Updated weights for policy 0, policy_version 68760 (0.0006) [2023-03-07 09:05:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 70416384. Throughput: 0: 13005.0. Samples: 70395980. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:38,368][155126] Avg episode reward: [(0, '1956.036')] [2023-03-07 09:05:38,674][155452] Updated weights for policy 0, policy_version 68770 (0.0005) [2023-03-07 09:05:39,478][155452] Updated weights for policy 0, policy_version 68780 (0.0007) [2023-03-07 09:05:40,252][155452] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-07 09:05:41,026][155452] Updated weights for policy 0, policy_version 68800 (0.0006) [2023-03-07 09:05:41,810][155452] Updated weights for policy 0, policy_version 68810 (0.0006) [2023-03-07 09:05:42,604][155452] Updated weights for policy 0, policy_version 68820 (0.0007) [2023-03-07 09:05:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 70480896. Throughput: 0: 13005.4. Samples: 70474040. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:43,367][155126] Avg episode reward: [(0, '1883.386')] [2023-03-07 09:05:43,399][155452] Updated weights for policy 0, policy_version 68830 (0.0006) [2023-03-07 09:05:44,173][155452] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-07 09:05:44,970][155452] Updated weights for policy 0, policy_version 68850 (0.0006) [2023-03-07 09:05:45,757][155452] Updated weights for policy 0, policy_version 68860 (0.0006) [2023-03-07 09:05:46,552][155452] Updated weights for policy 0, policy_version 68870 (0.0007) [2023-03-07 09:05:47,323][155452] Updated weights for policy 0, policy_version 68880 (0.0006) [2023-03-07 09:05:48,103][155452] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-07 09:05:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 70546432. Throughput: 0: 13011.6. Samples: 70512951. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:48,367][155126] Avg episode reward: [(0, '2000.447')] [2023-03-07 09:05:48,880][155452] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-07 09:05:49,685][155452] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-07 09:05:50,451][155452] Updated weights for policy 0, policy_version 68920 (0.0006) [2023-03-07 09:05:51,245][155452] Updated weights for policy 0, policy_version 68930 (0.0006) [2023-03-07 09:05:52,057][155452] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-07 09:05:52,827][155452] Updated weights for policy 0, policy_version 68950 (0.0006) [2023-03-07 09:05:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 70610944. Throughput: 0: 13013.5. Samples: 70591197. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:53,367][155126] Avg episode reward: [(0, '1780.805')] [2023-03-07 09:05:53,620][155452] Updated weights for policy 0, policy_version 68960 (0.0006) [2023-03-07 09:05:54,421][155452] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-07 09:05:55,205][155452] Updated weights for policy 0, policy_version 68980 (0.0006) [2023-03-07 09:05:55,993][155452] Updated weights for policy 0, policy_version 68990 (0.0007) [2023-03-07 09:05:56,761][155452] Updated weights for policy 0, policy_version 69000 (0.0007) [2023-03-07 09:05:57,547][155452] Updated weights for policy 0, policy_version 69010 (0.0007) [2023-03-07 09:05:58,315][155452] Updated weights for policy 0, policy_version 69020 (0.0006) [2023-03-07 09:05:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 70676480. Throughput: 0: 13021.5. Samples: 70669600. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:05:58,378][155126] Avg episode reward: [(0, '1903.788')] [2023-03-07 09:05:59,107][155452] Updated weights for policy 0, policy_version 69030 (0.0006) [2023-03-07 09:05:59,886][155452] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-07 09:06:00,667][155452] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-07 09:06:01,484][155452] Updated weights for policy 0, policy_version 69060 (0.0007) [2023-03-07 09:06:02,251][155452] Updated weights for policy 0, policy_version 69070 (0.0005) [2023-03-07 09:06:03,035][155452] Updated weights for policy 0, policy_version 69080 (0.0006) [2023-03-07 09:06:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13006.5). Total num frames: 70742016. Throughput: 0: 13023.3. Samples: 70708739. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:03,367][155126] Avg episode reward: [(0, '1978.353')] [2023-03-07 09:06:03,839][155452] Updated weights for policy 0, policy_version 69090 (0.0006) [2023-03-07 09:06:04,617][155452] Updated weights for policy 0, policy_version 69100 (0.0006) [2023-03-07 09:06:05,388][155452] Updated weights for policy 0, policy_version 69110 (0.0006) [2023-03-07 09:06:06,181][155452] Updated weights for policy 0, policy_version 69120 (0.0006) [2023-03-07 09:06:06,969][155452] Updated weights for policy 0, policy_version 69130 (0.0006) [2023-03-07 09:06:07,761][155452] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-07 09:06:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13006.5). Total num frames: 70806528. Throughput: 0: 13031.3. Samples: 70786955. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:08,367][155126] Avg episode reward: [(0, '2031.131')] [2023-03-07 09:06:08,549][155452] Updated weights for policy 0, policy_version 69150 (0.0006) [2023-03-07 09:06:09,353][155452] Updated weights for policy 0, policy_version 69160 (0.0007) [2023-03-07 09:06:10,136][155452] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-07 09:06:10,923][155452] Updated weights for policy 0, policy_version 69180 (0.0007) [2023-03-07 09:06:11,728][155452] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-07 09:06:12,518][155452] Updated weights for policy 0, policy_version 69200 (0.0006) [2023-03-07 09:06:13,290][155452] Updated weights for policy 0, policy_version 69210 (0.0006) [2023-03-07 09:06:13,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 70871040. Throughput: 0: 13010.9. Samples: 70864458. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:13,368][155126] Avg episode reward: [(0, '1866.225')] [2023-03-07 09:06:14,073][155452] Updated weights for policy 0, policy_version 69220 (0.0006) [2023-03-07 09:06:14,857][155452] Updated weights for policy 0, policy_version 69230 (0.0006) [2023-03-07 09:06:15,646][155452] Updated weights for policy 0, policy_version 69240 (0.0006) [2023-03-07 09:06:16,445][155452] Updated weights for policy 0, policy_version 69250 (0.0007) [2023-03-07 09:06:17,226][155452] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-07 09:06:18,019][155452] Updated weights for policy 0, policy_version 69270 (0.0007) [2023-03-07 09:06:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13006.5). Total num frames: 70936576. Throughput: 0: 13017.5. Samples: 70903646. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:18,367][155126] Avg episode reward: [(0, '1900.915')] [2023-03-07 09:06:18,813][155452] Updated weights for policy 0, policy_version 69280 (0.0006) [2023-03-07 09:06:19,594][155452] Updated weights for policy 0, policy_version 69290 (0.0006) [2023-03-07 09:06:20,387][155452] Updated weights for policy 0, policy_version 69300 (0.0006) [2023-03-07 09:06:21,179][155452] Updated weights for policy 0, policy_version 69310 (0.0006) [2023-03-07 09:06:21,966][155452] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-07 09:06:22,746][155452] Updated weights for policy 0, policy_version 69330 (0.0006) [2023-03-07 09:06:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 71001088. Throughput: 0: 13009.3. Samples: 70981396. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:23,367][155126] Avg episode reward: [(0, '1823.030')] [2023-03-07 09:06:23,539][155452] Updated weights for policy 0, policy_version 69340 (0.0007) [2023-03-07 09:06:24,327][155452] Updated weights for policy 0, policy_version 69350 (0.0007) [2023-03-07 09:06:25,127][155452] Updated weights for policy 0, policy_version 69360 (0.0006) [2023-03-07 09:06:25,922][155452] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-07 09:06:26,714][155452] Updated weights for policy 0, policy_version 69380 (0.0007) [2023-03-07 09:06:27,486][155452] Updated weights for policy 0, policy_version 69390 (0.0007) [2023-03-07 09:06:28,281][155452] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-07 09:06:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 71066624. Throughput: 0: 13007.1. Samples: 71059358. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:28,367][155126] Avg episode reward: [(0, '1974.524')] [2023-03-07 09:06:29,072][155452] Updated weights for policy 0, policy_version 69410 (0.0006) [2023-03-07 09:06:29,874][155452] Updated weights for policy 0, policy_version 69420 (0.0006) [2023-03-07 09:06:30,663][155452] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-07 09:06:31,434][155452] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-07 09:06:32,212][155452] Updated weights for policy 0, policy_version 69450 (0.0006) [2023-03-07 09:06:33,004][155452] Updated weights for policy 0, policy_version 69460 (0.0006) [2023-03-07 09:06:33,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 71131136. Throughput: 0: 13005.5. Samples: 71098200. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:33,368][155126] Avg episode reward: [(0, '1706.831')] [2023-03-07 09:06:33,801][155452] Updated weights for policy 0, policy_version 69470 (0.0006) [2023-03-07 09:06:34,578][155452] Updated weights for policy 0, policy_version 69480 (0.0006) [2023-03-07 09:06:35,352][155452] Updated weights for policy 0, policy_version 69490 (0.0006) [2023-03-07 09:06:36,141][155452] Updated weights for policy 0, policy_version 69500 (0.0006) [2023-03-07 09:06:36,923][155452] Updated weights for policy 0, policy_version 69510 (0.0006) [2023-03-07 09:06:37,710][155452] Updated weights for policy 0, policy_version 69520 (0.0007) [2023-03-07 09:06:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 71196672. Throughput: 0: 13006.4. Samples: 71176484. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:38,367][155126] Avg episode reward: [(0, '1844.698')] [2023-03-07 09:06:38,497][155452] Updated weights for policy 0, policy_version 69530 (0.0006) [2023-03-07 09:06:39,290][155452] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-07 09:06:40,061][155452] Updated weights for policy 0, policy_version 69550 (0.0006) [2023-03-07 09:06:40,853][155452] Updated weights for policy 0, policy_version 69560 (0.0007) [2023-03-07 09:06:41,648][155452] Updated weights for policy 0, policy_version 69570 (0.0005) [2023-03-07 09:06:42,439][155452] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-07 09:06:43,214][155452] Updated weights for policy 0, policy_version 69590 (0.0006) [2023-03-07 09:06:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 71261184. Throughput: 0: 12999.6. Samples: 71254581. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:43,368][155126] Avg episode reward: [(0, '1824.378')] [2023-03-07 09:06:44,028][155452] Updated weights for policy 0, policy_version 69600 (0.0007) [2023-03-07 09:06:44,810][155452] Updated weights for policy 0, policy_version 69610 (0.0005) [2023-03-07 09:06:45,605][155452] Updated weights for policy 0, policy_version 69620 (0.0006) [2023-03-07 09:06:46,401][155452] Updated weights for policy 0, policy_version 69630 (0.0006) [2023-03-07 09:06:47,160][155452] Updated weights for policy 0, policy_version 69640 (0.0007) [2023-03-07 09:06:47,976][155452] Updated weights for policy 0, policy_version 69650 (0.0007) [2023-03-07 09:06:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 71326720. Throughput: 0: 12994.4. Samples: 71293485. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:06:48,367][155126] Avg episode reward: [(0, '1800.923')] [2023-03-07 09:06:48,751][155452] Updated weights for policy 0, policy_version 69660 (0.0006) [2023-03-07 09:06:49,542][155452] Updated weights for policy 0, policy_version 69670 (0.0006) [2023-03-07 09:06:50,336][155452] Updated weights for policy 0, policy_version 69680 (0.0006) [2023-03-07 09:06:51,110][155452] Updated weights for policy 0, policy_version 69690 (0.0006) [2023-03-07 09:06:51,884][155452] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-07 09:06:52,657][155452] Updated weights for policy 0, policy_version 69710 (0.0007) [2023-03-07 09:06:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 71392256. Throughput: 0: 12991.9. Samples: 71371591. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:06:53,367][155126] Avg episode reward: [(0, '1913.649')] [2023-03-07 09:06:53,447][155452] Updated weights for policy 0, policy_version 69720 (0.0006) [2023-03-07 09:06:54,217][155452] Updated weights for policy 0, policy_version 69730 (0.0006) [2023-03-07 09:06:54,980][155452] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-07 09:06:55,762][155452] Updated weights for policy 0, policy_version 69750 (0.0007) [2023-03-07 09:06:56,549][155452] Updated weights for policy 0, policy_version 69760 (0.0007) [2023-03-07 09:06:57,339][155452] Updated weights for policy 0, policy_version 69770 (0.0006) [2023-03-07 09:06:58,141][155452] Updated weights for policy 0, policy_version 69780 (0.0006) [2023-03-07 09:06:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 71457792. Throughput: 0: 13019.7. Samples: 71450346. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:06:58,367][155126] Avg episode reward: [(0, '1969.057')] [2023-03-07 09:06:58,909][155452] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-07 09:06:59,718][155452] Updated weights for policy 0, policy_version 69800 (0.0007) [2023-03-07 09:07:00,498][155452] Updated weights for policy 0, policy_version 69810 (0.0006) [2023-03-07 09:07:01,285][155452] Updated weights for policy 0, policy_version 69820 (0.0006) [2023-03-07 09:07:02,060][155452] Updated weights for policy 0, policy_version 69830 (0.0006) [2023-03-07 09:07:02,862][155452] Updated weights for policy 0, policy_version 69840 (0.0006) [2023-03-07 09:07:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 71522304. Throughput: 0: 13015.0. Samples: 71489323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:07:03,367][155126] Avg episode reward: [(0, '1951.811')] [2023-03-07 09:07:03,649][155452] Updated weights for policy 0, policy_version 69850 (0.0007) [2023-03-07 09:07:04,421][155452] Updated weights for policy 0, policy_version 69860 (0.0007) [2023-03-07 09:07:05,210][155452] Updated weights for policy 0, policy_version 69870 (0.0006) [2023-03-07 09:07:06,003][155452] Updated weights for policy 0, policy_version 69880 (0.0007) [2023-03-07 09:07:06,803][155452] Updated weights for policy 0, policy_version 69890 (0.0006) [2023-03-07 09:07:07,580][155452] Updated weights for policy 0, policy_version 69900 (0.0007) [2023-03-07 09:07:08,360][155452] Updated weights for policy 0, policy_version 69910 (0.0006) [2023-03-07 09:07:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 71587840. Throughput: 0: 13021.5. Samples: 71567363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:07:08,367][155126] Avg episode reward: [(0, '2011.906')] [2023-03-07 09:07:09,138][155452] Updated weights for policy 0, policy_version 69920 (0.0007) [2023-03-07 09:07:09,945][155452] Updated weights for policy 0, policy_version 69930 (0.0006) [2023-03-07 09:07:10,733][155452] Updated weights for policy 0, policy_version 69940 (0.0007) [2023-03-07 09:07:11,503][155452] Updated weights for policy 0, policy_version 69950 (0.0007) [2023-03-07 09:07:12,303][155452] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-07 09:07:13,085][155452] Updated weights for policy 0, policy_version 69970 (0.0005) [2023-03-07 09:07:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 71652352. Throughput: 0: 13023.7. Samples: 71645425. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:07:13,367][155126] Avg episode reward: [(0, '2009.515')] [2023-03-07 09:07:13,857][155452] Updated weights for policy 0, policy_version 69980 (0.0006) [2023-03-07 09:07:14,656][155452] Updated weights for policy 0, policy_version 69990 (0.0006) [2023-03-07 09:07:15,462][155452] Updated weights for policy 0, policy_version 70000 (0.0006) [2023-03-07 09:07:16,232][155452] Updated weights for policy 0, policy_version 70010 (0.0006) [2023-03-07 09:07:17,018][155452] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-07 09:07:17,792][155452] Updated weights for policy 0, policy_version 70030 (0.0006) [2023-03-07 09:07:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 71717888. Throughput: 0: 13026.2. Samples: 71684376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:07:18,367][155126] Avg episode reward: [(0, '1974.045')] [2023-03-07 09:07:18,592][155452] Updated weights for policy 0, policy_version 70040 (0.0007) [2023-03-07 09:07:19,369][155452] Updated weights for policy 0, policy_version 70050 (0.0007) [2023-03-07 09:07:20,157][155452] Updated weights for policy 0, policy_version 70060 (0.0006) [2023-03-07 09:07:20,941][155452] Updated weights for policy 0, policy_version 70070 (0.0007) [2023-03-07 09:07:21,707][155452] Updated weights for policy 0, policy_version 70080 (0.0007) [2023-03-07 09:07:22,501][155452] Updated weights for policy 0, policy_version 70090 (0.0006) [2023-03-07 09:07:23,286][155452] Updated weights for policy 0, policy_version 70100 (0.0006) [2023-03-07 09:07:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 71783424. Throughput: 0: 13033.7. Samples: 71763000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:07:23,367][155126] Avg episode reward: [(0, '1976.118')] [2023-03-07 09:07:24,082][155452] Updated weights for policy 0, policy_version 70110 (0.0006) [2023-03-07 09:07:24,862][155452] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-07 09:07:25,645][155452] Updated weights for policy 0, policy_version 70130 (0.0006) [2023-03-07 09:07:26,429][155452] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-07 09:07:27,212][155452] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-07 09:07:27,988][155452] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-07 09:07:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13013.5). Total num frames: 71847936. Throughput: 0: 13037.5. Samples: 71841267. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:07:28,367][155126] Avg episode reward: [(0, '1778.288')] [2023-03-07 09:07:28,383][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000070165_71848960.pth... [2023-03-07 09:07:28,412][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000067115_68725760.pth [2023-03-07 09:07:28,771][155452] Updated weights for policy 0, policy_version 70170 (0.0006) [2023-03-07 09:07:29,550][155452] Updated weights for policy 0, policy_version 70180 (0.0005) [2023-03-07 09:07:30,325][155452] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-07 09:07:31,103][155452] Updated weights for policy 0, policy_version 70200 (0.0006) [2023-03-07 09:07:31,891][155452] Updated weights for policy 0, policy_version 70210 (0.0007) [2023-03-07 09:07:32,685][155452] Updated weights for policy 0, policy_version 70220 (0.0007) [2023-03-07 09:07:33,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 71913472. Throughput: 0: 13048.4. Samples: 71880664. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:07:33,368][155126] Avg episode reward: [(0, '1632.381')] [2023-03-07 09:07:33,469][155452] Updated weights for policy 0, policy_version 70230 (0.0006) [2023-03-07 09:07:34,270][155452] Updated weights for policy 0, policy_version 70240 (0.0006) [2023-03-07 09:07:35,052][155452] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-07 09:07:35,852][155452] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-07 09:07:36,619][155452] Updated weights for policy 0, policy_version 70270 (0.0006) [2023-03-07 09:07:37,417][155452] Updated weights for policy 0, policy_version 70280 (0.0005) [2023-03-07 09:07:38,180][155452] Updated weights for policy 0, policy_version 70290 (0.0007) [2023-03-07 09:07:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 71979008. Throughput: 0: 13046.7. Samples: 71958694. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:07:38,367][155126] Avg episode reward: [(0, '1674.982')] [2023-03-07 09:07:38,987][155452] Updated weights for policy 0, policy_version 70300 (0.0006) [2023-03-07 09:07:39,766][155452] Updated weights for policy 0, policy_version 70310 (0.0005) [2023-03-07 09:07:40,547][155452] Updated weights for policy 0, policy_version 70320 (0.0006) [2023-03-07 09:07:41,344][155452] Updated weights for policy 0, policy_version 70330 (0.0006) [2023-03-07 09:07:42,133][155452] Updated weights for policy 0, policy_version 70340 (0.0006) [2023-03-07 09:07:42,925][155452] Updated weights for policy 0, policy_version 70350 (0.0007) [2023-03-07 09:07:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 72043520. Throughput: 0: 13031.5. Samples: 72036763. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:07:43,367][155126] Avg episode reward: [(0, '1694.997')] [2023-03-07 09:07:43,708][155452] Updated weights for policy 0, policy_version 70360 (0.0006) [2023-03-07 09:07:44,487][155452] Updated weights for policy 0, policy_version 70370 (0.0006) [2023-03-07 09:07:45,290][155452] Updated weights for policy 0, policy_version 70380 (0.0006) [2023-03-07 09:07:46,056][155452] Updated weights for policy 0, policy_version 70390 (0.0006) [2023-03-07 09:07:46,848][155452] Updated weights for policy 0, policy_version 70400 (0.0007) [2023-03-07 09:07:47,630][155452] Updated weights for policy 0, policy_version 70410 (0.0006) [2023-03-07 09:07:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 72109056. Throughput: 0: 13035.6. Samples: 72075927. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:07:48,367][155126] Avg episode reward: [(0, '1599.972')] [2023-03-07 09:07:48,410][155452] Updated weights for policy 0, policy_version 70420 (0.0006) [2023-03-07 09:07:49,203][155452] Updated weights for policy 0, policy_version 70430 (0.0006) [2023-03-07 09:07:49,986][155452] Updated weights for policy 0, policy_version 70440 (0.0006) [2023-03-07 09:07:50,781][155452] Updated weights for policy 0, policy_version 70450 (0.0007) [2023-03-07 09:07:51,576][155452] Updated weights for policy 0, policy_version 70460 (0.0005) [2023-03-07 09:07:52,372][155452] Updated weights for policy 0, policy_version 70470 (0.0006) [2023-03-07 09:07:53,149][155452] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-07 09:07:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 72173568. Throughput: 0: 13034.0. Samples: 72153892. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:07:53,368][155126] Avg episode reward: [(0, '1705.022')] [2023-03-07 09:07:53,939][155452] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-07 09:07:54,731][155452] Updated weights for policy 0, policy_version 70500 (0.0007) [2023-03-07 09:07:55,535][155452] Updated weights for policy 0, policy_version 70510 (0.0006) [2023-03-07 09:07:56,315][155452] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-07 09:07:57,093][155452] Updated weights for policy 0, policy_version 70530 (0.0006) [2023-03-07 09:07:57,896][155452] Updated weights for policy 0, policy_version 70540 (0.0006) [2023-03-07 09:07:58,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 72238080. Throughput: 0: 13029.6. Samples: 72231754. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:07:58,367][155126] Avg episode reward: [(0, '1799.105')] [2023-03-07 09:07:58,685][155452] Updated weights for policy 0, policy_version 70550 (0.0006) [2023-03-07 09:07:59,478][155452] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-07 09:08:00,256][155452] Updated weights for policy 0, policy_version 70570 (0.0006) [2023-03-07 09:08:01,043][155452] Updated weights for policy 0, policy_version 70580 (0.0006) [2023-03-07 09:08:01,838][155452] Updated weights for policy 0, policy_version 70590 (0.0006) [2023-03-07 09:08:02,619][155452] Updated weights for policy 0, policy_version 70600 (0.0007) [2023-03-07 09:08:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 72303616. Throughput: 0: 13029.8. Samples: 72270719. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:08:03,367][155126] Avg episode reward: [(0, '1863.849')] [2023-03-07 09:08:03,394][155452] Updated weights for policy 0, policy_version 70610 (0.0007) [2023-03-07 09:08:04,182][155452] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-07 09:08:04,976][155452] Updated weights for policy 0, policy_version 70630 (0.0006) [2023-03-07 09:08:05,744][155452] Updated weights for policy 0, policy_version 70640 (0.0006) [2023-03-07 09:08:06,518][155452] Updated weights for policy 0, policy_version 70650 (0.0006) [2023-03-07 09:08:07,313][155452] Updated weights for policy 0, policy_version 70660 (0.0007) [2023-03-07 09:08:08,090][155452] Updated weights for policy 0, policy_version 70670 (0.0006) [2023-03-07 09:08:08,367][155126] Fps is (10 sec: 13106.9, 60 sec: 13021.8, 300 sec: 13013.5). Total num frames: 72369152. Throughput: 0: 13021.8. Samples: 72348985. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:08:08,368][155126] Avg episode reward: [(0, '1753.025')] [2023-03-07 09:08:08,891][155452] Updated weights for policy 0, policy_version 70680 (0.0007) [2023-03-07 09:08:09,678][155452] Updated weights for policy 0, policy_version 70690 (0.0006) [2023-03-07 09:08:10,462][155452] Updated weights for policy 0, policy_version 70700 (0.0007) [2023-03-07 09:08:11,226][155452] Updated weights for policy 0, policy_version 70710 (0.0006) [2023-03-07 09:08:12,020][155452] Updated weights for policy 0, policy_version 70720 (0.0007) [2023-03-07 09:08:12,814][155452] Updated weights for policy 0, policy_version 70730 (0.0006) [2023-03-07 09:08:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 72434688. Throughput: 0: 13023.8. Samples: 72427341. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:08:13,368][155126] Avg episode reward: [(0, '1690.696')] [2023-03-07 09:08:13,621][155452] Updated weights for policy 0, policy_version 70740 (0.0006) [2023-03-07 09:08:14,406][155452] Updated weights for policy 0, policy_version 70750 (0.0005) [2023-03-07 09:08:15,182][155452] Updated weights for policy 0, policy_version 70760 (0.0006) [2023-03-07 09:08:15,972][155452] Updated weights for policy 0, policy_version 70770 (0.0007) [2023-03-07 09:08:16,765][155452] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-07 09:08:17,552][155452] Updated weights for policy 0, policy_version 70790 (0.0006) [2023-03-07 09:08:18,338][155452] Updated weights for policy 0, policy_version 70800 (0.0006) [2023-03-07 09:08:18,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 72499200. Throughput: 0: 13013.5. Samples: 72466271. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:18,367][155126] Avg episode reward: [(0, '1708.663')] [2023-03-07 09:08:19,124][155452] Updated weights for policy 0, policy_version 70810 (0.0006) [2023-03-07 09:08:19,894][155452] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-07 09:08:20,701][155452] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-07 09:08:21,489][155452] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-07 09:08:22,257][155452] Updated weights for policy 0, policy_version 70850 (0.0006) [2023-03-07 09:08:23,043][155452] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-07 09:08:23,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 72564736. Throughput: 0: 13012.2. Samples: 72544243. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:23,367][155126] Avg episode reward: [(0, '1770.466')] [2023-03-07 09:08:23,834][155452] Updated weights for policy 0, policy_version 70870 (0.0007) [2023-03-07 09:08:24,626][155452] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-07 09:08:25,392][155452] Updated weights for policy 0, policy_version 70890 (0.0006) [2023-03-07 09:08:26,179][155452] Updated weights for policy 0, policy_version 70900 (0.0006) [2023-03-07 09:08:26,952][155452] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-07 09:08:27,746][155452] Updated weights for policy 0, policy_version 70920 (0.0007) [2023-03-07 09:08:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 72629248. Throughput: 0: 13018.3. Samples: 72622585. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:28,367][155126] Avg episode reward: [(0, '2006.965')] [2023-03-07 09:08:28,541][155452] Updated weights for policy 0, policy_version 70930 (0.0006) [2023-03-07 09:08:29,332][155452] Updated weights for policy 0, policy_version 70940 (0.0006) [2023-03-07 09:08:30,107][155452] Updated weights for policy 0, policy_version 70950 (0.0007) [2023-03-07 09:08:30,905][155452] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-07 09:08:31,685][155452] Updated weights for policy 0, policy_version 70970 (0.0006) [2023-03-07 09:08:32,474][155452] Updated weights for policy 0, policy_version 70980 (0.0006) [2023-03-07 09:08:33,259][155452] Updated weights for policy 0, policy_version 70990 (0.0006) [2023-03-07 09:08:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 72694784. Throughput: 0: 13014.7. Samples: 72661588. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:33,367][155126] Avg episode reward: [(0, '1898.599')] [2023-03-07 09:08:34,046][155452] Updated weights for policy 0, policy_version 71000 (0.0006) [2023-03-07 09:08:34,840][155452] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-07 09:08:35,621][155452] Updated weights for policy 0, policy_version 71020 (0.0006) [2023-03-07 09:08:36,402][155452] Updated weights for policy 0, policy_version 71030 (0.0006) [2023-03-07 09:08:37,198][155452] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-07 09:08:37,984][155452] Updated weights for policy 0, policy_version 71050 (0.0006) [2023-03-07 09:08:38,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 72759296. Throughput: 0: 13017.4. Samples: 72739677. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:38,367][155126] Avg episode reward: [(0, '1969.066')] [2023-03-07 09:08:38,763][155452] Updated weights for policy 0, policy_version 71060 (0.0008) [2023-03-07 09:08:39,549][155452] Updated weights for policy 0, policy_version 71070 (0.0006) [2023-03-07 09:08:40,325][155452] Updated weights for policy 0, policy_version 71080 (0.0006) [2023-03-07 09:08:41,130][155452] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-07 09:08:41,918][155452] Updated weights for policy 0, policy_version 71100 (0.0007) [2023-03-07 09:08:42,718][155452] Updated weights for policy 0, policy_version 71110 (0.0007) [2023-03-07 09:08:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 72824832. Throughput: 0: 13022.0. Samples: 72817746. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:43,367][155126] Avg episode reward: [(0, '2053.229')] [2023-03-07 09:08:43,503][155452] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-07 09:08:44,278][155452] Updated weights for policy 0, policy_version 71130 (0.0006) [2023-03-07 09:08:45,065][155452] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-07 09:08:45,859][155452] Updated weights for policy 0, policy_version 71150 (0.0007) [2023-03-07 09:08:46,667][155452] Updated weights for policy 0, policy_version 71160 (0.0006) [2023-03-07 09:08:47,453][155452] Updated weights for policy 0, policy_version 71170 (0.0006) [2023-03-07 09:08:48,222][155452] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-07 09:08:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 72889344. Throughput: 0: 13024.5. Samples: 72856822. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:48,367][155126] Avg episode reward: [(0, '1920.909')] [2023-03-07 09:08:49,003][155452] Updated weights for policy 0, policy_version 71190 (0.0005) [2023-03-07 09:08:49,800][155452] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-07 09:08:50,578][155452] Updated weights for policy 0, policy_version 71210 (0.0006) [2023-03-07 09:08:51,362][155452] Updated weights for policy 0, policy_version 71220 (0.0006) [2023-03-07 09:08:52,154][155452] Updated weights for policy 0, policy_version 71230 (0.0007) [2023-03-07 09:08:52,930][155452] Updated weights for policy 0, policy_version 71240 (0.0005) [2023-03-07 09:08:53,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 72954880. Throughput: 0: 13020.0. Samples: 72934885. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:53,368][155126] Avg episode reward: [(0, '1964.971')] [2023-03-07 09:08:53,716][155452] Updated weights for policy 0, policy_version 71250 (0.0007) [2023-03-07 09:08:54,520][155452] Updated weights for policy 0, policy_version 71260 (0.0007) [2023-03-07 09:08:55,304][155452] Updated weights for policy 0, policy_version 71270 (0.0007) [2023-03-07 09:08:56,101][155452] Updated weights for policy 0, policy_version 71280 (0.0007) [2023-03-07 09:08:56,900][155452] Updated weights for policy 0, policy_version 71290 (0.0006) [2023-03-07 09:08:57,697][155452] Updated weights for policy 0, policy_version 71300 (0.0006) [2023-03-07 09:08:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13013.5). Total num frames: 73019392. Throughput: 0: 13006.0. Samples: 73012609. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:08:58,378][155126] Avg episode reward: [(0, '2090.339')] [2023-03-07 09:08:58,496][155452] Updated weights for policy 0, policy_version 71310 (0.0008) [2023-03-07 09:08:59,271][155452] Updated weights for policy 0, policy_version 71320 (0.0006) [2023-03-07 09:09:00,048][155452] Updated weights for policy 0, policy_version 71330 (0.0006) [2023-03-07 09:09:00,833][155452] Updated weights for policy 0, policy_version 71340 (0.0006) [2023-03-07 09:09:01,623][155452] Updated weights for policy 0, policy_version 71350 (0.0006) [2023-03-07 09:09:02,406][155452] Updated weights for policy 0, policy_version 71360 (0.0005) [2023-03-07 09:09:03,200][155452] Updated weights for policy 0, policy_version 71370 (0.0006) [2023-03-07 09:09:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 73084928. Throughput: 0: 13008.4. Samples: 73051649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:03,368][155126] Avg episode reward: [(0, '1985.256')] [2023-03-07 09:09:03,976][155452] Updated weights for policy 0, policy_version 71380 (0.0006) [2023-03-07 09:09:04,767][155452] Updated weights for policy 0, policy_version 71390 (0.0007) [2023-03-07 09:09:05,552][155452] Updated weights for policy 0, policy_version 71400 (0.0007) [2023-03-07 09:09:06,332][155452] Updated weights for policy 0, policy_version 71410 (0.0006) [2023-03-07 09:09:07,111][155452] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-07 09:09:07,887][155452] Updated weights for policy 0, policy_version 71430 (0.0005) [2023-03-07 09:09:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 73150464. Throughput: 0: 13016.5. Samples: 73129987. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:08,367][155126] Avg episode reward: [(0, '2098.733')] [2023-03-07 09:09:08,682][155452] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-07 09:09:09,460][155452] Updated weights for policy 0, policy_version 71450 (0.0006) [2023-03-07 09:09:10,251][155452] Updated weights for policy 0, policy_version 71460 (0.0007) [2023-03-07 09:09:11,038][155452] Updated weights for policy 0, policy_version 71470 (0.0007) [2023-03-07 09:09:11,822][155452] Updated weights for policy 0, policy_version 71480 (0.0006) [2023-03-07 09:09:12,593][155452] Updated weights for policy 0, policy_version 71490 (0.0006) [2023-03-07 09:09:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 73214976. Throughput: 0: 13015.6. Samples: 73208287. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:13,367][155126] Avg episode reward: [(0, '1901.609')] [2023-03-07 09:09:13,384][155452] Updated weights for policy 0, policy_version 71500 (0.0007) [2023-03-07 09:09:14,149][155452] Updated weights for policy 0, policy_version 71510 (0.0006) [2023-03-07 09:09:14,951][155452] Updated weights for policy 0, policy_version 71520 (0.0006) [2023-03-07 09:09:15,736][155452] Updated weights for policy 0, policy_version 71530 (0.0006) [2023-03-07 09:09:16,533][155452] Updated weights for policy 0, policy_version 71540 (0.0005) [2023-03-07 09:09:17,326][155452] Updated weights for policy 0, policy_version 71550 (0.0006) [2023-03-07 09:09:18,093][155452] Updated weights for policy 0, policy_version 71560 (0.0006) [2023-03-07 09:09:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 73280512. Throughput: 0: 13014.6. Samples: 73247247. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:18,367][155126] Avg episode reward: [(0, '2053.400')] [2023-03-07 09:09:18,879][155452] Updated weights for policy 0, policy_version 71570 (0.0006) [2023-03-07 09:09:19,653][155452] Updated weights for policy 0, policy_version 71580 (0.0006) [2023-03-07 09:09:20,429][155452] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-07 09:09:21,209][155452] Updated weights for policy 0, policy_version 71600 (0.0007) [2023-03-07 09:09:21,991][155452] Updated weights for policy 0, policy_version 71610 (0.0006) [2023-03-07 09:09:22,766][155452] Updated weights for policy 0, policy_version 71620 (0.0006) [2023-03-07 09:09:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 73346048. Throughput: 0: 13027.0. Samples: 73325892. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:23,367][155126] Avg episode reward: [(0, '1711.820')] [2023-03-07 09:09:23,574][155452] Updated weights for policy 0, policy_version 71630 (0.0007) [2023-03-07 09:09:24,340][155452] Updated weights for policy 0, policy_version 71640 (0.0006) [2023-03-07 09:09:25,124][155452] Updated weights for policy 0, policy_version 71650 (0.0007) [2023-03-07 09:09:25,929][155452] Updated weights for policy 0, policy_version 71660 (0.0005) [2023-03-07 09:09:26,695][155452] Updated weights for policy 0, policy_version 71670 (0.0007) [2023-03-07 09:09:27,491][155452] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-07 09:09:28,281][155452] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-07 09:09:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13013.5). Total num frames: 73410560. Throughput: 0: 13030.7. Samples: 73404129. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:28,368][155126] Avg episode reward: [(0, '1865.464')] [2023-03-07 09:09:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000071690_73410560.pth... [2023-03-07 09:09:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000068639_70286336.pth [2023-03-07 09:09:29,094][155452] Updated weights for policy 0, policy_version 71700 (0.0006) [2023-03-07 09:09:29,884][155452] Updated weights for policy 0, policy_version 71710 (0.0006) [2023-03-07 09:09:30,669][155452] Updated weights for policy 0, policy_version 71720 (0.0007) [2023-03-07 09:09:31,455][155452] Updated weights for policy 0, policy_version 71730 (0.0007) [2023-03-07 09:09:32,258][155452] Updated weights for policy 0, policy_version 71740 (0.0007) [2023-03-07 09:09:33,031][155452] Updated weights for policy 0, policy_version 71750 (0.0006) [2023-03-07 09:09:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 73476096. Throughput: 0: 13023.1. Samples: 73442858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:33,367][155126] Avg episode reward: [(0, '1927.716')] [2023-03-07 09:09:33,822][155452] Updated weights for policy 0, policy_version 71760 (0.0006) [2023-03-07 09:09:34,611][155452] Updated weights for policy 0, policy_version 71770 (0.0007) [2023-03-07 09:09:35,385][155452] Updated weights for policy 0, policy_version 71780 (0.0006) [2023-03-07 09:09:36,174][155452] Updated weights for policy 0, policy_version 71790 (0.0006) [2023-03-07 09:09:36,965][155452] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-07 09:09:37,757][155452] Updated weights for policy 0, policy_version 71810 (0.0006) [2023-03-07 09:09:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 73540608. Throughput: 0: 13025.5. Samples: 73521034. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:38,367][155126] Avg episode reward: [(0, '2102.258')] [2023-03-07 09:09:38,556][155452] Updated weights for policy 0, policy_version 71820 (0.0006) [2023-03-07 09:09:39,335][155452] Updated weights for policy 0, policy_version 71830 (0.0006) [2023-03-07 09:09:40,111][155452] Updated weights for policy 0, policy_version 71840 (0.0006) [2023-03-07 09:09:40,887][155452] Updated weights for policy 0, policy_version 71850 (0.0006) [2023-03-07 09:09:41,703][155452] Updated weights for policy 0, policy_version 71860 (0.0007) [2023-03-07 09:09:42,477][155452] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-07 09:09:43,245][155452] Updated weights for policy 0, policy_version 71880 (0.0007) [2023-03-07 09:09:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 73606144. Throughput: 0: 13034.5. Samples: 73599163. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:43,368][155126] Avg episode reward: [(0, '1746.248')] [2023-03-07 09:09:44,023][155452] Updated weights for policy 0, policy_version 71890 (0.0006) [2023-03-07 09:09:44,839][155452] Updated weights for policy 0, policy_version 71900 (0.0007) [2023-03-07 09:09:45,622][155452] Updated weights for policy 0, policy_version 71910 (0.0006) [2023-03-07 09:09:46,395][155452] Updated weights for policy 0, policy_version 71920 (0.0006) [2023-03-07 09:09:47,198][155452] Updated weights for policy 0, policy_version 71930 (0.0007) [2023-03-07 09:09:47,998][155452] Updated weights for policy 0, policy_version 71940 (0.0006) [2023-03-07 09:09:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 73670656. Throughput: 0: 13032.7. Samples: 73638121. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:48,367][155126] Avg episode reward: [(0, '2007.047')] [2023-03-07 09:09:48,789][155452] Updated weights for policy 0, policy_version 71950 (0.0006) [2023-03-07 09:09:49,558][155452] Updated weights for policy 0, policy_version 71960 (0.0006) [2023-03-07 09:09:50,343][155452] Updated weights for policy 0, policy_version 71970 (0.0006) [2023-03-07 09:09:51,132][155452] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-07 09:09:51,914][155452] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-07 09:09:52,718][155452] Updated weights for policy 0, policy_version 72000 (0.0006) [2023-03-07 09:09:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 73736192. Throughput: 0: 13024.4. Samples: 73716086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:53,367][155126] Avg episode reward: [(0, '1972.039')] [2023-03-07 09:09:53,500][155452] Updated weights for policy 0, policy_version 72010 (0.0006) [2023-03-07 09:09:54,292][155452] Updated weights for policy 0, policy_version 72020 (0.0006) [2023-03-07 09:09:55,051][155452] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-07 09:09:55,824][155452] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-07 09:09:56,621][155452] Updated weights for policy 0, policy_version 72050 (0.0006) [2023-03-07 09:09:57,419][155452] Updated weights for policy 0, policy_version 72060 (0.0006) [2023-03-07 09:09:58,191][155452] Updated weights for policy 0, policy_version 72070 (0.0006) [2023-03-07 09:09:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 73801728. Throughput: 0: 13023.3. Samples: 73794337. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:09:58,367][155126] Avg episode reward: [(0, '2065.308')] [2023-03-07 09:09:58,985][155452] Updated weights for policy 0, policy_version 72080 (0.0005) [2023-03-07 09:09:59,777][155452] Updated weights for policy 0, policy_version 72090 (0.0006) [2023-03-07 09:10:00,542][155452] Updated weights for policy 0, policy_version 72100 (0.0006) [2023-03-07 09:10:01,337][155452] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-07 09:10:02,102][155452] Updated weights for policy 0, policy_version 72120 (0.0006) [2023-03-07 09:10:02,906][155452] Updated weights for policy 0, policy_version 72130 (0.0006) [2023-03-07 09:10:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 73867264. Throughput: 0: 13027.1. Samples: 73833465. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:10:03,367][155126] Avg episode reward: [(0, '2093.285')] [2023-03-07 09:10:03,677][155452] Updated weights for policy 0, policy_version 72140 (0.0006) [2023-03-07 09:10:04,469][155452] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-07 09:10:05,266][155452] Updated weights for policy 0, policy_version 72160 (0.0006) [2023-03-07 09:10:06,053][155452] Updated weights for policy 0, policy_version 72170 (0.0006) [2023-03-07 09:10:06,859][155452] Updated weights for policy 0, policy_version 72180 (0.0006) [2023-03-07 09:10:07,645][155452] Updated weights for policy 0, policy_version 72190 (0.0007) [2023-03-07 09:10:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 73931776. Throughput: 0: 13014.4. Samples: 73911540. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:10:08,367][155126] Avg episode reward: [(0, '2176.393')] [2023-03-07 09:10:08,422][155452] Updated weights for policy 0, policy_version 72200 (0.0006) [2023-03-07 09:10:09,216][155452] Updated weights for policy 0, policy_version 72210 (0.0006) [2023-03-07 09:10:10,013][155452] Updated weights for policy 0, policy_version 72220 (0.0006) [2023-03-07 09:10:10,799][155452] Updated weights for policy 0, policy_version 72230 (0.0006) [2023-03-07 09:10:11,580][155452] Updated weights for policy 0, policy_version 72240 (0.0005) [2023-03-07 09:10:12,366][155452] Updated weights for policy 0, policy_version 72250 (0.0006) [2023-03-07 09:10:13,156][155452] Updated weights for policy 0, policy_version 72260 (0.0006) [2023-03-07 09:10:13,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 73996288. Throughput: 0: 13008.9. Samples: 73989526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:10:13,367][155126] Avg episode reward: [(0, '1902.451')] [2023-03-07 09:10:13,954][155452] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-07 09:10:14,753][155452] Updated weights for policy 0, policy_version 72280 (0.0006) [2023-03-07 09:10:15,537][155452] Updated weights for policy 0, policy_version 72290 (0.0006) [2023-03-07 09:10:16,318][155452] Updated weights for policy 0, policy_version 72300 (0.0006) [2023-03-07 09:10:17,119][155452] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-07 09:10:17,882][155452] Updated weights for policy 0, policy_version 72320 (0.0006) [2023-03-07 09:10:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 74061824. Throughput: 0: 13010.4. Samples: 74028326. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:10:18,368][155126] Avg episode reward: [(0, '2024.503')] [2023-03-07 09:10:18,675][155452] Updated weights for policy 0, policy_version 72330 (0.0005) [2023-03-07 09:10:19,472][155452] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-07 09:10:20,245][155452] Updated weights for policy 0, policy_version 72350 (0.0006) [2023-03-07 09:10:21,038][155452] Updated weights for policy 0, policy_version 72360 (0.0006) [2023-03-07 09:10:21,824][155452] Updated weights for policy 0, policy_version 72370 (0.0006) [2023-03-07 09:10:22,586][155452] Updated weights for policy 0, policy_version 72380 (0.0006) [2023-03-07 09:10:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 74126336. Throughput: 0: 13012.0. Samples: 74106573. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:10:23,368][155126] Avg episode reward: [(0, '2269.581')] [2023-03-07 09:10:23,388][155452] Updated weights for policy 0, policy_version 72390 (0.0006) [2023-03-07 09:10:24,190][155452] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-07 09:10:24,974][155452] Updated weights for policy 0, policy_version 72410 (0.0006) [2023-03-07 09:10:25,747][155452] Updated weights for policy 0, policy_version 72420 (0.0007) [2023-03-07 09:10:26,543][155452] Updated weights for policy 0, policy_version 72430 (0.0006) [2023-03-07 09:10:27,340][155452] Updated weights for policy 0, policy_version 72440 (0.0006) [2023-03-07 09:10:28,125][155452] Updated weights for policy 0, policy_version 72450 (0.0007) [2023-03-07 09:10:28,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 74190848. Throughput: 0: 13005.7. Samples: 74184418. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:10:28,367][155126] Avg episode reward: [(0, '2214.997')] [2023-03-07 09:10:28,905][155452] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-07 09:10:29,690][155452] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-07 09:10:30,481][155452] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-07 09:10:31,281][155452] Updated weights for policy 0, policy_version 72490 (0.0006) [2023-03-07 09:10:32,056][155452] Updated weights for policy 0, policy_version 72500 (0.0006) [2023-03-07 09:10:32,845][155452] Updated weights for policy 0, policy_version 72510 (0.0006) [2023-03-07 09:10:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 74256384. Throughput: 0: 13010.0. Samples: 74223569. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:10:33,367][155126] Avg episode reward: [(0, '2164.812')] [2023-03-07 09:10:33,629][155452] Updated weights for policy 0, policy_version 72520 (0.0005) [2023-03-07 09:10:34,421][155452] Updated weights for policy 0, policy_version 72530 (0.0006) [2023-03-07 09:10:35,214][155452] Updated weights for policy 0, policy_version 72540 (0.0006) [2023-03-07 09:10:36,031][155452] Updated weights for policy 0, policy_version 72550 (0.0006) [2023-03-07 09:10:36,814][155452] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-07 09:10:37,594][155452] Updated weights for policy 0, policy_version 72570 (0.0006) [2023-03-07 09:10:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 74320896. Throughput: 0: 13002.5. Samples: 74301201. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:10:38,368][155126] Avg episode reward: [(0, '2220.774')] [2023-03-07 09:10:38,398][155452] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-07 09:10:39,177][155452] Updated weights for policy 0, policy_version 72590 (0.0006) [2023-03-07 09:10:39,960][155452] Updated weights for policy 0, policy_version 72600 (0.0007) [2023-03-07 09:10:40,743][155452] Updated weights for policy 0, policy_version 72610 (0.0007) [2023-03-07 09:10:41,532][155452] Updated weights for policy 0, policy_version 72620 (0.0007) [2023-03-07 09:10:42,315][155452] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-07 09:10:43,101][155452] Updated weights for policy 0, policy_version 72640 (0.0006) [2023-03-07 09:10:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 74386432. Throughput: 0: 13001.1. Samples: 74379384. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:10:43,368][155126] Avg episode reward: [(0, '2161.680')] [2023-03-07 09:10:43,891][155452] Updated weights for policy 0, policy_version 72650 (0.0006) [2023-03-07 09:10:44,678][155452] Updated weights for policy 0, policy_version 72660 (0.0006) [2023-03-07 09:10:45,470][155452] Updated weights for policy 0, policy_version 72670 (0.0006) [2023-03-07 09:10:46,239][155452] Updated weights for policy 0, policy_version 72680 (0.0008) [2023-03-07 09:10:47,038][155452] Updated weights for policy 0, policy_version 72690 (0.0006) [2023-03-07 09:10:47,828][155452] Updated weights for policy 0, policy_version 72700 (0.0007) [2023-03-07 09:10:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 74450944. Throughput: 0: 12999.0. Samples: 74418419. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:10:48,367][155126] Avg episode reward: [(0, '2265.859')] [2023-03-07 09:10:48,613][155452] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-07 09:10:49,397][155452] Updated weights for policy 0, policy_version 72720 (0.0005) [2023-03-07 09:10:50,186][155452] Updated weights for policy 0, policy_version 72730 (0.0006) [2023-03-07 09:10:50,961][155452] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-07 09:10:51,745][155452] Updated weights for policy 0, policy_version 72750 (0.0006) [2023-03-07 09:10:52,529][155452] Updated weights for policy 0, policy_version 72760 (0.0006) [2023-03-07 09:10:53,316][155452] Updated weights for policy 0, policy_version 72770 (0.0005) [2023-03-07 09:10:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 74516480. Throughput: 0: 13000.1. Samples: 74496546. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:10:53,367][155126] Avg episode reward: [(0, '2150.719')] [2023-03-07 09:10:54,081][155452] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-07 09:10:54,867][155452] Updated weights for policy 0, policy_version 72790 (0.0006) [2023-03-07 09:10:55,636][155452] Updated weights for policy 0, policy_version 72800 (0.0007) [2023-03-07 09:10:56,446][155452] Updated weights for policy 0, policy_version 72810 (0.0006) [2023-03-07 09:10:57,242][155452] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-07 09:10:58,022][155452] Updated weights for policy 0, policy_version 72830 (0.0005) [2023-03-07 09:10:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 74582016. Throughput: 0: 13011.7. Samples: 74575054. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:10:58,378][155126] Avg episode reward: [(0, '2118.284')] [2023-03-07 09:10:58,802][155452] Updated weights for policy 0, policy_version 72840 (0.0006) [2023-03-07 09:10:59,579][155452] Updated weights for policy 0, policy_version 72850 (0.0006) [2023-03-07 09:11:00,366][155452] Updated weights for policy 0, policy_version 72860 (0.0006) [2023-03-07 09:11:01,155][155452] Updated weights for policy 0, policy_version 72870 (0.0006) [2023-03-07 09:11:01,964][155452] Updated weights for policy 0, policy_version 72880 (0.0006) [2023-03-07 09:11:02,740][155452] Updated weights for policy 0, policy_version 72890 (0.0007) [2023-03-07 09:11:03,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 74647552. Throughput: 0: 13019.4. Samples: 74614196. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:03,367][155126] Avg episode reward: [(0, '2101.430')] [2023-03-07 09:11:03,516][155452] Updated weights for policy 0, policy_version 72900 (0.0006) [2023-03-07 09:11:04,314][155452] Updated weights for policy 0, policy_version 72910 (0.0006) [2023-03-07 09:11:05,109][155452] Updated weights for policy 0, policy_version 72920 (0.0006) [2023-03-07 09:11:05,889][155452] Updated weights for policy 0, policy_version 72930 (0.0006) [2023-03-07 09:11:06,677][155452] Updated weights for policy 0, policy_version 72940 (0.0006) [2023-03-07 09:11:07,474][155452] Updated weights for policy 0, policy_version 72950 (0.0006) [2023-03-07 09:11:08,274][155452] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-07 09:11:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 74712064. Throughput: 0: 13012.9. Samples: 74692154. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:08,367][155126] Avg episode reward: [(0, '2254.668')] [2023-03-07 09:11:09,055][155452] Updated weights for policy 0, policy_version 72970 (0.0005) [2023-03-07 09:11:09,830][155452] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-07 09:11:10,611][155452] Updated weights for policy 0, policy_version 72990 (0.0006) [2023-03-07 09:11:11,402][155452] Updated weights for policy 0, policy_version 73000 (0.0006) [2023-03-07 09:11:12,191][155452] Updated weights for policy 0, policy_version 73010 (0.0006) [2023-03-07 09:11:12,990][155452] Updated weights for policy 0, policy_version 73020 (0.0006) [2023-03-07 09:11:13,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 74776576. Throughput: 0: 13012.9. Samples: 74769999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:13,367][155126] Avg episode reward: [(0, '2358.307')] [2023-03-07 09:11:13,775][155452] Updated weights for policy 0, policy_version 73030 (0.0006) [2023-03-07 09:11:14,573][155452] Updated weights for policy 0, policy_version 73040 (0.0006) [2023-03-07 09:11:15,346][155452] Updated weights for policy 0, policy_version 73050 (0.0006) [2023-03-07 09:11:16,132][155452] Updated weights for policy 0, policy_version 73060 (0.0006) [2023-03-07 09:11:16,904][155452] Updated weights for policy 0, policy_version 73070 (0.0007) [2023-03-07 09:11:17,699][155452] Updated weights for policy 0, policy_version 73080 (0.0006) [2023-03-07 09:11:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 74842112. Throughput: 0: 13013.2. Samples: 74809166. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:18,368][155126] Avg episode reward: [(0, '2274.514')] [2023-03-07 09:11:18,487][155452] Updated weights for policy 0, policy_version 73090 (0.0005) [2023-03-07 09:11:19,269][155452] Updated weights for policy 0, policy_version 73100 (0.0007) [2023-03-07 09:11:20,054][155452] Updated weights for policy 0, policy_version 73110 (0.0006) [2023-03-07 09:11:20,839][155452] Updated weights for policy 0, policy_version 73120 (0.0006) [2023-03-07 09:11:21,649][155452] Updated weights for policy 0, policy_version 73130 (0.0006) [2023-03-07 09:11:22,428][155452] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-07 09:11:23,208][155452] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-07 09:11:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 74907648. Throughput: 0: 13019.4. Samples: 74887073. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:23,367][155126] Avg episode reward: [(0, '2253.874')] [2023-03-07 09:11:23,994][155452] Updated weights for policy 0, policy_version 73160 (0.0006) [2023-03-07 09:11:24,794][155452] Updated weights for policy 0, policy_version 73170 (0.0006) [2023-03-07 09:11:25,557][155452] Updated weights for policy 0, policy_version 73180 (0.0006) [2023-03-07 09:11:26,343][155452] Updated weights for policy 0, policy_version 73190 (0.0006) [2023-03-07 09:11:27,141][155452] Updated weights for policy 0, policy_version 73200 (0.0007) [2023-03-07 09:11:27,925][155452] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-07 09:11:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 74972160. Throughput: 0: 13019.4. Samples: 74965258. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:28,367][155126] Avg episode reward: [(0, '2211.636')] [2023-03-07 09:11:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000073215_74972160.pth... [2023-03-07 09:11:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000070165_71848960.pth [2023-03-07 09:11:28,706][155452] Updated weights for policy 0, policy_version 73220 (0.0006) [2023-03-07 09:11:29,506][155452] Updated weights for policy 0, policy_version 73230 (0.0006) [2023-03-07 09:11:30,292][155452] Updated weights for policy 0, policy_version 73240 (0.0006) [2023-03-07 09:11:31,072][155452] Updated weights for policy 0, policy_version 73250 (0.0006) [2023-03-07 09:11:31,866][155452] Updated weights for policy 0, policy_version 73260 (0.0006) [2023-03-07 09:11:32,662][155452] Updated weights for policy 0, policy_version 73270 (0.0006) [2023-03-07 09:11:33,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 75036672. Throughput: 0: 13019.6. Samples: 75004299. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:33,367][155126] Avg episode reward: [(0, '2145.927')] [2023-03-07 09:11:33,457][155452] Updated weights for policy 0, policy_version 73280 (0.0007) [2023-03-07 09:11:34,244][155452] Updated weights for policy 0, policy_version 73290 (0.0006) [2023-03-07 09:11:35,027][155452] Updated weights for policy 0, policy_version 73300 (0.0007) [2023-03-07 09:11:35,804][155452] Updated weights for policy 0, policy_version 73310 (0.0006) [2023-03-07 09:11:36,587][155452] Updated weights for policy 0, policy_version 73320 (0.0006) [2023-03-07 09:11:37,390][155452] Updated weights for policy 0, policy_version 73330 (0.0006) [2023-03-07 09:11:38,166][155452] Updated weights for policy 0, policy_version 73340 (0.0007) [2023-03-07 09:11:38,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 75102208. Throughput: 0: 13015.6. Samples: 75082249. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:38,368][155126] Avg episode reward: [(0, '2129.152')] [2023-03-07 09:11:38,961][155452] Updated weights for policy 0, policy_version 73350 (0.0007) [2023-03-07 09:11:39,758][155452] Updated weights for policy 0, policy_version 73360 (0.0006) [2023-03-07 09:11:40,535][155452] Updated weights for policy 0, policy_version 73370 (0.0006) [2023-03-07 09:11:41,326][155452] Updated weights for policy 0, policy_version 73380 (0.0006) [2023-03-07 09:11:42,125][155452] Updated weights for policy 0, policy_version 73390 (0.0005) [2023-03-07 09:11:42,899][155452] Updated weights for policy 0, policy_version 73400 (0.0006) [2023-03-07 09:11:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 75166720. Throughput: 0: 13001.0. Samples: 75160096. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:43,367][155126] Avg episode reward: [(0, '2201.896')] [2023-03-07 09:11:43,684][155452] Updated weights for policy 0, policy_version 73410 (0.0007) [2023-03-07 09:11:44,493][155452] Updated weights for policy 0, policy_version 73420 (0.0006) [2023-03-07 09:11:45,266][155452] Updated weights for policy 0, policy_version 73430 (0.0006) [2023-03-07 09:11:46,058][155452] Updated weights for policy 0, policy_version 73440 (0.0006) [2023-03-07 09:11:46,842][155452] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-07 09:11:47,622][155452] Updated weights for policy 0, policy_version 73460 (0.0007) [2023-03-07 09:11:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 75232256. Throughput: 0: 12997.3. Samples: 75199076. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:48,367][155126] Avg episode reward: [(0, '2165.752')] [2023-03-07 09:11:48,417][155452] Updated weights for policy 0, policy_version 73470 (0.0007) [2023-03-07 09:11:49,205][155452] Updated weights for policy 0, policy_version 73480 (0.0007) [2023-03-07 09:11:49,991][155452] Updated weights for policy 0, policy_version 73490 (0.0007) [2023-03-07 09:11:50,784][155452] Updated weights for policy 0, policy_version 73500 (0.0006) [2023-03-07 09:11:51,564][155452] Updated weights for policy 0, policy_version 73510 (0.0006) [2023-03-07 09:11:52,347][155452] Updated weights for policy 0, policy_version 73520 (0.0006) [2023-03-07 09:11:53,128][155452] Updated weights for policy 0, policy_version 73530 (0.0006) [2023-03-07 09:11:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 75296768. Throughput: 0: 13001.8. Samples: 75277233. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:53,368][155126] Avg episode reward: [(0, '2005.712')] [2023-03-07 09:11:53,913][155452] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-07 09:11:54,706][155452] Updated weights for policy 0, policy_version 73550 (0.0006) [2023-03-07 09:11:55,482][155452] Updated weights for policy 0, policy_version 73560 (0.0007) [2023-03-07 09:11:56,274][155452] Updated weights for policy 0, policy_version 73570 (0.0006) [2023-03-07 09:11:57,065][155452] Updated weights for policy 0, policy_version 73580 (0.0007) [2023-03-07 09:11:57,849][155452] Updated weights for policy 0, policy_version 73590 (0.0006) [2023-03-07 09:11:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 75362304. Throughput: 0: 13009.0. Samples: 75355404. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:11:58,367][155126] Avg episode reward: [(0, '2081.430')] [2023-03-07 09:11:58,622][155452] Updated weights for policy 0, policy_version 73600 (0.0006) [2023-03-07 09:11:59,434][155452] Updated weights for policy 0, policy_version 73610 (0.0006) [2023-03-07 09:12:00,212][155452] Updated weights for policy 0, policy_version 73620 (0.0006) [2023-03-07 09:12:00,988][155452] Updated weights for policy 0, policy_version 73630 (0.0006) [2023-03-07 09:12:01,805][155452] Updated weights for policy 0, policy_version 73640 (0.0006) [2023-03-07 09:12:02,594][155452] Updated weights for policy 0, policy_version 73650 (0.0006) [2023-03-07 09:12:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 75427840. Throughput: 0: 13007.1. Samples: 75394483. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:03,367][155126] Avg episode reward: [(0, '2075.993')] [2023-03-07 09:12:03,368][155452] Updated weights for policy 0, policy_version 73660 (0.0007) [2023-03-07 09:12:04,170][155452] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-07 09:12:04,954][155452] Updated weights for policy 0, policy_version 73680 (0.0007) [2023-03-07 09:12:05,741][155452] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-07 09:12:06,547][155452] Updated weights for policy 0, policy_version 73700 (0.0007) [2023-03-07 09:12:07,331][155452] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-07 09:12:08,129][155452] Updated weights for policy 0, policy_version 73720 (0.0006) [2023-03-07 09:12:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 75492352. Throughput: 0: 13003.8. Samples: 75472244. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:08,368][155126] Avg episode reward: [(0, '1947.697')] [2023-03-07 09:12:08,912][155452] Updated weights for policy 0, policy_version 73730 (0.0006) [2023-03-07 09:12:09,700][155452] Updated weights for policy 0, policy_version 73740 (0.0006) [2023-03-07 09:12:10,477][155452] Updated weights for policy 0, policy_version 73750 (0.0007) [2023-03-07 09:12:11,279][155452] Updated weights for policy 0, policy_version 73760 (0.0006) [2023-03-07 09:12:12,082][155452] Updated weights for policy 0, policy_version 73770 (0.0006) [2023-03-07 09:12:12,857][155452] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-07 09:12:13,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 75556864. Throughput: 0: 12993.9. Samples: 75549984. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:13,367][155126] Avg episode reward: [(0, '2061.313')] [2023-03-07 09:12:13,656][155452] Updated weights for policy 0, policy_version 73790 (0.0007) [2023-03-07 09:12:14,446][155452] Updated weights for policy 0, policy_version 73800 (0.0007) [2023-03-07 09:12:15,228][155452] Updated weights for policy 0, policy_version 73810 (0.0006) [2023-03-07 09:12:16,030][155452] Updated weights for policy 0, policy_version 73820 (0.0007) [2023-03-07 09:12:16,821][155452] Updated weights for policy 0, policy_version 73830 (0.0006) [2023-03-07 09:12:17,607][155452] Updated weights for policy 0, policy_version 73840 (0.0006) [2023-03-07 09:12:18,367][155126] Fps is (10 sec: 12902.4, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 75621376. Throughput: 0: 12987.7. Samples: 75588746. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:18,368][155126] Avg episode reward: [(0, '1948.200')] [2023-03-07 09:12:18,396][155452] Updated weights for policy 0, policy_version 73850 (0.0007) [2023-03-07 09:12:19,190][155452] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-07 09:12:19,962][155452] Updated weights for policy 0, policy_version 73870 (0.0006) [2023-03-07 09:12:20,747][155452] Updated weights for policy 0, policy_version 73880 (0.0007) [2023-03-07 09:12:21,542][155452] Updated weights for policy 0, policy_version 73890 (0.0006) [2023-03-07 09:12:22,344][155452] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-07 09:12:23,123][155452] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-07 09:12:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.8, 300 sec: 13013.5). Total num frames: 75686912. Throughput: 0: 12987.4. Samples: 75666677. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:23,367][155126] Avg episode reward: [(0, '1967.772')] [2023-03-07 09:12:23,919][155452] Updated weights for policy 0, policy_version 73920 (0.0006) [2023-03-07 09:12:24,688][155452] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-07 09:12:25,485][155452] Updated weights for policy 0, policy_version 73940 (0.0006) [2023-03-07 09:12:26,280][155452] Updated weights for policy 0, policy_version 73950 (0.0006) [2023-03-07 09:12:27,085][155452] Updated weights for policy 0, policy_version 73960 (0.0006) [2023-03-07 09:12:27,871][155452] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-07 09:12:28,367][155126] Fps is (10 sec: 13005.0, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 75751424. Throughput: 0: 12983.7. Samples: 75744362. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:28,367][155126] Avg episode reward: [(0, '1956.825')] [2023-03-07 09:12:28,667][155452] Updated weights for policy 0, policy_version 73980 (0.0006) [2023-03-07 09:12:29,438][155452] Updated weights for policy 0, policy_version 73990 (0.0007) [2023-03-07 09:12:30,237][155452] Updated weights for policy 0, policy_version 74000 (0.0006) [2023-03-07 09:12:31,033][155452] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-07 09:12:31,798][155452] Updated weights for policy 0, policy_version 74020 (0.0006) [2023-03-07 09:12:32,567][155452] Updated weights for policy 0, policy_version 74030 (0.0006) [2023-03-07 09:12:33,367][155126] Fps is (10 sec: 12902.3, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 75815936. Throughput: 0: 12983.2. Samples: 75783319. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:33,367][155126] Avg episode reward: [(0, '2226.202')] [2023-03-07 09:12:33,382][155452] Updated weights for policy 0, policy_version 74040 (0.0007) [2023-03-07 09:12:34,172][155452] Updated weights for policy 0, policy_version 74050 (0.0007) [2023-03-07 09:12:34,954][155452] Updated weights for policy 0, policy_version 74060 (0.0006) [2023-03-07 09:12:35,738][155452] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-07 09:12:36,534][155452] Updated weights for policy 0, policy_version 74080 (0.0005) [2023-03-07 09:12:37,314][155452] Updated weights for policy 0, policy_version 74090 (0.0006) [2023-03-07 09:12:38,100][155452] Updated weights for policy 0, policy_version 74100 (0.0006) [2023-03-07 09:12:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.8, 300 sec: 13010.0). Total num frames: 75881472. Throughput: 0: 12980.2. Samples: 75861341. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:38,367][155126] Avg episode reward: [(0, '2151.243')] [2023-03-07 09:12:38,906][155452] Updated weights for policy 0, policy_version 74110 (0.0006) [2023-03-07 09:12:39,680][155452] Updated weights for policy 0, policy_version 74120 (0.0007) [2023-03-07 09:12:40,463][155452] Updated weights for policy 0, policy_version 74130 (0.0005) [2023-03-07 09:12:41,273][155452] Updated weights for policy 0, policy_version 74140 (0.0006) [2023-03-07 09:12:42,067][155452] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-07 09:12:42,837][155452] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-07 09:12:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 75945984. Throughput: 0: 12973.9. Samples: 75939228. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:43,367][155126] Avg episode reward: [(0, '2113.087')] [2023-03-07 09:12:43,647][155452] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-07 09:12:44,428][155452] Updated weights for policy 0, policy_version 74180 (0.0006) [2023-03-07 09:12:45,192][155452] Updated weights for policy 0, policy_version 74190 (0.0007) [2023-03-07 09:12:45,993][155452] Updated weights for policy 0, policy_version 74200 (0.0006) [2023-03-07 09:12:46,775][155452] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-07 09:12:47,568][155452] Updated weights for policy 0, policy_version 74220 (0.0006) [2023-03-07 09:12:48,367][155126] Fps is (10 sec: 12902.4, 60 sec: 12970.7, 300 sec: 13006.5). Total num frames: 76010496. Throughput: 0: 12971.2. Samples: 75978186. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:48,367][155126] Avg episode reward: [(0, '2127.564')] [2023-03-07 09:12:48,371][155452] Updated weights for policy 0, policy_version 74230 (0.0006) [2023-03-07 09:12:49,163][155452] Updated weights for policy 0, policy_version 74240 (0.0007) [2023-03-07 09:12:49,954][155452] Updated weights for policy 0, policy_version 74250 (0.0006) [2023-03-07 09:12:50,734][155452] Updated weights for policy 0, policy_version 74260 (0.0006) [2023-03-07 09:12:51,518][155452] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-07 09:12:52,325][155452] Updated weights for policy 0, policy_version 74280 (0.0007) [2023-03-07 09:12:53,120][155452] Updated weights for policy 0, policy_version 74290 (0.0007) [2023-03-07 09:12:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 76076032. Throughput: 0: 12972.9. Samples: 76056020. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:53,367][155126] Avg episode reward: [(0, '1856.928')] [2023-03-07 09:12:53,882][155452] Updated weights for policy 0, policy_version 74300 (0.0006) [2023-03-07 09:12:54,651][155452] Updated weights for policy 0, policy_version 74310 (0.0006) [2023-03-07 09:12:55,440][155452] Updated weights for policy 0, policy_version 74320 (0.0006) [2023-03-07 09:12:56,234][155452] Updated weights for policy 0, policy_version 74330 (0.0006) [2023-03-07 09:12:57,009][155452] Updated weights for policy 0, policy_version 74340 (0.0006) [2023-03-07 09:12:57,802][155452] Updated weights for policy 0, policy_version 74350 (0.0006) [2023-03-07 09:12:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 76141568. Throughput: 0: 12983.0. Samples: 76134220. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:12:58,367][155126] Avg episode reward: [(0, '2047.813')] [2023-03-07 09:12:58,603][155452] Updated weights for policy 0, policy_version 74360 (0.0006) [2023-03-07 09:12:59,383][155452] Updated weights for policy 0, policy_version 74370 (0.0006) [2023-03-07 09:13:00,174][155452] Updated weights for policy 0, policy_version 74380 (0.0006) [2023-03-07 09:13:00,957][155452] Updated weights for policy 0, policy_version 74390 (0.0007) [2023-03-07 09:13:01,741][155452] Updated weights for policy 0, policy_version 74400 (0.0006) [2023-03-07 09:13:02,518][155452] Updated weights for policy 0, policy_version 74410 (0.0006) [2023-03-07 09:13:03,305][155452] Updated weights for policy 0, policy_version 74420 (0.0007) [2023-03-07 09:13:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12970.7, 300 sec: 13006.5). Total num frames: 76206080. Throughput: 0: 12991.3. Samples: 76173354. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:13:03,367][155126] Avg episode reward: [(0, '1987.190')] [2023-03-07 09:13:04,106][155452] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-07 09:13:04,878][155452] Updated weights for policy 0, policy_version 74440 (0.0006) [2023-03-07 09:13:05,671][155452] Updated weights for policy 0, policy_version 74450 (0.0006) [2023-03-07 09:13:06,444][155452] Updated weights for policy 0, policy_version 74460 (0.0006) [2023-03-07 09:13:07,228][155452] Updated weights for policy 0, policy_version 74470 (0.0006) [2023-03-07 09:13:08,015][155452] Updated weights for policy 0, policy_version 74480 (0.0006) [2023-03-07 09:13:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12987.8, 300 sec: 13006.5). Total num frames: 76271616. Throughput: 0: 12998.9. Samples: 76251627. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:13:08,367][155126] Avg episode reward: [(0, '1934.851')] [2023-03-07 09:13:08,792][155452] Updated weights for policy 0, policy_version 74490 (0.0005) [2023-03-07 09:13:09,596][155452] Updated weights for policy 0, policy_version 74500 (0.0006) [2023-03-07 09:13:10,397][155452] Updated weights for policy 0, policy_version 74510 (0.0006) [2023-03-07 09:13:11,181][155452] Updated weights for policy 0, policy_version 74520 (0.0006) [2023-03-07 09:13:11,964][155452] Updated weights for policy 0, policy_version 74530 (0.0006) [2023-03-07 09:13:12,742][155452] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-07 09:13:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 76336128. Throughput: 0: 13003.3. Samples: 76329512. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:13:13,367][155126] Avg episode reward: [(0, '2004.249')] [2023-03-07 09:13:13,539][155452] Updated weights for policy 0, policy_version 74550 (0.0006) [2023-03-07 09:13:14,330][155452] Updated weights for policy 0, policy_version 74560 (0.0006) [2023-03-07 09:13:15,120][155452] Updated weights for policy 0, policy_version 74570 (0.0008) [2023-03-07 09:13:15,919][155452] Updated weights for policy 0, policy_version 74580 (0.0006) [2023-03-07 09:13:16,709][155452] Updated weights for policy 0, policy_version 74590 (0.0006) [2023-03-07 09:13:17,485][155452] Updated weights for policy 0, policy_version 74600 (0.0005) [2023-03-07 09:13:18,268][155452] Updated weights for policy 0, policy_version 74610 (0.0007) [2023-03-07 09:13:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 76401664. Throughput: 0: 12999.5. Samples: 76368297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:13:18,367][155126] Avg episode reward: [(0, '1902.452')] [2023-03-07 09:13:19,057][155452] Updated weights for policy 0, policy_version 74620 (0.0006) [2023-03-07 09:13:19,834][155452] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-07 09:13:20,625][155452] Updated weights for policy 0, policy_version 74640 (0.0006) [2023-03-07 09:13:21,419][155452] Updated weights for policy 0, policy_version 74650 (0.0007) [2023-03-07 09:13:22,189][155452] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-07 09:13:22,993][155452] Updated weights for policy 0, policy_version 74670 (0.0007) [2023-03-07 09:13:23,367][155126] Fps is (10 sec: 13004.6, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 76466176. Throughput: 0: 13005.4. Samples: 76446585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:13:23,368][155126] Avg episode reward: [(0, '1897.838')] [2023-03-07 09:13:23,772][155452] Updated weights for policy 0, policy_version 74680 (0.0007) [2023-03-07 09:13:24,557][155452] Updated weights for policy 0, policy_version 74690 (0.0006) [2023-03-07 09:13:25,361][155452] Updated weights for policy 0, policy_version 74700 (0.0006) [2023-03-07 09:13:26,141][155452] Updated weights for policy 0, policy_version 74710 (0.0006) [2023-03-07 09:13:26,934][155452] Updated weights for policy 0, policy_version 74720 (0.0006) [2023-03-07 09:13:27,726][155452] Updated weights for policy 0, policy_version 74730 (0.0006) [2023-03-07 09:13:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 76531712. Throughput: 0: 13009.0. Samples: 76524632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:13:28,367][155126] Avg episode reward: [(0, '2021.866')] [2023-03-07 09:13:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000074738_76531712.pth... [2023-03-07 09:13:28,404][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000071690_73410560.pth [2023-03-07 09:13:28,498][155452] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-07 09:13:29,289][155452] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-07 09:13:30,073][155452] Updated weights for policy 0, policy_version 74760 (0.0007) [2023-03-07 09:13:30,841][155452] Updated weights for policy 0, policy_version 74770 (0.0006) [2023-03-07 09:13:31,626][155452] Updated weights for policy 0, policy_version 74780 (0.0006) [2023-03-07 09:13:32,408][155452] Updated weights for policy 0, policy_version 74790 (0.0006) [2023-03-07 09:13:33,191][155452] Updated weights for policy 0, policy_version 74800 (0.0006) [2023-03-07 09:13:33,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 76597248. Throughput: 0: 13015.3. Samples: 76563877. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:13:33,368][155126] Avg episode reward: [(0, '1799.352')] [2023-03-07 09:13:33,978][155452] Updated weights for policy 0, policy_version 74810 (0.0006) [2023-03-07 09:13:34,769][155452] Updated weights for policy 0, policy_version 74820 (0.0006) [2023-03-07 09:13:35,544][155452] Updated weights for policy 0, policy_version 74830 (0.0006) [2023-03-07 09:13:36,333][155452] Updated weights for policy 0, policy_version 74840 (0.0006) [2023-03-07 09:13:37,116][155452] Updated weights for policy 0, policy_version 74850 (0.0006) [2023-03-07 09:13:37,896][155452] Updated weights for policy 0, policy_version 74860 (0.0006) [2023-03-07 09:13:38,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 76662784. Throughput: 0: 13027.2. Samples: 76642242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:13:38,367][155126] Avg episode reward: [(0, '1805.795')] [2023-03-07 09:13:38,670][155452] Updated weights for policy 0, policy_version 74870 (0.0006) [2023-03-07 09:13:39,483][155452] Updated weights for policy 0, policy_version 74880 (0.0007) [2023-03-07 09:13:40,266][155452] Updated weights for policy 0, policy_version 74890 (0.0007) [2023-03-07 09:13:41,044][155452] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-07 09:13:41,838][155452] Updated weights for policy 0, policy_version 74910 (0.0006) [2023-03-07 09:13:42,614][155452] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-07 09:13:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 76727296. Throughput: 0: 13028.7. Samples: 76720513. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:13:43,367][155126] Avg episode reward: [(0, '2054.853')] [2023-03-07 09:13:43,403][155452] Updated weights for policy 0, policy_version 74930 (0.0006) [2023-03-07 09:13:44,191][155452] Updated weights for policy 0, policy_version 74940 (0.0006) [2023-03-07 09:13:44,970][155452] Updated weights for policy 0, policy_version 74950 (0.0006) [2023-03-07 09:13:45,767][155452] Updated weights for policy 0, policy_version 74960 (0.0006) [2023-03-07 09:13:46,561][155452] Updated weights for policy 0, policy_version 74970 (0.0006) [2023-03-07 09:13:47,342][155452] Updated weights for policy 0, policy_version 74980 (0.0006) [2023-03-07 09:13:48,154][155452] Updated weights for policy 0, policy_version 74990 (0.0007) [2023-03-07 09:13:48,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13021.9, 300 sec: 13006.5). Total num frames: 76791808. Throughput: 0: 13019.6. Samples: 76759234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:13:48,367][155126] Avg episode reward: [(0, '2143.998')] [2023-03-07 09:13:48,930][155452] Updated weights for policy 0, policy_version 75000 (0.0006) [2023-03-07 09:13:49,733][155452] Updated weights for policy 0, policy_version 75010 (0.0006) [2023-03-07 09:13:50,503][155452] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-07 09:13:51,306][155452] Updated weights for policy 0, policy_version 75030 (0.0006) [2023-03-07 09:13:52,107][155452] Updated weights for policy 0, policy_version 75040 (0.0006) [2023-03-07 09:13:52,895][155452] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-07 09:13:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 76857344. Throughput: 0: 13009.2. Samples: 76837040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:13:53,367][155126] Avg episode reward: [(0, '2064.945')] [2023-03-07 09:13:53,695][155452] Updated weights for policy 0, policy_version 75060 (0.0006) [2023-03-07 09:13:54,500][155452] Updated weights for policy 0, policy_version 75070 (0.0007) [2023-03-07 09:13:55,266][155452] Updated weights for policy 0, policy_version 75080 (0.0006) [2023-03-07 09:13:56,048][155452] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-07 09:13:56,834][155452] Updated weights for policy 0, policy_version 75100 (0.0007) [2023-03-07 09:13:57,638][155452] Updated weights for policy 0, policy_version 75110 (0.0006) [2023-03-07 09:13:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 76921856. Throughput: 0: 13007.2. Samples: 76914836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:13:58,367][155126] Avg episode reward: [(0, '1908.045')] [2023-03-07 09:13:58,433][155452] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-07 09:13:59,231][155452] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-07 09:14:00,029][155452] Updated weights for policy 0, policy_version 75140 (0.0006) [2023-03-07 09:14:00,809][155452] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-07 09:14:01,599][155452] Updated weights for policy 0, policy_version 75160 (0.0006) [2023-03-07 09:14:02,379][155452] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-07 09:14:03,173][155452] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-07 09:14:03,367][155126] Fps is (10 sec: 12902.2, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 76986368. Throughput: 0: 13001.8. Samples: 76953379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:03,367][155126] Avg episode reward: [(0, '2055.660')] [2023-03-07 09:14:03,948][155401] KL-divergence is very high: 137.7126 [2023-03-07 09:14:03,955][155452] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-07 09:14:04,747][155452] Updated weights for policy 0, policy_version 75200 (0.0006) [2023-03-07 09:14:05,530][155452] Updated weights for policy 0, policy_version 75210 (0.0006) [2023-03-07 09:14:06,319][155452] Updated weights for policy 0, policy_version 75220 (0.0006) [2023-03-07 09:14:07,114][155452] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-07 09:14:07,890][155452] Updated weights for policy 0, policy_version 75240 (0.0006) [2023-03-07 09:14:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 77051904. Throughput: 0: 12992.4. Samples: 77031242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:08,367][155126] Avg episode reward: [(0, '2000.269')] [2023-03-07 09:14:08,671][155452] Updated weights for policy 0, policy_version 75250 (0.0006) [2023-03-07 09:14:09,460][155452] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-07 09:14:10,247][155452] Updated weights for policy 0, policy_version 75270 (0.0006) [2023-03-07 09:14:11,033][155452] Updated weights for policy 0, policy_version 75280 (0.0006) [2023-03-07 09:14:11,822][155452] Updated weights for policy 0, policy_version 75290 (0.0006) [2023-03-07 09:14:12,610][155452] Updated weights for policy 0, policy_version 75300 (0.0006) [2023-03-07 09:14:12,858][155401] KL-divergence is very high: 167.7931 [2023-03-07 09:14:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 77116416. Throughput: 0: 12996.2. Samples: 77109460. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:13,367][155126] Avg episode reward: [(0, '2008.097')] [2023-03-07 09:14:13,413][155452] Updated weights for policy 0, policy_version 75310 (0.0007) [2023-03-07 09:14:14,200][155452] Updated weights for policy 0, policy_version 75320 (0.0006) [2023-03-07 09:14:14,995][155452] Updated weights for policy 0, policy_version 75330 (0.0006) [2023-03-07 09:14:15,794][155452] Updated weights for policy 0, policy_version 75340 (0.0006) [2023-03-07 09:14:16,582][155452] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-07 09:14:17,365][155452] Updated weights for policy 0, policy_version 75360 (0.0005) [2023-03-07 09:14:18,166][155452] Updated weights for policy 0, policy_version 75370 (0.0007) [2023-03-07 09:14:18,367][155126] Fps is (10 sec: 12902.5, 60 sec: 12987.7, 300 sec: 12999.6). Total num frames: 77180928. Throughput: 0: 12985.7. Samples: 77148233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:18,367][155126] Avg episode reward: [(0, '1994.284')] [2023-03-07 09:14:18,967][155452] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-07 09:14:19,748][155452] Updated weights for policy 0, policy_version 75390 (0.0007) [2023-03-07 09:14:20,529][155452] Updated weights for policy 0, policy_version 75400 (0.0006) [2023-03-07 09:14:21,320][155452] Updated weights for policy 0, policy_version 75410 (0.0006) [2023-03-07 09:14:22,107][155452] Updated weights for policy 0, policy_version 75420 (0.0006) [2023-03-07 09:14:22,897][155452] Updated weights for policy 0, policy_version 75430 (0.0006) [2023-03-07 09:14:23,367][155126] Fps is (10 sec: 12902.4, 60 sec: 12987.8, 300 sec: 12999.6). Total num frames: 77245440. Throughput: 0: 12972.8. Samples: 77226019. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:23,367][155126] Avg episode reward: [(0, '1837.423')] [2023-03-07 09:14:23,674][155452] Updated weights for policy 0, policy_version 75440 (0.0006) [2023-03-07 09:14:24,462][155452] Updated weights for policy 0, policy_version 75450 (0.0006) [2023-03-07 09:14:25,241][155452] Updated weights for policy 0, policy_version 75460 (0.0006) [2023-03-07 09:14:26,000][155452] Updated weights for policy 0, policy_version 75470 (0.0006) [2023-03-07 09:14:26,802][155452] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-07 09:14:27,588][155452] Updated weights for policy 0, policy_version 75490 (0.0006) [2023-03-07 09:14:28,356][155452] Updated weights for policy 0, policy_version 75500 (0.0006) [2023-03-07 09:14:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 77312000. Throughput: 0: 12979.3. Samples: 77304582. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:28,367][155126] Avg episode reward: [(0, '1756.890')] [2023-03-07 09:14:29,157][155452] Updated weights for policy 0, policy_version 75510 (0.0006) [2023-03-07 09:14:29,946][155452] Updated weights for policy 0, policy_version 75520 (0.0007) [2023-03-07 09:14:30,744][155452] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-07 09:14:31,522][155452] Updated weights for policy 0, policy_version 75540 (0.0006) [2023-03-07 09:14:32,319][155452] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-07 09:14:33,106][155452] Updated weights for policy 0, policy_version 75560 (0.0007) [2023-03-07 09:14:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 12987.7, 300 sec: 13003.1). Total num frames: 77376512. Throughput: 0: 12979.9. Samples: 77343330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:33,368][155126] Avg episode reward: [(0, '1875.101')] [2023-03-07 09:14:33,894][155452] Updated weights for policy 0, policy_version 75570 (0.0006) [2023-03-07 09:14:34,688][155452] Updated weights for policy 0, policy_version 75580 (0.0007) [2023-03-07 09:14:35,472][155452] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-07 09:14:36,260][155452] Updated weights for policy 0, policy_version 75600 (0.0005) [2023-03-07 09:14:37,046][155452] Updated weights for policy 0, policy_version 75610 (0.0005) [2023-03-07 09:14:37,833][155452] Updated weights for policy 0, policy_version 75620 (0.0007) [2023-03-07 09:14:38,367][155126] Fps is (10 sec: 12902.4, 60 sec: 12970.6, 300 sec: 12999.6). Total num frames: 77441024. Throughput: 0: 12984.2. Samples: 77421331. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:38,367][155126] Avg episode reward: [(0, '1803.984')] [2023-03-07 09:14:38,620][155452] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-07 09:14:39,397][155452] Updated weights for policy 0, policy_version 75640 (0.0006) [2023-03-07 09:14:40,189][155452] Updated weights for policy 0, policy_version 75650 (0.0006) [2023-03-07 09:14:40,969][155452] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-07 09:14:41,754][155452] Updated weights for policy 0, policy_version 75670 (0.0007) [2023-03-07 09:14:42,530][155452] Updated weights for policy 0, policy_version 75680 (0.0006) [2023-03-07 09:14:43,322][155452] Updated weights for policy 0, policy_version 75690 (0.0005) [2023-03-07 09:14:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13003.1). Total num frames: 77506560. Throughput: 0: 12997.5. Samples: 77499723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:43,367][155126] Avg episode reward: [(0, '1674.221')] [2023-03-07 09:14:44,106][155452] Updated weights for policy 0, policy_version 75700 (0.0005) [2023-03-07 09:14:44,905][155452] Updated weights for policy 0, policy_version 75710 (0.0007) [2023-03-07 09:14:45,678][155452] Updated weights for policy 0, policy_version 75720 (0.0007) [2023-03-07 09:14:46,478][155452] Updated weights for policy 0, policy_version 75730 (0.0007) [2023-03-07 09:14:47,262][155452] Updated weights for policy 0, policy_version 75740 (0.0006) [2023-03-07 09:14:48,046][155452] Updated weights for policy 0, policy_version 75750 (0.0006) [2023-03-07 09:14:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 12999.6). Total num frames: 77571072. Throughput: 0: 13004.9. Samples: 77538600. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:48,367][155126] Avg episode reward: [(0, '1882.378')] [2023-03-07 09:14:48,862][155452] Updated weights for policy 0, policy_version 75760 (0.0006) [2023-03-07 09:14:49,648][155452] Updated weights for policy 0, policy_version 75770 (0.0006) [2023-03-07 09:14:50,434][155452] Updated weights for policy 0, policy_version 75780 (0.0006) [2023-03-07 09:14:51,211][155452] Updated weights for policy 0, policy_version 75790 (0.0006) [2023-03-07 09:14:52,018][155452] Updated weights for policy 0, policy_version 75800 (0.0006) [2023-03-07 09:14:52,786][155452] Updated weights for policy 0, policy_version 75810 (0.0007) [2023-03-07 09:14:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 12999.6). Total num frames: 77636608. Throughput: 0: 13002.8. Samples: 77616368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:53,367][155126] Avg episode reward: [(0, '1786.550')] [2023-03-07 09:14:53,589][155452] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-07 09:14:54,366][155452] Updated weights for policy 0, policy_version 75830 (0.0006) [2023-03-07 09:14:55,167][155452] Updated weights for policy 0, policy_version 75840 (0.0007) [2023-03-07 09:14:55,957][155452] Updated weights for policy 0, policy_version 75850 (0.0007) [2023-03-07 09:14:56,735][155452] Updated weights for policy 0, policy_version 75860 (0.0005) [2023-03-07 09:14:57,517][155452] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-07 09:14:58,313][155452] Updated weights for policy 0, policy_version 75880 (0.0007) [2023-03-07 09:14:58,367][155126] Fps is (10 sec: 13004.5, 60 sec: 12987.7, 300 sec: 12996.1). Total num frames: 77701120. Throughput: 0: 12997.2. Samples: 77694336. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:14:58,368][155126] Avg episode reward: [(0, '1959.078')] [2023-03-07 09:14:59,093][155452] Updated weights for policy 0, policy_version 75890 (0.0007) [2023-03-07 09:14:59,881][155452] Updated weights for policy 0, policy_version 75900 (0.0007) [2023-03-07 09:15:00,682][155452] Updated weights for policy 0, policy_version 75910 (0.0006) [2023-03-07 09:15:01,477][155452] Updated weights for policy 0, policy_version 75920 (0.0006) [2023-03-07 09:15:02,266][155452] Updated weights for policy 0, policy_version 75930 (0.0006) [2023-03-07 09:15:03,047][155452] Updated weights for policy 0, policy_version 75940 (0.0006) [2023-03-07 09:15:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 77766656. Throughput: 0: 12999.2. Samples: 77733198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:03,367][155126] Avg episode reward: [(0, '1759.464')] [2023-03-07 09:15:03,839][155452] Updated weights for policy 0, policy_version 75950 (0.0006) [2023-03-07 09:15:04,635][155452] Updated weights for policy 0, policy_version 75960 (0.0007) [2023-03-07 09:15:05,402][155452] Updated weights for policy 0, policy_version 75970 (0.0006) [2023-03-07 09:15:06,199][155452] Updated weights for policy 0, policy_version 75980 (0.0005) [2023-03-07 09:15:06,987][155452] Updated weights for policy 0, policy_version 75990 (0.0006) [2023-03-07 09:15:07,761][155452] Updated weights for policy 0, policy_version 76000 (0.0007) [2023-03-07 09:15:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 12987.7, 300 sec: 12999.6). Total num frames: 77831168. Throughput: 0: 13006.7. Samples: 77811323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:08,367][155126] Avg episode reward: [(0, '1872.166')] [2023-03-07 09:15:08,540][155452] Updated weights for policy 0, policy_version 76010 (0.0006) [2023-03-07 09:15:09,345][155452] Updated weights for policy 0, policy_version 76020 (0.0006) [2023-03-07 09:15:10,122][155452] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-07 09:15:10,913][155452] Updated weights for policy 0, policy_version 76040 (0.0006) [2023-03-07 09:15:11,699][155452] Updated weights for policy 0, policy_version 76050 (0.0006) [2023-03-07 09:15:12,498][155452] Updated weights for policy 0, policy_version 76060 (0.0006) [2023-03-07 09:15:13,274][155452] Updated weights for policy 0, policy_version 76070 (0.0007) [2023-03-07 09:15:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 77896704. Throughput: 0: 12996.9. Samples: 77889444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:13,368][155126] Avg episode reward: [(0, '2033.124')] [2023-03-07 09:15:14,055][155452] Updated weights for policy 0, policy_version 76080 (0.0007) [2023-03-07 09:15:14,859][155452] Updated weights for policy 0, policy_version 76090 (0.0007) [2023-03-07 09:15:15,623][155452] Updated weights for policy 0, policy_version 76100 (0.0006) [2023-03-07 09:15:16,420][155452] Updated weights for policy 0, policy_version 76110 (0.0006) [2023-03-07 09:15:17,204][155452] Updated weights for policy 0, policy_version 76120 (0.0006) [2023-03-07 09:15:18,005][155452] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-07 09:15:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 77961216. Throughput: 0: 13002.3. Samples: 77928433. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:18,368][155126] Avg episode reward: [(0, '2042.276')] [2023-03-07 09:15:18,781][155452] Updated weights for policy 0, policy_version 76140 (0.0006) [2023-03-07 09:15:19,573][155452] Updated weights for policy 0, policy_version 76150 (0.0006) [2023-03-07 09:15:20,365][155452] Updated weights for policy 0, policy_version 76160 (0.0006) [2023-03-07 09:15:21,164][155452] Updated weights for policy 0, policy_version 76170 (0.0007) [2023-03-07 09:15:21,934][155452] Updated weights for policy 0, policy_version 76180 (0.0006) [2023-03-07 09:15:22,737][155452] Updated weights for policy 0, policy_version 76190 (0.0007) [2023-03-07 09:15:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13003.1). Total num frames: 78026752. Throughput: 0: 13003.9. Samples: 78006506. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:23,367][155126] Avg episode reward: [(0, '2119.873')] [2023-03-07 09:15:23,511][155452] Updated weights for policy 0, policy_version 76200 (0.0006) [2023-03-07 09:15:24,285][155452] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-07 09:15:25,065][155452] Updated weights for policy 0, policy_version 76220 (0.0007) [2023-03-07 09:15:25,856][155452] Updated weights for policy 0, policy_version 76230 (0.0006) [2023-03-07 09:15:26,642][155452] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-07 09:15:27,401][155452] Updated weights for policy 0, policy_version 76250 (0.0005) [2023-03-07 09:15:28,203][155452] Updated weights for policy 0, policy_version 76260 (0.0006) [2023-03-07 09:15:28,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 78092288. Throughput: 0: 13006.8. Samples: 78085031. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:28,367][155126] Avg episode reward: [(0, '2257.765')] [2023-03-07 09:15:28,373][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000076262_78092288.pth... [2023-03-07 09:15:28,404][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000073215_74972160.pth [2023-03-07 09:15:28,980][155452] Updated weights for policy 0, policy_version 76270 (0.0006) [2023-03-07 09:15:29,769][155452] Updated weights for policy 0, policy_version 76280 (0.0007) [2023-03-07 09:15:30,551][155452] Updated weights for policy 0, policy_version 76290 (0.0006) [2023-03-07 09:15:31,337][155452] Updated weights for policy 0, policy_version 76300 (0.0006) [2023-03-07 09:15:31,424][155401] KL-divergence is very high: 223.7450 [2023-03-07 09:15:32,114][155452] Updated weights for policy 0, policy_version 76310 (0.0006) [2023-03-07 09:15:32,899][155452] Updated weights for policy 0, policy_version 76320 (0.0006) [2023-03-07 09:15:33,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13021.9, 300 sec: 13006.5). Total num frames: 78157824. Throughput: 0: 13013.9. Samples: 78124227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:33,367][155126] Avg episode reward: [(0, '2134.062')] [2023-03-07 09:15:33,685][155452] Updated weights for policy 0, policy_version 76330 (0.0006) [2023-03-07 09:15:34,497][155452] Updated weights for policy 0, policy_version 76340 (0.0007) [2023-03-07 09:15:35,270][155452] Updated weights for policy 0, policy_version 76350 (0.0006) [2023-03-07 09:15:36,070][155452] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-07 09:15:36,853][155452] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-07 09:15:37,642][155452] Updated weights for policy 0, policy_version 76380 (0.0005) [2023-03-07 09:15:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13003.1). Total num frames: 78222336. Throughput: 0: 13019.3. Samples: 78202237. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:38,367][155126] Avg episode reward: [(0, '2118.094')] [2023-03-07 09:15:38,429][155452] Updated weights for policy 0, policy_version 76390 (0.0006) [2023-03-07 09:15:39,224][155452] Updated weights for policy 0, policy_version 76400 (0.0006) [2023-03-07 09:15:40,012][155452] Updated weights for policy 0, policy_version 76410 (0.0006) [2023-03-07 09:15:40,809][155452] Updated weights for policy 0, policy_version 76420 (0.0006) [2023-03-07 09:15:41,589][155452] Updated weights for policy 0, policy_version 76430 (0.0007) [2023-03-07 09:15:42,368][155452] Updated weights for policy 0, policy_version 76440 (0.0006) [2023-03-07 09:15:43,160][155452] Updated weights for policy 0, policy_version 76450 (0.0006) [2023-03-07 09:15:43,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 78286848. Throughput: 0: 13013.7. Samples: 78279952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:15:43,367][155126] Avg episode reward: [(0, '2064.869')] [2023-03-07 09:15:43,967][155452] Updated weights for policy 0, policy_version 76460 (0.0006) [2023-03-07 09:15:44,753][155452] Updated weights for policy 0, policy_version 76470 (0.0006) [2023-03-07 09:15:45,538][155452] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-07 09:15:46,328][155452] Updated weights for policy 0, policy_version 76490 (0.0006) [2023-03-07 09:15:47,126][155452] Updated weights for policy 0, policy_version 76500 (0.0006) [2023-03-07 09:15:47,901][155452] Updated weights for policy 0, policy_version 76510 (0.0006) [2023-03-07 09:15:48,367][155126] Fps is (10 sec: 12902.2, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 78351360. Throughput: 0: 13015.3. Samples: 78318888. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:15:48,368][155126] Avg episode reward: [(0, '2102.331')] [2023-03-07 09:15:48,679][155452] Updated weights for policy 0, policy_version 76520 (0.0006) [2023-03-07 09:15:49,479][155452] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-07 09:15:50,255][155452] Updated weights for policy 0, policy_version 76540 (0.0006) [2023-03-07 09:15:51,047][155452] Updated weights for policy 0, policy_version 76550 (0.0007) [2023-03-07 09:15:51,833][155452] Updated weights for policy 0, policy_version 76560 (0.0006) [2023-03-07 09:15:52,615][155452] Updated weights for policy 0, policy_version 76570 (0.0006) [2023-03-07 09:15:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 78416896. Throughput: 0: 13012.1. Samples: 78396868. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:15:53,367][155126] Avg episode reward: [(0, '2107.170')] [2023-03-07 09:15:53,404][155452] Updated weights for policy 0, policy_version 76580 (0.0006) [2023-03-07 09:15:54,193][155452] Updated weights for policy 0, policy_version 76590 (0.0007) [2023-03-07 09:15:54,983][155452] Updated weights for policy 0, policy_version 76600 (0.0006) [2023-03-07 09:15:55,782][155452] Updated weights for policy 0, policy_version 76610 (0.0006) [2023-03-07 09:15:56,558][155452] Updated weights for policy 0, policy_version 76620 (0.0006) [2023-03-07 09:15:57,346][155452] Updated weights for policy 0, policy_version 76630 (0.0006) [2023-03-07 09:15:58,139][155452] Updated weights for policy 0, policy_version 76640 (0.0006) [2023-03-07 09:15:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 12996.1). Total num frames: 78481408. Throughput: 0: 13009.5. Samples: 78474872. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:15:58,367][155126] Avg episode reward: [(0, '2084.005')] [2023-03-07 09:15:58,920][155452] Updated weights for policy 0, policy_version 76650 (0.0006) [2023-03-07 09:15:59,699][155452] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-07 09:16:00,501][155452] Updated weights for policy 0, policy_version 76670 (0.0008) [2023-03-07 09:16:01,269][155452] Updated weights for policy 0, policy_version 76680 (0.0006) [2023-03-07 09:16:02,070][155452] Updated weights for policy 0, policy_version 76690 (0.0006) [2023-03-07 09:16:02,845][155452] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-07 09:16:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 78546944. Throughput: 0: 13011.1. Samples: 78513930. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:16:03,378][155126] Avg episode reward: [(0, '2014.625')] [2023-03-07 09:16:03,637][155452] Updated weights for policy 0, policy_version 76710 (0.0006) [2023-03-07 09:16:04,419][155452] Updated weights for policy 0, policy_version 76720 (0.0006) [2023-03-07 09:16:05,213][155452] Updated weights for policy 0, policy_version 76730 (0.0007) [2023-03-07 09:16:06,002][155452] Updated weights for policy 0, policy_version 76740 (0.0006) [2023-03-07 09:16:06,798][155452] Updated weights for policy 0, policy_version 76750 (0.0008) [2023-03-07 09:16:07,578][155452] Updated weights for policy 0, policy_version 76760 (0.0006) [2023-03-07 09:16:08,353][155452] Updated weights for policy 0, policy_version 76770 (0.0006) [2023-03-07 09:16:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13003.1). Total num frames: 78612480. Throughput: 0: 13011.4. Samples: 78592018. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:16:08,367][155126] Avg episode reward: [(0, '1965.106')] [2023-03-07 09:16:09,137][155452] Updated weights for policy 0, policy_version 76780 (0.0006) [2023-03-07 09:16:09,920][155452] Updated weights for policy 0, policy_version 76790 (0.0006) [2023-03-07 09:16:10,706][155452] Updated weights for policy 0, policy_version 76800 (0.0006) [2023-03-07 09:16:11,510][155452] Updated weights for policy 0, policy_version 76810 (0.0006) [2023-03-07 09:16:12,293][155452] Updated weights for policy 0, policy_version 76820 (0.0006) [2023-03-07 09:16:13,079][155452] Updated weights for policy 0, policy_version 76830 (0.0006) [2023-03-07 09:16:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 78676992. Throughput: 0: 13002.3. Samples: 78670136. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:16:13,367][155126] Avg episode reward: [(0, '1865.103')] [2023-03-07 09:16:13,865][155452] Updated weights for policy 0, policy_version 76840 (0.0006) [2023-03-07 09:16:14,653][155452] Updated weights for policy 0, policy_version 76850 (0.0006) [2023-03-07 09:16:15,433][155452] Updated weights for policy 0, policy_version 76860 (0.0007) [2023-03-07 09:16:16,223][155452] Updated weights for policy 0, policy_version 76870 (0.0007) [2023-03-07 09:16:16,986][155452] Updated weights for policy 0, policy_version 76880 (0.0007) [2023-03-07 09:16:17,794][155452] Updated weights for policy 0, policy_version 76890 (0.0007) [2023-03-07 09:16:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 12999.6). Total num frames: 78742528. Throughput: 0: 12999.7. Samples: 78709214. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:16:18,367][155126] Avg episode reward: [(0, '2078.125')] [2023-03-07 09:16:18,592][155452] Updated weights for policy 0, policy_version 76900 (0.0006) [2023-03-07 09:16:19,390][155452] Updated weights for policy 0, policy_version 76910 (0.0007) [2023-03-07 09:16:20,171][155452] Updated weights for policy 0, policy_version 76920 (0.0006) [2023-03-07 09:16:20,945][155452] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-07 09:16:21,730][155452] Updated weights for policy 0, policy_version 76940 (0.0006) [2023-03-07 09:16:22,503][155452] Updated weights for policy 0, policy_version 76950 (0.0006) [2023-03-07 09:16:23,277][155452] Updated weights for policy 0, policy_version 76960 (0.0006) [2023-03-07 09:16:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 78807040. Throughput: 0: 13003.1. Samples: 78787376. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:16:23,367][155126] Avg episode reward: [(0, '2047.871')] [2023-03-07 09:16:24,092][155452] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-07 09:16:24,870][155452] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-07 09:16:25,672][155452] Updated weights for policy 0, policy_version 76990 (0.0006) [2023-03-07 09:16:26,462][155452] Updated weights for policy 0, policy_version 77000 (0.0006) [2023-03-07 09:16:27,230][155452] Updated weights for policy 0, policy_version 77010 (0.0007) [2023-03-07 09:16:28,010][155452] Updated weights for policy 0, policy_version 77020 (0.0006) [2023-03-07 09:16:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 78872576. Throughput: 0: 13015.8. Samples: 78865663. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:16:28,367][155126] Avg episode reward: [(0, '2045.356')] [2023-03-07 09:16:28,810][155452] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-07 09:16:29,588][155452] Updated weights for policy 0, policy_version 77040 (0.0006) [2023-03-07 09:16:30,376][155452] Updated weights for policy 0, policy_version 77050 (0.0007) [2023-03-07 09:16:31,156][155452] Updated weights for policy 0, policy_version 77060 (0.0007) [2023-03-07 09:16:31,949][155452] Updated weights for policy 0, policy_version 77070 (0.0006) [2023-03-07 09:16:32,726][155452] Updated weights for policy 0, policy_version 77080 (0.0007) [2023-03-07 09:16:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 78938112. Throughput: 0: 13018.0. Samples: 78904696. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:16:33,367][155126] Avg episode reward: [(0, '2016.238')] [2023-03-07 09:16:33,512][155452] Updated weights for policy 0, policy_version 77090 (0.0006) [2023-03-07 09:16:34,296][155452] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-07 09:16:35,090][155452] Updated weights for policy 0, policy_version 77110 (0.0006) [2023-03-07 09:16:35,896][155452] Updated weights for policy 0, policy_version 77120 (0.0007) [2023-03-07 09:16:36,675][155452] Updated weights for policy 0, policy_version 77130 (0.0006) [2023-03-07 09:16:37,464][155452] Updated weights for policy 0, policy_version 77140 (0.0006) [2023-03-07 09:16:38,243][155452] Updated weights for policy 0, policy_version 77150 (0.0006) [2023-03-07 09:16:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13003.1). Total num frames: 79002624. Throughput: 0: 13016.8. Samples: 78982626. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:16:38,368][155126] Avg episode reward: [(0, '2052.583')] [2023-03-07 09:16:39,034][155452] Updated weights for policy 0, policy_version 77160 (0.0006) [2023-03-07 09:16:39,829][155452] Updated weights for policy 0, policy_version 77170 (0.0006) [2023-03-07 09:16:40,637][155452] Updated weights for policy 0, policy_version 77180 (0.0006) [2023-03-07 09:16:41,417][155452] Updated weights for policy 0, policy_version 77190 (0.0006) [2023-03-07 09:16:42,214][155452] Updated weights for policy 0, policy_version 77200 (0.0006) [2023-03-07 09:16:43,016][155452] Updated weights for policy 0, policy_version 77210 (0.0006) [2023-03-07 09:16:43,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 12999.6). Total num frames: 79067136. Throughput: 0: 13008.1. Samples: 79060235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:16:43,367][155126] Avg episode reward: [(0, '2171.104')] [2023-03-07 09:16:43,813][155452] Updated weights for policy 0, policy_version 77220 (0.0007) [2023-03-07 09:16:44,580][155452] Updated weights for policy 0, policy_version 77230 (0.0006) [2023-03-07 09:16:45,359][155452] Updated weights for policy 0, policy_version 77240 (0.0006) [2023-03-07 09:16:46,156][155452] Updated weights for policy 0, policy_version 77250 (0.0006) [2023-03-07 09:16:46,929][155452] Updated weights for policy 0, policy_version 77260 (0.0006) [2023-03-07 09:16:47,727][155452] Updated weights for policy 0, policy_version 77270 (0.0006) [2023-03-07 09:16:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13003.1). Total num frames: 79132672. Throughput: 0: 13009.9. Samples: 79099375. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:16:48,368][155126] Avg episode reward: [(0, '1988.898')] [2023-03-07 09:16:48,498][155452] Updated weights for policy 0, policy_version 77280 (0.0005) [2023-03-07 09:16:49,289][155452] Updated weights for policy 0, policy_version 77290 (0.0006) [2023-03-07 09:16:50,068][155452] Updated weights for policy 0, policy_version 77300 (0.0007) [2023-03-07 09:16:50,861][155452] Updated weights for policy 0, policy_version 77310 (0.0006) [2023-03-07 09:16:51,629][155452] Updated weights for policy 0, policy_version 77320 (0.0006) [2023-03-07 09:16:52,414][155452] Updated weights for policy 0, policy_version 77330 (0.0005) [2023-03-07 09:16:53,192][155452] Updated weights for policy 0, policy_version 77340 (0.0006) [2023-03-07 09:16:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13003.1). Total num frames: 79198208. Throughput: 0: 13017.3. Samples: 79177798. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:16:53,367][155126] Avg episode reward: [(0, '1993.178')] [2023-03-07 09:16:53,959][155452] Updated weights for policy 0, policy_version 77350 (0.0006) [2023-03-07 09:16:54,765][155452] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-07 09:16:55,529][155452] Updated weights for policy 0, policy_version 77370 (0.0007) [2023-03-07 09:16:56,314][155452] Updated weights for policy 0, policy_version 77380 (0.0006) [2023-03-07 09:16:57,097][155452] Updated weights for policy 0, policy_version 77390 (0.0006) [2023-03-07 09:16:57,891][155452] Updated weights for policy 0, policy_version 77400 (0.0006) [2023-03-07 09:16:58,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13003.1). Total num frames: 79263744. Throughput: 0: 13029.5. Samples: 79256465. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:16:58,367][155126] Avg episode reward: [(0, '2053.004')] [2023-03-07 09:16:58,672][155452] Updated weights for policy 0, policy_version 77410 (0.0006) [2023-03-07 09:16:59,445][155452] Updated weights for policy 0, policy_version 77420 (0.0007) [2023-03-07 09:17:00,252][155452] Updated weights for policy 0, policy_version 77430 (0.0006) [2023-03-07 09:17:01,043][155452] Updated weights for policy 0, policy_version 77440 (0.0006) [2023-03-07 09:17:01,835][155452] Updated weights for policy 0, policy_version 77450 (0.0006) [2023-03-07 09:17:02,600][155452] Updated weights for policy 0, policy_version 77460 (0.0006) [2023-03-07 09:17:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13003.1). Total num frames: 79328256. Throughput: 0: 13027.8. Samples: 79295466. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:17:03,367][155126] Avg episode reward: [(0, '1906.155')] [2023-03-07 09:17:03,382][155452] Updated weights for policy 0, policy_version 77470 (0.0006) [2023-03-07 09:17:04,167][155452] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-07 09:17:04,956][155452] Updated weights for policy 0, policy_version 77490 (0.0006) [2023-03-07 09:17:05,743][155452] Updated weights for policy 0, policy_version 77500 (0.0006) [2023-03-07 09:17:06,518][155452] Updated weights for policy 0, policy_version 77510 (0.0006) [2023-03-07 09:17:07,309][155452] Updated weights for policy 0, policy_version 77520 (0.0006) [2023-03-07 09:17:08,099][155452] Updated weights for policy 0, policy_version 77530 (0.0005) [2023-03-07 09:17:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13006.5). Total num frames: 79393792. Throughput: 0: 13028.9. Samples: 79373678. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:17:08,368][155126] Avg episode reward: [(0, '2049.554')] [2023-03-07 09:17:08,905][155452] Updated weights for policy 0, policy_version 77540 (0.0006) [2023-03-07 09:17:09,670][155452] Updated weights for policy 0, policy_version 77550 (0.0006) [2023-03-07 09:17:10,474][155452] Updated weights for policy 0, policy_version 77560 (0.0006) [2023-03-07 09:17:11,271][155452] Updated weights for policy 0, policy_version 77570 (0.0006) [2023-03-07 09:17:12,053][155452] Updated weights for policy 0, policy_version 77580 (0.0007) [2023-03-07 09:17:12,830][155452] Updated weights for policy 0, policy_version 77590 (0.0006) [2023-03-07 09:17:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13006.5). Total num frames: 79458304. Throughput: 0: 13023.6. Samples: 79451726. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:17:13,368][155126] Avg episode reward: [(0, '2144.193')] [2023-03-07 09:17:13,617][155452] Updated weights for policy 0, policy_version 77600 (0.0006) [2023-03-07 09:17:14,397][155452] Updated weights for policy 0, policy_version 77610 (0.0006) [2023-03-07 09:17:15,182][155452] Updated weights for policy 0, policy_version 77620 (0.0006) [2023-03-07 09:17:15,975][155452] Updated weights for policy 0, policy_version 77630 (0.0006) [2023-03-07 09:17:16,757][155452] Updated weights for policy 0, policy_version 77640 (0.0006) [2023-03-07 09:17:17,559][155452] Updated weights for policy 0, policy_version 77650 (0.0006) [2023-03-07 09:17:18,347][155452] Updated weights for policy 0, policy_version 77660 (0.0006) [2023-03-07 09:17:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13006.5). Total num frames: 79523840. Throughput: 0: 13025.4. Samples: 79490837. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:18,367][155126] Avg episode reward: [(0, '2080.149')] [2023-03-07 09:17:19,141][155452] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-07 09:17:19,918][155452] Updated weights for policy 0, policy_version 77680 (0.0007) [2023-03-07 09:17:20,717][155452] Updated weights for policy 0, policy_version 77690 (0.0006) [2023-03-07 09:17:21,489][155452] Updated weights for policy 0, policy_version 77700 (0.0006) [2023-03-07 09:17:22,259][155452] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-07 09:17:23,056][155452] Updated weights for policy 0, policy_version 77720 (0.0006) [2023-03-07 09:17:23,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13006.5). Total num frames: 79588352. Throughput: 0: 13030.0. Samples: 79568976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:23,367][155126] Avg episode reward: [(0, '2222.098')] [2023-03-07 09:17:23,842][155452] Updated weights for policy 0, policy_version 77730 (0.0006) [2023-03-07 09:17:24,625][155452] Updated weights for policy 0, policy_version 77740 (0.0006) [2023-03-07 09:17:25,436][155452] Updated weights for policy 0, policy_version 77750 (0.0006) [2023-03-07 09:17:26,209][155452] Updated weights for policy 0, policy_version 77760 (0.0006) [2023-03-07 09:17:26,997][155452] Updated weights for policy 0, policy_version 77770 (0.0007) [2023-03-07 09:17:27,786][155452] Updated weights for policy 0, policy_version 77780 (0.0005) [2023-03-07 09:17:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13010.0). Total num frames: 79653888. Throughput: 0: 13038.5. Samples: 79646967. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:28,367][155126] Avg episode reward: [(0, '2172.286')] [2023-03-07 09:17:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000077787_79653888.pth... [2023-03-07 09:17:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000074738_76531712.pth [2023-03-07 09:17:28,557][155452] Updated weights for policy 0, policy_version 77790 (0.0005) [2023-03-07 09:17:29,369][155452] Updated weights for policy 0, policy_version 77800 (0.0007) [2023-03-07 09:17:30,125][155452] Updated weights for policy 0, policy_version 77810 (0.0006) [2023-03-07 09:17:30,914][155452] Updated weights for policy 0, policy_version 77820 (0.0006) [2023-03-07 09:17:31,693][155452] Updated weights for policy 0, policy_version 77830 (0.0006) [2023-03-07 09:17:32,477][155452] Updated weights for policy 0, policy_version 77840 (0.0006) [2023-03-07 09:17:33,299][155452] Updated weights for policy 0, policy_version 77850 (0.0007) [2023-03-07 09:17:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 79719424. Throughput: 0: 13037.9. Samples: 79686078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:33,367][155126] Avg episode reward: [(0, '2118.585')] [2023-03-07 09:17:34,062][155452] Updated weights for policy 0, policy_version 77860 (0.0006) [2023-03-07 09:17:34,861][155452] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-07 09:17:35,652][155452] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-07 09:17:36,430][155452] Updated weights for policy 0, policy_version 77890 (0.0006) [2023-03-07 09:17:37,213][155452] Updated weights for policy 0, policy_version 77900 (0.0005) [2023-03-07 09:17:38,007][155452] Updated weights for policy 0, policy_version 77910 (0.0006) [2023-03-07 09:17:38,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 79783936. Throughput: 0: 13028.5. Samples: 79764081. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:38,378][155126] Avg episode reward: [(0, '2215.577')] [2023-03-07 09:17:38,790][155452] Updated weights for policy 0, policy_version 77920 (0.0007) [2023-03-07 09:17:39,562][155452] Updated weights for policy 0, policy_version 77930 (0.0007) [2023-03-07 09:17:40,343][155452] Updated weights for policy 0, policy_version 77940 (0.0006) [2023-03-07 09:17:41,131][155452] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-07 09:17:41,914][155452] Updated weights for policy 0, policy_version 77960 (0.0007) [2023-03-07 09:17:42,697][155452] Updated weights for policy 0, policy_version 77970 (0.0006) [2023-03-07 09:17:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 79849472. Throughput: 0: 13024.1. Samples: 79842550. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:43,378][155126] Avg episode reward: [(0, '2186.198')] [2023-03-07 09:17:43,472][155452] Updated weights for policy 0, policy_version 77980 (0.0006) [2023-03-07 09:17:44,250][155452] Updated weights for policy 0, policy_version 77990 (0.0006) [2023-03-07 09:17:45,065][155452] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-07 09:17:45,842][155452] Updated weights for policy 0, policy_version 78010 (0.0006) [2023-03-07 09:17:46,621][155452] Updated weights for policy 0, policy_version 78020 (0.0006) [2023-03-07 09:17:47,418][155452] Updated weights for policy 0, policy_version 78030 (0.0006) [2023-03-07 09:17:48,193][155452] Updated weights for policy 0, policy_version 78040 (0.0006) [2023-03-07 09:17:48,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 79915008. Throughput: 0: 13024.0. Samples: 79881548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:48,367][155126] Avg episode reward: [(0, '2300.753')] [2023-03-07 09:17:48,981][155452] Updated weights for policy 0, policy_version 78050 (0.0006) [2023-03-07 09:17:49,763][155452] Updated weights for policy 0, policy_version 78060 (0.0007) [2023-03-07 09:17:50,541][155452] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-07 09:17:51,329][155452] Updated weights for policy 0, policy_version 78080 (0.0007) [2023-03-07 09:17:52,115][155452] Updated weights for policy 0, policy_version 78090 (0.0006) [2023-03-07 09:17:52,897][155452] Updated weights for policy 0, policy_version 78100 (0.0006) [2023-03-07 09:17:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13010.0). Total num frames: 79979520. Throughput: 0: 13028.6. Samples: 79959962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:53,367][155126] Avg episode reward: [(0, '2197.335')] [2023-03-07 09:17:53,688][155452] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-07 09:17:54,457][155452] Updated weights for policy 0, policy_version 78120 (0.0007) [2023-03-07 09:17:55,249][155452] Updated weights for policy 0, policy_version 78130 (0.0007) [2023-03-07 09:17:56,034][155452] Updated weights for policy 0, policy_version 78140 (0.0006) [2023-03-07 09:17:56,820][155452] Updated weights for policy 0, policy_version 78150 (0.0006) [2023-03-07 09:17:57,602][155452] Updated weights for policy 0, policy_version 78160 (0.0006) [2023-03-07 09:17:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 80045056. Throughput: 0: 13035.2. Samples: 80038309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:17:58,370][155452] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-07 09:17:58,367][155126] Avg episode reward: [(0, '2009.923')] [2023-03-07 09:17:59,178][155452] Updated weights for policy 0, policy_version 78180 (0.0006) [2023-03-07 09:17:59,965][155452] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-07 09:18:00,731][155452] Updated weights for policy 0, policy_version 78200 (0.0006) [2023-03-07 09:18:01,545][155452] Updated weights for policy 0, policy_version 78210 (0.0007) [2023-03-07 09:18:02,306][155452] Updated weights for policy 0, policy_version 78220 (0.0007) [2023-03-07 09:18:03,104][155452] Updated weights for policy 0, policy_version 78230 (0.0005) [2023-03-07 09:18:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 80110592. Throughput: 0: 13037.4. Samples: 80077519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:18:03,367][155126] Avg episode reward: [(0, '2056.164')] [2023-03-07 09:18:03,879][155452] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-07 09:18:04,677][155452] Updated weights for policy 0, policy_version 78250 (0.0006) [2023-03-07 09:18:05,458][155452] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-07 09:18:06,242][155452] Updated weights for policy 0, policy_version 78270 (0.0006) [2023-03-07 09:18:07,023][155452] Updated weights for policy 0, policy_version 78280 (0.0006) [2023-03-07 09:18:07,819][155452] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-07 09:18:08,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 80176128. Throughput: 0: 13037.0. Samples: 80155644. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:08,368][155126] Avg episode reward: [(0, '2039.802')] [2023-03-07 09:18:08,613][155452] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-07 09:18:09,400][155452] Updated weights for policy 0, policy_version 78310 (0.0006) [2023-03-07 09:18:10,182][155452] Updated weights for policy 0, policy_version 78320 (0.0007) [2023-03-07 09:18:10,976][155452] Updated weights for policy 0, policy_version 78330 (0.0007) [2023-03-07 09:18:11,767][155452] Updated weights for policy 0, policy_version 78340 (0.0006) [2023-03-07 09:18:12,544][155452] Updated weights for policy 0, policy_version 78350 (0.0006) [2023-03-07 09:18:13,321][155452] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-07 09:18:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13013.5). Total num frames: 80240640. Throughput: 0: 13038.0. Samples: 80233676. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:13,367][155126] Avg episode reward: [(0, '2070.577')] [2023-03-07 09:18:14,105][155452] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-07 09:18:14,891][155452] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-07 09:18:15,662][155452] Updated weights for policy 0, policy_version 78390 (0.0006) [2023-03-07 09:18:16,459][155452] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-07 09:18:17,243][155452] Updated weights for policy 0, policy_version 78410 (0.0007) [2023-03-07 09:18:18,043][155452] Updated weights for policy 0, policy_version 78420 (0.0007) [2023-03-07 09:18:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 80306176. Throughput: 0: 13039.7. Samples: 80272867. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:18,368][155126] Avg episode reward: [(0, '2074.268')] [2023-03-07 09:18:18,824][155452] Updated weights for policy 0, policy_version 78430 (0.0006) [2023-03-07 09:18:19,585][155452] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-07 09:18:20,389][155452] Updated weights for policy 0, policy_version 78450 (0.0006) [2023-03-07 09:18:21,162][155452] Updated weights for policy 0, policy_version 78460 (0.0007) [2023-03-07 09:18:21,945][155452] Updated weights for policy 0, policy_version 78470 (0.0006) [2023-03-07 09:18:22,734][155452] Updated weights for policy 0, policy_version 78480 (0.0005) [2023-03-07 09:18:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 80370688. Throughput: 0: 13046.9. Samples: 80351193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:23,367][155126] Avg episode reward: [(0, '2150.250')] [2023-03-07 09:18:23,540][155452] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-07 09:18:24,311][155452] Updated weights for policy 0, policy_version 78500 (0.0006) [2023-03-07 09:18:25,115][155452] Updated weights for policy 0, policy_version 78510 (0.0006) [2023-03-07 09:18:25,911][155452] Updated weights for policy 0, policy_version 78520 (0.0007) [2023-03-07 09:18:26,696][155452] Updated weights for policy 0, policy_version 78530 (0.0007) [2023-03-07 09:18:27,484][155452] Updated weights for policy 0, policy_version 78540 (0.0007) [2023-03-07 09:18:28,282][155452] Updated weights for policy 0, policy_version 78550 (0.0006) [2023-03-07 09:18:28,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 80436224. Throughput: 0: 13032.0. Samples: 80428991. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:28,367][155126] Avg episode reward: [(0, '2110.942')] [2023-03-07 09:18:29,056][155452] Updated weights for policy 0, policy_version 78560 (0.0006) [2023-03-07 09:18:29,823][155452] Updated weights for policy 0, policy_version 78570 (0.0006) [2023-03-07 09:18:30,626][155452] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-07 09:18:31,397][155452] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-07 09:18:32,188][155452] Updated weights for policy 0, policy_version 78600 (0.0006) [2023-03-07 09:18:32,963][155452] Updated weights for policy 0, policy_version 78610 (0.0005) [2023-03-07 09:18:33,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 80501760. Throughput: 0: 13038.8. Samples: 80468293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:33,367][155126] Avg episode reward: [(0, '2124.006')] [2023-03-07 09:18:33,746][155452] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-07 09:18:34,529][155452] Updated weights for policy 0, policy_version 78630 (0.0007) [2023-03-07 09:18:35,343][155452] Updated weights for policy 0, policy_version 78640 (0.0006) [2023-03-07 09:18:36,135][155452] Updated weights for policy 0, policy_version 78650 (0.0006) [2023-03-07 09:18:36,907][155452] Updated weights for policy 0, policy_version 78660 (0.0006) [2023-03-07 09:18:37,715][155452] Updated weights for policy 0, policy_version 78670 (0.0005) [2023-03-07 09:18:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13013.5). Total num frames: 80566272. Throughput: 0: 13028.9. Samples: 80546263. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:38,367][155126] Avg episode reward: [(0, '2295.502')] [2023-03-07 09:18:38,498][155452] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-07 09:18:39,285][155452] Updated weights for policy 0, policy_version 78690 (0.0006) [2023-03-07 09:18:40,081][155452] Updated weights for policy 0, policy_version 78700 (0.0007) [2023-03-07 09:18:40,834][155452] Updated weights for policy 0, policy_version 78710 (0.0006) [2023-03-07 09:18:41,624][155452] Updated weights for policy 0, policy_version 78720 (0.0006) [2023-03-07 09:18:42,431][155452] Updated weights for policy 0, policy_version 78730 (0.0006) [2023-03-07 09:18:43,222][155452] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-07 09:18:43,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 80630784. Throughput: 0: 13020.6. Samples: 80624235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:43,367][155126] Avg episode reward: [(0, '2266.196')] [2023-03-07 09:18:44,007][155452] Updated weights for policy 0, policy_version 78750 (0.0007) [2023-03-07 09:18:44,795][155452] Updated weights for policy 0, policy_version 78760 (0.0006) [2023-03-07 09:18:45,590][155452] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-07 09:18:46,370][155452] Updated weights for policy 0, policy_version 78780 (0.0006) [2023-03-07 09:18:47,151][155452] Updated weights for policy 0, policy_version 78790 (0.0007) [2023-03-07 09:18:47,969][155452] Updated weights for policy 0, policy_version 78800 (0.0007) [2023-03-07 09:18:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 80696320. Throughput: 0: 13015.0. Samples: 80663194. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:48,367][155126] Avg episode reward: [(0, '2219.529')] [2023-03-07 09:18:48,733][155452] Updated weights for policy 0, policy_version 78810 (0.0007) [2023-03-07 09:18:49,522][155452] Updated weights for policy 0, policy_version 78820 (0.0006) [2023-03-07 09:18:50,302][155452] Updated weights for policy 0, policy_version 78830 (0.0006) [2023-03-07 09:18:51,070][155452] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-07 09:18:51,853][155452] Updated weights for policy 0, policy_version 78850 (0.0006) [2023-03-07 09:18:52,639][155452] Updated weights for policy 0, policy_version 78860 (0.0006) [2023-03-07 09:18:53,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13017.0). Total num frames: 80761856. Throughput: 0: 13020.3. Samples: 80741558. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:18:53,367][155126] Avg episode reward: [(0, '2275.522')] [2023-03-07 09:18:53,410][155452] Updated weights for policy 0, policy_version 78870 (0.0005) [2023-03-07 09:18:54,197][155452] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-07 09:18:54,998][155452] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-07 09:18:55,784][155452] Updated weights for policy 0, policy_version 78900 (0.0006) [2023-03-07 09:18:56,558][155452] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-07 09:18:57,358][155452] Updated weights for policy 0, policy_version 78920 (0.0006) [2023-03-07 09:18:58,128][155452] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-07 09:18:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 80827392. Throughput: 0: 13028.7. Samples: 80819970. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:18:58,367][155126] Avg episode reward: [(0, '1995.172')] [2023-03-07 09:18:58,899][155452] Updated weights for policy 0, policy_version 78940 (0.0006) [2023-03-07 09:18:59,677][155452] Updated weights for policy 0, policy_version 78950 (0.0008) [2023-03-07 09:19:00,470][155452] Updated weights for policy 0, policy_version 78960 (0.0005) [2023-03-07 09:19:01,228][155452] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-07 09:19:02,010][155452] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-07 09:19:02,798][155452] Updated weights for policy 0, policy_version 78990 (0.0007) [2023-03-07 09:19:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 80892928. Throughput: 0: 13033.2. Samples: 80859359. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:03,367][155126] Avg episode reward: [(0, '2169.865')] [2023-03-07 09:19:03,570][155452] Updated weights for policy 0, policy_version 79000 (0.0007) [2023-03-07 09:19:04,360][155452] Updated weights for policy 0, policy_version 79010 (0.0006) [2023-03-07 09:19:05,155][155452] Updated weights for policy 0, policy_version 79020 (0.0006) [2023-03-07 09:19:05,927][155452] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-07 09:19:06,726][155452] Updated weights for policy 0, policy_version 79040 (0.0007) [2023-03-07 09:19:07,514][155452] Updated weights for policy 0, policy_version 79050 (0.0006) [2023-03-07 09:19:08,291][155452] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-07 09:19:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 80957440. Throughput: 0: 13037.2. Samples: 80937868. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:08,367][155126] Avg episode reward: [(0, '2145.054')] [2023-03-07 09:19:09,089][155452] Updated weights for policy 0, policy_version 79070 (0.0006) [2023-03-07 09:19:09,881][155452] Updated weights for policy 0, policy_version 79080 (0.0005) [2023-03-07 09:19:10,661][155452] Updated weights for policy 0, policy_version 79090 (0.0007) [2023-03-07 09:19:11,434][155452] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-07 09:19:12,244][155452] Updated weights for policy 0, policy_version 79110 (0.0006) [2023-03-07 09:19:13,027][155452] Updated weights for policy 0, policy_version 79120 (0.0007) [2023-03-07 09:19:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 81022976. Throughput: 0: 13040.7. Samples: 81015820. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:13,367][155126] Avg episode reward: [(0, '2114.867')] [2023-03-07 09:19:13,823][155452] Updated weights for policy 0, policy_version 79130 (0.0006) [2023-03-07 09:19:14,614][155452] Updated weights for policy 0, policy_version 79140 (0.0007) [2023-03-07 09:19:15,382][155452] Updated weights for policy 0, policy_version 79150 (0.0006) [2023-03-07 09:19:16,188][155452] Updated weights for policy 0, policy_version 79160 (0.0006) [2023-03-07 09:19:16,970][155452] Updated weights for policy 0, policy_version 79170 (0.0006) [2023-03-07 09:19:17,764][155452] Updated weights for policy 0, policy_version 79180 (0.0007) [2023-03-07 09:19:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 81087488. Throughput: 0: 13035.8. Samples: 81054904. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:18,367][155126] Avg episode reward: [(0, '2032.663')] [2023-03-07 09:19:18,555][155452] Updated weights for policy 0, policy_version 79190 (0.0006) [2023-03-07 09:19:19,325][155452] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-07 09:19:20,107][155452] Updated weights for policy 0, policy_version 79210 (0.0006) [2023-03-07 09:19:20,896][155452] Updated weights for policy 0, policy_version 79220 (0.0006) [2023-03-07 09:19:21,668][155452] Updated weights for policy 0, policy_version 79230 (0.0007) [2023-03-07 09:19:22,457][155452] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-07 09:19:23,248][155452] Updated weights for policy 0, policy_version 79250 (0.0006) [2023-03-07 09:19:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 81153024. Throughput: 0: 13042.3. Samples: 81133164. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:23,367][155126] Avg episode reward: [(0, '1932.797')] [2023-03-07 09:19:24,038][155452] Updated weights for policy 0, policy_version 79260 (0.0006) [2023-03-07 09:19:24,817][155452] Updated weights for policy 0, policy_version 79270 (0.0006) [2023-03-07 09:19:25,609][155452] Updated weights for policy 0, policy_version 79280 (0.0006) [2023-03-07 09:19:26,377][155452] Updated weights for policy 0, policy_version 79290 (0.0006) [2023-03-07 09:19:27,166][155452] Updated weights for policy 0, policy_version 79300 (0.0006) [2023-03-07 09:19:27,947][155452] Updated weights for policy 0, policy_version 79310 (0.0006) [2023-03-07 09:19:28,367][155126] Fps is (10 sec: 13107.0, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 81218560. Throughput: 0: 13048.7. Samples: 81211430. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:28,367][155126] Avg episode reward: [(0, '1972.473')] [2023-03-07 09:19:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000079315_81218560.pth... [2023-03-07 09:19:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000076262_78092288.pth [2023-03-07 09:19:28,730][155452] Updated weights for policy 0, policy_version 79320 (0.0006) [2023-03-07 09:19:29,502][155452] Updated weights for policy 0, policy_version 79330 (0.0007) [2023-03-07 09:19:30,271][155452] Updated weights for policy 0, policy_version 79340 (0.0006) [2023-03-07 09:19:31,066][155452] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-07 09:19:31,841][155452] Updated weights for policy 0, policy_version 79360 (0.0006) [2023-03-07 09:19:32,617][155452] Updated weights for policy 0, policy_version 79370 (0.0007) [2023-03-07 09:19:33,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 81284096. Throughput: 0: 13056.7. Samples: 81250748. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:33,367][155126] Avg episode reward: [(0, '1948.903')] [2023-03-07 09:19:33,394][155452] Updated weights for policy 0, policy_version 79380 (0.0007) [2023-03-07 09:19:34,170][155452] Updated weights for policy 0, policy_version 79390 (0.0006) [2023-03-07 09:19:34,941][155452] Updated weights for policy 0, policy_version 79400 (0.0006) [2023-03-07 09:19:35,734][155452] Updated weights for policy 0, policy_version 79410 (0.0007) [2023-03-07 09:19:36,522][155452] Updated weights for policy 0, policy_version 79420 (0.0007) [2023-03-07 09:19:37,290][155452] Updated weights for policy 0, policy_version 79430 (0.0005) [2023-03-07 09:19:38,089][155452] Updated weights for policy 0, policy_version 79440 (0.0006) [2023-03-07 09:19:38,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13027.4). Total num frames: 81349632. Throughput: 0: 13069.2. Samples: 81329671. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:19:38,367][155126] Avg episode reward: [(0, '2149.104')] [2023-03-07 09:19:38,850][155452] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-07 09:19:39,645][155452] Updated weights for policy 0, policy_version 79460 (0.0006) [2023-03-07 09:19:40,425][155452] Updated weights for policy 0, policy_version 79470 (0.0007) [2023-03-07 09:19:41,208][155452] Updated weights for policy 0, policy_version 79480 (0.0006) [2023-03-07 09:19:42,000][155452] Updated weights for policy 0, policy_version 79490 (0.0007) [2023-03-07 09:19:42,784][155452] Updated weights for policy 0, policy_version 79500 (0.0007) [2023-03-07 09:19:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13030.8). Total num frames: 81415168. Throughput: 0: 13071.8. Samples: 81408201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:19:43,367][155126] Avg episode reward: [(0, '2043.642')] [2023-03-07 09:19:43,563][155452] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-07 09:19:44,345][155452] Updated weights for policy 0, policy_version 79520 (0.0006) [2023-03-07 09:19:45,126][155452] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-07 09:19:45,922][155452] Updated weights for policy 0, policy_version 79540 (0.0006) [2023-03-07 09:19:46,696][155452] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-07 09:19:47,486][155452] Updated weights for policy 0, policy_version 79560 (0.0005) [2023-03-07 09:19:48,279][155452] Updated weights for policy 0, policy_version 79570 (0.0006) [2023-03-07 09:19:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13073.1, 300 sec: 13030.8). Total num frames: 81480704. Throughput: 0: 13065.3. Samples: 81447297. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:19:48,367][155126] Avg episode reward: [(0, '2223.330')] [2023-03-07 09:19:49,067][155452] Updated weights for policy 0, policy_version 79580 (0.0006) [2023-03-07 09:19:49,850][155452] Updated weights for policy 0, policy_version 79590 (0.0006) [2023-03-07 09:19:50,643][155452] Updated weights for policy 0, policy_version 79600 (0.0006) [2023-03-07 09:19:51,421][155452] Updated weights for policy 0, policy_version 79610 (0.0007) [2023-03-07 09:19:52,200][155452] Updated weights for policy 0, policy_version 79620 (0.0006) [2023-03-07 09:19:52,985][155452] Updated weights for policy 0, policy_version 79630 (0.0007) [2023-03-07 09:19:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13073.1, 300 sec: 13034.3). Total num frames: 81546240. Throughput: 0: 13058.7. Samples: 81525511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:19:53,378][155126] Avg episode reward: [(0, '2057.303')] [2023-03-07 09:19:53,778][155452] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-07 09:19:54,550][155452] Updated weights for policy 0, policy_version 79650 (0.0007) [2023-03-07 09:19:55,346][155452] Updated weights for policy 0, policy_version 79660 (0.0006) [2023-03-07 09:19:56,140][155452] Updated weights for policy 0, policy_version 79670 (0.0007) [2023-03-07 09:19:56,910][155452] Updated weights for policy 0, policy_version 79680 (0.0006) [2023-03-07 09:19:57,702][155452] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-07 09:19:58,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 81610752. Throughput: 0: 13067.1. Samples: 81603840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:19:58,367][155126] Avg episode reward: [(0, '2046.649')] [2023-03-07 09:19:58,478][155452] Updated weights for policy 0, policy_version 79700 (0.0007) [2023-03-07 09:19:59,293][155452] Updated weights for policy 0, policy_version 79710 (0.0006) [2023-03-07 09:20:00,068][155452] Updated weights for policy 0, policy_version 79720 (0.0006) [2023-03-07 09:20:00,871][155452] Updated weights for policy 0, policy_version 79730 (0.0006) [2023-03-07 09:20:01,652][155452] Updated weights for policy 0, policy_version 79740 (0.0008) [2023-03-07 09:20:02,440][155452] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-07 09:20:03,218][155452] Updated weights for policy 0, policy_version 79760 (0.0007) [2023-03-07 09:20:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13034.3). Total num frames: 81676288. Throughput: 0: 13060.9. Samples: 81642642. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:03,367][155126] Avg episode reward: [(0, '2183.906')] [2023-03-07 09:20:04,008][155452] Updated weights for policy 0, policy_version 79770 (0.0006) [2023-03-07 09:20:04,786][155452] Updated weights for policy 0, policy_version 79780 (0.0006) [2023-03-07 09:20:05,574][155452] Updated weights for policy 0, policy_version 79790 (0.0006) [2023-03-07 09:20:06,365][155452] Updated weights for policy 0, policy_version 79800 (0.0006) [2023-03-07 09:20:07,150][155452] Updated weights for policy 0, policy_version 79810 (0.0007) [2023-03-07 09:20:07,934][155452] Updated weights for policy 0, policy_version 79820 (0.0005) [2023-03-07 09:20:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 81740800. Throughput: 0: 13059.0. Samples: 81720822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:08,367][155126] Avg episode reward: [(0, '2057.044')] [2023-03-07 09:20:08,728][155452] Updated weights for policy 0, policy_version 79830 (0.0006) [2023-03-07 09:20:09,500][155452] Updated weights for policy 0, policy_version 79840 (0.0005) [2023-03-07 09:20:10,305][155452] Updated weights for policy 0, policy_version 79850 (0.0007) [2023-03-07 09:20:11,078][155452] Updated weights for policy 0, policy_version 79860 (0.0006) [2023-03-07 09:20:11,886][155452] Updated weights for policy 0, policy_version 79870 (0.0007) [2023-03-07 09:20:12,685][155452] Updated weights for policy 0, policy_version 79880 (0.0006) [2023-03-07 09:20:13,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 81805312. Throughput: 0: 13051.5. Samples: 81798749. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:13,367][155126] Avg episode reward: [(0, '2063.034')] [2023-03-07 09:20:13,454][155452] Updated weights for policy 0, policy_version 79890 (0.0006) [2023-03-07 09:20:14,245][155452] Updated weights for policy 0, policy_version 79900 (0.0007) [2023-03-07 09:20:15,030][155452] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-07 09:20:15,819][155452] Updated weights for policy 0, policy_version 79920 (0.0006) [2023-03-07 09:20:16,592][155452] Updated weights for policy 0, policy_version 79930 (0.0007) [2023-03-07 09:20:17,398][155452] Updated weights for policy 0, policy_version 79940 (0.0006) [2023-03-07 09:20:18,173][155452] Updated weights for policy 0, policy_version 79950 (0.0005) [2023-03-07 09:20:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 81870848. Throughput: 0: 13044.2. Samples: 81837738. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:18,367][155126] Avg episode reward: [(0, '1950.363')] [2023-03-07 09:20:18,951][155452] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-07 09:20:19,746][155452] Updated weights for policy 0, policy_version 79970 (0.0006) [2023-03-07 09:20:20,522][155452] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-07 09:20:21,322][155452] Updated weights for policy 0, policy_version 79990 (0.0007) [2023-03-07 09:20:22,101][155452] Updated weights for policy 0, policy_version 80000 (0.0006) [2023-03-07 09:20:22,868][155452] Updated weights for policy 0, policy_version 80010 (0.0006) [2023-03-07 09:20:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 81936384. Throughput: 0: 13030.8. Samples: 81916057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:23,367][155126] Avg episode reward: [(0, '1922.713')] [2023-03-07 09:20:23,655][155452] Updated weights for policy 0, policy_version 80020 (0.0006) [2023-03-07 09:20:24,455][155452] Updated weights for policy 0, policy_version 80030 (0.0007) [2023-03-07 09:20:25,242][155452] Updated weights for policy 0, policy_version 80040 (0.0006) [2023-03-07 09:20:25,992][155452] Updated weights for policy 0, policy_version 80050 (0.0006) [2023-03-07 09:20:26,797][155452] Updated weights for policy 0, policy_version 80060 (0.0005) [2023-03-07 09:20:27,575][155452] Updated weights for policy 0, policy_version 80070 (0.0006) [2023-03-07 09:20:28,337][155452] Updated weights for policy 0, policy_version 80080 (0.0006) [2023-03-07 09:20:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13056.0, 300 sec: 13030.8). Total num frames: 82001920. Throughput: 0: 13030.2. Samples: 81994562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:28,367][155126] Avg episode reward: [(0, '1878.829')] [2023-03-07 09:20:29,133][155452] Updated weights for policy 0, policy_version 80090 (0.0006) [2023-03-07 09:20:29,921][155452] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-07 09:20:30,711][155452] Updated weights for policy 0, policy_version 80110 (0.0006) [2023-03-07 09:20:31,509][155452] Updated weights for policy 0, policy_version 80120 (0.0007) [2023-03-07 09:20:32,303][155452] Updated weights for policy 0, policy_version 80130 (0.0006) [2023-03-07 09:20:33,078][155452] Updated weights for policy 0, policy_version 80140 (0.0006) [2023-03-07 09:20:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13030.8). Total num frames: 82066432. Throughput: 0: 13031.0. Samples: 82033693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:33,367][155126] Avg episode reward: [(0, '1878.926')] [2023-03-07 09:20:33,860][155452] Updated weights for policy 0, policy_version 80150 (0.0007) [2023-03-07 09:20:34,658][155452] Updated weights for policy 0, policy_version 80160 (0.0007) [2023-03-07 09:20:35,444][155452] Updated weights for policy 0, policy_version 80170 (0.0006) [2023-03-07 09:20:36,223][155452] Updated weights for policy 0, policy_version 80180 (0.0007) [2023-03-07 09:20:37,037][155452] Updated weights for policy 0, policy_version 80190 (0.0006) [2023-03-07 09:20:37,808][155452] Updated weights for policy 0, policy_version 80200 (0.0006) [2023-03-07 09:20:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 82131968. Throughput: 0: 13025.7. Samples: 82111670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:38,367][155126] Avg episode reward: [(0, '1832.378')] [2023-03-07 09:20:38,593][155452] Updated weights for policy 0, policy_version 80210 (0.0007) [2023-03-07 09:20:39,380][155452] Updated weights for policy 0, policy_version 80220 (0.0006) [2023-03-07 09:20:40,177][155452] Updated weights for policy 0, policy_version 80230 (0.0006) [2023-03-07 09:20:40,962][155452] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-07 09:20:41,740][155452] Updated weights for policy 0, policy_version 80250 (0.0007) [2023-03-07 09:20:42,505][155452] Updated weights for policy 0, policy_version 80260 (0.0006) [2023-03-07 09:20:43,289][155452] Updated weights for policy 0, policy_version 80270 (0.0006) [2023-03-07 09:20:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13037.8). Total num frames: 82197504. Throughput: 0: 13027.8. Samples: 82190089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:43,367][155126] Avg episode reward: [(0, '1891.047')] [2023-03-07 09:20:44,091][155452] Updated weights for policy 0, policy_version 80280 (0.0006) [2023-03-07 09:20:44,865][155452] Updated weights for policy 0, policy_version 80290 (0.0006) [2023-03-07 09:20:45,649][155452] Updated weights for policy 0, policy_version 80300 (0.0006) [2023-03-07 09:20:46,442][155452] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-07 09:20:47,221][155452] Updated weights for policy 0, policy_version 80320 (0.0006) [2023-03-07 09:20:48,000][155452] Updated weights for policy 0, policy_version 80330 (0.0007) [2023-03-07 09:20:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 82262016. Throughput: 0: 13033.2. Samples: 82229137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:48,367][155126] Avg episode reward: [(0, '1814.808')] [2023-03-07 09:20:48,773][155452] Updated weights for policy 0, policy_version 80340 (0.0006) [2023-03-07 09:20:49,575][155452] Updated weights for policy 0, policy_version 80350 (0.0006) [2023-03-07 09:20:50,337][155452] Updated weights for policy 0, policy_version 80360 (0.0006) [2023-03-07 09:20:51,130][155452] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-07 09:20:51,913][155452] Updated weights for policy 0, policy_version 80380 (0.0005) [2023-03-07 09:20:52,688][155452] Updated weights for policy 0, policy_version 80390 (0.0006) [2023-03-07 09:20:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 82327552. Throughput: 0: 13039.1. Samples: 82307580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:53,367][155126] Avg episode reward: [(0, '1754.197')] [2023-03-07 09:20:53,478][155452] Updated weights for policy 0, policy_version 80400 (0.0006) [2023-03-07 09:20:54,268][155452] Updated weights for policy 0, policy_version 80410 (0.0007) [2023-03-07 09:20:55,054][155452] Updated weights for policy 0, policy_version 80420 (0.0006) [2023-03-07 09:20:55,844][155452] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-07 09:20:56,625][155452] Updated weights for policy 0, policy_version 80440 (0.0007) [2023-03-07 09:20:57,392][155452] Updated weights for policy 0, policy_version 80450 (0.0006) [2023-03-07 09:20:58,180][155452] Updated weights for policy 0, policy_version 80460 (0.0006) [2023-03-07 09:20:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 82393088. Throughput: 0: 13046.2. Samples: 82385830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:20:58,378][155126] Avg episode reward: [(0, '1920.836')] [2023-03-07 09:20:58,974][155452] Updated weights for policy 0, policy_version 80470 (0.0007) [2023-03-07 09:20:59,763][155452] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-07 09:21:00,550][155452] Updated weights for policy 0, policy_version 80490 (0.0006) [2023-03-07 09:21:01,344][155452] Updated weights for policy 0, policy_version 80500 (0.0006) [2023-03-07 09:21:02,134][155452] Updated weights for policy 0, policy_version 80510 (0.0006) [2023-03-07 09:21:02,919][155452] Updated weights for policy 0, policy_version 80520 (0.0006) [2023-03-07 09:21:03,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 82457600. Throughput: 0: 13048.3. Samples: 82424914. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:21:03,378][155126] Avg episode reward: [(0, '1883.177')] [2023-03-07 09:21:03,701][155452] Updated weights for policy 0, policy_version 80530 (0.0006) [2023-03-07 09:21:04,472][155452] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-07 09:21:05,276][155452] Updated weights for policy 0, policy_version 80550 (0.0007) [2023-03-07 09:21:06,044][155452] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-07 09:21:06,852][155452] Updated weights for policy 0, policy_version 80570 (0.0006) [2023-03-07 09:21:07,631][155452] Updated weights for policy 0, policy_version 80580 (0.0006) [2023-03-07 09:21:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 82523136. Throughput: 0: 13042.5. Samples: 82502968. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:21:08,367][155126] Avg episode reward: [(0, '2202.080')] [2023-03-07 09:21:08,422][155452] Updated weights for policy 0, policy_version 80590 (0.0006) [2023-03-07 09:21:09,206][155452] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-07 09:21:09,990][155452] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-07 09:21:10,765][155452] Updated weights for policy 0, policy_version 80620 (0.0006) [2023-03-07 09:21:11,561][155452] Updated weights for policy 0, policy_version 80630 (0.0006) [2023-03-07 09:21:12,344][155452] Updated weights for policy 0, policy_version 80640 (0.0006) [2023-03-07 09:21:13,135][155452] Updated weights for policy 0, policy_version 80650 (0.0006) [2023-03-07 09:21:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13056.0, 300 sec: 13037.8). Total num frames: 82588672. Throughput: 0: 13038.4. Samples: 82581289. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:21:13,367][155126] Avg episode reward: [(0, '2064.709')] [2023-03-07 09:21:13,934][155452] Updated weights for policy 0, policy_version 80660 (0.0006) [2023-03-07 09:21:14,714][155452] Updated weights for policy 0, policy_version 80670 (0.0006) [2023-03-07 09:21:15,495][155452] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-07 09:21:16,281][155452] Updated weights for policy 0, policy_version 80690 (0.0006) [2023-03-07 09:21:17,078][155452] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-07 09:21:17,867][155452] Updated weights for policy 0, policy_version 80710 (0.0007) [2023-03-07 09:21:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13037.8). Total num frames: 82653184. Throughput: 0: 13031.4. Samples: 82620107. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:21:18,367][155126] Avg episode reward: [(0, '2160.928')] [2023-03-07 09:21:18,650][155452] Updated weights for policy 0, policy_version 80720 (0.0006) [2023-03-07 09:21:19,441][155452] Updated weights for policy 0, policy_version 80730 (0.0006) [2023-03-07 09:21:20,246][155452] Updated weights for policy 0, policy_version 80740 (0.0007) [2023-03-07 09:21:21,026][155452] Updated weights for policy 0, policy_version 80750 (0.0007) [2023-03-07 09:21:21,810][155452] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-07 09:21:22,594][155452] Updated weights for policy 0, policy_version 80770 (0.0005) [2023-03-07 09:21:23,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 82717696. Throughput: 0: 13030.3. Samples: 82698032. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:23,367][155126] Avg episode reward: [(0, '1969.102')] [2023-03-07 09:21:23,387][155452] Updated weights for policy 0, policy_version 80780 (0.0005) [2023-03-07 09:21:24,161][155452] Updated weights for policy 0, policy_version 80790 (0.0006) [2023-03-07 09:21:24,958][155452] Updated weights for policy 0, policy_version 80800 (0.0007) [2023-03-07 09:21:25,742][155452] Updated weights for policy 0, policy_version 80810 (0.0007) [2023-03-07 09:21:26,530][155452] Updated weights for policy 0, policy_version 80820 (0.0007) [2023-03-07 09:21:27,301][155452] Updated weights for policy 0, policy_version 80830 (0.0007) [2023-03-07 09:21:28,097][155452] Updated weights for policy 0, policy_version 80840 (0.0006) [2023-03-07 09:21:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 82783232. Throughput: 0: 13021.6. Samples: 82776060. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:28,367][155126] Avg episode reward: [(0, '2057.094')] [2023-03-07 09:21:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000080843_82783232.pth... [2023-03-07 09:21:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000077787_79653888.pth [2023-03-07 09:21:28,893][155452] Updated weights for policy 0, policy_version 80850 (0.0006) [2023-03-07 09:21:29,691][155452] Updated weights for policy 0, policy_version 80860 (0.0005) [2023-03-07 09:21:30,475][155452] Updated weights for policy 0, policy_version 80870 (0.0007) [2023-03-07 09:21:31,275][155452] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-07 09:21:32,061][155452] Updated weights for policy 0, policy_version 80890 (0.0006) [2023-03-07 09:21:32,829][155452] Updated weights for policy 0, policy_version 80900 (0.0006) [2023-03-07 09:21:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 82847744. Throughput: 0: 13019.0. Samples: 82814993. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:33,367][155126] Avg episode reward: [(0, '2211.569')] [2023-03-07 09:21:33,613][155452] Updated weights for policy 0, policy_version 80910 (0.0006) [2023-03-07 09:21:34,402][155452] Updated weights for policy 0, policy_version 80920 (0.0006) [2023-03-07 09:21:35,177][155452] Updated weights for policy 0, policy_version 80930 (0.0006) [2023-03-07 09:21:35,965][155452] Updated weights for policy 0, policy_version 80940 (0.0007) [2023-03-07 09:21:36,753][155452] Updated weights for policy 0, policy_version 80950 (0.0007) [2023-03-07 09:21:37,540][155452] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-07 09:21:38,317][155452] Updated weights for policy 0, policy_version 80970 (0.0006) [2023-03-07 09:21:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 82913280. Throughput: 0: 13022.6. Samples: 82893597. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:38,367][155126] Avg episode reward: [(0, '1756.413')] [2023-03-07 09:21:39,113][155452] Updated weights for policy 0, policy_version 80980 (0.0006) [2023-03-07 09:21:39,890][155452] Updated weights for policy 0, policy_version 80990 (0.0006) [2023-03-07 09:21:40,669][155452] Updated weights for policy 0, policy_version 81000 (0.0007) [2023-03-07 09:21:41,452][155452] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-07 09:21:42,230][155452] Updated weights for policy 0, policy_version 81020 (0.0006) [2023-03-07 09:21:43,016][155452] Updated weights for policy 0, policy_version 81030 (0.0007) [2023-03-07 09:21:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13037.8). Total num frames: 82978816. Throughput: 0: 13018.0. Samples: 82971639. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:43,367][155126] Avg episode reward: [(0, '1932.627')] [2023-03-07 09:21:43,801][155452] Updated weights for policy 0, policy_version 81040 (0.0006) [2023-03-07 09:21:44,583][155452] Updated weights for policy 0, policy_version 81050 (0.0006) [2023-03-07 09:21:45,375][155452] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-07 09:21:46,174][155452] Updated weights for policy 0, policy_version 81070 (0.0006) [2023-03-07 09:21:46,953][155452] Updated weights for policy 0, policy_version 81080 (0.0006) [2023-03-07 09:21:47,749][155452] Updated weights for policy 0, policy_version 81090 (0.0006) [2023-03-07 09:21:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83043328. Throughput: 0: 13018.9. Samples: 83010761. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:48,367][155126] Avg episode reward: [(0, '2121.808')] [2023-03-07 09:21:48,529][155452] Updated weights for policy 0, policy_version 81100 (0.0006) [2023-03-07 09:21:49,338][155452] Updated weights for policy 0, policy_version 81110 (0.0006) [2023-03-07 09:21:50,136][155452] Updated weights for policy 0, policy_version 81120 (0.0007) [2023-03-07 09:21:50,919][155452] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-07 09:21:51,701][155452] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-07 09:21:52,478][155452] Updated weights for policy 0, policy_version 81150 (0.0005) [2023-03-07 09:21:53,260][155452] Updated weights for policy 0, policy_version 81160 (0.0006) [2023-03-07 09:21:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 83108864. Throughput: 0: 13013.5. Samples: 83088577. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:53,367][155126] Avg episode reward: [(0, '1954.389')] [2023-03-07 09:21:54,054][155452] Updated weights for policy 0, policy_version 81170 (0.0006) [2023-03-07 09:21:54,841][155452] Updated weights for policy 0, policy_version 81180 (0.0006) [2023-03-07 09:21:55,637][155452] Updated weights for policy 0, policy_version 81190 (0.0006) [2023-03-07 09:21:56,417][155452] Updated weights for policy 0, policy_version 81200 (0.0007) [2023-03-07 09:21:57,205][155452] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-07 09:21:58,007][155452] Updated weights for policy 0, policy_version 81220 (0.0006) [2023-03-07 09:21:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 83173376. Throughput: 0: 13005.8. Samples: 83166551. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:21:58,367][155126] Avg episode reward: [(0, '1943.580')] [2023-03-07 09:21:58,768][155452] Updated weights for policy 0, policy_version 81230 (0.0005) [2023-03-07 09:21:59,559][155452] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-07 09:22:00,361][155452] Updated weights for policy 0, policy_version 81250 (0.0006) [2023-03-07 09:22:01,144][155452] Updated weights for policy 0, policy_version 81260 (0.0006) [2023-03-07 09:22:01,937][155452] Updated weights for policy 0, policy_version 81270 (0.0005) [2023-03-07 09:22:02,721][155452] Updated weights for policy 0, policy_version 81280 (0.0005) [2023-03-07 09:22:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83238912. Throughput: 0: 13012.7. Samples: 83205681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:22:03,367][155126] Avg episode reward: [(0, '1930.533')] [2023-03-07 09:22:03,511][155452] Updated weights for policy 0, policy_version 81290 (0.0007) [2023-03-07 09:22:04,291][155452] Updated weights for policy 0, policy_version 81300 (0.0006) [2023-03-07 09:22:05,081][155452] Updated weights for policy 0, policy_version 81310 (0.0007) [2023-03-07 09:22:05,894][155452] Updated weights for policy 0, policy_version 81320 (0.0006) [2023-03-07 09:22:06,674][155452] Updated weights for policy 0, policy_version 81330 (0.0006) [2023-03-07 09:22:07,454][155452] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-07 09:22:08,242][155452] Updated weights for policy 0, policy_version 81350 (0.0005) [2023-03-07 09:22:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 83303424. Throughput: 0: 13009.9. Samples: 83283478. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:08,367][155126] Avg episode reward: [(0, '1842.757')] [2023-03-07 09:22:09,022][155452] Updated weights for policy 0, policy_version 81360 (0.0007) [2023-03-07 09:22:09,801][155452] Updated weights for policy 0, policy_version 81370 (0.0006) [2023-03-07 09:22:10,613][155452] Updated weights for policy 0, policy_version 81380 (0.0006) [2023-03-07 09:22:11,382][155452] Updated weights for policy 0, policy_version 81390 (0.0005) [2023-03-07 09:22:12,161][155452] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-07 09:22:12,953][155452] Updated weights for policy 0, policy_version 81410 (0.0006) [2023-03-07 09:22:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 83368960. Throughput: 0: 13012.7. Samples: 83361634. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:13,367][155126] Avg episode reward: [(0, '1754.718')] [2023-03-07 09:22:13,743][155452] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-07 09:22:14,549][155452] Updated weights for policy 0, policy_version 81430 (0.0006) [2023-03-07 09:22:15,321][155452] Updated weights for policy 0, policy_version 81440 (0.0006) [2023-03-07 09:22:16,111][155452] Updated weights for policy 0, policy_version 81450 (0.0006) [2023-03-07 09:22:16,906][155452] Updated weights for policy 0, policy_version 81460 (0.0006) [2023-03-07 09:22:17,681][155452] Updated weights for policy 0, policy_version 81470 (0.0006) [2023-03-07 09:22:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13034.3). Total num frames: 83433472. Throughput: 0: 13015.5. Samples: 83400691. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:18,367][155126] Avg episode reward: [(0, '1831.137')] [2023-03-07 09:22:18,443][155452] Updated weights for policy 0, policy_version 81480 (0.0006) [2023-03-07 09:22:19,232][155452] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-07 09:22:20,022][155452] Updated weights for policy 0, policy_version 81500 (0.0007) [2023-03-07 09:22:20,807][155452] Updated weights for policy 0, policy_version 81510 (0.0006) [2023-03-07 09:22:21,623][155452] Updated weights for policy 0, policy_version 81520 (0.0006) [2023-03-07 09:22:22,398][155452] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-07 09:22:23,166][155452] Updated weights for policy 0, policy_version 81540 (0.0006) [2023-03-07 09:22:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83499008. Throughput: 0: 13003.7. Samples: 83478763. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:23,367][155126] Avg episode reward: [(0, '1894.750')] [2023-03-07 09:22:23,976][155452] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-07 09:22:24,753][155452] Updated weights for policy 0, policy_version 81560 (0.0006) [2023-03-07 09:22:25,545][155452] Updated weights for policy 0, policy_version 81570 (0.0006) [2023-03-07 09:22:26,330][155452] Updated weights for policy 0, policy_version 81580 (0.0007) [2023-03-07 09:22:27,109][155452] Updated weights for policy 0, policy_version 81590 (0.0006) [2023-03-07 09:22:27,890][155452] Updated weights for policy 0, policy_version 81600 (0.0006) [2023-03-07 09:22:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83564544. Throughput: 0: 13011.1. Samples: 83557138. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:28,367][155126] Avg episode reward: [(0, '1949.298')] [2023-03-07 09:22:28,690][155452] Updated weights for policy 0, policy_version 81610 (0.0007) [2023-03-07 09:22:29,462][155452] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-07 09:22:30,246][155452] Updated weights for policy 0, policy_version 81630 (0.0006) [2023-03-07 09:22:31,033][155452] Updated weights for policy 0, policy_version 81640 (0.0006) [2023-03-07 09:22:31,825][155452] Updated weights for policy 0, policy_version 81650 (0.0006) [2023-03-07 09:22:32,599][155452] Updated weights for policy 0, policy_version 81660 (0.0006) [2023-03-07 09:22:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83629056. Throughput: 0: 13007.6. Samples: 83596103. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:33,367][155126] Avg episode reward: [(0, '2099.243')] [2023-03-07 09:22:33,380][155452] Updated weights for policy 0, policy_version 81670 (0.0006) [2023-03-07 09:22:34,179][155452] Updated weights for policy 0, policy_version 81680 (0.0006) [2023-03-07 09:22:34,962][155452] Updated weights for policy 0, policy_version 81690 (0.0006) [2023-03-07 09:22:35,745][155452] Updated weights for policy 0, policy_version 81700 (0.0006) [2023-03-07 09:22:36,537][155452] Updated weights for policy 0, policy_version 81710 (0.0007) [2023-03-07 09:22:37,323][155452] Updated weights for policy 0, policy_version 81720 (0.0006) [2023-03-07 09:22:38,106][155452] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-07 09:22:38,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13034.3). Total num frames: 83694592. Throughput: 0: 13021.9. Samples: 83674564. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:38,368][155126] Avg episode reward: [(0, '1874.291')] [2023-03-07 09:22:38,893][155452] Updated weights for policy 0, policy_version 81740 (0.0007) [2023-03-07 09:22:39,674][155452] Updated weights for policy 0, policy_version 81750 (0.0006) [2023-03-07 09:22:40,471][155452] Updated weights for policy 0, policy_version 81760 (0.0007) [2023-03-07 09:22:41,245][155452] Updated weights for policy 0, policy_version 81770 (0.0006) [2023-03-07 09:22:42,037][155452] Updated weights for policy 0, policy_version 81780 (0.0006) [2023-03-07 09:22:42,816][155452] Updated weights for policy 0, policy_version 81790 (0.0006) [2023-03-07 09:22:43,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83760128. Throughput: 0: 13029.2. Samples: 83752864. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:43,367][155126] Avg episode reward: [(0, '1760.841')] [2023-03-07 09:22:43,595][155452] Updated weights for policy 0, policy_version 81800 (0.0006) [2023-03-07 09:22:44,380][155452] Updated weights for policy 0, policy_version 81810 (0.0007) [2023-03-07 09:22:45,170][155452] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-07 09:22:45,966][155452] Updated weights for policy 0, policy_version 81830 (0.0006) [2023-03-07 09:22:46,748][155452] Updated weights for policy 0, policy_version 81840 (0.0006) [2023-03-07 09:22:47,530][155452] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-07 09:22:48,303][155452] Updated weights for policy 0, policy_version 81860 (0.0007) [2023-03-07 09:22:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83824640. Throughput: 0: 13027.7. Samples: 83791926. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:48,367][155126] Avg episode reward: [(0, '2052.760')] [2023-03-07 09:22:49,093][155452] Updated weights for policy 0, policy_version 81870 (0.0006) [2023-03-07 09:22:49,870][155452] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-07 09:22:50,670][155452] Updated weights for policy 0, policy_version 81890 (0.0006) [2023-03-07 09:22:51,452][155452] Updated weights for policy 0, policy_version 81900 (0.0006) [2023-03-07 09:22:52,218][155452] Updated weights for policy 0, policy_version 81910 (0.0006) [2023-03-07 09:22:53,008][155452] Updated weights for policy 0, policy_version 81920 (0.0008) [2023-03-07 09:22:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 83890176. Throughput: 0: 13038.2. Samples: 83870196. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:22:53,367][155126] Avg episode reward: [(0, '1893.237')] [2023-03-07 09:22:53,795][155452] Updated weights for policy 0, policy_version 81930 (0.0006) [2023-03-07 09:22:54,575][155452] Updated weights for policy 0, policy_version 81940 (0.0006) [2023-03-07 09:22:55,356][155452] Updated weights for policy 0, policy_version 81950 (0.0006) [2023-03-07 09:22:56,138][155452] Updated weights for policy 0, policy_version 81960 (0.0006) [2023-03-07 09:22:56,910][155452] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-07 09:22:57,689][155452] Updated weights for policy 0, policy_version 81980 (0.0006) [2023-03-07 09:22:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 83955712. Throughput: 0: 13048.6. Samples: 83948819. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:22:58,367][155126] Avg episode reward: [(0, '1871.717')] [2023-03-07 09:22:58,485][155452] Updated weights for policy 0, policy_version 81990 (0.0006) [2023-03-07 09:22:59,274][155452] Updated weights for policy 0, policy_version 82000 (0.0007) [2023-03-07 09:23:00,059][155452] Updated weights for policy 0, policy_version 82010 (0.0005) [2023-03-07 09:23:00,829][155452] Updated weights for policy 0, policy_version 82020 (0.0006) [2023-03-07 09:23:01,630][155452] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-07 09:23:02,418][155452] Updated weights for policy 0, policy_version 82040 (0.0006) [2023-03-07 09:23:03,195][155452] Updated weights for policy 0, policy_version 82050 (0.0006) [2023-03-07 09:23:03,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 84021248. Throughput: 0: 13047.8. Samples: 83987840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:03,367][155126] Avg episode reward: [(0, '1949.335')] [2023-03-07 09:23:03,999][155452] Updated weights for policy 0, policy_version 82060 (0.0006) [2023-03-07 09:23:04,792][155452] Updated weights for policy 0, policy_version 82070 (0.0006) [2023-03-07 09:23:05,570][155452] Updated weights for policy 0, policy_version 82080 (0.0005) [2023-03-07 09:23:06,357][155452] Updated weights for policy 0, policy_version 82090 (0.0006) [2023-03-07 09:23:07,156][155452] Updated weights for policy 0, policy_version 82100 (0.0006) [2023-03-07 09:23:07,945][155452] Updated weights for policy 0, policy_version 82110 (0.0007) [2023-03-07 09:23:08,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 84085760. Throughput: 0: 13046.3. Samples: 84065849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:08,368][155126] Avg episode reward: [(0, '1884.120')] [2023-03-07 09:23:08,733][155452] Updated weights for policy 0, policy_version 82120 (0.0007) [2023-03-07 09:23:09,522][155452] Updated weights for policy 0, policy_version 82130 (0.0006) [2023-03-07 09:23:10,317][155452] Updated weights for policy 0, policy_version 82140 (0.0006) [2023-03-07 09:23:11,088][155452] Updated weights for policy 0, policy_version 82150 (0.0006) [2023-03-07 09:23:11,888][155452] Updated weights for policy 0, policy_version 82160 (0.0006) [2023-03-07 09:23:12,661][155452] Updated weights for policy 0, policy_version 82170 (0.0006) [2023-03-07 09:23:13,367][155126] Fps is (10 sec: 12902.2, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 84150272. Throughput: 0: 13034.3. Samples: 84143685. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:13,368][155126] Avg episode reward: [(0, '2178.581')] [2023-03-07 09:23:13,442][155452] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-07 09:23:14,251][155452] Updated weights for policy 0, policy_version 82190 (0.0006) [2023-03-07 09:23:15,041][155452] Updated weights for policy 0, policy_version 82200 (0.0006) [2023-03-07 09:23:15,804][155452] Updated weights for policy 0, policy_version 82210 (0.0006) [2023-03-07 09:23:16,594][155452] Updated weights for policy 0, policy_version 82220 (0.0006) [2023-03-07 09:23:17,391][155452] Updated weights for policy 0, policy_version 82230 (0.0006) [2023-03-07 09:23:18,169][155452] Updated weights for policy 0, policy_version 82240 (0.0006) [2023-03-07 09:23:18,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 84215808. Throughput: 0: 13038.8. Samples: 84182850. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:18,367][155126] Avg episode reward: [(0, '2128.160')] [2023-03-07 09:23:18,961][155452] Updated weights for policy 0, policy_version 82250 (0.0006) [2023-03-07 09:23:19,761][155452] Updated weights for policy 0, policy_version 82260 (0.0006) [2023-03-07 09:23:20,520][155452] Updated weights for policy 0, policy_version 82270 (0.0006) [2023-03-07 09:23:21,299][155452] Updated weights for policy 0, policy_version 82280 (0.0006) [2023-03-07 09:23:22,087][155452] Updated weights for policy 0, policy_version 82290 (0.0005) [2023-03-07 09:23:22,874][155452] Updated weights for policy 0, policy_version 82300 (0.0006) [2023-03-07 09:23:23,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 84281344. Throughput: 0: 13036.1. Samples: 84261186. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:23,367][155126] Avg episode reward: [(0, '2259.554')] [2023-03-07 09:23:23,649][155452] Updated weights for policy 0, policy_version 82310 (0.0006) [2023-03-07 09:23:24,430][155452] Updated weights for policy 0, policy_version 82320 (0.0006) [2023-03-07 09:23:25,220][155452] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-07 09:23:26,008][155452] Updated weights for policy 0, policy_version 82340 (0.0007) [2023-03-07 09:23:26,774][155452] Updated weights for policy 0, policy_version 82350 (0.0007) [2023-03-07 09:23:27,574][155452] Updated weights for policy 0, policy_version 82360 (0.0006) [2023-03-07 09:23:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13030.8). Total num frames: 84345856. Throughput: 0: 13033.4. Samples: 84339368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:28,371][155452] Updated weights for policy 0, policy_version 82370 (0.0006) [2023-03-07 09:23:28,378][155126] Avg episode reward: [(0, '2003.029')] [2023-03-07 09:23:28,382][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000082370_84346880.pth... [2023-03-07 09:23:28,413][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000079315_81218560.pth [2023-03-07 09:23:29,159][155452] Updated weights for policy 0, policy_version 82380 (0.0006) [2023-03-07 09:23:29,949][155452] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-07 09:23:30,742][155452] Updated weights for policy 0, policy_version 82400 (0.0006) [2023-03-07 09:23:31,538][155452] Updated weights for policy 0, policy_version 82410 (0.0006) [2023-03-07 09:23:32,331][155452] Updated weights for policy 0, policy_version 82420 (0.0006) [2023-03-07 09:23:33,101][155452] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-07 09:23:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 84411392. Throughput: 0: 13029.6. Samples: 84378258. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:33,367][155126] Avg episode reward: [(0, '1981.923')] [2023-03-07 09:23:33,903][155452] Updated weights for policy 0, policy_version 82440 (0.0007) [2023-03-07 09:23:34,689][155452] Updated weights for policy 0, policy_version 82450 (0.0006) [2023-03-07 09:23:35,484][155452] Updated weights for policy 0, policy_version 82460 (0.0006) [2023-03-07 09:23:36,267][155452] Updated weights for policy 0, policy_version 82470 (0.0007) [2023-03-07 09:23:37,064][155452] Updated weights for policy 0, policy_version 82480 (0.0007) [2023-03-07 09:23:37,825][155452] Updated weights for policy 0, policy_version 82490 (0.0006) [2023-03-07 09:23:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 84475904. Throughput: 0: 13021.1. Samples: 84456144. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:38,367][155126] Avg episode reward: [(0, '2117.704')] [2023-03-07 09:23:38,613][155452] Updated weights for policy 0, policy_version 82500 (0.0007) [2023-03-07 09:23:39,418][155452] Updated weights for policy 0, policy_version 82510 (0.0007) [2023-03-07 09:23:40,201][155452] Updated weights for policy 0, policy_version 82520 (0.0006) [2023-03-07 09:23:40,979][155452] Updated weights for policy 0, policy_version 82530 (0.0007) [2023-03-07 09:23:41,757][155452] Updated weights for policy 0, policy_version 82540 (0.0006) [2023-03-07 09:23:42,537][155452] Updated weights for policy 0, policy_version 82550 (0.0006) [2023-03-07 09:23:43,334][155452] Updated weights for policy 0, policy_version 82560 (0.0006) [2023-03-07 09:23:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 84541440. Throughput: 0: 13012.8. Samples: 84534396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:43,367][155126] Avg episode reward: [(0, '2199.657')] [2023-03-07 09:23:44,117][155452] Updated weights for policy 0, policy_version 82570 (0.0007) [2023-03-07 09:23:44,891][155452] Updated weights for policy 0, policy_version 82580 (0.0006) [2023-03-07 09:23:45,663][155452] Updated weights for policy 0, policy_version 82590 (0.0006) [2023-03-07 09:23:46,452][155452] Updated weights for policy 0, policy_version 82600 (0.0006) [2023-03-07 09:23:47,238][155452] Updated weights for policy 0, policy_version 82610 (0.0006) [2023-03-07 09:23:48,034][155452] Updated weights for policy 0, policy_version 82620 (0.0007) [2023-03-07 09:23:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 84606976. Throughput: 0: 13020.5. Samples: 84573762. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:48,367][155126] Avg episode reward: [(0, '1973.701')] [2023-03-07 09:23:48,808][155452] Updated weights for policy 0, policy_version 82630 (0.0007) [2023-03-07 09:23:49,590][155452] Updated weights for policy 0, policy_version 82640 (0.0006) [2023-03-07 09:23:50,379][155452] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-07 09:23:51,177][155452] Updated weights for policy 0, policy_version 82660 (0.0006) [2023-03-07 09:23:51,959][155452] Updated weights for policy 0, policy_version 82670 (0.0007) [2023-03-07 09:23:52,739][155452] Updated weights for policy 0, policy_version 82680 (0.0006) [2023-03-07 09:23:53,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 84671488. Throughput: 0: 13021.0. Samples: 84651790. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:53,367][155126] Avg episode reward: [(0, '1942.800')] [2023-03-07 09:23:53,541][155452] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-07 09:23:54,325][155452] Updated weights for policy 0, policy_version 82700 (0.0007) [2023-03-07 09:23:55,109][155452] Updated weights for policy 0, policy_version 82710 (0.0006) [2023-03-07 09:23:55,885][155452] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-07 09:23:56,693][155452] Updated weights for policy 0, policy_version 82730 (0.0006) [2023-03-07 09:23:57,466][155452] Updated weights for policy 0, policy_version 82740 (0.0005) [2023-03-07 09:23:58,244][155452] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-07 09:23:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 84737024. Throughput: 0: 13032.0. Samples: 84730124. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:23:58,367][155126] Avg episode reward: [(0, '1976.487')] [2023-03-07 09:23:59,036][155452] Updated weights for policy 0, policy_version 82760 (0.0006) [2023-03-07 09:23:59,808][155452] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-07 09:24:00,586][155452] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-07 09:24:01,382][155452] Updated weights for policy 0, policy_version 82790 (0.0006) [2023-03-07 09:24:02,160][155452] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-07 09:24:02,941][155452] Updated weights for policy 0, policy_version 82810 (0.0006) [2023-03-07 09:24:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13034.3). Total num frames: 84802560. Throughput: 0: 13038.8. Samples: 84769598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:24:03,368][155126] Avg episode reward: [(0, '2020.156')] [2023-03-07 09:24:03,725][155452] Updated weights for policy 0, policy_version 82820 (0.0005) [2023-03-07 09:24:04,496][155452] Updated weights for policy 0, policy_version 82830 (0.0006) [2023-03-07 09:24:05,302][155452] Updated weights for policy 0, policy_version 82840 (0.0007) [2023-03-07 09:24:06,080][155452] Updated weights for policy 0, policy_version 82850 (0.0007) [2023-03-07 09:24:06,858][155452] Updated weights for policy 0, policy_version 82860 (0.0006) [2023-03-07 09:24:07,641][155452] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-07 09:24:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 84868096. Throughput: 0: 13033.5. Samples: 84847693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:24:08,367][155126] Avg episode reward: [(0, '2218.082')] [2023-03-07 09:24:08,425][155452] Updated weights for policy 0, policy_version 82880 (0.0006) [2023-03-07 09:24:09,202][155452] Updated weights for policy 0, policy_version 82890 (0.0006) [2023-03-07 09:24:09,993][155452] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-07 09:24:10,771][155452] Updated weights for policy 0, policy_version 82910 (0.0006) [2023-03-07 09:24:11,562][155452] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-07 09:24:12,345][155452] Updated weights for policy 0, policy_version 82930 (0.0006) [2023-03-07 09:24:13,142][155452] Updated weights for policy 0, policy_version 82940 (0.0007) [2023-03-07 09:24:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13039.0, 300 sec: 13034.3). Total num frames: 84932608. Throughput: 0: 13038.1. Samples: 84926084. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:24:13,367][155126] Avg episode reward: [(0, '1989.828')] [2023-03-07 09:24:13,934][155452] Updated weights for policy 0, policy_version 82950 (0.0007) [2023-03-07 09:24:14,723][155452] Updated weights for policy 0, policy_version 82960 (0.0006) [2023-03-07 09:24:15,506][155452] Updated weights for policy 0, policy_version 82970 (0.0006) [2023-03-07 09:24:16,290][155452] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-07 09:24:17,071][155452] Updated weights for policy 0, policy_version 82990 (0.0005) [2023-03-07 09:24:17,859][155452] Updated weights for policy 0, policy_version 83000 (0.0007) [2023-03-07 09:24:18,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13034.3). Total num frames: 84998144. Throughput: 0: 13038.2. Samples: 84964979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:24:18,368][155126] Avg episode reward: [(0, '2090.720')] [2023-03-07 09:24:18,662][155452] Updated weights for policy 0, policy_version 83010 (0.0006) [2023-03-07 09:24:19,442][155452] Updated weights for policy 0, policy_version 83020 (0.0007) [2023-03-07 09:24:20,233][155452] Updated weights for policy 0, policy_version 83030 (0.0007) [2023-03-07 09:24:21,020][155452] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-07 09:24:21,809][155452] Updated weights for policy 0, policy_version 83050 (0.0006) [2023-03-07 09:24:22,601][155452] Updated weights for policy 0, policy_version 83060 (0.0006) [2023-03-07 09:24:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13030.8). Total num frames: 85062656. Throughput: 0: 13040.1. Samples: 85042948. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:24:23,368][155126] Avg episode reward: [(0, '1928.636')] [2023-03-07 09:24:23,385][155452] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-07 09:24:24,173][155452] Updated weights for policy 0, policy_version 83080 (0.0007) [2023-03-07 09:24:24,965][155452] Updated weights for policy 0, policy_version 83090 (0.0006) [2023-03-07 09:24:25,766][155452] Updated weights for policy 0, policy_version 83100 (0.0007) [2023-03-07 09:24:26,566][155452] Updated weights for policy 0, policy_version 83110 (0.0006) [2023-03-07 09:24:27,361][155452] Updated weights for policy 0, policy_version 83120 (0.0006) [2023-03-07 09:24:28,159][155452] Updated weights for policy 0, policy_version 83130 (0.0005) [2023-03-07 09:24:28,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 85127168. Throughput: 0: 13024.4. Samples: 85120493. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:24:28,368][155126] Avg episode reward: [(0, '1934.487')] [2023-03-07 09:24:28,945][155452] Updated weights for policy 0, policy_version 83140 (0.0006) [2023-03-07 09:24:29,725][155452] Updated weights for policy 0, policy_version 83150 (0.0006) [2023-03-07 09:24:30,501][155452] Updated weights for policy 0, policy_version 83160 (0.0006) [2023-03-07 09:24:31,302][155452] Updated weights for policy 0, policy_version 83170 (0.0006) [2023-03-07 09:24:32,081][155452] Updated weights for policy 0, policy_version 83180 (0.0006) [2023-03-07 09:24:32,876][155452] Updated weights for policy 0, policy_version 83190 (0.0007) [2023-03-07 09:24:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 85192704. Throughput: 0: 13014.4. Samples: 85159411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:24:33,367][155126] Avg episode reward: [(0, '1980.690')] [2023-03-07 09:24:33,638][155452] Updated weights for policy 0, policy_version 83200 (0.0005) [2023-03-07 09:24:34,442][155452] Updated weights for policy 0, policy_version 83210 (0.0005) [2023-03-07 09:24:35,212][155452] Updated weights for policy 0, policy_version 83220 (0.0005) [2023-03-07 09:24:36,016][155452] Updated weights for policy 0, policy_version 83230 (0.0006) [2023-03-07 09:24:36,792][155452] Updated weights for policy 0, policy_version 83240 (0.0006) [2023-03-07 09:24:37,573][155452] Updated weights for policy 0, policy_version 83250 (0.0006) [2023-03-07 09:24:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 85257216. Throughput: 0: 13022.7. Samples: 85237812. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:24:38,367][155126] Avg episode reward: [(0, '2061.995')] [2023-03-07 09:24:38,368][155452] Updated weights for policy 0, policy_version 83260 (0.0006) [2023-03-07 09:24:39,156][155452] Updated weights for policy 0, policy_version 83270 (0.0008) [2023-03-07 09:24:39,935][155452] Updated weights for policy 0, policy_version 83280 (0.0006) [2023-03-07 09:24:40,720][155452] Updated weights for policy 0, policy_version 83290 (0.0006) [2023-03-07 09:24:41,523][155452] Updated weights for policy 0, policy_version 83300 (0.0005) [2023-03-07 09:24:42,305][155452] Updated weights for policy 0, policy_version 83310 (0.0006) [2023-03-07 09:24:43,091][155452] Updated weights for policy 0, policy_version 83320 (0.0006) [2023-03-07 09:24:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 85322752. Throughput: 0: 13016.1. Samples: 85315849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:24:43,368][155126] Avg episode reward: [(0, '1859.697')] [2023-03-07 09:24:43,884][155452] Updated weights for policy 0, policy_version 83330 (0.0006) [2023-03-07 09:24:44,663][155452] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-07 09:24:45,459][155452] Updated weights for policy 0, policy_version 83350 (0.0006) [2023-03-07 09:24:46,233][155452] Updated weights for policy 0, policy_version 83360 (0.0006) [2023-03-07 09:24:47,006][155452] Updated weights for policy 0, policy_version 83370 (0.0006) [2023-03-07 09:24:47,825][155452] Updated weights for policy 0, policy_version 83380 (0.0006) [2023-03-07 09:24:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 85387264. Throughput: 0: 13006.5. Samples: 85354888. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:24:48,367][155126] Avg episode reward: [(0, '1887.606')] [2023-03-07 09:24:48,597][155452] Updated weights for policy 0, policy_version 83390 (0.0005) [2023-03-07 09:24:49,380][155452] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-07 09:24:50,161][155452] Updated weights for policy 0, policy_version 83410 (0.0006) [2023-03-07 09:24:50,942][155452] Updated weights for policy 0, policy_version 83420 (0.0006) [2023-03-07 09:24:51,730][155452] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-07 09:24:52,533][155452] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-07 09:24:53,318][155452] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-07 09:24:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 85452800. Throughput: 0: 13007.1. Samples: 85433012. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:24:53,367][155126] Avg episode reward: [(0, '1840.468')] [2023-03-07 09:24:54,109][155452] Updated weights for policy 0, policy_version 83460 (0.0005) [2023-03-07 09:24:54,885][155452] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-07 09:24:55,662][155452] Updated weights for policy 0, policy_version 83480 (0.0006) [2023-03-07 09:24:56,457][155452] Updated weights for policy 0, policy_version 83490 (0.0007) [2023-03-07 09:24:57,248][155452] Updated weights for policy 0, policy_version 83500 (0.0006) [2023-03-07 09:24:58,044][155452] Updated weights for policy 0, policy_version 83510 (0.0007) [2023-03-07 09:24:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 85517312. Throughput: 0: 12997.5. Samples: 85510970. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:24:58,367][155126] Avg episode reward: [(0, '2015.920')] [2023-03-07 09:24:58,827][155452] Updated weights for policy 0, policy_version 83520 (0.0006) [2023-03-07 09:24:59,644][155452] Updated weights for policy 0, policy_version 83530 (0.0007) [2023-03-07 09:25:00,438][155452] Updated weights for policy 0, policy_version 83540 (0.0006) [2023-03-07 09:25:01,221][155452] Updated weights for policy 0, policy_version 83550 (0.0006) [2023-03-07 09:25:02,018][155452] Updated weights for policy 0, policy_version 83560 (0.0006) [2023-03-07 09:25:02,799][155452] Updated weights for policy 0, policy_version 83570 (0.0006) [2023-03-07 09:25:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 85582848. Throughput: 0: 12991.2. Samples: 85549584. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:03,367][155126] Avg episode reward: [(0, '2164.150')] [2023-03-07 09:25:03,581][155452] Updated weights for policy 0, policy_version 83580 (0.0006) [2023-03-07 09:25:04,369][155452] Updated weights for policy 0, policy_version 83590 (0.0006) [2023-03-07 09:25:05,158][155452] Updated weights for policy 0, policy_version 83600 (0.0006) [2023-03-07 09:25:05,946][155452] Updated weights for policy 0, policy_version 83610 (0.0006) [2023-03-07 09:25:06,738][155452] Updated weights for policy 0, policy_version 83620 (0.0007) [2023-03-07 09:25:07,514][155452] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-07 09:25:08,300][155452] Updated weights for policy 0, policy_version 83640 (0.0007) [2023-03-07 09:25:08,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 85648384. Throughput: 0: 12994.2. Samples: 85627686. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:08,367][155126] Avg episode reward: [(0, '2092.385')] [2023-03-07 09:25:09,100][155452] Updated weights for policy 0, policy_version 83650 (0.0007) [2023-03-07 09:25:09,889][155452] Updated weights for policy 0, policy_version 83660 (0.0007) [2023-03-07 09:25:10,683][155452] Updated weights for policy 0, policy_version 83670 (0.0006) [2023-03-07 09:25:11,468][155452] Updated weights for policy 0, policy_version 83680 (0.0006) [2023-03-07 09:25:12,239][155452] Updated weights for policy 0, policy_version 83690 (0.0006) [2023-03-07 09:25:13,019][155452] Updated weights for policy 0, policy_version 83700 (0.0007) [2023-03-07 09:25:13,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 85712896. Throughput: 0: 13005.7. Samples: 85705750. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:13,367][155126] Avg episode reward: [(0, '2210.458')] [2023-03-07 09:25:13,826][155452] Updated weights for policy 0, policy_version 83710 (0.0007) [2023-03-07 09:25:14,615][155452] Updated weights for policy 0, policy_version 83720 (0.0006) [2023-03-07 09:25:15,377][155452] Updated weights for policy 0, policy_version 83730 (0.0005) [2023-03-07 09:25:16,187][155452] Updated weights for policy 0, policy_version 83740 (0.0006) [2023-03-07 09:25:16,973][155452] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-07 09:25:17,759][155452] Updated weights for policy 0, policy_version 83760 (0.0006) [2023-03-07 09:25:18,367][155126] Fps is (10 sec: 12902.1, 60 sec: 12987.7, 300 sec: 13020.4). Total num frames: 85777408. Throughput: 0: 13008.7. Samples: 85744803. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:18,378][155126] Avg episode reward: [(0, '2098.146')] [2023-03-07 09:25:18,541][155452] Updated weights for policy 0, policy_version 83770 (0.0006) [2023-03-07 09:25:19,321][155452] Updated weights for policy 0, policy_version 83780 (0.0006) [2023-03-07 09:25:20,122][155452] Updated weights for policy 0, policy_version 83790 (0.0008) [2023-03-07 09:25:20,894][155452] Updated weights for policy 0, policy_version 83800 (0.0006) [2023-03-07 09:25:21,677][155452] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-07 09:25:22,487][155452] Updated weights for policy 0, policy_version 83820 (0.0006) [2023-03-07 09:25:23,266][155452] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-07 09:25:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 85842944. Throughput: 0: 13000.0. Samples: 85822811. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:23,368][155126] Avg episode reward: [(0, '2234.901')] [2023-03-07 09:25:24,052][155452] Updated weights for policy 0, policy_version 83840 (0.0007) [2023-03-07 09:25:24,850][155452] Updated weights for policy 0, policy_version 83850 (0.0007) [2023-03-07 09:25:25,621][155452] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-07 09:25:26,391][155452] Updated weights for policy 0, policy_version 83870 (0.0006) [2023-03-07 09:25:27,187][155452] Updated weights for policy 0, policy_version 83880 (0.0005) [2023-03-07 09:25:27,969][155452] Updated weights for policy 0, policy_version 83890 (0.0006) [2023-03-07 09:25:28,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 85907456. Throughput: 0: 13004.4. Samples: 85901044. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:28,367][155126] Avg episode reward: [(0, '2289.610')] [2023-03-07 09:25:28,383][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000083895_85908480.pth... [2023-03-07 09:25:28,413][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000080843_82783232.pth [2023-03-07 09:25:28,758][155452] Updated weights for policy 0, policy_version 83900 (0.0006) [2023-03-07 09:25:29,537][155452] Updated weights for policy 0, policy_version 83910 (0.0006) [2023-03-07 09:25:30,337][155452] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-07 09:25:31,125][155452] Updated weights for policy 0, policy_version 83930 (0.0007) [2023-03-07 09:25:31,925][155452] Updated weights for policy 0, policy_version 83940 (0.0007) [2023-03-07 09:25:32,723][155452] Updated weights for policy 0, policy_version 83950 (0.0006) [2023-03-07 09:25:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 85972992. Throughput: 0: 13000.7. Samples: 85939919. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:33,367][155126] Avg episode reward: [(0, '2101.466')] [2023-03-07 09:25:33,491][155452] Updated weights for policy 0, policy_version 83960 (0.0006) [2023-03-07 09:25:34,274][155452] Updated weights for policy 0, policy_version 83970 (0.0005) [2023-03-07 09:25:35,074][155452] Updated weights for policy 0, policy_version 83980 (0.0007) [2023-03-07 09:25:35,845][155452] Updated weights for policy 0, policy_version 83990 (0.0006) [2023-03-07 09:25:36,638][155452] Updated weights for policy 0, policy_version 84000 (0.0007) [2023-03-07 09:25:37,436][155452] Updated weights for policy 0, policy_version 84010 (0.0006) [2023-03-07 09:25:38,211][155452] Updated weights for policy 0, policy_version 84020 (0.0006) [2023-03-07 09:25:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86037504. Throughput: 0: 12998.9. Samples: 86017961. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:38,367][155126] Avg episode reward: [(0, '2036.820')] [2023-03-07 09:25:39,023][155452] Updated weights for policy 0, policy_version 84030 (0.0006) [2023-03-07 09:25:39,793][155452] Updated weights for policy 0, policy_version 84040 (0.0006) [2023-03-07 09:25:40,613][155452] Updated weights for policy 0, policy_version 84050 (0.0008) [2023-03-07 09:25:41,371][155452] Updated weights for policy 0, policy_version 84060 (0.0006) [2023-03-07 09:25:42,178][155452] Updated weights for policy 0, policy_version 84070 (0.0006) [2023-03-07 09:25:42,961][155452] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-07 09:25:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 86103040. Throughput: 0: 12997.9. Samples: 86095874. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:43,367][155126] Avg episode reward: [(0, '2188.668')] [2023-03-07 09:25:43,752][155452] Updated weights for policy 0, policy_version 84090 (0.0006) [2023-03-07 09:25:44,527][155452] Updated weights for policy 0, policy_version 84100 (0.0007) [2023-03-07 09:25:45,306][155452] Updated weights for policy 0, policy_version 84110 (0.0006) [2023-03-07 09:25:46,105][155452] Updated weights for policy 0, policy_version 84120 (0.0006) [2023-03-07 09:25:46,870][155452] Updated weights for policy 0, policy_version 84130 (0.0006) [2023-03-07 09:25:47,666][155452] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-07 09:25:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 86168576. Throughput: 0: 13009.2. Samples: 86134996. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:48,367][155126] Avg episode reward: [(0, '2127.468')] [2023-03-07 09:25:48,445][155452] Updated weights for policy 0, policy_version 84150 (0.0006) [2023-03-07 09:25:49,233][155452] Updated weights for policy 0, policy_version 84160 (0.0006) [2023-03-07 09:25:50,026][155452] Updated weights for policy 0, policy_version 84170 (0.0007) [2023-03-07 09:25:50,820][155452] Updated weights for policy 0, policy_version 84180 (0.0007) [2023-03-07 09:25:51,605][155452] Updated weights for policy 0, policy_version 84190 (0.0005) [2023-03-07 09:25:52,394][155452] Updated weights for policy 0, policy_version 84200 (0.0005) [2023-03-07 09:25:53,181][155452] Updated weights for policy 0, policy_version 84210 (0.0006) [2023-03-07 09:25:53,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86233088. Throughput: 0: 13006.7. Samples: 86212991. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:53,368][155126] Avg episode reward: [(0, '2163.273')] [2023-03-07 09:25:53,957][155452] Updated weights for policy 0, policy_version 84220 (0.0006) [2023-03-07 09:25:54,746][155452] Updated weights for policy 0, policy_version 84230 (0.0006) [2023-03-07 09:25:55,537][155452] Updated weights for policy 0, policy_version 84240 (0.0006) [2023-03-07 09:25:56,337][155452] Updated weights for policy 0, policy_version 84250 (0.0007) [2023-03-07 09:25:57,116][155452] Updated weights for policy 0, policy_version 84260 (0.0006) [2023-03-07 09:25:57,912][155452] Updated weights for policy 0, policy_version 84270 (0.0007) [2023-03-07 09:25:58,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 86297600. Throughput: 0: 13003.0. Samples: 86290886. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:25:58,367][155126] Avg episode reward: [(0, '2099.167')] [2023-03-07 09:25:58,724][155452] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-07 09:25:59,513][155452] Updated weights for policy 0, policy_version 84290 (0.0006) [2023-03-07 09:26:00,309][155452] Updated weights for policy 0, policy_version 84300 (0.0007) [2023-03-07 09:26:01,082][155452] Updated weights for policy 0, policy_version 84310 (0.0005) [2023-03-07 09:26:01,878][155452] Updated weights for policy 0, policy_version 84320 (0.0005) [2023-03-07 09:26:02,672][155452] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-07 09:26:03,367][155126] Fps is (10 sec: 12902.4, 60 sec: 12987.7, 300 sec: 13013.5). Total num frames: 86362112. Throughput: 0: 12995.7. Samples: 86329609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:26:03,378][155126] Avg episode reward: [(0, '2392.891')] [2023-03-07 09:26:03,457][155452] Updated weights for policy 0, policy_version 84340 (0.0006) [2023-03-07 09:26:04,246][155452] Updated weights for policy 0, policy_version 84350 (0.0006) [2023-03-07 09:26:05,021][155452] Updated weights for policy 0, policy_version 84360 (0.0006) [2023-03-07 09:26:05,813][155452] Updated weights for policy 0, policy_version 84370 (0.0006) [2023-03-07 09:26:06,612][155452] Updated weights for policy 0, policy_version 84380 (0.0007) [2023-03-07 09:26:07,374][155452] Updated weights for policy 0, policy_version 84390 (0.0005) [2023-03-07 09:26:08,151][155452] Updated weights for policy 0, policy_version 84400 (0.0006) [2023-03-07 09:26:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13013.5). Total num frames: 86427648. Throughput: 0: 12997.2. Samples: 86407684. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:26:08,367][155126] Avg episode reward: [(0, '2178.977')] [2023-03-07 09:26:08,959][155452] Updated weights for policy 0, policy_version 84410 (0.0006) [2023-03-07 09:26:09,752][155452] Updated weights for policy 0, policy_version 84420 (0.0006) [2023-03-07 09:26:10,525][155452] Updated weights for policy 0, policy_version 84430 (0.0006) [2023-03-07 09:26:11,315][155452] Updated weights for policy 0, policy_version 84440 (0.0006) [2023-03-07 09:26:12,115][155452] Updated weights for policy 0, policy_version 84450 (0.0007) [2023-03-07 09:26:12,889][155452] Updated weights for policy 0, policy_version 84460 (0.0006) [2023-03-07 09:26:13,367][155126] Fps is (10 sec: 13107.5, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86493184. Throughput: 0: 12996.1. Samples: 86485868. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:26:13,367][155126] Avg episode reward: [(0, '2085.361')] [2023-03-07 09:26:13,689][155452] Updated weights for policy 0, policy_version 84470 (0.0006) [2023-03-07 09:26:14,469][155452] Updated weights for policy 0, policy_version 84480 (0.0007) [2023-03-07 09:26:15,242][155452] Updated weights for policy 0, policy_version 84490 (0.0006) [2023-03-07 09:26:16,032][155452] Updated weights for policy 0, policy_version 84500 (0.0006) [2023-03-07 09:26:16,814][155452] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-07 09:26:17,623][155452] Updated weights for policy 0, policy_version 84520 (0.0007) [2023-03-07 09:26:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86557696. Throughput: 0: 13002.2. Samples: 86525020. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:26:18,367][155126] Avg episode reward: [(0, '2243.381')] [2023-03-07 09:26:18,385][155452] Updated weights for policy 0, policy_version 84530 (0.0006) [2023-03-07 09:26:19,179][155452] Updated weights for policy 0, policy_version 84540 (0.0006) [2023-03-07 09:26:19,958][155452] Updated weights for policy 0, policy_version 84550 (0.0006) [2023-03-07 09:26:20,738][155452] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-07 09:26:21,512][155452] Updated weights for policy 0, policy_version 84570 (0.0006) [2023-03-07 09:26:22,315][155452] Updated weights for policy 0, policy_version 84580 (0.0007) [2023-03-07 09:26:23,107][155452] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-07 09:26:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86623232. Throughput: 0: 13002.4. Samples: 86603067. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:23,378][155126] Avg episode reward: [(0, '2083.997')] [2023-03-07 09:26:23,894][155452] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-07 09:26:24,686][155452] Updated weights for policy 0, policy_version 84610 (0.0006) [2023-03-07 09:26:25,465][155452] Updated weights for policy 0, policy_version 84620 (0.0007) [2023-03-07 09:26:26,250][155452] Updated weights for policy 0, policy_version 84630 (0.0006) [2023-03-07 09:26:27,040][155452] Updated weights for policy 0, policy_version 84640 (0.0007) [2023-03-07 09:26:27,828][155452] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-07 09:26:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86687744. Throughput: 0: 13008.0. Samples: 86681234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:28,378][155126] Avg episode reward: [(0, '2074.061')] [2023-03-07 09:26:28,614][155452] Updated weights for policy 0, policy_version 84660 (0.0006) [2023-03-07 09:26:29,407][155452] Updated weights for policy 0, policy_version 84670 (0.0006) [2023-03-07 09:26:30,204][155452] Updated weights for policy 0, policy_version 84680 (0.0006) [2023-03-07 09:26:30,979][155452] Updated weights for policy 0, policy_version 84690 (0.0006) [2023-03-07 09:26:31,770][155452] Updated weights for policy 0, policy_version 84700 (0.0007) [2023-03-07 09:26:32,553][155452] Updated weights for policy 0, policy_version 84710 (0.0006) [2023-03-07 09:26:33,350][155452] Updated weights for policy 0, policy_version 84720 (0.0006) [2023-03-07 09:26:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 86753280. Throughput: 0: 13003.5. Samples: 86720155. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:33,367][155126] Avg episode reward: [(0, '2115.301')] [2023-03-07 09:26:34,149][155452] Updated weights for policy 0, policy_version 84730 (0.0007) [2023-03-07 09:26:34,925][155452] Updated weights for policy 0, policy_version 84740 (0.0006) [2023-03-07 09:26:35,705][155452] Updated weights for policy 0, policy_version 84750 (0.0007) [2023-03-07 09:26:36,489][155452] Updated weights for policy 0, policy_version 84760 (0.0006) [2023-03-07 09:26:37,294][155452] Updated weights for policy 0, policy_version 84770 (0.0006) [2023-03-07 09:26:38,065][155452] Updated weights for policy 0, policy_version 84780 (0.0006) [2023-03-07 09:26:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 86817792. Throughput: 0: 13002.4. Samples: 86798096. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:38,367][155126] Avg episode reward: [(0, '2289.147')] [2023-03-07 09:26:38,872][155452] Updated weights for policy 0, policy_version 84790 (0.0007) [2023-03-07 09:26:39,654][155452] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-07 09:26:40,445][155452] Updated weights for policy 0, policy_version 84810 (0.0006) [2023-03-07 09:26:41,226][155452] Updated weights for policy 0, policy_version 84820 (0.0006) [2023-03-07 09:26:42,013][155452] Updated weights for policy 0, policy_version 84830 (0.0006) [2023-03-07 09:26:42,784][155452] Updated weights for policy 0, policy_version 84840 (0.0006) [2023-03-07 09:26:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86883328. Throughput: 0: 13009.6. Samples: 86876319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:43,368][155126] Avg episode reward: [(0, '2139.411')] [2023-03-07 09:26:43,575][155452] Updated weights for policy 0, policy_version 84850 (0.0006) [2023-03-07 09:26:44,360][155452] Updated weights for policy 0, policy_version 84860 (0.0006) [2023-03-07 09:26:45,133][155452] Updated weights for policy 0, policy_version 84870 (0.0006) [2023-03-07 09:26:45,928][155452] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-07 09:26:46,703][155452] Updated weights for policy 0, policy_version 84890 (0.0006) [2023-03-07 09:26:47,478][155452] Updated weights for policy 0, policy_version 84900 (0.0007) [2023-03-07 09:26:48,269][155452] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-07 09:26:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 86948864. Throughput: 0: 13018.6. Samples: 86915446. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:48,378][155126] Avg episode reward: [(0, '1994.978')] [2023-03-07 09:26:49,065][155452] Updated weights for policy 0, policy_version 84920 (0.0006) [2023-03-07 09:26:49,840][155452] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-07 09:26:50,620][155452] Updated weights for policy 0, policy_version 84940 (0.0007) [2023-03-07 09:26:51,423][155452] Updated weights for policy 0, policy_version 84950 (0.0007) [2023-03-07 09:26:52,214][155452] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-07 09:26:52,992][155452] Updated weights for policy 0, policy_version 84970 (0.0006) [2023-03-07 09:26:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 87013376. Throughput: 0: 13023.2. Samples: 86993726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:53,367][155126] Avg episode reward: [(0, '2010.989')] [2023-03-07 09:26:53,807][155452] Updated weights for policy 0, policy_version 84980 (0.0006) [2023-03-07 09:26:54,594][155452] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-07 09:26:55,386][155452] Updated weights for policy 0, policy_version 85000 (0.0007) [2023-03-07 09:26:56,187][155452] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-07 09:26:56,961][155452] Updated weights for policy 0, policy_version 85020 (0.0005) [2023-03-07 09:26:57,752][155452] Updated weights for policy 0, policy_version 85030 (0.0007) [2023-03-07 09:26:58,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 87077888. Throughput: 0: 13010.0. Samples: 87071319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:26:58,367][155126] Avg episode reward: [(0, '1874.664')] [2023-03-07 09:26:58,550][155452] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-07 09:26:59,327][155452] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-07 09:27:00,131][155452] Updated weights for policy 0, policy_version 85060 (0.0006) [2023-03-07 09:27:00,903][155452] Updated weights for policy 0, policy_version 85070 (0.0007) [2023-03-07 09:27:01,703][155452] Updated weights for policy 0, policy_version 85080 (0.0006) [2023-03-07 09:27:02,454][155452] Updated weights for policy 0, policy_version 85090 (0.0006) [2023-03-07 09:27:03,250][155452] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-07 09:27:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 87143424. Throughput: 0: 13005.3. Samples: 87110260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:27:03,367][155126] Avg episode reward: [(0, '1935.800')] [2023-03-07 09:27:04,035][155452] Updated weights for policy 0, policy_version 85110 (0.0006) [2023-03-07 09:27:04,813][155452] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-07 09:27:05,604][155452] Updated weights for policy 0, policy_version 85130 (0.0006) [2023-03-07 09:27:06,382][155452] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-07 09:27:07,161][155452] Updated weights for policy 0, policy_version 85150 (0.0006) [2023-03-07 09:27:07,955][155452] Updated weights for policy 0, policy_version 85160 (0.0006) [2023-03-07 09:27:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 87208960. Throughput: 0: 13013.5. Samples: 87188675. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:27:08,367][155126] Avg episode reward: [(0, '2131.637')] [2023-03-07 09:27:08,720][155452] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-07 09:27:09,517][155452] Updated weights for policy 0, policy_version 85180 (0.0006) [2023-03-07 09:27:10,304][155452] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-07 09:27:11,090][155452] Updated weights for policy 0, policy_version 85200 (0.0007) [2023-03-07 09:27:11,871][155452] Updated weights for policy 0, policy_version 85210 (0.0006) [2023-03-07 09:27:12,669][155452] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-07 09:27:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 87273472. Throughput: 0: 13013.4. Samples: 87266834. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:13,367][155126] Avg episode reward: [(0, '2055.431')] [2023-03-07 09:27:13,457][155452] Updated weights for policy 0, policy_version 85230 (0.0006) [2023-03-07 09:27:14,250][155452] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-07 09:27:15,013][155452] Updated weights for policy 0, policy_version 85250 (0.0006) [2023-03-07 09:27:15,811][155452] Updated weights for policy 0, policy_version 85260 (0.0006) [2023-03-07 09:27:16,573][155452] Updated weights for policy 0, policy_version 85270 (0.0006) [2023-03-07 09:27:17,364][155452] Updated weights for policy 0, policy_version 85280 (0.0007) [2023-03-07 09:27:18,138][155452] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-07 09:27:18,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13016.9). Total num frames: 87339008. Throughput: 0: 13024.5. Samples: 87306258. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:18,378][155126] Avg episode reward: [(0, '2073.158')] [2023-03-07 09:27:18,925][155452] Updated weights for policy 0, policy_version 85300 (0.0006) [2023-03-07 09:27:19,715][155452] Updated weights for policy 0, policy_version 85310 (0.0006) [2023-03-07 09:27:20,510][155452] Updated weights for policy 0, policy_version 85320 (0.0006) [2023-03-07 09:27:21,301][155452] Updated weights for policy 0, policy_version 85330 (0.0006) [2023-03-07 09:27:22,105][155452] Updated weights for policy 0, policy_version 85340 (0.0007) [2023-03-07 09:27:22,883][155452] Updated weights for policy 0, policy_version 85350 (0.0006) [2023-03-07 09:27:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 87404544. Throughput: 0: 13026.6. Samples: 87384291. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:23,367][155126] Avg episode reward: [(0, '2133.523')] [2023-03-07 09:27:23,661][155452] Updated weights for policy 0, policy_version 85360 (0.0006) [2023-03-07 09:27:24,458][155452] Updated weights for policy 0, policy_version 85370 (0.0006) [2023-03-07 09:27:25,244][155452] Updated weights for policy 0, policy_version 85380 (0.0006) [2023-03-07 09:27:26,031][155452] Updated weights for policy 0, policy_version 85390 (0.0006) [2023-03-07 09:27:26,818][155452] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-07 09:27:27,600][155452] Updated weights for policy 0, policy_version 85410 (0.0006) [2023-03-07 09:27:28,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 87469056. Throughput: 0: 13022.4. Samples: 87462327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:28,368][155126] Avg episode reward: [(0, '2088.681')] [2023-03-07 09:27:28,379][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000085420_87470080.pth... [2023-03-07 09:27:28,380][155452] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-07 09:27:28,412][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000082370_84346880.pth [2023-03-07 09:27:29,158][155452] Updated weights for policy 0, policy_version 85430 (0.0007) [2023-03-07 09:27:29,952][155452] Updated weights for policy 0, policy_version 85440 (0.0006) [2023-03-07 09:27:30,733][155452] Updated weights for policy 0, policy_version 85450 (0.0008) [2023-03-07 09:27:31,537][155452] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-07 09:27:32,327][155452] Updated weights for policy 0, policy_version 85470 (0.0007) [2023-03-07 09:27:33,116][155452] Updated weights for policy 0, policy_version 85480 (0.0006) [2023-03-07 09:27:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13017.0). Total num frames: 87534592. Throughput: 0: 13024.1. Samples: 87501530. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:33,367][155126] Avg episode reward: [(0, '2108.801')] [2023-03-07 09:27:33,908][155452] Updated weights for policy 0, policy_version 85490 (0.0006) [2023-03-07 09:27:34,680][155452] Updated weights for policy 0, policy_version 85500 (0.0007) [2023-03-07 09:27:35,458][155452] Updated weights for policy 0, policy_version 85510 (0.0006) [2023-03-07 09:27:36,230][155452] Updated weights for policy 0, policy_version 85520 (0.0006) [2023-03-07 09:27:37,018][155452] Updated weights for policy 0, policy_version 85530 (0.0006) [2023-03-07 09:27:37,815][155452] Updated weights for policy 0, policy_version 85540 (0.0006) [2023-03-07 09:27:38,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 87599104. Throughput: 0: 13021.1. Samples: 87579674. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:38,367][155126] Avg episode reward: [(0, '2026.102')] [2023-03-07 09:27:38,599][155452] Updated weights for policy 0, policy_version 85550 (0.0006) [2023-03-07 09:27:39,379][155452] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-07 09:27:40,153][155452] Updated weights for policy 0, policy_version 85570 (0.0006) [2023-03-07 09:27:40,960][155452] Updated weights for policy 0, policy_version 85580 (0.0006) [2023-03-07 09:27:41,761][155452] Updated weights for policy 0, policy_version 85590 (0.0007) [2023-03-07 09:27:42,542][155452] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-07 09:27:43,316][155452] Updated weights for policy 0, policy_version 85610 (0.0006) [2023-03-07 09:27:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 87664640. Throughput: 0: 13031.0. Samples: 87657714. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:43,367][155126] Avg episode reward: [(0, '2009.496')] [2023-03-07 09:27:44,089][155452] Updated weights for policy 0, policy_version 85620 (0.0006) [2023-03-07 09:27:44,872][155452] Updated weights for policy 0, policy_version 85630 (0.0007) [2023-03-07 09:27:45,653][155452] Updated weights for policy 0, policy_version 85640 (0.0006) [2023-03-07 09:27:46,444][155452] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-07 09:27:47,219][155452] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-07 09:27:47,993][155452] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-07 09:27:48,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 87730176. Throughput: 0: 13041.9. Samples: 87697147. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:48,367][155126] Avg episode reward: [(0, '2002.614')] [2023-03-07 09:27:48,797][155452] Updated weights for policy 0, policy_version 85680 (0.0006) [2023-03-07 09:27:49,570][155452] Updated weights for policy 0, policy_version 85690 (0.0007) [2023-03-07 09:27:50,353][155452] Updated weights for policy 0, policy_version 85700 (0.0006) [2023-03-07 09:27:51,145][155452] Updated weights for policy 0, policy_version 85710 (0.0006) [2023-03-07 09:27:51,917][155452] Updated weights for policy 0, policy_version 85720 (0.0005) [2023-03-07 09:27:52,710][155452] Updated weights for policy 0, policy_version 85730 (0.0006) [2023-03-07 09:27:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 87795712. Throughput: 0: 13043.2. Samples: 87775618. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:53,367][155126] Avg episode reward: [(0, '1984.104')] [2023-03-07 09:27:53,492][155452] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-07 09:27:54,277][155452] Updated weights for policy 0, policy_version 85750 (0.0007) [2023-03-07 09:27:55,082][155452] Updated weights for policy 0, policy_version 85760 (0.0006) [2023-03-07 09:27:55,864][155452] Updated weights for policy 0, policy_version 85770 (0.0006) [2023-03-07 09:27:56,649][155452] Updated weights for policy 0, policy_version 85780 (0.0006) [2023-03-07 09:27:57,427][155452] Updated weights for policy 0, policy_version 85790 (0.0006) [2023-03-07 09:27:58,204][155452] Updated weights for policy 0, policy_version 85800 (0.0006) [2023-03-07 09:27:58,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13056.0, 300 sec: 13016.9). Total num frames: 87861248. Throughput: 0: 13043.5. Samples: 87853792. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:27:58,368][155126] Avg episode reward: [(0, '1991.845')] [2023-03-07 09:27:58,985][155452] Updated weights for policy 0, policy_version 85810 (0.0006) [2023-03-07 09:27:59,781][155452] Updated weights for policy 0, policy_version 85820 (0.0006) [2023-03-07 09:28:00,566][155452] Updated weights for policy 0, policy_version 85830 (0.0006) [2023-03-07 09:28:01,346][155452] Updated weights for policy 0, policy_version 85840 (0.0006) [2023-03-07 09:28:02,137][155452] Updated weights for policy 0, policy_version 85850 (0.0006) [2023-03-07 09:28:02,928][155452] Updated weights for policy 0, policy_version 85860 (0.0006) [2023-03-07 09:28:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13017.0). Total num frames: 87925760. Throughput: 0: 13037.3. Samples: 87892934. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:03,367][155126] Avg episode reward: [(0, '2054.545')] [2023-03-07 09:28:03,721][155452] Updated weights for policy 0, policy_version 85870 (0.0006) [2023-03-07 09:28:04,496][155452] Updated weights for policy 0, policy_version 85880 (0.0007) [2023-03-07 09:28:05,291][155452] Updated weights for policy 0, policy_version 85890 (0.0006) [2023-03-07 09:28:06,077][155452] Updated weights for policy 0, policy_version 85900 (0.0006) [2023-03-07 09:28:06,850][155452] Updated weights for policy 0, policy_version 85910 (0.0006) [2023-03-07 09:28:07,635][155452] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-07 09:28:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 87991296. Throughput: 0: 13041.2. Samples: 87971144. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:08,368][155126] Avg episode reward: [(0, '2025.427')] [2023-03-07 09:28:08,434][155452] Updated weights for policy 0, policy_version 85930 (0.0006) [2023-03-07 09:28:09,225][155452] Updated weights for policy 0, policy_version 85940 (0.0006) [2023-03-07 09:28:10,004][155452] Updated weights for policy 0, policy_version 85950 (0.0006) [2023-03-07 09:28:10,780][155452] Updated weights for policy 0, policy_version 85960 (0.0006) [2023-03-07 09:28:11,573][155452] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-07 09:28:12,355][155452] Updated weights for policy 0, policy_version 85980 (0.0006) [2023-03-07 09:28:13,151][155452] Updated weights for policy 0, policy_version 85990 (0.0007) [2023-03-07 09:28:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 88055808. Throughput: 0: 13040.8. Samples: 88049164. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:13,368][155126] Avg episode reward: [(0, '2285.721')] [2023-03-07 09:28:13,949][155452] Updated weights for policy 0, policy_version 86000 (0.0007) [2023-03-07 09:28:14,753][155452] Updated weights for policy 0, policy_version 86010 (0.0007) [2023-03-07 09:28:15,510][155452] Updated weights for policy 0, policy_version 86020 (0.0006) [2023-03-07 09:28:16,291][155452] Updated weights for policy 0, policy_version 86030 (0.0007) [2023-03-07 09:28:17,089][155452] Updated weights for policy 0, policy_version 86040 (0.0006) [2023-03-07 09:28:17,863][155452] Updated weights for policy 0, policy_version 86050 (0.0006) [2023-03-07 09:28:18,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 88121344. Throughput: 0: 13032.7. Samples: 88088003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:18,368][155126] Avg episode reward: [(0, '2098.726')] [2023-03-07 09:28:18,652][155452] Updated weights for policy 0, policy_version 86060 (0.0006) [2023-03-07 09:28:19,446][155452] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-07 09:28:20,251][155452] Updated weights for policy 0, policy_version 86080 (0.0006) [2023-03-07 09:28:21,021][155452] Updated weights for policy 0, policy_version 86090 (0.0007) [2023-03-07 09:28:21,821][155452] Updated weights for policy 0, policy_version 86100 (0.0007) [2023-03-07 09:28:22,605][155452] Updated weights for policy 0, policy_version 86110 (0.0006) [2023-03-07 09:28:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 88185856. Throughput: 0: 13036.9. Samples: 88166336. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:23,367][155126] Avg episode reward: [(0, '2049.823')] [2023-03-07 09:28:23,381][155452] Updated weights for policy 0, policy_version 86120 (0.0006) [2023-03-07 09:28:24,171][155452] Updated weights for policy 0, policy_version 86130 (0.0006) [2023-03-07 09:28:24,951][155452] Updated weights for policy 0, policy_version 86140 (0.0006) [2023-03-07 09:28:25,738][155452] Updated weights for policy 0, policy_version 86150 (0.0006) [2023-03-07 09:28:26,525][155452] Updated weights for policy 0, policy_version 86160 (0.0006) [2023-03-07 09:28:27,301][155452] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-07 09:28:28,078][155452] Updated weights for policy 0, policy_version 86180 (0.0007) [2023-03-07 09:28:28,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13039.0, 300 sec: 13017.0). Total num frames: 88251392. Throughput: 0: 13042.1. Samples: 88244610. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:28,367][155126] Avg episode reward: [(0, '2155.996')] [2023-03-07 09:28:28,864][155452] Updated weights for policy 0, policy_version 86190 (0.0007) [2023-03-07 09:28:29,663][155452] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-07 09:28:30,449][155452] Updated weights for policy 0, policy_version 86210 (0.0006) [2023-03-07 09:28:31,229][155452] Updated weights for policy 0, policy_version 86220 (0.0006) [2023-03-07 09:28:32,017][155452] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-07 09:28:32,797][155452] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-07 09:28:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13020.4). Total num frames: 88316928. Throughput: 0: 13033.3. Samples: 88283646. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:33,378][155126] Avg episode reward: [(0, '2132.327')] [2023-03-07 09:28:33,600][155452] Updated weights for policy 0, policy_version 86250 (0.0006) [2023-03-07 09:28:34,386][155452] Updated weights for policy 0, policy_version 86260 (0.0006) [2023-03-07 09:28:35,174][155452] Updated weights for policy 0, policy_version 86270 (0.0006) [2023-03-07 09:28:35,959][155452] Updated weights for policy 0, policy_version 86280 (0.0006) [2023-03-07 09:28:36,754][155452] Updated weights for policy 0, policy_version 86290 (0.0006) [2023-03-07 09:28:37,539][155452] Updated weights for policy 0, policy_version 86300 (0.0006) [2023-03-07 09:28:38,325][155452] Updated weights for policy 0, policy_version 86310 (0.0006) [2023-03-07 09:28:38,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13038.9, 300 sec: 13016.9). Total num frames: 88381440. Throughput: 0: 13021.9. Samples: 88361607. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:38,378][155126] Avg episode reward: [(0, '2290.126')] [2023-03-07 09:28:39,111][155452] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-07 09:28:39,898][155452] Updated weights for policy 0, policy_version 86330 (0.0006) [2023-03-07 09:28:40,673][155452] Updated weights for policy 0, policy_version 86340 (0.0007) [2023-03-07 09:28:41,463][155452] Updated weights for policy 0, policy_version 86350 (0.0006) [2023-03-07 09:28:42,265][155452] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-07 09:28:43,054][155452] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-07 09:28:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13017.0). Total num frames: 88446976. Throughput: 0: 13019.0. Samples: 88439645. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:43,367][155126] Avg episode reward: [(0, '2156.847')] [2023-03-07 09:28:43,814][155452] Updated weights for policy 0, policy_version 86380 (0.0006) [2023-03-07 09:28:44,634][155452] Updated weights for policy 0, policy_version 86390 (0.0005) [2023-03-07 09:28:45,409][155452] Updated weights for policy 0, policy_version 86400 (0.0006) [2023-03-07 09:28:46,195][155452] Updated weights for policy 0, policy_version 86410 (0.0006) [2023-03-07 09:28:46,980][155452] Updated weights for policy 0, policy_version 86420 (0.0007) [2023-03-07 09:28:47,768][155452] Updated weights for policy 0, policy_version 86430 (0.0006) [2023-03-07 09:28:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 88511488. Throughput: 0: 13019.5. Samples: 88478810. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:48,367][155126] Avg episode reward: [(0, '2218.468')] [2023-03-07 09:28:48,551][155452] Updated weights for policy 0, policy_version 86440 (0.0006) [2023-03-07 09:28:49,347][155452] Updated weights for policy 0, policy_version 86450 (0.0007) [2023-03-07 09:28:50,112][155452] Updated weights for policy 0, policy_version 86460 (0.0006) [2023-03-07 09:28:50,893][155452] Updated weights for policy 0, policy_version 86470 (0.0007) [2023-03-07 09:28:51,681][155452] Updated weights for policy 0, policy_version 86480 (0.0006) [2023-03-07 09:28:52,463][155452] Updated weights for policy 0, policy_version 86490 (0.0006) [2023-03-07 09:28:53,257][155452] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-07 09:28:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 88577024. Throughput: 0: 13016.1. Samples: 88556871. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:28:53,367][155126] Avg episode reward: [(0, '2124.510')] [2023-03-07 09:28:54,041][155452] Updated weights for policy 0, policy_version 86510 (0.0006) [2023-03-07 09:28:54,821][155452] Updated weights for policy 0, policy_version 86520 (0.0007) [2023-03-07 09:28:55,621][155452] Updated weights for policy 0, policy_version 86530 (0.0006) [2023-03-07 09:28:56,413][155452] Updated weights for policy 0, policy_version 86540 (0.0006) [2023-03-07 09:28:57,214][155452] Updated weights for policy 0, policy_version 86550 (0.0006) [2023-03-07 09:28:57,989][155452] Updated weights for policy 0, policy_version 86560 (0.0007) [2023-03-07 09:28:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 88641536. Throughput: 0: 13014.3. Samples: 88634806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:28:58,367][155126] Avg episode reward: [(0, '2295.133')] [2023-03-07 09:28:58,789][155452] Updated weights for policy 0, policy_version 86570 (0.0006) [2023-03-07 09:28:59,570][155452] Updated weights for policy 0, policy_version 86580 (0.0006) [2023-03-07 09:29:00,374][155452] Updated weights for policy 0, policy_version 86590 (0.0006) [2023-03-07 09:29:01,165][155452] Updated weights for policy 0, policy_version 86600 (0.0006) [2023-03-07 09:29:01,958][155452] Updated weights for policy 0, policy_version 86610 (0.0006) [2023-03-07 09:29:02,750][155452] Updated weights for policy 0, policy_version 86620 (0.0007) [2023-03-07 09:29:03,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 88707072. Throughput: 0: 13013.9. Samples: 88673625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:03,367][155126] Avg episode reward: [(0, '2112.214')] [2023-03-07 09:29:03,532][155452] Updated weights for policy 0, policy_version 86630 (0.0007) [2023-03-07 09:29:04,314][155452] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-07 09:29:05,117][155452] Updated weights for policy 0, policy_version 86650 (0.0006) [2023-03-07 09:29:05,907][155452] Updated weights for policy 0, policy_version 86660 (0.0007) [2023-03-07 09:29:06,689][155452] Updated weights for policy 0, policy_version 86670 (0.0007) [2023-03-07 09:29:07,473][155452] Updated weights for policy 0, policy_version 86680 (0.0006) [2023-03-07 09:29:08,250][155452] Updated weights for policy 0, policy_version 86690 (0.0006) [2023-03-07 09:29:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 88771584. Throughput: 0: 13002.3. Samples: 88751441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:08,367][155126] Avg episode reward: [(0, '2231.543')] [2023-03-07 09:29:09,038][155452] Updated weights for policy 0, policy_version 86700 (0.0006) [2023-03-07 09:29:09,821][155452] Updated weights for policy 0, policy_version 86710 (0.0006) [2023-03-07 09:29:10,600][155452] Updated weights for policy 0, policy_version 86720 (0.0006) [2023-03-07 09:29:11,374][155452] Updated weights for policy 0, policy_version 86730 (0.0006) [2023-03-07 09:29:12,169][155452] Updated weights for policy 0, policy_version 86740 (0.0006) [2023-03-07 09:29:12,948][155452] Updated weights for policy 0, policy_version 86750 (0.0006) [2023-03-07 09:29:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13013.5). Total num frames: 88837120. Throughput: 0: 13006.5. Samples: 88829904. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:13,367][155126] Avg episode reward: [(0, '2229.467')] [2023-03-07 09:29:13,734][155452] Updated weights for policy 0, policy_version 86760 (0.0006) [2023-03-07 09:29:14,539][155452] Updated weights for policy 0, policy_version 86770 (0.0006) [2023-03-07 09:29:15,326][155452] Updated weights for policy 0, policy_version 86780 (0.0006) [2023-03-07 09:29:16,124][155452] Updated weights for policy 0, policy_version 86790 (0.0007) [2023-03-07 09:29:16,910][155452] Updated weights for policy 0, policy_version 86800 (0.0006) [2023-03-07 09:29:17,700][155452] Updated weights for policy 0, policy_version 86810 (0.0006) [2023-03-07 09:29:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 88901632. Throughput: 0: 13000.9. Samples: 88868688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:18,367][155126] Avg episode reward: [(0, '2325.957')] [2023-03-07 09:29:18,474][155452] Updated weights for policy 0, policy_version 86820 (0.0006) [2023-03-07 09:29:19,266][155452] Updated weights for policy 0, policy_version 86830 (0.0007) [2023-03-07 09:29:20,041][155452] Updated weights for policy 0, policy_version 86840 (0.0006) [2023-03-07 09:29:20,834][155452] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-07 09:29:21,621][155452] Updated weights for policy 0, policy_version 86860 (0.0006) [2023-03-07 09:29:22,418][155452] Updated weights for policy 0, policy_version 86870 (0.0006) [2023-03-07 09:29:23,194][155452] Updated weights for policy 0, policy_version 86880 (0.0007) [2023-03-07 09:29:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 88967168. Throughput: 0: 13006.7. Samples: 88946906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:23,368][155126] Avg episode reward: [(0, '2184.611')] [2023-03-07 09:29:23,983][155452] Updated weights for policy 0, policy_version 86890 (0.0007) [2023-03-07 09:29:24,766][155452] Updated weights for policy 0, policy_version 86900 (0.0006) [2023-03-07 09:29:25,547][155452] Updated weights for policy 0, policy_version 86910 (0.0006) [2023-03-07 09:29:26,334][155452] Updated weights for policy 0, policy_version 86920 (0.0006) [2023-03-07 09:29:27,108][155452] Updated weights for policy 0, policy_version 86930 (0.0006) [2023-03-07 09:29:27,900][155452] Updated weights for policy 0, policy_version 86940 (0.0006) [2023-03-07 09:29:28,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13016.9). Total num frames: 89032704. Throughput: 0: 13012.7. Samples: 89025219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:28,367][155126] Avg episode reward: [(0, '2382.447')] [2023-03-07 09:29:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000086946_89032704.pth... [2023-03-07 09:29:28,399][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000083895_85908480.pth [2023-03-07 09:29:28,676][155452] Updated weights for policy 0, policy_version 86950 (0.0006) [2023-03-07 09:29:29,443][155452] Updated weights for policy 0, policy_version 86960 (0.0005) [2023-03-07 09:29:30,238][155452] Updated weights for policy 0, policy_version 86970 (0.0006) [2023-03-07 09:29:31,028][155452] Updated weights for policy 0, policy_version 86980 (0.0007) [2023-03-07 09:29:31,806][155452] Updated weights for policy 0, policy_version 86990 (0.0006) [2023-03-07 09:29:32,597][155452] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-07 09:29:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 89097216. Throughput: 0: 13017.4. Samples: 89064594. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:33,367][155126] Avg episode reward: [(0, '2321.417')] [2023-03-07 09:29:33,372][155452] Updated weights for policy 0, policy_version 87010 (0.0006) [2023-03-07 09:29:34,160][155452] Updated weights for policy 0, policy_version 87020 (0.0006) [2023-03-07 09:29:34,952][155452] Updated weights for policy 0, policy_version 87030 (0.0006) [2023-03-07 09:29:35,717][155452] Updated weights for policy 0, policy_version 87040 (0.0006) [2023-03-07 09:29:36,519][155452] Updated weights for policy 0, policy_version 87050 (0.0007) [2023-03-07 09:29:37,319][155452] Updated weights for policy 0, policy_version 87060 (0.0006) [2023-03-07 09:29:38,120][155452] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-07 09:29:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 89162752. Throughput: 0: 13021.3. Samples: 89142828. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:38,367][155126] Avg episode reward: [(0, '2201.270')] [2023-03-07 09:29:38,891][155452] Updated weights for policy 0, policy_version 87080 (0.0007) [2023-03-07 09:29:39,678][155452] Updated weights for policy 0, policy_version 87090 (0.0007) [2023-03-07 09:29:40,474][155452] Updated weights for policy 0, policy_version 87100 (0.0006) [2023-03-07 09:29:41,278][155452] Updated weights for policy 0, policy_version 87110 (0.0006) [2023-03-07 09:29:42,069][155452] Updated weights for policy 0, policy_version 87120 (0.0006) [2023-03-07 09:29:42,857][155452] Updated weights for policy 0, policy_version 87130 (0.0007) [2023-03-07 09:29:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 89227264. Throughput: 0: 13015.2. Samples: 89220489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:43,368][155126] Avg episode reward: [(0, '1998.425')] [2023-03-07 09:29:43,640][155452] Updated weights for policy 0, policy_version 87140 (0.0006) [2023-03-07 09:29:44,444][155452] Updated weights for policy 0, policy_version 87150 (0.0005) [2023-03-07 09:29:45,218][155452] Updated weights for policy 0, policy_version 87160 (0.0007) [2023-03-07 09:29:45,996][155452] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-07 09:29:46,779][155452] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-07 09:29:47,557][155452] Updated weights for policy 0, policy_version 87190 (0.0006) [2023-03-07 09:29:48,360][155452] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-07 09:29:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 89292800. Throughput: 0: 13017.7. Samples: 89259419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:48,367][155126] Avg episode reward: [(0, '2008.175')] [2023-03-07 09:29:49,140][155452] Updated weights for policy 0, policy_version 87210 (0.0006) [2023-03-07 09:29:49,930][155452] Updated weights for policy 0, policy_version 87220 (0.0007) [2023-03-07 09:29:50,690][155452] Updated weights for policy 0, policy_version 87230 (0.0006) [2023-03-07 09:29:51,472][155452] Updated weights for policy 0, policy_version 87240 (0.0006) [2023-03-07 09:29:52,245][155452] Updated weights for policy 0, policy_version 87250 (0.0006) [2023-03-07 09:29:53,021][155452] Updated weights for policy 0, policy_version 87260 (0.0007) [2023-03-07 09:29:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 89358336. Throughput: 0: 13033.7. Samples: 89337959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:53,368][155126] Avg episode reward: [(0, '1993.796')] [2023-03-07 09:29:53,821][155452] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-07 09:29:54,611][155452] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-07 09:29:55,401][155452] Updated weights for policy 0, policy_version 87290 (0.0005) [2023-03-07 09:29:56,189][155452] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-07 09:29:56,994][155452] Updated weights for policy 0, policy_version 87310 (0.0006) [2023-03-07 09:29:57,764][155452] Updated weights for policy 0, policy_version 87320 (0.0006) [2023-03-07 09:29:58,367][155126] Fps is (10 sec: 13004.5, 60 sec: 13021.9, 300 sec: 13016.9). Total num frames: 89422848. Throughput: 0: 13027.3. Samples: 89416135. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:29:58,367][155126] Avg episode reward: [(0, '2157.289')] [2023-03-07 09:29:58,551][155452] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-07 09:29:59,323][155452] Updated weights for policy 0, policy_version 87340 (0.0006) [2023-03-07 09:30:00,099][155452] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-07 09:30:00,883][155452] Updated weights for policy 0, policy_version 87360 (0.0006) [2023-03-07 09:30:01,650][155452] Updated weights for policy 0, policy_version 87370 (0.0006) [2023-03-07 09:30:02,432][155452] Updated weights for policy 0, policy_version 87380 (0.0006) [2023-03-07 09:30:03,238][155452] Updated weights for policy 0, policy_version 87390 (0.0006) [2023-03-07 09:30:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13016.9). Total num frames: 89488384. Throughput: 0: 13038.1. Samples: 89455403. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:03,367][155126] Avg episode reward: [(0, '2045.004')] [2023-03-07 09:30:04,018][155452] Updated weights for policy 0, policy_version 87400 (0.0006) [2023-03-07 09:30:04,809][155452] Updated weights for policy 0, policy_version 87410 (0.0006) [2023-03-07 09:30:05,589][155452] Updated weights for policy 0, policy_version 87420 (0.0006) [2023-03-07 09:30:06,384][155452] Updated weights for policy 0, policy_version 87430 (0.0007) [2023-03-07 09:30:07,161][155452] Updated weights for policy 0, policy_version 87440 (0.0006) [2023-03-07 09:30:07,955][155452] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-07 09:30:08,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 89553920. Throughput: 0: 13040.7. Samples: 89533737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:08,367][155126] Avg episode reward: [(0, '2096.912')] [2023-03-07 09:30:08,742][155452] Updated weights for policy 0, policy_version 87460 (0.0006) [2023-03-07 09:30:09,535][155452] Updated weights for policy 0, policy_version 87470 (0.0006) [2023-03-07 09:30:10,309][155452] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-07 09:30:11,093][155452] Updated weights for policy 0, policy_version 87490 (0.0006) [2023-03-07 09:30:11,867][155452] Updated weights for policy 0, policy_version 87500 (0.0006) [2023-03-07 09:30:12,666][155452] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-07 09:30:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 89619456. Throughput: 0: 13039.0. Samples: 89611971. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:13,367][155126] Avg episode reward: [(0, '1977.166')] [2023-03-07 09:30:13,449][155452] Updated weights for policy 0, policy_version 87520 (0.0006) [2023-03-07 09:30:14,218][155452] Updated weights for policy 0, policy_version 87530 (0.0006) [2023-03-07 09:30:14,998][155452] Updated weights for policy 0, policy_version 87540 (0.0006) [2023-03-07 09:30:15,793][155452] Updated weights for policy 0, policy_version 87550 (0.0006) [2023-03-07 09:30:16,573][155452] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-07 09:30:17,341][155452] Updated weights for policy 0, policy_version 87570 (0.0006) [2023-03-07 09:30:18,131][155452] Updated weights for policy 0, policy_version 87580 (0.0006) [2023-03-07 09:30:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 89683968. Throughput: 0: 13037.7. Samples: 89651291. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:18,368][155126] Avg episode reward: [(0, '2157.804')] [2023-03-07 09:30:18,917][155452] Updated weights for policy 0, policy_version 87590 (0.0006) [2023-03-07 09:30:19,712][155452] Updated weights for policy 0, policy_version 87600 (0.0006) [2023-03-07 09:30:20,525][155452] Updated weights for policy 0, policy_version 87610 (0.0007) [2023-03-07 09:30:21,312][155452] Updated weights for policy 0, policy_version 87620 (0.0006) [2023-03-07 09:30:22,101][155452] Updated weights for policy 0, policy_version 87630 (0.0006) [2023-03-07 09:30:22,873][155452] Updated weights for policy 0, policy_version 87640 (0.0006) [2023-03-07 09:30:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 89749504. Throughput: 0: 13031.2. Samples: 89729234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:23,368][155126] Avg episode reward: [(0, '2024.724')] [2023-03-07 09:30:23,673][155452] Updated weights for policy 0, policy_version 87650 (0.0006) [2023-03-07 09:30:24,450][155452] Updated weights for policy 0, policy_version 87660 (0.0007) [2023-03-07 09:30:25,241][155452] Updated weights for policy 0, policy_version 87670 (0.0007) [2023-03-07 09:30:26,025][155452] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-07 09:30:26,812][155452] Updated weights for policy 0, policy_version 87690 (0.0006) [2023-03-07 09:30:27,602][155452] Updated weights for policy 0, policy_version 87700 (0.0006) [2023-03-07 09:30:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 89814016. Throughput: 0: 13042.6. Samples: 89807407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:28,367][155126] Avg episode reward: [(0, '2050.456')] [2023-03-07 09:30:28,391][155452] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-07 09:30:29,186][155452] Updated weights for policy 0, policy_version 87720 (0.0006) [2023-03-07 09:30:29,981][155452] Updated weights for policy 0, policy_version 87730 (0.0006) [2023-03-07 09:30:30,767][155452] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-07 09:30:31,546][155452] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-07 09:30:32,329][155452] Updated weights for policy 0, policy_version 87760 (0.0006) [2023-03-07 09:30:33,122][155452] Updated weights for policy 0, policy_version 87770 (0.0007) [2023-03-07 09:30:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 89879552. Throughput: 0: 13036.3. Samples: 89846053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:33,367][155126] Avg episode reward: [(0, '1998.579')] [2023-03-07 09:30:33,901][155452] Updated weights for policy 0, policy_version 87780 (0.0007) [2023-03-07 09:30:34,683][155452] Updated weights for policy 0, policy_version 87790 (0.0007) [2023-03-07 09:30:35,467][155452] Updated weights for policy 0, policy_version 87800 (0.0006) [2023-03-07 09:30:36,251][155452] Updated weights for policy 0, policy_version 87810 (0.0006) [2023-03-07 09:30:37,020][155452] Updated weights for policy 0, policy_version 87820 (0.0006) [2023-03-07 09:30:37,812][155452] Updated weights for policy 0, policy_version 87830 (0.0006) [2023-03-07 09:30:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 89945088. Throughput: 0: 13038.5. Samples: 89924693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:30:38,368][155126] Avg episode reward: [(0, '2026.430')] [2023-03-07 09:30:38,598][155452] Updated weights for policy 0, policy_version 87840 (0.0006) [2023-03-07 09:30:39,380][155452] Updated weights for policy 0, policy_version 87850 (0.0007) [2023-03-07 09:30:40,173][155452] Updated weights for policy 0, policy_version 87860 (0.0006) [2023-03-07 09:30:40,946][155452] Updated weights for policy 0, policy_version 87870 (0.0006) [2023-03-07 09:30:41,731][155452] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-07 09:30:42,520][155452] Updated weights for policy 0, policy_version 87890 (0.0007) [2023-03-07 09:30:43,289][155452] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-07 09:30:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13039.0, 300 sec: 13020.4). Total num frames: 90009600. Throughput: 0: 13041.7. Samples: 90003009. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:30:43,367][155126] Avg episode reward: [(0, '1922.137')] [2023-03-07 09:30:44,080][155452] Updated weights for policy 0, policy_version 87910 (0.0005) [2023-03-07 09:30:44,858][155452] Updated weights for policy 0, policy_version 87920 (0.0007) [2023-03-07 09:30:45,636][155452] Updated weights for policy 0, policy_version 87930 (0.0006) [2023-03-07 09:30:46,418][155452] Updated weights for policy 0, policy_version 87940 (0.0007) [2023-03-07 09:30:47,205][155452] Updated weights for policy 0, policy_version 87950 (0.0006) [2023-03-07 09:30:48,013][155452] Updated weights for policy 0, policy_version 87960 (0.0006) [2023-03-07 09:30:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 90075136. Throughput: 0: 13043.8. Samples: 90042373. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:30:48,367][155126] Avg episode reward: [(0, '2114.154')] [2023-03-07 09:30:48,804][155452] Updated weights for policy 0, policy_version 87970 (0.0007) [2023-03-07 09:30:49,592][155452] Updated weights for policy 0, policy_version 87980 (0.0005) [2023-03-07 09:30:50,376][155452] Updated weights for policy 0, policy_version 87990 (0.0007) [2023-03-07 09:30:51,157][155452] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-07 09:30:51,937][155452] Updated weights for policy 0, policy_version 88010 (0.0007) [2023-03-07 09:30:52,729][155452] Updated weights for policy 0, policy_version 88020 (0.0006) [2023-03-07 09:30:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90140672. Throughput: 0: 13036.2. Samples: 90120367. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:30:53,367][155126] Avg episode reward: [(0, '2059.791')] [2023-03-07 09:30:53,515][155452] Updated weights for policy 0, policy_version 88030 (0.0005) [2023-03-07 09:30:54,306][155452] Updated weights for policy 0, policy_version 88040 (0.0007) [2023-03-07 09:30:55,096][155452] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-07 09:30:55,879][155452] Updated weights for policy 0, policy_version 88060 (0.0006) [2023-03-07 09:30:56,669][155452] Updated weights for policy 0, policy_version 88070 (0.0007) [2023-03-07 09:30:57,456][155452] Updated weights for policy 0, policy_version 88080 (0.0007) [2023-03-07 09:30:58,262][155452] Updated weights for policy 0, policy_version 88090 (0.0007) [2023-03-07 09:30:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90205184. Throughput: 0: 13028.3. Samples: 90198249. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:30:58,367][155126] Avg episode reward: [(0, '2242.984')] [2023-03-07 09:30:59,030][155452] Updated weights for policy 0, policy_version 88100 (0.0006) [2023-03-07 09:30:59,821][155452] Updated weights for policy 0, policy_version 88110 (0.0006) [2023-03-07 09:31:00,600][155452] Updated weights for policy 0, policy_version 88120 (0.0006) [2023-03-07 09:31:01,390][155452] Updated weights for policy 0, policy_version 88130 (0.0005) [2023-03-07 09:31:02,165][155452] Updated weights for policy 0, policy_version 88140 (0.0007) [2023-03-07 09:31:02,963][155452] Updated weights for policy 0, policy_version 88150 (0.0006) [2023-03-07 09:31:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 90270720. Throughput: 0: 13026.5. Samples: 90237484. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:31:03,367][155126] Avg episode reward: [(0, '2051.475')] [2023-03-07 09:31:03,758][155452] Updated weights for policy 0, policy_version 88160 (0.0006) [2023-03-07 09:31:04,540][155452] Updated weights for policy 0, policy_version 88170 (0.0006) [2023-03-07 09:31:05,347][155452] Updated weights for policy 0, policy_version 88180 (0.0006) [2023-03-07 09:31:06,131][155452] Updated weights for policy 0, policy_version 88190 (0.0006) [2023-03-07 09:31:06,917][155452] Updated weights for policy 0, policy_version 88200 (0.0006) [2023-03-07 09:31:07,683][155452] Updated weights for policy 0, policy_version 88210 (0.0006) [2023-03-07 09:31:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 90335232. Throughput: 0: 13024.3. Samples: 90315327. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:31:08,368][155126] Avg episode reward: [(0, '2201.561')] [2023-03-07 09:31:08,496][155452] Updated weights for policy 0, policy_version 88220 (0.0007) [2023-03-07 09:31:09,280][155452] Updated weights for policy 0, policy_version 88230 (0.0007) [2023-03-07 09:31:10,057][155452] Updated weights for policy 0, policy_version 88240 (0.0006) [2023-03-07 09:31:10,862][155452] Updated weights for policy 0, policy_version 88250 (0.0007) [2023-03-07 09:31:11,632][155452] Updated weights for policy 0, policy_version 88260 (0.0006) [2023-03-07 09:31:12,440][155452] Updated weights for policy 0, policy_version 88270 (0.0005) [2023-03-07 09:31:13,208][155452] Updated weights for policy 0, policy_version 88280 (0.0006) [2023-03-07 09:31:13,367][155126] Fps is (10 sec: 12902.2, 60 sec: 13004.7, 300 sec: 13023.9). Total num frames: 90399744. Throughput: 0: 13019.2. Samples: 90393271. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:31:13,368][155126] Avg episode reward: [(0, '2155.911')] [2023-03-07 09:31:14,001][155452] Updated weights for policy 0, policy_version 88290 (0.0006) [2023-03-07 09:31:14,791][155452] Updated weights for policy 0, policy_version 88300 (0.0006) [2023-03-07 09:31:15,565][155452] Updated weights for policy 0, policy_version 88310 (0.0006) [2023-03-07 09:31:16,345][155452] Updated weights for policy 0, policy_version 88320 (0.0006) [2023-03-07 09:31:17,127][155452] Updated weights for policy 0, policy_version 88330 (0.0007) [2023-03-07 09:31:17,893][155452] Updated weights for policy 0, policy_version 88340 (0.0007) [2023-03-07 09:31:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 90465280. Throughput: 0: 13031.1. Samples: 90432453. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:31:18,367][155126] Avg episode reward: [(0, '2147.238')] [2023-03-07 09:31:18,695][155452] Updated weights for policy 0, policy_version 88350 (0.0006) [2023-03-07 09:31:19,490][155452] Updated weights for policy 0, policy_version 88360 (0.0006) [2023-03-07 09:31:20,265][155452] Updated weights for policy 0, policy_version 88370 (0.0006) [2023-03-07 09:31:21,073][155452] Updated weights for policy 0, policy_version 88380 (0.0006) [2023-03-07 09:31:21,859][155452] Updated weights for policy 0, policy_version 88390 (0.0007) [2023-03-07 09:31:22,633][155452] Updated weights for policy 0, policy_version 88400 (0.0006) [2023-03-07 09:31:23,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 90530816. Throughput: 0: 13023.4. Samples: 90510745. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:31:23,367][155126] Avg episode reward: [(0, '2153.770')] [2023-03-07 09:31:23,417][155452] Updated weights for policy 0, policy_version 88410 (0.0006) [2023-03-07 09:31:24,217][155452] Updated weights for policy 0, policy_version 88420 (0.0007) [2023-03-07 09:31:25,005][155452] Updated weights for policy 0, policy_version 88430 (0.0006) [2023-03-07 09:31:25,779][155452] Updated weights for policy 0, policy_version 88440 (0.0006) [2023-03-07 09:31:26,556][155452] Updated weights for policy 0, policy_version 88450 (0.0007) [2023-03-07 09:31:27,354][155452] Updated weights for policy 0, policy_version 88460 (0.0006) [2023-03-07 09:31:28,126][155452] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-07 09:31:28,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 90595328. Throughput: 0: 13019.9. Samples: 90588906. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:31:28,368][155126] Avg episode reward: [(0, '2019.190')] [2023-03-07 09:31:28,383][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000088473_90596352.pth... [2023-03-07 09:31:28,417][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000085420_87470080.pth [2023-03-07 09:31:28,936][155452] Updated weights for policy 0, policy_version 88480 (0.0007) [2023-03-07 09:31:29,707][155452] Updated weights for policy 0, policy_version 88490 (0.0006) [2023-03-07 09:31:30,502][155452] Updated weights for policy 0, policy_version 88500 (0.0006) [2023-03-07 09:31:31,283][155452] Updated weights for policy 0, policy_version 88510 (0.0006) [2023-03-07 09:31:32,076][155452] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-07 09:31:32,880][155452] Updated weights for policy 0, policy_version 88530 (0.0006) [2023-03-07 09:31:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.8, 300 sec: 13027.4). Total num frames: 90660864. Throughput: 0: 13010.8. Samples: 90627860. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:31:33,367][155126] Avg episode reward: [(0, '2215.151')] [2023-03-07 09:31:33,665][155452] Updated weights for policy 0, policy_version 88540 (0.0006) [2023-03-07 09:31:34,458][155452] Updated weights for policy 0, policy_version 88550 (0.0007) [2023-03-07 09:31:35,250][155452] Updated weights for policy 0, policy_version 88560 (0.0007) [2023-03-07 09:31:36,026][155452] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-07 09:31:36,805][155452] Updated weights for policy 0, policy_version 88580 (0.0006) [2023-03-07 09:31:37,592][155452] Updated weights for policy 0, policy_version 88590 (0.0006) [2023-03-07 09:31:38,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 90725376. Throughput: 0: 13007.1. Samples: 90705689. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:31:38,367][155126] Avg episode reward: [(0, '2139.287')] [2023-03-07 09:31:38,385][155452] Updated weights for policy 0, policy_version 88600 (0.0006) [2023-03-07 09:31:39,175][155452] Updated weights for policy 0, policy_version 88610 (0.0006) [2023-03-07 09:31:39,961][155452] Updated weights for policy 0, policy_version 88620 (0.0007) [2023-03-07 09:31:40,752][155452] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-07 09:31:41,528][155452] Updated weights for policy 0, policy_version 88640 (0.0006) [2023-03-07 09:31:42,316][155452] Updated weights for policy 0, policy_version 88650 (0.0007) [2023-03-07 09:31:43,097][155452] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-07 09:31:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 90790912. Throughput: 0: 13015.2. Samples: 90783933. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:31:43,367][155126] Avg episode reward: [(0, '2201.245')] [2023-03-07 09:31:43,886][155452] Updated weights for policy 0, policy_version 88670 (0.0006) [2023-03-07 09:31:44,665][155452] Updated weights for policy 0, policy_version 88680 (0.0007) [2023-03-07 09:31:45,454][155452] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-07 09:31:46,240][155452] Updated weights for policy 0, policy_version 88700 (0.0007) [2023-03-07 09:31:47,027][155452] Updated weights for policy 0, policy_version 88710 (0.0006) [2023-03-07 09:31:47,812][155452] Updated weights for policy 0, policy_version 88720 (0.0008) [2023-03-07 09:31:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 90856448. Throughput: 0: 13008.2. Samples: 90822853. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:31:48,367][155126] Avg episode reward: [(0, '2074.068')] [2023-03-07 09:31:48,593][155452] Updated weights for policy 0, policy_version 88730 (0.0006) [2023-03-07 09:31:49,390][155452] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-07 09:31:50,185][155452] Updated weights for policy 0, policy_version 88750 (0.0006) [2023-03-07 09:31:50,957][155452] Updated weights for policy 0, policy_version 88760 (0.0006) [2023-03-07 09:31:51,731][155452] Updated weights for policy 0, policy_version 88770 (0.0006) [2023-03-07 09:31:52,521][155452] Updated weights for policy 0, policy_version 88780 (0.0006) [2023-03-07 09:31:53,300][155452] Updated weights for policy 0, policy_version 88790 (0.0007) [2023-03-07 09:31:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13027.4). Total num frames: 90920960. Throughput: 0: 13022.4. Samples: 90901332. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:31:53,367][155126] Avg episode reward: [(0, '2141.370')] [2023-03-07 09:31:54,078][155452] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-07 09:31:54,875][155452] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-07 09:31:55,665][155452] Updated weights for policy 0, policy_version 88820 (0.0006) [2023-03-07 09:31:56,452][155452] Updated weights for policy 0, policy_version 88830 (0.0006) [2023-03-07 09:31:57,243][155452] Updated weights for policy 0, policy_version 88840 (0.0006) [2023-03-07 09:31:58,034][155452] Updated weights for policy 0, policy_version 88850 (0.0006) [2023-03-07 09:31:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 90986496. Throughput: 0: 13026.9. Samples: 90979478. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:31:58,367][155126] Avg episode reward: [(0, '2194.312')] [2023-03-07 09:31:58,813][155452] Updated weights for policy 0, policy_version 88860 (0.0006) [2023-03-07 09:31:59,600][155452] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-07 09:32:00,379][155452] Updated weights for policy 0, policy_version 88880 (0.0007) [2023-03-07 09:32:01,161][155452] Updated weights for policy 0, policy_version 88890 (0.0006) [2023-03-07 09:32:01,945][155452] Updated weights for policy 0, policy_version 88900 (0.0006) [2023-03-07 09:32:02,749][155452] Updated weights for policy 0, policy_version 88910 (0.0006) [2023-03-07 09:32:03,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 91051008. Throughput: 0: 13022.2. Samples: 91018452. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:32:03,367][155126] Avg episode reward: [(0, '2255.299')] [2023-03-07 09:32:03,541][155452] Updated weights for policy 0, policy_version 88920 (0.0006) [2023-03-07 09:32:04,315][155452] Updated weights for policy 0, policy_version 88930 (0.0006) [2023-03-07 09:32:05,120][155452] Updated weights for policy 0, policy_version 88940 (0.0007) [2023-03-07 09:32:05,914][155452] Updated weights for policy 0, policy_version 88950 (0.0006) [2023-03-07 09:32:06,712][155452] Updated weights for policy 0, policy_version 88960 (0.0006) [2023-03-07 09:32:07,488][155452] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-07 09:32:08,262][155452] Updated weights for policy 0, policy_version 88980 (0.0005) [2023-03-07 09:32:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 91116544. Throughput: 0: 13014.5. Samples: 91096397. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:32:08,367][155126] Avg episode reward: [(0, '2228.512')] [2023-03-07 09:32:09,043][155452] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-07 09:32:09,817][155452] Updated weights for policy 0, policy_version 89000 (0.0007) [2023-03-07 09:32:10,602][155452] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-07 09:32:11,406][155452] Updated weights for policy 0, policy_version 89020 (0.0006) [2023-03-07 09:32:12,187][155452] Updated weights for policy 0, policy_version 89030 (0.0007) [2023-03-07 09:32:12,985][155452] Updated weights for policy 0, policy_version 89040 (0.0005) [2023-03-07 09:32:13,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 91181056. Throughput: 0: 13015.2. Samples: 91174589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:32:13,367][155126] Avg episode reward: [(0, '2142.762')] [2023-03-07 09:32:13,750][155452] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-07 09:32:14,550][155452] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-07 09:32:15,346][155452] Updated weights for policy 0, policy_version 89070 (0.0006) [2023-03-07 09:32:16,122][155452] Updated weights for policy 0, policy_version 89080 (0.0006) [2023-03-07 09:32:16,899][155452] Updated weights for policy 0, policy_version 89090 (0.0006) [2023-03-07 09:32:17,705][155452] Updated weights for policy 0, policy_version 89100 (0.0006) [2023-03-07 09:32:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 91246592. Throughput: 0: 13014.6. Samples: 91213518. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:32:18,367][155126] Avg episode reward: [(0, '2104.641')] [2023-03-07 09:32:18,493][155452] Updated weights for policy 0, policy_version 89110 (0.0006) [2023-03-07 09:32:19,265][155452] Updated weights for policy 0, policy_version 89120 (0.0007) [2023-03-07 09:32:20,069][155452] Updated weights for policy 0, policy_version 89130 (0.0006) [2023-03-07 09:32:20,843][155452] Updated weights for policy 0, policy_version 89140 (0.0006) [2023-03-07 09:32:21,621][155452] Updated weights for policy 0, policy_version 89150 (0.0006) [2023-03-07 09:32:22,415][155452] Updated weights for policy 0, policy_version 89160 (0.0006) [2023-03-07 09:32:23,188][155452] Updated weights for policy 0, policy_version 89170 (0.0006) [2023-03-07 09:32:23,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 91312128. Throughput: 0: 13024.0. Samples: 91291770. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:32:23,378][155126] Avg episode reward: [(0, '2277.233')] [2023-03-07 09:32:23,978][155452] Updated weights for policy 0, policy_version 89180 (0.0007) [2023-03-07 09:32:24,771][155452] Updated weights for policy 0, policy_version 89190 (0.0006) [2023-03-07 09:32:25,553][155452] Updated weights for policy 0, policy_version 89200 (0.0006) [2023-03-07 09:32:26,326][155452] Updated weights for policy 0, policy_version 89210 (0.0007) [2023-03-07 09:32:27,138][155452] Updated weights for policy 0, policy_version 89220 (0.0006) [2023-03-07 09:32:27,921][155452] Updated weights for policy 0, policy_version 89230 (0.0007) [2023-03-07 09:32:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 91376640. Throughput: 0: 13019.2. Samples: 91369798. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:32:28,367][155126] Avg episode reward: [(0, '2098.251')] [2023-03-07 09:32:28,721][155452] Updated weights for policy 0, policy_version 89240 (0.0007) [2023-03-07 09:32:29,492][155452] Updated weights for policy 0, policy_version 89250 (0.0007) [2023-03-07 09:32:30,297][155452] Updated weights for policy 0, policy_version 89260 (0.0006) [2023-03-07 09:32:31,067][155452] Updated weights for policy 0, policy_version 89270 (0.0006) [2023-03-07 09:32:31,873][155452] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-07 09:32:32,663][155452] Updated weights for policy 0, policy_version 89290 (0.0005) [2023-03-07 09:32:33,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 91441152. Throughput: 0: 13020.8. Samples: 91408788. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:32:33,367][155126] Avg episode reward: [(0, '2149.410')] [2023-03-07 09:32:33,430][155452] Updated weights for policy 0, policy_version 89300 (0.0006) [2023-03-07 09:32:34,226][155452] Updated weights for policy 0, policy_version 89310 (0.0006) [2023-03-07 09:32:35,016][155452] Updated weights for policy 0, policy_version 89320 (0.0006) [2023-03-07 09:32:35,794][155452] Updated weights for policy 0, policy_version 89330 (0.0006) [2023-03-07 09:32:36,599][155452] Updated weights for policy 0, policy_version 89340 (0.0006) [2023-03-07 09:32:37,399][155452] Updated weights for policy 0, policy_version 89350 (0.0006) [2023-03-07 09:32:38,170][155452] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-07 09:32:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 91506688. Throughput: 0: 13007.5. Samples: 91486670. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:32:38,367][155126] Avg episode reward: [(0, '2153.778')] [2023-03-07 09:32:38,964][155452] Updated weights for policy 0, policy_version 89370 (0.0007) [2023-03-07 09:32:39,734][155452] Updated weights for policy 0, policy_version 89380 (0.0006) [2023-03-07 09:32:40,517][155452] Updated weights for policy 0, policy_version 89390 (0.0006) [2023-03-07 09:32:41,316][155452] Updated weights for policy 0, policy_version 89400 (0.0007) [2023-03-07 09:32:42,096][155452] Updated weights for policy 0, policy_version 89410 (0.0007) [2023-03-07 09:32:42,882][155452] Updated weights for policy 0, policy_version 89420 (0.0006) [2023-03-07 09:32:43,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 91572224. Throughput: 0: 13010.9. Samples: 91564969. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:32:43,367][155126] Avg episode reward: [(0, '2096.752')] [2023-03-07 09:32:43,680][155452] Updated weights for policy 0, policy_version 89430 (0.0006) [2023-03-07 09:32:44,463][155452] Updated weights for policy 0, policy_version 89440 (0.0007) [2023-03-07 09:32:45,273][155452] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-07 09:32:46,054][155452] Updated weights for policy 0, policy_version 89460 (0.0006) [2023-03-07 09:32:46,844][155452] Updated weights for policy 0, policy_version 89470 (0.0006) [2023-03-07 09:32:47,633][155452] Updated weights for policy 0, policy_version 89480 (0.0006) [2023-03-07 09:32:48,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 91636736. Throughput: 0: 13004.6. Samples: 91603661. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:32:48,378][155126] Avg episode reward: [(0, '2068.816')] [2023-03-07 09:32:48,418][155452] Updated weights for policy 0, policy_version 89490 (0.0006) [2023-03-07 09:32:49,202][155452] Updated weights for policy 0, policy_version 89500 (0.0006) [2023-03-07 09:32:49,971][155452] Updated weights for policy 0, policy_version 89510 (0.0006) [2023-03-07 09:32:50,771][155452] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-07 09:32:51,574][155452] Updated weights for policy 0, policy_version 89530 (0.0006) [2023-03-07 09:32:52,363][155452] Updated weights for policy 0, policy_version 89540 (0.0007) [2023-03-07 09:32:53,146][155452] Updated weights for policy 0, policy_version 89550 (0.0006) [2023-03-07 09:32:53,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 91701248. Throughput: 0: 13008.7. Samples: 91681789. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:32:53,367][155126] Avg episode reward: [(0, '2137.141')] [2023-03-07 09:32:53,934][155452] Updated weights for policy 0, policy_version 89560 (0.0006) [2023-03-07 09:32:54,719][155452] Updated weights for policy 0, policy_version 89570 (0.0006) [2023-03-07 09:32:55,504][155452] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-07 09:32:56,309][155452] Updated weights for policy 0, policy_version 89590 (0.0006) [2023-03-07 09:32:57,092][155452] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-07 09:32:57,870][155452] Updated weights for policy 0, policy_version 89610 (0.0007) [2023-03-07 09:32:58,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 91766784. Throughput: 0: 13001.2. Samples: 91759645. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:32:58,367][155126] Avg episode reward: [(0, '2042.063')] [2023-03-07 09:32:58,667][155452] Updated weights for policy 0, policy_version 89620 (0.0006) [2023-03-07 09:32:59,441][155452] Updated weights for policy 0, policy_version 89630 (0.0005) [2023-03-07 09:33:00,236][155452] Updated weights for policy 0, policy_version 89640 (0.0006) [2023-03-07 09:33:00,998][155452] Updated weights for policy 0, policy_version 89650 (0.0006) [2023-03-07 09:33:01,795][155452] Updated weights for policy 0, policy_version 89660 (0.0006) [2023-03-07 09:33:02,590][155452] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-07 09:33:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 91831296. Throughput: 0: 13004.9. Samples: 91798741. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:33:03,368][155126] Avg episode reward: [(0, '2098.316')] [2023-03-07 09:33:03,385][155452] Updated weights for policy 0, policy_version 89680 (0.0007) [2023-03-07 09:33:04,162][155452] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-07 09:33:04,948][155452] Updated weights for policy 0, policy_version 89700 (0.0006) [2023-03-07 09:33:05,723][155452] Updated weights for policy 0, policy_version 89710 (0.0005) [2023-03-07 09:33:06,505][155452] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-07 09:33:07,290][155452] Updated weights for policy 0, policy_version 89730 (0.0006) [2023-03-07 09:33:08,080][155452] Updated weights for policy 0, policy_version 89740 (0.0006) [2023-03-07 09:33:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 91896832. Throughput: 0: 13003.3. Samples: 91876918. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:33:08,367][155126] Avg episode reward: [(0, '2204.656')] [2023-03-07 09:33:08,873][155452] Updated weights for policy 0, policy_version 89750 (0.0006) [2023-03-07 09:33:09,635][155452] Updated weights for policy 0, policy_version 89760 (0.0006) [2023-03-07 09:33:10,435][155452] Updated weights for policy 0, policy_version 89770 (0.0006) [2023-03-07 09:33:11,226][155452] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-07 09:33:12,008][155452] Updated weights for policy 0, policy_version 89790 (0.0006) [2023-03-07 09:33:12,800][155452] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-07 09:33:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 91962368. Throughput: 0: 13008.5. Samples: 91955180. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:33:13,367][155126] Avg episode reward: [(0, '2154.460')] [2023-03-07 09:33:13,573][155452] Updated weights for policy 0, policy_version 89810 (0.0006) [2023-03-07 09:33:14,363][155452] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-07 09:33:15,166][155452] Updated weights for policy 0, policy_version 89830 (0.0005) [2023-03-07 09:33:15,955][155452] Updated weights for policy 0, policy_version 89840 (0.0005) [2023-03-07 09:33:16,734][155452] Updated weights for policy 0, policy_version 89850 (0.0006) [2023-03-07 09:33:17,531][155452] Updated weights for policy 0, policy_version 89860 (0.0005) [2023-03-07 09:33:18,318][155452] Updated weights for policy 0, policy_version 89870 (0.0006) [2023-03-07 09:33:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 92026880. Throughput: 0: 13009.2. Samples: 91994204. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:33:18,367][155126] Avg episode reward: [(0, '2055.377')] [2023-03-07 09:33:19,108][155452] Updated weights for policy 0, policy_version 89880 (0.0006) [2023-03-07 09:33:19,894][155452] Updated weights for policy 0, policy_version 89890 (0.0005) [2023-03-07 09:33:20,668][155452] Updated weights for policy 0, policy_version 89900 (0.0006) [2023-03-07 09:33:21,461][155452] Updated weights for policy 0, policy_version 89910 (0.0006) [2023-03-07 09:33:22,249][155452] Updated weights for policy 0, policy_version 89920 (0.0006) [2023-03-07 09:33:23,037][155452] Updated weights for policy 0, policy_version 89930 (0.0006) [2023-03-07 09:33:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 92092416. Throughput: 0: 13010.3. Samples: 92072132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:23,368][155126] Avg episode reward: [(0, '2040.184')] [2023-03-07 09:33:23,817][155452] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-07 09:33:24,588][155452] Updated weights for policy 0, policy_version 89950 (0.0005) [2023-03-07 09:33:25,384][155452] Updated weights for policy 0, policy_version 89960 (0.0006) [2023-03-07 09:33:26,156][155452] Updated weights for policy 0, policy_version 89970 (0.0006) [2023-03-07 09:33:26,950][155452] Updated weights for policy 0, policy_version 89980 (0.0006) [2023-03-07 09:33:27,741][155452] Updated weights for policy 0, policy_version 89990 (0.0007) [2023-03-07 09:33:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 92156928. Throughput: 0: 13009.7. Samples: 92150409. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:28,368][155126] Avg episode reward: [(0, '2058.025')] [2023-03-07 09:33:28,381][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000089998_92157952.pth... [2023-03-07 09:33:28,412][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000086946_89032704.pth [2023-03-07 09:33:28,538][155452] Updated weights for policy 0, policy_version 90000 (0.0006) [2023-03-07 09:33:29,313][155452] Updated weights for policy 0, policy_version 90010 (0.0006) [2023-03-07 09:33:30,098][155452] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-07 09:33:30,886][155452] Updated weights for policy 0, policy_version 90030 (0.0006) [2023-03-07 09:33:31,664][155452] Updated weights for policy 0, policy_version 90040 (0.0007) [2023-03-07 09:33:32,446][155452] Updated weights for policy 0, policy_version 90050 (0.0006) [2023-03-07 09:33:33,235][155452] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-07 09:33:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 92222464. Throughput: 0: 13018.0. Samples: 92189468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:33,368][155126] Avg episode reward: [(0, '2107.776')] [2023-03-07 09:33:34,033][155452] Updated weights for policy 0, policy_version 90070 (0.0006) [2023-03-07 09:33:34,825][155452] Updated weights for policy 0, policy_version 90080 (0.0006) [2023-03-07 09:33:35,605][155452] Updated weights for policy 0, policy_version 90090 (0.0006) [2023-03-07 09:33:36,384][155452] Updated weights for policy 0, policy_version 90100 (0.0006) [2023-03-07 09:33:37,180][155452] Updated weights for policy 0, policy_version 90110 (0.0006) [2023-03-07 09:33:37,961][155452] Updated weights for policy 0, policy_version 90120 (0.0006) [2023-03-07 09:33:38,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 92288000. Throughput: 0: 13022.0. Samples: 92267783. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:38,368][155126] Avg episode reward: [(0, '2031.883')] [2023-03-07 09:33:38,757][155452] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-07 09:33:39,529][155452] Updated weights for policy 0, policy_version 90140 (0.0007) [2023-03-07 09:33:40,320][155452] Updated weights for policy 0, policy_version 90150 (0.0006) [2023-03-07 09:33:41,097][155452] Updated weights for policy 0, policy_version 90160 (0.0006) [2023-03-07 09:33:41,880][155452] Updated weights for policy 0, policy_version 90170 (0.0006) [2023-03-07 09:33:42,665][155452] Updated weights for policy 0, policy_version 90180 (0.0006) [2023-03-07 09:33:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 92353536. Throughput: 0: 13030.6. Samples: 92346020. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:43,367][155126] Avg episode reward: [(0, '1975.610')] [2023-03-07 09:33:43,438][155452] Updated weights for policy 0, policy_version 90190 (0.0006) [2023-03-07 09:33:44,231][155452] Updated weights for policy 0, policy_version 90200 (0.0007) [2023-03-07 09:33:45,042][155452] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-07 09:33:45,797][155452] Updated weights for policy 0, policy_version 90220 (0.0005) [2023-03-07 09:33:46,610][155452] Updated weights for policy 0, policy_version 90230 (0.0007) [2023-03-07 09:33:47,401][155452] Updated weights for policy 0, policy_version 90240 (0.0007) [2023-03-07 09:33:48,202][155452] Updated weights for policy 0, policy_version 90250 (0.0006) [2023-03-07 09:33:48,367][155126] Fps is (10 sec: 13005.1, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 92418048. Throughput: 0: 13030.7. Samples: 92385122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:48,367][155126] Avg episode reward: [(0, '2056.161')] [2023-03-07 09:33:48,985][155452] Updated weights for policy 0, policy_version 90260 (0.0006) [2023-03-07 09:33:49,781][155452] Updated weights for policy 0, policy_version 90270 (0.0006) [2023-03-07 09:33:50,565][155452] Updated weights for policy 0, policy_version 90280 (0.0006) [2023-03-07 09:33:51,354][155452] Updated weights for policy 0, policy_version 90290 (0.0007) [2023-03-07 09:33:52,148][155452] Updated weights for policy 0, policy_version 90300 (0.0006) [2023-03-07 09:33:52,922][155452] Updated weights for policy 0, policy_version 90310 (0.0006) [2023-03-07 09:33:53,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 92482560. Throughput: 0: 13014.6. Samples: 92462575. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:53,367][155126] Avg episode reward: [(0, '2000.814')] [2023-03-07 09:33:53,726][155452] Updated weights for policy 0, policy_version 90320 (0.0006) [2023-03-07 09:33:54,505][155452] Updated weights for policy 0, policy_version 90330 (0.0007) [2023-03-07 09:33:55,297][155452] Updated weights for policy 0, policy_version 90340 (0.0006) [2023-03-07 09:33:56,097][155452] Updated weights for policy 0, policy_version 90350 (0.0007) [2023-03-07 09:33:56,873][155452] Updated weights for policy 0, policy_version 90360 (0.0007) [2023-03-07 09:33:57,657][155452] Updated weights for policy 0, policy_version 90370 (0.0007) [2023-03-07 09:33:58,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 92547072. Throughput: 0: 13010.1. Samples: 92540634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:33:58,367][155126] Avg episode reward: [(0, '2101.209')] [2023-03-07 09:33:58,458][155452] Updated weights for policy 0, policy_version 90380 (0.0006) [2023-03-07 09:33:59,244][155452] Updated weights for policy 0, policy_version 90390 (0.0008) [2023-03-07 09:34:00,034][155452] Updated weights for policy 0, policy_version 90400 (0.0006) [2023-03-07 09:34:00,821][155452] Updated weights for policy 0, policy_version 90410 (0.0006) [2023-03-07 09:34:01,587][155452] Updated weights for policy 0, policy_version 90420 (0.0006) [2023-03-07 09:34:02,378][155452] Updated weights for policy 0, policy_version 90430 (0.0006) [2023-03-07 09:34:03,169][155452] Updated weights for policy 0, policy_version 90440 (0.0006) [2023-03-07 09:34:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 92612608. Throughput: 0: 13007.3. Samples: 92579532. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:34:03,368][155126] Avg episode reward: [(0, '2089.474')] [2023-03-07 09:34:03,953][155452] Updated weights for policy 0, policy_version 90450 (0.0005) [2023-03-07 09:34:04,739][155452] Updated weights for policy 0, policy_version 90460 (0.0006) [2023-03-07 09:34:05,525][155452] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-07 09:34:06,299][155452] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-07 09:34:07,081][155452] Updated weights for policy 0, policy_version 90490 (0.0006) [2023-03-07 09:34:07,851][155452] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-07 09:34:08,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 92678144. Throughput: 0: 13017.7. Samples: 92657931. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:34:08,367][155126] Avg episode reward: [(0, '1933.166')] [2023-03-07 09:34:08,640][155452] Updated weights for policy 0, policy_version 90510 (0.0007) [2023-03-07 09:34:09,426][155452] Updated weights for policy 0, policy_version 90520 (0.0006) [2023-03-07 09:34:10,226][155452] Updated weights for policy 0, policy_version 90530 (0.0005) [2023-03-07 09:34:11,020][155452] Updated weights for policy 0, policy_version 90540 (0.0006) [2023-03-07 09:34:11,815][155452] Updated weights for policy 0, policy_version 90550 (0.0006) [2023-03-07 09:34:12,586][155452] Updated weights for policy 0, policy_version 90560 (0.0006) [2023-03-07 09:34:13,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 92743680. Throughput: 0: 13014.1. Samples: 92736040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:34:13,367][155126] Avg episode reward: [(0, '2153.205')] [2023-03-07 09:34:13,369][155452] Updated weights for policy 0, policy_version 90570 (0.0005) [2023-03-07 09:34:14,163][155452] Updated weights for policy 0, policy_version 90580 (0.0006) [2023-03-07 09:34:14,952][155452] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-07 09:34:15,727][155452] Updated weights for policy 0, policy_version 90600 (0.0005) [2023-03-07 09:34:16,506][155452] Updated weights for policy 0, policy_version 90610 (0.0006) [2023-03-07 09:34:17,295][155452] Updated weights for policy 0, policy_version 90620 (0.0006) [2023-03-07 09:34:18,068][155452] Updated weights for policy 0, policy_version 90630 (0.0006) [2023-03-07 09:34:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 92808192. Throughput: 0: 13016.9. Samples: 92775230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:18,367][155126] Avg episode reward: [(0, '2160.162')] [2023-03-07 09:34:18,860][155452] Updated weights for policy 0, policy_version 90640 (0.0005) [2023-03-07 09:34:19,664][155452] Updated weights for policy 0, policy_version 90650 (0.0006) [2023-03-07 09:34:20,429][155452] Updated weights for policy 0, policy_version 90660 (0.0006) [2023-03-07 09:34:21,217][155452] Updated weights for policy 0, policy_version 90670 (0.0006) [2023-03-07 09:34:22,001][155452] Updated weights for policy 0, policy_version 90680 (0.0007) [2023-03-07 09:34:22,766][155452] Updated weights for policy 0, policy_version 90690 (0.0006) [2023-03-07 09:34:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 92873728. Throughput: 0: 13019.0. Samples: 92853637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:23,368][155126] Avg episode reward: [(0, '2027.613')] [2023-03-07 09:34:23,561][155452] Updated weights for policy 0, policy_version 90700 (0.0007) [2023-03-07 09:34:24,344][155452] Updated weights for policy 0, policy_version 90710 (0.0006) [2023-03-07 09:34:25,124][155452] Updated weights for policy 0, policy_version 90720 (0.0006) [2023-03-07 09:34:25,906][155452] Updated weights for policy 0, policy_version 90730 (0.0006) [2023-03-07 09:34:26,682][155452] Updated weights for policy 0, policy_version 90740 (0.0005) [2023-03-07 09:34:27,477][155452] Updated weights for policy 0, policy_version 90750 (0.0007) [2023-03-07 09:34:28,261][155452] Updated weights for policy 0, policy_version 90760 (0.0006) [2023-03-07 09:34:28,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 92939264. Throughput: 0: 13024.4. Samples: 92932120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:28,367][155126] Avg episode reward: [(0, '1925.727')] [2023-03-07 09:34:29,064][155452] Updated weights for policy 0, policy_version 90770 (0.0006) [2023-03-07 09:34:29,843][155452] Updated weights for policy 0, policy_version 90780 (0.0007) [2023-03-07 09:34:30,617][155452] Updated weights for policy 0, policy_version 90790 (0.0005) [2023-03-07 09:34:31,394][155452] Updated weights for policy 0, policy_version 90800 (0.0006) [2023-03-07 09:34:32,179][155452] Updated weights for policy 0, policy_version 90810 (0.0006) [2023-03-07 09:34:32,966][155452] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-07 09:34:33,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 93004800. Throughput: 0: 13025.5. Samples: 92971269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:33,378][155126] Avg episode reward: [(0, '2066.684')] [2023-03-07 09:34:33,749][155452] Updated weights for policy 0, policy_version 90830 (0.0007) [2023-03-07 09:34:34,539][155452] Updated weights for policy 0, policy_version 90840 (0.0006) [2023-03-07 09:34:35,346][155452] Updated weights for policy 0, policy_version 90850 (0.0006) [2023-03-07 09:34:36,117][155452] Updated weights for policy 0, policy_version 90860 (0.0006) [2023-03-07 09:34:36,916][155452] Updated weights for policy 0, policy_version 90870 (0.0007) [2023-03-07 09:34:37,694][155452] Updated weights for policy 0, policy_version 90880 (0.0006) [2023-03-07 09:34:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 93069312. Throughput: 0: 13038.9. Samples: 93049327. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:38,367][155126] Avg episode reward: [(0, '1876.106')] [2023-03-07 09:34:38,501][155452] Updated weights for policy 0, policy_version 90890 (0.0006) [2023-03-07 09:34:39,274][155452] Updated weights for policy 0, policy_version 90900 (0.0006) [2023-03-07 09:34:40,059][155452] Updated weights for policy 0, policy_version 90910 (0.0006) [2023-03-07 09:34:40,857][155452] Updated weights for policy 0, policy_version 90920 (0.0006) [2023-03-07 09:34:41,630][155452] Updated weights for policy 0, policy_version 90930 (0.0006) [2023-03-07 09:34:42,414][155452] Updated weights for policy 0, policy_version 90940 (0.0007) [2023-03-07 09:34:43,199][155452] Updated weights for policy 0, policy_version 90950 (0.0006) [2023-03-07 09:34:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 93134848. Throughput: 0: 13042.2. Samples: 93127534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:43,367][155126] Avg episode reward: [(0, '2058.459')] [2023-03-07 09:34:43,979][155452] Updated weights for policy 0, policy_version 90960 (0.0006) [2023-03-07 09:34:44,753][155452] Updated weights for policy 0, policy_version 90970 (0.0006) [2023-03-07 09:34:45,541][155452] Updated weights for policy 0, policy_version 90980 (0.0007) [2023-03-07 09:34:46,341][155452] Updated weights for policy 0, policy_version 90990 (0.0006) [2023-03-07 09:34:47,125][155452] Updated weights for policy 0, policy_version 91000 (0.0006) [2023-03-07 09:34:47,914][155452] Updated weights for policy 0, policy_version 91010 (0.0006) [2023-03-07 09:34:48,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93199360. Throughput: 0: 13048.5. Samples: 93166715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:48,367][155126] Avg episode reward: [(0, '2090.355')] [2023-03-07 09:34:48,706][155452] Updated weights for policy 0, policy_version 91020 (0.0006) [2023-03-07 09:34:49,485][155452] Updated weights for policy 0, policy_version 91030 (0.0006) [2023-03-07 09:34:50,274][155452] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-07 09:34:51,068][155452] Updated weights for policy 0, policy_version 91050 (0.0008) [2023-03-07 09:34:51,843][155452] Updated weights for policy 0, policy_version 91060 (0.0007) [2023-03-07 09:34:52,634][155452] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-07 09:34:53,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 93264896. Throughput: 0: 13039.9. Samples: 93244730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:53,368][155126] Avg episode reward: [(0, '2113.497')] [2023-03-07 09:34:53,418][155452] Updated weights for policy 0, policy_version 91080 (0.0006) [2023-03-07 09:34:54,192][155452] Updated weights for policy 0, policy_version 91090 (0.0006) [2023-03-07 09:34:54,955][155452] Updated weights for policy 0, policy_version 91100 (0.0006) [2023-03-07 09:34:55,757][155452] Updated weights for policy 0, policy_version 91110 (0.0006) [2023-03-07 09:34:56,522][155452] Updated weights for policy 0, policy_version 91120 (0.0006) [2023-03-07 09:34:57,315][155452] Updated weights for policy 0, policy_version 91130 (0.0006) [2023-03-07 09:34:58,107][155452] Updated weights for policy 0, policy_version 91140 (0.0006) [2023-03-07 09:34:58,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13023.9). Total num frames: 93330432. Throughput: 0: 13051.3. Samples: 93323348. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:34:58,367][155126] Avg episode reward: [(0, '1912.674')] [2023-03-07 09:34:58,891][155452] Updated weights for policy 0, policy_version 91150 (0.0006) [2023-03-07 09:34:59,682][155452] Updated weights for policy 0, policy_version 91160 (0.0007) [2023-03-07 09:35:00,482][155452] Updated weights for policy 0, policy_version 91170 (0.0006) [2023-03-07 09:35:01,262][155452] Updated weights for policy 0, policy_version 91180 (0.0006) [2023-03-07 09:35:02,056][155452] Updated weights for policy 0, policy_version 91190 (0.0006) [2023-03-07 09:35:02,838][155452] Updated weights for policy 0, policy_version 91200 (0.0007) [2023-03-07 09:35:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 93394944. Throughput: 0: 13042.2. Samples: 93362130. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:35:03,367][155126] Avg episode reward: [(0, '1789.169')] [2023-03-07 09:35:03,632][155452] Updated weights for policy 0, policy_version 91210 (0.0006) [2023-03-07 09:35:04,402][155452] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-07 09:35:05,184][155452] Updated weights for policy 0, policy_version 91230 (0.0006) [2023-03-07 09:35:05,973][155452] Updated weights for policy 0, policy_version 91240 (0.0007) [2023-03-07 09:35:06,765][155452] Updated weights for policy 0, policy_version 91250 (0.0007) [2023-03-07 09:35:07,535][155452] Updated weights for policy 0, policy_version 91260 (0.0006) [2023-03-07 09:35:08,317][155452] Updated weights for policy 0, policy_version 91270 (0.0006) [2023-03-07 09:35:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 93460480. Throughput: 0: 13039.7. Samples: 93440422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:35:08,367][155126] Avg episode reward: [(0, '2069.373')] [2023-03-07 09:35:09,103][155452] Updated weights for policy 0, policy_version 91280 (0.0008) [2023-03-07 09:35:09,889][155452] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-07 09:35:10,679][155452] Updated weights for policy 0, policy_version 91300 (0.0005) [2023-03-07 09:35:11,458][155452] Updated weights for policy 0, policy_version 91310 (0.0006) [2023-03-07 09:35:12,264][155452] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-07 09:35:13,041][155452] Updated weights for policy 0, policy_version 91330 (0.0007) [2023-03-07 09:35:13,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 93526016. Throughput: 0: 13032.8. Samples: 93518596. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:13,367][155126] Avg episode reward: [(0, '1979.734')] [2023-03-07 09:35:13,844][155452] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-07 09:35:14,632][155452] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-07 09:35:15,426][155452] Updated weights for policy 0, policy_version 91360 (0.0007) [2023-03-07 09:35:16,202][155452] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-07 09:35:17,006][155452] Updated weights for policy 0, policy_version 91380 (0.0006) [2023-03-07 09:35:17,794][155452] Updated weights for policy 0, policy_version 91390 (0.0005) [2023-03-07 09:35:18,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13039.0, 300 sec: 13020.4). Total num frames: 93590528. Throughput: 0: 13025.4. Samples: 93557412. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:18,367][155126] Avg episode reward: [(0, '1890.610')] [2023-03-07 09:35:18,562][155452] Updated weights for policy 0, policy_version 91400 (0.0006) [2023-03-07 09:35:19,358][155452] Updated weights for policy 0, policy_version 91410 (0.0005) [2023-03-07 09:35:20,141][155452] Updated weights for policy 0, policy_version 91420 (0.0006) [2023-03-07 09:35:20,923][155452] Updated weights for policy 0, policy_version 91430 (0.0007) [2023-03-07 09:35:21,702][155452] Updated weights for policy 0, policy_version 91440 (0.0007) [2023-03-07 09:35:22,501][155452] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-07 09:35:23,287][155452] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-07 09:35:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 93656064. Throughput: 0: 13030.3. Samples: 93635690. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:23,367][155126] Avg episode reward: [(0, '2128.013')] [2023-03-07 09:35:24,066][155452] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-07 09:35:24,854][155452] Updated weights for policy 0, policy_version 91480 (0.0006) [2023-03-07 09:35:25,625][155452] Updated weights for policy 0, policy_version 91490 (0.0006) [2023-03-07 09:35:26,418][155452] Updated weights for policy 0, policy_version 91500 (0.0006) [2023-03-07 09:35:27,201][155452] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-07 09:35:27,985][155452] Updated weights for policy 0, policy_version 91520 (0.0006) [2023-03-07 09:35:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93720576. Throughput: 0: 13031.8. Samples: 93713967. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:28,367][155126] Avg episode reward: [(0, '1960.515')] [2023-03-07 09:35:28,382][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000091525_93721600.pth... [2023-03-07 09:35:28,412][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000088473_90596352.pth [2023-03-07 09:35:28,777][155452] Updated weights for policy 0, policy_version 91530 (0.0006) [2023-03-07 09:35:29,581][155452] Updated weights for policy 0, policy_version 91540 (0.0005) [2023-03-07 09:35:30,363][155452] Updated weights for policy 0, policy_version 91550 (0.0006) [2023-03-07 09:35:31,143][155452] Updated weights for policy 0, policy_version 91560 (0.0005) [2023-03-07 09:35:31,927][155452] Updated weights for policy 0, policy_version 91570 (0.0007) [2023-03-07 09:35:32,720][155452] Updated weights for policy 0, policy_version 91580 (0.0007) [2023-03-07 09:35:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93786112. Throughput: 0: 13023.7. Samples: 93752782. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:33,367][155126] Avg episode reward: [(0, '2082.127')] [2023-03-07 09:35:33,498][155452] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-07 09:35:34,294][155452] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-07 09:35:35,073][155452] Updated weights for policy 0, policy_version 91610 (0.0007) [2023-03-07 09:35:35,859][155452] Updated weights for policy 0, policy_version 91620 (0.0006) [2023-03-07 09:35:36,655][155452] Updated weights for policy 0, policy_version 91630 (0.0006) [2023-03-07 09:35:37,438][155452] Updated weights for policy 0, policy_version 91640 (0.0006) [2023-03-07 09:35:38,232][155452] Updated weights for policy 0, policy_version 91650 (0.0006) [2023-03-07 09:35:38,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93850624. Throughput: 0: 13028.4. Samples: 93831007. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:38,368][155126] Avg episode reward: [(0, '2040.771')] [2023-03-07 09:35:39,021][155452] Updated weights for policy 0, policy_version 91660 (0.0007) [2023-03-07 09:35:39,806][155452] Updated weights for policy 0, policy_version 91670 (0.0006) [2023-03-07 09:35:40,604][155452] Updated weights for policy 0, policy_version 91680 (0.0007) [2023-03-07 09:35:41,378][155452] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-07 09:35:42,176][155452] Updated weights for policy 0, policy_version 91700 (0.0006) [2023-03-07 09:35:42,975][155452] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-07 09:35:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 93916160. Throughput: 0: 13008.0. Samples: 93908709. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:43,367][155126] Avg episode reward: [(0, '1970.187')] [2023-03-07 09:35:43,784][155452] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-07 09:35:44,549][155452] Updated weights for policy 0, policy_version 91730 (0.0006) [2023-03-07 09:35:45,355][155452] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-07 09:35:46,125][155452] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-07 09:35:46,926][155452] Updated weights for policy 0, policy_version 91760 (0.0006) [2023-03-07 09:35:47,709][155452] Updated weights for policy 0, policy_version 91770 (0.0007) [2023-03-07 09:35:48,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 93980672. Throughput: 0: 13007.5. Samples: 93947465. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:48,367][155126] Avg episode reward: [(0, '1976.075')] [2023-03-07 09:35:48,501][155452] Updated weights for policy 0, policy_version 91780 (0.0007) [2023-03-07 09:35:49,279][155452] Updated weights for policy 0, policy_version 91790 (0.0006) [2023-03-07 09:35:50,068][155452] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-07 09:35:50,856][155452] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-07 09:35:51,667][155452] Updated weights for policy 0, policy_version 91820 (0.0006) [2023-03-07 09:35:52,477][155452] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-07 09:35:53,259][155452] Updated weights for policy 0, policy_version 91840 (0.0006) [2023-03-07 09:35:53,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 94045184. Throughput: 0: 12999.3. Samples: 94025392. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:53,367][155126] Avg episode reward: [(0, '1916.477')] [2023-03-07 09:35:54,041][155452] Updated weights for policy 0, policy_version 91850 (0.0005) [2023-03-07 09:35:54,818][155452] Updated weights for policy 0, policy_version 91860 (0.0006) [2023-03-07 09:35:55,578][155452] Updated weights for policy 0, policy_version 91870 (0.0006) [2023-03-07 09:35:56,357][155452] Updated weights for policy 0, policy_version 91880 (0.0006) [2023-03-07 09:35:57,151][155452] Updated weights for policy 0, policy_version 91890 (0.0006) [2023-03-07 09:35:57,929][155452] Updated weights for policy 0, policy_version 91900 (0.0006) [2023-03-07 09:35:58,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 94110720. Throughput: 0: 13007.0. Samples: 94103911. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:35:58,367][155126] Avg episode reward: [(0, '2152.736')] [2023-03-07 09:35:58,707][155452] Updated weights for policy 0, policy_version 91910 (0.0006) [2023-03-07 09:35:59,501][155452] Updated weights for policy 0, policy_version 91920 (0.0005) [2023-03-07 09:36:00,293][155452] Updated weights for policy 0, policy_version 91930 (0.0006) [2023-03-07 09:36:01,080][155452] Updated weights for policy 0, policy_version 91940 (0.0005) [2023-03-07 09:36:01,878][155452] Updated weights for policy 0, policy_version 91950 (0.0007) [2023-03-07 09:36:02,660][155452] Updated weights for policy 0, policy_version 91960 (0.0006) [2023-03-07 09:36:03,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 94176256. Throughput: 0: 13009.3. Samples: 94142834. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:36:03,368][155126] Avg episode reward: [(0, '2014.058')] [2023-03-07 09:36:03,443][155452] Updated weights for policy 0, policy_version 91970 (0.0006) [2023-03-07 09:36:04,233][155452] Updated weights for policy 0, policy_version 91980 (0.0007) [2023-03-07 09:36:05,028][155452] Updated weights for policy 0, policy_version 91990 (0.0006) [2023-03-07 09:36:05,819][155452] Updated weights for policy 0, policy_version 92000 (0.0006) [2023-03-07 09:36:06,590][155452] Updated weights for policy 0, policy_version 92010 (0.0006) [2023-03-07 09:36:07,391][155452] Updated weights for policy 0, policy_version 92020 (0.0007) [2023-03-07 09:36:08,170][155452] Updated weights for policy 0, policy_version 92030 (0.0007) [2023-03-07 09:36:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 94240768. Throughput: 0: 13002.8. Samples: 94220816. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:08,378][155126] Avg episode reward: [(0, '1904.533')] [2023-03-07 09:36:08,948][155452] Updated weights for policy 0, policy_version 92040 (0.0007) [2023-03-07 09:36:09,737][155452] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-07 09:36:10,531][155452] Updated weights for policy 0, policy_version 92060 (0.0007) [2023-03-07 09:36:11,333][155452] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-07 09:36:12,138][155452] Updated weights for policy 0, policy_version 92080 (0.0007) [2023-03-07 09:36:12,921][155452] Updated weights for policy 0, policy_version 92090 (0.0006) [2023-03-07 09:36:13,367][155126] Fps is (10 sec: 12902.5, 60 sec: 12987.7, 300 sec: 13016.9). Total num frames: 94305280. Throughput: 0: 12990.8. Samples: 94298555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:13,368][155126] Avg episode reward: [(0, '1878.243')] [2023-03-07 09:36:13,709][155452] Updated weights for policy 0, policy_version 92100 (0.0007) [2023-03-07 09:36:14,507][155452] Updated weights for policy 0, policy_version 92110 (0.0006) [2023-03-07 09:36:15,296][155452] Updated weights for policy 0, policy_version 92120 (0.0007) [2023-03-07 09:36:16,081][155452] Updated weights for policy 0, policy_version 92130 (0.0005) [2023-03-07 09:36:16,856][155452] Updated weights for policy 0, policy_version 92140 (0.0006) [2023-03-07 09:36:17,639][155452] Updated weights for policy 0, policy_version 92150 (0.0007) [2023-03-07 09:36:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 94370816. Throughput: 0: 12991.4. Samples: 94337396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:18,367][155126] Avg episode reward: [(0, '2015.319')] [2023-03-07 09:36:18,421][155452] Updated weights for policy 0, policy_version 92160 (0.0006) [2023-03-07 09:36:19,214][155452] Updated weights for policy 0, policy_version 92170 (0.0006) [2023-03-07 09:36:19,980][155452] Updated weights for policy 0, policy_version 92180 (0.0006) [2023-03-07 09:36:20,776][155452] Updated weights for policy 0, policy_version 92190 (0.0008) [2023-03-07 09:36:21,567][155452] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-07 09:36:22,339][155452] Updated weights for policy 0, policy_version 92210 (0.0006) [2023-03-07 09:36:23,112][155452] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-07 09:36:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 94436352. Throughput: 0: 12995.9. Samples: 94415822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:23,368][155126] Avg episode reward: [(0, '1898.539')] [2023-03-07 09:36:23,913][155452] Updated weights for policy 0, policy_version 92230 (0.0007) [2023-03-07 09:36:24,699][155452] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-07 09:36:25,481][155452] Updated weights for policy 0, policy_version 92250 (0.0007) [2023-03-07 09:36:26,272][155452] Updated weights for policy 0, policy_version 92260 (0.0006) [2023-03-07 09:36:27,061][155452] Updated weights for policy 0, policy_version 92270 (0.0006) [2023-03-07 09:36:27,829][155452] Updated weights for policy 0, policy_version 92280 (0.0007) [2023-03-07 09:36:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 94501888. Throughput: 0: 13014.5. Samples: 94494363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:28,367][155126] Avg episode reward: [(0, '2098.640')] [2023-03-07 09:36:28,626][155452] Updated weights for policy 0, policy_version 92290 (0.0006) [2023-03-07 09:36:29,415][155452] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-07 09:36:30,198][155452] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-07 09:36:30,994][155452] Updated weights for policy 0, policy_version 92320 (0.0006) [2023-03-07 09:36:31,785][155452] Updated weights for policy 0, policy_version 92330 (0.0007) [2023-03-07 09:36:32,552][155452] Updated weights for policy 0, policy_version 92340 (0.0006) [2023-03-07 09:36:33,339][155452] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-07 09:36:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 94566400. Throughput: 0: 13016.7. Samples: 94533218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:33,367][155126] Avg episode reward: [(0, '1984.606')] [2023-03-07 09:36:34,126][155452] Updated weights for policy 0, policy_version 92360 (0.0006) [2023-03-07 09:36:34,914][155452] Updated weights for policy 0, policy_version 92370 (0.0005) [2023-03-07 09:36:35,710][155452] Updated weights for policy 0, policy_version 92380 (0.0006) [2023-03-07 09:36:36,490][155452] Updated weights for policy 0, policy_version 92390 (0.0007) [2023-03-07 09:36:37,287][155452] Updated weights for policy 0, policy_version 92400 (0.0006) [2023-03-07 09:36:38,066][155452] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-07 09:36:38,367][155126] Fps is (10 sec: 12902.1, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 94630912. Throughput: 0: 13017.6. Samples: 94611185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:38,378][155126] Avg episode reward: [(0, '1981.566')] [2023-03-07 09:36:38,836][155452] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-07 09:36:39,638][155452] Updated weights for policy 0, policy_version 92430 (0.0006) [2023-03-07 09:36:40,402][155452] Updated weights for policy 0, policy_version 92440 (0.0007) [2023-03-07 09:36:41,174][155452] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-07 09:36:41,975][155452] Updated weights for policy 0, policy_version 92460 (0.0005) [2023-03-07 09:36:42,774][155452] Updated weights for policy 0, policy_version 92470 (0.0006) [2023-03-07 09:36:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 94696448. Throughput: 0: 13016.2. Samples: 94689639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:43,367][155126] Avg episode reward: [(0, '1802.415')] [2023-03-07 09:36:43,557][155452] Updated weights for policy 0, policy_version 92480 (0.0007) [2023-03-07 09:36:44,348][155452] Updated weights for policy 0, policy_version 92490 (0.0006) [2023-03-07 09:36:45,130][155452] Updated weights for policy 0, policy_version 92500 (0.0006) [2023-03-07 09:36:45,926][155452] Updated weights for policy 0, policy_version 92510 (0.0005) [2023-03-07 09:36:46,697][155452] Updated weights for policy 0, policy_version 92520 (0.0006) [2023-03-07 09:36:47,512][155452] Updated weights for policy 0, policy_version 92530 (0.0007) [2023-03-07 09:36:48,273][155452] Updated weights for policy 0, policy_version 92540 (0.0006) [2023-03-07 09:36:48,367][155126] Fps is (10 sec: 13107.5, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 94761984. Throughput: 0: 13017.6. Samples: 94728623. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:48,367][155126] Avg episode reward: [(0, '1944.278')] [2023-03-07 09:36:49,068][155452] Updated weights for policy 0, policy_version 92550 (0.0007) [2023-03-07 09:36:49,855][155452] Updated weights for policy 0, policy_version 92560 (0.0006) [2023-03-07 09:36:50,644][155452] Updated weights for policy 0, policy_version 92570 (0.0006) [2023-03-07 09:36:51,414][155452] Updated weights for policy 0, policy_version 92580 (0.0006) [2023-03-07 09:36:52,187][155452] Updated weights for policy 0, policy_version 92590 (0.0006) [2023-03-07 09:36:52,979][155452] Updated weights for policy 0, policy_version 92600 (0.0006) [2023-03-07 09:36:53,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 94827520. Throughput: 0: 13021.5. Samples: 94806782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:53,367][155126] Avg episode reward: [(0, '1806.691')] [2023-03-07 09:36:53,775][155452] Updated weights for policy 0, policy_version 92610 (0.0006) [2023-03-07 09:36:54,570][155452] Updated weights for policy 0, policy_version 92620 (0.0006) [2023-03-07 09:36:55,343][155452] Updated weights for policy 0, policy_version 92630 (0.0006) [2023-03-07 09:36:56,129][155452] Updated weights for policy 0, policy_version 92640 (0.0006) [2023-03-07 09:36:56,916][155452] Updated weights for policy 0, policy_version 92650 (0.0007) [2023-03-07 09:36:57,703][155452] Updated weights for policy 0, policy_version 92660 (0.0006) [2023-03-07 09:36:58,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 94892032. Throughput: 0: 13030.0. Samples: 94884907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:36:58,367][155126] Avg episode reward: [(0, '2100.857')] [2023-03-07 09:36:58,483][155452] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-07 09:36:59,279][155452] Updated weights for policy 0, policy_version 92680 (0.0006) [2023-03-07 09:37:00,058][155452] Updated weights for policy 0, policy_version 92690 (0.0006) [2023-03-07 09:37:00,846][155452] Updated weights for policy 0, policy_version 92700 (0.0005) [2023-03-07 09:37:01,627][155452] Updated weights for policy 0, policy_version 92710 (0.0006) [2023-03-07 09:37:02,429][155452] Updated weights for policy 0, policy_version 92720 (0.0007) [2023-03-07 09:37:03,219][155452] Updated weights for policy 0, policy_version 92730 (0.0006) [2023-03-07 09:37:03,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13017.0). Total num frames: 94956544. Throughput: 0: 13034.9. Samples: 94923966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:37:03,367][155126] Avg episode reward: [(0, '2030.058')] [2023-03-07 09:37:04,010][155452] Updated weights for policy 0, policy_version 92740 (0.0006) [2023-03-07 09:37:04,793][155452] Updated weights for policy 0, policy_version 92750 (0.0006) [2023-03-07 09:37:05,581][155452] Updated weights for policy 0, policy_version 92760 (0.0006) [2023-03-07 09:37:06,360][155452] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-07 09:37:07,151][155452] Updated weights for policy 0, policy_version 92780 (0.0006) [2023-03-07 09:37:07,957][155452] Updated weights for policy 0, policy_version 92790 (0.0007) [2023-03-07 09:37:08,367][155126] Fps is (10 sec: 13005.0, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 95022080. Throughput: 0: 13025.2. Samples: 95001953. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:08,367][155126] Avg episode reward: [(0, '2063.865')] [2023-03-07 09:37:08,717][155452] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-07 09:37:09,498][155452] Updated weights for policy 0, policy_version 92810 (0.0006) [2023-03-07 09:37:10,285][155452] Updated weights for policy 0, policy_version 92820 (0.0006) [2023-03-07 09:37:11,060][155452] Updated weights for policy 0, policy_version 92830 (0.0007) [2023-03-07 09:37:11,841][155452] Updated weights for policy 0, policy_version 92840 (0.0007) [2023-03-07 09:37:12,628][155452] Updated weights for policy 0, policy_version 92850 (0.0006) [2023-03-07 09:37:13,367][155126] Fps is (10 sec: 13107.1, 60 sec: 13038.9, 300 sec: 13020.4). Total num frames: 95087616. Throughput: 0: 13026.6. Samples: 95080560. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:13,367][155126] Avg episode reward: [(0, '1964.782')] [2023-03-07 09:37:13,402][155452] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-07 09:37:14,202][155452] Updated weights for policy 0, policy_version 92870 (0.0007) [2023-03-07 09:37:14,997][155452] Updated weights for policy 0, policy_version 92880 (0.0006) [2023-03-07 09:37:15,770][155452] Updated weights for policy 0, policy_version 92890 (0.0007) [2023-03-07 09:37:16,570][155452] Updated weights for policy 0, policy_version 92900 (0.0007) [2023-03-07 09:37:17,349][155452] Updated weights for policy 0, policy_version 92910 (0.0007) [2023-03-07 09:37:18,121][155452] Updated weights for policy 0, policy_version 92920 (0.0006) [2023-03-07 09:37:18,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13039.0, 300 sec: 13020.4). Total num frames: 95153152. Throughput: 0: 13028.7. Samples: 95119509. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:18,367][155126] Avg episode reward: [(0, '2040.729')] [2023-03-07 09:37:18,894][155452] Updated weights for policy 0, policy_version 92930 (0.0005) [2023-03-07 09:37:19,685][155452] Updated weights for policy 0, policy_version 92940 (0.0006) [2023-03-07 09:37:20,478][155452] Updated weights for policy 0, policy_version 92950 (0.0007) [2023-03-07 09:37:21,237][155452] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-07 09:37:22,030][155452] Updated weights for policy 0, policy_version 92970 (0.0005) [2023-03-07 09:37:22,813][155452] Updated weights for policy 0, policy_version 92980 (0.0006) [2023-03-07 09:37:23,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 95218688. Throughput: 0: 13045.3. Samples: 95198221. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:23,368][155126] Avg episode reward: [(0, '2064.808')] [2023-03-07 09:37:23,597][155452] Updated weights for policy 0, policy_version 92990 (0.0006) [2023-03-07 09:37:24,369][155452] Updated weights for policy 0, policy_version 93000 (0.0006) [2023-03-07 09:37:25,166][155452] Updated weights for policy 0, policy_version 93010 (0.0006) [2023-03-07 09:37:25,960][155452] Updated weights for policy 0, policy_version 93020 (0.0006) [2023-03-07 09:37:26,757][155452] Updated weights for policy 0, policy_version 93030 (0.0006) [2023-03-07 09:37:27,550][155452] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-07 09:37:28,312][155452] Updated weights for policy 0, policy_version 93050 (0.0006) [2023-03-07 09:37:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13023.9). Total num frames: 95283200. Throughput: 0: 13036.6. Samples: 95276287. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:28,368][155126] Avg episode reward: [(0, '1836.532')] [2023-03-07 09:37:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000093050_95283200.pth... [2023-03-07 09:37:28,402][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000089998_92157952.pth [2023-03-07 09:37:29,107][155452] Updated weights for policy 0, policy_version 93060 (0.0006) [2023-03-07 09:37:29,891][155452] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-07 09:37:30,662][155452] Updated weights for policy 0, policy_version 93080 (0.0006) [2023-03-07 09:37:31,456][155452] Updated weights for policy 0, policy_version 93090 (0.0005) [2023-03-07 09:37:32,243][155452] Updated weights for policy 0, policy_version 93100 (0.0006) [2023-03-07 09:37:33,040][155452] Updated weights for policy 0, policy_version 93110 (0.0006) [2023-03-07 09:37:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 95348736. Throughput: 0: 13040.0. Samples: 95315422. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:33,367][155126] Avg episode reward: [(0, '1953.605')] [2023-03-07 09:37:33,809][155452] Updated weights for policy 0, policy_version 93120 (0.0007) [2023-03-07 09:37:34,617][155452] Updated weights for policy 0, policy_version 93130 (0.0006) [2023-03-07 09:37:35,389][155452] Updated weights for policy 0, policy_version 93140 (0.0006) [2023-03-07 09:37:36,161][155452] Updated weights for policy 0, policy_version 93150 (0.0006) [2023-03-07 09:37:36,945][155452] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-07 09:37:37,722][155452] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-07 09:37:38,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13056.0, 300 sec: 13023.9). Total num frames: 95414272. Throughput: 0: 13045.2. Samples: 95393818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:38,367][155126] Avg episode reward: [(0, '1930.324')] [2023-03-07 09:37:38,510][155452] Updated weights for policy 0, policy_version 93180 (0.0006) [2023-03-07 09:37:39,294][155452] Updated weights for policy 0, policy_version 93190 (0.0006) [2023-03-07 09:37:40,078][155452] Updated weights for policy 0, policy_version 93200 (0.0006) [2023-03-07 09:37:40,853][155452] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-07 09:37:41,640][155452] Updated weights for policy 0, policy_version 93220 (0.0006) [2023-03-07 09:37:42,426][155452] Updated weights for policy 0, policy_version 93230 (0.0006) [2023-03-07 09:37:43,228][155452] Updated weights for policy 0, policy_version 93240 (0.0006) [2023-03-07 09:37:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 95478784. Throughput: 0: 13047.8. Samples: 95472055. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:43,367][155126] Avg episode reward: [(0, '1988.027')] [2023-03-07 09:37:43,998][155452] Updated weights for policy 0, policy_version 93250 (0.0005) [2023-03-07 09:37:44,787][155452] Updated weights for policy 0, policy_version 93260 (0.0006) [2023-03-07 09:37:45,584][155452] Updated weights for policy 0, policy_version 93270 (0.0005) [2023-03-07 09:37:46,364][155452] Updated weights for policy 0, policy_version 93280 (0.0005) [2023-03-07 09:37:47,163][155452] Updated weights for policy 0, policy_version 93290 (0.0006) [2023-03-07 09:37:47,942][155452] Updated weights for policy 0, policy_version 93300 (0.0006) [2023-03-07 09:37:48,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 95544320. Throughput: 0: 13047.6. Samples: 95511108. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:48,367][155126] Avg episode reward: [(0, '2040.486')] [2023-03-07 09:37:48,741][155452] Updated weights for policy 0, policy_version 93310 (0.0006) [2023-03-07 09:37:49,539][155452] Updated weights for policy 0, policy_version 93320 (0.0006) [2023-03-07 09:37:50,318][155452] Updated weights for policy 0, policy_version 93330 (0.0006) [2023-03-07 09:37:51,091][155452] Updated weights for policy 0, policy_version 93340 (0.0006) [2023-03-07 09:37:51,901][155452] Updated weights for policy 0, policy_version 93350 (0.0005) [2023-03-07 09:37:52,679][155452] Updated weights for policy 0, policy_version 93360 (0.0006) [2023-03-07 09:37:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 95608832. Throughput: 0: 13049.5. Samples: 95589182. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:53,367][155126] Avg episode reward: [(0, '2098.306')] [2023-03-07 09:37:53,448][155452] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-07 09:37:54,245][155452] Updated weights for policy 0, policy_version 93380 (0.0008) [2023-03-07 09:37:55,034][155452] Updated weights for policy 0, policy_version 93390 (0.0006) [2023-03-07 09:37:55,807][155452] Updated weights for policy 0, policy_version 93400 (0.0006) [2023-03-07 09:37:56,592][155452] Updated weights for policy 0, policy_version 93410 (0.0006) [2023-03-07 09:37:57,373][155452] Updated weights for policy 0, policy_version 93420 (0.0007) [2023-03-07 09:37:58,170][155452] Updated weights for policy 0, policy_version 93430 (0.0006) [2023-03-07 09:37:58,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13027.4). Total num frames: 95674368. Throughput: 0: 13039.8. Samples: 95667353. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:37:58,367][155126] Avg episode reward: [(0, '2075.507')] [2023-03-07 09:37:58,944][155452] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-07 09:37:59,745][155452] Updated weights for policy 0, policy_version 93450 (0.0006) [2023-03-07 09:38:00,514][155452] Updated weights for policy 0, policy_version 93460 (0.0006) [2023-03-07 09:38:01,307][155452] Updated weights for policy 0, policy_version 93470 (0.0007) [2023-03-07 09:38:02,110][155452] Updated weights for policy 0, policy_version 93480 (0.0006) [2023-03-07 09:38:02,911][155452] Updated weights for policy 0, policy_version 93490 (0.0006) [2023-03-07 09:38:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 95738880. Throughput: 0: 13044.8. Samples: 95706522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:03,367][155126] Avg episode reward: [(0, '2019.539')] [2023-03-07 09:38:03,683][155452] Updated weights for policy 0, policy_version 93500 (0.0006) [2023-03-07 09:38:04,470][155452] Updated weights for policy 0, policy_version 93510 (0.0006) [2023-03-07 09:38:05,254][155452] Updated weights for policy 0, policy_version 93520 (0.0007) [2023-03-07 09:38:06,020][155452] Updated weights for policy 0, policy_version 93530 (0.0006) [2023-03-07 09:38:06,819][155452] Updated weights for policy 0, policy_version 93540 (0.0007) [2023-03-07 09:38:07,606][155452] Updated weights for policy 0, policy_version 93550 (0.0006) [2023-03-07 09:38:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 95804416. Throughput: 0: 13028.9. Samples: 95784520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:08,367][155126] Avg episode reward: [(0, '1973.585')] [2023-03-07 09:38:08,395][155452] Updated weights for policy 0, policy_version 93560 (0.0007) [2023-03-07 09:38:09,185][155452] Updated weights for policy 0, policy_version 93570 (0.0006) [2023-03-07 09:38:09,959][155452] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-07 09:38:10,746][155452] Updated weights for policy 0, policy_version 93590 (0.0006) [2023-03-07 09:38:11,540][155452] Updated weights for policy 0, policy_version 93600 (0.0007) [2023-03-07 09:38:12,325][155452] Updated weights for policy 0, policy_version 93610 (0.0005) [2023-03-07 09:38:13,121][155452] Updated weights for policy 0, policy_version 93620 (0.0007) [2023-03-07 09:38:13,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 95868928. Throughput: 0: 13030.5. Samples: 95862661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:13,367][155126] Avg episode reward: [(0, '2070.165')] [2023-03-07 09:38:13,908][155452] Updated weights for policy 0, policy_version 93630 (0.0006) [2023-03-07 09:38:14,681][155452] Updated weights for policy 0, policy_version 93640 (0.0007) [2023-03-07 09:38:15,476][155452] Updated weights for policy 0, policy_version 93650 (0.0006) [2023-03-07 09:38:16,265][155452] Updated weights for policy 0, policy_version 93660 (0.0006) [2023-03-07 09:38:17,065][155452] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-07 09:38:17,861][155452] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-07 09:38:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 95934464. Throughput: 0: 13025.8. Samples: 95901585. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:18,367][155126] Avg episode reward: [(0, '2231.905')] [2023-03-07 09:38:18,657][155452] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-07 09:38:19,454][155452] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-07 09:38:20,230][155452] Updated weights for policy 0, policy_version 93710 (0.0006) [2023-03-07 09:38:21,029][155452] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-07 09:38:21,794][155452] Updated weights for policy 0, policy_version 93730 (0.0005) [2023-03-07 09:38:22,596][155452] Updated weights for policy 0, policy_version 93740 (0.0007) [2023-03-07 09:38:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13023.9). Total num frames: 95998976. Throughput: 0: 13015.3. Samples: 95979505. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:23,367][155126] Avg episode reward: [(0, '1833.792')] [2023-03-07 09:38:23,370][155452] Updated weights for policy 0, policy_version 93750 (0.0006) [2023-03-07 09:38:24,162][155452] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-07 09:38:24,943][155452] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-07 09:38:25,741][155452] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-07 09:38:26,542][155452] Updated weights for policy 0, policy_version 93790 (0.0006) [2023-03-07 09:38:27,328][155452] Updated weights for policy 0, policy_version 93800 (0.0006) [2023-03-07 09:38:28,117][155452] Updated weights for policy 0, policy_version 93810 (0.0006) [2023-03-07 09:38:28,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 96064512. Throughput: 0: 13005.7. Samples: 96057313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:28,367][155126] Avg episode reward: [(0, '2050.146')] [2023-03-07 09:38:28,923][155452] Updated weights for policy 0, policy_version 93820 (0.0006) [2023-03-07 09:38:29,691][155452] Updated weights for policy 0, policy_version 93830 (0.0006) [2023-03-07 09:38:30,476][155452] Updated weights for policy 0, policy_version 93840 (0.0006) [2023-03-07 09:38:31,252][155452] Updated weights for policy 0, policy_version 93850 (0.0007) [2023-03-07 09:38:32,027][155452] Updated weights for policy 0, policy_version 93860 (0.0006) [2023-03-07 09:38:32,824][155452] Updated weights for policy 0, policy_version 93870 (0.0007) [2023-03-07 09:38:33,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 96129024. Throughput: 0: 13007.0. Samples: 96096422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:33,367][155126] Avg episode reward: [(0, '2166.074')] [2023-03-07 09:38:33,607][155452] Updated weights for policy 0, policy_version 93880 (0.0006) [2023-03-07 09:38:34,397][155452] Updated weights for policy 0, policy_version 93890 (0.0006) [2023-03-07 09:38:35,178][155452] Updated weights for policy 0, policy_version 93900 (0.0006) [2023-03-07 09:38:35,974][155452] Updated weights for policy 0, policy_version 93910 (0.0007) [2023-03-07 09:38:36,773][155452] Updated weights for policy 0, policy_version 93920 (0.0006) [2023-03-07 09:38:37,542][155452] Updated weights for policy 0, policy_version 93930 (0.0007) [2023-03-07 09:38:38,334][155452] Updated weights for policy 0, policy_version 93940 (0.0005) [2023-03-07 09:38:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 96194560. Throughput: 0: 13005.8. Samples: 96174442. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:38,367][155126] Avg episode reward: [(0, '2141.246')] [2023-03-07 09:38:39,107][155452] Updated weights for policy 0, policy_version 93950 (0.0006) [2023-03-07 09:38:39,912][155452] Updated weights for policy 0, policy_version 93960 (0.0006) [2023-03-07 09:38:40,681][155452] Updated weights for policy 0, policy_version 93970 (0.0007) [2023-03-07 09:38:41,474][155452] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-07 09:38:42,283][155452] Updated weights for policy 0, policy_version 93990 (0.0006) [2023-03-07 09:38:43,049][155452] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-07 09:38:43,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 96259072. Throughput: 0: 13005.1. Samples: 96252584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:43,367][155126] Avg episode reward: [(0, '2089.392')] [2023-03-07 09:38:43,835][155452] Updated weights for policy 0, policy_version 94010 (0.0006) [2023-03-07 09:38:44,633][155452] Updated weights for policy 0, policy_version 94020 (0.0007) [2023-03-07 09:38:45,406][155452] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-07 09:38:46,194][155452] Updated weights for policy 0, policy_version 94040 (0.0006) [2023-03-07 09:38:46,971][155452] Updated weights for policy 0, policy_version 94050 (0.0006) [2023-03-07 09:38:47,758][155452] Updated weights for policy 0, policy_version 94060 (0.0006) [2023-03-07 09:38:48,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 96325632. Throughput: 0: 13006.4. Samples: 96291811. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:48,367][155126] Avg episode reward: [(0, '2138.554')] [2023-03-07 09:38:48,520][155452] Updated weights for policy 0, policy_version 94070 (0.0007) [2023-03-07 09:38:49,326][155452] Updated weights for policy 0, policy_version 94080 (0.0007) [2023-03-07 09:38:50,106][155452] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-07 09:38:50,872][155452] Updated weights for policy 0, policy_version 94100 (0.0007) [2023-03-07 09:38:51,654][155452] Updated weights for policy 0, policy_version 94110 (0.0005) [2023-03-07 09:38:52,468][155452] Updated weights for policy 0, policy_version 94120 (0.0006) [2023-03-07 09:38:53,230][155452] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-07 09:38:53,367][155126] Fps is (10 sec: 13107.2, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 96390144. Throughput: 0: 13017.3. Samples: 96370300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:53,367][155126] Avg episode reward: [(0, '2267.897')] [2023-03-07 09:38:54,015][155452] Updated weights for policy 0, policy_version 94140 (0.0006) [2023-03-07 09:38:54,806][155452] Updated weights for policy 0, policy_version 94150 (0.0006) [2023-03-07 09:38:55,606][155452] Updated weights for policy 0, policy_version 94160 (0.0006) [2023-03-07 09:38:56,392][155452] Updated weights for policy 0, policy_version 94170 (0.0007) [2023-03-07 09:38:57,178][155452] Updated weights for policy 0, policy_version 94180 (0.0006) [2023-03-07 09:38:57,972][155452] Updated weights for policy 0, policy_version 94190 (0.0007) [2023-03-07 09:38:58,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13027.4). Total num frames: 96455680. Throughput: 0: 13013.4. Samples: 96448265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:38:58,368][155126] Avg episode reward: [(0, '2194.785')] [2023-03-07 09:38:58,745][155452] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-07 09:38:59,518][155452] Updated weights for policy 0, policy_version 94210 (0.0007) [2023-03-07 09:39:00,338][155452] Updated weights for policy 0, policy_version 94220 (0.0007) [2023-03-07 09:39:01,107][155452] Updated weights for policy 0, policy_version 94230 (0.0006) [2023-03-07 09:39:01,906][155452] Updated weights for policy 0, policy_version 94240 (0.0007) [2023-03-07 09:39:02,677][155452] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-07 09:39:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 96520192. Throughput: 0: 13019.4. Samples: 96487457. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:03,367][155126] Avg episode reward: [(0, '2263.513')] [2023-03-07 09:39:03,474][155452] Updated weights for policy 0, policy_version 94260 (0.0006) [2023-03-07 09:39:04,261][155452] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-07 09:39:05,042][155452] Updated weights for policy 0, policy_version 94280 (0.0006) [2023-03-07 09:39:05,842][155452] Updated weights for policy 0, policy_version 94290 (0.0006) [2023-03-07 09:39:06,633][155452] Updated weights for policy 0, policy_version 94300 (0.0006) [2023-03-07 09:39:07,426][155452] Updated weights for policy 0, policy_version 94310 (0.0005) [2023-03-07 09:39:08,202][155452] Updated weights for policy 0, policy_version 94320 (0.0006) [2023-03-07 09:39:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 96585728. Throughput: 0: 13017.1. Samples: 96565275. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:08,367][155126] Avg episode reward: [(0, '2157.473')] [2023-03-07 09:39:08,996][155452] Updated weights for policy 0, policy_version 94330 (0.0006) [2023-03-07 09:39:09,785][155452] Updated weights for policy 0, policy_version 94340 (0.0007) [2023-03-07 09:39:10,558][155452] Updated weights for policy 0, policy_version 94350 (0.0007) [2023-03-07 09:39:11,351][155452] Updated weights for policy 0, policy_version 94360 (0.0006) [2023-03-07 09:39:12,154][155452] Updated weights for policy 0, policy_version 94370 (0.0006) [2023-03-07 09:39:12,945][155452] Updated weights for policy 0, policy_version 94380 (0.0006) [2023-03-07 09:39:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 96650240. Throughput: 0: 13020.6. Samples: 96643241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:13,367][155126] Avg episode reward: [(0, '1976.628')] [2023-03-07 09:39:13,731][155452] Updated weights for policy 0, policy_version 94390 (0.0007) [2023-03-07 09:39:14,510][155452] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-07 09:39:15,301][155452] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-07 09:39:16,091][155452] Updated weights for policy 0, policy_version 94420 (0.0006) [2023-03-07 09:39:16,860][155452] Updated weights for policy 0, policy_version 94430 (0.0007) [2023-03-07 09:39:17,642][155452] Updated weights for policy 0, policy_version 94440 (0.0006) [2023-03-07 09:39:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 96715776. Throughput: 0: 13020.9. Samples: 96682363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:18,367][155126] Avg episode reward: [(0, '2128.601')] [2023-03-07 09:39:18,429][155452] Updated weights for policy 0, policy_version 94450 (0.0006) [2023-03-07 09:39:19,214][155452] Updated weights for policy 0, policy_version 94460 (0.0006) [2023-03-07 09:39:19,993][155452] Updated weights for policy 0, policy_version 94470 (0.0006) [2023-03-07 09:39:20,786][155452] Updated weights for policy 0, policy_version 94480 (0.0006) [2023-03-07 09:39:21,588][155452] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-07 09:39:22,372][155452] Updated weights for policy 0, policy_version 94500 (0.0006) [2023-03-07 09:39:23,139][155452] Updated weights for policy 0, policy_version 94510 (0.0006) [2023-03-07 09:39:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 96780288. Throughput: 0: 13024.7. Samples: 96760553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:23,367][155126] Avg episode reward: [(0, '2111.705')] [2023-03-07 09:39:23,938][155452] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-07 09:39:24,733][155452] Updated weights for policy 0, policy_version 94530 (0.0006) [2023-03-07 09:39:25,506][155452] Updated weights for policy 0, policy_version 94540 (0.0006) [2023-03-07 09:39:26,293][155452] Updated weights for policy 0, policy_version 94550 (0.0005) [2023-03-07 09:39:27,071][155452] Updated weights for policy 0, policy_version 94560 (0.0006) [2023-03-07 09:39:27,852][155452] Updated weights for policy 0, policy_version 94570 (0.0007) [2023-03-07 09:39:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 96845824. Throughput: 0: 13030.0. Samples: 96838934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:28,367][155126] Avg episode reward: [(0, '2095.971')] [2023-03-07 09:39:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000094576_96845824.pth... [2023-03-07 09:39:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000091525_93721600.pth [2023-03-07 09:39:28,635][155452] Updated weights for policy 0, policy_version 94580 (0.0006) [2023-03-07 09:39:29,425][155452] Updated weights for policy 0, policy_version 94590 (0.0007) [2023-03-07 09:39:30,220][155452] Updated weights for policy 0, policy_version 94600 (0.0006) [2023-03-07 09:39:30,984][155452] Updated weights for policy 0, policy_version 94610 (0.0006) [2023-03-07 09:39:31,789][155452] Updated weights for policy 0, policy_version 94620 (0.0006) [2023-03-07 09:39:32,552][155452] Updated weights for policy 0, policy_version 94630 (0.0006) [2023-03-07 09:39:33,340][155452] Updated weights for policy 0, policy_version 94640 (0.0006) [2023-03-07 09:39:33,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13039.0, 300 sec: 13023.9). Total num frames: 96911360. Throughput: 0: 13023.6. Samples: 96877872. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:33,367][155126] Avg episode reward: [(0, '2320.543')] [2023-03-07 09:39:34,125][155452] Updated weights for policy 0, policy_version 94650 (0.0006) [2023-03-07 09:39:34,898][155452] Updated weights for policy 0, policy_version 94660 (0.0006) [2023-03-07 09:39:35,694][155452] Updated weights for policy 0, policy_version 94670 (0.0006) [2023-03-07 09:39:36,496][155452] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-07 09:39:37,277][155452] Updated weights for policy 0, policy_version 94690 (0.0007) [2023-03-07 09:39:38,068][155452] Updated weights for policy 0, policy_version 94700 (0.0006) [2023-03-07 09:39:38,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 96975872. Throughput: 0: 13020.7. Samples: 96956230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:38,378][155126] Avg episode reward: [(0, '2068.085')] [2023-03-07 09:39:38,856][155452] Updated weights for policy 0, policy_version 94710 (0.0006) [2023-03-07 09:39:39,629][155452] Updated weights for policy 0, policy_version 94720 (0.0006) [2023-03-07 09:39:40,413][155452] Updated weights for policy 0, policy_version 94730 (0.0006) [2023-03-07 09:39:41,196][155452] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-07 09:39:42,004][155452] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-07 09:39:42,762][155452] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-07 09:39:43,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13038.9, 300 sec: 13023.9). Total num frames: 97041408. Throughput: 0: 13030.0. Samples: 97034613. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:43,368][155126] Avg episode reward: [(0, '1862.759')] [2023-03-07 09:39:43,558][155452] Updated weights for policy 0, policy_version 94770 (0.0006) [2023-03-07 09:39:44,332][155452] Updated weights for policy 0, policy_version 94780 (0.0006) [2023-03-07 09:39:45,131][155452] Updated weights for policy 0, policy_version 94790 (0.0007) [2023-03-07 09:39:45,925][155452] Updated weights for policy 0, policy_version 94800 (0.0005) [2023-03-07 09:39:46,701][155452] Updated weights for policy 0, policy_version 94810 (0.0006) [2023-03-07 09:39:47,492][155452] Updated weights for policy 0, policy_version 94820 (0.0007) [2023-03-07 09:39:48,271][155452] Updated weights for policy 0, policy_version 94830 (0.0007) [2023-03-07 09:39:48,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 97106944. Throughput: 0: 13025.1. Samples: 97073588. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:48,367][155126] Avg episode reward: [(0, '2104.655')] [2023-03-07 09:39:49,052][155452] Updated weights for policy 0, policy_version 94840 (0.0006) [2023-03-07 09:39:49,826][155452] Updated weights for policy 0, policy_version 94850 (0.0006) [2023-03-07 09:39:50,627][155452] Updated weights for policy 0, policy_version 94860 (0.0006) [2023-03-07 09:39:51,425][155452] Updated weights for policy 0, policy_version 94870 (0.0007) [2023-03-07 09:39:52,200][155452] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-07 09:39:53,000][155452] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-07 09:39:53,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97171456. Throughput: 0: 13031.5. Samples: 97151692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:53,367][155126] Avg episode reward: [(0, '2053.911')] [2023-03-07 09:39:53,790][155452] Updated weights for policy 0, policy_version 94900 (0.0006) [2023-03-07 09:39:54,568][155452] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-07 09:39:55,361][155452] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-07 09:39:56,145][155452] Updated weights for policy 0, policy_version 94930 (0.0006) [2023-03-07 09:39:56,923][155452] Updated weights for policy 0, policy_version 94940 (0.0006) [2023-03-07 09:39:57,716][155452] Updated weights for policy 0, policy_version 94950 (0.0006) [2023-03-07 09:39:58,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13021.9, 300 sec: 13023.9). Total num frames: 97236992. Throughput: 0: 13036.6. Samples: 97229888. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:39:58,378][155126] Avg episode reward: [(0, '2135.650')] [2023-03-07 09:39:58,493][155452] Updated weights for policy 0, policy_version 94960 (0.0006) [2023-03-07 09:39:59,296][155452] Updated weights for policy 0, policy_version 94970 (0.0006) [2023-03-07 09:40:00,094][155452] Updated weights for policy 0, policy_version 94980 (0.0006) [2023-03-07 09:40:00,881][155452] Updated weights for policy 0, policy_version 94990 (0.0006) [2023-03-07 09:40:01,673][155452] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-07 09:40:02,451][155452] Updated weights for policy 0, policy_version 95010 (0.0006) [2023-03-07 09:40:03,244][155452] Updated weights for policy 0, policy_version 95020 (0.0008) [2023-03-07 09:40:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97301504. Throughput: 0: 13031.0. Samples: 97268757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:03,367][155126] Avg episode reward: [(0, '2060.449')] [2023-03-07 09:40:04,045][155452] Updated weights for policy 0, policy_version 95030 (0.0007) [2023-03-07 09:40:04,840][155452] Updated weights for policy 0, policy_version 95040 (0.0007) [2023-03-07 09:40:05,619][155452] Updated weights for policy 0, policy_version 95050 (0.0006) [2023-03-07 09:40:06,401][155452] Updated weights for policy 0, policy_version 95060 (0.0006) [2023-03-07 09:40:07,173][155452] Updated weights for policy 0, policy_version 95070 (0.0006) [2023-03-07 09:40:07,957][155452] Updated weights for policy 0, policy_version 95080 (0.0007) [2023-03-07 09:40:08,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97367040. Throughput: 0: 13023.0. Samples: 97346590. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:08,367][155126] Avg episode reward: [(0, '2155.077')] [2023-03-07 09:40:08,749][155452] Updated weights for policy 0, policy_version 95090 (0.0005) [2023-03-07 09:40:09,527][155452] Updated weights for policy 0, policy_version 95100 (0.0006) [2023-03-07 09:40:10,314][155452] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-07 09:40:11,102][155452] Updated weights for policy 0, policy_version 95120 (0.0006) [2023-03-07 09:40:11,910][155452] Updated weights for policy 0, policy_version 95130 (0.0006) [2023-03-07 09:40:12,681][155452] Updated weights for policy 0, policy_version 95140 (0.0005) [2023-03-07 09:40:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97431552. Throughput: 0: 13017.5. Samples: 97424720. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:13,367][155126] Avg episode reward: [(0, '2287.325')] [2023-03-07 09:40:13,477][155452] Updated weights for policy 0, policy_version 95150 (0.0006) [2023-03-07 09:40:14,276][155452] Updated weights for policy 0, policy_version 95160 (0.0006) [2023-03-07 09:40:15,065][155452] Updated weights for policy 0, policy_version 95170 (0.0006) [2023-03-07 09:40:15,860][155452] Updated weights for policy 0, policy_version 95180 (0.0006) [2023-03-07 09:40:16,632][155452] Updated weights for policy 0, policy_version 95190 (0.0006) [2023-03-07 09:40:17,420][155452] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-07 09:40:18,200][155452] Updated weights for policy 0, policy_version 95210 (0.0007) [2023-03-07 09:40:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97497088. Throughput: 0: 13013.6. Samples: 97463485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:18,367][155126] Avg episode reward: [(0, '2208.024')] [2023-03-07 09:40:18,986][155452] Updated weights for policy 0, policy_version 95220 (0.0006) [2023-03-07 09:40:19,798][155452] Updated weights for policy 0, policy_version 95230 (0.0006) [2023-03-07 09:40:20,581][155452] Updated weights for policy 0, policy_version 95240 (0.0007) [2023-03-07 09:40:21,375][155452] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-07 09:40:22,159][155452] Updated weights for policy 0, policy_version 95260 (0.0005) [2023-03-07 09:40:22,940][155452] Updated weights for policy 0, policy_version 95270 (0.0006) [2023-03-07 09:40:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 97561600. Throughput: 0: 13005.5. Samples: 97541476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:23,367][155126] Avg episode reward: [(0, '2233.344')] [2023-03-07 09:40:23,739][155452] Updated weights for policy 0, policy_version 95280 (0.0006) [2023-03-07 09:40:24,521][155452] Updated weights for policy 0, policy_version 95290 (0.0006) [2023-03-07 09:40:25,302][155452] Updated weights for policy 0, policy_version 95300 (0.0006) [2023-03-07 09:40:26,087][155452] Updated weights for policy 0, policy_version 95310 (0.0006) [2023-03-07 09:40:26,860][155452] Updated weights for policy 0, policy_version 95320 (0.0006) [2023-03-07 09:40:27,667][155452] Updated weights for policy 0, policy_version 95330 (0.0006) [2023-03-07 09:40:28,367][155126] Fps is (10 sec: 12902.4, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 97626112. Throughput: 0: 12996.7. Samples: 97619464. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:28,367][155126] Avg episode reward: [(0, '2210.028')] [2023-03-07 09:40:28,471][155452] Updated weights for policy 0, policy_version 95340 (0.0006) [2023-03-07 09:40:29,256][155452] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-07 09:40:30,053][155452] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-07 09:40:30,842][155452] Updated weights for policy 0, policy_version 95370 (0.0006) [2023-03-07 09:40:31,638][155452] Updated weights for policy 0, policy_version 95380 (0.0006) [2023-03-07 09:40:32,436][155452] Updated weights for policy 0, policy_version 95390 (0.0007) [2023-03-07 09:40:33,221][155452] Updated weights for policy 0, policy_version 95400 (0.0006) [2023-03-07 09:40:33,367][155126] Fps is (10 sec: 12902.5, 60 sec: 12987.7, 300 sec: 13017.0). Total num frames: 97690624. Throughput: 0: 12990.3. Samples: 97658151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:33,367][155126] Avg episode reward: [(0, '2194.439')] [2023-03-07 09:40:34,005][155452] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-07 09:40:34,788][155452] Updated weights for policy 0, policy_version 95420 (0.0006) [2023-03-07 09:40:35,565][155452] Updated weights for policy 0, policy_version 95430 (0.0007) [2023-03-07 09:40:36,358][155452] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-07 09:40:37,155][155452] Updated weights for policy 0, policy_version 95450 (0.0006) [2023-03-07 09:40:37,929][155452] Updated weights for policy 0, policy_version 95460 (0.0006) [2023-03-07 09:40:38,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 97756160. Throughput: 0: 12988.5. Samples: 97736174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:38,367][155126] Avg episode reward: [(0, '2157.708')] [2023-03-07 09:40:38,721][155452] Updated weights for policy 0, policy_version 95470 (0.0006) [2023-03-07 09:40:39,504][155452] Updated weights for policy 0, policy_version 95480 (0.0006) [2023-03-07 09:40:40,298][155452] Updated weights for policy 0, policy_version 95490 (0.0006) [2023-03-07 09:40:41,093][155452] Updated weights for policy 0, policy_version 95500 (0.0007) [2023-03-07 09:40:41,874][155452] Updated weights for policy 0, policy_version 95510 (0.0006) [2023-03-07 09:40:42,649][155452] Updated weights for policy 0, policy_version 95520 (0.0006) [2023-03-07 09:40:43,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 97821696. Throughput: 0: 12987.1. Samples: 97814305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:43,367][155126] Avg episode reward: [(0, '2247.999')] [2023-03-07 09:40:43,430][155452] Updated weights for policy 0, policy_version 95530 (0.0006) [2023-03-07 09:40:44,218][155452] Updated weights for policy 0, policy_version 95540 (0.0006) [2023-03-07 09:40:45,002][155452] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-07 09:40:45,797][155452] Updated weights for policy 0, policy_version 95560 (0.0006) [2023-03-07 09:40:46,556][155452] Updated weights for policy 0, policy_version 95570 (0.0006) [2023-03-07 09:40:47,357][155452] Updated weights for policy 0, policy_version 95580 (0.0007) [2023-03-07 09:40:48,158][155452] Updated weights for policy 0, policy_version 95590 (0.0006) [2023-03-07 09:40:48,367][155126] Fps is (10 sec: 13004.6, 60 sec: 12987.7, 300 sec: 13020.4). Total num frames: 97886208. Throughput: 0: 12994.6. Samples: 97853514. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:48,368][155126] Avg episode reward: [(0, '2164.568')] [2023-03-07 09:40:48,944][155452] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-07 09:40:49,732][155452] Updated weights for policy 0, policy_version 95610 (0.0006) [2023-03-07 09:40:50,525][155452] Updated weights for policy 0, policy_version 95620 (0.0007) [2023-03-07 09:40:51,298][155452] Updated weights for policy 0, policy_version 95630 (0.0005) [2023-03-07 09:40:52,101][155452] Updated weights for policy 0, policy_version 95640 (0.0006) [2023-03-07 09:40:52,881][155452] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-07 09:40:53,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 97951744. Throughput: 0: 12997.8. Samples: 97931492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:53,367][155126] Avg episode reward: [(0, '2190.112')] [2023-03-07 09:40:53,668][155452] Updated weights for policy 0, policy_version 95660 (0.0007) [2023-03-07 09:40:54,452][155452] Updated weights for policy 0, policy_version 95670 (0.0006) [2023-03-07 09:40:55,221][155452] Updated weights for policy 0, policy_version 95680 (0.0006) [2023-03-07 09:40:56,020][155452] Updated weights for policy 0, policy_version 95690 (0.0006) [2023-03-07 09:40:56,792][155452] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-07 09:40:57,574][155452] Updated weights for policy 0, policy_version 95710 (0.0006) [2023-03-07 09:40:58,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13017.0). Total num frames: 98016256. Throughput: 0: 13000.6. Samples: 98009747. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:40:58,378][155126] Avg episode reward: [(0, '2234.876')] [2023-03-07 09:40:58,382][155452] Updated weights for policy 0, policy_version 95720 (0.0006) [2023-03-07 09:40:59,165][155452] Updated weights for policy 0, policy_version 95730 (0.0006) [2023-03-07 09:40:59,971][155452] Updated weights for policy 0, policy_version 95740 (0.0006) [2023-03-07 09:41:00,760][155452] Updated weights for policy 0, policy_version 95750 (0.0006) [2023-03-07 09:41:01,548][155452] Updated weights for policy 0, policy_version 95760 (0.0007) [2023-03-07 09:41:02,337][155452] Updated weights for policy 0, policy_version 95770 (0.0007) [2023-03-07 09:41:03,095][155452] Updated weights for policy 0, policy_version 95780 (0.0006) [2023-03-07 09:41:03,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98081792. Throughput: 0: 13000.0. Samples: 98048485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:03,378][155126] Avg episode reward: [(0, '2391.223')] [2023-03-07 09:41:03,885][155452] Updated weights for policy 0, policy_version 95790 (0.0006) [2023-03-07 09:41:04,672][155452] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-07 09:41:05,458][155452] Updated weights for policy 0, policy_version 95810 (0.0007) [2023-03-07 09:41:06,259][155452] Updated weights for policy 0, policy_version 95820 (0.0006) [2023-03-07 09:41:07,042][155452] Updated weights for policy 0, policy_version 95830 (0.0006) [2023-03-07 09:41:07,830][155452] Updated weights for policy 0, policy_version 95840 (0.0006) [2023-03-07 09:41:08,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 13020.4). Total num frames: 98146304. Throughput: 0: 13005.2. Samples: 98126709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:08,378][155126] Avg episode reward: [(0, '2183.122')] [2023-03-07 09:41:08,625][155452] Updated weights for policy 0, policy_version 95850 (0.0006) [2023-03-07 09:41:09,412][155452] Updated weights for policy 0, policy_version 95860 (0.0006) [2023-03-07 09:41:10,185][155452] Updated weights for policy 0, policy_version 95870 (0.0006) [2023-03-07 09:41:10,982][155452] Updated weights for policy 0, policy_version 95880 (0.0007) [2023-03-07 09:41:11,770][155452] Updated weights for policy 0, policy_version 95890 (0.0007) [2023-03-07 09:41:12,561][155452] Updated weights for policy 0, policy_version 95900 (0.0007) [2023-03-07 09:41:13,359][155452] Updated weights for policy 0, policy_version 95910 (0.0006) [2023-03-07 09:41:13,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98211840. Throughput: 0: 13004.9. Samples: 98204684. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:13,367][155126] Avg episode reward: [(0, '2373.292')] [2023-03-07 09:41:14,133][155452] Updated weights for policy 0, policy_version 95920 (0.0006) [2023-03-07 09:41:14,934][155452] Updated weights for policy 0, policy_version 95930 (0.0006) [2023-03-07 09:41:15,698][155452] Updated weights for policy 0, policy_version 95940 (0.0006) [2023-03-07 09:41:16,498][155452] Updated weights for policy 0, policy_version 95950 (0.0006) [2023-03-07 09:41:17,276][155452] Updated weights for policy 0, policy_version 95960 (0.0006) [2023-03-07 09:41:18,051][155452] Updated weights for policy 0, policy_version 95970 (0.0006) [2023-03-07 09:41:18,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13004.8, 300 sec: 13020.4). Total num frames: 98277376. Throughput: 0: 13012.9. Samples: 98243730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:18,368][155126] Avg episode reward: [(0, '2250.561')] [2023-03-07 09:41:18,840][155452] Updated weights for policy 0, policy_version 95980 (0.0006) [2023-03-07 09:41:19,617][155452] Updated weights for policy 0, policy_version 95990 (0.0006) [2023-03-07 09:41:20,391][155452] Updated weights for policy 0, policy_version 96000 (0.0007) [2023-03-07 09:41:21,192][155452] Updated weights for policy 0, policy_version 96010 (0.0006) [2023-03-07 09:41:21,971][155452] Updated weights for policy 0, policy_version 96020 (0.0006) [2023-03-07 09:41:22,757][155452] Updated weights for policy 0, policy_version 96030 (0.0006) [2023-03-07 09:41:23,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98341888. Throughput: 0: 13021.3. Samples: 98322135. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:23,368][155126] Avg episode reward: [(0, '2368.883')] [2023-03-07 09:41:23,553][155452] Updated weights for policy 0, policy_version 96040 (0.0006) [2023-03-07 09:41:24,342][155452] Updated weights for policy 0, policy_version 96050 (0.0006) [2023-03-07 09:41:25,119][155452] Updated weights for policy 0, policy_version 96060 (0.0006) [2023-03-07 09:41:25,902][155452] Updated weights for policy 0, policy_version 96070 (0.0006) [2023-03-07 09:41:26,702][155452] Updated weights for policy 0, policy_version 96080 (0.0006) [2023-03-07 09:41:27,478][155452] Updated weights for policy 0, policy_version 96090 (0.0007) [2023-03-07 09:41:28,268][155452] Updated weights for policy 0, policy_version 96100 (0.0006) [2023-03-07 09:41:28,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13021.8, 300 sec: 13020.4). Total num frames: 98407424. Throughput: 0: 13017.4. Samples: 98400091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:28,368][155126] Avg episode reward: [(0, '2330.642')] [2023-03-07 09:41:28,372][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000096101_98407424.pth... [2023-03-07 09:41:28,403][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000093050_95283200.pth [2023-03-07 09:41:29,069][155452] Updated weights for policy 0, policy_version 96110 (0.0006) [2023-03-07 09:41:29,847][155452] Updated weights for policy 0, policy_version 96120 (0.0006) [2023-03-07 09:41:30,649][155452] Updated weights for policy 0, policy_version 96130 (0.0006) [2023-03-07 09:41:31,454][155452] Updated weights for policy 0, policy_version 96140 (0.0008) [2023-03-07 09:41:32,229][155452] Updated weights for policy 0, policy_version 96150 (0.0006) [2023-03-07 09:41:33,009][155452] Updated weights for policy 0, policy_version 96160 (0.0006) [2023-03-07 09:41:33,367][155126] Fps is (10 sec: 13004.9, 60 sec: 13021.9, 300 sec: 13020.4). Total num frames: 98471936. Throughput: 0: 13011.5. Samples: 98439030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:33,367][155126] Avg episode reward: [(0, '2242.228')] [2023-03-07 09:41:33,801][155452] Updated weights for policy 0, policy_version 96170 (0.0007) [2023-03-07 09:41:34,597][155452] Updated weights for policy 0, policy_version 96180 (0.0006) [2023-03-07 09:41:35,386][155452] Updated weights for policy 0, policy_version 96190 (0.0006) [2023-03-07 09:41:36,176][155452] Updated weights for policy 0, policy_version 96200 (0.0007) [2023-03-07 09:41:36,949][155452] Updated weights for policy 0, policy_version 96210 (0.0006) [2023-03-07 09:41:37,741][155452] Updated weights for policy 0, policy_version 96220 (0.0006) [2023-03-07 09:41:38,367][155126] Fps is (10 sec: 12902.5, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98536448. Throughput: 0: 13009.9. Samples: 98516937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:38,368][155126] Avg episode reward: [(0, '2263.385')] [2023-03-07 09:41:38,537][155452] Updated weights for policy 0, policy_version 96230 (0.0006) [2023-03-07 09:41:39,304][155452] Updated weights for policy 0, policy_version 96240 (0.0007) [2023-03-07 09:41:40,085][155452] Updated weights for policy 0, policy_version 96250 (0.0006) [2023-03-07 09:41:40,881][155452] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-07 09:41:41,672][155452] Updated weights for policy 0, policy_version 96270 (0.0005) [2023-03-07 09:41:42,440][155452] Updated weights for policy 0, policy_version 96280 (0.0007) [2023-03-07 09:41:43,241][155452] Updated weights for policy 0, policy_version 96290 (0.0006) [2023-03-07 09:41:43,367][155126] Fps is (10 sec: 13004.6, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98601984. Throughput: 0: 13010.6. Samples: 98595227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:43,368][155126] Avg episode reward: [(0, '2211.431')] [2023-03-07 09:41:44,027][155452] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-07 09:41:44,802][155452] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-07 09:41:45,596][155452] Updated weights for policy 0, policy_version 96320 (0.0007) [2023-03-07 09:41:46,399][155452] Updated weights for policy 0, policy_version 96330 (0.0007) [2023-03-07 09:41:47,201][155452] Updated weights for policy 0, policy_version 96340 (0.0007) [2023-03-07 09:41:47,967][155452] Updated weights for policy 0, policy_version 96350 (0.0006) [2023-03-07 09:41:48,367][155126] Fps is (10 sec: 13107.4, 60 sec: 13021.9, 300 sec: 13017.0). Total num frames: 98667520. Throughput: 0: 13016.6. Samples: 98634230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:48,367][155126] Avg episode reward: [(0, '2365.525')] [2023-03-07 09:41:48,760][155452] Updated weights for policy 0, policy_version 96360 (0.0007) [2023-03-07 09:41:49,549][155452] Updated weights for policy 0, policy_version 96370 (0.0006) [2023-03-07 09:41:50,330][155452] Updated weights for policy 0, policy_version 96380 (0.0006) [2023-03-07 09:41:51,114][155452] Updated weights for policy 0, policy_version 96390 (0.0006) [2023-03-07 09:41:51,903][155452] Updated weights for policy 0, policy_version 96400 (0.0006) [2023-03-07 09:41:52,686][155452] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-07 09:41:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98732032. Throughput: 0: 13007.5. Samples: 98712046. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:53,368][155126] Avg episode reward: [(0, '2065.674')] [2023-03-07 09:41:53,497][155452] Updated weights for policy 0, policy_version 96420 (0.0007) [2023-03-07 09:41:54,265][155452] Updated weights for policy 0, policy_version 96430 (0.0005) [2023-03-07 09:41:55,053][155452] Updated weights for policy 0, policy_version 96440 (0.0005) [2023-03-07 09:41:55,837][155452] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-07 09:41:56,632][155452] Updated weights for policy 0, policy_version 96460 (0.0006) [2023-03-07 09:41:57,418][155452] Updated weights for policy 0, policy_version 96470 (0.0006) [2023-03-07 09:41:58,223][155452] Updated weights for policy 0, policy_version 96480 (0.0006) [2023-03-07 09:41:58,367][155126] Fps is (10 sec: 12902.3, 60 sec: 13004.8, 300 sec: 13016.9). Total num frames: 98796544. Throughput: 0: 13007.6. Samples: 98790024. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:41:58,367][155126] Avg episode reward: [(0, '2253.399')] [2023-03-07 09:41:59,018][155452] Updated weights for policy 0, policy_version 96490 (0.0007) [2023-03-07 09:41:59,820][155452] Updated weights for policy 0, policy_version 96500 (0.0006) [2023-03-07 09:42:00,624][155452] Updated weights for policy 0, policy_version 96510 (0.0006) [2023-03-07 09:42:01,409][155452] Updated weights for policy 0, policy_version 96520 (0.0005) [2023-03-07 09:42:02,189][155452] Updated weights for policy 0, policy_version 96530 (0.0006) [2023-03-07 09:42:02,989][155452] Updated weights for policy 0, policy_version 96540 (0.0007) [2023-03-07 09:42:03,367][155126] Fps is (10 sec: 12902.5, 60 sec: 12987.7, 300 sec: 13013.5). Total num frames: 98861056. Throughput: 0: 12994.7. Samples: 98828492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:03,368][155126] Avg episode reward: [(0, '2111.765')] [2023-03-07 09:42:03,773][155452] Updated weights for policy 0, policy_version 96550 (0.0005) [2023-03-07 09:42:04,559][155452] Updated weights for policy 0, policy_version 96560 (0.0006) [2023-03-07 09:42:05,346][155452] Updated weights for policy 0, policy_version 96570 (0.0005) [2023-03-07 09:42:06,123][155452] Updated weights for policy 0, policy_version 96580 (0.0006) [2023-03-07 09:42:06,922][155452] Updated weights for policy 0, policy_version 96590 (0.0006) [2023-03-07 09:42:07,700][155452] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-07 09:42:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13013.5). Total num frames: 98926592. Throughput: 0: 12985.1. Samples: 98906463. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:08,378][155126] Avg episode reward: [(0, '2148.027')] [2023-03-07 09:42:08,491][155452] Updated weights for policy 0, policy_version 96610 (0.0006) [2023-03-07 09:42:09,285][155452] Updated weights for policy 0, policy_version 96620 (0.0006) [2023-03-07 09:42:10,061][155452] Updated weights for policy 0, policy_version 96630 (0.0006) [2023-03-07 09:42:10,865][155452] Updated weights for policy 0, policy_version 96640 (0.0006) [2023-03-07 09:42:11,644][155452] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-07 09:42:12,423][155452] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-07 09:42:13,222][155452] Updated weights for policy 0, policy_version 96670 (0.0006) [2023-03-07 09:42:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 98991104. Throughput: 0: 12986.6. Samples: 98984489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:13,378][155126] Avg episode reward: [(0, '2224.449')] [2023-03-07 09:42:14,024][155452] Updated weights for policy 0, policy_version 96680 (0.0007) [2023-03-07 09:42:14,784][155452] Updated weights for policy 0, policy_version 96690 (0.0007) [2023-03-07 09:42:15,576][155452] Updated weights for policy 0, policy_version 96700 (0.0005) [2023-03-07 09:42:16,362][155452] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-07 09:42:17,146][155452] Updated weights for policy 0, policy_version 96720 (0.0006) [2023-03-07 09:42:17,921][155452] Updated weights for policy 0, policy_version 96730 (0.0006) [2023-03-07 09:42:18,367][155126] Fps is (10 sec: 13004.7, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 99056640. Throughput: 0: 12992.7. Samples: 99023705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:18,368][155126] Avg episode reward: [(0, '2290.785')] [2023-03-07 09:42:18,718][155452] Updated weights for policy 0, policy_version 96740 (0.0006) [2023-03-07 09:42:19,506][155452] Updated weights for policy 0, policy_version 96750 (0.0007) [2023-03-07 09:42:20,284][155452] Updated weights for policy 0, policy_version 96760 (0.0007) [2023-03-07 09:42:21,061][155452] Updated weights for policy 0, policy_version 96770 (0.0006) [2023-03-07 09:42:21,860][155452] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-07 09:42:22,666][155452] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-07 09:42:23,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 99121152. Throughput: 0: 12996.6. Samples: 99101785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:23,367][155126] Avg episode reward: [(0, '2069.001')] [2023-03-07 09:42:23,473][155452] Updated weights for policy 0, policy_version 96800 (0.0006) [2023-03-07 09:42:24,250][155452] Updated weights for policy 0, policy_version 96810 (0.0006) [2023-03-07 09:42:25,046][155452] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-07 09:42:25,846][155452] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-07 09:42:26,626][155452] Updated weights for policy 0, policy_version 96840 (0.0006) [2023-03-07 09:42:27,403][155452] Updated weights for policy 0, policy_version 96850 (0.0005) [2023-03-07 09:42:28,202][155452] Updated weights for policy 0, policy_version 96860 (0.0006) [2023-03-07 09:42:28,367][155126] Fps is (10 sec: 13005.0, 60 sec: 12987.8, 300 sec: 13010.0). Total num frames: 99186688. Throughput: 0: 12982.6. Samples: 99179443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:28,367][155126] Avg episode reward: [(0, '2109.415')] [2023-03-07 09:42:28,998][155452] Updated weights for policy 0, policy_version 96870 (0.0007) [2023-03-07 09:42:29,771][155452] Updated weights for policy 0, policy_version 96880 (0.0007) [2023-03-07 09:42:30,563][155452] Updated weights for policy 0, policy_version 96890 (0.0006) [2023-03-07 09:42:31,349][155452] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-07 09:42:32,119][155452] Updated weights for policy 0, policy_version 96910 (0.0006) [2023-03-07 09:42:32,905][155452] Updated weights for policy 0, policy_version 96920 (0.0006) [2023-03-07 09:42:33,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 99251200. Throughput: 0: 12979.9. Samples: 99218325. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:33,367][155126] Avg episode reward: [(0, '2186.336')] [2023-03-07 09:42:33,724][155452] Updated weights for policy 0, policy_version 96930 (0.0007) [2023-03-07 09:42:34,511][155452] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-07 09:42:35,300][155452] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-07 09:42:36,085][155452] Updated weights for policy 0, policy_version 96960 (0.0006) [2023-03-07 09:42:36,871][155452] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-07 09:42:37,655][155452] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-07 09:42:38,367][155126] Fps is (10 sec: 12902.3, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 99315712. Throughput: 0: 12984.0. Samples: 99296323. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:38,367][155126] Avg episode reward: [(0, '2189.026')] [2023-03-07 09:42:38,454][155452] Updated weights for policy 0, policy_version 96990 (0.0006) [2023-03-07 09:42:39,228][155452] Updated weights for policy 0, policy_version 97000 (0.0006) [2023-03-07 09:42:40,021][155452] Updated weights for policy 0, policy_version 97010 (0.0006) [2023-03-07 09:42:40,805][155452] Updated weights for policy 0, policy_version 97020 (0.0006) [2023-03-07 09:42:41,590][155452] Updated weights for policy 0, policy_version 97030 (0.0006) [2023-03-07 09:42:42,386][155452] Updated weights for policy 0, policy_version 97040 (0.0006) [2023-03-07 09:42:43,176][155452] Updated weights for policy 0, policy_version 97050 (0.0005) [2023-03-07 09:42:43,367][155126] Fps is (10 sec: 13004.9, 60 sec: 12987.8, 300 sec: 13006.5). Total num frames: 99381248. Throughput: 0: 12981.9. Samples: 99374210. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:43,367][155126] Avg episode reward: [(0, '2106.436')] [2023-03-07 09:42:43,950][155452] Updated weights for policy 0, policy_version 97060 (0.0006) [2023-03-07 09:42:44,735][155452] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-07 09:42:45,510][155452] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-07 09:42:46,313][155452] Updated weights for policy 0, policy_version 97090 (0.0006) [2023-03-07 09:42:47,098][155452] Updated weights for policy 0, policy_version 97100 (0.0006) [2023-03-07 09:42:47,900][155452] Updated weights for policy 0, policy_version 97110 (0.0007) [2023-03-07 09:42:48,367][155126] Fps is (10 sec: 13107.2, 60 sec: 12987.7, 300 sec: 13010.0). Total num frames: 99446784. Throughput: 0: 13001.2. Samples: 99413547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:48,367][155126] Avg episode reward: [(0, '2306.896')] [2023-03-07 09:42:48,675][155452] Updated weights for policy 0, policy_version 97120 (0.0006) [2023-03-07 09:42:49,477][155452] Updated weights for policy 0, policy_version 97130 (0.0006) [2023-03-07 09:42:50,255][155452] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-07 09:42:51,040][155452] Updated weights for policy 0, policy_version 97150 (0.0007) [2023-03-07 09:42:51,836][155452] Updated weights for policy 0, policy_version 97160 (0.0006) [2023-03-07 09:42:52,613][155452] Updated weights for policy 0, policy_version 97170 (0.0006) [2023-03-07 09:42:53,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.8, 300 sec: 13006.5). Total num frames: 99511296. Throughput: 0: 12994.4. Samples: 99491209. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:53,367][155126] Avg episode reward: [(0, '2220.528')] [2023-03-07 09:42:53,414][155452] Updated weights for policy 0, policy_version 97180 (0.0006) [2023-03-07 09:42:54,201][155452] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-07 09:42:54,980][155452] Updated weights for policy 0, policy_version 97200 (0.0007) [2023-03-07 09:42:55,777][155452] Updated weights for policy 0, policy_version 97210 (0.0006) [2023-03-07 09:42:56,562][155452] Updated weights for policy 0, policy_version 97220 (0.0006) [2023-03-07 09:42:57,362][155452] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-07 09:42:58,128][155452] Updated weights for policy 0, policy_version 97240 (0.0006) [2023-03-07 09:42:58,367][155126] Fps is (10 sec: 12902.3, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 99575808. Throughput: 0: 12993.9. Samples: 99569216. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:42:58,368][155126] Avg episode reward: [(0, '2176.709')] [2023-03-07 09:42:58,927][155452] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-07 09:42:59,705][155452] Updated weights for policy 0, policy_version 97260 (0.0006) [2023-03-07 09:43:00,488][155452] Updated weights for policy 0, policy_version 97270 (0.0008) [2023-03-07 09:43:01,302][155452] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-07 09:43:02,083][155452] Updated weights for policy 0, policy_version 97290 (0.0005) [2023-03-07 09:43:02,871][155452] Updated weights for policy 0, policy_version 97300 (0.0006) [2023-03-07 09:43:03,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 99641344. Throughput: 0: 12989.6. Samples: 99608236. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:43:03,367][155126] Avg episode reward: [(0, '2252.351')] [2023-03-07 09:43:03,665][155452] Updated weights for policy 0, policy_version 97310 (0.0006) [2023-03-07 09:43:04,454][155452] Updated weights for policy 0, policy_version 97320 (0.0006) [2023-03-07 09:43:05,245][155452] Updated weights for policy 0, policy_version 97330 (0.0006) [2023-03-07 09:43:06,019][155452] Updated weights for policy 0, policy_version 97340 (0.0006) [2023-03-07 09:43:06,814][155452] Updated weights for policy 0, policy_version 97350 (0.0006) [2023-03-07 09:43:07,604][155452] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-07 09:43:08,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 99705856. Throughput: 0: 12986.9. Samples: 99686196. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:43:08,368][155126] Avg episode reward: [(0, '2127.129')] [2023-03-07 09:43:08,397][155452] Updated weights for policy 0, policy_version 97370 (0.0006) [2023-03-07 09:43:09,174][155452] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-07 09:43:09,955][155452] Updated weights for policy 0, policy_version 97390 (0.0007) [2023-03-07 09:43:10,745][155452] Updated weights for policy 0, policy_version 97400 (0.0007) [2023-03-07 09:43:11,542][155452] Updated weights for policy 0, policy_version 97410 (0.0006) [2023-03-07 09:43:12,314][155452] Updated weights for policy 0, policy_version 97420 (0.0006) [2023-03-07 09:43:13,090][155452] Updated weights for policy 0, policy_version 97430 (0.0007) [2023-03-07 09:43:13,367][155126] Fps is (10 sec: 13004.8, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 99771392. Throughput: 0: 12998.3. Samples: 99764365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:43:13,367][155126] Avg episode reward: [(0, '1977.467')] [2023-03-07 09:43:13,899][155452] Updated weights for policy 0, policy_version 97440 (0.0007) [2023-03-07 09:43:14,693][155452] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-07 09:43:15,454][155452] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-07 09:43:16,248][155452] Updated weights for policy 0, policy_version 97470 (0.0007) [2023-03-07 09:43:17,030][155452] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-07 09:43:17,815][155452] Updated weights for policy 0, policy_version 97490 (0.0006) [2023-03-07 09:43:18,367][155126] Fps is (10 sec: 13004.8, 60 sec: 12987.7, 300 sec: 13006.5). Total num frames: 99835904. Throughput: 0: 13001.9. Samples: 99803410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:43:18,367][155126] Avg episode reward: [(0, '1929.760')] [2023-03-07 09:43:18,594][155452] Updated weights for policy 0, policy_version 97500 (0.0008) [2023-03-07 09:43:19,399][155452] Updated weights for policy 0, policy_version 97510 (0.0007) [2023-03-07 09:43:20,177][155452] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-07 09:43:20,950][155452] Updated weights for policy 0, policy_version 97530 (0.0006) [2023-03-07 09:43:21,748][155452] Updated weights for policy 0, policy_version 97540 (0.0006) [2023-03-07 09:43:22,535][155452] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-07 09:43:23,317][155452] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-07 09:43:23,367][155126] Fps is (10 sec: 13004.7, 60 sec: 13004.8, 300 sec: 13006.5). Total num frames: 99901440. Throughput: 0: 13004.3. Samples: 99881515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:43:23,367][155126] Avg episode reward: [(0, '2200.758')] [2023-03-07 09:43:24,097][155452] Updated weights for policy 0, policy_version 97570 (0.0005) [2023-03-07 09:43:24,894][155452] Updated weights for policy 0, policy_version 97580 (0.0006) [2023-03-07 09:43:25,682][155452] Updated weights for policy 0, policy_version 97590 (0.0006) [2023-03-07 09:43:26,463][155452] Updated weights for policy 0, policy_version 97600 (0.0007) [2023-03-07 09:43:27,267][155452] Updated weights for policy 0, policy_version 97610 (0.0006) [2023-03-07 09:43:28,046][155452] Updated weights for policy 0, policy_version 97620 (0.0006) [2023-03-07 09:43:28,367][155126] Fps is (10 sec: 13107.3, 60 sec: 13004.8, 300 sec: 13010.0). Total num frames: 99966976. Throughput: 0: 13006.1. Samples: 99959486. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:43:28,367][155126] Avg episode reward: [(0, '2189.310')] [2023-03-07 09:43:28,371][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000097624_99966976.pth... [2023-03-07 09:43:28,401][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000094576_96845824.pth [2023-03-07 09:43:28,832][155452] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-07 09:43:29,623][155452] Updated weights for policy 0, policy_version 97640 (0.0006) [2023-03-07 09:43:30,416][155452] Updated weights for policy 0, policy_version 97650 (0.0006) [2023-03-07 09:43:31,047][156005] Stopping RolloutWorker_w23... [2023-03-07 09:43:31,047][155458] Stopping RolloutWorker_w5... [2023-03-07 09:43:31,047][155689] Stopping RolloutWorker_w17... [2023-03-07 09:43:31,047][155960] Stopping RolloutWorker_w22... [2023-03-07 09:43:31,047][155457] Stopping RolloutWorker_w4... [2023-03-07 09:43:31,047][156041] Stopping RolloutWorker_w29... [2023-03-07 09:43:31,047][155706] Stopping RolloutWorker_w20... [2023-03-07 09:43:31,047][155686] Stopping RolloutWorker_w11... [2023-03-07 09:43:31,047][156005] Loop rollout_proc23_evt_loop terminating... [2023-03-07 09:43:31,047][155674] Stopping RolloutWorker_w10... [2023-03-07 09:43:31,047][155681] Stopping RolloutWorker_w21... [2023-03-07 09:43:31,047][155455] Stopping RolloutWorker_w2... [2023-03-07 09:43:31,047][156038] Stopping RolloutWorker_w26... [2023-03-07 09:43:31,047][155960] Loop rollout_proc22_evt_loop terminating... [2023-03-07 09:43:31,047][155724] Stopping RolloutWorker_w13... [2023-03-07 09:43:31,047][155684] Stopping RolloutWorker_w7... [2023-03-07 09:43:31,047][155680] Stopping RolloutWorker_w15... [2023-03-07 09:43:31,047][155457] Loop rollout_proc4_evt_loop terminating... [2023-03-07 09:43:31,047][155688] Stopping RolloutWorker_w19... [2023-03-07 09:43:31,047][155689] Loop rollout_proc17_evt_loop terminating... [2023-03-07 09:43:31,047][155683] Stopping RolloutWorker_w16... [2023-03-07 09:43:31,047][155458] Loop rollout_proc5_evt_loop terminating... [2023-03-07 09:43:31,047][155679] Stopping RolloutWorker_w9... [2023-03-07 09:43:31,047][155706] Loop rollout_proc20_evt_loop terminating... [2023-03-07 09:43:31,047][155685] Stopping RolloutWorker_w8... [2023-03-07 09:43:31,047][155682] Stopping RolloutWorker_w18... [2023-03-07 09:43:31,047][155454] Stopping RolloutWorker_w1... [2023-03-07 09:43:31,047][156041] Loop rollout_proc29_evt_loop terminating... [2023-03-07 09:43:31,047][156074] Stopping RolloutWorker_w28... [2023-03-07 09:43:31,047][155686] Loop rollout_proc11_evt_loop terminating... [2023-03-07 09:43:31,047][155401] Stopping Batcher_0... [2023-03-07 09:43:31,047][155674] Loop rollout_proc10_evt_loop terminating... [2023-03-07 09:43:31,047][155681] Loop rollout_proc21_evt_loop terminating... [2023-03-07 09:43:31,048][155688] Loop rollout_proc19_evt_loop terminating... [2023-03-07 09:43:31,048][155684] Loop rollout_proc7_evt_loop terminating... [2023-03-07 09:43:31,047][156076] Stopping RolloutWorker_w31... [2023-03-07 09:43:31,047][155455] Loop rollout_proc2_evt_loop terminating... [2023-03-07 09:43:31,048][156038] Loop rollout_proc26_evt_loop terminating... [2023-03-07 09:43:31,048][155724] Loop rollout_proc13_evt_loop terminating... [2023-03-07 09:43:31,048][155680] Loop rollout_proc15_evt_loop terminating... [2023-03-07 09:43:31,047][156006] Stopping RolloutWorker_w24... [2023-03-07 09:43:31,048][155679] Loop rollout_proc9_evt_loop terminating... [2023-03-07 09:43:31,047][155675] Stopping RolloutWorker_w12... [2023-03-07 09:43:31,048][155682] Loop rollout_proc18_evt_loop terminating... [2023-03-07 09:43:31,048][156074] Loop rollout_proc28_evt_loop terminating... [2023-03-07 09:43:31,048][155454] Loop rollout_proc1_evt_loop terminating... [2023-03-07 09:43:31,048][155683] Loop rollout_proc16_evt_loop terminating... [2023-03-07 09:43:31,048][155687] Stopping RolloutWorker_w14... [2023-03-07 09:43:31,048][156076] Loop rollout_proc31_evt_loop terminating... [2023-03-07 09:43:31,048][155685] Loop rollout_proc8_evt_loop terminating... [2023-03-07 09:43:31,048][156075] Stopping RolloutWorker_w25... [2023-03-07 09:43:31,048][156006] Loop rollout_proc24_evt_loop terminating... [2023-03-07 09:43:31,048][155687] Loop rollout_proc14_evt_loop terminating... [2023-03-07 09:43:31,048][155401] Loop batcher_evt_loop terminating... [2023-03-07 09:43:31,048][156075] Loop rollout_proc25_evt_loop terminating... [2023-03-07 09:43:31,048][155675] Loop rollout_proc12_evt_loop terminating... [2023-03-07 09:43:31,048][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 09:43:31,047][155126] Component RolloutWorker_w23 stopped! [2023-03-07 09:43:31,048][155453] Stopping RolloutWorker_w0... [2023-03-07 09:43:31,048][156039] Stopping RolloutWorker_w27... [2023-03-07 09:43:31,049][155126] Component RolloutWorker_w17 stopped! [2023-03-07 09:43:31,049][156039] Loop rollout_proc27_evt_loop terminating... [2023-03-07 09:43:31,049][155453] Loop rollout_proc0_evt_loop terminating... [2023-03-07 09:43:31,049][155126] Component RolloutWorker_w5 stopped! [2023-03-07 09:43:31,049][155126] Component RolloutWorker_w29 stopped! [2023-03-07 09:43:31,049][155126] Component RolloutWorker_w10 stopped! [2023-03-07 09:43:31,050][155126] Component RolloutWorker_w11 stopped! [2023-03-07 09:43:31,050][155126] Component RolloutWorker_w22 stopped! [2023-03-07 09:43:31,050][155456] Stopping RolloutWorker_w3... [2023-03-07 09:43:31,050][155126] Component RolloutWorker_w4 stopped! [2023-03-07 09:43:31,050][155456] Loop rollout_proc3_evt_loop terminating... [2023-03-07 09:43:31,050][155126] Component RolloutWorker_w21 stopped! [2023-03-07 09:43:31,051][155126] Component RolloutWorker_w20 stopped! [2023-03-07 09:43:31,051][155126] Component RolloutWorker_w16 stopped! [2023-03-07 09:43:31,051][155126] Component RolloutWorker_w2 stopped! [2023-03-07 09:43:31,051][155126] Component Batcher_0 stopped! [2023-03-07 09:43:31,051][155126] Component RolloutWorker_w26 stopped! [2023-03-07 09:43:31,052][155126] Component RolloutWorker_w15 stopped! [2023-03-07 09:43:31,052][155126] Component RolloutWorker_w13 stopped! [2023-03-07 09:43:31,052][155126] Component RolloutWorker_w8 stopped! [2023-03-07 09:43:31,053][155126] Component RolloutWorker_w9 stopped! [2023-03-07 09:43:31,053][155126] Component RolloutWorker_w7 stopped! [2023-03-07 09:43:31,053][155126] Component RolloutWorker_w19 stopped! [2023-03-07 09:43:31,053][155126] Component RolloutWorker_w1 stopped! [2023-03-07 09:43:31,054][155126] Component RolloutWorker_w18 stopped! [2023-03-07 09:43:31,054][155126] Component RolloutWorker_w28 stopped! [2023-03-07 09:43:31,054][155126] Component RolloutWorker_w12 stopped! [2023-03-07 09:43:31,054][155126] Component RolloutWorker_w24 stopped! [2023-03-07 09:43:31,055][155126] Component RolloutWorker_w31 stopped! [2023-03-07 09:43:31,055][155126] Component RolloutWorker_w14 stopped! [2023-03-07 09:43:31,055][155126] Component RolloutWorker_w25 stopped! [2023-03-07 09:43:31,055][155126] Component RolloutWorker_w27 stopped! [2023-03-07 09:43:31,055][155126] Component RolloutWorker_w0 stopped! [2023-03-07 09:43:31,055][155126] Component RolloutWorker_w3 stopped! [2023-03-07 09:43:31,056][155126] Component RolloutWorker_w6 stopped! [2023-03-07 09:43:31,057][155673] Stopping RolloutWorker_w6... [2023-03-07 09:43:31,057][155673] Loop rollout_proc6_evt_loop terminating... [2023-03-07 09:43:31,071][156073] Stopping RolloutWorker_w30... [2023-03-07 09:43:31,072][156073] Loop rollout_proc30_evt_loop terminating... [2023-03-07 09:43:31,071][155126] Component RolloutWorker_w30 stopped! [2023-03-07 09:43:31,118][155452] Weights refcount: 2 0 [2023-03-07 09:43:31,121][155452] Stopping InferenceWorker_p0-w0... [2023-03-07 09:43:31,121][155452] Loop inference_proc0-0_evt_loop terminating... [2023-03-07 09:43:31,122][155126] Component InferenceWorker_p0-w0 stopped! [2023-03-07 09:43:31,162][155401] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000096101_98407424.pth [2023-03-07 09:43:31,171][155401] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-button-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 09:43:31,261][155401] Stopping LearnerWorker_p0... [2023-03-07 09:43:31,262][155401] Loop learner_proc0_evt_loop terminating... [2023-03-07 09:43:31,261][155126] Component LearnerWorker_p0 stopped! [2023-03-07 09:43:31,262][155126] Waiting for process learner_proc0 to stop... [2023-03-07 09:43:32,412][155126] Waiting for process inference_proc0-0 to join... [2023-03-07 09:43:32,413][155126] Waiting for process rollout_proc0 to join... [2023-03-07 09:43:32,413][155126] Waiting for process rollout_proc1 to join... [2023-03-07 09:43:32,413][155126] Waiting for process rollout_proc2 to join... [2023-03-07 09:43:32,413][155126] Waiting for process rollout_proc3 to join... [2023-03-07 09:43:32,414][155126] Waiting for process rollout_proc4 to join... [2023-03-07 09:43:32,414][155126] Waiting for process rollout_proc5 to join... [2023-03-07 09:43:32,414][155126] Waiting for process rollout_proc6 to join... [2023-03-07 09:43:32,414][155126] Waiting for process rollout_proc7 to join... [2023-03-07 09:43:32,415][155126] Waiting for process rollout_proc8 to join... [2023-03-07 09:43:32,415][155126] Waiting for process rollout_proc9 to join... [2023-03-07 09:43:32,415][155126] Waiting for process rollout_proc10 to join... [2023-03-07 09:43:32,415][155126] Waiting for process rollout_proc11 to join... [2023-03-07 09:43:32,415][155126] Waiting for process rollout_proc12 to join... [2023-03-07 09:43:32,416][155126] Waiting for process rollout_proc13 to join... [2023-03-07 09:43:32,416][155126] Waiting for process rollout_proc14 to join... [2023-03-07 09:43:32,416][155126] Waiting for process rollout_proc15 to join... [2023-03-07 09:43:32,416][155126] Waiting for process rollout_proc16 to join... [2023-03-07 09:43:32,417][155126] Waiting for process rollout_proc17 to join... [2023-03-07 09:43:32,417][155126] Waiting for process rollout_proc18 to join... [2023-03-07 09:43:32,417][155126] Waiting for process rollout_proc19 to join... [2023-03-07 09:43:32,417][155126] Waiting for process rollout_proc20 to join... [2023-03-07 09:43:32,418][155126] Waiting for process rollout_proc21 to join... [2023-03-07 09:43:32,418][155126] Waiting for process rollout_proc22 to join... [2023-03-07 09:43:32,418][155126] Waiting for process rollout_proc23 to join... [2023-03-07 09:43:32,418][155126] Waiting for process rollout_proc24 to join... [2023-03-07 09:43:32,418][155126] Waiting for process rollout_proc25 to join... [2023-03-07 09:43:32,419][155126] Waiting for process rollout_proc26 to join... [2023-03-07 09:43:32,419][155126] Waiting for process rollout_proc27 to join... [2023-03-07 09:43:32,419][155126] Waiting for process rollout_proc28 to join... [2023-03-07 09:43:32,419][155126] Waiting for process rollout_proc29 to join... [2023-03-07 09:43:32,420][155126] Waiting for process rollout_proc30 to join... [2023-03-07 09:43:32,420][155126] Waiting for process rollout_proc31 to join... [2023-03-07 09:43:32,420][155126] Batcher 0 profile tree view: batching: 855.9808, releasing_batches: 1.6545 [2023-03-07 09:43:32,420][155126] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 232.5132 update_model: 134.7992 weight_update: 0.0007 one_step: 0.0065 handle_policy_step: 6922.1110 deserialize: 207.5475, stack: 36.1919, obs_to_device_normalize: 1229.0658, forward: 3074.4022, send_messages: 1386.9794 prepare_outputs: 716.4201 to_cpu: 363.8341 [2023-03-07 09:43:32,420][155126] Learner 0 profile tree view: misc: 0.4608, prepare_batch: 426.8559 train: 914.8080 epoch_init: 0.3939, minibatch_init: 0.3915, losses_postprocess: 29.5881, kl_divergence: 35.7258, after_optimizer: 95.5071 calculate_losses: 303.2087 losses_init: 0.2252, forward_head: 16.8826, bptt_initial: 109.9870, tail: 60.8447, advantages_returns: 7.6228, losses: 28.5697 bptt: 70.0652 bptt_forward_core: 67.6402 update: 427.3006 clip: 55.4146 [2023-03-07 09:43:32,421][155126] RolloutWorker_w0 profile tree view: wait_for_trajectories: 4.1003, enqueue_policy_requests: 183.2453, env_step: 2959.2879, overhead: 170.0398, complete_rollouts: 9.7847 save_policy_outputs: 235.0129 split_output_tensors: 115.0067 [2023-03-07 09:43:32,421][155126] RolloutWorker_w31 profile tree view: wait_for_trajectories: 4.2051, enqueue_policy_requests: 189.8305, env_step: 3017.5267, overhead: 172.4257, complete_rollouts: 9.9336 save_policy_outputs: 237.7731 split_output_tensors: 115.6144 [2023-03-07 09:43:32,421][155126] Loop Runner_EvtLoop terminating... [2023-03-07 09:43:32,421][155126] Runner profile tree view: main_loop: 7680.9260 [2023-03-07 09:43:32,421][155126] Collected {0: 100001792}, FPS: 13019.5