[2023-03-07 16:12:28,404][231894] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/config.json... [2023-03-07 16:12:28,419][231894] Rollout worker 0 uses device cpu [2023-03-07 16:12:28,419][231894] Rollout worker 1 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 2 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 3 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 4 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 5 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 6 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 7 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 8 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 9 uses device cpu [2023-03-07 16:12:28,420][231894] Rollout worker 10 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 11 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 12 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 13 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 14 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 15 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 16 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 17 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 18 uses device cpu [2023-03-07 16:12:28,421][231894] Rollout worker 19 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 20 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 21 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 22 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 23 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 24 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 25 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 26 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 27 uses device cpu [2023-03-07 16:12:28,422][231894] Rollout worker 28 uses device cpu [2023-03-07 16:12:28,423][231894] Rollout worker 29 uses device cpu [2023-03-07 16:12:28,423][231894] Rollout worker 30 uses device cpu [2023-03-07 16:12:28,423][231894] Rollout worker 31 uses device cpu [2023-03-07 16:12:28,436][231894] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 16:12:28,436][231894] InferenceWorker_p0-w0: min num requests: 10 [2023-03-07 16:12:28,515][231894] Starting all processes... [2023-03-07 16:12:28,516][231894] Starting process learner_proc0 [2023-03-07 16:12:28,566][231894] Starting all processes... [2023-03-07 16:12:28,634][231894] Starting process inference_proc0-0 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc0 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc1 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc2 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc3 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc4 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc5 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc6 [2023-03-07 16:12:28,634][231894] Starting process rollout_proc7 [2023-03-07 16:12:28,635][231894] Starting process rollout_proc8 [2023-03-07 16:12:28,635][231894] Starting process rollout_proc9 [2023-03-07 16:12:28,635][231894] Starting process rollout_proc10 [2023-03-07 16:12:28,635][231894] Starting process rollout_proc11 [2023-03-07 16:12:28,635][231894] Starting process rollout_proc12 [2023-03-07 16:12:28,635][231894] Starting process rollout_proc13 [2023-03-07 16:12:28,635][231894] Starting process rollout_proc14 [2023-03-07 16:12:28,646][231894] Starting process rollout_proc15 [2023-03-07 16:12:28,646][231894] Starting process rollout_proc16 [2023-03-07 16:12:28,652][231894] Starting process rollout_proc17 [2023-03-07 16:12:28,658][231894] Starting process rollout_proc18 [2023-03-07 16:12:28,665][231894] Starting process rollout_proc19 [2023-03-07 16:12:28,670][231894] Starting process rollout_proc20 [2023-03-07 16:12:28,760][231894] Starting process rollout_proc21 [2023-03-07 16:12:28,789][231894] Starting process rollout_proc22 [2023-03-07 16:12:28,798][231894] Starting process rollout_proc23 [2023-03-07 16:12:28,807][231894] Starting process rollout_proc24 [2023-03-07 16:12:28,807][231894] Starting process rollout_proc25 [2023-03-07 16:12:28,807][231894] Starting process rollout_proc26 [2023-03-07 16:12:28,812][231894] Starting process rollout_proc27 [2023-03-07 16:12:28,821][231894] Starting process rollout_proc28 [2023-03-07 16:12:28,822][231894] Starting process rollout_proc29 [2023-03-07 16:12:28,822][231894] Starting process rollout_proc30 [2023-03-07 16:12:28,822][231894] Starting process rollout_proc31 [2023-03-07 16:12:30,585][232173] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 16:12:30,585][232173] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-07 16:12:30,602][232173] Num visible devices: 1 [2023-03-07 16:12:30,633][232173] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-07 16:12:30,633][232173] Starting seed is not provided [2023-03-07 16:12:30,633][232173] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 16:12:30,633][232173] Initializing actor-critic model on device cuda:0 [2023-03-07 16:12:30,633][232173] RunningMeanStd input shape: (39,) [2023-03-07 16:12:30,634][232173] RunningMeanStd input shape: (1,) [2023-03-07 16:12:30,675][232228] Worker 3 uses CPU cores [3] [2023-03-07 16:12:30,702][232227] Worker 2 uses CPU cores [2] [2023-03-07 16:12:30,775][232173] Created Actor Critic model with architecture: [2023-03-07 16:12:30,775][232173] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-07 16:12:30,875][232431] Worker 11 uses CPU cores [11] [2023-03-07 16:12:30,935][232463] Worker 21 uses CPU cores [21] [2023-03-07 16:12:31,131][232598] Worker 28 uses CPU cores [28] [2023-03-07 16:12:31,191][232425] Worker 12 uses CPU cores [12] [2023-03-07 16:12:31,209][232229] Worker 4 uses CPU cores [4] [2023-03-07 16:12:31,391][232391] Worker 19 uses CPU cores [19] [2023-03-07 16:12:31,510][232389] Worker 7 uses CPU cores [7] [2023-03-07 16:12:31,630][232355] Worker 5 uses CPU cores [5] [2023-03-07 16:12:31,646][232226] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 16:12:31,646][232226] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-07 16:12:31,656][232226] Num visible devices: 1 [2023-03-07 16:12:31,827][232390] Worker 13 uses CPU cores [13] [2023-03-07 16:12:31,946][232755] Worker 31 uses CPU cores [31] [2023-03-07 16:12:32,095][232428] Worker 8 uses CPU cores [8] [2023-03-07 16:12:32,195][232501] Worker 26 uses CPU cores [26] [2023-03-07 16:12:32,262][232173] Using optimizer [2023-03-07 16:12:32,263][232173] No checkpoints found [2023-03-07 16:12:32,263][232173] Did not load from checkpoint, starting from scratch! [2023-03-07 16:12:32,263][232173] Initialized policy 0 weights for model version 0 [2023-03-07 16:12:32,276][232173] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 16:12:32,279][232173] LearnerWorker_p0 finished initialization! [2023-03-07 16:12:32,307][232495] Worker 22 uses CPU cores [22] [2023-03-07 16:12:32,308][232430] Worker 9 uses CPU cores [9] [2023-03-07 16:12:32,335][232226] RunningMeanStd input shape: (39,) [2023-03-07 16:12:32,335][232226] RunningMeanStd input shape: (1,) [2023-03-07 16:12:32,544][232429] Worker 15 uses CPU cores [15] [2023-03-07 16:12:32,558][232356] Worker 17 uses CPU cores [17] [2023-03-07 16:12:32,719][232427] Worker 20 uses CPU cores [20] [2023-03-07 16:12:32,767][232357] Worker 16 uses CPU cores [16] [2023-03-07 16:12:33,003][232411] Worker 10 uses CPU cores [10] [2023-03-07 16:12:33,059][232566] Worker 27 uses CPU cores [27] [2023-03-07 16:12:33,067][232500] Worker 25 uses CPU cores [25] [2023-03-07 16:12:33,143][232498] Worker 24 uses CPU cores [24] [2023-03-07 16:12:33,207][231894] Inference worker 0-0 is ready! [2023-03-07 16:12:33,208][231894] All inference workers are ready! Signal rollout workers to start! [2023-03-07 16:12:33,272][232224] Worker 0 uses CPU cores [0] [2023-03-07 16:12:33,483][232426] Worker 18 uses CPU cores [18] [2023-03-07 16:12:33,591][232225] Worker 1 uses CPU cores [1] [2023-03-07 16:12:33,679][232392] Worker 14 uses CPU cores [14] [2023-03-07 16:12:33,895][232565] Worker 30 uses CPU cores [30] [2023-03-07 16:12:33,951][232354] Worker 6 uses CPU cores [6] [2023-03-07 16:12:34,091][232692] Worker 29 uses CPU cores [29] [2023-03-07 16:12:34,226][232496] Worker 23 uses CPU cores [23] [2023-03-07 16:12:35,069][231894] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-07 16:12:35,344][232389] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,351][232390] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,427][232227] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,434][232495] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,436][232356] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,449][232431] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,454][232357] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,481][232430] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,513][232501] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,544][232429] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,572][232425] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,606][232228] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,613][232598] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,618][232391] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,619][232427] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,620][232229] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,631][232355] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,633][232463] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,644][232498] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,653][232428] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,673][232224] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,717][232411] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,726][232566] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,831][232755] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,875][232500] Decorrelating experience for 0 frames... [2023-03-07 16:12:35,956][232426] Decorrelating experience for 0 frames... [2023-03-07 16:12:36,073][232225] Decorrelating experience for 0 frames... [2023-03-07 16:12:36,276][232565] Decorrelating experience for 0 frames... [2023-03-07 16:12:36,397][232392] Decorrelating experience for 0 frames... [2023-03-07 16:12:36,542][232354] Decorrelating experience for 0 frames... [2023-03-07 16:12:36,850][232692] Decorrelating experience for 0 frames... [2023-03-07 16:12:36,902][232496] Decorrelating experience for 0 frames... [2023-03-07 16:12:37,644][232389] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,673][232390] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,697][232356] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,704][232495] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,711][232227] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,732][232431] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,736][232357] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,784][232430] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,873][232429] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,874][232501] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,896][232224] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,922][232425] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,931][232566] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,958][232598] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,984][232391] Decorrelating experience for 32 frames... [2023-03-07 16:12:37,995][232411] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,002][232428] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,007][232229] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,016][232427] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,018][232463] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,024][232355] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,030][232228] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,056][232498] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,063][232426] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,063][232500] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,070][232755] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,152][232225] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,434][232173] Signal inference workers to stop experience collection... [2023-03-07 16:12:38,438][232226] InferenceWorker_p0-w0: stopping experience collection [2023-03-07 16:12:38,517][232565] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,555][232354] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,599][232392] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,679][232692] Decorrelating experience for 32 frames... [2023-03-07 16:12:38,731][232173] Signal inference workers to resume experience collection... [2023-03-07 16:12:38,731][232226] InferenceWorker_p0-w0: resuming experience collection [2023-03-07 16:12:38,741][232496] Decorrelating experience for 32 frames... [2023-03-07 16:12:39,908][232226] Updated weights for policy 0, policy_version 10 (0.0216) [2023-03-07 16:12:40,069][231894] Fps is (10 sec: 2457.6, 60 sec: 2457.6, 300 sec: 2457.6). Total num frames: 12288. Throughput: 0: 753.6. Samples: 3768. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2023-03-07 16:12:40,671][232226] Updated weights for policy 0, policy_version 20 (0.0007) [2023-03-07 16:12:41,500][232226] Updated weights for policy 0, policy_version 30 (0.0007) [2023-03-07 16:12:42,273][232226] Updated weights for policy 0, policy_version 40 (0.0006) [2023-03-07 16:12:43,042][232226] Updated weights for policy 0, policy_version 50 (0.0006) [2023-03-07 16:12:43,836][232226] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-07 16:12:44,631][232226] Updated weights for policy 0, policy_version 70 (0.0006) [2023-03-07 16:12:45,069][231894] Fps is (10 sec: 7680.0, 60 sec: 7680.0, 300 sec: 7680.0). Total num frames: 76800. Throughput: 0: 4158.2. Samples: 41582. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:12:45,069][231894] Avg episode reward: [(0, '187.355')] [2023-03-07 16:12:45,406][232226] Updated weights for policy 0, policy_version 80 (0.0005) [2023-03-07 16:12:46,180][232226] Updated weights for policy 0, policy_version 90 (0.0006) [2023-03-07 16:12:46,989][232226] Updated weights for policy 0, policy_version 100 (0.0006) [2023-03-07 16:12:47,757][232226] Updated weights for policy 0, policy_version 110 (0.0006) [2023-03-07 16:12:48,432][231894] Heartbeat connected on Batcher_0 [2023-03-07 16:12:48,434][231894] Heartbeat connected on LearnerWorker_p0 [2023-03-07 16:12:48,439][231894] Heartbeat connected on RolloutWorker_w0 [2023-03-07 16:12:48,440][231894] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-07 16:12:48,441][231894] Heartbeat connected on RolloutWorker_w1 [2023-03-07 16:12:48,443][231894] Heartbeat connected on RolloutWorker_w2 [2023-03-07 16:12:48,444][231894] Heartbeat connected on RolloutWorker_w3 [2023-03-07 16:12:48,447][231894] Heartbeat connected on RolloutWorker_w4 [2023-03-07 16:12:48,449][231894] Heartbeat connected on RolloutWorker_w5 [2023-03-07 16:12:48,450][231894] Heartbeat connected on RolloutWorker_w6 [2023-03-07 16:12:48,454][231894] Heartbeat connected on RolloutWorker_w8 [2023-03-07 16:12:48,456][231894] Heartbeat connected on RolloutWorker_w9 [2023-03-07 16:12:48,458][231894] Heartbeat connected on RolloutWorker_w7 [2023-03-07 16:12:48,460][231894] Heartbeat connected on RolloutWorker_w10 [2023-03-07 16:12:48,478][231894] Heartbeat connected on RolloutWorker_w11 [2023-03-07 16:12:48,480][231894] Heartbeat connected on RolloutWorker_w12 [2023-03-07 16:12:48,482][231894] Heartbeat connected on RolloutWorker_w13 [2023-03-07 16:12:48,483][231894] Heartbeat connected on RolloutWorker_w14 [2023-03-07 16:12:48,485][231894] Heartbeat connected on RolloutWorker_w15 [2023-03-07 16:12:48,487][231894] Heartbeat connected on RolloutWorker_w16 [2023-03-07 16:12:48,489][231894] Heartbeat connected on RolloutWorker_w17 [2023-03-07 16:12:48,491][231894] Heartbeat connected on RolloutWorker_w18 [2023-03-07 16:12:48,492][231894] Heartbeat connected on RolloutWorker_w19 [2023-03-07 16:12:48,494][231894] Heartbeat connected on RolloutWorker_w20 [2023-03-07 16:12:48,496][231894] Heartbeat connected on RolloutWorker_w21 [2023-03-07 16:12:48,497][231894] Heartbeat connected on RolloutWorker_w22 [2023-03-07 16:12:48,501][231894] Heartbeat connected on RolloutWorker_w23 [2023-03-07 16:12:48,502][231894] Heartbeat connected on RolloutWorker_w24 [2023-03-07 16:12:48,504][231894] Heartbeat connected on RolloutWorker_w25 [2023-03-07 16:12:48,505][231894] Heartbeat connected on RolloutWorker_w26 [2023-03-07 16:12:48,507][231894] Heartbeat connected on RolloutWorker_w27 [2023-03-07 16:12:48,509][231894] Heartbeat connected on RolloutWorker_w28 [2023-03-07 16:12:48,510][231894] Heartbeat connected on RolloutWorker_w29 [2023-03-07 16:12:48,513][231894] Heartbeat connected on RolloutWorker_w30 [2023-03-07 16:12:48,514][231894] Heartbeat connected on RolloutWorker_w31 [2023-03-07 16:12:48,538][232226] Updated weights for policy 0, policy_version 120 (0.0007) [2023-03-07 16:12:49,341][232226] Updated weights for policy 0, policy_version 130 (0.0006) [2023-03-07 16:12:50,069][231894] Fps is (10 sec: 13004.8, 60 sec: 9489.1, 300 sec: 9489.1). Total num frames: 142336. Throughput: 0: 7990.2. Samples: 119852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:12:50,069][231894] Avg episode reward: [(0, '189.375')] [2023-03-07 16:12:50,070][232173] Saving new best policy, reward=189.375! [2023-03-07 16:12:50,125][232226] Updated weights for policy 0, policy_version 140 (0.0006) [2023-03-07 16:12:50,905][232226] Updated weights for policy 0, policy_version 150 (0.0006) [2023-03-07 16:12:51,719][232226] Updated weights for policy 0, policy_version 160 (0.0007) [2023-03-07 16:12:52,503][232226] Updated weights for policy 0, policy_version 170 (0.0006) [2023-03-07 16:12:53,281][232226] Updated weights for policy 0, policy_version 180 (0.0006) [2023-03-07 16:12:54,121][232226] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-07 16:12:54,885][232226] Updated weights for policy 0, policy_version 200 (0.0006) [2023-03-07 16:12:55,069][231894] Fps is (10 sec: 13004.8, 60 sec: 10342.4, 300 sec: 10342.4). Total num frames: 206848. Throughput: 0: 9857.1. Samples: 197142. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:12:55,069][231894] Avg episode reward: [(0, '185.395')] [2023-03-07 16:12:55,672][232226] Updated weights for policy 0, policy_version 210 (0.0006) [2023-03-07 16:12:56,494][232226] Updated weights for policy 0, policy_version 220 (0.0006) [2023-03-07 16:12:57,284][232226] Updated weights for policy 0, policy_version 230 (0.0006) [2023-03-07 16:12:58,071][232226] Updated weights for policy 0, policy_version 240 (0.0006) [2023-03-07 16:12:58,878][232226] Updated weights for policy 0, policy_version 250 (0.0006) [2023-03-07 16:12:59,659][232226] Updated weights for policy 0, policy_version 260 (0.0006) [2023-03-07 16:13:00,071][231894] Fps is (10 sec: 12899.7, 60 sec: 10853.5, 300 sec: 10853.5). Total num frames: 271360. Throughput: 0: 9431.8. Samples: 235815. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 16:13:00,072][231894] Avg episode reward: [(0, '190.203')] [2023-03-07 16:13:00,073][232173] Saving new best policy, reward=190.203! [2023-03-07 16:13:00,457][232226] Updated weights for policy 0, policy_version 270 (0.0006) [2023-03-07 16:13:01,247][232226] Updated weights for policy 0, policy_version 280 (0.0006) [2023-03-07 16:13:02,055][232226] Updated weights for policy 0, policy_version 290 (0.0007) [2023-03-07 16:13:02,825][232226] Updated weights for policy 0, policy_version 300 (0.0006) [2023-03-07 16:13:03,612][232226] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-07 16:13:04,398][232226] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-07 16:13:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 11195.7, 300 sec: 11195.7). Total num frames: 335872. Throughput: 0: 10450.7. Samples: 313522. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:13:05,069][231894] Avg episode reward: [(0, '190.468')] [2023-03-07 16:13:05,073][232173] Saving new best policy, reward=190.468! [2023-03-07 16:13:05,173][232226] Updated weights for policy 0, policy_version 330 (0.0006) [2023-03-07 16:13:05,982][232226] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-07 16:13:06,770][232226] Updated weights for policy 0, policy_version 350 (0.0006) [2023-03-07 16:13:07,545][232226] Updated weights for policy 0, policy_version 360 (0.0007) [2023-03-07 16:13:08,327][232226] Updated weights for policy 0, policy_version 370 (0.0006) [2023-03-07 16:13:09,136][232226] Updated weights for policy 0, policy_version 380 (0.0006) [2023-03-07 16:13:09,922][232226] Updated weights for policy 0, policy_version 390 (0.0006) [2023-03-07 16:13:10,069][231894] Fps is (10 sec: 12905.0, 60 sec: 11439.5, 300 sec: 11439.5). Total num frames: 400384. Throughput: 0: 11178.4. Samples: 391245. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:13:10,069][231894] Avg episode reward: [(0, '194.024')] [2023-03-07 16:13:10,081][232173] Saving new best policy, reward=194.024! [2023-03-07 16:13:10,705][232226] Updated weights for policy 0, policy_version 400 (0.0006) [2023-03-07 16:13:11,535][232226] Updated weights for policy 0, policy_version 410 (0.0006) [2023-03-07 16:13:12,359][232226] Updated weights for policy 0, policy_version 420 (0.0006) [2023-03-07 16:13:13,142][232226] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-07 16:13:13,960][232226] Updated weights for policy 0, policy_version 440 (0.0007) [2023-03-07 16:13:14,750][232226] Updated weights for policy 0, policy_version 450 (0.0006) [2023-03-07 16:13:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 11622.4, 300 sec: 11622.4). Total num frames: 464896. Throughput: 0: 10738.4. Samples: 429537. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:13:15,070][231894] Avg episode reward: [(0, '197.088')] [2023-03-07 16:13:15,073][232173] Saving new best policy, reward=197.088! [2023-03-07 16:13:15,530][232226] Updated weights for policy 0, policy_version 460 (0.0006) [2023-03-07 16:13:16,342][232226] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-07 16:13:17,142][232226] Updated weights for policy 0, policy_version 480 (0.0006) [2023-03-07 16:13:17,920][232226] Updated weights for policy 0, policy_version 490 (0.0007) [2023-03-07 16:13:18,740][232226] Updated weights for policy 0, policy_version 500 (0.0006) [2023-03-07 16:13:19,517][232226] Updated weights for policy 0, policy_version 510 (0.0006) [2023-03-07 16:13:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 11764.6, 300 sec: 11764.6). Total num frames: 529408. Throughput: 0: 11263.5. Samples: 506857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:13:20,069][231894] Avg episode reward: [(0, '192.785')] [2023-03-07 16:13:20,297][232226] Updated weights for policy 0, policy_version 520 (0.0006) [2023-03-07 16:13:21,111][232226] Updated weights for policy 0, policy_version 530 (0.0006) [2023-03-07 16:13:21,915][232226] Updated weights for policy 0, policy_version 540 (0.0007) [2023-03-07 16:13:22,697][232226] Updated weights for policy 0, policy_version 550 (0.0006) [2023-03-07 16:13:23,502][232226] Updated weights for policy 0, policy_version 560 (0.0006) [2023-03-07 16:13:24,283][232226] Updated weights for policy 0, policy_version 570 (0.0006) [2023-03-07 16:13:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 11857.9, 300 sec: 11857.9). Total num frames: 592896. Throughput: 0: 12893.0. Samples: 583952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:13:25,069][231894] Avg episode reward: [(0, '197.419')] [2023-03-07 16:13:25,078][232173] Saving new best policy, reward=197.419! [2023-03-07 16:13:25,080][232226] Updated weights for policy 0, policy_version 580 (0.0006) [2023-03-07 16:13:25,890][232226] Updated weights for policy 0, policy_version 590 (0.0007) [2023-03-07 16:13:26,703][232226] Updated weights for policy 0, policy_version 600 (0.0006) [2023-03-07 16:13:27,498][232226] Updated weights for policy 0, policy_version 610 (0.0006) [2023-03-07 16:13:28,298][232226] Updated weights for policy 0, policy_version 620 (0.0006) [2023-03-07 16:13:29,101][232226] Updated weights for policy 0, policy_version 630 (0.0007) [2023-03-07 16:13:29,900][232226] Updated weights for policy 0, policy_version 640 (0.0007) [2023-03-07 16:13:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 11952.9, 300 sec: 11952.9). Total num frames: 657408. Throughput: 0: 12905.5. Samples: 622328. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:13:30,069][231894] Avg episode reward: [(0, '204.578')] [2023-03-07 16:13:30,070][232173] Saving new best policy, reward=204.578! [2023-03-07 16:13:30,699][232226] Updated weights for policy 0, policy_version 650 (0.0007) [2023-03-07 16:13:31,505][232226] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-07 16:13:32,286][232226] Updated weights for policy 0, policy_version 670 (0.0006) [2023-03-07 16:13:33,070][232226] Updated weights for policy 0, policy_version 680 (0.0006) [2023-03-07 16:13:33,884][232226] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-07 16:13:34,669][232226] Updated weights for policy 0, policy_version 700 (0.0007) [2023-03-07 16:13:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12032.0, 300 sec: 12032.0). Total num frames: 721920. Throughput: 0: 12876.2. Samples: 699280. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:13:35,069][231894] Avg episode reward: [(0, '198.072')] [2023-03-07 16:13:35,448][232226] Updated weights for policy 0, policy_version 710 (0.0007) [2023-03-07 16:13:36,266][232226] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-07 16:13:37,063][232226] Updated weights for policy 0, policy_version 730 (0.0006) [2023-03-07 16:13:37,856][232226] Updated weights for policy 0, policy_version 740 (0.0007) [2023-03-07 16:13:38,667][232226] Updated weights for policy 0, policy_version 750 (0.0007) [2023-03-07 16:13:39,453][232226] Updated weights for policy 0, policy_version 760 (0.0006) [2023-03-07 16:13:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12083.2). Total num frames: 785408. Throughput: 0: 12869.5. Samples: 776271. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:13:40,069][231894] Avg episode reward: [(0, '205.475')] [2023-03-07 16:13:40,070][232173] Saving new best policy, reward=205.475! [2023-03-07 16:13:40,253][232226] Updated weights for policy 0, policy_version 770 (0.0006) [2023-03-07 16:13:41,056][232226] Updated weights for policy 0, policy_version 780 (0.0008) [2023-03-07 16:13:41,859][232226] Updated weights for policy 0, policy_version 790 (0.0007) [2023-03-07 16:13:42,646][232226] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-07 16:13:43,436][232226] Updated weights for policy 0, policy_version 810 (0.0006) [2023-03-07 16:13:44,241][232226] Updated weights for policy 0, policy_version 820 (0.0006) [2023-03-07 16:13:45,023][232226] Updated weights for policy 0, policy_version 830 (0.0006) [2023-03-07 16:13:45,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12141.7). Total num frames: 849920. Throughput: 0: 12868.5. Samples: 814873. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:13:45,069][231894] Avg episode reward: [(0, '201.296')] [2023-03-07 16:13:45,839][232226] Updated weights for policy 0, policy_version 840 (0.0007) [2023-03-07 16:13:46,619][232226] Updated weights for policy 0, policy_version 850 (0.0007) [2023-03-07 16:13:47,411][232226] Updated weights for policy 0, policy_version 860 (0.0006) [2023-03-07 16:13:48,210][232226] Updated weights for policy 0, policy_version 870 (0.0007) [2023-03-07 16:13:49,009][232226] Updated weights for policy 0, policy_version 880 (0.0007) [2023-03-07 16:13:49,785][232226] Updated weights for policy 0, policy_version 890 (0.0006) [2023-03-07 16:13:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12192.4). Total num frames: 914432. Throughput: 0: 12859.7. Samples: 892208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:13:50,069][231894] Avg episode reward: [(0, '204.357')] [2023-03-07 16:13:50,578][232226] Updated weights for policy 0, policy_version 900 (0.0006) [2023-03-07 16:13:51,371][232226] Updated weights for policy 0, policy_version 910 (0.0006) [2023-03-07 16:13:52,164][232226] Updated weights for policy 0, policy_version 920 (0.0006) [2023-03-07 16:13:52,964][232226] Updated weights for policy 0, policy_version 930 (0.0006) [2023-03-07 16:13:53,772][232226] Updated weights for policy 0, policy_version 940 (0.0006) [2023-03-07 16:13:54,551][232226] Updated weights for policy 0, policy_version 950 (0.0006) [2023-03-07 16:13:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12236.8). Total num frames: 978944. Throughput: 0: 12855.5. Samples: 969743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:13:55,070][231894] Avg episode reward: [(0, '193.087')] [2023-03-07 16:13:55,348][232226] Updated weights for policy 0, policy_version 960 (0.0006) [2023-03-07 16:13:56,158][232226] Updated weights for policy 0, policy_version 970 (0.0007) [2023-03-07 16:13:56,946][232226] Updated weights for policy 0, policy_version 980 (0.0006) [2023-03-07 16:13:57,736][232226] Updated weights for policy 0, policy_version 990 (0.0006) [2023-03-07 16:13:58,536][232226] Updated weights for policy 0, policy_version 1000 (0.0008) [2023-03-07 16:13:59,317][232226] Updated weights for policy 0, policy_version 1010 (0.0006) [2023-03-07 16:14:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.7, 300 sec: 12276.0). Total num frames: 1043456. Throughput: 0: 12860.3. Samples: 1008249. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:14:00,069][231894] Avg episode reward: [(0, '190.432')] [2023-03-07 16:14:00,104][232226] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-07 16:14:00,914][232226] Updated weights for policy 0, policy_version 1030 (0.0007) [2023-03-07 16:14:01,723][232226] Updated weights for policy 0, policy_version 1040 (0.0007) [2023-03-07 16:14:02,521][232226] Updated weights for policy 0, policy_version 1050 (0.0006) [2023-03-07 16:14:03,329][232226] Updated weights for policy 0, policy_version 1060 (0.0008) [2023-03-07 16:14:04,124][232226] Updated weights for policy 0, policy_version 1070 (0.0006) [2023-03-07 16:14:04,921][232226] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-07 16:14:05,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12299.4). Total num frames: 1106944. Throughput: 0: 12853.9. Samples: 1085282. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:14:05,069][231894] Avg episode reward: [(0, '200.787')] [2023-03-07 16:14:05,727][232226] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-07 16:14:06,522][232226] Updated weights for policy 0, policy_version 1100 (0.0006) [2023-03-07 16:14:07,306][232226] Updated weights for policy 0, policy_version 1110 (0.0006) [2023-03-07 16:14:08,116][232226] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-07 16:14:08,908][232226] Updated weights for policy 0, policy_version 1130 (0.0005) [2023-03-07 16:14:09,685][232226] Updated weights for policy 0, policy_version 1140 (0.0006) [2023-03-07 16:14:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12331.1). Total num frames: 1171456. Throughput: 0: 12854.8. Samples: 1162418. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:14:10,069][231894] Avg episode reward: [(0, '200.066')] [2023-03-07 16:14:10,470][232226] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-07 16:14:11,275][232226] Updated weights for policy 0, policy_version 1160 (0.0006) [2023-03-07 16:14:12,057][232226] Updated weights for policy 0, policy_version 1170 (0.0006) [2023-03-07 16:14:12,866][232226] Updated weights for policy 0, policy_version 1180 (0.0006) [2023-03-07 16:14:13,654][232226] Updated weights for policy 0, policy_version 1190 (0.0006) [2023-03-07 16:14:14,455][232226] Updated weights for policy 0, policy_version 1200 (0.0007) [2023-03-07 16:14:15,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12868.3, 300 sec: 12369.9). Total num frames: 1236992. Throughput: 0: 12864.5. Samples: 1201230. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:14:15,069][231894] Avg episode reward: [(0, '194.772')] [2023-03-07 16:14:15,229][232226] Updated weights for policy 0, policy_version 1210 (0.0006) [2023-03-07 16:14:16,030][232226] Updated weights for policy 0, policy_version 1220 (0.0006) [2023-03-07 16:14:16,827][232226] Updated weights for policy 0, policy_version 1230 (0.0006) [2023-03-07 16:14:17,599][232226] Updated weights for policy 0, policy_version 1240 (0.0006) [2023-03-07 16:14:18,402][232226] Updated weights for policy 0, policy_version 1250 (0.0007) [2023-03-07 16:14:19,189][232226] Updated weights for policy 0, policy_version 1260 (0.0006) [2023-03-07 16:14:19,982][232226] Updated weights for policy 0, policy_version 1270 (0.0007) [2023-03-07 16:14:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12385.5). Total num frames: 1300480. Throughput: 0: 12876.5. Samples: 1278721. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:14:20,069][231894] Avg episode reward: [(0, '197.150')] [2023-03-07 16:14:20,782][232226] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-07 16:14:21,586][232226] Updated weights for policy 0, policy_version 1290 (0.0006) [2023-03-07 16:14:22,388][232226] Updated weights for policy 0, policy_version 1300 (0.0007) [2023-03-07 16:14:23,179][232226] Updated weights for policy 0, policy_version 1310 (0.0006) [2023-03-07 16:14:23,977][232226] Updated weights for policy 0, policy_version 1320 (0.0007) [2023-03-07 16:14:24,743][232226] Updated weights for policy 0, policy_version 1330 (0.0006) [2023-03-07 16:14:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12418.3). Total num frames: 1366016. Throughput: 0: 12886.2. Samples: 1356149. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:14:25,069][231894] Avg episode reward: [(0, '193.876')] [2023-03-07 16:14:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000001334_1366016.pth... [2023-03-07 16:14:25,545][232226] Updated weights for policy 0, policy_version 1340 (0.0006) [2023-03-07 16:14:26,339][232226] Updated weights for policy 0, policy_version 1350 (0.0007) [2023-03-07 16:14:27,138][232226] Updated weights for policy 0, policy_version 1360 (0.0006) [2023-03-07 16:14:27,942][232226] Updated weights for policy 0, policy_version 1370 (0.0007) [2023-03-07 16:14:28,737][232226] Updated weights for policy 0, policy_version 1380 (0.0007) [2023-03-07 16:14:29,525][232226] Updated weights for policy 0, policy_version 1390 (0.0007) [2023-03-07 16:14:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12430.5). Total num frames: 1429504. Throughput: 0: 12891.1. Samples: 1394973. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:14:30,069][231894] Avg episode reward: [(0, '199.855')] [2023-03-07 16:14:30,323][232226] Updated weights for policy 0, policy_version 1400 (0.0007) [2023-03-07 16:14:31,123][232226] Updated weights for policy 0, policy_version 1410 (0.0005) [2023-03-07 16:14:31,911][232226] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-07 16:14:32,715][232226] Updated weights for policy 0, policy_version 1430 (0.0007) [2023-03-07 16:14:33,505][232226] Updated weights for policy 0, policy_version 1440 (0.0006) [2023-03-07 16:14:34,296][232226] Updated weights for policy 0, policy_version 1450 (0.0006) [2023-03-07 16:14:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12450.1). Total num frames: 1494016. Throughput: 0: 12886.9. Samples: 1472116. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:14:35,069][231894] Avg episode reward: [(0, '194.707')] [2023-03-07 16:14:35,088][232226] Updated weights for policy 0, policy_version 1460 (0.0006) [2023-03-07 16:14:35,881][232226] Updated weights for policy 0, policy_version 1470 (0.0007) [2023-03-07 16:14:36,677][232226] Updated weights for policy 0, policy_version 1480 (0.0006) [2023-03-07 16:14:37,480][232226] Updated weights for policy 0, policy_version 1490 (0.0006) [2023-03-07 16:14:38,286][232226] Updated weights for policy 0, policy_version 1500 (0.0005) [2023-03-07 16:14:39,061][232226] Updated weights for policy 0, policy_version 1510 (0.0005) [2023-03-07 16:14:39,875][232226] Updated weights for policy 0, policy_version 1520 (0.0007) [2023-03-07 16:14:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12468.2). Total num frames: 1558528. Throughput: 0: 12881.8. Samples: 1549424. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:14:40,070][231894] Avg episode reward: [(0, '197.322')] [2023-03-07 16:14:40,658][232226] Updated weights for policy 0, policy_version 1530 (0.0007) [2023-03-07 16:14:41,443][232226] Updated weights for policy 0, policy_version 1540 (0.0006) [2023-03-07 16:14:42,241][232226] Updated weights for policy 0, policy_version 1550 (0.0007) [2023-03-07 16:14:43,038][232226] Updated weights for policy 0, policy_version 1560 (0.0006) [2023-03-07 16:14:43,833][232226] Updated weights for policy 0, policy_version 1570 (0.0007) [2023-03-07 16:14:44,647][232226] Updated weights for policy 0, policy_version 1580 (0.0007) [2023-03-07 16:14:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12484.9). Total num frames: 1623040. Throughput: 0: 12886.0. Samples: 1588117. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:14:45,069][231894] Avg episode reward: [(0, '205.860')] [2023-03-07 16:14:45,072][232173] Saving new best policy, reward=205.860! [2023-03-07 16:14:45,445][232226] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-07 16:14:46,218][232226] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-07 16:14:46,998][232226] Updated weights for policy 0, policy_version 1610 (0.0006) [2023-03-07 16:14:47,808][232226] Updated weights for policy 0, policy_version 1620 (0.0007) [2023-03-07 16:14:48,598][232226] Updated weights for policy 0, policy_version 1630 (0.0006) [2023-03-07 16:14:49,384][232226] Updated weights for policy 0, policy_version 1640 (0.0006) [2023-03-07 16:14:50,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12500.4). Total num frames: 1687552. Throughput: 0: 12890.1. Samples: 1665334. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:14:50,069][231894] Avg episode reward: [(0, '206.038')] [2023-03-07 16:14:50,070][232173] Saving new best policy, reward=206.038! [2023-03-07 16:14:50,170][232226] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-07 16:14:50,965][232226] Updated weights for policy 0, policy_version 1660 (0.0006) [2023-03-07 16:14:51,772][232226] Updated weights for policy 0, policy_version 1670 (0.0006) [2023-03-07 16:14:52,555][232226] Updated weights for policy 0, policy_version 1680 (0.0006) [2023-03-07 16:14:53,366][232226] Updated weights for policy 0, policy_version 1690 (0.0007) [2023-03-07 16:14:54,155][232226] Updated weights for policy 0, policy_version 1700 (0.0006) [2023-03-07 16:14:54,958][232226] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-07 16:14:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12514.7). Total num frames: 1752064. Throughput: 0: 12897.1. Samples: 1742787. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:14:55,069][231894] Avg episode reward: [(0, '206.550')] [2023-03-07 16:14:55,072][232173] Saving new best policy, reward=206.550! [2023-03-07 16:14:55,755][232226] Updated weights for policy 0, policy_version 1720 (0.0006) [2023-03-07 16:14:56,551][232226] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-07 16:14:57,343][232226] Updated weights for policy 0, policy_version 1740 (0.0006) [2023-03-07 16:14:58,141][232226] Updated weights for policy 0, policy_version 1750 (0.0007) [2023-03-07 16:14:58,922][232226] Updated weights for policy 0, policy_version 1760 (0.0006) [2023-03-07 16:14:59,705][232226] Updated weights for policy 0, policy_version 1770 (0.0006) [2023-03-07 16:15:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12528.1). Total num frames: 1816576. Throughput: 0: 12894.1. Samples: 1781462. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:00,069][231894] Avg episode reward: [(0, '205.197')] [2023-03-07 16:15:00,497][232226] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-07 16:15:01,307][232226] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-07 16:15:02,109][232226] Updated weights for policy 0, policy_version 1800 (0.0007) [2023-03-07 16:15:02,902][232226] Updated weights for policy 0, policy_version 1810 (0.0007) [2023-03-07 16:15:03,701][232226] Updated weights for policy 0, policy_version 1820 (0.0007) [2023-03-07 16:15:04,498][232226] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-07 16:15:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12540.6). Total num frames: 1881088. Throughput: 0: 12887.9. Samples: 1858676. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:05,069][231894] Avg episode reward: [(0, '203.233')] [2023-03-07 16:15:05,289][232226] Updated weights for policy 0, policy_version 1840 (0.0007) [2023-03-07 16:15:06,090][232226] Updated weights for policy 0, policy_version 1850 (0.0007) [2023-03-07 16:15:06,881][232226] Updated weights for policy 0, policy_version 1860 (0.0006) [2023-03-07 16:15:07,669][232226] Updated weights for policy 0, policy_version 1870 (0.0007) [2023-03-07 16:15:08,490][232226] Updated weights for policy 0, policy_version 1880 (0.0006) [2023-03-07 16:15:09,277][232226] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-07 16:15:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.4, 300 sec: 12545.7). Total num frames: 1944576. Throughput: 0: 12880.9. Samples: 1935787. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:15:10,073][232226] Updated weights for policy 0, policy_version 1900 (0.0006) [2023-03-07 16:15:10,080][231894] Avg episode reward: [(0, '193.474')] [2023-03-07 16:15:10,884][232226] Updated weights for policy 0, policy_version 1910 (0.0006) [2023-03-07 16:15:11,674][232226] Updated weights for policy 0, policy_version 1920 (0.0007) [2023-03-07 16:15:12,463][232226] Updated weights for policy 0, policy_version 1930 (0.0007) [2023-03-07 16:15:13,265][232226] Updated weights for policy 0, policy_version 1940 (0.0007) [2023-03-07 16:15:14,051][232226] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-07 16:15:14,863][232226] Updated weights for policy 0, policy_version 1960 (0.0007) [2023-03-07 16:15:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12556.8). Total num frames: 2009088. Throughput: 0: 12874.0. Samples: 1974304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:15,080][231894] Avg episode reward: [(0, '199.511')] [2023-03-07 16:15:15,658][232226] Updated weights for policy 0, policy_version 1970 (0.0006) [2023-03-07 16:15:16,438][232226] Updated weights for policy 0, policy_version 1980 (0.0007) [2023-03-07 16:15:17,244][232226] Updated weights for policy 0, policy_version 1990 (0.0007) [2023-03-07 16:15:18,046][232226] Updated weights for policy 0, policy_version 2000 (0.0007) [2023-03-07 16:15:18,834][232226] Updated weights for policy 0, policy_version 2010 (0.0007) [2023-03-07 16:15:19,638][232226] Updated weights for policy 0, policy_version 2020 (0.0006) [2023-03-07 16:15:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12567.3). Total num frames: 2073600. Throughput: 0: 12874.0. Samples: 2051447. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:20,080][231894] Avg episode reward: [(0, '206.978')] [2023-03-07 16:15:20,081][232173] Saving new best policy, reward=206.978! [2023-03-07 16:15:20,426][232226] Updated weights for policy 0, policy_version 2030 (0.0007) [2023-03-07 16:15:21,224][232226] Updated weights for policy 0, policy_version 2040 (0.0006) [2023-03-07 16:15:22,015][232226] Updated weights for policy 0, policy_version 2050 (0.0007) [2023-03-07 16:15:22,820][232226] Updated weights for policy 0, policy_version 2060 (0.0006) [2023-03-07 16:15:23,639][232226] Updated weights for policy 0, policy_version 2070 (0.0006) [2023-03-07 16:15:24,423][232226] Updated weights for policy 0, policy_version 2080 (0.0006) [2023-03-07 16:15:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12577.1). Total num frames: 2138112. Throughput: 0: 12869.8. Samples: 2128561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:25,080][231894] Avg episode reward: [(0, '202.542')] [2023-03-07 16:15:25,237][232226] Updated weights for policy 0, policy_version 2090 (0.0006) [2023-03-07 16:15:26,024][232226] Updated weights for policy 0, policy_version 2100 (0.0007) [2023-03-07 16:15:26,816][232226] Updated weights for policy 0, policy_version 2110 (0.0005) [2023-03-07 16:15:27,610][232226] Updated weights for policy 0, policy_version 2120 (0.0006) [2023-03-07 16:15:28,421][232226] Updated weights for policy 0, policy_version 2130 (0.0007) [2023-03-07 16:15:29,194][232226] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-07 16:15:29,989][232226] Updated weights for policy 0, policy_version 2150 (0.0006) [2023-03-07 16:15:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12580.6). Total num frames: 2201600. Throughput: 0: 12865.5. Samples: 2167063. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:15:30,069][231894] Avg episode reward: [(0, '194.236')] [2023-03-07 16:15:30,790][232226] Updated weights for policy 0, policy_version 2160 (0.0006) [2023-03-07 16:15:31,570][232226] Updated weights for policy 0, policy_version 2170 (0.0006) [2023-03-07 16:15:32,386][232226] Updated weights for policy 0, policy_version 2180 (0.0007) [2023-03-07 16:15:33,198][232226] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-07 16:15:33,976][232226] Updated weights for policy 0, policy_version 2200 (0.0006) [2023-03-07 16:15:34,778][232226] Updated weights for policy 0, policy_version 2210 (0.0006) [2023-03-07 16:15:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12589.5). Total num frames: 2266112. Throughput: 0: 12860.0. Samples: 2244033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:35,069][231894] Avg episode reward: [(0, '201.207')] [2023-03-07 16:15:35,597][232226] Updated weights for policy 0, policy_version 2220 (0.0007) [2023-03-07 16:15:36,402][232226] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-07 16:15:37,174][232226] Updated weights for policy 0, policy_version 2240 (0.0007) [2023-03-07 16:15:37,982][232226] Updated weights for policy 0, policy_version 2250 (0.0007) [2023-03-07 16:15:38,761][232226] Updated weights for policy 0, policy_version 2260 (0.0007) [2023-03-07 16:15:39,566][232226] Updated weights for policy 0, policy_version 2270 (0.0006) [2023-03-07 16:15:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12598.0). Total num frames: 2330624. Throughput: 0: 12855.9. Samples: 2321300. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:40,069][231894] Avg episode reward: [(0, '195.077')] [2023-03-07 16:15:40,366][232226] Updated weights for policy 0, policy_version 2280 (0.0006) [2023-03-07 16:15:41,153][232226] Updated weights for policy 0, policy_version 2290 (0.0007) [2023-03-07 16:15:41,957][232226] Updated weights for policy 0, policy_version 2300 (0.0006) [2023-03-07 16:15:42,754][232226] Updated weights for policy 0, policy_version 2310 (0.0006) [2023-03-07 16:15:43,551][232226] Updated weights for policy 0, policy_version 2320 (0.0006) [2023-03-07 16:15:44,371][232226] Updated weights for policy 0, policy_version 2330 (0.0006) [2023-03-07 16:15:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12600.6). Total num frames: 2394112. Throughput: 0: 12849.9. Samples: 2359706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:15:45,069][231894] Avg episode reward: [(0, '201.504')] [2023-03-07 16:15:45,166][232226] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-07 16:15:45,974][232226] Updated weights for policy 0, policy_version 2350 (0.0006) [2023-03-07 16:15:46,751][232226] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-07 16:15:47,537][232226] Updated weights for policy 0, policy_version 2370 (0.0006) [2023-03-07 16:15:48,337][232226] Updated weights for policy 0, policy_version 2380 (0.0006) [2023-03-07 16:15:49,133][232226] Updated weights for policy 0, policy_version 2390 (0.0007) [2023-03-07 16:15:49,913][232226] Updated weights for policy 0, policy_version 2400 (0.0006) [2023-03-07 16:15:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12613.6). Total num frames: 2459648. Throughput: 0: 12849.5. Samples: 2436904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:15:50,069][231894] Avg episode reward: [(0, '198.646')] [2023-03-07 16:15:50,704][232226] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-07 16:15:51,502][232226] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-07 16:15:52,291][232226] Updated weights for policy 0, policy_version 2430 (0.0006) [2023-03-07 16:15:53,109][232226] Updated weights for policy 0, policy_version 2440 (0.0006) [2023-03-07 16:15:53,900][232226] Updated weights for policy 0, policy_version 2450 (0.0006) [2023-03-07 16:15:54,689][232226] Updated weights for policy 0, policy_version 2460 (0.0006) [2023-03-07 16:15:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12615.7). Total num frames: 2523136. Throughput: 0: 12854.4. Samples: 2514235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:15:55,069][231894] Avg episode reward: [(0, '197.206')] [2023-03-07 16:15:55,467][232226] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-07 16:15:56,283][232226] Updated weights for policy 0, policy_version 2480 (0.0006) [2023-03-07 16:15:57,070][232226] Updated weights for policy 0, policy_version 2490 (0.0006) [2023-03-07 16:15:57,859][232226] Updated weights for policy 0, policy_version 2500 (0.0007) [2023-03-07 16:15:58,640][232226] Updated weights for policy 0, policy_version 2510 (0.0007) [2023-03-07 16:15:59,436][232226] Updated weights for policy 0, policy_version 2520 (0.0006) [2023-03-07 16:16:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12627.7). Total num frames: 2588672. Throughput: 0: 12858.8. Samples: 2552949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:00,069][231894] Avg episode reward: [(0, '202.739')] [2023-03-07 16:16:00,223][232226] Updated weights for policy 0, policy_version 2530 (0.0007) [2023-03-07 16:16:01,012][232226] Updated weights for policy 0, policy_version 2540 (0.0006) [2023-03-07 16:16:01,805][232226] Updated weights for policy 0, policy_version 2550 (0.0006) [2023-03-07 16:16:02,606][232226] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-07 16:16:03,401][232226] Updated weights for policy 0, policy_version 2570 (0.0006) [2023-03-07 16:16:04,198][232226] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-07 16:16:04,986][232226] Updated weights for policy 0, policy_version 2590 (0.0006) [2023-03-07 16:16:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12629.3). Total num frames: 2652160. Throughput: 0: 12868.2. Samples: 2630519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:05,070][231894] Avg episode reward: [(0, '198.447')] [2023-03-07 16:16:05,806][232226] Updated weights for policy 0, policy_version 2600 (0.0007) [2023-03-07 16:16:06,589][232226] Updated weights for policy 0, policy_version 2610 (0.0006) [2023-03-07 16:16:07,368][232226] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-07 16:16:08,178][232226] Updated weights for policy 0, policy_version 2630 (0.0006) [2023-03-07 16:16:08,954][232226] Updated weights for policy 0, policy_version 2640 (0.0006) [2023-03-07 16:16:09,747][232226] Updated weights for policy 0, policy_version 2650 (0.0006) [2023-03-07 16:16:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12640.5). Total num frames: 2717696. Throughput: 0: 12873.0. Samples: 2707848. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:16:10,070][231894] Avg episode reward: [(0, '197.807')] [2023-03-07 16:16:10,566][232226] Updated weights for policy 0, policy_version 2660 (0.0006) [2023-03-07 16:16:11,333][232226] Updated weights for policy 0, policy_version 2670 (0.0006) [2023-03-07 16:16:12,162][232226] Updated weights for policy 0, policy_version 2680 (0.0006) [2023-03-07 16:16:12,959][232226] Updated weights for policy 0, policy_version 2690 (0.0006) [2023-03-07 16:16:13,722][232226] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-07 16:16:14,541][232226] Updated weights for policy 0, policy_version 2710 (0.0006) [2023-03-07 16:16:15,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12641.7). Total num frames: 2781184. Throughput: 0: 12872.0. Samples: 2746304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:15,069][231894] Avg episode reward: [(0, '205.005')] [2023-03-07 16:16:15,327][232226] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-07 16:16:16,137][232226] Updated weights for policy 0, policy_version 2730 (0.0006) [2023-03-07 16:16:16,936][232226] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-07 16:16:17,725][232226] Updated weights for policy 0, policy_version 2750 (0.0007) [2023-03-07 16:16:18,519][232226] Updated weights for policy 0, policy_version 2760 (0.0006) [2023-03-07 16:16:19,328][232226] Updated weights for policy 0, policy_version 2770 (0.0006) [2023-03-07 16:16:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12647.5). Total num frames: 2845696. Throughput: 0: 12880.1. Samples: 2823635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:20,069][231894] Avg episode reward: [(0, '204.076')] [2023-03-07 16:16:20,119][232226] Updated weights for policy 0, policy_version 2780 (0.0007) [2023-03-07 16:16:20,904][232226] Updated weights for policy 0, policy_version 2790 (0.0007) [2023-03-07 16:16:21,694][232226] Updated weights for policy 0, policy_version 2800 (0.0006) [2023-03-07 16:16:22,499][232226] Updated weights for policy 0, policy_version 2810 (0.0006) [2023-03-07 16:16:23,286][232226] Updated weights for policy 0, policy_version 2820 (0.0006) [2023-03-07 16:16:24,095][232226] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-07 16:16:24,902][232226] Updated weights for policy 0, policy_version 2840 (0.0007) [2023-03-07 16:16:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12653.1). Total num frames: 2910208. Throughput: 0: 12873.8. Samples: 2900624. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:16:25,069][231894] Avg episode reward: [(0, '204.533')] [2023-03-07 16:16:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000002842_2910208.pth... [2023-03-07 16:16:25,674][232226] Updated weights for policy 0, policy_version 2850 (0.0007) [2023-03-07 16:16:26,486][232226] Updated weights for policy 0, policy_version 2860 (0.0006) [2023-03-07 16:16:27,280][232226] Updated weights for policy 0, policy_version 2870 (0.0006) [2023-03-07 16:16:28,065][232226] Updated weights for policy 0, policy_version 2880 (0.0005) [2023-03-07 16:16:28,857][232226] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-07 16:16:29,650][232226] Updated weights for policy 0, policy_version 2900 (0.0006) [2023-03-07 16:16:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12658.4). Total num frames: 2974720. Throughput: 0: 12879.9. Samples: 2939303. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:30,069][231894] Avg episode reward: [(0, '197.036')] [2023-03-07 16:16:30,437][232226] Updated weights for policy 0, policy_version 2910 (0.0007) [2023-03-07 16:16:31,236][232226] Updated weights for policy 0, policy_version 2920 (0.0007) [2023-03-07 16:16:32,027][232226] Updated weights for policy 0, policy_version 2930 (0.0007) [2023-03-07 16:16:32,800][232226] Updated weights for policy 0, policy_version 2940 (0.0005) [2023-03-07 16:16:33,602][232226] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-07 16:16:34,403][232226] Updated weights for policy 0, policy_version 2960 (0.0006) [2023-03-07 16:16:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12663.5). Total num frames: 3039232. Throughput: 0: 12891.9. Samples: 3017042. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:35,070][231894] Avg episode reward: [(0, '196.095')] [2023-03-07 16:16:35,178][232226] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-07 16:16:35,985][232226] Updated weights for policy 0, policy_version 2980 (0.0006) [2023-03-07 16:16:36,767][232226] Updated weights for policy 0, policy_version 2990 (0.0006) [2023-03-07 16:16:37,552][232226] Updated weights for policy 0, policy_version 3000 (0.0006) [2023-03-07 16:16:38,349][232226] Updated weights for policy 0, policy_version 3010 (0.0007) [2023-03-07 16:16:39,158][232226] Updated weights for policy 0, policy_version 3020 (0.0007) [2023-03-07 16:16:39,960][232226] Updated weights for policy 0, policy_version 3030 (0.0007) [2023-03-07 16:16:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12668.3). Total num frames: 3103744. Throughput: 0: 12893.1. Samples: 3094425. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:40,069][231894] Avg episode reward: [(0, '200.997')] [2023-03-07 16:16:40,746][232226] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-07 16:16:41,545][232226] Updated weights for policy 0, policy_version 3050 (0.0006) [2023-03-07 16:16:42,353][232226] Updated weights for policy 0, policy_version 3060 (0.0007) [2023-03-07 16:16:43,160][232226] Updated weights for policy 0, policy_version 3070 (0.0007) [2023-03-07 16:16:43,949][232226] Updated weights for policy 0, policy_version 3080 (0.0006) [2023-03-07 16:16:44,726][232226] Updated weights for policy 0, policy_version 3090 (0.0007) [2023-03-07 16:16:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12673.0). Total num frames: 3168256. Throughput: 0: 12887.5. Samples: 3132889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:16:45,069][231894] Avg episode reward: [(0, '200.584')] [2023-03-07 16:16:45,534][232226] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-07 16:16:46,345][232226] Updated weights for policy 0, policy_version 3110 (0.0006) [2023-03-07 16:16:47,133][232226] Updated weights for policy 0, policy_version 3120 (0.0006) [2023-03-07 16:16:47,912][232226] Updated weights for policy 0, policy_version 3130 (0.0006) [2023-03-07 16:16:48,718][232226] Updated weights for policy 0, policy_version 3140 (0.0007) [2023-03-07 16:16:49,521][232226] Updated weights for policy 0, policy_version 3150 (0.0007) [2023-03-07 16:16:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12677.5). Total num frames: 3232768. Throughput: 0: 12880.4. Samples: 3210137. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:16:50,069][231894] Avg episode reward: [(0, '203.389')] [2023-03-07 16:16:50,308][232226] Updated weights for policy 0, policy_version 3160 (0.0006) [2023-03-07 16:16:51,104][232226] Updated weights for policy 0, policy_version 3170 (0.0006) [2023-03-07 16:16:51,911][232226] Updated weights for policy 0, policy_version 3180 (0.0006) [2023-03-07 16:16:52,695][232226] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-07 16:16:53,484][232226] Updated weights for policy 0, policy_version 3200 (0.0006) [2023-03-07 16:16:54,306][232226] Updated weights for policy 0, policy_version 3210 (0.0006) [2023-03-07 16:16:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12677.9). Total num frames: 3296256. Throughput: 0: 12877.2. Samples: 3287321. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:16:55,069][231894] Avg episode reward: [(0, '197.866')] [2023-03-07 16:16:55,086][232226] Updated weights for policy 0, policy_version 3220 (0.0006) [2023-03-07 16:16:55,873][232226] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-07 16:16:56,681][232226] Updated weights for policy 0, policy_version 3240 (0.0006) [2023-03-07 16:16:57,466][232226] Updated weights for policy 0, policy_version 3250 (0.0006) [2023-03-07 16:16:58,270][232226] Updated weights for policy 0, policy_version 3260 (0.0007) [2023-03-07 16:16:59,058][232226] Updated weights for policy 0, policy_version 3270 (0.0006) [2023-03-07 16:16:59,858][232226] Updated weights for policy 0, policy_version 3280 (0.0006) [2023-03-07 16:17:00,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12682.1). Total num frames: 3360768. Throughput: 0: 12882.7. Samples: 3326028. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:17:00,069][231894] Avg episode reward: [(0, '204.586')] [2023-03-07 16:17:00,657][232226] Updated weights for policy 0, policy_version 3290 (0.0007) [2023-03-07 16:17:01,466][232226] Updated weights for policy 0, policy_version 3300 (0.0007) [2023-03-07 16:17:02,243][232226] Updated weights for policy 0, policy_version 3310 (0.0005) [2023-03-07 16:17:03,039][232226] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-07 16:17:03,841][232226] Updated weights for policy 0, policy_version 3330 (0.0006) [2023-03-07 16:17:04,623][232226] Updated weights for policy 0, policy_version 3340 (0.0007) [2023-03-07 16:17:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12686.2). Total num frames: 3425280. Throughput: 0: 12876.8. Samples: 3403093. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:17:05,069][231894] Avg episode reward: [(0, '194.568')] [2023-03-07 16:17:05,435][232226] Updated weights for policy 0, policy_version 3350 (0.0006) [2023-03-07 16:17:06,226][232226] Updated weights for policy 0, policy_version 3360 (0.0006) [2023-03-07 16:17:07,021][232226] Updated weights for policy 0, policy_version 3370 (0.0007) [2023-03-07 16:17:07,822][232226] Updated weights for policy 0, policy_version 3380 (0.0006) [2023-03-07 16:17:08,609][232226] Updated weights for policy 0, policy_version 3390 (0.0007) [2023-03-07 16:17:09,404][232226] Updated weights for policy 0, policy_version 3400 (0.0007) [2023-03-07 16:17:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12690.2). Total num frames: 3489792. Throughput: 0: 12883.2. Samples: 3480366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:17:10,069][231894] Avg episode reward: [(0, '204.141')] [2023-03-07 16:17:10,184][232226] Updated weights for policy 0, policy_version 3410 (0.0006) [2023-03-07 16:17:10,974][232226] Updated weights for policy 0, policy_version 3420 (0.0007) [2023-03-07 16:17:11,781][232226] Updated weights for policy 0, policy_version 3430 (0.0006) [2023-03-07 16:17:12,574][232226] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-07 16:17:13,374][232226] Updated weights for policy 0, policy_version 3450 (0.0006) [2023-03-07 16:17:14,133][232226] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-07 16:17:14,944][232226] Updated weights for policy 0, policy_version 3470 (0.0007) [2023-03-07 16:17:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12693.9). Total num frames: 3554304. Throughput: 0: 12884.4. Samples: 3519100. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:17:15,069][231894] Avg episode reward: [(0, '197.835')] [2023-03-07 16:17:15,737][232226] Updated weights for policy 0, policy_version 3480 (0.0005) [2023-03-07 16:17:16,528][232226] Updated weights for policy 0, policy_version 3490 (0.0006) [2023-03-07 16:17:17,303][232226] Updated weights for policy 0, policy_version 3500 (0.0006) [2023-03-07 16:17:18,104][232226] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-07 16:17:18,905][232226] Updated weights for policy 0, policy_version 3520 (0.0007) [2023-03-07 16:17:19,695][232226] Updated weights for policy 0, policy_version 3530 (0.0007) [2023-03-07 16:17:20,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12697.6). Total num frames: 3618816. Throughput: 0: 12884.8. Samples: 3596858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:17:20,069][231894] Avg episode reward: [(0, '206.254')] [2023-03-07 16:17:20,491][232226] Updated weights for policy 0, policy_version 3540 (0.0006) [2023-03-07 16:17:21,286][232226] Updated weights for policy 0, policy_version 3550 (0.0006) [2023-03-07 16:17:22,081][232226] Updated weights for policy 0, policy_version 3560 (0.0008) [2023-03-07 16:17:22,889][232226] Updated weights for policy 0, policy_version 3570 (0.0006) [2023-03-07 16:17:23,674][232226] Updated weights for policy 0, policy_version 3580 (0.0006) [2023-03-07 16:17:24,498][232226] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-07 16:17:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12697.6). Total num frames: 3682304. Throughput: 0: 12881.3. Samples: 3674083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:17:25,069][231894] Avg episode reward: [(0, '199.844')] [2023-03-07 16:17:25,308][232226] Updated weights for policy 0, policy_version 3600 (0.0007) [2023-03-07 16:17:26,092][232226] Updated weights for policy 0, policy_version 3610 (0.0006) [2023-03-07 16:17:26,882][232226] Updated weights for policy 0, policy_version 3620 (0.0005) [2023-03-07 16:17:27,683][232226] Updated weights for policy 0, policy_version 3630 (0.0006) [2023-03-07 16:17:28,474][232226] Updated weights for policy 0, policy_version 3640 (0.0008) [2023-03-07 16:17:29,277][232226] Updated weights for policy 0, policy_version 3650 (0.0007) [2023-03-07 16:17:30,062][232226] Updated weights for policy 0, policy_version 3660 (0.0006) [2023-03-07 16:17:30,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12704.5). Total num frames: 3747840. Throughput: 0: 12876.9. Samples: 3712347. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:17:30,069][231894] Avg episode reward: [(0, '191.318')] [2023-03-07 16:17:30,854][232226] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-07 16:17:31,651][232226] Updated weights for policy 0, policy_version 3680 (0.0007) [2023-03-07 16:17:32,432][232226] Updated weights for policy 0, policy_version 3690 (0.0006) [2023-03-07 16:17:33,234][232226] Updated weights for policy 0, policy_version 3700 (0.0007) [2023-03-07 16:17:34,022][232226] Updated weights for policy 0, policy_version 3710 (0.0007) [2023-03-07 16:17:34,826][232226] Updated weights for policy 0, policy_version 3720 (0.0006) [2023-03-07 16:17:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 3811328. Throughput: 0: 12878.5. Samples: 3789668. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:17:35,069][231894] Avg episode reward: [(0, '205.158')] [2023-03-07 16:17:35,625][232226] Updated weights for policy 0, policy_version 3730 (0.0007) [2023-03-07 16:17:36,433][232226] Updated weights for policy 0, policy_version 3740 (0.0006) [2023-03-07 16:17:37,201][232226] Updated weights for policy 0, policy_version 3750 (0.0007) [2023-03-07 16:17:38,016][232226] Updated weights for policy 0, policy_version 3760 (0.0007) [2023-03-07 16:17:38,829][232226] Updated weights for policy 0, policy_version 3770 (0.0007) [2023-03-07 16:17:39,617][232226] Updated weights for policy 0, policy_version 3780 (0.0007) [2023-03-07 16:17:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 3875840. Throughput: 0: 12876.9. Samples: 3866780. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:17:40,069][231894] Avg episode reward: [(0, '196.726')] [2023-03-07 16:17:40,425][232226] Updated weights for policy 0, policy_version 3790 (0.0006) [2023-03-07 16:17:41,205][232226] Updated weights for policy 0, policy_version 3800 (0.0006) [2023-03-07 16:17:42,005][232226] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-07 16:17:42,805][232226] Updated weights for policy 0, policy_version 3820 (0.0007) [2023-03-07 16:17:43,606][232226] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-07 16:17:44,393][232226] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-07 16:17:45,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 3940352. Throughput: 0: 12872.8. Samples: 3905304. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:17:45,070][231894] Avg episode reward: [(0, '198.626')] [2023-03-07 16:17:45,196][232226] Updated weights for policy 0, policy_version 3850 (0.0007) [2023-03-07 16:17:45,995][232226] Updated weights for policy 0, policy_version 3860 (0.0006) [2023-03-07 16:17:46,762][232226] Updated weights for policy 0, policy_version 3870 (0.0007) [2023-03-07 16:17:47,557][232226] Updated weights for policy 0, policy_version 3880 (0.0006) [2023-03-07 16:17:48,368][232226] Updated weights for policy 0, policy_version 3890 (0.0007) [2023-03-07 16:17:49,153][232226] Updated weights for policy 0, policy_version 3900 (0.0007) [2023-03-07 16:17:49,957][232226] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-07 16:17:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 4004864. Throughput: 0: 12875.2. Samples: 3982475. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:17:50,069][231894] Avg episode reward: [(0, '198.992')] [2023-03-07 16:17:50,746][232226] Updated weights for policy 0, policy_version 3920 (0.0006) [2023-03-07 16:17:51,557][232226] Updated weights for policy 0, policy_version 3930 (0.0006) [2023-03-07 16:17:52,337][232226] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-07 16:17:53,149][232226] Updated weights for policy 0, policy_version 3950 (0.0006) [2023-03-07 16:17:53,945][232226] Updated weights for policy 0, policy_version 3960 (0.0007) [2023-03-07 16:17:54,735][232226] Updated weights for policy 0, policy_version 3970 (0.0007) [2023-03-07 16:17:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.7). Total num frames: 4069376. Throughput: 0: 12879.3. Samples: 4059937. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:17:55,069][231894] Avg episode reward: [(0, '204.558')] [2023-03-07 16:17:55,538][232226] Updated weights for policy 0, policy_version 3980 (0.0007) [2023-03-07 16:17:56,318][232226] Updated weights for policy 0, policy_version 3990 (0.0006) [2023-03-07 16:17:57,118][232226] Updated weights for policy 0, policy_version 4000 (0.0006) [2023-03-07 16:17:57,916][232226] Updated weights for policy 0, policy_version 4010 (0.0007) [2023-03-07 16:17:58,714][232226] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-07 16:17:59,510][232226] Updated weights for policy 0, policy_version 4030 (0.0007) [2023-03-07 16:18:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 4132864. Throughput: 0: 12872.8. Samples: 4098374. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:18:00,069][231894] Avg episode reward: [(0, '209.241')] [2023-03-07 16:18:00,071][232173] Saving new best policy, reward=209.241! [2023-03-07 16:18:00,325][232226] Updated weights for policy 0, policy_version 4040 (0.0007) [2023-03-07 16:18:01,118][232226] Updated weights for policy 0, policy_version 4050 (0.0006) [2023-03-07 16:18:01,912][232226] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-07 16:18:02,713][232226] Updated weights for policy 0, policy_version 4070 (0.0006) [2023-03-07 16:18:03,520][232226] Updated weights for policy 0, policy_version 4080 (0.0006) [2023-03-07 16:18:04,303][232226] Updated weights for policy 0, policy_version 4090 (0.0006) [2023-03-07 16:18:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 4197376. Throughput: 0: 12851.4. Samples: 4175172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:18:05,070][231894] Avg episode reward: [(0, '204.795')] [2023-03-07 16:18:05,104][232226] Updated weights for policy 0, policy_version 4100 (0.0005) [2023-03-07 16:18:05,891][232226] Updated weights for policy 0, policy_version 4110 (0.0007) [2023-03-07 16:18:06,685][232226] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-07 16:18:07,482][232226] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-07 16:18:08,262][232226] Updated weights for policy 0, policy_version 4140 (0.0008) [2023-03-07 16:18:09,059][232226] Updated weights for policy 0, policy_version 4150 (0.0007) [2023-03-07 16:18:09,859][232226] Updated weights for policy 0, policy_version 4160 (0.0006) [2023-03-07 16:18:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 4261888. Throughput: 0: 12858.9. Samples: 4252734. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:18:10,069][231894] Avg episode reward: [(0, '197.399')] [2023-03-07 16:18:10,638][232226] Updated weights for policy 0, policy_version 4170 (0.0007) [2023-03-07 16:18:11,418][232226] Updated weights for policy 0, policy_version 4180 (0.0006) [2023-03-07 16:18:12,217][232226] Updated weights for policy 0, policy_version 4190 (0.0006) [2023-03-07 16:18:13,013][232226] Updated weights for policy 0, policy_version 4200 (0.0007) [2023-03-07 16:18:13,813][232226] Updated weights for policy 0, policy_version 4210 (0.0007) [2023-03-07 16:18:14,604][232226] Updated weights for policy 0, policy_version 4220 (0.0007) [2023-03-07 16:18:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 4326400. Throughput: 0: 12872.3. Samples: 4291601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 4.0) [2023-03-07 16:18:15,069][231894] Avg episode reward: [(0, '200.230')] [2023-03-07 16:18:15,409][232226] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-07 16:18:16,206][232226] Updated weights for policy 0, policy_version 4240 (0.0007) [2023-03-07 16:18:17,010][232226] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-07 16:18:17,794][232226] Updated weights for policy 0, policy_version 4260 (0.0007) [2023-03-07 16:18:18,602][232226] Updated weights for policy 0, policy_version 4270 (0.0007) [2023-03-07 16:18:19,395][232226] Updated weights for policy 0, policy_version 4280 (0.0007) [2023-03-07 16:18:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 4390912. Throughput: 0: 12866.2. Samples: 4368649. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:18:20,069][231894] Avg episode reward: [(0, '205.033')] [2023-03-07 16:18:20,194][232226] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-07 16:18:20,987][232226] Updated weights for policy 0, policy_version 4300 (0.0006) [2023-03-07 16:18:21,788][232226] Updated weights for policy 0, policy_version 4310 (0.0006) [2023-03-07 16:18:22,587][232226] Updated weights for policy 0, policy_version 4320 (0.0006) [2023-03-07 16:18:23,382][232226] Updated weights for policy 0, policy_version 4330 (0.0006) [2023-03-07 16:18:24,182][232226] Updated weights for policy 0, policy_version 4340 (0.0007) [2023-03-07 16:18:24,966][232226] Updated weights for policy 0, policy_version 4350 (0.0006) [2023-03-07 16:18:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 4455424. Throughput: 0: 12868.3. Samples: 4445855. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:18:25,069][231894] Avg episode reward: [(0, '210.527')] [2023-03-07 16:18:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000004351_4455424.pth... [2023-03-07 16:18:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000001334_1366016.pth [2023-03-07 16:18:25,108][232173] Saving new best policy, reward=210.527! [2023-03-07 16:18:25,787][232226] Updated weights for policy 0, policy_version 4360 (0.0006) [2023-03-07 16:18:26,571][232226] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-07 16:18:27,382][232226] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-07 16:18:28,173][232226] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-07 16:18:28,957][232226] Updated weights for policy 0, policy_version 4400 (0.0006) [2023-03-07 16:18:29,768][232226] Updated weights for policy 0, policy_version 4410 (0.0007) [2023-03-07 16:18:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 4518912. Throughput: 0: 12865.4. Samples: 4484248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:18:30,069][231894] Avg episode reward: [(0, '202.887')] [2023-03-07 16:18:30,567][232226] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-07 16:18:31,353][232226] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-07 16:18:32,154][232226] Updated weights for policy 0, policy_version 4440 (0.0006) [2023-03-07 16:18:32,942][232226] Updated weights for policy 0, policy_version 4450 (0.0005) [2023-03-07 16:18:33,749][232226] Updated weights for policy 0, policy_version 4460 (0.0008) [2023-03-07 16:18:34,543][232226] Updated weights for policy 0, policy_version 4470 (0.0006) [2023-03-07 16:18:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 4583424. Throughput: 0: 12864.4. Samples: 4561372. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:18:35,069][231894] Avg episode reward: [(0, '203.654')] [2023-03-07 16:18:35,354][232226] Updated weights for policy 0, policy_version 4480 (0.0005) [2023-03-07 16:18:36,150][232226] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-07 16:18:36,938][232226] Updated weights for policy 0, policy_version 4500 (0.0006) [2023-03-07 16:18:37,743][232226] Updated weights for policy 0, policy_version 4510 (0.0006) [2023-03-07 16:18:38,517][232226] Updated weights for policy 0, policy_version 4520 (0.0006) [2023-03-07 16:18:39,325][232226] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-07 16:18:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 4647936. Throughput: 0: 12856.3. Samples: 4638470. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:18:40,070][231894] Avg episode reward: [(0, '201.417')] [2023-03-07 16:18:40,130][232226] Updated weights for policy 0, policy_version 4540 (0.0006) [2023-03-07 16:18:40,897][232226] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-07 16:18:41,732][232226] Updated weights for policy 0, policy_version 4560 (0.0006) [2023-03-07 16:18:42,513][232226] Updated weights for policy 0, policy_version 4570 (0.0007) [2023-03-07 16:18:43,308][232226] Updated weights for policy 0, policy_version 4580 (0.0007) [2023-03-07 16:18:44,105][232226] Updated weights for policy 0, policy_version 4590 (0.0006) [2023-03-07 16:18:44,891][232226] Updated weights for policy 0, policy_version 4600 (0.0006) [2023-03-07 16:18:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 4712448. Throughput: 0: 12858.9. Samples: 4677025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:18:45,069][231894] Avg episode reward: [(0, '201.854')] [2023-03-07 16:18:45,699][232226] Updated weights for policy 0, policy_version 4610 (0.0006) [2023-03-07 16:18:46,482][232226] Updated weights for policy 0, policy_version 4620 (0.0006) [2023-03-07 16:18:47,267][232226] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-07 16:18:48,086][232226] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-07 16:18:48,881][232226] Updated weights for policy 0, policy_version 4650 (0.0006) [2023-03-07 16:18:49,670][232226] Updated weights for policy 0, policy_version 4660 (0.0006) [2023-03-07 16:18:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 4775936. Throughput: 0: 12868.8. Samples: 4754266. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:18:50,069][231894] Avg episode reward: [(0, '195.484')] [2023-03-07 16:18:50,482][232226] Updated weights for policy 0, policy_version 4670 (0.0007) [2023-03-07 16:18:51,290][232226] Updated weights for policy 0, policy_version 4680 (0.0008) [2023-03-07 16:18:52,072][232226] Updated weights for policy 0, policy_version 4690 (0.0007) [2023-03-07 16:18:52,874][232226] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-07 16:18:53,654][232226] Updated weights for policy 0, policy_version 4710 (0.0007) [2023-03-07 16:18:54,433][232226] Updated weights for policy 0, policy_version 4720 (0.0007) [2023-03-07 16:18:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 4840448. Throughput: 0: 12862.2. Samples: 4831533. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:18:55,069][231894] Avg episode reward: [(0, '197.967')] [2023-03-07 16:18:55,249][232226] Updated weights for policy 0, policy_version 4730 (0.0006) [2023-03-07 16:18:56,037][232226] Updated weights for policy 0, policy_version 4740 (0.0006) [2023-03-07 16:18:56,817][232226] Updated weights for policy 0, policy_version 4750 (0.0007) [2023-03-07 16:18:57,625][232226] Updated weights for policy 0, policy_version 4760 (0.0006) [2023-03-07 16:18:58,427][232226] Updated weights for policy 0, policy_version 4770 (0.0006) [2023-03-07 16:18:59,224][232226] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-07 16:19:00,004][232226] Updated weights for policy 0, policy_version 4790 (0.0007) [2023-03-07 16:19:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 4904960. Throughput: 0: 12859.5. Samples: 4870278. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:19:00,070][231894] Avg episode reward: [(0, '208.917')] [2023-03-07 16:19:00,802][232226] Updated weights for policy 0, policy_version 4800 (0.0007) [2023-03-07 16:19:01,579][232226] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-07 16:19:02,372][232226] Updated weights for policy 0, policy_version 4820 (0.0006) [2023-03-07 16:19:03,177][232226] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-07 16:19:03,959][232226] Updated weights for policy 0, policy_version 4840 (0.0006) [2023-03-07 16:19:04,760][232226] Updated weights for policy 0, policy_version 4850 (0.0007) [2023-03-07 16:19:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 4969472. Throughput: 0: 12864.2. Samples: 4947540. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:19:05,070][231894] Avg episode reward: [(0, '198.546')] [2023-03-07 16:19:05,553][232226] Updated weights for policy 0, policy_version 4860 (0.0006) [2023-03-07 16:19:06,338][232226] Updated weights for policy 0, policy_version 4870 (0.0006) [2023-03-07 16:19:07,122][232226] Updated weights for policy 0, policy_version 4880 (0.0006) [2023-03-07 16:19:07,919][232226] Updated weights for policy 0, policy_version 4890 (0.0006) [2023-03-07 16:19:08,708][232226] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-07 16:19:09,524][232226] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-07 16:19:10,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 5035008. Throughput: 0: 12875.9. Samples: 5025270. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:19:10,069][231894] Avg episode reward: [(0, '202.730')] [2023-03-07 16:19:10,327][232226] Updated weights for policy 0, policy_version 4920 (0.0008) [2023-03-07 16:19:11,110][232226] Updated weights for policy 0, policy_version 4930 (0.0006) [2023-03-07 16:19:11,888][232226] Updated weights for policy 0, policy_version 4940 (0.0006) [2023-03-07 16:19:12,700][232226] Updated weights for policy 0, policy_version 4950 (0.0007) [2023-03-07 16:19:13,484][232226] Updated weights for policy 0, policy_version 4960 (0.0009) [2023-03-07 16:19:14,297][232226] Updated weights for policy 0, policy_version 4970 (0.0006) [2023-03-07 16:19:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 5098496. Throughput: 0: 12880.9. Samples: 5063889. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:19:15,069][231894] Avg episode reward: [(0, '203.858')] [2023-03-07 16:19:15,093][232226] Updated weights for policy 0, policy_version 4980 (0.0006) [2023-03-07 16:19:15,893][232226] Updated weights for policy 0, policy_version 4990 (0.0007) [2023-03-07 16:19:16,672][232226] Updated weights for policy 0, policy_version 5000 (0.0006) [2023-03-07 16:19:17,468][232226] Updated weights for policy 0, policy_version 5010 (0.0007) [2023-03-07 16:19:18,273][232226] Updated weights for policy 0, policy_version 5020 (0.0007) [2023-03-07 16:19:19,061][232226] Updated weights for policy 0, policy_version 5030 (0.0006) [2023-03-07 16:19:19,854][232226] Updated weights for policy 0, policy_version 5040 (0.0007) [2023-03-07 16:19:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 5163008. Throughput: 0: 12879.7. Samples: 5140957. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:19:20,069][231894] Avg episode reward: [(0, '198.851')] [2023-03-07 16:19:20,647][232226] Updated weights for policy 0, policy_version 5050 (0.0006) [2023-03-07 16:19:21,449][232226] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-07 16:19:22,230][232226] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-07 16:19:23,034][232226] Updated weights for policy 0, policy_version 5080 (0.0006) [2023-03-07 16:19:23,827][232226] Updated weights for policy 0, policy_version 5090 (0.0006) [2023-03-07 16:19:24,631][232226] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-07 16:19:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 5227520. Throughput: 0: 12886.1. Samples: 5218344. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:19:25,070][231894] Avg episode reward: [(0, '195.118')] [2023-03-07 16:19:25,435][232226] Updated weights for policy 0, policy_version 5110 (0.0006) [2023-03-07 16:19:26,233][232226] Updated weights for policy 0, policy_version 5120 (0.0006) [2023-03-07 16:19:27,032][232226] Updated weights for policy 0, policy_version 5130 (0.0007) [2023-03-07 16:19:27,838][232226] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-07 16:19:28,610][232226] Updated weights for policy 0, policy_version 5150 (0.0006) [2023-03-07 16:19:29,421][232226] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-07 16:19:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 5292032. Throughput: 0: 12881.7. Samples: 5256702. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:19:30,080][231894] Avg episode reward: [(0, '197.603')] [2023-03-07 16:19:30,202][232226] Updated weights for policy 0, policy_version 5170 (0.0006) [2023-03-07 16:19:30,972][232226] Updated weights for policy 0, policy_version 5180 (0.0006) [2023-03-07 16:19:31,774][232226] Updated weights for policy 0, policy_version 5190 (0.0006) [2023-03-07 16:19:32,559][232226] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-07 16:19:33,361][232226] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-07 16:19:34,145][232226] Updated weights for policy 0, policy_version 5220 (0.0006) [2023-03-07 16:19:34,950][232226] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-07 16:19:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 5356544. Throughput: 0: 12891.4. Samples: 5334377. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:19:35,080][231894] Avg episode reward: [(0, '200.178')] [2023-03-07 16:19:35,745][232226] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-07 16:19:36,525][232226] Updated weights for policy 0, policy_version 5250 (0.0007) [2023-03-07 16:19:37,321][232226] Updated weights for policy 0, policy_version 5260 (0.0007) [2023-03-07 16:19:38,111][232226] Updated weights for policy 0, policy_version 5270 (0.0006) [2023-03-07 16:19:38,913][232226] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-07 16:19:39,706][232226] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-07 16:19:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12874.6). Total num frames: 5421056. Throughput: 0: 12896.9. Samples: 5411893. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:19:40,080][231894] Avg episode reward: [(0, '195.274')] [2023-03-07 16:19:40,486][232226] Updated weights for policy 0, policy_version 5300 (0.0006) [2023-03-07 16:19:41,277][232226] Updated weights for policy 0, policy_version 5310 (0.0006) [2023-03-07 16:19:42,070][232226] Updated weights for policy 0, policy_version 5320 (0.0006) [2023-03-07 16:19:42,869][232226] Updated weights for policy 0, policy_version 5330 (0.0007) [2023-03-07 16:19:43,645][232226] Updated weights for policy 0, policy_version 5340 (0.0005) [2023-03-07 16:19:44,443][232226] Updated weights for policy 0, policy_version 5350 (0.0007) [2023-03-07 16:19:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 5485568. Throughput: 0: 12898.4. Samples: 5450708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:19:45,080][231894] Avg episode reward: [(0, '184.555')] [2023-03-07 16:19:45,235][232226] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-07 16:19:46,005][232226] Updated weights for policy 0, policy_version 5370 (0.0006) [2023-03-07 16:19:46,801][232226] Updated weights for policy 0, policy_version 5380 (0.0006) [2023-03-07 16:19:47,599][232226] Updated weights for policy 0, policy_version 5390 (0.0006) [2023-03-07 16:19:48,382][232226] Updated weights for policy 0, policy_version 5400 (0.0007) [2023-03-07 16:19:49,174][232226] Updated weights for policy 0, policy_version 5410 (0.0006) [2023-03-07 16:19:49,974][232226] Updated weights for policy 0, policy_version 5420 (0.0006) [2023-03-07 16:19:50,069][231894] Fps is (10 sec: 13004.6, 60 sec: 12919.5, 300 sec: 12878.1). Total num frames: 5551104. Throughput: 0: 12908.8. Samples: 5528436. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:19:50,080][231894] Avg episode reward: [(0, '194.969')] [2023-03-07 16:19:50,758][232226] Updated weights for policy 0, policy_version 5430 (0.0006) [2023-03-07 16:19:51,537][232226] Updated weights for policy 0, policy_version 5440 (0.0007) [2023-03-07 16:19:52,336][232226] Updated weights for policy 0, policy_version 5450 (0.0005) [2023-03-07 16:19:53,134][232226] Updated weights for policy 0, policy_version 5460 (0.0007) [2023-03-07 16:19:53,929][232226] Updated weights for policy 0, policy_version 5470 (0.0006) [2023-03-07 16:19:54,706][232226] Updated weights for policy 0, policy_version 5480 (0.0007) [2023-03-07 16:19:55,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12878.1). Total num frames: 5615616. Throughput: 0: 12912.9. Samples: 5606352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:19:55,069][231894] Avg episode reward: [(0, '194.058')] [2023-03-07 16:19:55,511][232226] Updated weights for policy 0, policy_version 5490 (0.0007) [2023-03-07 16:19:56,286][232226] Updated weights for policy 0, policy_version 5500 (0.0007) [2023-03-07 16:19:57,075][232226] Updated weights for policy 0, policy_version 5510 (0.0006) [2023-03-07 16:19:57,883][232226] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-07 16:19:58,672][232226] Updated weights for policy 0, policy_version 5530 (0.0007) [2023-03-07 16:19:59,477][232226] Updated weights for policy 0, policy_version 5540 (0.0006) [2023-03-07 16:20:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12878.1). Total num frames: 5680128. Throughput: 0: 12916.4. Samples: 5645127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:20:00,070][231894] Avg episode reward: [(0, '196.254')] [2023-03-07 16:20:00,270][232226] Updated weights for policy 0, policy_version 5550 (0.0006) [2023-03-07 16:20:01,061][232226] Updated weights for policy 0, policy_version 5560 (0.0006) [2023-03-07 16:20:01,850][232226] Updated weights for policy 0, policy_version 5570 (0.0006) [2023-03-07 16:20:02,649][232226] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-07 16:20:03,434][232226] Updated weights for policy 0, policy_version 5590 (0.0007) [2023-03-07 16:20:04,242][232226] Updated weights for policy 0, policy_version 5600 (0.0006) [2023-03-07 16:20:05,061][232226] Updated weights for policy 0, policy_version 5610 (0.0006) [2023-03-07 16:20:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12881.6). Total num frames: 5744640. Throughput: 0: 12921.5. Samples: 5722424. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:20:05,072][231894] Avg episode reward: [(0, '199.808')] [2023-03-07 16:20:05,840][232226] Updated weights for policy 0, policy_version 5620 (0.0007) [2023-03-07 16:20:06,637][232226] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-07 16:20:07,452][232226] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-07 16:20:08,227][232226] Updated weights for policy 0, policy_version 5650 (0.0006) [2023-03-07 16:20:09,045][232226] Updated weights for policy 0, policy_version 5660 (0.0006) [2023-03-07 16:20:09,842][232226] Updated weights for policy 0, policy_version 5670 (0.0006) [2023-03-07 16:20:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 5808128. Throughput: 0: 12910.9. Samples: 5799335. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:20:10,080][231894] Avg episode reward: [(0, '188.524')] [2023-03-07 16:20:10,650][232226] Updated weights for policy 0, policy_version 5680 (0.0007) [2023-03-07 16:20:11,447][232226] Updated weights for policy 0, policy_version 5690 (0.0007) [2023-03-07 16:20:12,229][232226] Updated weights for policy 0, policy_version 5700 (0.0007) [2023-03-07 16:20:13,029][232226] Updated weights for policy 0, policy_version 5710 (0.0007) [2023-03-07 16:20:13,821][232226] Updated weights for policy 0, policy_version 5720 (0.0007) [2023-03-07 16:20:14,602][232226] Updated weights for policy 0, policy_version 5730 (0.0006) [2023-03-07 16:20:15,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 5872640. Throughput: 0: 12915.5. Samples: 5837899. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:20:15,080][231894] Avg episode reward: [(0, '194.974')] [2023-03-07 16:20:15,404][232226] Updated weights for policy 0, policy_version 5740 (0.0006) [2023-03-07 16:20:16,169][232226] Updated weights for policy 0, policy_version 5750 (0.0006) [2023-03-07 16:20:16,973][232226] Updated weights for policy 0, policy_version 5760 (0.0006) [2023-03-07 16:20:17,779][232226] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-07 16:20:18,557][232226] Updated weights for policy 0, policy_version 5780 (0.0007) [2023-03-07 16:20:19,353][232226] Updated weights for policy 0, policy_version 5790 (0.0006) [2023-03-07 16:20:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 5937152. Throughput: 0: 12915.3. Samples: 5915564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:20:20,080][231894] Avg episode reward: [(0, '197.608')] [2023-03-07 16:20:20,158][232226] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-07 16:20:20,946][232226] Updated weights for policy 0, policy_version 5810 (0.0007) [2023-03-07 16:20:21,754][232226] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-07 16:20:22,558][232226] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-07 16:20:23,344][232226] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-07 16:20:24,149][232226] Updated weights for policy 0, policy_version 5850 (0.0007) [2023-03-07 16:20:24,939][232226] Updated weights for policy 0, policy_version 5860 (0.0006) [2023-03-07 16:20:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 6001664. Throughput: 0: 12904.6. Samples: 5992604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:20:25,080][231894] Avg episode reward: [(0, '198.041')] [2023-03-07 16:20:25,093][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000005862_6002688.pth... [2023-03-07 16:20:25,123][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000002842_2910208.pth [2023-03-07 16:20:25,750][232226] Updated weights for policy 0, policy_version 5870 (0.0007) [2023-03-07 16:20:26,515][232226] Updated weights for policy 0, policy_version 5880 (0.0007) [2023-03-07 16:20:27,325][232226] Updated weights for policy 0, policy_version 5890 (0.0007) [2023-03-07 16:20:28,120][232226] Updated weights for policy 0, policy_version 5900 (0.0005) [2023-03-07 16:20:28,917][232226] Updated weights for policy 0, policy_version 5910 (0.0006) [2023-03-07 16:20:29,709][232226] Updated weights for policy 0, policy_version 5920 (0.0006) [2023-03-07 16:20:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 6066176. Throughput: 0: 12901.1. Samples: 6031257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:20:30,080][231894] Avg episode reward: [(0, '194.498')] [2023-03-07 16:20:30,482][232226] Updated weights for policy 0, policy_version 5930 (0.0006) [2023-03-07 16:20:31,287][232226] Updated weights for policy 0, policy_version 5940 (0.0007) [2023-03-07 16:20:32,088][232226] Updated weights for policy 0, policy_version 5950 (0.0006) [2023-03-07 16:20:32,878][232226] Updated weights for policy 0, policy_version 5960 (0.0006) [2023-03-07 16:20:33,652][232226] Updated weights for policy 0, policy_version 5970 (0.0006) [2023-03-07 16:20:34,448][232226] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-07 16:20:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 6130688. Throughput: 0: 12892.4. Samples: 6108592. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:20:35,080][231894] Avg episode reward: [(0, '195.190')] [2023-03-07 16:20:35,249][232226] Updated weights for policy 0, policy_version 5990 (0.0006) [2023-03-07 16:20:36,026][232226] Updated weights for policy 0, policy_version 6000 (0.0006) [2023-03-07 16:20:36,843][232226] Updated weights for policy 0, policy_version 6010 (0.0006) [2023-03-07 16:20:37,629][232226] Updated weights for policy 0, policy_version 6020 (0.0007) [2023-03-07 16:20:38,426][232226] Updated weights for policy 0, policy_version 6030 (0.0007) [2023-03-07 16:20:39,210][232226] Updated weights for policy 0, policy_version 6040 (0.0006) [2023-03-07 16:20:40,017][232226] Updated weights for policy 0, policy_version 6050 (0.0006) [2023-03-07 16:20:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6195200. Throughput: 0: 12884.7. Samples: 6186164. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:20:40,080][231894] Avg episode reward: [(0, '198.626')] [2023-03-07 16:20:40,812][232226] Updated weights for policy 0, policy_version 6060 (0.0007) [2023-03-07 16:20:41,606][232226] Updated weights for policy 0, policy_version 6070 (0.0006) [2023-03-07 16:20:42,402][232226] Updated weights for policy 0, policy_version 6080 (0.0006) [2023-03-07 16:20:43,201][232226] Updated weights for policy 0, policy_version 6090 (0.0007) [2023-03-07 16:20:43,987][232226] Updated weights for policy 0, policy_version 6100 (0.0006) [2023-03-07 16:20:44,797][232226] Updated weights for policy 0, policy_version 6110 (0.0006) [2023-03-07 16:20:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 6259712. Throughput: 0: 12881.0. Samples: 6224774. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:20:45,080][231894] Avg episode reward: [(0, '190.795')] [2023-03-07 16:20:45,595][232226] Updated weights for policy 0, policy_version 6120 (0.0006) [2023-03-07 16:20:46,389][232226] Updated weights for policy 0, policy_version 6130 (0.0006) [2023-03-07 16:20:47,201][232226] Updated weights for policy 0, policy_version 6140 (0.0006) [2023-03-07 16:20:47,975][232226] Updated weights for policy 0, policy_version 6150 (0.0006) [2023-03-07 16:20:48,772][232226] Updated weights for policy 0, policy_version 6160 (0.0006) [2023-03-07 16:20:49,559][232226] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-07 16:20:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 6324224. Throughput: 0: 12875.8. Samples: 6301837. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:20:50,069][231894] Avg episode reward: [(0, '199.405')] [2023-03-07 16:20:50,350][232226] Updated weights for policy 0, policy_version 6180 (0.0006) [2023-03-07 16:20:51,133][232226] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-07 16:20:51,927][232226] Updated weights for policy 0, policy_version 6200 (0.0007) [2023-03-07 16:20:52,718][232226] Updated weights for policy 0, policy_version 6210 (0.0006) [2023-03-07 16:20:53,508][232226] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-07 16:20:54,285][232226] Updated weights for policy 0, policy_version 6230 (0.0007) [2023-03-07 16:20:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 6388736. Throughput: 0: 12896.1. Samples: 6379658. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:20:55,069][231894] Avg episode reward: [(0, '192.861')] [2023-03-07 16:20:55,093][232226] Updated weights for policy 0, policy_version 6240 (0.0006) [2023-03-07 16:20:55,896][232226] Updated weights for policy 0, policy_version 6250 (0.0006) [2023-03-07 16:20:56,662][232226] Updated weights for policy 0, policy_version 6260 (0.0006) [2023-03-07 16:20:57,466][232226] Updated weights for policy 0, policy_version 6270 (0.0007) [2023-03-07 16:20:58,238][232226] Updated weights for policy 0, policy_version 6280 (0.0007) [2023-03-07 16:20:59,047][232226] Updated weights for policy 0, policy_version 6290 (0.0007) [2023-03-07 16:20:59,833][232226] Updated weights for policy 0, policy_version 6300 (0.0006) [2023-03-07 16:21:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.1). Total num frames: 6453248. Throughput: 0: 12898.8. Samples: 6418343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:21:00,069][231894] Avg episode reward: [(0, '201.300')] [2023-03-07 16:21:00,645][232226] Updated weights for policy 0, policy_version 6310 (0.0006) [2023-03-07 16:21:01,446][232226] Updated weights for policy 0, policy_version 6320 (0.0007) [2023-03-07 16:21:02,229][232226] Updated weights for policy 0, policy_version 6330 (0.0005) [2023-03-07 16:21:03,031][232226] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-07 16:21:03,817][232226] Updated weights for policy 0, policy_version 6350 (0.0007) [2023-03-07 16:21:04,618][232226] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-07 16:21:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 6517760. Throughput: 0: 12891.0. Samples: 6495659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:21:05,069][231894] Avg episode reward: [(0, '191.083')] [2023-03-07 16:21:05,395][232226] Updated weights for policy 0, policy_version 6370 (0.0006) [2023-03-07 16:21:06,206][232226] Updated weights for policy 0, policy_version 6380 (0.0007) [2023-03-07 16:21:07,006][232226] Updated weights for policy 0, policy_version 6390 (0.0007) [2023-03-07 16:21:07,789][232226] Updated weights for policy 0, policy_version 6400 (0.0006) [2023-03-07 16:21:08,586][232226] Updated weights for policy 0, policy_version 6410 (0.0006) [2023-03-07 16:21:09,357][232226] Updated weights for policy 0, policy_version 6420 (0.0006) [2023-03-07 16:21:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6582272. Throughput: 0: 12902.9. Samples: 6573233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:21:10,069][231894] Avg episode reward: [(0, '194.731')] [2023-03-07 16:21:10,153][232226] Updated weights for policy 0, policy_version 6430 (0.0006) [2023-03-07 16:21:10,958][232226] Updated weights for policy 0, policy_version 6440 (0.0006) [2023-03-07 16:21:11,749][232226] Updated weights for policy 0, policy_version 6450 (0.0006) [2023-03-07 16:21:12,537][232226] Updated weights for policy 0, policy_version 6460 (0.0007) [2023-03-07 16:21:13,338][232226] Updated weights for policy 0, policy_version 6470 (0.0006) [2023-03-07 16:21:14,123][232226] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-07 16:21:14,940][232226] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-07 16:21:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6646784. Throughput: 0: 12906.0. Samples: 6612026. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:21:15,069][231894] Avg episode reward: [(0, '190.751')] [2023-03-07 16:21:15,736][232226] Updated weights for policy 0, policy_version 6500 (0.0006) [2023-03-07 16:21:16,516][232226] Updated weights for policy 0, policy_version 6510 (0.0006) [2023-03-07 16:21:17,313][232226] Updated weights for policy 0, policy_version 6520 (0.0006) [2023-03-07 16:21:18,107][232226] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-07 16:21:18,896][232226] Updated weights for policy 0, policy_version 6540 (0.0006) [2023-03-07 16:21:19,702][232226] Updated weights for policy 0, policy_version 6550 (0.0006) [2023-03-07 16:21:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6711296. Throughput: 0: 12904.7. Samples: 6689302. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:21:20,069][231894] Avg episode reward: [(0, '196.276')] [2023-03-07 16:21:20,506][232226] Updated weights for policy 0, policy_version 6560 (0.0007) [2023-03-07 16:21:21,281][232226] Updated weights for policy 0, policy_version 6570 (0.0006) [2023-03-07 16:21:22,073][232226] Updated weights for policy 0, policy_version 6580 (0.0006) [2023-03-07 16:21:22,877][232226] Updated weights for policy 0, policy_version 6590 (0.0006) [2023-03-07 16:21:23,658][232226] Updated weights for policy 0, policy_version 6600 (0.0006) [2023-03-07 16:21:24,445][232226] Updated weights for policy 0, policy_version 6610 (0.0006) [2023-03-07 16:21:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6775808. Throughput: 0: 12903.6. Samples: 6766824. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:21:25,070][231894] Avg episode reward: [(0, '189.528')] [2023-03-07 16:21:25,250][232226] Updated weights for policy 0, policy_version 6620 (0.0007) [2023-03-07 16:21:26,041][232226] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-07 16:21:26,829][232226] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-07 16:21:27,640][232226] Updated weights for policy 0, policy_version 6650 (0.0006) [2023-03-07 16:21:28,406][232226] Updated weights for policy 0, policy_version 6660 (0.0006) [2023-03-07 16:21:29,206][232226] Updated weights for policy 0, policy_version 6670 (0.0006) [2023-03-07 16:21:30,007][232226] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-07 16:21:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6840320. Throughput: 0: 12905.7. Samples: 6805528. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:21:30,069][231894] Avg episode reward: [(0, '187.125')] [2023-03-07 16:21:30,791][232226] Updated weights for policy 0, policy_version 6690 (0.0006) [2023-03-07 16:21:31,560][232226] Updated weights for policy 0, policy_version 6700 (0.0006) [2023-03-07 16:21:32,379][232226] Updated weights for policy 0, policy_version 6710 (0.0007) [2023-03-07 16:21:33,169][232226] Updated weights for policy 0, policy_version 6720 (0.0006) [2023-03-07 16:21:33,968][232226] Updated weights for policy 0, policy_version 6730 (0.0006) [2023-03-07 16:21:34,779][232226] Updated weights for policy 0, policy_version 6740 (0.0007) [2023-03-07 16:21:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6904832. Throughput: 0: 12914.5. Samples: 6882989. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:21:35,070][231894] Avg episode reward: [(0, '187.462')] [2023-03-07 16:21:35,558][232226] Updated weights for policy 0, policy_version 6750 (0.0006) [2023-03-07 16:21:36,353][232226] Updated weights for policy 0, policy_version 6760 (0.0007) [2023-03-07 16:21:37,156][232226] Updated weights for policy 0, policy_version 6770 (0.0006) [2023-03-07 16:21:37,935][232226] Updated weights for policy 0, policy_version 6780 (0.0007) [2023-03-07 16:21:38,754][232226] Updated weights for policy 0, policy_version 6790 (0.0007) [2023-03-07 16:21:39,557][232226] Updated weights for policy 0, policy_version 6800 (0.0006) [2023-03-07 16:21:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 6969344. Throughput: 0: 12901.1. Samples: 6960207. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:21:40,069][231894] Avg episode reward: [(0, '194.286')] [2023-03-07 16:21:40,356][232226] Updated weights for policy 0, policy_version 6810 (0.0006) [2023-03-07 16:21:41,138][232226] Updated weights for policy 0, policy_version 6820 (0.0006) [2023-03-07 16:21:41,936][232226] Updated weights for policy 0, policy_version 6830 (0.0006) [2023-03-07 16:21:42,725][232226] Updated weights for policy 0, policy_version 6840 (0.0007) [2023-03-07 16:21:43,521][232226] Updated weights for policy 0, policy_version 6850 (0.0007) [2023-03-07 16:21:44,318][232226] Updated weights for policy 0, policy_version 6860 (0.0008) [2023-03-07 16:21:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 7033856. Throughput: 0: 12897.3. Samples: 6998721. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:21:45,069][231894] Avg episode reward: [(0, '191.065')] [2023-03-07 16:21:45,097][232226] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-07 16:21:45,882][232226] Updated weights for policy 0, policy_version 6880 (0.0008) [2023-03-07 16:21:46,681][232226] Updated weights for policy 0, policy_version 6890 (0.0007) [2023-03-07 16:21:47,472][232226] Updated weights for policy 0, policy_version 6900 (0.0006) [2023-03-07 16:21:48,251][232226] Updated weights for policy 0, policy_version 6910 (0.0006) [2023-03-07 16:21:49,043][232226] Updated weights for policy 0, policy_version 6920 (0.0006) [2023-03-07 16:21:49,829][232226] Updated weights for policy 0, policy_version 6930 (0.0006) [2023-03-07 16:21:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 7098368. Throughput: 0: 12909.7. Samples: 7076593. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:21:50,069][231894] Avg episode reward: [(0, '195.944')] [2023-03-07 16:21:50,636][232226] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-07 16:21:51,444][232226] Updated weights for policy 0, policy_version 6950 (0.0007) [2023-03-07 16:21:52,228][232226] Updated weights for policy 0, policy_version 6960 (0.0006) [2023-03-07 16:21:53,015][232226] Updated weights for policy 0, policy_version 6970 (0.0006) [2023-03-07 16:21:53,806][232226] Updated weights for policy 0, policy_version 6980 (0.0007) [2023-03-07 16:21:54,608][232226] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-07 16:21:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 7162880. Throughput: 0: 12904.3. Samples: 7153929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:21:55,069][231894] Avg episode reward: [(0, '190.403')] [2023-03-07 16:21:55,401][232226] Updated weights for policy 0, policy_version 7000 (0.0006) [2023-03-07 16:21:56,193][232226] Updated weights for policy 0, policy_version 7010 (0.0006) [2023-03-07 16:21:56,995][232226] Updated weights for policy 0, policy_version 7020 (0.0006) [2023-03-07 16:21:57,792][232226] Updated weights for policy 0, policy_version 7030 (0.0007) [2023-03-07 16:21:58,589][232226] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-07 16:21:59,371][232226] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-07 16:22:00,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 7227392. Throughput: 0: 12903.9. Samples: 7192700. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 16:22:00,070][231894] Avg episode reward: [(0, '197.620')] [2023-03-07 16:22:00,167][232226] Updated weights for policy 0, policy_version 7060 (0.0006) [2023-03-07 16:22:00,959][232226] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-07 16:22:01,754][232226] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-07 16:22:02,546][232226] Updated weights for policy 0, policy_version 7090 (0.0007) [2023-03-07 16:22:03,339][232226] Updated weights for policy 0, policy_version 7100 (0.0006) [2023-03-07 16:22:04,134][232226] Updated weights for policy 0, policy_version 7110 (0.0005) [2023-03-07 16:22:04,921][232226] Updated weights for policy 0, policy_version 7120 (0.0006) [2023-03-07 16:22:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 7291904. Throughput: 0: 12905.4. Samples: 7270045. Policy #0 lag: (min: 0.0, avg: 1.1, max: 4.0) [2023-03-07 16:22:05,069][231894] Avg episode reward: [(0, '196.646')] [2023-03-07 16:22:05,711][232226] Updated weights for policy 0, policy_version 7130 (0.0007) [2023-03-07 16:22:06,509][232226] Updated weights for policy 0, policy_version 7140 (0.0006) [2023-03-07 16:22:07,306][232226] Updated weights for policy 0, policy_version 7150 (0.0006) [2023-03-07 16:22:08,126][232226] Updated weights for policy 0, policy_version 7160 (0.0006) [2023-03-07 16:22:08,899][232226] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-07 16:22:09,677][232226] Updated weights for policy 0, policy_version 7180 (0.0006) [2023-03-07 16:22:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 7356416. Throughput: 0: 12901.7. Samples: 7347400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:10,069][231894] Avg episode reward: [(0, '195.067')] [2023-03-07 16:22:10,485][232226] Updated weights for policy 0, policy_version 7190 (0.0007) [2023-03-07 16:22:11,269][232226] Updated weights for policy 0, policy_version 7200 (0.0007) [2023-03-07 16:22:12,061][232226] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-07 16:22:12,849][232226] Updated weights for policy 0, policy_version 7220 (0.0006) [2023-03-07 16:22:13,644][232226] Updated weights for policy 0, policy_version 7230 (0.0006) [2023-03-07 16:22:14,442][232226] Updated weights for policy 0, policy_version 7240 (0.0005) [2023-03-07 16:22:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 7420928. Throughput: 0: 12904.1. Samples: 7386212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:15,069][231894] Avg episode reward: [(0, '198.632')] [2023-03-07 16:22:15,238][232226] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-07 16:22:16,029][232226] Updated weights for policy 0, policy_version 7260 (0.0007) [2023-03-07 16:22:16,819][232226] Updated weights for policy 0, policy_version 7270 (0.0006) [2023-03-07 16:22:17,609][232226] Updated weights for policy 0, policy_version 7280 (0.0007) [2023-03-07 16:22:18,409][232226] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-07 16:22:19,211][232226] Updated weights for policy 0, policy_version 7300 (0.0007) [2023-03-07 16:22:19,985][232226] Updated weights for policy 0, policy_version 7310 (0.0007) [2023-03-07 16:22:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 7485440. Throughput: 0: 12905.0. Samples: 7463713. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:20,069][231894] Avg episode reward: [(0, '183.541')] [2023-03-07 16:22:20,762][232226] Updated weights for policy 0, policy_version 7320 (0.0006) [2023-03-07 16:22:21,584][232226] Updated weights for policy 0, policy_version 7330 (0.0005) [2023-03-07 16:22:22,380][232226] Updated weights for policy 0, policy_version 7340 (0.0007) [2023-03-07 16:22:23,171][232226] Updated weights for policy 0, policy_version 7350 (0.0006) [2023-03-07 16:22:23,943][232226] Updated weights for policy 0, policy_version 7360 (0.0007) [2023-03-07 16:22:24,750][232226] Updated weights for policy 0, policy_version 7370 (0.0006) [2023-03-07 16:22:25,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 7550976. Throughput: 0: 12912.3. Samples: 7541263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:25,069][231894] Avg episode reward: [(0, '183.712')] [2023-03-07 16:22:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000007374_7550976.pth... [2023-03-07 16:22:25,102][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000004351_4455424.pth [2023-03-07 16:22:25,534][232226] Updated weights for policy 0, policy_version 7380 (0.0007) [2023-03-07 16:22:26,342][232226] Updated weights for policy 0, policy_version 7390 (0.0006) [2023-03-07 16:22:27,142][232226] Updated weights for policy 0, policy_version 7400 (0.0007) [2023-03-07 16:22:27,943][232226] Updated weights for policy 0, policy_version 7410 (0.0007) [2023-03-07 16:22:28,726][232226] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-07 16:22:29,522][232226] Updated weights for policy 0, policy_version 7430 (0.0007) [2023-03-07 16:22:30,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12919.4, 300 sec: 12895.4). Total num frames: 7615488. Throughput: 0: 12911.7. Samples: 7579746. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:30,069][231894] Avg episode reward: [(0, '190.353')] [2023-03-07 16:22:30,317][232226] Updated weights for policy 0, policy_version 7440 (0.0006) [2023-03-07 16:22:31,105][232226] Updated weights for policy 0, policy_version 7450 (0.0007) [2023-03-07 16:22:31,921][232226] Updated weights for policy 0, policy_version 7460 (0.0006) [2023-03-07 16:22:32,701][232226] Updated weights for policy 0, policy_version 7470 (0.0006) [2023-03-07 16:22:33,503][232226] Updated weights for policy 0, policy_version 7480 (0.0006) [2023-03-07 16:22:34,301][232226] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-07 16:22:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 7678976. Throughput: 0: 12900.0. Samples: 7657094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:35,069][231894] Avg episode reward: [(0, '193.403')] [2023-03-07 16:22:35,078][232226] Updated weights for policy 0, policy_version 7500 (0.0007) [2023-03-07 16:22:35,870][232226] Updated weights for policy 0, policy_version 7510 (0.0007) [2023-03-07 16:22:36,662][232226] Updated weights for policy 0, policy_version 7520 (0.0007) [2023-03-07 16:22:37,466][232226] Updated weights for policy 0, policy_version 7530 (0.0007) [2023-03-07 16:22:38,250][232226] Updated weights for policy 0, policy_version 7540 (0.0006) [2023-03-07 16:22:39,062][232226] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-07 16:22:39,827][232226] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-07 16:22:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 7743488. Throughput: 0: 12903.8. Samples: 7734597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:40,069][231894] Avg episode reward: [(0, '189.700')] [2023-03-07 16:22:40,635][232226] Updated weights for policy 0, policy_version 7570 (0.0006) [2023-03-07 16:22:41,437][232226] Updated weights for policy 0, policy_version 7580 (0.0007) [2023-03-07 16:22:42,218][232226] Updated weights for policy 0, policy_version 7590 (0.0007) [2023-03-07 16:22:43,020][232226] Updated weights for policy 0, policy_version 7600 (0.0007) [2023-03-07 16:22:43,794][232226] Updated weights for policy 0, policy_version 7610 (0.0007) [2023-03-07 16:22:44,593][232226] Updated weights for policy 0, policy_version 7620 (0.0007) [2023-03-07 16:22:45,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 7809024. Throughput: 0: 12902.0. Samples: 7773290. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:45,069][231894] Avg episode reward: [(0, '190.142')] [2023-03-07 16:22:45,365][232226] Updated weights for policy 0, policy_version 7630 (0.0006) [2023-03-07 16:22:46,149][232226] Updated weights for policy 0, policy_version 7640 (0.0007) [2023-03-07 16:22:46,958][232226] Updated weights for policy 0, policy_version 7650 (0.0006) [2023-03-07 16:22:47,735][232226] Updated weights for policy 0, policy_version 7660 (0.0006) [2023-03-07 16:22:48,541][232226] Updated weights for policy 0, policy_version 7670 (0.0006) [2023-03-07 16:22:49,337][232226] Updated weights for policy 0, policy_version 7680 (0.0006) [2023-03-07 16:22:50,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 7873536. Throughput: 0: 12914.1. Samples: 7851181. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:50,069][231894] Avg episode reward: [(0, '192.139')] [2023-03-07 16:22:50,130][232226] Updated weights for policy 0, policy_version 7690 (0.0007) [2023-03-07 16:22:50,935][232226] Updated weights for policy 0, policy_version 7700 (0.0006) [2023-03-07 16:22:51,734][232226] Updated weights for policy 0, policy_version 7710 (0.0007) [2023-03-07 16:22:52,532][232226] Updated weights for policy 0, policy_version 7720 (0.0006) [2023-03-07 16:22:53,334][232226] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-07 16:22:54,122][232226] Updated weights for policy 0, policy_version 7740 (0.0006) [2023-03-07 16:22:54,918][232226] Updated weights for policy 0, policy_version 7750 (0.0006) [2023-03-07 16:22:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 7937024. Throughput: 0: 12904.4. Samples: 7928097. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:22:55,069][231894] Avg episode reward: [(0, '190.087')] [2023-03-07 16:22:55,720][232226] Updated weights for policy 0, policy_version 7760 (0.0007) [2023-03-07 16:22:56,511][232226] Updated weights for policy 0, policy_version 7770 (0.0006) [2023-03-07 16:22:57,311][232226] Updated weights for policy 0, policy_version 7780 (0.0007) [2023-03-07 16:22:58,107][232226] Updated weights for policy 0, policy_version 7790 (0.0006) [2023-03-07 16:22:58,906][232226] Updated weights for policy 0, policy_version 7800 (0.0006) [2023-03-07 16:22:59,702][232226] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-07 16:23:00,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 8001536. Throughput: 0: 12900.4. Samples: 7966732. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:23:00,070][231894] Avg episode reward: [(0, '188.179')] [2023-03-07 16:23:00,498][232226] Updated weights for policy 0, policy_version 7820 (0.0006) [2023-03-07 16:23:01,287][232226] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-07 16:23:02,084][232226] Updated weights for policy 0, policy_version 7840 (0.0007) [2023-03-07 16:23:02,897][232226] Updated weights for policy 0, policy_version 7850 (0.0006) [2023-03-07 16:23:03,687][232226] Updated weights for policy 0, policy_version 7860 (0.0006) [2023-03-07 16:23:04,494][232226] Updated weights for policy 0, policy_version 7870 (0.0007) [2023-03-07 16:23:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 8066048. Throughput: 0: 12893.4. Samples: 8043919. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:23:05,069][231894] Avg episode reward: [(0, '185.590')] [2023-03-07 16:23:05,282][232226] Updated weights for policy 0, policy_version 7880 (0.0007) [2023-03-07 16:23:06,079][232226] Updated weights for policy 0, policy_version 7890 (0.0007) [2023-03-07 16:23:06,900][232226] Updated weights for policy 0, policy_version 7900 (0.0007) [2023-03-07 16:23:07,689][232226] Updated weights for policy 0, policy_version 7910 (0.0006) [2023-03-07 16:23:08,459][232226] Updated weights for policy 0, policy_version 7920 (0.0006) [2023-03-07 16:23:09,253][232226] Updated weights for policy 0, policy_version 7930 (0.0007) [2023-03-07 16:23:10,034][232226] Updated weights for policy 0, policy_version 7940 (0.0005) [2023-03-07 16:23:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 8130560. Throughput: 0: 12885.5. Samples: 8121111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 16:23:10,069][231894] Avg episode reward: [(0, '186.446')] [2023-03-07 16:23:10,825][232226] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-07 16:23:11,623][232226] Updated weights for policy 0, policy_version 7960 (0.0007) [2023-03-07 16:23:12,401][232226] Updated weights for policy 0, policy_version 7970 (0.0006) [2023-03-07 16:23:13,184][232226] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-07 16:23:13,999][232226] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-07 16:23:14,795][232226] Updated weights for policy 0, policy_version 8000 (0.0007) [2023-03-07 16:23:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 8195072. Throughput: 0: 12896.5. Samples: 8160088. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 16:23:15,069][231894] Avg episode reward: [(0, '193.586')] [2023-03-07 16:23:15,570][232226] Updated weights for policy 0, policy_version 8010 (0.0006) [2023-03-07 16:23:16,385][232226] Updated weights for policy 0, policy_version 8020 (0.0006) [2023-03-07 16:23:17,150][232226] Updated weights for policy 0, policy_version 8030 (0.0006) [2023-03-07 16:23:17,970][232226] Updated weights for policy 0, policy_version 8040 (0.0006) [2023-03-07 16:23:18,749][232226] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-07 16:23:19,548][232226] Updated weights for policy 0, policy_version 8060 (0.0006) [2023-03-07 16:23:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 8259584. Throughput: 0: 12901.4. Samples: 8237660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:23:20,070][231894] Avg episode reward: [(0, '195.393')] [2023-03-07 16:23:20,348][232226] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-07 16:23:21,137][232226] Updated weights for policy 0, policy_version 8080 (0.0007) [2023-03-07 16:23:21,923][232226] Updated weights for policy 0, policy_version 8090 (0.0006) [2023-03-07 16:23:22,707][232226] Updated weights for policy 0, policy_version 8100 (0.0005) [2023-03-07 16:23:23,521][232226] Updated weights for policy 0, policy_version 8110 (0.0007) [2023-03-07 16:23:24,305][232226] Updated weights for policy 0, policy_version 8120 (0.0006) [2023-03-07 16:23:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 8324096. Throughput: 0: 12897.7. Samples: 8314994. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:23:25,069][231894] Avg episode reward: [(0, '193.764')] [2023-03-07 16:23:25,091][232226] Updated weights for policy 0, policy_version 8130 (0.0006) [2023-03-07 16:23:25,897][232226] Updated weights for policy 0, policy_version 8140 (0.0006) [2023-03-07 16:23:26,670][232226] Updated weights for policy 0, policy_version 8150 (0.0006) [2023-03-07 16:23:27,460][232226] Updated weights for policy 0, policy_version 8160 (0.0007) [2023-03-07 16:23:28,255][232226] Updated weights for policy 0, policy_version 8170 (0.0005) [2023-03-07 16:23:29,041][232226] Updated weights for policy 0, policy_version 8180 (0.0008) [2023-03-07 16:23:29,809][232226] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-07 16:23:30,069][231894] Fps is (10 sec: 13005.0, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 8389632. Throughput: 0: 12902.1. Samples: 8353882. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:23:30,069][231894] Avg episode reward: [(0, '197.108')] [2023-03-07 16:23:30,622][232226] Updated weights for policy 0, policy_version 8200 (0.0006) [2023-03-07 16:23:31,415][232226] Updated weights for policy 0, policy_version 8210 (0.0006) [2023-03-07 16:23:32,217][232226] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-07 16:23:33,013][232226] Updated weights for policy 0, policy_version 8230 (0.0006) [2023-03-07 16:23:33,810][232226] Updated weights for policy 0, policy_version 8240 (0.0007) [2023-03-07 16:23:34,590][232226] Updated weights for policy 0, policy_version 8250 (0.0007) [2023-03-07 16:23:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 8453120. Throughput: 0: 12894.4. Samples: 8431428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:23:35,069][231894] Avg episode reward: [(0, '187.993')] [2023-03-07 16:23:35,401][232226] Updated weights for policy 0, policy_version 8260 (0.0006) [2023-03-07 16:23:36,192][232226] Updated weights for policy 0, policy_version 8270 (0.0007) [2023-03-07 16:23:36,990][232226] Updated weights for policy 0, policy_version 8280 (0.0007) [2023-03-07 16:23:37,785][232226] Updated weights for policy 0, policy_version 8290 (0.0006) [2023-03-07 16:23:38,582][232226] Updated weights for policy 0, policy_version 8300 (0.0006) [2023-03-07 16:23:39,378][232226] Updated weights for policy 0, policy_version 8310 (0.0006) [2023-03-07 16:23:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 8517632. Throughput: 0: 12897.2. Samples: 8508472. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:23:40,069][231894] Avg episode reward: [(0, '189.730')] [2023-03-07 16:23:40,164][232226] Updated weights for policy 0, policy_version 8320 (0.0007) [2023-03-07 16:23:40,957][232226] Updated weights for policy 0, policy_version 8330 (0.0007) [2023-03-07 16:23:41,759][232226] Updated weights for policy 0, policy_version 8340 (0.0006) [2023-03-07 16:23:42,543][232226] Updated weights for policy 0, policy_version 8350 (0.0006) [2023-03-07 16:23:43,342][232226] Updated weights for policy 0, policy_version 8360 (0.0007) [2023-03-07 16:23:44,139][232226] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-07 16:23:44,942][232226] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-07 16:23:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 8582144. Throughput: 0: 12901.4. Samples: 8547292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:23:45,069][231894] Avg episode reward: [(0, '194.786')] [2023-03-07 16:23:45,725][232226] Updated weights for policy 0, policy_version 8390 (0.0006) [2023-03-07 16:23:46,535][232226] Updated weights for policy 0, policy_version 8400 (0.0006) [2023-03-07 16:23:47,324][232226] Updated weights for policy 0, policy_version 8410 (0.0006) [2023-03-07 16:23:48,113][232226] Updated weights for policy 0, policy_version 8420 (0.0006) [2023-03-07 16:23:48,907][232226] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-07 16:23:49,691][232226] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-07 16:23:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 8646656. Throughput: 0: 12904.5. Samples: 8624623. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:23:50,069][231894] Avg episode reward: [(0, '192.101')] [2023-03-07 16:23:50,471][232226] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-07 16:23:51,279][232226] Updated weights for policy 0, policy_version 8460 (0.0007) [2023-03-07 16:23:52,065][232226] Updated weights for policy 0, policy_version 8470 (0.0006) [2023-03-07 16:23:52,854][232226] Updated weights for policy 0, policy_version 8480 (0.0006) [2023-03-07 16:23:53,667][232226] Updated weights for policy 0, policy_version 8490 (0.0006) [2023-03-07 16:23:54,458][232226] Updated weights for policy 0, policy_version 8500 (0.0007) [2023-03-07 16:23:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 8711168. Throughput: 0: 12912.6. Samples: 8702181. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:23:55,070][231894] Avg episode reward: [(0, '186.046')] [2023-03-07 16:23:55,255][232226] Updated weights for policy 0, policy_version 8510 (0.0006) [2023-03-07 16:23:56,049][232226] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-07 16:23:56,871][232226] Updated weights for policy 0, policy_version 8530 (0.0007) [2023-03-07 16:23:57,637][232226] Updated weights for policy 0, policy_version 8540 (0.0007) [2023-03-07 16:23:58,449][232226] Updated weights for policy 0, policy_version 8550 (0.0006) [2023-03-07 16:23:59,218][232226] Updated weights for policy 0, policy_version 8560 (0.0006) [2023-03-07 16:24:00,015][232226] Updated weights for policy 0, policy_version 8570 (0.0006) [2023-03-07 16:24:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 8775680. Throughput: 0: 12905.2. Samples: 8740822. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:24:00,070][231894] Avg episode reward: [(0, '189.376')] [2023-03-07 16:24:00,832][232226] Updated weights for policy 0, policy_version 8580 (0.0006) [2023-03-07 16:24:01,617][232226] Updated weights for policy 0, policy_version 8590 (0.0006) [2023-03-07 16:24:02,408][232226] Updated weights for policy 0, policy_version 8600 (0.0007) [2023-03-07 16:24:03,189][232226] Updated weights for policy 0, policy_version 8610 (0.0007) [2023-03-07 16:24:03,997][232226] Updated weights for policy 0, policy_version 8620 (0.0006) [2023-03-07 16:24:04,773][232226] Updated weights for policy 0, policy_version 8630 (0.0006) [2023-03-07 16:24:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 8840192. Throughput: 0: 12897.1. Samples: 8818029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:24:05,069][231894] Avg episode reward: [(0, '186.517')] [2023-03-07 16:24:05,568][232226] Updated weights for policy 0, policy_version 8640 (0.0007) [2023-03-07 16:24:06,363][232226] Updated weights for policy 0, policy_version 8650 (0.0006) [2023-03-07 16:24:07,149][232226] Updated weights for policy 0, policy_version 8660 (0.0007) [2023-03-07 16:24:07,967][232226] Updated weights for policy 0, policy_version 8670 (0.0007) [2023-03-07 16:24:08,759][232226] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-07 16:24:09,562][232226] Updated weights for policy 0, policy_version 8690 (0.0006) [2023-03-07 16:24:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 8904704. Throughput: 0: 12897.8. Samples: 8895397. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:24:10,069][231894] Avg episode reward: [(0, '196.089')] [2023-03-07 16:24:10,351][232226] Updated weights for policy 0, policy_version 8700 (0.0006) [2023-03-07 16:24:11,145][232226] Updated weights for policy 0, policy_version 8710 (0.0006) [2023-03-07 16:24:11,936][232226] Updated weights for policy 0, policy_version 8720 (0.0006) [2023-03-07 16:24:12,725][232226] Updated weights for policy 0, policy_version 8730 (0.0006) [2023-03-07 16:24:13,529][232226] Updated weights for policy 0, policy_version 8740 (0.0006) [2023-03-07 16:24:14,319][232226] Updated weights for policy 0, policy_version 8750 (0.0007) [2023-03-07 16:24:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 8969216. Throughput: 0: 12893.5. Samples: 8934090. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:24:15,069][231894] Avg episode reward: [(0, '188.970')] [2023-03-07 16:24:15,111][232226] Updated weights for policy 0, policy_version 8760 (0.0006) [2023-03-07 16:24:15,919][232226] Updated weights for policy 0, policy_version 8770 (0.0006) [2023-03-07 16:24:16,729][232226] Updated weights for policy 0, policy_version 8780 (0.0006) [2023-03-07 16:24:17,529][232226] Updated weights for policy 0, policy_version 8790 (0.0007) [2023-03-07 16:24:18,339][232226] Updated weights for policy 0, policy_version 8800 (0.0006) [2023-03-07 16:24:19,138][232226] Updated weights for policy 0, policy_version 8810 (0.0005) [2023-03-07 16:24:19,902][232226] Updated weights for policy 0, policy_version 8820 (0.0005) [2023-03-07 16:24:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 9033728. Throughput: 0: 12881.1. Samples: 9011078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:24:20,069][231894] Avg episode reward: [(0, '185.226')] [2023-03-07 16:24:20,688][232226] Updated weights for policy 0, policy_version 8830 (0.0007) [2023-03-07 16:24:21,489][232226] Updated weights for policy 0, policy_version 8840 (0.0007) [2023-03-07 16:24:22,281][232226] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-07 16:24:23,065][232226] Updated weights for policy 0, policy_version 8860 (0.0006) [2023-03-07 16:24:23,866][232226] Updated weights for policy 0, policy_version 8870 (0.0006) [2023-03-07 16:24:24,669][232226] Updated weights for policy 0, policy_version 8880 (0.0006) [2023-03-07 16:24:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 9098240. Throughput: 0: 12891.8. Samples: 9088603. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:24:25,069][231894] Avg episode reward: [(0, '189.019')] [2023-03-07 16:24:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000008885_9098240.pth... [2023-03-07 16:24:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000005862_6002688.pth [2023-03-07 16:24:25,469][232226] Updated weights for policy 0, policy_version 8890 (0.0007) [2023-03-07 16:24:26,267][232226] Updated weights for policy 0, policy_version 8900 (0.0007) [2023-03-07 16:24:27,049][232226] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-07 16:24:27,854][232226] Updated weights for policy 0, policy_version 8920 (0.0006) [2023-03-07 16:24:28,662][232226] Updated weights for policy 0, policy_version 8930 (0.0007) [2023-03-07 16:24:29,449][232226] Updated weights for policy 0, policy_version 8940 (0.0006) [2023-03-07 16:24:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12898.9). Total num frames: 9161728. Throughput: 0: 12883.8. Samples: 9127062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:24:30,069][231894] Avg episode reward: [(0, '191.999')] [2023-03-07 16:24:30,252][232226] Updated weights for policy 0, policy_version 8950 (0.0007) [2023-03-07 16:24:31,058][232226] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-07 16:24:31,860][232226] Updated weights for policy 0, policy_version 8970 (0.0006) [2023-03-07 16:24:32,630][232226] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-07 16:24:33,431][232226] Updated weights for policy 0, policy_version 8990 (0.0006) [2023-03-07 16:24:34,210][232226] Updated weights for policy 0, policy_version 9000 (0.0006) [2023-03-07 16:24:35,014][232226] Updated weights for policy 0, policy_version 9010 (0.0006) [2023-03-07 16:24:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 9226240. Throughput: 0: 12880.1. Samples: 9204229. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:24:35,069][231894] Avg episode reward: [(0, '194.922')] [2023-03-07 16:24:35,804][232226] Updated weights for policy 0, policy_version 9020 (0.0007) [2023-03-07 16:24:36,596][232226] Updated weights for policy 0, policy_version 9030 (0.0007) [2023-03-07 16:24:37,402][232226] Updated weights for policy 0, policy_version 9040 (0.0007) [2023-03-07 16:24:38,190][232226] Updated weights for policy 0, policy_version 9050 (0.0006) [2023-03-07 16:24:38,983][232226] Updated weights for policy 0, policy_version 9060 (0.0006) [2023-03-07 16:24:39,797][232226] Updated weights for policy 0, policy_version 9070 (0.0006) [2023-03-07 16:24:40,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 9290752. Throughput: 0: 12877.8. Samples: 9281682. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:24:40,069][231894] Avg episode reward: [(0, '189.735')] [2023-03-07 16:24:40,581][232226] Updated weights for policy 0, policy_version 9080 (0.0006) [2023-03-07 16:24:41,368][232226] Updated weights for policy 0, policy_version 9090 (0.0006) [2023-03-07 16:24:42,151][232226] Updated weights for policy 0, policy_version 9100 (0.0006) [2023-03-07 16:24:42,943][232226] Updated weights for policy 0, policy_version 9110 (0.0006) [2023-03-07 16:24:43,753][232226] Updated weights for policy 0, policy_version 9120 (0.0007) [2023-03-07 16:24:44,526][232226] Updated weights for policy 0, policy_version 9130 (0.0007) [2023-03-07 16:24:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 9355264. Throughput: 0: 12880.2. Samples: 9320429. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:24:45,069][231894] Avg episode reward: [(0, '191.014')] [2023-03-07 16:24:45,322][232226] Updated weights for policy 0, policy_version 9140 (0.0006) [2023-03-07 16:24:46,114][232226] Updated weights for policy 0, policy_version 9150 (0.0006) [2023-03-07 16:24:46,921][232226] Updated weights for policy 0, policy_version 9160 (0.0007) [2023-03-07 16:24:47,715][232226] Updated weights for policy 0, policy_version 9170 (0.0006) [2023-03-07 16:24:48,518][232226] Updated weights for policy 0, policy_version 9180 (0.0007) [2023-03-07 16:24:49,286][232226] Updated weights for policy 0, policy_version 9190 (0.0007) [2023-03-07 16:24:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 9419776. Throughput: 0: 12880.4. Samples: 9397647. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:24:50,069][231894] Avg episode reward: [(0, '191.483')] [2023-03-07 16:24:50,091][232226] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-07 16:24:50,883][232226] Updated weights for policy 0, policy_version 9210 (0.0007) [2023-03-07 16:24:51,677][232226] Updated weights for policy 0, policy_version 9220 (0.0007) [2023-03-07 16:24:52,478][232226] Updated weights for policy 0, policy_version 9230 (0.0007) [2023-03-07 16:24:53,251][232226] Updated weights for policy 0, policy_version 9240 (0.0007) [2023-03-07 16:24:54,045][232226] Updated weights for policy 0, policy_version 9250 (0.0007) [2023-03-07 16:24:54,837][232226] Updated weights for policy 0, policy_version 9260 (0.0008) [2023-03-07 16:24:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 9484288. Throughput: 0: 12888.6. Samples: 9475382. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:24:55,069][231894] Avg episode reward: [(0, '187.852')] [2023-03-07 16:24:55,623][232226] Updated weights for policy 0, policy_version 9270 (0.0006) [2023-03-07 16:24:56,429][232226] Updated weights for policy 0, policy_version 9280 (0.0006) [2023-03-07 16:24:57,206][232226] Updated weights for policy 0, policy_version 9290 (0.0006) [2023-03-07 16:24:58,001][232226] Updated weights for policy 0, policy_version 9300 (0.0006) [2023-03-07 16:24:58,782][232226] Updated weights for policy 0, policy_version 9310 (0.0006) [2023-03-07 16:24:59,563][232226] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-07 16:25:00,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 9549824. Throughput: 0: 12894.2. Samples: 9514331. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:25:00,069][231894] Avg episode reward: [(0, '185.624')] [2023-03-07 16:25:00,373][232226] Updated weights for policy 0, policy_version 9330 (0.0006) [2023-03-07 16:25:01,155][232226] Updated weights for policy 0, policy_version 9340 (0.0006) [2023-03-07 16:25:01,936][232226] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-07 16:25:02,732][232226] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-07 16:25:03,550][232226] Updated weights for policy 0, policy_version 9370 (0.0006) [2023-03-07 16:25:04,330][232226] Updated weights for policy 0, policy_version 9380 (0.0007) [2023-03-07 16:25:05,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 9614336. Throughput: 0: 12908.2. Samples: 9591948. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:25:05,069][231894] Avg episode reward: [(0, '193.331')] [2023-03-07 16:25:05,130][232226] Updated weights for policy 0, policy_version 9390 (0.0005) [2023-03-07 16:25:05,923][232226] Updated weights for policy 0, policy_version 9400 (0.0006) [2023-03-07 16:25:06,705][232226] Updated weights for policy 0, policy_version 9410 (0.0006) [2023-03-07 16:25:07,504][232226] Updated weights for policy 0, policy_version 9420 (0.0007) [2023-03-07 16:25:08,302][232226] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-07 16:25:09,064][232226] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-07 16:25:09,877][232226] Updated weights for policy 0, policy_version 9450 (0.0007) [2023-03-07 16:25:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 9678848. Throughput: 0: 12913.7. Samples: 9669721. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:25:10,069][231894] Avg episode reward: [(0, '196.001')] [2023-03-07 16:25:10,677][232226] Updated weights for policy 0, policy_version 9460 (0.0007) [2023-03-07 16:25:11,455][232226] Updated weights for policy 0, policy_version 9470 (0.0006) [2023-03-07 16:25:12,247][232226] Updated weights for policy 0, policy_version 9480 (0.0005) [2023-03-07 16:25:13,028][232226] Updated weights for policy 0, policy_version 9490 (0.0006) [2023-03-07 16:25:13,821][232226] Updated weights for policy 0, policy_version 9500 (0.0006) [2023-03-07 16:25:14,606][232226] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-07 16:25:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 9743360. Throughput: 0: 12918.9. Samples: 9708413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:25:15,069][231894] Avg episode reward: [(0, '194.125')] [2023-03-07 16:25:15,397][232226] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-07 16:25:16,188][232226] Updated weights for policy 0, policy_version 9530 (0.0006) [2023-03-07 16:25:16,985][232226] Updated weights for policy 0, policy_version 9540 (0.0005) [2023-03-07 16:25:17,769][232226] Updated weights for policy 0, policy_version 9550 (0.0007) [2023-03-07 16:25:18,554][232226] Updated weights for policy 0, policy_version 9560 (0.0006) [2023-03-07 16:25:19,334][232226] Updated weights for policy 0, policy_version 9570 (0.0005) [2023-03-07 16:25:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 9807872. Throughput: 0: 12929.3. Samples: 9786049. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:25:20,069][231894] Avg episode reward: [(0, '184.525')] [2023-03-07 16:25:20,146][232226] Updated weights for policy 0, policy_version 9580 (0.0006) [2023-03-07 16:25:20,916][232226] Updated weights for policy 0, policy_version 9590 (0.0007) [2023-03-07 16:25:21,738][232226] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-07 16:25:22,491][232226] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-07 16:25:23,304][232226] Updated weights for policy 0, policy_version 9620 (0.0007) [2023-03-07 16:25:24,112][232226] Updated weights for policy 0, policy_version 9630 (0.0007) [2023-03-07 16:25:24,886][232226] Updated weights for policy 0, policy_version 9640 (0.0006) [2023-03-07 16:25:25,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 9873408. Throughput: 0: 12937.9. Samples: 9863885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:25:25,069][231894] Avg episode reward: [(0, '187.143')] [2023-03-07 16:25:25,687][232226] Updated weights for policy 0, policy_version 9650 (0.0006) [2023-03-07 16:25:26,471][232226] Updated weights for policy 0, policy_version 9660 (0.0006) [2023-03-07 16:25:27,255][232226] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-07 16:25:28,054][232226] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-07 16:25:28,837][232226] Updated weights for policy 0, policy_version 9690 (0.0007) [2023-03-07 16:25:29,624][232226] Updated weights for policy 0, policy_version 9700 (0.0007) [2023-03-07 16:25:30,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12936.5, 300 sec: 12905.9). Total num frames: 9937920. Throughput: 0: 12941.3. Samples: 9902787. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:25:30,069][231894] Avg episode reward: [(0, '188.601')] [2023-03-07 16:25:30,419][232226] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-07 16:25:31,226][232226] Updated weights for policy 0, policy_version 9720 (0.0007) [2023-03-07 16:25:32,014][232226] Updated weights for policy 0, policy_version 9730 (0.0006) [2023-03-07 16:25:32,823][232226] Updated weights for policy 0, policy_version 9740 (0.0007) [2023-03-07 16:25:33,600][232226] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-07 16:25:34,412][232226] Updated weights for policy 0, policy_version 9760 (0.0007) [2023-03-07 16:25:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12936.5, 300 sec: 12905.9). Total num frames: 10002432. Throughput: 0: 12944.0. Samples: 9980127. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:25:35,069][231894] Avg episode reward: [(0, '192.544')] [2023-03-07 16:25:35,217][232226] Updated weights for policy 0, policy_version 9770 (0.0007) [2023-03-07 16:25:35,990][232226] Updated weights for policy 0, policy_version 9780 (0.0007) [2023-03-07 16:25:36,794][232226] Updated weights for policy 0, policy_version 9790 (0.0006) [2023-03-07 16:25:37,585][232226] Updated weights for policy 0, policy_version 9800 (0.0006) [2023-03-07 16:25:38,365][232226] Updated weights for policy 0, policy_version 9810 (0.0006) [2023-03-07 16:25:39,163][232226] Updated weights for policy 0, policy_version 9820 (0.0006) [2023-03-07 16:25:39,949][232226] Updated weights for policy 0, policy_version 9830 (0.0007) [2023-03-07 16:25:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12936.6, 300 sec: 12905.9). Total num frames: 10066944. Throughput: 0: 12937.5. Samples: 10057571. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:25:40,069][231894] Avg episode reward: [(0, '197.727')] [2023-03-07 16:25:40,742][232226] Updated weights for policy 0, policy_version 9840 (0.0006) [2023-03-07 16:25:41,526][232226] Updated weights for policy 0, policy_version 9850 (0.0006) [2023-03-07 16:25:42,331][232226] Updated weights for policy 0, policy_version 9860 (0.0006) [2023-03-07 16:25:43,112][232226] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-07 16:25:43,913][232226] Updated weights for policy 0, policy_version 9880 (0.0006) [2023-03-07 16:25:44,713][232226] Updated weights for policy 0, policy_version 9890 (0.0007) [2023-03-07 16:25:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12936.5, 300 sec: 12905.9). Total num frames: 10131456. Throughput: 0: 12936.5. Samples: 10096473. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:25:45,070][231894] Avg episode reward: [(0, '192.603')] [2023-03-07 16:25:45,508][232226] Updated weights for policy 0, policy_version 9900 (0.0006) [2023-03-07 16:25:46,295][232226] Updated weights for policy 0, policy_version 9910 (0.0007) [2023-03-07 16:25:47,086][232226] Updated weights for policy 0, policy_version 9920 (0.0007) [2023-03-07 16:25:47,878][232226] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-07 16:25:48,666][232226] Updated weights for policy 0, policy_version 9940 (0.0006) [2023-03-07 16:25:49,476][232226] Updated weights for policy 0, policy_version 9950 (0.0006) [2023-03-07 16:25:50,069][231894] Fps is (10 sec: 12902.1, 60 sec: 12936.5, 300 sec: 12905.9). Total num frames: 10195968. Throughput: 0: 12929.9. Samples: 10173793. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:25:50,070][231894] Avg episode reward: [(0, '197.760')] [2023-03-07 16:25:50,268][232226] Updated weights for policy 0, policy_version 9960 (0.0007) [2023-03-07 16:25:51,041][232226] Updated weights for policy 0, policy_version 9970 (0.0006) [2023-03-07 16:25:51,839][232226] Updated weights for policy 0, policy_version 9980 (0.0007) [2023-03-07 16:25:52,637][232226] Updated weights for policy 0, policy_version 9990 (0.0007) [2023-03-07 16:25:53,417][232226] Updated weights for policy 0, policy_version 10000 (0.0006) [2023-03-07 16:25:54,218][232226] Updated weights for policy 0, policy_version 10010 (0.0006) [2023-03-07 16:25:55,009][232226] Updated weights for policy 0, policy_version 10020 (0.0007) [2023-03-07 16:25:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12936.5, 300 sec: 12905.9). Total num frames: 10260480. Throughput: 0: 12925.7. Samples: 10251377. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:25:55,069][231894] Avg episode reward: [(0, '182.205')] [2023-03-07 16:25:55,805][232226] Updated weights for policy 0, policy_version 10030 (0.0006) [2023-03-07 16:25:56,595][232226] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-07 16:25:57,397][232226] Updated weights for policy 0, policy_version 10050 (0.0006) [2023-03-07 16:25:58,205][232226] Updated weights for policy 0, policy_version 10060 (0.0007) [2023-03-07 16:25:58,989][232226] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-07 16:25:59,779][232226] Updated weights for policy 0, policy_version 10080 (0.0006) [2023-03-07 16:26:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 10324992. Throughput: 0: 12925.8. Samples: 10290073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:00,069][231894] Avg episode reward: [(0, '190.152')] [2023-03-07 16:26:00,568][232226] Updated weights for policy 0, policy_version 10090 (0.0008) [2023-03-07 16:26:01,361][232226] Updated weights for policy 0, policy_version 10100 (0.0006) [2023-03-07 16:26:02,141][232226] Updated weights for policy 0, policy_version 10110 (0.0006) [2023-03-07 16:26:02,940][232226] Updated weights for policy 0, policy_version 10120 (0.0006) [2023-03-07 16:26:03,733][232226] Updated weights for policy 0, policy_version 10130 (0.0006) [2023-03-07 16:26:04,526][232226] Updated weights for policy 0, policy_version 10140 (0.0006) [2023-03-07 16:26:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 10389504. Throughput: 0: 12923.5. Samples: 10367605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:05,070][231894] Avg episode reward: [(0, '193.442')] [2023-03-07 16:26:05,322][232226] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-07 16:26:06,114][232226] Updated weights for policy 0, policy_version 10160 (0.0007) [2023-03-07 16:26:06,919][232226] Updated weights for policy 0, policy_version 10170 (0.0006) [2023-03-07 16:26:07,700][232226] Updated weights for policy 0, policy_version 10180 (0.0006) [2023-03-07 16:26:08,503][232226] Updated weights for policy 0, policy_version 10190 (0.0006) [2023-03-07 16:26:09,294][232226] Updated weights for policy 0, policy_version 10200 (0.0006) [2023-03-07 16:26:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 10454016. Throughput: 0: 12914.5. Samples: 10445036. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:10,069][231894] Avg episode reward: [(0, '193.693')] [2023-03-07 16:26:10,085][232226] Updated weights for policy 0, policy_version 10210 (0.0006) [2023-03-07 16:26:10,871][232226] Updated weights for policy 0, policy_version 10220 (0.0006) [2023-03-07 16:26:11,660][232226] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-07 16:26:12,453][232226] Updated weights for policy 0, policy_version 10240 (0.0006) [2023-03-07 16:26:13,260][232226] Updated weights for policy 0, policy_version 10250 (0.0006) [2023-03-07 16:26:14,065][232226] Updated weights for policy 0, policy_version 10260 (0.0006) [2023-03-07 16:26:14,865][232226] Updated weights for policy 0, policy_version 10270 (0.0006) [2023-03-07 16:26:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 10518528. Throughput: 0: 12912.5. Samples: 10483848. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:15,070][231894] Avg episode reward: [(0, '184.422')] [2023-03-07 16:26:15,654][232226] Updated weights for policy 0, policy_version 10280 (0.0007) [2023-03-07 16:26:16,444][232226] Updated weights for policy 0, policy_version 10290 (0.0006) [2023-03-07 16:26:17,257][232226] Updated weights for policy 0, policy_version 10300 (0.0007) [2023-03-07 16:26:18,048][232226] Updated weights for policy 0, policy_version 10310 (0.0006) [2023-03-07 16:26:18,837][232226] Updated weights for policy 0, policy_version 10320 (0.0006) [2023-03-07 16:26:19,633][232226] Updated weights for policy 0, policy_version 10330 (0.0006) [2023-03-07 16:26:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 10583040. Throughput: 0: 12902.6. Samples: 10560742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:20,070][231894] Avg episode reward: [(0, '192.206')] [2023-03-07 16:26:20,430][232226] Updated weights for policy 0, policy_version 10340 (0.0008) [2023-03-07 16:26:21,225][232226] Updated weights for policy 0, policy_version 10350 (0.0006) [2023-03-07 16:26:22,034][232226] Updated weights for policy 0, policy_version 10360 (0.0007) [2023-03-07 16:26:22,832][232226] Updated weights for policy 0, policy_version 10370 (0.0006) [2023-03-07 16:26:23,601][232226] Updated weights for policy 0, policy_version 10380 (0.0006) [2023-03-07 16:26:24,401][232226] Updated weights for policy 0, policy_version 10390 (0.0007) [2023-03-07 16:26:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 10647552. Throughput: 0: 12900.9. Samples: 10638112. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:26:25,069][231894] Avg episode reward: [(0, '188.827')] [2023-03-07 16:26:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000010398_10647552.pth... [2023-03-07 16:26:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000007374_7550976.pth [2023-03-07 16:26:25,188][232226] Updated weights for policy 0, policy_version 10400 (0.0007) [2023-03-07 16:26:25,982][232226] Updated weights for policy 0, policy_version 10410 (0.0006) [2023-03-07 16:26:26,788][232226] Updated weights for policy 0, policy_version 10420 (0.0007) [2023-03-07 16:26:27,564][232226] Updated weights for policy 0, policy_version 10430 (0.0007) [2023-03-07 16:26:28,349][232226] Updated weights for policy 0, policy_version 10440 (0.0005) [2023-03-07 16:26:29,134][232226] Updated weights for policy 0, policy_version 10450 (0.0007) [2023-03-07 16:26:29,931][232226] Updated weights for policy 0, policy_version 10460 (0.0006) [2023-03-07 16:26:30,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 10712064. Throughput: 0: 12898.7. Samples: 10676912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:30,069][231894] Avg episode reward: [(0, '192.987')] [2023-03-07 16:26:30,745][232226] Updated weights for policy 0, policy_version 10470 (0.0007) [2023-03-07 16:26:31,525][232226] Updated weights for policy 0, policy_version 10480 (0.0006) [2023-03-07 16:26:32,319][232226] Updated weights for policy 0, policy_version 10490 (0.0007) [2023-03-07 16:26:33,105][232226] Updated weights for policy 0, policy_version 10500 (0.0006) [2023-03-07 16:26:33,905][232226] Updated weights for policy 0, policy_version 10510 (0.0007) [2023-03-07 16:26:34,688][232226] Updated weights for policy 0, policy_version 10520 (0.0006) [2023-03-07 16:26:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 10776576. Throughput: 0: 12906.8. Samples: 10754597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:35,069][231894] Avg episode reward: [(0, '189.940')] [2023-03-07 16:26:35,479][232226] Updated weights for policy 0, policy_version 10530 (0.0006) [2023-03-07 16:26:36,300][232226] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-07 16:26:37,071][232226] Updated weights for policy 0, policy_version 10550 (0.0007) [2023-03-07 16:26:37,862][232226] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-07 16:26:38,686][232226] Updated weights for policy 0, policy_version 10570 (0.0006) [2023-03-07 16:26:39,469][232226] Updated weights for policy 0, policy_version 10580 (0.0006) [2023-03-07 16:26:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 10841088. Throughput: 0: 12898.3. Samples: 10831801. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:26:40,069][231894] Avg episode reward: [(0, '195.424')] [2023-03-07 16:26:40,270][232226] Updated weights for policy 0, policy_version 10590 (0.0006) [2023-03-07 16:26:41,072][232226] Updated weights for policy 0, policy_version 10600 (0.0008) [2023-03-07 16:26:41,860][232226] Updated weights for policy 0, policy_version 10610 (0.0007) [2023-03-07 16:26:42,645][232226] Updated weights for policy 0, policy_version 10620 (0.0006) [2023-03-07 16:26:43,463][232226] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-07 16:26:44,237][232226] Updated weights for policy 0, policy_version 10640 (0.0006) [2023-03-07 16:26:45,014][232226] Updated weights for policy 0, policy_version 10650 (0.0006) [2023-03-07 16:26:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 10905600. Throughput: 0: 12896.8. Samples: 10870426. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:26:45,069][231894] Avg episode reward: [(0, '188.212')] [2023-03-07 16:26:45,813][232226] Updated weights for policy 0, policy_version 10660 (0.0006) [2023-03-07 16:26:46,614][232226] Updated weights for policy 0, policy_version 10670 (0.0006) [2023-03-07 16:26:47,391][232226] Updated weights for policy 0, policy_version 10680 (0.0007) [2023-03-07 16:26:48,213][232226] Updated weights for policy 0, policy_version 10690 (0.0006) [2023-03-07 16:26:49,006][232226] Updated weights for policy 0, policy_version 10700 (0.0006) [2023-03-07 16:26:49,802][232226] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-07 16:26:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 10970112. Throughput: 0: 12894.1. Samples: 10947837. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:26:50,069][231894] Avg episode reward: [(0, '184.747')] [2023-03-07 16:26:50,598][232226] Updated weights for policy 0, policy_version 10720 (0.0006) [2023-03-07 16:26:51,410][232226] Updated weights for policy 0, policy_version 10730 (0.0007) [2023-03-07 16:26:52,193][232226] Updated weights for policy 0, policy_version 10740 (0.0007) [2023-03-07 16:26:52,961][232226] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-07 16:26:53,782][232226] Updated weights for policy 0, policy_version 10760 (0.0006) [2023-03-07 16:26:54,568][232226] Updated weights for policy 0, policy_version 10770 (0.0007) [2023-03-07 16:26:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 11034624. Throughput: 0: 12888.8. Samples: 11025031. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:26:55,069][231894] Avg episode reward: [(0, '186.039')] [2023-03-07 16:26:55,338][232226] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-07 16:26:56,166][232226] Updated weights for policy 0, policy_version 10790 (0.0005) [2023-03-07 16:26:56,946][232226] Updated weights for policy 0, policy_version 10800 (0.0006) [2023-03-07 16:26:57,745][232226] Updated weights for policy 0, policy_version 10810 (0.0007) [2023-03-07 16:26:58,544][232226] Updated weights for policy 0, policy_version 10820 (0.0007) [2023-03-07 16:26:59,338][232226] Updated weights for policy 0, policy_version 10830 (0.0007) [2023-03-07 16:27:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 11098112. Throughput: 0: 12888.0. Samples: 11063805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:27:00,069][231894] Avg episode reward: [(0, '196.254')] [2023-03-07 16:27:00,135][232226] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-07 16:27:00,945][232226] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-07 16:27:01,729][232226] Updated weights for policy 0, policy_version 10860 (0.0007) [2023-03-07 16:27:02,515][232226] Updated weights for policy 0, policy_version 10870 (0.0006) [2023-03-07 16:27:03,316][232226] Updated weights for policy 0, policy_version 10880 (0.0007) [2023-03-07 16:27:04,084][232226] Updated weights for policy 0, policy_version 10890 (0.0006) [2023-03-07 16:27:04,883][232226] Updated weights for policy 0, policy_version 10900 (0.0007) [2023-03-07 16:27:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 11163648. Throughput: 0: 12895.9. Samples: 11141054. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:27:05,069][231894] Avg episode reward: [(0, '194.605')] [2023-03-07 16:27:05,674][232226] Updated weights for policy 0, policy_version 10910 (0.0007) [2023-03-07 16:27:06,478][232226] Updated weights for policy 0, policy_version 10920 (0.0006) [2023-03-07 16:27:07,276][232226] Updated weights for policy 0, policy_version 10930 (0.0007) [2023-03-07 16:27:08,074][232226] Updated weights for policy 0, policy_version 10940 (0.0006) [2023-03-07 16:27:08,878][232226] Updated weights for policy 0, policy_version 10950 (0.0006) [2023-03-07 16:27:09,674][232226] Updated weights for policy 0, policy_version 10960 (0.0006) [2023-03-07 16:27:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 11227136. Throughput: 0: 12891.2. Samples: 11218217. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:27:10,069][231894] Avg episode reward: [(0, '192.697')] [2023-03-07 16:27:10,479][232226] Updated weights for policy 0, policy_version 10970 (0.0006) [2023-03-07 16:27:11,305][232226] Updated weights for policy 0, policy_version 10980 (0.0007) [2023-03-07 16:27:12,074][232226] Updated weights for policy 0, policy_version 10990 (0.0006) [2023-03-07 16:27:12,869][232226] Updated weights for policy 0, policy_version 11000 (0.0006) [2023-03-07 16:27:13,653][232226] Updated weights for policy 0, policy_version 11010 (0.0006) [2023-03-07 16:27:14,432][232226] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-07 16:27:15,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 11291648. Throughput: 0: 12890.1. Samples: 11256967. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:15,070][231894] Avg episode reward: [(0, '182.494')] [2023-03-07 16:27:15,233][232226] Updated weights for policy 0, policy_version 11030 (0.0007) [2023-03-07 16:27:16,027][232226] Updated weights for policy 0, policy_version 11040 (0.0006) [2023-03-07 16:27:16,826][232226] Updated weights for policy 0, policy_version 11050 (0.0006) [2023-03-07 16:27:17,625][232226] Updated weights for policy 0, policy_version 11060 (0.0006) [2023-03-07 16:27:18,424][232226] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-07 16:27:19,211][232226] Updated weights for policy 0, policy_version 11080 (0.0006) [2023-03-07 16:27:19,994][232226] Updated weights for policy 0, policy_version 11090 (0.0006) [2023-03-07 16:27:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12898.9). Total num frames: 11356160. Throughput: 0: 12877.0. Samples: 11334061. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:20,069][231894] Avg episode reward: [(0, '195.813')] [2023-03-07 16:27:20,794][232226] Updated weights for policy 0, policy_version 11100 (0.0007) [2023-03-07 16:27:21,587][232226] Updated weights for policy 0, policy_version 11110 (0.0006) [2023-03-07 16:27:22,384][232226] Updated weights for policy 0, policy_version 11120 (0.0007) [2023-03-07 16:27:23,178][232226] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-07 16:27:23,987][232226] Updated weights for policy 0, policy_version 11140 (0.0006) [2023-03-07 16:27:24,774][232226] Updated weights for policy 0, policy_version 11150 (0.0006) [2023-03-07 16:27:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 11420672. Throughput: 0: 12884.7. Samples: 11411612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:25,069][231894] Avg episode reward: [(0, '184.631')] [2023-03-07 16:27:25,574][232226] Updated weights for policy 0, policy_version 11160 (0.0007) [2023-03-07 16:27:26,377][232226] Updated weights for policy 0, policy_version 11170 (0.0006) [2023-03-07 16:27:27,168][232226] Updated weights for policy 0, policy_version 11180 (0.0006) [2023-03-07 16:27:27,972][232226] Updated weights for policy 0, policy_version 11190 (0.0006) [2023-03-07 16:27:28,778][232226] Updated weights for policy 0, policy_version 11200 (0.0006) [2023-03-07 16:27:29,570][232226] Updated weights for policy 0, policy_version 11210 (0.0006) [2023-03-07 16:27:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 11485184. Throughput: 0: 12880.0. Samples: 11450028. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:30,069][231894] Avg episode reward: [(0, '192.504')] [2023-03-07 16:27:30,369][232226] Updated weights for policy 0, policy_version 11220 (0.0006) [2023-03-07 16:27:31,160][232226] Updated weights for policy 0, policy_version 11230 (0.0006) [2023-03-07 16:27:31,942][232226] Updated weights for policy 0, policy_version 11240 (0.0006) [2023-03-07 16:27:32,740][232226] Updated weights for policy 0, policy_version 11250 (0.0006) [2023-03-07 16:27:33,526][232226] Updated weights for policy 0, policy_version 11260 (0.0007) [2023-03-07 16:27:34,314][232226] Updated weights for policy 0, policy_version 11270 (0.0006) [2023-03-07 16:27:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12902.4). Total num frames: 11549696. Throughput: 0: 12881.4. Samples: 11527502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:35,069][231894] Avg episode reward: [(0, '190.847')] [2023-03-07 16:27:35,115][232226] Updated weights for policy 0, policy_version 11280 (0.0007) [2023-03-07 16:27:35,902][232226] Updated weights for policy 0, policy_version 11290 (0.0006) [2023-03-07 16:27:36,693][232226] Updated weights for policy 0, policy_version 11300 (0.0006) [2023-03-07 16:27:37,505][232226] Updated weights for policy 0, policy_version 11310 (0.0008) [2023-03-07 16:27:38,277][232226] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-07 16:27:39,080][232226] Updated weights for policy 0, policy_version 11330 (0.0006) [2023-03-07 16:27:39,868][232226] Updated weights for policy 0, policy_version 11340 (0.0007) [2023-03-07 16:27:40,069][231894] Fps is (10 sec: 12902.1, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 11614208. Throughput: 0: 12886.6. Samples: 11604928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:40,070][231894] Avg episode reward: [(0, '195.690')] [2023-03-07 16:27:40,674][232226] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-07 16:27:41,445][232226] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-07 16:27:42,239][232226] Updated weights for policy 0, policy_version 11370 (0.0008) [2023-03-07 16:27:43,056][232226] Updated weights for policy 0, policy_version 11380 (0.0007) [2023-03-07 16:27:43,842][232226] Updated weights for policy 0, policy_version 11390 (0.0005) [2023-03-07 16:27:44,632][232226] Updated weights for policy 0, policy_version 11400 (0.0006) [2023-03-07 16:27:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 11678720. Throughput: 0: 12886.9. Samples: 11643718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:45,069][231894] Avg episode reward: [(0, '192.170')] [2023-03-07 16:27:45,441][232226] Updated weights for policy 0, policy_version 11410 (0.0006) [2023-03-07 16:27:46,225][232226] Updated weights for policy 0, policy_version 11420 (0.0006) [2023-03-07 16:27:47,009][232226] Updated weights for policy 0, policy_version 11430 (0.0006) [2023-03-07 16:27:47,818][232226] Updated weights for policy 0, policy_version 11440 (0.0006) [2023-03-07 16:27:48,596][232226] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-07 16:27:49,396][232226] Updated weights for policy 0, policy_version 11460 (0.0007) [2023-03-07 16:27:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 11743232. Throughput: 0: 12885.8. Samples: 11720918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:50,069][231894] Avg episode reward: [(0, '185.744')] [2023-03-07 16:27:50,202][232226] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-07 16:27:51,003][232226] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-07 16:27:51,789][232226] Updated weights for policy 0, policy_version 11490 (0.0006) [2023-03-07 16:27:52,582][232226] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-07 16:27:53,400][232226] Updated weights for policy 0, policy_version 11510 (0.0007) [2023-03-07 16:27:54,180][232226] Updated weights for policy 0, policy_version 11520 (0.0007) [2023-03-07 16:27:54,962][232226] Updated weights for policy 0, policy_version 11530 (0.0007) [2023-03-07 16:27:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12902.4). Total num frames: 11807744. Throughput: 0: 12885.7. Samples: 11798071. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:27:55,069][231894] Avg episode reward: [(0, '190.250')] [2023-03-07 16:27:55,774][232226] Updated weights for policy 0, policy_version 11540 (0.0007) [2023-03-07 16:27:56,569][232226] Updated weights for policy 0, policy_version 11550 (0.0006) [2023-03-07 16:27:57,345][232226] Updated weights for policy 0, policy_version 11560 (0.0006) [2023-03-07 16:27:58,166][232226] Updated weights for policy 0, policy_version 11570 (0.0006) [2023-03-07 16:27:58,953][232226] Updated weights for policy 0, policy_version 11580 (0.0006) [2023-03-07 16:27:59,744][232226] Updated weights for policy 0, policy_version 11590 (0.0007) [2023-03-07 16:28:00,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 11872256. Throughput: 0: 12885.1. Samples: 11836793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:28:00,080][231894] Avg episode reward: [(0, '190.498')] [2023-03-07 16:28:00,566][232226] Updated weights for policy 0, policy_version 11600 (0.0006) [2023-03-07 16:28:01,321][232226] Updated weights for policy 0, policy_version 11610 (0.0006) [2023-03-07 16:28:02,102][232226] Updated weights for policy 0, policy_version 11620 (0.0007) [2023-03-07 16:28:02,905][232226] Updated weights for policy 0, policy_version 11630 (0.0008) [2023-03-07 16:28:03,685][232226] Updated weights for policy 0, policy_version 11640 (0.0006) [2023-03-07 16:28:04,473][232226] Updated weights for policy 0, policy_version 11650 (0.0007) [2023-03-07 16:28:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 11936768. Throughput: 0: 12898.1. Samples: 11914475. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:28:05,080][231894] Avg episode reward: [(0, '191.210')] [2023-03-07 16:28:05,277][232226] Updated weights for policy 0, policy_version 11660 (0.0007) [2023-03-07 16:28:06,055][232226] Updated weights for policy 0, policy_version 11670 (0.0007) [2023-03-07 16:28:06,840][232226] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-07 16:28:07,637][232226] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-07 16:28:08,441][232226] Updated weights for policy 0, policy_version 11700 (0.0006) [2023-03-07 16:28:09,219][232226] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-07 16:28:10,023][232226] Updated weights for policy 0, policy_version 11720 (0.0006) [2023-03-07 16:28:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 12001280. Throughput: 0: 12901.6. Samples: 11992185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:28:10,080][231894] Avg episode reward: [(0, '188.725')] [2023-03-07 16:28:10,811][232226] Updated weights for policy 0, policy_version 11730 (0.0006) [2023-03-07 16:28:11,598][232226] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-07 16:28:12,407][232226] Updated weights for policy 0, policy_version 11750 (0.0006) [2023-03-07 16:28:13,202][232226] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-07 16:28:13,982][232226] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-07 16:28:14,798][232226] Updated weights for policy 0, policy_version 11780 (0.0007) [2023-03-07 16:28:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 12065792. Throughput: 0: 12907.6. Samples: 12030872. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:28:15,080][231894] Avg episode reward: [(0, '195.667')] [2023-03-07 16:28:15,574][232226] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-07 16:28:16,381][232226] Updated weights for policy 0, policy_version 11800 (0.0006) [2023-03-07 16:28:17,156][232226] Updated weights for policy 0, policy_version 11810 (0.0007) [2023-03-07 16:28:17,950][232226] Updated weights for policy 0, policy_version 11820 (0.0006) [2023-03-07 16:28:18,734][232226] Updated weights for policy 0, policy_version 11830 (0.0006) [2023-03-07 16:28:19,537][232226] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-07 16:28:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 12130304. Throughput: 0: 12907.9. Samples: 12108359. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:28:20,080][231894] Avg episode reward: [(0, '194.134')] [2023-03-07 16:28:20,326][232226] Updated weights for policy 0, policy_version 11850 (0.0007) [2023-03-07 16:28:21,134][232226] Updated weights for policy 0, policy_version 11860 (0.0007) [2023-03-07 16:28:21,932][232226] Updated weights for policy 0, policy_version 11870 (0.0006) [2023-03-07 16:28:22,713][232226] Updated weights for policy 0, policy_version 11880 (0.0006) [2023-03-07 16:28:23,510][232226] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-07 16:28:24,285][232226] Updated weights for policy 0, policy_version 11900 (0.0006) [2023-03-07 16:28:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 12194816. Throughput: 0: 12908.4. Samples: 12185806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:28:25,078][232226] Updated weights for policy 0, policy_version 11910 (0.0006) [2023-03-07 16:28:25,080][231894] Avg episode reward: [(0, '187.616')] [2023-03-07 16:28:25,084][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000011910_12195840.pth... [2023-03-07 16:28:25,113][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000008885_9098240.pth [2023-03-07 16:28:25,879][232226] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-07 16:28:26,667][232226] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-07 16:28:27,484][232226] Updated weights for policy 0, policy_version 11940 (0.0006) [2023-03-07 16:28:28,255][232226] Updated weights for policy 0, policy_version 11950 (0.0007) [2023-03-07 16:28:29,041][232226] Updated weights for policy 0, policy_version 11960 (0.0007) [2023-03-07 16:28:29,832][232226] Updated weights for policy 0, policy_version 11970 (0.0006) [2023-03-07 16:28:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 12259328. Throughput: 0: 12907.4. Samples: 12224551. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:28:30,080][231894] Avg episode reward: [(0, '189.286')] [2023-03-07 16:28:30,630][232226] Updated weights for policy 0, policy_version 11980 (0.0006) [2023-03-07 16:28:31,425][232226] Updated weights for policy 0, policy_version 11990 (0.0007) [2023-03-07 16:28:32,225][232226] Updated weights for policy 0, policy_version 12000 (0.0007) [2023-03-07 16:28:33,004][232226] Updated weights for policy 0, policy_version 12010 (0.0006) [2023-03-07 16:28:33,790][232226] Updated weights for policy 0, policy_version 12020 (0.0006) [2023-03-07 16:28:34,605][232226] Updated weights for policy 0, policy_version 12030 (0.0006) [2023-03-07 16:28:35,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 12324864. Throughput: 0: 12917.3. Samples: 12302196. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:28:35,080][231894] Avg episode reward: [(0, '194.686')] [2023-03-07 16:28:35,367][232226] Updated weights for policy 0, policy_version 12040 (0.0007) [2023-03-07 16:28:36,171][232226] Updated weights for policy 0, policy_version 12050 (0.0007) [2023-03-07 16:28:36,970][232226] Updated weights for policy 0, policy_version 12060 (0.0006) [2023-03-07 16:28:37,748][232226] Updated weights for policy 0, policy_version 12070 (0.0006) [2023-03-07 16:28:38,547][232226] Updated weights for policy 0, policy_version 12080 (0.0006) [2023-03-07 16:28:39,339][232226] Updated weights for policy 0, policy_version 12090 (0.0006) [2023-03-07 16:28:40,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 12389376. Throughput: 0: 12925.1. Samples: 12379702. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:28:40,080][231894] Avg episode reward: [(0, '189.600')] [2023-03-07 16:28:40,140][232226] Updated weights for policy 0, policy_version 12100 (0.0006) [2023-03-07 16:28:40,943][232226] Updated weights for policy 0, policy_version 12110 (0.0006) [2023-03-07 16:28:41,735][232226] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-07 16:28:42,523][232226] Updated weights for policy 0, policy_version 12130 (0.0006) [2023-03-07 16:28:43,306][232226] Updated weights for policy 0, policy_version 12140 (0.0007) [2023-03-07 16:28:44,118][232226] Updated weights for policy 0, policy_version 12150 (0.0007) [2023-03-07 16:28:44,925][232226] Updated weights for policy 0, policy_version 12160 (0.0007) [2023-03-07 16:28:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 12452864. Throughput: 0: 12923.2. Samples: 12418339. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:28:45,080][231894] Avg episode reward: [(0, '190.577')] [2023-03-07 16:28:45,711][232226] Updated weights for policy 0, policy_version 12170 (0.0007) [2023-03-07 16:28:46,492][232226] Updated weights for policy 0, policy_version 12180 (0.0006) [2023-03-07 16:28:47,286][232226] Updated weights for policy 0, policy_version 12190 (0.0006) [2023-03-07 16:28:48,079][232226] Updated weights for policy 0, policy_version 12200 (0.0007) [2023-03-07 16:28:48,874][232226] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-07 16:28:49,677][232226] Updated weights for policy 0, policy_version 12220 (0.0007) [2023-03-07 16:28:50,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 12517376. Throughput: 0: 12917.2. Samples: 12495748. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:28:50,069][231894] Avg episode reward: [(0, '186.652')] [2023-03-07 16:28:50,465][232226] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-07 16:28:51,258][232226] Updated weights for policy 0, policy_version 12240 (0.0006) [2023-03-07 16:28:52,047][232226] Updated weights for policy 0, policy_version 12250 (0.0007) [2023-03-07 16:28:52,851][232226] Updated weights for policy 0, policy_version 12260 (0.0007) [2023-03-07 16:28:53,644][232226] Updated weights for policy 0, policy_version 12270 (0.0006) [2023-03-07 16:28:54,408][232226] Updated weights for policy 0, policy_version 12280 (0.0007) [2023-03-07 16:28:55,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 12582912. Throughput: 0: 12913.5. Samples: 12573293. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:28:55,080][231894] Avg episode reward: [(0, '186.490')] [2023-03-07 16:28:55,219][232226] Updated weights for policy 0, policy_version 12290 (0.0007) [2023-03-07 16:28:56,007][232226] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-07 16:28:56,804][232226] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-07 16:28:57,585][232226] Updated weights for policy 0, policy_version 12320 (0.0006) [2023-03-07 16:28:58,361][232226] Updated weights for policy 0, policy_version 12330 (0.0007) [2023-03-07 16:28:59,156][232226] Updated weights for policy 0, policy_version 12340 (0.0007) [2023-03-07 16:28:59,961][232226] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-07 16:29:00,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.4, 300 sec: 12905.9). Total num frames: 12647424. Throughput: 0: 12913.6. Samples: 12611986. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:29:00,080][231894] Avg episode reward: [(0, '196.576')] [2023-03-07 16:29:00,767][232226] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-07 16:29:01,565][232226] Updated weights for policy 0, policy_version 12370 (0.0007) [2023-03-07 16:29:02,363][232226] Updated weights for policy 0, policy_version 12380 (0.0007) [2023-03-07 16:29:03,155][232226] Updated weights for policy 0, policy_version 12390 (0.0006) [2023-03-07 16:29:03,947][232226] Updated weights for policy 0, policy_version 12400 (0.0006) [2023-03-07 16:29:04,724][232226] Updated weights for policy 0, policy_version 12410 (0.0006) [2023-03-07 16:29:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 12711936. Throughput: 0: 12914.2. Samples: 12689497. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:29:05,080][231894] Avg episode reward: [(0, '194.515')] [2023-03-07 16:29:05,532][232226] Updated weights for policy 0, policy_version 12420 (0.0007) [2023-03-07 16:29:06,333][232226] Updated weights for policy 0, policy_version 12430 (0.0007) [2023-03-07 16:29:07,126][232226] Updated weights for policy 0, policy_version 12440 (0.0006) [2023-03-07 16:29:07,929][232226] Updated weights for policy 0, policy_version 12450 (0.0007) [2023-03-07 16:29:08,725][232226] Updated weights for policy 0, policy_version 12460 (0.0006) [2023-03-07 16:29:09,520][232226] Updated weights for policy 0, policy_version 12470 (0.0007) [2023-03-07 16:29:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 12776448. Throughput: 0: 12904.8. Samples: 12766522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:29:10,069][231894] Avg episode reward: [(0, '188.820')] [2023-03-07 16:29:10,328][232226] Updated weights for policy 0, policy_version 12480 (0.0006) [2023-03-07 16:29:11,116][232226] Updated weights for policy 0, policy_version 12490 (0.0006) [2023-03-07 16:29:11,912][232226] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-07 16:29:12,705][232226] Updated weights for policy 0, policy_version 12510 (0.0006) [2023-03-07 16:29:13,489][232226] Updated weights for policy 0, policy_version 12520 (0.0006) [2023-03-07 16:29:14,297][232226] Updated weights for policy 0, policy_version 12530 (0.0007) [2023-03-07 16:29:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 12839936. Throughput: 0: 12904.4. Samples: 12805248. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:29:15,069][231894] Avg episode reward: [(0, '192.546')] [2023-03-07 16:29:15,076][232226] Updated weights for policy 0, policy_version 12540 (0.0006) [2023-03-07 16:29:15,867][232226] Updated weights for policy 0, policy_version 12550 (0.0006) [2023-03-07 16:29:16,665][232226] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-07 16:29:17,450][232226] Updated weights for policy 0, policy_version 12570 (0.0006) [2023-03-07 16:29:18,231][232226] Updated weights for policy 0, policy_version 12580 (0.0007) [2023-03-07 16:29:19,046][232226] Updated weights for policy 0, policy_version 12590 (0.0006) [2023-03-07 16:29:19,815][232226] Updated weights for policy 0, policy_version 12600 (0.0006) [2023-03-07 16:29:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 12905472. Throughput: 0: 12902.0. Samples: 12882785. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:29:20,070][231894] Avg episode reward: [(0, '191.855')] [2023-03-07 16:29:20,616][232226] Updated weights for policy 0, policy_version 12610 (0.0006) [2023-03-07 16:29:21,402][232226] Updated weights for policy 0, policy_version 12620 (0.0006) [2023-03-07 16:29:22,197][232226] Updated weights for policy 0, policy_version 12630 (0.0007) [2023-03-07 16:29:22,993][232226] Updated weights for policy 0, policy_version 12640 (0.0006) [2023-03-07 16:29:23,784][232226] Updated weights for policy 0, policy_version 12650 (0.0007) [2023-03-07 16:29:24,574][232226] Updated weights for policy 0, policy_version 12660 (0.0006) [2023-03-07 16:29:25,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 12969984. Throughput: 0: 12903.1. Samples: 12960343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:29:25,069][231894] Avg episode reward: [(0, '193.658')] [2023-03-07 16:29:25,361][232226] Updated weights for policy 0, policy_version 12670 (0.0006) [2023-03-07 16:29:26,174][232226] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-07 16:29:26,958][232226] Updated weights for policy 0, policy_version 12690 (0.0006) [2023-03-07 16:29:27,771][232226] Updated weights for policy 0, policy_version 12700 (0.0006) [2023-03-07 16:29:28,558][232226] Updated weights for policy 0, policy_version 12710 (0.0006) [2023-03-07 16:29:29,360][232226] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-07 16:29:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 13034496. Throughput: 0: 12904.3. Samples: 12999033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:29:30,069][231894] Avg episode reward: [(0, '192.653')] [2023-03-07 16:29:30,135][232226] Updated weights for policy 0, policy_version 12730 (0.0007) [2023-03-07 16:29:30,932][232226] Updated weights for policy 0, policy_version 12740 (0.0007) [2023-03-07 16:29:31,748][232226] Updated weights for policy 0, policy_version 12750 (0.0007) [2023-03-07 16:29:32,522][232226] Updated weights for policy 0, policy_version 12760 (0.0005) [2023-03-07 16:29:33,310][232226] Updated weights for policy 0, policy_version 12770 (0.0006) [2023-03-07 16:29:34,101][232226] Updated weights for policy 0, policy_version 12780 (0.0006) [2023-03-07 16:29:34,886][232226] Updated weights for policy 0, policy_version 12790 (0.0006) [2023-03-07 16:29:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 13099008. Throughput: 0: 12903.7. Samples: 13076417. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:29:35,069][231894] Avg episode reward: [(0, '201.302')] [2023-03-07 16:29:35,693][232226] Updated weights for policy 0, policy_version 12800 (0.0006) [2023-03-07 16:29:36,511][232226] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-07 16:29:37,292][232226] Updated weights for policy 0, policy_version 12820 (0.0007) [2023-03-07 16:29:38,097][232226] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-07 16:29:38,880][232226] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-07 16:29:39,674][232226] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-07 16:29:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 13163520. Throughput: 0: 12898.9. Samples: 13153745. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:29:40,069][231894] Avg episode reward: [(0, '189.244')] [2023-03-07 16:29:40,481][232226] Updated weights for policy 0, policy_version 12860 (0.0006) [2023-03-07 16:29:41,268][232226] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-07 16:29:42,053][232226] Updated weights for policy 0, policy_version 12880 (0.0006) [2023-03-07 16:29:42,837][232226] Updated weights for policy 0, policy_version 12890 (0.0007) [2023-03-07 16:29:43,617][232226] Updated weights for policy 0, policy_version 12900 (0.0006) [2023-03-07 16:29:44,418][232226] Updated weights for policy 0, policy_version 12910 (0.0007) [2023-03-07 16:29:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 13228032. Throughput: 0: 12897.3. Samples: 13192366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:29:45,069][231894] Avg episode reward: [(0, '194.885')] [2023-03-07 16:29:45,209][232226] Updated weights for policy 0, policy_version 12920 (0.0006) [2023-03-07 16:29:45,989][232226] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-07 16:29:46,790][232226] Updated weights for policy 0, policy_version 12940 (0.0007) [2023-03-07 16:29:47,581][232226] Updated weights for policy 0, policy_version 12950 (0.0007) [2023-03-07 16:29:48,379][232226] Updated weights for policy 0, policy_version 12960 (0.0006) [2023-03-07 16:29:49,180][232226] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-07 16:29:49,994][232226] Updated weights for policy 0, policy_version 12980 (0.0006) [2023-03-07 16:29:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 13292544. Throughput: 0: 12902.4. Samples: 13270105. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:29:50,069][231894] Avg episode reward: [(0, '195.189')] [2023-03-07 16:29:50,795][232226] Updated weights for policy 0, policy_version 12990 (0.0007) [2023-03-07 16:29:51,593][232226] Updated weights for policy 0, policy_version 13000 (0.0006) [2023-03-07 16:29:52,366][232226] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-07 16:29:53,190][232226] Updated weights for policy 0, policy_version 13020 (0.0006) [2023-03-07 16:29:53,966][232226] Updated weights for policy 0, policy_version 13030 (0.0006) [2023-03-07 16:29:54,765][232226] Updated weights for policy 0, policy_version 13040 (0.0006) [2023-03-07 16:29:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 13356032. Throughput: 0: 12902.9. Samples: 13347152. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:29:55,069][231894] Avg episode reward: [(0, '196.215')] [2023-03-07 16:29:55,558][232226] Updated weights for policy 0, policy_version 13050 (0.0006) [2023-03-07 16:29:56,361][232226] Updated weights for policy 0, policy_version 13060 (0.0006) [2023-03-07 16:29:57,151][232226] Updated weights for policy 0, policy_version 13070 (0.0006) [2023-03-07 16:29:57,950][232226] Updated weights for policy 0, policy_version 13080 (0.0006) [2023-03-07 16:29:58,738][232226] Updated weights for policy 0, policy_version 13090 (0.0006) [2023-03-07 16:29:59,520][232226] Updated weights for policy 0, policy_version 13100 (0.0007) [2023-03-07 16:30:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 13421568. Throughput: 0: 12899.9. Samples: 13385744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:30:00,070][231894] Avg episode reward: [(0, '191.677')] [2023-03-07 16:30:00,330][232226] Updated weights for policy 0, policy_version 13110 (0.0007) [2023-03-07 16:30:01,147][232226] Updated weights for policy 0, policy_version 13120 (0.0006) [2023-03-07 16:30:01,935][232226] Updated weights for policy 0, policy_version 13130 (0.0006) [2023-03-07 16:30:02,720][232226] Updated weights for policy 0, policy_version 13140 (0.0007) [2023-03-07 16:30:03,523][232226] Updated weights for policy 0, policy_version 13150 (0.0006) [2023-03-07 16:30:04,330][232226] Updated weights for policy 0, policy_version 13160 (0.0006) [2023-03-07 16:30:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 13485056. Throughput: 0: 12891.4. Samples: 13462896. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:30:05,069][231894] Avg episode reward: [(0, '190.363')] [2023-03-07 16:30:05,117][232226] Updated weights for policy 0, policy_version 13170 (0.0006) [2023-03-07 16:30:05,914][232226] Updated weights for policy 0, policy_version 13180 (0.0006) [2023-03-07 16:30:06,707][232226] Updated weights for policy 0, policy_version 13190 (0.0007) [2023-03-07 16:30:07,484][232226] Updated weights for policy 0, policy_version 13200 (0.0006) [2023-03-07 16:30:08,270][232226] Updated weights for policy 0, policy_version 13210 (0.0006) [2023-03-07 16:30:09,076][232226] Updated weights for policy 0, policy_version 13220 (0.0007) [2023-03-07 16:30:09,858][232226] Updated weights for policy 0, policy_version 13230 (0.0006) [2023-03-07 16:30:10,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 13549568. Throughput: 0: 12885.2. Samples: 13540178. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:30:10,069][231894] Avg episode reward: [(0, '188.212')] [2023-03-07 16:30:10,661][232226] Updated weights for policy 0, policy_version 13240 (0.0006) [2023-03-07 16:30:11,466][232226] Updated weights for policy 0, policy_version 13250 (0.0006) [2023-03-07 16:30:12,258][232226] Updated weights for policy 0, policy_version 13260 (0.0006) [2023-03-07 16:30:13,025][232226] Updated weights for policy 0, policy_version 13270 (0.0007) [2023-03-07 16:30:13,837][232226] Updated weights for policy 0, policy_version 13280 (0.0007) [2023-03-07 16:30:14,633][232226] Updated weights for policy 0, policy_version 13290 (0.0007) [2023-03-07 16:30:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 13614080. Throughput: 0: 12888.0. Samples: 13578992. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:30:15,069][231894] Avg episode reward: [(0, '195.762')] [2023-03-07 16:30:15,409][232226] Updated weights for policy 0, policy_version 13300 (0.0006) [2023-03-07 16:30:16,222][232226] Updated weights for policy 0, policy_version 13310 (0.0006) [2023-03-07 16:30:17,012][232226] Updated weights for policy 0, policy_version 13320 (0.0008) [2023-03-07 16:30:17,805][232226] Updated weights for policy 0, policy_version 13330 (0.0006) [2023-03-07 16:30:18,592][232226] Updated weights for policy 0, policy_version 13340 (0.0007) [2023-03-07 16:30:19,388][232226] Updated weights for policy 0, policy_version 13350 (0.0006) [2023-03-07 16:30:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 13678592. Throughput: 0: 12889.3. Samples: 13656435. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:30:20,070][231894] Avg episode reward: [(0, '185.625')] [2023-03-07 16:30:20,183][232226] Updated weights for policy 0, policy_version 13360 (0.0007) [2023-03-07 16:30:20,964][232226] Updated weights for policy 0, policy_version 13370 (0.0007) [2023-03-07 16:30:21,770][232226] Updated weights for policy 0, policy_version 13380 (0.0007) [2023-03-07 16:30:22,550][232226] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-07 16:30:23,362][232226] Updated weights for policy 0, policy_version 13400 (0.0006) [2023-03-07 16:30:24,137][232226] Updated weights for policy 0, policy_version 13410 (0.0006) [2023-03-07 16:30:24,938][232226] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-07 16:30:25,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 13743104. Throughput: 0: 12893.9. Samples: 13733971. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:30:25,070][231894] Avg episode reward: [(0, '192.649')] [2023-03-07 16:30:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000013421_13743104.pth... [2023-03-07 16:30:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000010398_10647552.pth [2023-03-07 16:30:25,713][232226] Updated weights for policy 0, policy_version 13430 (0.0006) [2023-03-07 16:30:26,518][232226] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-07 16:30:27,302][232226] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-07 16:30:28,112][232226] Updated weights for policy 0, policy_version 13460 (0.0007) [2023-03-07 16:30:28,910][232226] Updated weights for policy 0, policy_version 13470 (0.0006) [2023-03-07 16:30:29,698][232226] Updated weights for policy 0, policy_version 13480 (0.0006) [2023-03-07 16:30:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 13807616. Throughput: 0: 12897.3. Samples: 13772746. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:30:30,069][231894] Avg episode reward: [(0, '195.113')] [2023-03-07 16:30:30,510][232226] Updated weights for policy 0, policy_version 13490 (0.0006) [2023-03-07 16:30:31,308][232226] Updated weights for policy 0, policy_version 13500 (0.0006) [2023-03-07 16:30:32,109][232226] Updated weights for policy 0, policy_version 13510 (0.0006) [2023-03-07 16:30:32,890][232226] Updated weights for policy 0, policy_version 13520 (0.0007) [2023-03-07 16:30:33,696][232226] Updated weights for policy 0, policy_version 13530 (0.0006) [2023-03-07 16:30:34,472][232226] Updated weights for policy 0, policy_version 13540 (0.0006) [2023-03-07 16:30:35,069][231894] Fps is (10 sec: 12902.7, 60 sec: 12885.4, 300 sec: 12898.9). Total num frames: 13872128. Throughput: 0: 12880.3. Samples: 13849717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:30:35,069][231894] Avg episode reward: [(0, '190.975')] [2023-03-07 16:30:35,285][232226] Updated weights for policy 0, policy_version 13550 (0.0006) [2023-03-07 16:30:36,050][232226] Updated weights for policy 0, policy_version 13560 (0.0007) [2023-03-07 16:30:36,860][232226] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-07 16:30:37,650][232226] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-07 16:30:38,431][232226] Updated weights for policy 0, policy_version 13590 (0.0006) [2023-03-07 16:30:39,247][232226] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-07 16:30:40,034][232226] Updated weights for policy 0, policy_version 13610 (0.0007) [2023-03-07 16:30:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12898.9). Total num frames: 13936640. Throughput: 0: 12892.2. Samples: 13927301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:30:40,069][231894] Avg episode reward: [(0, '193.004')] [2023-03-07 16:30:40,833][232226] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-07 16:30:41,640][232226] Updated weights for policy 0, policy_version 13630 (0.0006) [2023-03-07 16:30:42,420][232226] Updated weights for policy 0, policy_version 13640 (0.0006) [2023-03-07 16:30:43,219][232226] Updated weights for policy 0, policy_version 13650 (0.0006) [2023-03-07 16:30:44,026][232226] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-07 16:30:44,810][232226] Updated weights for policy 0, policy_version 13670 (0.0007) [2023-03-07 16:30:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 14001152. Throughput: 0: 12893.6. Samples: 13965954. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:30:45,069][231894] Avg episode reward: [(0, '193.685')] [2023-03-07 16:30:45,598][232226] Updated weights for policy 0, policy_version 13680 (0.0006) [2023-03-07 16:30:46,383][232226] Updated weights for policy 0, policy_version 13690 (0.0007) [2023-03-07 16:30:47,188][232226] Updated weights for policy 0, policy_version 13700 (0.0007) [2023-03-07 16:30:47,977][232226] Updated weights for policy 0, policy_version 13710 (0.0007) [2023-03-07 16:30:48,788][232226] Updated weights for policy 0, policy_version 13720 (0.0006) [2023-03-07 16:30:49,553][232226] Updated weights for policy 0, policy_version 13730 (0.0006) [2023-03-07 16:30:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 14065664. Throughput: 0: 12897.9. Samples: 14043301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:30:50,069][231894] Avg episode reward: [(0, '182.244')] [2023-03-07 16:30:50,351][232226] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-07 16:30:51,153][232226] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-07 16:30:51,945][232226] Updated weights for policy 0, policy_version 13760 (0.0007) [2023-03-07 16:30:52,754][232226] Updated weights for policy 0, policy_version 13770 (0.0006) [2023-03-07 16:30:53,543][232226] Updated weights for policy 0, policy_version 13780 (0.0007) [2023-03-07 16:30:54,336][232226] Updated weights for policy 0, policy_version 13790 (0.0007) [2023-03-07 16:30:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 14129152. Throughput: 0: 12896.5. Samples: 14120520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:30:55,069][231894] Avg episode reward: [(0, '183.945')] [2023-03-07 16:30:55,141][232226] Updated weights for policy 0, policy_version 13800 (0.0006) [2023-03-07 16:30:55,939][232226] Updated weights for policy 0, policy_version 13810 (0.0007) [2023-03-07 16:30:56,741][232226] Updated weights for policy 0, policy_version 13820 (0.0006) [2023-03-07 16:30:57,538][232226] Updated weights for policy 0, policy_version 13830 (0.0006) [2023-03-07 16:30:58,328][232226] Updated weights for policy 0, policy_version 13840 (0.0006) [2023-03-07 16:30:59,128][232226] Updated weights for policy 0, policy_version 13850 (0.0007) [2023-03-07 16:30:59,913][232226] Updated weights for policy 0, policy_version 13860 (0.0006) [2023-03-07 16:31:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12895.5). Total num frames: 14193664. Throughput: 0: 12887.2. Samples: 14158915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:31:00,069][231894] Avg episode reward: [(0, '190.467')] [2023-03-07 16:31:00,704][232226] Updated weights for policy 0, policy_version 13870 (0.0006) [2023-03-07 16:31:01,502][232226] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-07 16:31:02,279][232226] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-07 16:31:03,089][232226] Updated weights for policy 0, policy_version 13900 (0.0006) [2023-03-07 16:31:03,869][232226] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-07 16:31:04,658][232226] Updated weights for policy 0, policy_version 13920 (0.0006) [2023-03-07 16:31:05,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 14259200. Throughput: 0: 12891.5. Samples: 14236551. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:31:05,069][231894] Avg episode reward: [(0, '193.852')] [2023-03-07 16:31:05,466][232226] Updated weights for policy 0, policy_version 13930 (0.0006) [2023-03-07 16:31:06,254][232226] Updated weights for policy 0, policy_version 13940 (0.0006) [2023-03-07 16:31:07,037][232226] Updated weights for policy 0, policy_version 13950 (0.0006) [2023-03-07 16:31:07,844][232226] Updated weights for policy 0, policy_version 13960 (0.0007) [2023-03-07 16:31:08,639][232226] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-07 16:31:09,413][232226] Updated weights for policy 0, policy_version 13980 (0.0007) [2023-03-07 16:31:10,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 14323712. Throughput: 0: 12890.8. Samples: 14314056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:31:10,069][231894] Avg episode reward: [(0, '193.763')] [2023-03-07 16:31:10,219][232226] Updated weights for policy 0, policy_version 13990 (0.0006) [2023-03-07 16:31:11,018][232226] Updated weights for policy 0, policy_version 14000 (0.0007) [2023-03-07 16:31:11,809][232226] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-07 16:31:12,616][232226] Updated weights for policy 0, policy_version 14020 (0.0007) [2023-03-07 16:31:13,400][232226] Updated weights for policy 0, policy_version 14030 (0.0006) [2023-03-07 16:31:14,178][232226] Updated weights for policy 0, policy_version 14040 (0.0007) [2023-03-07 16:31:14,989][232226] Updated weights for policy 0, policy_version 14050 (0.0008) [2023-03-07 16:31:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 14388224. Throughput: 0: 12889.5. Samples: 14352772. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:15,069][231894] Avg episode reward: [(0, '192.401')] [2023-03-07 16:31:15,784][232226] Updated weights for policy 0, policy_version 14060 (0.0007) [2023-03-07 16:31:16,561][232226] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-07 16:31:17,374][232226] Updated weights for policy 0, policy_version 14080 (0.0007) [2023-03-07 16:31:18,157][232226] Updated weights for policy 0, policy_version 14090 (0.0006) [2023-03-07 16:31:18,944][232226] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-07 16:31:19,743][232226] Updated weights for policy 0, policy_version 14110 (0.0007) [2023-03-07 16:31:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 14452736. Throughput: 0: 12896.9. Samples: 14430080. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:20,069][231894] Avg episode reward: [(0, '194.567')] [2023-03-07 16:31:20,554][232226] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-07 16:31:21,349][232226] Updated weights for policy 0, policy_version 14130 (0.0006) [2023-03-07 16:31:22,161][232226] Updated weights for policy 0, policy_version 14140 (0.0006) [2023-03-07 16:31:22,941][232226] Updated weights for policy 0, policy_version 14150 (0.0007) [2023-03-07 16:31:23,736][232226] Updated weights for policy 0, policy_version 14160 (0.0007) [2023-03-07 16:31:24,509][232226] Updated weights for policy 0, policy_version 14170 (0.0006) [2023-03-07 16:31:25,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 14516224. Throughput: 0: 12890.3. Samples: 14507363. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:25,069][231894] Avg episode reward: [(0, '196.824')] [2023-03-07 16:31:25,322][232226] Updated weights for policy 0, policy_version 14180 (0.0006) [2023-03-07 16:31:26,119][232226] Updated weights for policy 0, policy_version 14190 (0.0006) [2023-03-07 16:31:26,930][232226] Updated weights for policy 0, policy_version 14200 (0.0007) [2023-03-07 16:31:27,705][232226] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-07 16:31:28,491][232226] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-07 16:31:29,294][232226] Updated weights for policy 0, policy_version 14230 (0.0007) [2023-03-07 16:31:30,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 14580736. Throughput: 0: 12884.2. Samples: 14545744. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:30,069][231894] Avg episode reward: [(0, '189.217')] [2023-03-07 16:31:30,087][232226] Updated weights for policy 0, policy_version 14240 (0.0006) [2023-03-07 16:31:30,881][232226] Updated weights for policy 0, policy_version 14250 (0.0006) [2023-03-07 16:31:31,653][232226] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-07 16:31:32,464][232226] Updated weights for policy 0, policy_version 14270 (0.0007) [2023-03-07 16:31:33,254][232226] Updated weights for policy 0, policy_version 14280 (0.0007) [2023-03-07 16:31:34,052][232226] Updated weights for policy 0, policy_version 14290 (0.0007) [2023-03-07 16:31:34,851][232226] Updated weights for policy 0, policy_version 14300 (0.0006) [2023-03-07 16:31:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 14645248. Throughput: 0: 12894.2. Samples: 14623538. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:35,069][231894] Avg episode reward: [(0, '196.746')] [2023-03-07 16:31:35,646][232226] Updated weights for policy 0, policy_version 14310 (0.0007) [2023-03-07 16:31:36,425][232226] Updated weights for policy 0, policy_version 14320 (0.0007) [2023-03-07 16:31:37,234][232226] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-07 16:31:38,025][232226] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-07 16:31:38,803][232226] Updated weights for policy 0, policy_version 14350 (0.0006) [2023-03-07 16:31:39,592][232226] Updated weights for policy 0, policy_version 14360 (0.0006) [2023-03-07 16:31:40,069][231894] Fps is (10 sec: 13005.0, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 14710784. Throughput: 0: 12896.7. Samples: 14700874. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:40,069][231894] Avg episode reward: [(0, '190.861')] [2023-03-07 16:31:40,376][232226] Updated weights for policy 0, policy_version 14370 (0.0006) [2023-03-07 16:31:41,180][232226] Updated weights for policy 0, policy_version 14380 (0.0007) [2023-03-07 16:31:41,974][232226] Updated weights for policy 0, policy_version 14390 (0.0006) [2023-03-07 16:31:42,750][232226] Updated weights for policy 0, policy_version 14400 (0.0007) [2023-03-07 16:31:43,561][232226] Updated weights for policy 0, policy_version 14410 (0.0006) [2023-03-07 16:31:44,365][232226] Updated weights for policy 0, policy_version 14420 (0.0006) [2023-03-07 16:31:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 14774272. Throughput: 0: 12907.7. Samples: 14739764. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:45,070][231894] Avg episode reward: [(0, '186.803')] [2023-03-07 16:31:45,159][232226] Updated weights for policy 0, policy_version 14430 (0.0007) [2023-03-07 16:31:45,936][232226] Updated weights for policy 0, policy_version 14440 (0.0007) [2023-03-07 16:31:46,734][232226] Updated weights for policy 0, policy_version 14450 (0.0006) [2023-03-07 16:31:47,521][232226] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-07 16:31:48,337][232226] Updated weights for policy 0, policy_version 14470 (0.0006) [2023-03-07 16:31:49,145][232226] Updated weights for policy 0, policy_version 14480 (0.0005) [2023-03-07 16:31:49,947][232226] Updated weights for policy 0, policy_version 14490 (0.0006) [2023-03-07 16:31:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 14838784. Throughput: 0: 12902.0. Samples: 14817142. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:31:50,069][231894] Avg episode reward: [(0, '188.826')] [2023-03-07 16:31:50,720][232226] Updated weights for policy 0, policy_version 14500 (0.0007) [2023-03-07 16:31:51,531][232226] Updated weights for policy 0, policy_version 14510 (0.0006) [2023-03-07 16:31:52,303][232226] Updated weights for policy 0, policy_version 14520 (0.0006) [2023-03-07 16:31:53,101][232226] Updated weights for policy 0, policy_version 14530 (0.0006) [2023-03-07 16:31:53,897][232226] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-07 16:31:54,703][232226] Updated weights for policy 0, policy_version 14550 (0.0007) [2023-03-07 16:31:55,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 14903296. Throughput: 0: 12895.0. Samples: 14894331. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:31:55,069][231894] Avg episode reward: [(0, '188.688')] [2023-03-07 16:31:55,481][232226] Updated weights for policy 0, policy_version 14560 (0.0006) [2023-03-07 16:31:56,302][232226] Updated weights for policy 0, policy_version 14570 (0.0007) [2023-03-07 16:31:57,089][232226] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-07 16:31:57,870][232226] Updated weights for policy 0, policy_version 14590 (0.0006) [2023-03-07 16:31:58,678][232226] Updated weights for policy 0, policy_version 14600 (0.0007) [2023-03-07 16:31:59,466][232226] Updated weights for policy 0, policy_version 14610 (0.0007) [2023-03-07 16:32:00,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 14967808. Throughput: 0: 12891.4. Samples: 14932883. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:32:00,069][231894] Avg episode reward: [(0, '183.411')] [2023-03-07 16:32:00,260][232226] Updated weights for policy 0, policy_version 14620 (0.0006) [2023-03-07 16:32:01,051][232226] Updated weights for policy 0, policy_version 14630 (0.0007) [2023-03-07 16:32:01,840][232226] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-07 16:32:02,645][232226] Updated weights for policy 0, policy_version 14650 (0.0007) [2023-03-07 16:32:03,420][232226] Updated weights for policy 0, policy_version 14660 (0.0006) [2023-03-07 16:32:04,206][232226] Updated weights for policy 0, policy_version 14670 (0.0006) [2023-03-07 16:32:04,994][232226] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-07 16:32:05,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 15033344. Throughput: 0: 12897.7. Samples: 15010475. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:32:05,069][231894] Avg episode reward: [(0, '191.007')] [2023-03-07 16:32:05,787][232226] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-07 16:32:06,588][232226] Updated weights for policy 0, policy_version 14700 (0.0006) [2023-03-07 16:32:07,399][232226] Updated weights for policy 0, policy_version 14710 (0.0006) [2023-03-07 16:32:08,169][232226] Updated weights for policy 0, policy_version 14720 (0.0007) [2023-03-07 16:32:08,985][232226] Updated weights for policy 0, policy_version 14730 (0.0006) [2023-03-07 16:32:09,784][232226] Updated weights for policy 0, policy_version 14740 (0.0007) [2023-03-07 16:32:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 15096832. Throughput: 0: 12894.5. Samples: 15087617. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:32:10,069][231894] Avg episode reward: [(0, '194.357')] [2023-03-07 16:32:10,570][232226] Updated weights for policy 0, policy_version 14750 (0.0006) [2023-03-07 16:32:11,366][232226] Updated weights for policy 0, policy_version 14760 (0.0006) [2023-03-07 16:32:12,173][232226] Updated weights for policy 0, policy_version 14770 (0.0006) [2023-03-07 16:32:12,936][232226] Updated weights for policy 0, policy_version 14780 (0.0006) [2023-03-07 16:32:13,746][232226] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-07 16:32:14,538][232226] Updated weights for policy 0, policy_version 14800 (0.0006) [2023-03-07 16:32:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 15161344. Throughput: 0: 12904.4. Samples: 15126441. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:32:15,069][231894] Avg episode reward: [(0, '193.409')] [2023-03-07 16:32:15,323][232226] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-07 16:32:16,112][232226] Updated weights for policy 0, policy_version 14820 (0.0007) [2023-03-07 16:32:16,923][232226] Updated weights for policy 0, policy_version 14830 (0.0006) [2023-03-07 16:32:17,729][232226] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-07 16:32:18,522][232226] Updated weights for policy 0, policy_version 14850 (0.0007) [2023-03-07 16:32:19,321][232226] Updated weights for policy 0, policy_version 14860 (0.0006) [2023-03-07 16:32:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 15225856. Throughput: 0: 12891.0. Samples: 15203634. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:32:20,069][231894] Avg episode reward: [(0, '192.873')] [2023-03-07 16:32:20,105][232226] Updated weights for policy 0, policy_version 14870 (0.0007) [2023-03-07 16:32:20,889][232226] Updated weights for policy 0, policy_version 14880 (0.0007) [2023-03-07 16:32:21,690][232226] Updated weights for policy 0, policy_version 14890 (0.0007) [2023-03-07 16:32:22,481][232226] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-07 16:32:23,272][232226] Updated weights for policy 0, policy_version 14910 (0.0008) [2023-03-07 16:32:24,077][232226] Updated weights for policy 0, policy_version 14920 (0.0007) [2023-03-07 16:32:24,873][232226] Updated weights for policy 0, policy_version 14930 (0.0006) [2023-03-07 16:32:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 15290368. Throughput: 0: 12893.0. Samples: 15281058. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:32:25,069][231894] Avg episode reward: [(0, '194.920')] [2023-03-07 16:32:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000014932_15290368.pth... [2023-03-07 16:32:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000011910_12195840.pth [2023-03-07 16:32:25,653][232226] Updated weights for policy 0, policy_version 14940 (0.0006) [2023-03-07 16:32:26,449][232226] Updated weights for policy 0, policy_version 14950 (0.0007) [2023-03-07 16:32:27,239][232226] Updated weights for policy 0, policy_version 14960 (0.0007) [2023-03-07 16:32:28,021][232226] Updated weights for policy 0, policy_version 14970 (0.0007) [2023-03-07 16:32:28,824][232226] Updated weights for policy 0, policy_version 14980 (0.0007) [2023-03-07 16:32:29,623][232226] Updated weights for policy 0, policy_version 14990 (0.0007) [2023-03-07 16:32:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 15354880. Throughput: 0: 12892.1. Samples: 15319907. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:32:30,069][231894] Avg episode reward: [(0, '190.976')] [2023-03-07 16:32:30,421][232226] Updated weights for policy 0, policy_version 15000 (0.0007) [2023-03-07 16:32:31,201][232226] Updated weights for policy 0, policy_version 15010 (0.0007) [2023-03-07 16:32:32,007][232226] Updated weights for policy 0, policy_version 15020 (0.0006) [2023-03-07 16:32:32,794][232226] Updated weights for policy 0, policy_version 15030 (0.0007) [2023-03-07 16:32:33,581][232226] Updated weights for policy 0, policy_version 15040 (0.0007) [2023-03-07 16:32:34,394][232226] Updated weights for policy 0, policy_version 15050 (0.0007) [2023-03-07 16:32:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 15419392. Throughput: 0: 12894.9. Samples: 15397416. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:32:35,069][231894] Avg episode reward: [(0, '192.716')] [2023-03-07 16:32:35,184][232226] Updated weights for policy 0, policy_version 15060 (0.0007) [2023-03-07 16:32:35,942][232226] Updated weights for policy 0, policy_version 15070 (0.0006) [2023-03-07 16:32:36,773][232226] Updated weights for policy 0, policy_version 15080 (0.0006) [2023-03-07 16:32:37,546][232226] Updated weights for policy 0, policy_version 15090 (0.0006) [2023-03-07 16:32:38,334][232226] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-07 16:32:39,134][232226] Updated weights for policy 0, policy_version 15110 (0.0005) [2023-03-07 16:32:39,934][232226] Updated weights for policy 0, policy_version 15120 (0.0006) [2023-03-07 16:32:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 15483904. Throughput: 0: 12900.2. Samples: 15474842. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:32:40,070][231894] Avg episode reward: [(0, '195.928')] [2023-03-07 16:32:40,736][232226] Updated weights for policy 0, policy_version 15130 (0.0006) [2023-03-07 16:32:41,520][232226] Updated weights for policy 0, policy_version 15140 (0.0006) [2023-03-07 16:32:42,312][232226] Updated weights for policy 0, policy_version 15150 (0.0006) [2023-03-07 16:32:43,101][232226] Updated weights for policy 0, policy_version 15160 (0.0007) [2023-03-07 16:32:43,888][232226] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-07 16:32:44,685][232226] Updated weights for policy 0, policy_version 15180 (0.0007) [2023-03-07 16:32:45,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 15549440. Throughput: 0: 12905.5. Samples: 15513631. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:32:45,069][231894] Avg episode reward: [(0, '191.447')] [2023-03-07 16:32:45,461][232226] Updated weights for policy 0, policy_version 15190 (0.0006) [2023-03-07 16:32:46,277][232226] Updated weights for policy 0, policy_version 15200 (0.0007) [2023-03-07 16:32:47,067][232226] Updated weights for policy 0, policy_version 15210 (0.0006) [2023-03-07 16:32:47,861][232226] Updated weights for policy 0, policy_version 15220 (0.0006) [2023-03-07 16:32:48,665][232226] Updated weights for policy 0, policy_version 15230 (0.0007) [2023-03-07 16:32:49,451][232226] Updated weights for policy 0, policy_version 15240 (0.0006) [2023-03-07 16:32:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 15612928. Throughput: 0: 12901.0. Samples: 15591021. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:32:50,069][231894] Avg episode reward: [(0, '188.504')] [2023-03-07 16:32:50,232][232226] Updated weights for policy 0, policy_version 15250 (0.0007) [2023-03-07 16:32:51,021][232226] Updated weights for policy 0, policy_version 15260 (0.0006) [2023-03-07 16:32:51,808][232226] Updated weights for policy 0, policy_version 15270 (0.0006) [2023-03-07 16:32:52,604][232226] Updated weights for policy 0, policy_version 15280 (0.0006) [2023-03-07 16:32:53,407][232226] Updated weights for policy 0, policy_version 15290 (0.0007) [2023-03-07 16:32:54,187][232226] Updated weights for policy 0, policy_version 15300 (0.0007) [2023-03-07 16:32:54,983][232226] Updated weights for policy 0, policy_version 15310 (0.0006) [2023-03-07 16:32:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.4, 300 sec: 12902.4). Total num frames: 15678464. Throughput: 0: 12912.4. Samples: 15668675. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:32:55,069][231894] Avg episode reward: [(0, '187.957')] [2023-03-07 16:32:55,764][232226] Updated weights for policy 0, policy_version 15320 (0.0006) [2023-03-07 16:32:56,554][232226] Updated weights for policy 0, policy_version 15330 (0.0007) [2023-03-07 16:32:57,346][232226] Updated weights for policy 0, policy_version 15340 (0.0006) [2023-03-07 16:32:58,159][232226] Updated weights for policy 0, policy_version 15350 (0.0006) [2023-03-07 16:32:58,937][232226] Updated weights for policy 0, policy_version 15360 (0.0007) [2023-03-07 16:32:59,739][232226] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-07 16:33:00,069][231894] Fps is (10 sec: 13005.0, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 15742976. Throughput: 0: 12912.6. Samples: 15707508. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:33:00,069][231894] Avg episode reward: [(0, '191.747')] [2023-03-07 16:33:00,543][232226] Updated weights for policy 0, policy_version 15380 (0.0007) [2023-03-07 16:33:01,332][232226] Updated weights for policy 0, policy_version 15390 (0.0007) [2023-03-07 16:33:02,134][232226] Updated weights for policy 0, policy_version 15400 (0.0007) [2023-03-07 16:33:02,929][232226] Updated weights for policy 0, policy_version 15410 (0.0006) [2023-03-07 16:33:03,718][232226] Updated weights for policy 0, policy_version 15420 (0.0007) [2023-03-07 16:33:04,520][232226] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-07 16:33:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 15807488. Throughput: 0: 12916.2. Samples: 15784863. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:33:05,069][231894] Avg episode reward: [(0, '194.224')] [2023-03-07 16:33:05,309][232226] Updated weights for policy 0, policy_version 15440 (0.0006) [2023-03-07 16:33:06,103][232226] Updated weights for policy 0, policy_version 15450 (0.0006) [2023-03-07 16:33:06,888][232226] Updated weights for policy 0, policy_version 15460 (0.0007) [2023-03-07 16:33:07,682][232226] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-07 16:33:08,473][232226] Updated weights for policy 0, policy_version 15480 (0.0006) [2023-03-07 16:33:09,285][232226] Updated weights for policy 0, policy_version 15490 (0.0007) [2023-03-07 16:33:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 15872000. Throughput: 0: 12915.8. Samples: 15862269. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:33:10,069][231894] Avg episode reward: [(0, '192.445')] [2023-03-07 16:33:10,071][232226] Updated weights for policy 0, policy_version 15500 (0.0006) [2023-03-07 16:33:10,859][232226] Updated weights for policy 0, policy_version 15510 (0.0006) [2023-03-07 16:33:11,666][232226] Updated weights for policy 0, policy_version 15520 (0.0007) [2023-03-07 16:33:12,443][232226] Updated weights for policy 0, policy_version 15530 (0.0006) [2023-03-07 16:33:13,251][232226] Updated weights for policy 0, policy_version 15540 (0.0007) [2023-03-07 16:33:14,032][232226] Updated weights for policy 0, policy_version 15550 (0.0006) [2023-03-07 16:33:14,827][232226] Updated weights for policy 0, policy_version 15560 (0.0006) [2023-03-07 16:33:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 15936512. Throughput: 0: 12913.2. Samples: 15900999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:33:15,069][231894] Avg episode reward: [(0, '193.946')] [2023-03-07 16:33:15,640][232226] Updated weights for policy 0, policy_version 15570 (0.0007) [2023-03-07 16:33:16,438][232226] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-07 16:33:17,213][232226] Updated weights for policy 0, policy_version 15590 (0.0006) [2023-03-07 16:33:18,021][232226] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-07 16:33:18,811][232226] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-07 16:33:19,585][232226] Updated weights for policy 0, policy_version 15620 (0.0006) [2023-03-07 16:33:20,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 16001024. Throughput: 0: 12904.4. Samples: 15978115. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:33:20,069][231894] Avg episode reward: [(0, '193.597')] [2023-03-07 16:33:20,384][232226] Updated weights for policy 0, policy_version 15630 (0.0007) [2023-03-07 16:33:21,179][232226] Updated weights for policy 0, policy_version 15640 (0.0006) [2023-03-07 16:33:21,979][232226] Updated weights for policy 0, policy_version 15650 (0.0007) [2023-03-07 16:33:22,776][232226] Updated weights for policy 0, policy_version 15660 (0.0007) [2023-03-07 16:33:23,583][232226] Updated weights for policy 0, policy_version 15670 (0.0007) [2023-03-07 16:33:24,369][232226] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-07 16:33:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 16064512. Throughput: 0: 12904.1. Samples: 16055527. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:33:25,070][231894] Avg episode reward: [(0, '194.003')] [2023-03-07 16:33:25,173][232226] Updated weights for policy 0, policy_version 15690 (0.0007) [2023-03-07 16:33:25,968][232226] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-07 16:33:26,761][232226] Updated weights for policy 0, policy_version 15710 (0.0006) [2023-03-07 16:33:27,537][232226] Updated weights for policy 0, policy_version 15720 (0.0006) [2023-03-07 16:33:28,339][232226] Updated weights for policy 0, policy_version 15730 (0.0007) [2023-03-07 16:33:29,121][232226] Updated weights for policy 0, policy_version 15740 (0.0006) [2023-03-07 16:33:29,939][232226] Updated weights for policy 0, policy_version 15750 (0.0006) [2023-03-07 16:33:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 16129024. Throughput: 0: 12901.7. Samples: 16094207. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:33:30,069][231894] Avg episode reward: [(0, '192.191')] [2023-03-07 16:33:30,730][232226] Updated weights for policy 0, policy_version 15760 (0.0006) [2023-03-07 16:33:31,507][232226] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-07 16:33:32,322][232226] Updated weights for policy 0, policy_version 15780 (0.0007) [2023-03-07 16:33:33,085][232226] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-07 16:33:33,909][232226] Updated weights for policy 0, policy_version 15800 (0.0006) [2023-03-07 16:33:34,702][232226] Updated weights for policy 0, policy_version 15810 (0.0006) [2023-03-07 16:33:35,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 16193536. Throughput: 0: 12901.3. Samples: 16171578. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:33:35,069][231894] Avg episode reward: [(0, '195.231')] [2023-03-07 16:33:35,520][232226] Updated weights for policy 0, policy_version 15820 (0.0005) [2023-03-07 16:33:36,313][232226] Updated weights for policy 0, policy_version 15830 (0.0006) [2023-03-07 16:33:37,087][232226] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-07 16:33:37,893][232226] Updated weights for policy 0, policy_version 15850 (0.0006) [2023-03-07 16:33:38,682][232226] Updated weights for policy 0, policy_version 15860 (0.0007) [2023-03-07 16:33:39,466][232226] Updated weights for policy 0, policy_version 15870 (0.0006) [2023-03-07 16:33:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 16258048. Throughput: 0: 12892.8. Samples: 16248848. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:33:40,069][231894] Avg episode reward: [(0, '193.262')] [2023-03-07 16:33:40,269][232226] Updated weights for policy 0, policy_version 15880 (0.0007) [2023-03-07 16:33:41,064][232226] Updated weights for policy 0, policy_version 15890 (0.0007) [2023-03-07 16:33:41,837][232226] Updated weights for policy 0, policy_version 15900 (0.0006) [2023-03-07 16:33:42,663][232226] Updated weights for policy 0, policy_version 15910 (0.0007) [2023-03-07 16:33:43,453][232226] Updated weights for policy 0, policy_version 15920 (0.0007) [2023-03-07 16:33:44,226][232226] Updated weights for policy 0, policy_version 15930 (0.0006) [2023-03-07 16:33:45,022][232226] Updated weights for policy 0, policy_version 15940 (0.0006) [2023-03-07 16:33:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 16322560. Throughput: 0: 12887.6. Samples: 16287451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:33:45,069][231894] Avg episode reward: [(0, '197.104')] [2023-03-07 16:33:45,817][232226] Updated weights for policy 0, policy_version 15950 (0.0006) [2023-03-07 16:33:46,603][232226] Updated weights for policy 0, policy_version 15960 (0.0006) [2023-03-07 16:33:47,410][232226] Updated weights for policy 0, policy_version 15970 (0.0006) [2023-03-07 16:33:48,209][232226] Updated weights for policy 0, policy_version 15980 (0.0006) [2023-03-07 16:33:49,009][232226] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-07 16:33:49,807][232226] Updated weights for policy 0, policy_version 16000 (0.0007) [2023-03-07 16:33:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 16387072. Throughput: 0: 12888.2. Samples: 16364835. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:33:50,069][231894] Avg episode reward: [(0, '194.103')] [2023-03-07 16:33:50,597][232226] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-07 16:33:51,395][232226] Updated weights for policy 0, policy_version 16020 (0.0006) [2023-03-07 16:33:52,175][232226] Updated weights for policy 0, policy_version 16030 (0.0006) [2023-03-07 16:33:52,986][232226] Updated weights for policy 0, policy_version 16040 (0.0006) [2023-03-07 16:33:53,773][232226] Updated weights for policy 0, policy_version 16050 (0.0006) [2023-03-07 16:33:54,566][232226] Updated weights for policy 0, policy_version 16060 (0.0006) [2023-03-07 16:33:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 16451584. Throughput: 0: 12889.0. Samples: 16442275. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:33:55,069][231894] Avg episode reward: [(0, '193.908')] [2023-03-07 16:33:55,370][232226] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-07 16:33:56,155][232226] Updated weights for policy 0, policy_version 16080 (0.0006) [2023-03-07 16:33:56,946][232226] Updated weights for policy 0, policy_version 16090 (0.0006) [2023-03-07 16:33:57,745][232226] Updated weights for policy 0, policy_version 16100 (0.0006) [2023-03-07 16:33:58,526][232226] Updated weights for policy 0, policy_version 16110 (0.0006) [2023-03-07 16:33:59,314][232226] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-07 16:34:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 16516096. Throughput: 0: 12886.0. Samples: 16480869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:34:00,069][231894] Avg episode reward: [(0, '191.648')] [2023-03-07 16:34:00,122][232226] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-07 16:34:00,921][232226] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-07 16:34:01,716][232226] Updated weights for policy 0, policy_version 16150 (0.0006) [2023-03-07 16:34:02,508][232226] Updated weights for policy 0, policy_version 16160 (0.0006) [2023-03-07 16:34:03,314][232226] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-07 16:34:04,099][232226] Updated weights for policy 0, policy_version 16180 (0.0007) [2023-03-07 16:34:04,890][232226] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-07 16:34:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 16580608. Throughput: 0: 12888.9. Samples: 16558113. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:34:05,069][231894] Avg episode reward: [(0, '184.542')] [2023-03-07 16:34:05,696][232226] Updated weights for policy 0, policy_version 16200 (0.0007) [2023-03-07 16:34:06,474][232226] Updated weights for policy 0, policy_version 16210 (0.0006) [2023-03-07 16:34:07,281][232226] Updated weights for policy 0, policy_version 16220 (0.0006) [2023-03-07 16:34:08,067][232226] Updated weights for policy 0, policy_version 16230 (0.0006) [2023-03-07 16:34:08,846][232226] Updated weights for policy 0, policy_version 16240 (0.0006) [2023-03-07 16:34:09,636][232226] Updated weights for policy 0, policy_version 16250 (0.0006) [2023-03-07 16:34:10,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 16645120. Throughput: 0: 12893.9. Samples: 16635751. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:34:10,070][231894] Avg episode reward: [(0, '191.318')] [2023-03-07 16:34:10,447][232226] Updated weights for policy 0, policy_version 16260 (0.0007) [2023-03-07 16:34:11,234][232226] Updated weights for policy 0, policy_version 16270 (0.0006) [2023-03-07 16:34:12,023][232226] Updated weights for policy 0, policy_version 16280 (0.0006) [2023-03-07 16:34:12,818][232226] Updated weights for policy 0, policy_version 16290 (0.0007) [2023-03-07 16:34:13,595][232226] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-07 16:34:14,389][232226] Updated weights for policy 0, policy_version 16310 (0.0007) [2023-03-07 16:34:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 16709632. Throughput: 0: 12895.0. Samples: 16674482. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:34:15,069][231894] Avg episode reward: [(0, '190.786')] [2023-03-07 16:34:15,208][232226] Updated weights for policy 0, policy_version 16320 (0.0006) [2023-03-07 16:34:15,997][232226] Updated weights for policy 0, policy_version 16330 (0.0006) [2023-03-07 16:34:16,781][232226] Updated weights for policy 0, policy_version 16340 (0.0006) [2023-03-07 16:34:17,577][232226] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-07 16:34:18,358][232226] Updated weights for policy 0, policy_version 16360 (0.0006) [2023-03-07 16:34:19,160][232226] Updated weights for policy 0, policy_version 16370 (0.0006) [2023-03-07 16:34:19,961][232226] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-07 16:34:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 16774144. Throughput: 0: 12895.3. Samples: 16751869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:34:20,069][231894] Avg episode reward: [(0, '189.708')] [2023-03-07 16:34:20,748][232226] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-07 16:34:21,542][232226] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-07 16:34:22,324][232226] Updated weights for policy 0, policy_version 16410 (0.0006) [2023-03-07 16:34:23,111][232226] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-07 16:34:23,899][232226] Updated weights for policy 0, policy_version 16430 (0.0007) [2023-03-07 16:34:24,711][232226] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-07 16:34:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 16838656. Throughput: 0: 12902.1. Samples: 16829443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:34:25,069][231894] Avg episode reward: [(0, '191.666')] [2023-03-07 16:34:25,072][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000016444_16838656.pth... [2023-03-07 16:34:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000013421_13743104.pth [2023-03-07 16:34:25,505][232226] Updated weights for policy 0, policy_version 16450 (0.0006) [2023-03-07 16:34:26,301][232226] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-07 16:34:27,110][232226] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-07 16:34:27,921][232226] Updated weights for policy 0, policy_version 16480 (0.0006) [2023-03-07 16:34:28,705][232226] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-07 16:34:29,501][232226] Updated weights for policy 0, policy_version 16500 (0.0006) [2023-03-07 16:34:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 16903168. Throughput: 0: 12899.1. Samples: 16867909. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:34:30,069][231894] Avg episode reward: [(0, '190.780')] [2023-03-07 16:34:30,297][232226] Updated weights for policy 0, policy_version 16510 (0.0007) [2023-03-07 16:34:31,094][232226] Updated weights for policy 0, policy_version 16520 (0.0007) [2023-03-07 16:34:31,885][232226] Updated weights for policy 0, policy_version 16530 (0.0006) [2023-03-07 16:34:32,682][232226] Updated weights for policy 0, policy_version 16540 (0.0007) [2023-03-07 16:34:33,466][232226] Updated weights for policy 0, policy_version 16550 (0.0007) [2023-03-07 16:34:34,280][232226] Updated weights for policy 0, policy_version 16560 (0.0007) [2023-03-07 16:34:35,058][232226] Updated weights for policy 0, policy_version 16570 (0.0006) [2023-03-07 16:34:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 16967680. Throughput: 0: 12897.2. Samples: 16945208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:34:35,070][231894] Avg episode reward: [(0, '193.672')] [2023-03-07 16:34:35,875][232226] Updated weights for policy 0, policy_version 16580 (0.0006) [2023-03-07 16:34:36,664][232226] Updated weights for policy 0, policy_version 16590 (0.0007) [2023-03-07 16:34:37,453][232226] Updated weights for policy 0, policy_version 16600 (0.0006) [2023-03-07 16:34:38,237][232226] Updated weights for policy 0, policy_version 16610 (0.0006) [2023-03-07 16:34:39,051][232226] Updated weights for policy 0, policy_version 16620 (0.0006) [2023-03-07 16:34:39,843][232226] Updated weights for policy 0, policy_version 16630 (0.0007) [2023-03-07 16:34:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 17031168. Throughput: 0: 12886.4. Samples: 17022164. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:34:40,069][231894] Avg episode reward: [(0, '194.019')] [2023-03-07 16:34:40,633][232226] Updated weights for policy 0, policy_version 16640 (0.0006) [2023-03-07 16:34:41,423][232226] Updated weights for policy 0, policy_version 16650 (0.0006) [2023-03-07 16:34:42,224][232226] Updated weights for policy 0, policy_version 16660 (0.0006) [2023-03-07 16:34:43,019][232226] Updated weights for policy 0, policy_version 16670 (0.0006) [2023-03-07 16:34:43,802][232226] Updated weights for policy 0, policy_version 16680 (0.0006) [2023-03-07 16:34:44,610][232226] Updated weights for policy 0, policy_version 16690 (0.0006) [2023-03-07 16:34:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 17096704. Throughput: 0: 12891.4. Samples: 17060980. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:34:45,070][231894] Avg episode reward: [(0, '192.295')] [2023-03-07 16:34:45,386][232226] Updated weights for policy 0, policy_version 16700 (0.0006) [2023-03-07 16:34:46,174][232226] Updated weights for policy 0, policy_version 16710 (0.0006) [2023-03-07 16:34:46,998][232226] Updated weights for policy 0, policy_version 16720 (0.0006) [2023-03-07 16:34:47,783][232226] Updated weights for policy 0, policy_version 16730 (0.0006) [2023-03-07 16:34:48,585][232226] Updated weights for policy 0, policy_version 16740 (0.0006) [2023-03-07 16:34:49,369][232226] Updated weights for policy 0, policy_version 16750 (0.0006) [2023-03-07 16:34:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 17160192. Throughput: 0: 12890.6. Samples: 17138190. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:34:50,069][231894] Avg episode reward: [(0, '196.135')] [2023-03-07 16:34:50,161][232226] Updated weights for policy 0, policy_version 16760 (0.0007) [2023-03-07 16:34:50,941][232226] Updated weights for policy 0, policy_version 16770 (0.0007) [2023-03-07 16:34:51,738][232226] Updated weights for policy 0, policy_version 16780 (0.0006) [2023-03-07 16:34:52,541][232226] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-07 16:34:53,324][232226] Updated weights for policy 0, policy_version 16800 (0.0006) [2023-03-07 16:34:54,109][232226] Updated weights for policy 0, policy_version 16810 (0.0007) [2023-03-07 16:34:54,918][232226] Updated weights for policy 0, policy_version 16820 (0.0006) [2023-03-07 16:34:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 17224704. Throughput: 0: 12891.5. Samples: 17215867. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:34:55,069][231894] Avg episode reward: [(0, '190.662')] [2023-03-07 16:34:55,705][232226] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-07 16:34:56,503][232226] Updated weights for policy 0, policy_version 16840 (0.0006) [2023-03-07 16:34:57,300][232226] Updated weights for policy 0, policy_version 16850 (0.0007) [2023-03-07 16:34:58,089][232226] Updated weights for policy 0, policy_version 16860 (0.0007) [2023-03-07 16:34:58,872][232226] Updated weights for policy 0, policy_version 16870 (0.0006) [2023-03-07 16:34:59,683][232226] Updated weights for policy 0, policy_version 16880 (0.0007) [2023-03-07 16:35:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 17289216. Throughput: 0: 12888.9. Samples: 17254483. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:35:00,070][231894] Avg episode reward: [(0, '189.005')] [2023-03-07 16:35:00,477][232226] Updated weights for policy 0, policy_version 16890 (0.0006) [2023-03-07 16:35:01,265][232226] Updated weights for policy 0, policy_version 16900 (0.0006) [2023-03-07 16:35:02,077][232226] Updated weights for policy 0, policy_version 16910 (0.0006) [2023-03-07 16:35:02,893][232226] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-07 16:35:03,674][232226] Updated weights for policy 0, policy_version 16930 (0.0006) [2023-03-07 16:35:04,475][232226] Updated weights for policy 0, policy_version 16940 (0.0007) [2023-03-07 16:35:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 17353728. Throughput: 0: 12881.2. Samples: 17331522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:05,069][231894] Avg episode reward: [(0, '189.723')] [2023-03-07 16:35:05,264][232226] Updated weights for policy 0, policy_version 16950 (0.0007) [2023-03-07 16:35:06,059][232226] Updated weights for policy 0, policy_version 16960 (0.0007) [2023-03-07 16:35:06,872][232226] Updated weights for policy 0, policy_version 16970 (0.0007) [2023-03-07 16:35:07,673][232226] Updated weights for policy 0, policy_version 16980 (0.0006) [2023-03-07 16:35:08,474][232226] Updated weights for policy 0, policy_version 16990 (0.0006) [2023-03-07 16:35:09,278][232226] Updated weights for policy 0, policy_version 17000 (0.0006) [2023-03-07 16:35:10,061][232226] Updated weights for policy 0, policy_version 17010 (0.0007) [2023-03-07 16:35:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 17418240. Throughput: 0: 12870.4. Samples: 17408610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:10,069][231894] Avg episode reward: [(0, '197.560')] [2023-03-07 16:35:10,837][232226] Updated weights for policy 0, policy_version 17020 (0.0005) [2023-03-07 16:35:11,644][232226] Updated weights for policy 0, policy_version 17030 (0.0007) [2023-03-07 16:35:12,444][232226] Updated weights for policy 0, policy_version 17040 (0.0006) [2023-03-07 16:35:13,228][232226] Updated weights for policy 0, policy_version 17050 (0.0007) [2023-03-07 16:35:14,025][232226] Updated weights for policy 0, policy_version 17060 (0.0006) [2023-03-07 16:35:14,825][232226] Updated weights for policy 0, policy_version 17070 (0.0007) [2023-03-07 16:35:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 17482752. Throughput: 0: 12874.7. Samples: 17447270. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:15,069][231894] Avg episode reward: [(0, '184.074')] [2023-03-07 16:35:15,638][232226] Updated weights for policy 0, policy_version 17080 (0.0007) [2023-03-07 16:35:16,407][232226] Updated weights for policy 0, policy_version 17090 (0.0007) [2023-03-07 16:35:17,222][232226] Updated weights for policy 0, policy_version 17100 (0.0007) [2023-03-07 16:35:18,025][232226] Updated weights for policy 0, policy_version 17110 (0.0006) [2023-03-07 16:35:18,801][232226] Updated weights for policy 0, policy_version 17120 (0.0006) [2023-03-07 16:35:19,601][232226] Updated weights for policy 0, policy_version 17130 (0.0005) [2023-03-07 16:35:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 17546240. Throughput: 0: 12871.2. Samples: 17524413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:20,069][231894] Avg episode reward: [(0, '193.650')] [2023-03-07 16:35:20,393][232226] Updated weights for policy 0, policy_version 17140 (0.0006) [2023-03-07 16:35:21,194][232226] Updated weights for policy 0, policy_version 17150 (0.0006) [2023-03-07 16:35:21,982][232226] Updated weights for policy 0, policy_version 17160 (0.0007) [2023-03-07 16:35:22,777][232226] Updated weights for policy 0, policy_version 17170 (0.0006) [2023-03-07 16:35:23,574][232226] Updated weights for policy 0, policy_version 17180 (0.0006) [2023-03-07 16:35:24,365][232226] Updated weights for policy 0, policy_version 17190 (0.0006) [2023-03-07 16:35:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 17610752. Throughput: 0: 12878.1. Samples: 17601678. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:25,069][231894] Avg episode reward: [(0, '193.584')] [2023-03-07 16:35:25,158][232226] Updated weights for policy 0, policy_version 17200 (0.0006) [2023-03-07 16:35:25,927][232226] Updated weights for policy 0, policy_version 17210 (0.0006) [2023-03-07 16:35:26,746][232226] Updated weights for policy 0, policy_version 17220 (0.0007) [2023-03-07 16:35:27,522][232226] Updated weights for policy 0, policy_version 17230 (0.0006) [2023-03-07 16:35:28,320][232226] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-07 16:35:29,132][232226] Updated weights for policy 0, policy_version 17250 (0.0007) [2023-03-07 16:35:29,922][232226] Updated weights for policy 0, policy_version 17260 (0.0007) [2023-03-07 16:35:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 17675264. Throughput: 0: 12879.4. Samples: 17640553. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:35:30,069][231894] Avg episode reward: [(0, '197.255')] [2023-03-07 16:35:30,730][232226] Updated weights for policy 0, policy_version 17270 (0.0006) [2023-03-07 16:35:31,537][232226] Updated weights for policy 0, policy_version 17280 (0.0006) [2023-03-07 16:35:32,327][232226] Updated weights for policy 0, policy_version 17290 (0.0007) [2023-03-07 16:35:33,105][232226] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-07 16:35:33,905][232226] Updated weights for policy 0, policy_version 17310 (0.0006) [2023-03-07 16:35:34,693][232226] Updated weights for policy 0, policy_version 17320 (0.0007) [2023-03-07 16:35:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 17739776. Throughput: 0: 12882.0. Samples: 17717882. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:35:35,069][231894] Avg episode reward: [(0, '195.485')] [2023-03-07 16:35:35,470][232226] Updated weights for policy 0, policy_version 17330 (0.0006) [2023-03-07 16:35:36,280][232226] Updated weights for policy 0, policy_version 17340 (0.0007) [2023-03-07 16:35:37,064][232226] Updated weights for policy 0, policy_version 17350 (0.0006) [2023-03-07 16:35:37,851][232226] Updated weights for policy 0, policy_version 17360 (0.0007) [2023-03-07 16:35:38,648][232226] Updated weights for policy 0, policy_version 17370 (0.0005) [2023-03-07 16:35:39,446][232226] Updated weights for policy 0, policy_version 17380 (0.0005) [2023-03-07 16:35:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 17804288. Throughput: 0: 12877.4. Samples: 17795352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:35:40,069][231894] Avg episode reward: [(0, '189.786')] [2023-03-07 16:35:40,238][232226] Updated weights for policy 0, policy_version 17390 (0.0006) [2023-03-07 16:35:41,033][232226] Updated weights for policy 0, policy_version 17400 (0.0006) [2023-03-07 16:35:41,806][232226] Updated weights for policy 0, policy_version 17410 (0.0006) [2023-03-07 16:35:42,607][232226] Updated weights for policy 0, policy_version 17420 (0.0006) [2023-03-07 16:35:43,370][232226] Updated weights for policy 0, policy_version 17430 (0.0006) [2023-03-07 16:35:44,173][232226] Updated weights for policy 0, policy_version 17440 (0.0006) [2023-03-07 16:35:44,970][232226] Updated weights for policy 0, policy_version 17450 (0.0006) [2023-03-07 16:35:45,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 17869824. Throughput: 0: 12885.0. Samples: 17834307. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:45,069][231894] Avg episode reward: [(0, '194.851')] [2023-03-07 16:35:45,769][232226] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-07 16:35:46,550][232226] Updated weights for policy 0, policy_version 17470 (0.0007) [2023-03-07 16:35:47,354][232226] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-07 16:35:48,133][232226] Updated weights for policy 0, policy_version 17490 (0.0007) [2023-03-07 16:35:48,940][232226] Updated weights for policy 0, policy_version 17500 (0.0007) [2023-03-07 16:35:49,721][232226] Updated weights for policy 0, policy_version 17510 (0.0006) [2023-03-07 16:35:50,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 17934336. Throughput: 0: 12898.2. Samples: 17911940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:50,069][231894] Avg episode reward: [(0, '194.676')] [2023-03-07 16:35:50,518][232226] Updated weights for policy 0, policy_version 17520 (0.0006) [2023-03-07 16:35:51,313][232226] Updated weights for policy 0, policy_version 17530 (0.0006) [2023-03-07 16:35:52,109][232226] Updated weights for policy 0, policy_version 17540 (0.0006) [2023-03-07 16:35:52,893][232226] Updated weights for policy 0, policy_version 17550 (0.0007) [2023-03-07 16:35:53,697][232226] Updated weights for policy 0, policy_version 17560 (0.0006) [2023-03-07 16:35:54,478][232226] Updated weights for policy 0, policy_version 17570 (0.0006) [2023-03-07 16:35:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 17998848. Throughput: 0: 12906.6. Samples: 17989405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:35:55,069][231894] Avg episode reward: [(0, '194.698')] [2023-03-07 16:35:55,289][232226] Updated weights for policy 0, policy_version 17580 (0.0006) [2023-03-07 16:35:56,086][232226] Updated weights for policy 0, policy_version 17590 (0.0007) [2023-03-07 16:35:56,861][232226] Updated weights for policy 0, policy_version 17600 (0.0006) [2023-03-07 16:35:57,651][232226] Updated weights for policy 0, policy_version 17610 (0.0006) [2023-03-07 16:35:58,431][232226] Updated weights for policy 0, policy_version 17620 (0.0006) [2023-03-07 16:35:59,214][232226] Updated weights for policy 0, policy_version 17630 (0.0006) [2023-03-07 16:36:00,021][232226] Updated weights for policy 0, policy_version 17640 (0.0006) [2023-03-07 16:36:00,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 18063360. Throughput: 0: 12909.4. Samples: 18028195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:36:00,069][231894] Avg episode reward: [(0, '192.929')] [2023-03-07 16:36:00,811][232226] Updated weights for policy 0, policy_version 17650 (0.0006) [2023-03-07 16:36:01,627][232226] Updated weights for policy 0, policy_version 17660 (0.0006) [2023-03-07 16:36:02,413][232226] Updated weights for policy 0, policy_version 17670 (0.0006) [2023-03-07 16:36:03,215][232226] Updated weights for policy 0, policy_version 17680 (0.0007) [2023-03-07 16:36:04,011][232226] Updated weights for policy 0, policy_version 17690 (0.0007) [2023-03-07 16:36:04,803][232226] Updated weights for policy 0, policy_version 17700 (0.0007) [2023-03-07 16:36:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 18127872. Throughput: 0: 12915.5. Samples: 18105612. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:36:05,069][231894] Avg episode reward: [(0, '182.220')] [2023-03-07 16:36:05,573][232226] Updated weights for policy 0, policy_version 17710 (0.0007) [2023-03-07 16:36:06,375][232226] Updated weights for policy 0, policy_version 17720 (0.0007) [2023-03-07 16:36:07,169][232226] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-07 16:36:07,970][232226] Updated weights for policy 0, policy_version 17740 (0.0006) [2023-03-07 16:36:08,788][232226] Updated weights for policy 0, policy_version 17750 (0.0007) [2023-03-07 16:36:09,581][232226] Updated weights for policy 0, policy_version 17760 (0.0007) [2023-03-07 16:36:10,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 18192384. Throughput: 0: 12913.9. Samples: 18182804. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:36:10,069][231894] Avg episode reward: [(0, '196.972')] [2023-03-07 16:36:10,379][232226] Updated weights for policy 0, policy_version 17770 (0.0006) [2023-03-07 16:36:11,184][232226] Updated weights for policy 0, policy_version 17780 (0.0005) [2023-03-07 16:36:11,984][232226] Updated weights for policy 0, policy_version 17790 (0.0006) [2023-03-07 16:36:12,772][232226] Updated weights for policy 0, policy_version 17800 (0.0007) [2023-03-07 16:36:13,572][232226] Updated weights for policy 0, policy_version 17810 (0.0006) [2023-03-07 16:36:14,352][232226] Updated weights for policy 0, policy_version 17820 (0.0006) [2023-03-07 16:36:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 18256896. Throughput: 0: 12903.0. Samples: 18221188. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:36:15,070][231894] Avg episode reward: [(0, '187.323')] [2023-03-07 16:36:15,143][232226] Updated weights for policy 0, policy_version 17830 (0.0007) [2023-03-07 16:36:15,950][232226] Updated weights for policy 0, policy_version 17840 (0.0007) [2023-03-07 16:36:16,747][232226] Updated weights for policy 0, policy_version 17850 (0.0006) [2023-03-07 16:36:17,530][232226] Updated weights for policy 0, policy_version 17860 (0.0007) [2023-03-07 16:36:18,344][232226] Updated weights for policy 0, policy_version 17870 (0.0007) [2023-03-07 16:36:19,132][232226] Updated weights for policy 0, policy_version 17880 (0.0007) [2023-03-07 16:36:19,915][232226] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-07 16:36:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 18321408. Throughput: 0: 12902.7. Samples: 18298500. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:36:20,069][231894] Avg episode reward: [(0, '195.397')] [2023-03-07 16:36:20,713][232226] Updated weights for policy 0, policy_version 17900 (0.0007) [2023-03-07 16:36:21,515][232226] Updated weights for policy 0, policy_version 17910 (0.0006) [2023-03-07 16:36:22,274][232226] Updated weights for policy 0, policy_version 17920 (0.0007) [2023-03-07 16:36:23,078][232226] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-07 16:36:23,865][232226] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-07 16:36:24,660][232226] Updated weights for policy 0, policy_version 17950 (0.0007) [2023-03-07 16:36:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.4, 300 sec: 12898.9). Total num frames: 18385920. Throughput: 0: 12906.7. Samples: 18376156. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:36:25,069][231894] Avg episode reward: [(0, '191.386')] [2023-03-07 16:36:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000017955_18385920.pth... [2023-03-07 16:36:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000014932_15290368.pth [2023-03-07 16:36:25,422][232226] Updated weights for policy 0, policy_version 17960 (0.0007) [2023-03-07 16:36:26,218][232226] Updated weights for policy 0, policy_version 17970 (0.0006) [2023-03-07 16:36:27,037][232226] Updated weights for policy 0, policy_version 17980 (0.0007) [2023-03-07 16:36:27,823][232226] Updated weights for policy 0, policy_version 17990 (0.0007) [2023-03-07 16:36:28,630][232226] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-07 16:36:29,427][232226] Updated weights for policy 0, policy_version 18010 (0.0006) [2023-03-07 16:36:30,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 18450432. Throughput: 0: 12906.8. Samples: 18415112. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:36:30,070][231894] Avg episode reward: [(0, '188.851')] [2023-03-07 16:36:30,179][232226] Updated weights for policy 0, policy_version 18020 (0.0006) [2023-03-07 16:36:31,000][232226] Updated weights for policy 0, policy_version 18030 (0.0006) [2023-03-07 16:36:31,791][232226] Updated weights for policy 0, policy_version 18040 (0.0006) [2023-03-07 16:36:32,575][232226] Updated weights for policy 0, policy_version 18050 (0.0006) [2023-03-07 16:36:33,365][232226] Updated weights for policy 0, policy_version 18060 (0.0006) [2023-03-07 16:36:34,170][232226] Updated weights for policy 0, policy_version 18070 (0.0005) [2023-03-07 16:36:34,947][232226] Updated weights for policy 0, policy_version 18080 (0.0006) [2023-03-07 16:36:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 18514944. Throughput: 0: 12905.3. Samples: 18492678. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:36:35,069][231894] Avg episode reward: [(0, '199.721')] [2023-03-07 16:36:35,751][232226] Updated weights for policy 0, policy_version 18090 (0.0006) [2023-03-07 16:36:36,549][232226] Updated weights for policy 0, policy_version 18100 (0.0006) [2023-03-07 16:36:37,320][232226] Updated weights for policy 0, policy_version 18110 (0.0006) [2023-03-07 16:36:38,116][232226] Updated weights for policy 0, policy_version 18120 (0.0007) [2023-03-07 16:36:38,924][232226] Updated weights for policy 0, policy_version 18130 (0.0007) [2023-03-07 16:36:39,714][232226] Updated weights for policy 0, policy_version 18140 (0.0006) [2023-03-07 16:36:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 18579456. Throughput: 0: 12909.0. Samples: 18570310. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:36:40,070][231894] Avg episode reward: [(0, '187.472')] [2023-03-07 16:36:40,500][232226] Updated weights for policy 0, policy_version 18150 (0.0007) [2023-03-07 16:36:41,306][232226] Updated weights for policy 0, policy_version 18160 (0.0006) [2023-03-07 16:36:42,111][232226] Updated weights for policy 0, policy_version 18170 (0.0006) [2023-03-07 16:36:42,891][232226] Updated weights for policy 0, policy_version 18180 (0.0006) [2023-03-07 16:36:43,692][232226] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-07 16:36:44,478][232226] Updated weights for policy 0, policy_version 18200 (0.0006) [2023-03-07 16:36:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 18643968. Throughput: 0: 12901.6. Samples: 18608764. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:36:45,069][231894] Avg episode reward: [(0, '192.982')] [2023-03-07 16:36:45,262][232226] Updated weights for policy 0, policy_version 18210 (0.0006) [2023-03-07 16:36:46,053][232226] Updated weights for policy 0, policy_version 18220 (0.0006) [2023-03-07 16:36:46,854][232226] Updated weights for policy 0, policy_version 18230 (0.0007) [2023-03-07 16:36:47,645][232226] Updated weights for policy 0, policy_version 18240 (0.0007) [2023-03-07 16:36:48,454][232226] Updated weights for policy 0, policy_version 18250 (0.0006) [2023-03-07 16:36:49,246][232226] Updated weights for policy 0, policy_version 18260 (0.0006) [2023-03-07 16:36:50,062][232226] Updated weights for policy 0, policy_version 18270 (0.0007) [2023-03-07 16:36:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 18708480. Throughput: 0: 12903.7. Samples: 18686275. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:36:50,069][231894] Avg episode reward: [(0, '190.803')] [2023-03-07 16:36:50,866][232226] Updated weights for policy 0, policy_version 18280 (0.0006) [2023-03-07 16:36:51,637][232226] Updated weights for policy 0, policy_version 18290 (0.0006) [2023-03-07 16:36:52,422][232226] Updated weights for policy 0, policy_version 18300 (0.0006) [2023-03-07 16:36:53,241][232226] Updated weights for policy 0, policy_version 18310 (0.0007) [2023-03-07 16:36:54,049][232226] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-07 16:36:54,825][232226] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-07 16:36:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 18772992. Throughput: 0: 12896.3. Samples: 18763137. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:36:55,069][231894] Avg episode reward: [(0, '195.309')] [2023-03-07 16:36:55,623][232226] Updated weights for policy 0, policy_version 18340 (0.0005) [2023-03-07 16:36:56,410][232226] Updated weights for policy 0, policy_version 18350 (0.0006) [2023-03-07 16:36:57,205][232226] Updated weights for policy 0, policy_version 18360 (0.0007) [2023-03-07 16:36:57,998][232226] Updated weights for policy 0, policy_version 18370 (0.0006) [2023-03-07 16:36:58,794][232226] Updated weights for policy 0, policy_version 18380 (0.0006) [2023-03-07 16:36:59,585][232226] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-07 16:37:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 18837504. Throughput: 0: 12905.2. Samples: 18801919. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:37:00,069][231894] Avg episode reward: [(0, '188.965')] [2023-03-07 16:37:00,375][232226] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-07 16:37:01,168][232226] Updated weights for policy 0, policy_version 18410 (0.0006) [2023-03-07 16:37:01,956][232226] Updated weights for policy 0, policy_version 18420 (0.0007) [2023-03-07 16:37:02,730][232226] Updated weights for policy 0, policy_version 18430 (0.0006) [2023-03-07 16:37:03,522][232226] Updated weights for policy 0, policy_version 18440 (0.0006) [2023-03-07 16:37:04,339][232226] Updated weights for policy 0, policy_version 18450 (0.0006) [2023-03-07 16:37:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 18902016. Throughput: 0: 12908.8. Samples: 18879398. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:37:05,069][231894] Avg episode reward: [(0, '193.065')] [2023-03-07 16:37:05,132][232226] Updated weights for policy 0, policy_version 18460 (0.0005) [2023-03-07 16:37:05,927][232226] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-07 16:37:06,725][232226] Updated weights for policy 0, policy_version 18480 (0.0006) [2023-03-07 16:37:07,519][232226] Updated weights for policy 0, policy_version 18490 (0.0007) [2023-03-07 16:37:08,305][232226] Updated weights for policy 0, policy_version 18500 (0.0006) [2023-03-07 16:37:09,102][232226] Updated weights for policy 0, policy_version 18510 (0.0006) [2023-03-07 16:37:09,899][232226] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-07 16:37:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 18966528. Throughput: 0: 12905.0. Samples: 18956879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:37:10,069][231894] Avg episode reward: [(0, '196.945')] [2023-03-07 16:37:10,707][232226] Updated weights for policy 0, policy_version 18530 (0.0007) [2023-03-07 16:37:11,497][232226] Updated weights for policy 0, policy_version 18540 (0.0006) [2023-03-07 16:37:12,298][232226] Updated weights for policy 0, policy_version 18550 (0.0007) [2023-03-07 16:37:13,077][232226] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-07 16:37:13,870][232226] Updated weights for policy 0, policy_version 18570 (0.0006) [2023-03-07 16:37:14,662][232226] Updated weights for policy 0, policy_version 18580 (0.0007) [2023-03-07 16:37:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 19031040. Throughput: 0: 12893.6. Samples: 18995324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:37:15,069][231894] Avg episode reward: [(0, '194.408')] [2023-03-07 16:37:15,457][232226] Updated weights for policy 0, policy_version 18590 (0.0006) [2023-03-07 16:37:16,251][232226] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-07 16:37:17,031][232226] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-07 16:37:17,823][232226] Updated weights for policy 0, policy_version 18620 (0.0006) [2023-03-07 16:37:18,611][232226] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-07 16:37:19,418][232226] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-07 16:37:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 19095552. Throughput: 0: 12899.5. Samples: 19073152. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:37:20,069][231894] Avg episode reward: [(0, '192.302')] [2023-03-07 16:37:20,182][232226] Updated weights for policy 0, policy_version 18650 (0.0007) [2023-03-07 16:37:20,974][232226] Updated weights for policy 0, policy_version 18660 (0.0006) [2023-03-07 16:37:21,766][232226] Updated weights for policy 0, policy_version 18670 (0.0007) [2023-03-07 16:37:22,570][232226] Updated weights for policy 0, policy_version 18680 (0.0006) [2023-03-07 16:37:23,366][232226] Updated weights for policy 0, policy_version 18690 (0.0006) [2023-03-07 16:37:24,155][232226] Updated weights for policy 0, policy_version 18700 (0.0006) [2023-03-07 16:37:24,945][232226] Updated weights for policy 0, policy_version 18710 (0.0007) [2023-03-07 16:37:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 19160064. Throughput: 0: 12900.2. Samples: 19150820. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:37:25,069][231894] Avg episode reward: [(0, '186.753')] [2023-03-07 16:37:25,739][232226] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-07 16:37:26,530][232226] Updated weights for policy 0, policy_version 18730 (0.0006) [2023-03-07 16:37:27,305][232226] Updated weights for policy 0, policy_version 18740 (0.0007) [2023-03-07 16:37:28,113][232226] Updated weights for policy 0, policy_version 18750 (0.0007) [2023-03-07 16:37:28,902][232226] Updated weights for policy 0, policy_version 18760 (0.0007) [2023-03-07 16:37:29,686][232226] Updated weights for policy 0, policy_version 18770 (0.0006) [2023-03-07 16:37:30,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 19224576. Throughput: 0: 12910.7. Samples: 19189746. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:37:30,070][231894] Avg episode reward: [(0, '194.898')] [2023-03-07 16:37:30,493][232226] Updated weights for policy 0, policy_version 18780 (0.0005) [2023-03-07 16:37:31,266][232226] Updated weights for policy 0, policy_version 18790 (0.0007) [2023-03-07 16:37:32,067][232226] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-07 16:37:32,846][232226] Updated weights for policy 0, policy_version 18810 (0.0007) [2023-03-07 16:37:33,637][232226] Updated weights for policy 0, policy_version 18820 (0.0007) [2023-03-07 16:37:34,430][232226] Updated weights for policy 0, policy_version 18830 (0.0006) [2023-03-07 16:37:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 19289088. Throughput: 0: 12910.8. Samples: 19267262. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:37:35,069][231894] Avg episode reward: [(0, '197.898')] [2023-03-07 16:37:35,243][232226] Updated weights for policy 0, policy_version 18840 (0.0006) [2023-03-07 16:37:36,026][232226] Updated weights for policy 0, policy_version 18850 (0.0007) [2023-03-07 16:37:36,814][232226] Updated weights for policy 0, policy_version 18860 (0.0006) [2023-03-07 16:37:37,617][232226] Updated weights for policy 0, policy_version 18870 (0.0007) [2023-03-07 16:37:38,408][232226] Updated weights for policy 0, policy_version 18880 (0.0006) [2023-03-07 16:37:39,183][232226] Updated weights for policy 0, policy_version 18890 (0.0006) [2023-03-07 16:37:39,991][232226] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-07 16:37:40,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 19353600. Throughput: 0: 12925.3. Samples: 19344775. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:37:40,069][231894] Avg episode reward: [(0, '193.123')] [2023-03-07 16:37:40,777][232226] Updated weights for policy 0, policy_version 18910 (0.0006) [2023-03-07 16:37:41,579][232226] Updated weights for policy 0, policy_version 18920 (0.0006) [2023-03-07 16:37:42,378][232226] Updated weights for policy 0, policy_version 18930 (0.0006) [2023-03-07 16:37:43,183][232226] Updated weights for policy 0, policy_version 18940 (0.0006) [2023-03-07 16:37:43,974][232226] Updated weights for policy 0, policy_version 18950 (0.0006) [2023-03-07 16:37:44,788][232226] Updated weights for policy 0, policy_version 18960 (0.0007) [2023-03-07 16:37:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 19418112. Throughput: 0: 12919.7. Samples: 19383309. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 16:37:45,070][231894] Avg episode reward: [(0, '196.602')] [2023-03-07 16:37:45,577][232226] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-07 16:37:46,362][232226] Updated weights for policy 0, policy_version 18980 (0.0005) [2023-03-07 16:37:47,162][232226] Updated weights for policy 0, policy_version 18990 (0.0006) [2023-03-07 16:37:47,952][232226] Updated weights for policy 0, policy_version 19000 (0.0006) [2023-03-07 16:37:48,739][232226] Updated weights for policy 0, policy_version 19010 (0.0007) [2023-03-07 16:37:49,537][232226] Updated weights for policy 0, policy_version 19020 (0.0007) [2023-03-07 16:37:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 19482624. Throughput: 0: 12915.1. Samples: 19460577. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:37:50,069][231894] Avg episode reward: [(0, '195.268')] [2023-03-07 16:37:50,329][232226] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-07 16:37:51,106][232226] Updated weights for policy 0, policy_version 19040 (0.0006) [2023-03-07 16:37:51,890][232226] Updated weights for policy 0, policy_version 19050 (0.0006) [2023-03-07 16:37:52,694][232226] Updated weights for policy 0, policy_version 19060 (0.0007) [2023-03-07 16:37:53,498][232226] Updated weights for policy 0, policy_version 19070 (0.0006) [2023-03-07 16:37:54,285][232226] Updated weights for policy 0, policy_version 19080 (0.0006) [2023-03-07 16:37:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 19547136. Throughput: 0: 12920.4. Samples: 19538296. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:37:55,072][232226] Updated weights for policy 0, policy_version 19090 (0.0006) [2023-03-07 16:37:55,072][231894] Avg episode reward: [(0, '192.844')] [2023-03-07 16:37:55,883][232226] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-07 16:37:56,669][232226] Updated weights for policy 0, policy_version 19110 (0.0006) [2023-03-07 16:37:57,467][232226] Updated weights for policy 0, policy_version 19120 (0.0005) [2023-03-07 16:37:58,261][232226] Updated weights for policy 0, policy_version 19130 (0.0005) [2023-03-07 16:37:59,045][232226] Updated weights for policy 0, policy_version 19140 (0.0007) [2023-03-07 16:37:59,834][232226] Updated weights for policy 0, policy_version 19150 (0.0006) [2023-03-07 16:38:00,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 19612672. Throughput: 0: 12922.3. Samples: 19576826. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:38:00,069][231894] Avg episode reward: [(0, '190.936')] [2023-03-07 16:38:00,625][232226] Updated weights for policy 0, policy_version 19160 (0.0006) [2023-03-07 16:38:01,411][232226] Updated weights for policy 0, policy_version 19170 (0.0006) [2023-03-07 16:38:02,205][232226] Updated weights for policy 0, policy_version 19180 (0.0006) [2023-03-07 16:38:02,998][232226] Updated weights for policy 0, policy_version 19190 (0.0006) [2023-03-07 16:38:03,781][232226] Updated weights for policy 0, policy_version 19200 (0.0007) [2023-03-07 16:38:04,585][232226] Updated weights for policy 0, policy_version 19210 (0.0006) [2023-03-07 16:38:05,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 19677184. Throughput: 0: 12922.7. Samples: 19654673. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:38:05,069][231894] Avg episode reward: [(0, '193.933')] [2023-03-07 16:38:05,391][232226] Updated weights for policy 0, policy_version 19220 (0.0006) [2023-03-07 16:38:06,172][232226] Updated weights for policy 0, policy_version 19230 (0.0006) [2023-03-07 16:38:06,967][232226] Updated weights for policy 0, policy_version 19240 (0.0006) [2023-03-07 16:38:07,741][232226] Updated weights for policy 0, policy_version 19250 (0.0005) [2023-03-07 16:38:08,536][232226] Updated weights for policy 0, policy_version 19260 (0.0007) [2023-03-07 16:38:09,336][232226] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-07 16:38:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 19741696. Throughput: 0: 12917.7. Samples: 19732116. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:38:10,069][231894] Avg episode reward: [(0, '194.979')] [2023-03-07 16:38:10,127][232226] Updated weights for policy 0, policy_version 19280 (0.0006) [2023-03-07 16:38:10,941][232226] Updated weights for policy 0, policy_version 19290 (0.0006) [2023-03-07 16:38:11,733][232226] Updated weights for policy 0, policy_version 19300 (0.0006) [2023-03-07 16:38:12,510][232226] Updated weights for policy 0, policy_version 19310 (0.0006) [2023-03-07 16:38:13,304][232226] Updated weights for policy 0, policy_version 19320 (0.0007) [2023-03-07 16:38:14,095][232226] Updated weights for policy 0, policy_version 19330 (0.0006) [2023-03-07 16:38:14,879][232226] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-07 16:38:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 19806208. Throughput: 0: 12911.7. Samples: 19770774. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:38:15,069][231894] Avg episode reward: [(0, '193.659')] [2023-03-07 16:38:15,686][232226] Updated weights for policy 0, policy_version 19350 (0.0006) [2023-03-07 16:38:16,475][232226] Updated weights for policy 0, policy_version 19360 (0.0006) [2023-03-07 16:38:17,266][232226] Updated weights for policy 0, policy_version 19370 (0.0006) [2023-03-07 16:38:18,054][232226] Updated weights for policy 0, policy_version 19380 (0.0006) [2023-03-07 16:38:18,846][232226] Updated weights for policy 0, policy_version 19390 (0.0006) [2023-03-07 16:38:19,657][232226] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-07 16:38:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 19870720. Throughput: 0: 12914.7. Samples: 19848424. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:38:20,069][231894] Avg episode reward: [(0, '182.582')] [2023-03-07 16:38:20,438][232226] Updated weights for policy 0, policy_version 19410 (0.0007) [2023-03-07 16:38:21,217][232226] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-07 16:38:22,029][232226] Updated weights for policy 0, policy_version 19430 (0.0006) [2023-03-07 16:38:22,810][232226] Updated weights for policy 0, policy_version 19440 (0.0006) [2023-03-07 16:38:23,604][232226] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-07 16:38:24,421][232226] Updated weights for policy 0, policy_version 19460 (0.0006) [2023-03-07 16:38:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 19935232. Throughput: 0: 12907.6. Samples: 19925619. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:38:25,069][231894] Avg episode reward: [(0, '192.869')] [2023-03-07 16:38:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000019468_19935232.pth... [2023-03-07 16:38:25,102][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000016444_16838656.pth [2023-03-07 16:38:25,217][232226] Updated weights for policy 0, policy_version 19470 (0.0006) [2023-03-07 16:38:26,021][232226] Updated weights for policy 0, policy_version 19480 (0.0006) [2023-03-07 16:38:26,832][232226] Updated weights for policy 0, policy_version 19490 (0.0007) [2023-03-07 16:38:27,621][232226] Updated weights for policy 0, policy_version 19500 (0.0006) [2023-03-07 16:38:28,428][232226] Updated weights for policy 0, policy_version 19510 (0.0006) [2023-03-07 16:38:29,218][232226] Updated weights for policy 0, policy_version 19520 (0.0007) [2023-03-07 16:38:30,021][232226] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-07 16:38:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 19998720. Throughput: 0: 12905.3. Samples: 19964047. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:38:30,070][231894] Avg episode reward: [(0, '200.893')] [2023-03-07 16:38:30,809][232226] Updated weights for policy 0, policy_version 19540 (0.0006) [2023-03-07 16:38:31,611][232226] Updated weights for policy 0, policy_version 19550 (0.0006) [2023-03-07 16:38:32,393][232226] Updated weights for policy 0, policy_version 19560 (0.0007) [2023-03-07 16:38:33,169][232226] Updated weights for policy 0, policy_version 19570 (0.0006) [2023-03-07 16:38:33,980][232226] Updated weights for policy 0, policy_version 19580 (0.0006) [2023-03-07 16:38:34,771][232226] Updated weights for policy 0, policy_version 19590 (0.0006) [2023-03-07 16:38:35,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 20063232. Throughput: 0: 12904.6. Samples: 20041285. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:38:35,069][231894] Avg episode reward: [(0, '195.911')] [2023-03-07 16:38:35,569][232226] Updated weights for policy 0, policy_version 19600 (0.0006) [2023-03-07 16:38:36,362][232226] Updated weights for policy 0, policy_version 19610 (0.0006) [2023-03-07 16:38:37,161][232226] Updated weights for policy 0, policy_version 19620 (0.0006) [2023-03-07 16:38:37,953][232226] Updated weights for policy 0, policy_version 19630 (0.0006) [2023-03-07 16:38:38,746][232226] Updated weights for policy 0, policy_version 19640 (0.0006) [2023-03-07 16:38:39,542][232226] Updated weights for policy 0, policy_version 19650 (0.0006) [2023-03-07 16:38:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 20127744. Throughput: 0: 12896.1. Samples: 20118618. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:38:40,069][231894] Avg episode reward: [(0, '190.299')] [2023-03-07 16:38:40,333][232226] Updated weights for policy 0, policy_version 19660 (0.0007) [2023-03-07 16:38:41,118][232226] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-07 16:38:41,914][232226] Updated weights for policy 0, policy_version 19680 (0.0006) [2023-03-07 16:38:42,711][232226] Updated weights for policy 0, policy_version 19690 (0.0006) [2023-03-07 16:38:43,509][232226] Updated weights for policy 0, policy_version 19700 (0.0006) [2023-03-07 16:38:44,289][232226] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-07 16:38:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 20192256. Throughput: 0: 12900.4. Samples: 20157346. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:38:45,069][231894] Avg episode reward: [(0, '179.934')] [2023-03-07 16:38:45,098][232226] Updated weights for policy 0, policy_version 19720 (0.0005) [2023-03-07 16:38:45,870][232226] Updated weights for policy 0, policy_version 19730 (0.0006) [2023-03-07 16:38:46,679][232226] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-07 16:38:47,467][232226] Updated weights for policy 0, policy_version 19750 (0.0006) [2023-03-07 16:38:48,261][232226] Updated weights for policy 0, policy_version 19760 (0.0007) [2023-03-07 16:38:49,047][232226] Updated weights for policy 0, policy_version 19770 (0.0006) [2023-03-07 16:38:49,831][232226] Updated weights for policy 0, policy_version 19780 (0.0006) [2023-03-07 16:38:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 20256768. Throughput: 0: 12892.7. Samples: 20234845. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:38:50,069][231894] Avg episode reward: [(0, '192.368')] [2023-03-07 16:38:50,622][232226] Updated weights for policy 0, policy_version 19790 (0.0005) [2023-03-07 16:38:51,401][232226] Updated weights for policy 0, policy_version 19800 (0.0006) [2023-03-07 16:38:52,210][232226] Updated weights for policy 0, policy_version 19810 (0.0007) [2023-03-07 16:38:52,989][232226] Updated weights for policy 0, policy_version 19820 (0.0007) [2023-03-07 16:38:53,770][232226] Updated weights for policy 0, policy_version 19830 (0.0006) [2023-03-07 16:38:54,574][232226] Updated weights for policy 0, policy_version 19840 (0.0007) [2023-03-07 16:38:55,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 20322304. Throughput: 0: 12904.7. Samples: 20312828. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:38:55,069][231894] Avg episode reward: [(0, '188.550')] [2023-03-07 16:38:55,370][232226] Updated weights for policy 0, policy_version 19850 (0.0006) [2023-03-07 16:38:56,162][232226] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-07 16:38:56,966][232226] Updated weights for policy 0, policy_version 19870 (0.0007) [2023-03-07 16:38:57,750][232226] Updated weights for policy 0, policy_version 19880 (0.0007) [2023-03-07 16:38:58,536][232226] Updated weights for policy 0, policy_version 19890 (0.0007) [2023-03-07 16:38:59,332][232226] Updated weights for policy 0, policy_version 19900 (0.0006) [2023-03-07 16:39:00,069][231894] Fps is (10 sec: 13004.6, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 20386816. Throughput: 0: 12903.7. Samples: 20351443. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:39:00,070][231894] Avg episode reward: [(0, '195.058')] [2023-03-07 16:39:00,116][232226] Updated weights for policy 0, policy_version 19910 (0.0006) [2023-03-07 16:39:00,936][232226] Updated weights for policy 0, policy_version 19920 (0.0006) [2023-03-07 16:39:01,722][232226] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-07 16:39:02,518][232226] Updated weights for policy 0, policy_version 19940 (0.0007) [2023-03-07 16:39:03,325][232226] Updated weights for policy 0, policy_version 19950 (0.0006) [2023-03-07 16:39:04,115][232226] Updated weights for policy 0, policy_version 19960 (0.0006) [2023-03-07 16:39:04,911][232226] Updated weights for policy 0, policy_version 19970 (0.0006) [2023-03-07 16:39:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 20451328. Throughput: 0: 12898.1. Samples: 20428839. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:39:05,069][231894] Avg episode reward: [(0, '192.797')] [2023-03-07 16:39:05,698][232226] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-07 16:39:06,480][232226] Updated weights for policy 0, policy_version 19990 (0.0007) [2023-03-07 16:39:07,283][232226] Updated weights for policy 0, policy_version 20000 (0.0007) [2023-03-07 16:39:08,075][232226] Updated weights for policy 0, policy_version 20010 (0.0006) [2023-03-07 16:39:08,867][232226] Updated weights for policy 0, policy_version 20020 (0.0006) [2023-03-07 16:39:09,637][232226] Updated weights for policy 0, policy_version 20030 (0.0006) [2023-03-07 16:39:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 20515840. Throughput: 0: 12901.4. Samples: 20506182. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:39:10,069][231894] Avg episode reward: [(0, '190.069')] [2023-03-07 16:39:10,443][232226] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-07 16:39:11,234][232226] Updated weights for policy 0, policy_version 20050 (0.0006) [2023-03-07 16:39:12,021][232226] Updated weights for policy 0, policy_version 20060 (0.0007) [2023-03-07 16:39:12,809][232226] Updated weights for policy 0, policy_version 20070 (0.0006) [2023-03-07 16:39:13,583][232226] Updated weights for policy 0, policy_version 20080 (0.0006) [2023-03-07 16:39:14,390][232226] Updated weights for policy 0, policy_version 20090 (0.0006) [2023-03-07 16:39:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 20580352. Throughput: 0: 12917.0. Samples: 20545311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:39:15,069][231894] Avg episode reward: [(0, '190.446')] [2023-03-07 16:39:15,179][232226] Updated weights for policy 0, policy_version 20100 (0.0007) [2023-03-07 16:39:15,957][232226] Updated weights for policy 0, policy_version 20110 (0.0007) [2023-03-07 16:39:16,763][232226] Updated weights for policy 0, policy_version 20120 (0.0007) [2023-03-07 16:39:17,565][232226] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-07 16:39:18,348][232226] Updated weights for policy 0, policy_version 20140 (0.0006) [2023-03-07 16:39:19,130][232226] Updated weights for policy 0, policy_version 20150 (0.0007) [2023-03-07 16:39:19,937][232226] Updated weights for policy 0, policy_version 20160 (0.0007) [2023-03-07 16:39:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 20644864. Throughput: 0: 12923.8. Samples: 20622858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:39:20,070][231894] Avg episode reward: [(0, '189.199')] [2023-03-07 16:39:20,736][232226] Updated weights for policy 0, policy_version 20170 (0.0006) [2023-03-07 16:39:21,537][232226] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-07 16:39:22,329][232226] Updated weights for policy 0, policy_version 20190 (0.0007) [2023-03-07 16:39:23,121][232226] Updated weights for policy 0, policy_version 20200 (0.0005) [2023-03-07 16:39:23,912][232226] Updated weights for policy 0, policy_version 20210 (0.0006) [2023-03-07 16:39:24,702][232226] Updated weights for policy 0, policy_version 20220 (0.0006) [2023-03-07 16:39:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 20709376. Throughput: 0: 12921.6. Samples: 20700090. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:39:25,069][231894] Avg episode reward: [(0, '188.379')] [2023-03-07 16:39:25,495][232226] Updated weights for policy 0, policy_version 20230 (0.0007) [2023-03-07 16:39:26,277][232226] Updated weights for policy 0, policy_version 20240 (0.0007) [2023-03-07 16:39:27,070][232226] Updated weights for policy 0, policy_version 20250 (0.0007) [2023-03-07 16:39:27,873][232226] Updated weights for policy 0, policy_version 20260 (0.0006) [2023-03-07 16:39:28,650][232226] Updated weights for policy 0, policy_version 20270 (0.0006) [2023-03-07 16:39:29,441][232226] Updated weights for policy 0, policy_version 20280 (0.0006) [2023-03-07 16:39:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 20773888. Throughput: 0: 12925.3. Samples: 20738983. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:39:30,069][231894] Avg episode reward: [(0, '194.435')] [2023-03-07 16:39:30,238][232226] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-07 16:39:31,018][232226] Updated weights for policy 0, policy_version 20300 (0.0006) [2023-03-07 16:39:31,819][232226] Updated weights for policy 0, policy_version 20310 (0.0007) [2023-03-07 16:39:32,609][232226] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-07 16:39:33,394][232226] Updated weights for policy 0, policy_version 20330 (0.0006) [2023-03-07 16:39:34,185][232226] Updated weights for policy 0, policy_version 20340 (0.0006) [2023-03-07 16:39:34,993][232226] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-07 16:39:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.4, 300 sec: 12905.9). Total num frames: 20838400. Throughput: 0: 12930.4. Samples: 20816716. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:39:35,069][231894] Avg episode reward: [(0, '200.335')] [2023-03-07 16:39:35,789][232226] Updated weights for policy 0, policy_version 20360 (0.0006) [2023-03-07 16:39:36,582][232226] Updated weights for policy 0, policy_version 20370 (0.0007) [2023-03-07 16:39:37,368][232226] Updated weights for policy 0, policy_version 20380 (0.0006) [2023-03-07 16:39:38,164][232226] Updated weights for policy 0, policy_version 20390 (0.0005) [2023-03-07 16:39:38,950][232226] Updated weights for policy 0, policy_version 20400 (0.0005) [2023-03-07 16:39:39,765][232226] Updated weights for policy 0, policy_version 20410 (0.0006) [2023-03-07 16:39:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 20902912. Throughput: 0: 12918.4. Samples: 20894158. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:39:40,069][231894] Avg episode reward: [(0, '198.013')] [2023-03-07 16:39:40,548][232226] Updated weights for policy 0, policy_version 20420 (0.0007) [2023-03-07 16:39:41,350][232226] Updated weights for policy 0, policy_version 20430 (0.0006) [2023-03-07 16:39:42,145][232226] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-07 16:39:42,968][232226] Updated weights for policy 0, policy_version 20450 (0.0007) [2023-03-07 16:39:43,756][232226] Updated weights for policy 0, policy_version 20460 (0.0006) [2023-03-07 16:39:44,549][232226] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-07 16:39:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 20967424. Throughput: 0: 12912.0. Samples: 20932479. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:39:45,069][231894] Avg episode reward: [(0, '192.230')] [2023-03-07 16:39:45,353][232226] Updated weights for policy 0, policy_version 20480 (0.0007) [2023-03-07 16:39:46,134][232226] Updated weights for policy 0, policy_version 20490 (0.0006) [2023-03-07 16:39:46,934][232226] Updated weights for policy 0, policy_version 20500 (0.0007) [2023-03-07 16:39:47,715][232226] Updated weights for policy 0, policy_version 20510 (0.0006) [2023-03-07 16:39:48,519][232226] Updated weights for policy 0, policy_version 20520 (0.0006) [2023-03-07 16:39:49,304][232226] Updated weights for policy 0, policy_version 20530 (0.0006) [2023-03-07 16:39:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 21031936. Throughput: 0: 12911.8. Samples: 21009871. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:39:50,069][231894] Avg episode reward: [(0, '191.266')] [2023-03-07 16:39:50,087][232226] Updated weights for policy 0, policy_version 20540 (0.0007) [2023-03-07 16:39:50,895][232226] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-07 16:39:51,692][232226] Updated weights for policy 0, policy_version 20560 (0.0006) [2023-03-07 16:39:52,464][232226] Updated weights for policy 0, policy_version 20570 (0.0005) [2023-03-07 16:39:53,252][232226] Updated weights for policy 0, policy_version 20580 (0.0006) [2023-03-07 16:39:54,089][232226] Updated weights for policy 0, policy_version 20590 (0.0006) [2023-03-07 16:39:54,869][232226] Updated weights for policy 0, policy_version 20600 (0.0006) [2023-03-07 16:39:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 21096448. Throughput: 0: 12916.9. Samples: 21087444. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:39:55,069][231894] Avg episode reward: [(0, '194.047')] [2023-03-07 16:39:55,666][232226] Updated weights for policy 0, policy_version 20610 (0.0007) [2023-03-07 16:39:56,457][232226] Updated weights for policy 0, policy_version 20620 (0.0007) [2023-03-07 16:39:57,254][232226] Updated weights for policy 0, policy_version 20630 (0.0007) [2023-03-07 16:39:58,038][232226] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-07 16:39:58,832][232226] Updated weights for policy 0, policy_version 20650 (0.0007) [2023-03-07 16:39:59,629][232226] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-07 16:40:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 21160960. Throughput: 0: 12903.3. Samples: 21125961. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:40:00,069][231894] Avg episode reward: [(0, '192.688')] [2023-03-07 16:40:00,445][232226] Updated weights for policy 0, policy_version 20670 (0.0006) [2023-03-07 16:40:01,242][232226] Updated weights for policy 0, policy_version 20680 (0.0007) [2023-03-07 16:40:02,041][232226] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-07 16:40:02,834][232226] Updated weights for policy 0, policy_version 20700 (0.0007) [2023-03-07 16:40:03,630][232226] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-07 16:40:04,428][232226] Updated weights for policy 0, policy_version 20720 (0.0007) [2023-03-07 16:40:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 21225472. Throughput: 0: 12891.4. Samples: 21202968. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:40:05,069][231894] Avg episode reward: [(0, '191.841')] [2023-03-07 16:40:05,200][232226] Updated weights for policy 0, policy_version 20730 (0.0006) [2023-03-07 16:40:06,001][232226] Updated weights for policy 0, policy_version 20740 (0.0006) [2023-03-07 16:40:06,803][232226] Updated weights for policy 0, policy_version 20750 (0.0007) [2023-03-07 16:40:07,593][232226] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-07 16:40:08,376][232226] Updated weights for policy 0, policy_version 20770 (0.0006) [2023-03-07 16:40:09,196][232226] Updated weights for policy 0, policy_version 20780 (0.0007) [2023-03-07 16:40:10,006][232226] Updated weights for policy 0, policy_version 20790 (0.0007) [2023-03-07 16:40:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 21289984. Throughput: 0: 12891.3. Samples: 21280200. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:40:10,069][231894] Avg episode reward: [(0, '191.798')] [2023-03-07 16:40:10,788][232226] Updated weights for policy 0, policy_version 20800 (0.0007) [2023-03-07 16:40:11,574][232226] Updated weights for policy 0, policy_version 20810 (0.0006) [2023-03-07 16:40:12,385][232226] Updated weights for policy 0, policy_version 20820 (0.0007) [2023-03-07 16:40:13,154][232226] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-07 16:40:13,942][232226] Updated weights for policy 0, policy_version 20840 (0.0006) [2023-03-07 16:40:14,741][232226] Updated weights for policy 0, policy_version 20850 (0.0007) [2023-03-07 16:40:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 21353472. Throughput: 0: 12885.7. Samples: 21318837. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:40:15,069][231894] Avg episode reward: [(0, '195.131')] [2023-03-07 16:40:15,541][232226] Updated weights for policy 0, policy_version 20860 (0.0006) [2023-03-07 16:40:16,343][232226] Updated weights for policy 0, policy_version 20870 (0.0006) [2023-03-07 16:40:17,134][232226] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-07 16:40:17,936][232226] Updated weights for policy 0, policy_version 20890 (0.0006) [2023-03-07 16:40:18,716][232226] Updated weights for policy 0, policy_version 20900 (0.0007) [2023-03-07 16:40:19,514][232226] Updated weights for policy 0, policy_version 20910 (0.0006) [2023-03-07 16:40:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 21417984. Throughput: 0: 12877.8. Samples: 21396219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:20,069][231894] Avg episode reward: [(0, '196.532')] [2023-03-07 16:40:20,306][232226] Updated weights for policy 0, policy_version 20920 (0.0007) [2023-03-07 16:40:21,085][232226] Updated weights for policy 0, policy_version 20930 (0.0006) [2023-03-07 16:40:21,897][232226] Updated weights for policy 0, policy_version 20940 (0.0006) [2023-03-07 16:40:22,689][232226] Updated weights for policy 0, policy_version 20950 (0.0006) [2023-03-07 16:40:23,487][232226] Updated weights for policy 0, policy_version 20960 (0.0006) [2023-03-07 16:40:24,263][232226] Updated weights for policy 0, policy_version 20970 (0.0006) [2023-03-07 16:40:25,060][232226] Updated weights for policy 0, policy_version 20980 (0.0006) [2023-03-07 16:40:25,069][231894] Fps is (10 sec: 13004.6, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 21483520. Throughput: 0: 12883.6. Samples: 21473921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:25,070][231894] Avg episode reward: [(0, '192.061')] [2023-03-07 16:40:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000020980_21483520.pth... [2023-03-07 16:40:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000017955_18385920.pth [2023-03-07 16:40:25,830][232226] Updated weights for policy 0, policy_version 20990 (0.0006) [2023-03-07 16:40:26,643][232226] Updated weights for policy 0, policy_version 21000 (0.0006) [2023-03-07 16:40:27,439][232226] Updated weights for policy 0, policy_version 21010 (0.0006) [2023-03-07 16:40:28,232][232226] Updated weights for policy 0, policy_version 21020 (0.0006) [2023-03-07 16:40:29,029][232226] Updated weights for policy 0, policy_version 21030 (0.0006) [2023-03-07 16:40:29,805][232226] Updated weights for policy 0, policy_version 21040 (0.0006) [2023-03-07 16:40:30,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 21548032. Throughput: 0: 12892.3. Samples: 21512632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:30,069][231894] Avg episode reward: [(0, '185.245')] [2023-03-07 16:40:30,610][232226] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-07 16:40:31,402][232226] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-07 16:40:32,177][232226] Updated weights for policy 0, policy_version 21070 (0.0006) [2023-03-07 16:40:32,984][232226] Updated weights for policy 0, policy_version 21080 (0.0007) [2023-03-07 16:40:33,769][232226] Updated weights for policy 0, policy_version 21090 (0.0006) [2023-03-07 16:40:34,573][232226] Updated weights for policy 0, policy_version 21100 (0.0006) [2023-03-07 16:40:35,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 21612544. Throughput: 0: 12897.9. Samples: 21590278. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:35,069][231894] Avg episode reward: [(0, '185.570')] [2023-03-07 16:40:35,362][232226] Updated weights for policy 0, policy_version 21110 (0.0006) [2023-03-07 16:40:36,139][232226] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-07 16:40:36,954][232226] Updated weights for policy 0, policy_version 21130 (0.0006) [2023-03-07 16:40:37,745][232226] Updated weights for policy 0, policy_version 21140 (0.0006) [2023-03-07 16:40:38,529][232226] Updated weights for policy 0, policy_version 21150 (0.0007) [2023-03-07 16:40:39,323][232226] Updated weights for policy 0, policy_version 21160 (0.0007) [2023-03-07 16:40:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 21677056. Throughput: 0: 12892.3. Samples: 21667598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:40,069][231894] Avg episode reward: [(0, '189.427')] [2023-03-07 16:40:40,106][232226] Updated weights for policy 0, policy_version 21170 (0.0006) [2023-03-07 16:40:40,906][232226] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-07 16:40:41,701][232226] Updated weights for policy 0, policy_version 21190 (0.0007) [2023-03-07 16:40:42,498][232226] Updated weights for policy 0, policy_version 21200 (0.0006) [2023-03-07 16:40:43,308][232226] Updated weights for policy 0, policy_version 21210 (0.0007) [2023-03-07 16:40:44,090][232226] Updated weights for policy 0, policy_version 21220 (0.0006) [2023-03-07 16:40:44,908][232226] Updated weights for policy 0, policy_version 21230 (0.0007) [2023-03-07 16:40:45,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 21740544. Throughput: 0: 12898.1. Samples: 21706378. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:45,069][231894] Avg episode reward: [(0, '184.036')] [2023-03-07 16:40:45,694][232226] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-07 16:40:46,478][232226] Updated weights for policy 0, policy_version 21250 (0.0006) [2023-03-07 16:40:47,281][232226] Updated weights for policy 0, policy_version 21260 (0.0006) [2023-03-07 16:40:48,072][232226] Updated weights for policy 0, policy_version 21270 (0.0006) [2023-03-07 16:40:48,853][232226] Updated weights for policy 0, policy_version 21280 (0.0006) [2023-03-07 16:40:49,654][232226] Updated weights for policy 0, policy_version 21290 (0.0007) [2023-03-07 16:40:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 21806080. Throughput: 0: 12903.5. Samples: 21783625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:50,069][231894] Avg episode reward: [(0, '194.629')] [2023-03-07 16:40:50,445][232226] Updated weights for policy 0, policy_version 21300 (0.0006) [2023-03-07 16:40:51,258][232226] Updated weights for policy 0, policy_version 21310 (0.0007) [2023-03-07 16:40:52,033][232226] Updated weights for policy 0, policy_version 21320 (0.0006) [2023-03-07 16:40:52,845][232226] Updated weights for policy 0, policy_version 21330 (0.0007) [2023-03-07 16:40:53,626][232226] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-07 16:40:54,421][232226] Updated weights for policy 0, policy_version 21350 (0.0006) [2023-03-07 16:40:55,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 21870592. Throughput: 0: 12905.5. Samples: 21860947. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:40:55,069][231894] Avg episode reward: [(0, '191.314')] [2023-03-07 16:40:55,211][232226] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-07 16:40:55,996][232226] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-07 16:40:56,803][232226] Updated weights for policy 0, policy_version 21380 (0.0007) [2023-03-07 16:40:57,591][232226] Updated weights for policy 0, policy_version 21390 (0.0006) [2023-03-07 16:40:58,384][232226] Updated weights for policy 0, policy_version 21400 (0.0007) [2023-03-07 16:40:59,205][232226] Updated weights for policy 0, policy_version 21410 (0.0007) [2023-03-07 16:41:00,014][232226] Updated weights for policy 0, policy_version 21420 (0.0006) [2023-03-07 16:41:00,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 21934080. Throughput: 0: 12907.3. Samples: 21899665. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:41:00,070][231894] Avg episode reward: [(0, '189.760')] [2023-03-07 16:41:00,792][232226] Updated weights for policy 0, policy_version 21430 (0.0006) [2023-03-07 16:41:01,578][232226] Updated weights for policy 0, policy_version 21440 (0.0006) [2023-03-07 16:41:02,381][232226] Updated weights for policy 0, policy_version 21450 (0.0007) [2023-03-07 16:41:03,163][232226] Updated weights for policy 0, policy_version 21460 (0.0006) [2023-03-07 16:41:03,963][232226] Updated weights for policy 0, policy_version 21470 (0.0006) [2023-03-07 16:41:04,752][232226] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-07 16:41:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 21998592. Throughput: 0: 12902.9. Samples: 21976851. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:41:05,070][231894] Avg episode reward: [(0, '183.820')] [2023-03-07 16:41:05,557][232226] Updated weights for policy 0, policy_version 21490 (0.0007) [2023-03-07 16:41:06,350][232226] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-07 16:41:07,157][232226] Updated weights for policy 0, policy_version 21510 (0.0007) [2023-03-07 16:41:07,948][232226] Updated weights for policy 0, policy_version 21520 (0.0006) [2023-03-07 16:41:08,737][232226] Updated weights for policy 0, policy_version 21530 (0.0006) [2023-03-07 16:41:09,534][232226] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-07 16:41:10,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 22063104. Throughput: 0: 12890.4. Samples: 22053986. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:41:10,069][231894] Avg episode reward: [(0, '196.462')] [2023-03-07 16:41:10,329][232226] Updated weights for policy 0, policy_version 21550 (0.0006) [2023-03-07 16:41:11,129][232226] Updated weights for policy 0, policy_version 21560 (0.0007) [2023-03-07 16:41:11,921][232226] Updated weights for policy 0, policy_version 21570 (0.0006) [2023-03-07 16:41:12,735][232226] Updated weights for policy 0, policy_version 21580 (0.0006) [2023-03-07 16:41:13,528][232226] Updated weights for policy 0, policy_version 21590 (0.0007) [2023-03-07 16:41:14,325][232226] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-07 16:41:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22127616. Throughput: 0: 12890.3. Samples: 22092694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 16:41:15,069][231894] Avg episode reward: [(0, '195.673')] [2023-03-07 16:41:15,107][232226] Updated weights for policy 0, policy_version 21610 (0.0006) [2023-03-07 16:41:15,914][232226] Updated weights for policy 0, policy_version 21620 (0.0006) [2023-03-07 16:41:16,685][232226] Updated weights for policy 0, policy_version 21630 (0.0007) [2023-03-07 16:41:17,487][232226] Updated weights for policy 0, policy_version 21640 (0.0006) [2023-03-07 16:41:18,280][232226] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-07 16:41:19,084][232226] Updated weights for policy 0, policy_version 21660 (0.0007) [2023-03-07 16:41:19,913][232226] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-07 16:41:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22192128. Throughput: 0: 12880.1. Samples: 22169884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 16:41:20,070][231894] Avg episode reward: [(0, '191.257')] [2023-03-07 16:41:20,707][232226] Updated weights for policy 0, policy_version 21680 (0.0008) [2023-03-07 16:41:21,482][232226] Updated weights for policy 0, policy_version 21690 (0.0007) [2023-03-07 16:41:22,293][232226] Updated weights for policy 0, policy_version 21700 (0.0006) [2023-03-07 16:41:23,081][232226] Updated weights for policy 0, policy_version 21710 (0.0006) [2023-03-07 16:41:23,879][232226] Updated weights for policy 0, policy_version 21720 (0.0007) [2023-03-07 16:41:24,671][232226] Updated weights for policy 0, policy_version 21730 (0.0006) [2023-03-07 16:41:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.4, 300 sec: 12902.4). Total num frames: 22256640. Throughput: 0: 12871.9. Samples: 22246835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 16:41:25,069][231894] Avg episode reward: [(0, '191.900')] [2023-03-07 16:41:25,487][232226] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-07 16:41:26,281][232226] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-07 16:41:27,078][232226] Updated weights for policy 0, policy_version 21760 (0.0007) [2023-03-07 16:41:27,867][232226] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-07 16:41:28,648][232226] Updated weights for policy 0, policy_version 21780 (0.0006) [2023-03-07 16:41:29,427][232226] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-07 16:41:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 22321152. Throughput: 0: 12863.6. Samples: 22285241. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:41:30,069][231894] Avg episode reward: [(0, '192.414')] [2023-03-07 16:41:30,212][232226] Updated weights for policy 0, policy_version 21800 (0.0007) [2023-03-07 16:41:31,006][232226] Updated weights for policy 0, policy_version 21810 (0.0007) [2023-03-07 16:41:31,797][232226] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-07 16:41:32,586][232226] Updated weights for policy 0, policy_version 21830 (0.0007) [2023-03-07 16:41:33,389][232226] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-07 16:41:34,174][232226] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-07 16:41:34,963][232226] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-07 16:41:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 22385664. Throughput: 0: 12881.0. Samples: 22363269. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:41:35,069][231894] Avg episode reward: [(0, '192.366')] [2023-03-07 16:41:35,748][232226] Updated weights for policy 0, policy_version 21870 (0.0007) [2023-03-07 16:41:36,568][232226] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-07 16:41:37,349][232226] Updated weights for policy 0, policy_version 21890 (0.0006) [2023-03-07 16:41:38,136][232226] Updated weights for policy 0, policy_version 21900 (0.0007) [2023-03-07 16:41:38,932][232226] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-07 16:41:39,730][232226] Updated weights for policy 0, policy_version 21920 (0.0007) [2023-03-07 16:41:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 22450176. Throughput: 0: 12886.9. Samples: 22440855. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:41:40,069][231894] Avg episode reward: [(0, '192.880')] [2023-03-07 16:41:40,519][232226] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-07 16:41:41,296][232226] Updated weights for policy 0, policy_version 21940 (0.0007) [2023-03-07 16:41:42,091][232226] Updated weights for policy 0, policy_version 21950 (0.0006) [2023-03-07 16:41:42,893][232226] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-07 16:41:43,692][232226] Updated weights for policy 0, policy_version 21970 (0.0007) [2023-03-07 16:41:44,483][232226] Updated weights for policy 0, policy_version 21980 (0.0006) [2023-03-07 16:41:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22514688. Throughput: 0: 12888.1. Samples: 22479627. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:41:45,069][231894] Avg episode reward: [(0, '195.999')] [2023-03-07 16:41:45,273][232226] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-07 16:41:46,069][232226] Updated weights for policy 0, policy_version 22000 (0.0006) [2023-03-07 16:41:46,853][232226] Updated weights for policy 0, policy_version 22010 (0.0008) [2023-03-07 16:41:47,630][232226] Updated weights for policy 0, policy_version 22020 (0.0006) [2023-03-07 16:41:48,437][232226] Updated weights for policy 0, policy_version 22030 (0.0006) [2023-03-07 16:41:49,235][232226] Updated weights for policy 0, policy_version 22040 (0.0006) [2023-03-07 16:41:50,006][232226] Updated weights for policy 0, policy_version 22050 (0.0006) [2023-03-07 16:41:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 22579200. Throughput: 0: 12893.8. Samples: 22557071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:41:50,070][231894] Avg episode reward: [(0, '192.014')] [2023-03-07 16:41:50,814][232226] Updated weights for policy 0, policy_version 22060 (0.0006) [2023-03-07 16:41:51,597][232226] Updated weights for policy 0, policy_version 22070 (0.0006) [2023-03-07 16:41:52,393][232226] Updated weights for policy 0, policy_version 22080 (0.0006) [2023-03-07 16:41:53,183][232226] Updated weights for policy 0, policy_version 22090 (0.0006) [2023-03-07 16:41:53,995][232226] Updated weights for policy 0, policy_version 22100 (0.0006) [2023-03-07 16:41:54,779][232226] Updated weights for policy 0, policy_version 22110 (0.0006) [2023-03-07 16:41:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 22643712. Throughput: 0: 12900.2. Samples: 22634496. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:41:55,069][231894] Avg episode reward: [(0, '191.257')] [2023-03-07 16:41:55,570][232226] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-07 16:41:56,367][232226] Updated weights for policy 0, policy_version 22130 (0.0007) [2023-03-07 16:41:57,166][232226] Updated weights for policy 0, policy_version 22140 (0.0007) [2023-03-07 16:41:57,957][232226] Updated weights for policy 0, policy_version 22150 (0.0006) [2023-03-07 16:41:58,756][232226] Updated weights for policy 0, policy_version 22160 (0.0007) [2023-03-07 16:41:59,553][232226] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-07 16:42:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22708224. Throughput: 0: 12896.3. Samples: 22673027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:00,069][231894] Avg episode reward: [(0, '199.302')] [2023-03-07 16:42:00,345][232226] Updated weights for policy 0, policy_version 22180 (0.0006) [2023-03-07 16:42:01,137][232226] Updated weights for policy 0, policy_version 22190 (0.0007) [2023-03-07 16:42:01,918][232226] Updated weights for policy 0, policy_version 22200 (0.0007) [2023-03-07 16:42:02,720][232226] Updated weights for policy 0, policy_version 22210 (0.0007) [2023-03-07 16:42:03,520][232226] Updated weights for policy 0, policy_version 22220 (0.0006) [2023-03-07 16:42:04,303][232226] Updated weights for policy 0, policy_version 22230 (0.0007) [2023-03-07 16:42:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22772736. Throughput: 0: 12904.4. Samples: 22750584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:05,070][231894] Avg episode reward: [(0, '194.343')] [2023-03-07 16:42:05,097][232226] Updated weights for policy 0, policy_version 22240 (0.0006) [2023-03-07 16:42:05,897][232226] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-07 16:42:06,687][232226] Updated weights for policy 0, policy_version 22260 (0.0006) [2023-03-07 16:42:07,485][232226] Updated weights for policy 0, policy_version 22270 (0.0008) [2023-03-07 16:42:08,283][232226] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-07 16:42:09,075][232226] Updated weights for policy 0, policy_version 22290 (0.0006) [2023-03-07 16:42:09,861][232226] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-07 16:42:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22837248. Throughput: 0: 12916.0. Samples: 22828052. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:10,069][231894] Avg episode reward: [(0, '188.732')] [2023-03-07 16:42:10,646][232226] Updated weights for policy 0, policy_version 22310 (0.0006) [2023-03-07 16:42:11,452][232226] Updated weights for policy 0, policy_version 22320 (0.0005) [2023-03-07 16:42:12,229][232226] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-07 16:42:13,026][232226] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-07 16:42:13,822][232226] Updated weights for policy 0, policy_version 22350 (0.0006) [2023-03-07 16:42:14,624][232226] Updated weights for policy 0, policy_version 22360 (0.0006) [2023-03-07 16:42:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22901760. Throughput: 0: 12925.4. Samples: 22866885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:15,070][231894] Avg episode reward: [(0, '189.980')] [2023-03-07 16:42:15,410][232226] Updated weights for policy 0, policy_version 22370 (0.0006) [2023-03-07 16:42:16,218][232226] Updated weights for policy 0, policy_version 22380 (0.0007) [2023-03-07 16:42:17,026][232226] Updated weights for policy 0, policy_version 22390 (0.0006) [2023-03-07 16:42:17,823][232226] Updated weights for policy 0, policy_version 22400 (0.0007) [2023-03-07 16:42:18,619][232226] Updated weights for policy 0, policy_version 22410 (0.0006) [2023-03-07 16:42:19,395][232226] Updated weights for policy 0, policy_version 22420 (0.0005) [2023-03-07 16:42:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 22966272. Throughput: 0: 12903.4. Samples: 22943923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:20,069][231894] Avg episode reward: [(0, '187.866')] [2023-03-07 16:42:20,184][232226] Updated weights for policy 0, policy_version 22430 (0.0006) [2023-03-07 16:42:20,984][232226] Updated weights for policy 0, policy_version 22440 (0.0007) [2023-03-07 16:42:21,762][232226] Updated weights for policy 0, policy_version 22450 (0.0006) [2023-03-07 16:42:22,571][232226] Updated weights for policy 0, policy_version 22460 (0.0006) [2023-03-07 16:42:23,351][232226] Updated weights for policy 0, policy_version 22470 (0.0007) [2023-03-07 16:42:24,142][232226] Updated weights for policy 0, policy_version 22480 (0.0006) [2023-03-07 16:42:24,934][232226] Updated weights for policy 0, policy_version 22490 (0.0006) [2023-03-07 16:42:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 23030784. Throughput: 0: 12906.9. Samples: 23021664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:25,069][231894] Avg episode reward: [(0, '191.591')] [2023-03-07 16:42:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000022491_23030784.pth... [2023-03-07 16:42:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000019468_19935232.pth [2023-03-07 16:42:25,726][232226] Updated weights for policy 0, policy_version 22500 (0.0007) [2023-03-07 16:42:26,517][232226] Updated weights for policy 0, policy_version 22510 (0.0006) [2023-03-07 16:42:27,315][232226] Updated weights for policy 0, policy_version 22520 (0.0007) [2023-03-07 16:42:28,102][232226] Updated weights for policy 0, policy_version 22530 (0.0006) [2023-03-07 16:42:28,888][232226] Updated weights for policy 0, policy_version 22540 (0.0007) [2023-03-07 16:42:29,669][232226] Updated weights for policy 0, policy_version 22550 (0.0006) [2023-03-07 16:42:30,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 23096320. Throughput: 0: 12909.4. Samples: 23060552. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:30,069][231894] Avg episode reward: [(0, '195.892')] [2023-03-07 16:42:30,475][232226] Updated weights for policy 0, policy_version 22560 (0.0007) [2023-03-07 16:42:31,261][232226] Updated weights for policy 0, policy_version 22570 (0.0006) [2023-03-07 16:42:32,065][232226] Updated weights for policy 0, policy_version 22580 (0.0006) [2023-03-07 16:42:32,848][232226] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-07 16:42:33,646][232226] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-07 16:42:34,442][232226] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-07 16:42:35,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 23160832. Throughput: 0: 12911.5. Samples: 23138088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:35,069][231894] Avg episode reward: [(0, '192.628')] [2023-03-07 16:42:35,209][232226] Updated weights for policy 0, policy_version 22620 (0.0006) [2023-03-07 16:42:36,005][232226] Updated weights for policy 0, policy_version 22630 (0.0007) [2023-03-07 16:42:36,819][232226] Updated weights for policy 0, policy_version 22640 (0.0007) [2023-03-07 16:42:37,606][232226] Updated weights for policy 0, policy_version 22650 (0.0007) [2023-03-07 16:42:38,414][232226] Updated weights for policy 0, policy_version 22660 (0.0006) [2023-03-07 16:42:39,205][232226] Updated weights for policy 0, policy_version 22670 (0.0006) [2023-03-07 16:42:40,018][232226] Updated weights for policy 0, policy_version 22680 (0.0006) [2023-03-07 16:42:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 23224320. Throughput: 0: 12907.7. Samples: 23215340. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:40,069][231894] Avg episode reward: [(0, '196.126')] [2023-03-07 16:42:40,787][232226] Updated weights for policy 0, policy_version 22690 (0.0007) [2023-03-07 16:42:41,594][232226] Updated weights for policy 0, policy_version 22700 (0.0007) [2023-03-07 16:42:42,393][232226] Updated weights for policy 0, policy_version 22710 (0.0006) [2023-03-07 16:42:43,181][232226] Updated weights for policy 0, policy_version 22720 (0.0006) [2023-03-07 16:42:43,966][232226] Updated weights for policy 0, policy_version 22730 (0.0007) [2023-03-07 16:42:44,757][232226] Updated weights for policy 0, policy_version 22740 (0.0007) [2023-03-07 16:42:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 23289856. Throughput: 0: 12911.7. Samples: 23254056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:45,070][231894] Avg episode reward: [(0, '186.840')] [2023-03-07 16:42:45,554][232226] Updated weights for policy 0, policy_version 22750 (0.0006) [2023-03-07 16:42:46,340][232226] Updated weights for policy 0, policy_version 22760 (0.0005) [2023-03-07 16:42:47,136][232226] Updated weights for policy 0, policy_version 22770 (0.0006) [2023-03-07 16:42:47,932][232226] Updated weights for policy 0, policy_version 22780 (0.0007) [2023-03-07 16:42:48,713][232226] Updated weights for policy 0, policy_version 22790 (0.0006) [2023-03-07 16:42:49,514][232226] Updated weights for policy 0, policy_version 22800 (0.0006) [2023-03-07 16:42:50,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 23354368. Throughput: 0: 12914.2. Samples: 23331723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:50,069][231894] Avg episode reward: [(0, '192.648')] [2023-03-07 16:42:50,287][232226] Updated weights for policy 0, policy_version 22810 (0.0006) [2023-03-07 16:42:51,096][232226] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-07 16:42:51,882][232226] Updated weights for policy 0, policy_version 22830 (0.0007) [2023-03-07 16:42:52,675][232226] Updated weights for policy 0, policy_version 22840 (0.0006) [2023-03-07 16:42:53,469][232226] Updated weights for policy 0, policy_version 22850 (0.0006) [2023-03-07 16:42:54,258][232226] Updated weights for policy 0, policy_version 22860 (0.0007) [2023-03-07 16:42:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 23417856. Throughput: 0: 12914.9. Samples: 23409225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:42:55,069][231894] Avg episode reward: [(0, '188.268')] [2023-03-07 16:42:55,070][232226] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-07 16:42:55,844][232226] Updated weights for policy 0, policy_version 22880 (0.0006) [2023-03-07 16:42:56,640][232226] Updated weights for policy 0, policy_version 22890 (0.0007) [2023-03-07 16:42:57,426][232226] Updated weights for policy 0, policy_version 22900 (0.0007) [2023-03-07 16:42:58,211][232226] Updated weights for policy 0, policy_version 22910 (0.0006) [2023-03-07 16:42:59,003][232226] Updated weights for policy 0, policy_version 22920 (0.0007) [2023-03-07 16:42:59,801][232226] Updated weights for policy 0, policy_version 22930 (0.0007) [2023-03-07 16:43:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.4, 300 sec: 12902.4). Total num frames: 23483392. Throughput: 0: 12911.2. Samples: 23447889. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:43:00,070][231894] Avg episode reward: [(0, '189.733')] [2023-03-07 16:43:00,605][232226] Updated weights for policy 0, policy_version 22940 (0.0006) [2023-03-07 16:43:01,410][232226] Updated weights for policy 0, policy_version 22950 (0.0007) [2023-03-07 16:43:02,185][232226] Updated weights for policy 0, policy_version 22960 (0.0008) [2023-03-07 16:43:02,998][232226] Updated weights for policy 0, policy_version 22970 (0.0007) [2023-03-07 16:43:03,794][232226] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-07 16:43:04,597][232226] Updated weights for policy 0, policy_version 22990 (0.0007) [2023-03-07 16:43:05,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 23547904. Throughput: 0: 12917.6. Samples: 23525215. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:43:05,069][231894] Avg episode reward: [(0, '198.592')] [2023-03-07 16:43:05,382][232226] Updated weights for policy 0, policy_version 23000 (0.0008) [2023-03-07 16:43:06,180][232226] Updated weights for policy 0, policy_version 23010 (0.0007) [2023-03-07 16:43:06,965][232226] Updated weights for policy 0, policy_version 23020 (0.0007) [2023-03-07 16:43:07,770][232226] Updated weights for policy 0, policy_version 23030 (0.0006) [2023-03-07 16:43:08,549][232226] Updated weights for policy 0, policy_version 23040 (0.0006) [2023-03-07 16:43:09,354][232226] Updated weights for policy 0, policy_version 23050 (0.0007) [2023-03-07 16:43:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 23612416. Throughput: 0: 12909.6. Samples: 23602596. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:43:10,069][231894] Avg episode reward: [(0, '185.023')] [2023-03-07 16:43:10,128][232226] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-07 16:43:10,903][232226] Updated weights for policy 0, policy_version 23070 (0.0007) [2023-03-07 16:43:11,693][232226] Updated weights for policy 0, policy_version 23080 (0.0006) [2023-03-07 16:43:12,505][232226] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-07 16:43:13,292][232226] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-07 16:43:14,077][232226] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-07 16:43:14,861][232226] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-07 16:43:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 23676928. Throughput: 0: 12910.2. Samples: 23641511. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:15,070][231894] Avg episode reward: [(0, '192.064')] [2023-03-07 16:43:15,662][232226] Updated weights for policy 0, policy_version 23130 (0.0006) [2023-03-07 16:43:16,460][232226] Updated weights for policy 0, policy_version 23140 (0.0007) [2023-03-07 16:43:17,258][232226] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-07 16:43:18,046][232226] Updated weights for policy 0, policy_version 23160 (0.0006) [2023-03-07 16:43:18,864][232226] Updated weights for policy 0, policy_version 23170 (0.0006) [2023-03-07 16:43:19,620][232226] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-07 16:43:20,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 23741440. Throughput: 0: 12912.9. Samples: 23719172. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:20,070][231894] Avg episode reward: [(0, '193.271')] [2023-03-07 16:43:20,416][232226] Updated weights for policy 0, policy_version 23190 (0.0007) [2023-03-07 16:43:21,219][232226] Updated weights for policy 0, policy_version 23200 (0.0006) [2023-03-07 16:43:22,012][232226] Updated weights for policy 0, policy_version 23210 (0.0006) [2023-03-07 16:43:22,801][232226] Updated weights for policy 0, policy_version 23220 (0.0006) [2023-03-07 16:43:23,586][232226] Updated weights for policy 0, policy_version 23230 (0.0006) [2023-03-07 16:43:24,364][232226] Updated weights for policy 0, policy_version 23240 (0.0006) [2023-03-07 16:43:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.4, 300 sec: 12905.9). Total num frames: 23805952. Throughput: 0: 12925.1. Samples: 23796973. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:25,070][231894] Avg episode reward: [(0, '190.497')] [2023-03-07 16:43:25,175][232226] Updated weights for policy 0, policy_version 23250 (0.0006) [2023-03-07 16:43:25,954][232226] Updated weights for policy 0, policy_version 23260 (0.0006) [2023-03-07 16:43:26,746][232226] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-07 16:43:27,537][232226] Updated weights for policy 0, policy_version 23280 (0.0006) [2023-03-07 16:43:28,335][232226] Updated weights for policy 0, policy_version 23290 (0.0006) [2023-03-07 16:43:29,121][232226] Updated weights for policy 0, policy_version 23300 (0.0006) [2023-03-07 16:43:29,914][232226] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-07 16:43:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 23870464. Throughput: 0: 12924.7. Samples: 23835665. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:30,069][231894] Avg episode reward: [(0, '194.844')] [2023-03-07 16:43:30,696][232226] Updated weights for policy 0, policy_version 23320 (0.0007) [2023-03-07 16:43:31,487][232226] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-07 16:43:32,286][232226] Updated weights for policy 0, policy_version 23340 (0.0006) [2023-03-07 16:43:33,099][232226] Updated weights for policy 0, policy_version 23350 (0.0007) [2023-03-07 16:43:33,877][232226] Updated weights for policy 0, policy_version 23360 (0.0006) [2023-03-07 16:43:34,675][232226] Updated weights for policy 0, policy_version 23370 (0.0007) [2023-03-07 16:43:35,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 23934976. Throughput: 0: 12919.5. Samples: 23913100. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:35,069][231894] Avg episode reward: [(0, '192.148')] [2023-03-07 16:43:35,477][232226] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-07 16:43:36,274][232226] Updated weights for policy 0, policy_version 23390 (0.0006) [2023-03-07 16:43:37,053][232226] Updated weights for policy 0, policy_version 23400 (0.0007) [2023-03-07 16:43:37,858][232226] Updated weights for policy 0, policy_version 23410 (0.0006) [2023-03-07 16:43:38,650][232226] Updated weights for policy 0, policy_version 23420 (0.0005) [2023-03-07 16:43:39,428][232226] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-07 16:43:40,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12936.5, 300 sec: 12909.3). Total num frames: 24000512. Throughput: 0: 12919.9. Samples: 23990618. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:40,069][231894] Avg episode reward: [(0, '193.028')] [2023-03-07 16:43:40,222][232226] Updated weights for policy 0, policy_version 23440 (0.0007) [2023-03-07 16:43:41,019][232226] Updated weights for policy 0, policy_version 23450 (0.0007) [2023-03-07 16:43:41,806][232226] Updated weights for policy 0, policy_version 23460 (0.0006) [2023-03-07 16:43:42,600][232226] Updated weights for policy 0, policy_version 23470 (0.0006) [2023-03-07 16:43:43,392][232226] Updated weights for policy 0, policy_version 23480 (0.0006) [2023-03-07 16:43:44,189][232226] Updated weights for policy 0, policy_version 23490 (0.0007) [2023-03-07 16:43:44,981][232226] Updated weights for policy 0, policy_version 23500 (0.0007) [2023-03-07 16:43:45,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 24065024. Throughput: 0: 12926.4. Samples: 24029578. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:45,069][231894] Avg episode reward: [(0, '188.528')] [2023-03-07 16:43:45,786][232226] Updated weights for policy 0, policy_version 23510 (0.0007) [2023-03-07 16:43:46,589][232226] Updated weights for policy 0, policy_version 23520 (0.0006) [2023-03-07 16:43:47,378][232226] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-07 16:43:48,151][232226] Updated weights for policy 0, policy_version 23540 (0.0007) [2023-03-07 16:43:48,948][232226] Updated weights for policy 0, policy_version 23550 (0.0007) [2023-03-07 16:43:49,745][232226] Updated weights for policy 0, policy_version 23560 (0.0006) [2023-03-07 16:43:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.4, 300 sec: 12905.9). Total num frames: 24129536. Throughput: 0: 12928.7. Samples: 24107009. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:50,070][231894] Avg episode reward: [(0, '186.459')] [2023-03-07 16:43:50,521][232226] Updated weights for policy 0, policy_version 23570 (0.0007) [2023-03-07 16:43:51,322][232226] Updated weights for policy 0, policy_version 23580 (0.0007) [2023-03-07 16:43:52,117][232226] Updated weights for policy 0, policy_version 23590 (0.0006) [2023-03-07 16:43:52,893][232226] Updated weights for policy 0, policy_version 23600 (0.0005) [2023-03-07 16:43:53,686][232226] Updated weights for policy 0, policy_version 23610 (0.0007) [2023-03-07 16:43:54,477][232226] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-07 16:43:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12936.5, 300 sec: 12905.9). Total num frames: 24194048. Throughput: 0: 12935.9. Samples: 24184711. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:43:55,069][231894] Avg episode reward: [(0, '194.732')] [2023-03-07 16:43:55,278][232226] Updated weights for policy 0, policy_version 23630 (0.0007) [2023-03-07 16:43:56,062][232226] Updated weights for policy 0, policy_version 23640 (0.0007) [2023-03-07 16:43:56,852][232226] Updated weights for policy 0, policy_version 23650 (0.0007) [2023-03-07 16:43:57,648][232226] Updated weights for policy 0, policy_version 23660 (0.0007) [2023-03-07 16:43:58,459][232226] Updated weights for policy 0, policy_version 23670 (0.0006) [2023-03-07 16:43:59,250][232226] Updated weights for policy 0, policy_version 23680 (0.0007) [2023-03-07 16:44:00,033][232226] Updated weights for policy 0, policy_version 23690 (0.0006) [2023-03-07 16:44:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 24258560. Throughput: 0: 12933.1. Samples: 24223501. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:44:00,069][231894] Avg episode reward: [(0, '199.714')] [2023-03-07 16:44:00,825][232226] Updated weights for policy 0, policy_version 23700 (0.0006) [2023-03-07 16:44:01,614][232226] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-07 16:44:02,402][232226] Updated weights for policy 0, policy_version 23720 (0.0007) [2023-03-07 16:44:03,205][232226] Updated weights for policy 0, policy_version 23730 (0.0006) [2023-03-07 16:44:04,011][232226] Updated weights for policy 0, policy_version 23740 (0.0006) [2023-03-07 16:44:04,808][232226] Updated weights for policy 0, policy_version 23750 (0.0006) [2023-03-07 16:44:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.4, 300 sec: 12905.9). Total num frames: 24323072. Throughput: 0: 12924.3. Samples: 24300765. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:05,069][231894] Avg episode reward: [(0, '196.767')] [2023-03-07 16:44:05,604][232226] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-07 16:44:06,412][232226] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-07 16:44:07,197][232226] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-07 16:44:07,989][232226] Updated weights for policy 0, policy_version 23790 (0.0006) [2023-03-07 16:44:08,794][232226] Updated weights for policy 0, policy_version 23800 (0.0006) [2023-03-07 16:44:09,577][232226] Updated weights for policy 0, policy_version 23810 (0.0007) [2023-03-07 16:44:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.4, 300 sec: 12905.9). Total num frames: 24387584. Throughput: 0: 12911.9. Samples: 24378010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:10,069][231894] Avg episode reward: [(0, '187.839')] [2023-03-07 16:44:10,365][232226] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-07 16:44:11,167][232226] Updated weights for policy 0, policy_version 23830 (0.0007) [2023-03-07 16:44:11,962][232226] Updated weights for policy 0, policy_version 23840 (0.0005) [2023-03-07 16:44:12,762][232226] Updated weights for policy 0, policy_version 23850 (0.0006) [2023-03-07 16:44:13,556][232226] Updated weights for policy 0, policy_version 23860 (0.0006) [2023-03-07 16:44:14,338][232226] Updated weights for policy 0, policy_version 23870 (0.0006) [2023-03-07 16:44:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 24452096. Throughput: 0: 12911.2. Samples: 24416671. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:15,070][231894] Avg episode reward: [(0, '194.383')] [2023-03-07 16:44:15,139][232226] Updated weights for policy 0, policy_version 23880 (0.0007) [2023-03-07 16:44:15,909][232226] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-07 16:44:16,725][232226] Updated weights for policy 0, policy_version 23900 (0.0006) [2023-03-07 16:44:17,501][232226] Updated weights for policy 0, policy_version 23910 (0.0006) [2023-03-07 16:44:18,291][232226] Updated weights for policy 0, policy_version 23920 (0.0007) [2023-03-07 16:44:19,104][232226] Updated weights for policy 0, policy_version 23930 (0.0006) [2023-03-07 16:44:19,898][232226] Updated weights for policy 0, policy_version 23940 (0.0006) [2023-03-07 16:44:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 24516608. Throughput: 0: 12912.7. Samples: 24494174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:20,069][231894] Avg episode reward: [(0, '191.006')] [2023-03-07 16:44:20,678][232226] Updated weights for policy 0, policy_version 23950 (0.0006) [2023-03-07 16:44:21,486][232226] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-07 16:44:22,282][232226] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-07 16:44:23,088][232226] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-07 16:44:23,885][232226] Updated weights for policy 0, policy_version 23990 (0.0006) [2023-03-07 16:44:24,674][232226] Updated weights for policy 0, policy_version 24000 (0.0006) [2023-03-07 16:44:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 24581120. Throughput: 0: 12905.3. Samples: 24571359. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:25,069][231894] Avg episode reward: [(0, '185.962')] [2023-03-07 16:44:25,075][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000024005_24581120.pth... [2023-03-07 16:44:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000020980_21483520.pth [2023-03-07 16:44:25,449][232226] Updated weights for policy 0, policy_version 24010 (0.0007) [2023-03-07 16:44:26,256][232226] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-07 16:44:27,067][232226] Updated weights for policy 0, policy_version 24030 (0.0006) [2023-03-07 16:44:27,846][232226] Updated weights for policy 0, policy_version 24040 (0.0006) [2023-03-07 16:44:28,642][232226] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-07 16:44:29,442][232226] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-07 16:44:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 24644608. Throughput: 0: 12898.6. Samples: 24610012. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:30,069][231894] Avg episode reward: [(0, '196.461')] [2023-03-07 16:44:30,229][232226] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-07 16:44:31,025][232226] Updated weights for policy 0, policy_version 24080 (0.0007) [2023-03-07 16:44:31,828][232226] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-07 16:44:32,611][232226] Updated weights for policy 0, policy_version 24100 (0.0005) [2023-03-07 16:44:33,405][232226] Updated weights for policy 0, policy_version 24110 (0.0006) [2023-03-07 16:44:34,190][232226] Updated weights for policy 0, policy_version 24120 (0.0007) [2023-03-07 16:44:35,012][232226] Updated weights for policy 0, policy_version 24130 (0.0007) [2023-03-07 16:44:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 24710144. Throughput: 0: 12898.5. Samples: 24687440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:35,069][231894] Avg episode reward: [(0, '194.061')] [2023-03-07 16:44:35,795][232226] Updated weights for policy 0, policy_version 24140 (0.0007) [2023-03-07 16:44:36,575][232226] Updated weights for policy 0, policy_version 24150 (0.0006) [2023-03-07 16:44:37,375][232226] Updated weights for policy 0, policy_version 24160 (0.0006) [2023-03-07 16:44:38,169][232226] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-07 16:44:38,958][232226] Updated weights for policy 0, policy_version 24180 (0.0006) [2023-03-07 16:44:39,758][232226] Updated weights for policy 0, policy_version 24190 (0.0006) [2023-03-07 16:44:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 24773632. Throughput: 0: 12893.6. Samples: 24764922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:40,069][231894] Avg episode reward: [(0, '187.505')] [2023-03-07 16:44:40,541][232226] Updated weights for policy 0, policy_version 24200 (0.0006) [2023-03-07 16:44:41,329][232226] Updated weights for policy 0, policy_version 24210 (0.0007) [2023-03-07 16:44:42,141][232226] Updated weights for policy 0, policy_version 24220 (0.0006) [2023-03-07 16:44:42,924][232226] Updated weights for policy 0, policy_version 24230 (0.0006) [2023-03-07 16:44:43,710][232226] Updated weights for policy 0, policy_version 24240 (0.0006) [2023-03-07 16:44:44,485][232226] Updated weights for policy 0, policy_version 24250 (0.0006) [2023-03-07 16:44:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 24839168. Throughput: 0: 12888.5. Samples: 24803484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:45,070][231894] Avg episode reward: [(0, '190.248')] [2023-03-07 16:44:45,279][232226] Updated weights for policy 0, policy_version 24260 (0.0007) [2023-03-07 16:44:46,080][232226] Updated weights for policy 0, policy_version 24270 (0.0006) [2023-03-07 16:44:46,869][232226] Updated weights for policy 0, policy_version 24280 (0.0006) [2023-03-07 16:44:47,677][232226] Updated weights for policy 0, policy_version 24290 (0.0006) [2023-03-07 16:44:48,473][232226] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-07 16:44:49,276][232226] Updated weights for policy 0, policy_version 24310 (0.0007) [2023-03-07 16:44:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12902.4). Total num frames: 24902656. Throughput: 0: 12896.5. Samples: 24881105. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:50,069][231894] Avg episode reward: [(0, '187.690')] [2023-03-07 16:44:50,072][232226] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-07 16:44:50,859][232226] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-07 16:44:51,649][232226] Updated weights for policy 0, policy_version 24340 (0.0007) [2023-03-07 16:44:52,427][232226] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-07 16:44:53,230][232226] Updated weights for policy 0, policy_version 24360 (0.0007) [2023-03-07 16:44:54,015][232226] Updated weights for policy 0, policy_version 24370 (0.0007) [2023-03-07 16:44:54,810][232226] Updated weights for policy 0, policy_version 24380 (0.0007) [2023-03-07 16:44:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 24968192. Throughput: 0: 12902.3. Samples: 24958611. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:44:55,069][231894] Avg episode reward: [(0, '196.428')] [2023-03-07 16:44:55,596][232226] Updated weights for policy 0, policy_version 24390 (0.0007) [2023-03-07 16:44:56,377][232226] Updated weights for policy 0, policy_version 24400 (0.0006) [2023-03-07 16:44:57,183][232226] Updated weights for policy 0, policy_version 24410 (0.0007) [2023-03-07 16:44:57,970][232226] Updated weights for policy 0, policy_version 24420 (0.0006) [2023-03-07 16:44:58,762][232226] Updated weights for policy 0, policy_version 24430 (0.0006) [2023-03-07 16:44:59,586][232226] Updated weights for policy 0, policy_version 24440 (0.0006) [2023-03-07 16:45:00,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 25032704. Throughput: 0: 12906.7. Samples: 24997472. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:00,069][231894] Avg episode reward: [(0, '192.463')] [2023-03-07 16:45:00,372][232226] Updated weights for policy 0, policy_version 24450 (0.0007) [2023-03-07 16:45:01,151][232226] Updated weights for policy 0, policy_version 24460 (0.0006) [2023-03-07 16:45:01,940][232226] Updated weights for policy 0, policy_version 24470 (0.0007) [2023-03-07 16:45:02,751][232226] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-07 16:45:03,535][232226] Updated weights for policy 0, policy_version 24490 (0.0007) [2023-03-07 16:45:04,335][232226] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-07 16:45:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 25097216. Throughput: 0: 12902.0. Samples: 25074765. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:05,069][231894] Avg episode reward: [(0, '192.926')] [2023-03-07 16:45:05,141][232226] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-07 16:45:05,932][232226] Updated weights for policy 0, policy_version 24520 (0.0007) [2023-03-07 16:45:06,726][232226] Updated weights for policy 0, policy_version 24530 (0.0007) [2023-03-07 16:45:07,520][232226] Updated weights for policy 0, policy_version 24540 (0.0007) [2023-03-07 16:45:08,324][232226] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-07 16:45:09,115][232226] Updated weights for policy 0, policy_version 24560 (0.0007) [2023-03-07 16:45:09,900][232226] Updated weights for policy 0, policy_version 24570 (0.0006) [2023-03-07 16:45:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 25161728. Throughput: 0: 12900.4. Samples: 25151874. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:10,069][231894] Avg episode reward: [(0, '192.089')] [2023-03-07 16:45:10,707][232226] Updated weights for policy 0, policy_version 24580 (0.0006) [2023-03-07 16:45:11,496][232226] Updated weights for policy 0, policy_version 24590 (0.0007) [2023-03-07 16:45:12,278][232226] Updated weights for policy 0, policy_version 24600 (0.0006) [2023-03-07 16:45:13,066][232226] Updated weights for policy 0, policy_version 24610 (0.0007) [2023-03-07 16:45:13,869][232226] Updated weights for policy 0, policy_version 24620 (0.0007) [2023-03-07 16:45:14,634][232226] Updated weights for policy 0, policy_version 24630 (0.0006) [2023-03-07 16:45:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 25226240. Throughput: 0: 12907.0. Samples: 25190829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:15,069][231894] Avg episode reward: [(0, '193.595')] [2023-03-07 16:45:15,429][232226] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-07 16:45:16,236][232226] Updated weights for policy 0, policy_version 24650 (0.0007) [2023-03-07 16:45:17,026][232226] Updated weights for policy 0, policy_version 24660 (0.0006) [2023-03-07 16:45:17,834][232226] Updated weights for policy 0, policy_version 24670 (0.0006) [2023-03-07 16:45:18,616][232226] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-07 16:45:19,430][232226] Updated weights for policy 0, policy_version 24690 (0.0006) [2023-03-07 16:45:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 25290752. Throughput: 0: 12905.2. Samples: 25268174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:20,069][231894] Avg episode reward: [(0, '201.119')] [2023-03-07 16:45:20,206][232226] Updated weights for policy 0, policy_version 24700 (0.0006) [2023-03-07 16:45:20,973][232226] Updated weights for policy 0, policy_version 24710 (0.0007) [2023-03-07 16:45:21,788][232226] Updated weights for policy 0, policy_version 24720 (0.0005) [2023-03-07 16:45:22,573][232226] Updated weights for policy 0, policy_version 24730 (0.0007) [2023-03-07 16:45:23,373][232226] Updated weights for policy 0, policy_version 24740 (0.0007) [2023-03-07 16:45:24,165][232226] Updated weights for policy 0, policy_version 24750 (0.0006) [2023-03-07 16:45:24,956][232226] Updated weights for policy 0, policy_version 24760 (0.0007) [2023-03-07 16:45:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 25355264. Throughput: 0: 12909.7. Samples: 25345860. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:25,069][231894] Avg episode reward: [(0, '194.997')] [2023-03-07 16:45:25,737][232226] Updated weights for policy 0, policy_version 24770 (0.0006) [2023-03-07 16:45:26,545][232226] Updated weights for policy 0, policy_version 24780 (0.0006) [2023-03-07 16:45:27,321][232226] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-07 16:45:28,126][232226] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-07 16:45:28,943][232226] Updated weights for policy 0, policy_version 24810 (0.0006) [2023-03-07 16:45:29,727][232226] Updated weights for policy 0, policy_version 24820 (0.0006) [2023-03-07 16:45:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 25419776. Throughput: 0: 12916.6. Samples: 25384730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:30,069][231894] Avg episode reward: [(0, '193.558')] [2023-03-07 16:45:30,526][232226] Updated weights for policy 0, policy_version 24830 (0.0006) [2023-03-07 16:45:31,342][232226] Updated weights for policy 0, policy_version 24840 (0.0008) [2023-03-07 16:45:32,118][232226] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-07 16:45:32,917][232226] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-07 16:45:33,726][232226] Updated weights for policy 0, policy_version 24870 (0.0006) [2023-03-07 16:45:34,507][232226] Updated weights for policy 0, policy_version 24880 (0.0006) [2023-03-07 16:45:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 25484288. Throughput: 0: 12902.8. Samples: 25461730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:35,069][231894] Avg episode reward: [(0, '185.592')] [2023-03-07 16:45:35,295][232226] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-07 16:45:36,080][232226] Updated weights for policy 0, policy_version 24900 (0.0007) [2023-03-07 16:45:36,870][232226] Updated weights for policy 0, policy_version 24910 (0.0006) [2023-03-07 16:45:37,653][232226] Updated weights for policy 0, policy_version 24920 (0.0007) [2023-03-07 16:45:38,446][232226] Updated weights for policy 0, policy_version 24930 (0.0006) [2023-03-07 16:45:39,272][232226] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-07 16:45:40,042][232226] Updated weights for policy 0, policy_version 24950 (0.0006) [2023-03-07 16:45:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 25548800. Throughput: 0: 12902.5. Samples: 25539224. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:40,069][231894] Avg episode reward: [(0, '195.974')] [2023-03-07 16:45:40,855][232226] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-07 16:45:41,656][232226] Updated weights for policy 0, policy_version 24970 (0.0006) [2023-03-07 16:45:42,449][232226] Updated weights for policy 0, policy_version 24980 (0.0006) [2023-03-07 16:45:43,240][232226] Updated weights for policy 0, policy_version 24990 (0.0006) [2023-03-07 16:45:44,024][232226] Updated weights for policy 0, policy_version 25000 (0.0006) [2023-03-07 16:45:44,819][232226] Updated weights for policy 0, policy_version 25010 (0.0007) [2023-03-07 16:45:45,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 25612288. Throughput: 0: 12897.4. Samples: 25577857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:45,069][231894] Avg episode reward: [(0, '188.127')] [2023-03-07 16:45:45,622][232226] Updated weights for policy 0, policy_version 25020 (0.0007) [2023-03-07 16:45:46,427][232226] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-07 16:45:47,224][232226] Updated weights for policy 0, policy_version 25040 (0.0006) [2023-03-07 16:45:48,018][232226] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-07 16:45:48,810][232226] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-07 16:45:49,602][232226] Updated weights for policy 0, policy_version 25070 (0.0007) [2023-03-07 16:45:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 25677824. Throughput: 0: 12892.5. Samples: 25654925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:50,069][231894] Avg episode reward: [(0, '195.441')] [2023-03-07 16:45:50,382][232226] Updated weights for policy 0, policy_version 25080 (0.0007) [2023-03-07 16:45:51,180][232226] Updated weights for policy 0, policy_version 25090 (0.0006) [2023-03-07 16:45:51,986][232226] Updated weights for policy 0, policy_version 25100 (0.0007) [2023-03-07 16:45:52,768][232226] Updated weights for policy 0, policy_version 25110 (0.0006) [2023-03-07 16:45:53,565][232226] Updated weights for policy 0, policy_version 25120 (0.0006) [2023-03-07 16:45:54,363][232226] Updated weights for policy 0, policy_version 25130 (0.0007) [2023-03-07 16:45:55,069][231894] Fps is (10 sec: 12902.7, 60 sec: 12885.4, 300 sec: 12905.9). Total num frames: 25741312. Throughput: 0: 12901.0. Samples: 25732418. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:45:55,069][231894] Avg episode reward: [(0, '190.407')] [2023-03-07 16:45:55,154][232226] Updated weights for policy 0, policy_version 25140 (0.0006) [2023-03-07 16:45:55,967][232226] Updated weights for policy 0, policy_version 25150 (0.0007) [2023-03-07 16:45:56,748][232226] Updated weights for policy 0, policy_version 25160 (0.0007) [2023-03-07 16:45:57,530][232226] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-07 16:45:58,327][232226] Updated weights for policy 0, policy_version 25180 (0.0007) [2023-03-07 16:45:59,114][232226] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-07 16:45:59,919][232226] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-07 16:46:00,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 25806848. Throughput: 0: 12895.8. Samples: 25771139. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:46:00,069][231894] Avg episode reward: [(0, '197.125')] [2023-03-07 16:46:00,706][232226] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-07 16:46:01,486][232226] Updated weights for policy 0, policy_version 25220 (0.0006) [2023-03-07 16:46:02,289][232226] Updated weights for policy 0, policy_version 25230 (0.0006) [2023-03-07 16:46:03,082][232226] Updated weights for policy 0, policy_version 25240 (0.0006) [2023-03-07 16:46:03,882][232226] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-07 16:46:04,665][232226] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-07 16:46:05,069][231894] Fps is (10 sec: 13004.6, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 25871360. Throughput: 0: 12903.2. Samples: 25848821. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:46:05,070][231894] Avg episode reward: [(0, '190.244')] [2023-03-07 16:46:05,456][232226] Updated weights for policy 0, policy_version 25270 (0.0007) [2023-03-07 16:46:06,246][232226] Updated weights for policy 0, policy_version 25280 (0.0006) [2023-03-07 16:46:07,054][232226] Updated weights for policy 0, policy_version 25290 (0.0006) [2023-03-07 16:46:07,853][232226] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-07 16:46:08,634][232226] Updated weights for policy 0, policy_version 25310 (0.0005) [2023-03-07 16:46:09,437][232226] Updated weights for policy 0, policy_version 25320 (0.0006) [2023-03-07 16:46:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 25935872. Throughput: 0: 12895.5. Samples: 25926157. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:46:10,069][231894] Avg episode reward: [(0, '194.632')] [2023-03-07 16:46:10,214][232226] Updated weights for policy 0, policy_version 25330 (0.0006) [2023-03-07 16:46:11,030][232226] Updated weights for policy 0, policy_version 25340 (0.0006) [2023-03-07 16:46:11,828][232226] Updated weights for policy 0, policy_version 25350 (0.0007) [2023-03-07 16:46:12,611][232226] Updated weights for policy 0, policy_version 25360 (0.0006) [2023-03-07 16:46:13,414][232226] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-07 16:46:14,198][232226] Updated weights for policy 0, policy_version 25380 (0.0006) [2023-03-07 16:46:14,985][232226] Updated weights for policy 0, policy_version 25390 (0.0006) [2023-03-07 16:46:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 25999360. Throughput: 0: 12887.2. Samples: 25964655. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:46:15,080][231894] Avg episode reward: [(0, '194.794')] [2023-03-07 16:46:15,791][232226] Updated weights for policy 0, policy_version 25400 (0.0006) [2023-03-07 16:46:16,587][232226] Updated weights for policy 0, policy_version 25410 (0.0006) [2023-03-07 16:46:17,378][232226] Updated weights for policy 0, policy_version 25420 (0.0008) [2023-03-07 16:46:18,178][232226] Updated weights for policy 0, policy_version 25430 (0.0007) [2023-03-07 16:46:18,977][232226] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-07 16:46:19,767][232226] Updated weights for policy 0, policy_version 25450 (0.0006) [2023-03-07 16:46:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 26063872. Throughput: 0: 12893.3. Samples: 26041927. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:46:20,069][231894] Avg episode reward: [(0, '190.993')] [2023-03-07 16:46:20,551][232226] Updated weights for policy 0, policy_version 25460 (0.0006) [2023-03-07 16:46:21,346][232226] Updated weights for policy 0, policy_version 25470 (0.0007) [2023-03-07 16:46:22,149][232226] Updated weights for policy 0, policy_version 25480 (0.0007) [2023-03-07 16:46:22,927][232226] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-07 16:46:23,710][232226] Updated weights for policy 0, policy_version 25500 (0.0006) [2023-03-07 16:46:24,514][232226] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-07 16:46:25,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 26129408. Throughput: 0: 12896.8. Samples: 26119578. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:46:25,069][231894] Avg episode reward: [(0, '193.253')] [2023-03-07 16:46:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000025517_26129408.pth... [2023-03-07 16:46:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000022491_23030784.pth [2023-03-07 16:46:25,292][232226] Updated weights for policy 0, policy_version 25520 (0.0006) [2023-03-07 16:46:26,086][232226] Updated weights for policy 0, policy_version 25530 (0.0006) [2023-03-07 16:46:26,878][232226] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-07 16:46:27,676][232226] Updated weights for policy 0, policy_version 25550 (0.0006) [2023-03-07 16:46:28,486][232226] Updated weights for policy 0, policy_version 25560 (0.0007) [2023-03-07 16:46:29,265][232226] Updated weights for policy 0, policy_version 25570 (0.0007) [2023-03-07 16:46:30,050][232226] Updated weights for policy 0, policy_version 25580 (0.0006) [2023-03-07 16:46:30,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 26193920. Throughput: 0: 12900.7. Samples: 26158385. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:46:30,069][231894] Avg episode reward: [(0, '193.442')] [2023-03-07 16:46:30,826][232226] Updated weights for policy 0, policy_version 25590 (0.0006) [2023-03-07 16:46:31,633][232226] Updated weights for policy 0, policy_version 25600 (0.0006) [2023-03-07 16:46:32,410][232226] Updated weights for policy 0, policy_version 25610 (0.0006) [2023-03-07 16:46:33,185][232226] Updated weights for policy 0, policy_version 25620 (0.0007) [2023-03-07 16:46:33,983][232226] Updated weights for policy 0, policy_version 25630 (0.0006) [2023-03-07 16:46:34,757][232226] Updated weights for policy 0, policy_version 25640 (0.0006) [2023-03-07 16:46:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 26258432. Throughput: 0: 12923.0. Samples: 26236464. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:46:35,070][231894] Avg episode reward: [(0, '189.061')] [2023-03-07 16:46:35,555][232226] Updated weights for policy 0, policy_version 25650 (0.0007) [2023-03-07 16:46:36,342][232226] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-07 16:46:37,146][232226] Updated weights for policy 0, policy_version 25670 (0.0006) [2023-03-07 16:46:37,921][232226] Updated weights for policy 0, policy_version 25680 (0.0007) [2023-03-07 16:46:38,706][232226] Updated weights for policy 0, policy_version 25690 (0.0006) [2023-03-07 16:46:39,498][232226] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-07 16:46:40,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12912.8). Total num frames: 26323968. Throughput: 0: 12933.4. Samples: 26314423. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:46:40,069][231894] Avg episode reward: [(0, '195.880')] [2023-03-07 16:46:40,272][232226] Updated weights for policy 0, policy_version 25710 (0.0006) [2023-03-07 16:46:41,078][232226] Updated weights for policy 0, policy_version 25720 (0.0006) [2023-03-07 16:46:41,866][232226] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-07 16:46:42,647][232226] Updated weights for policy 0, policy_version 25740 (0.0006) [2023-03-07 16:46:43,450][232226] Updated weights for policy 0, policy_version 25750 (0.0006) [2023-03-07 16:46:44,249][232226] Updated weights for policy 0, policy_version 25760 (0.0007) [2023-03-07 16:46:45,041][232226] Updated weights for policy 0, policy_version 25770 (0.0006) [2023-03-07 16:46:45,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12936.6, 300 sec: 12912.8). Total num frames: 26388480. Throughput: 0: 12935.2. Samples: 26353222. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:46:45,069][231894] Avg episode reward: [(0, '193.528')] [2023-03-07 16:46:45,837][232226] Updated weights for policy 0, policy_version 25780 (0.0006) [2023-03-07 16:46:46,628][232226] Updated weights for policy 0, policy_version 25790 (0.0006) [2023-03-07 16:46:47,434][232226] Updated weights for policy 0, policy_version 25800 (0.0007) [2023-03-07 16:46:48,215][232226] Updated weights for policy 0, policy_version 25810 (0.0007) [2023-03-07 16:46:49,008][232226] Updated weights for policy 0, policy_version 25820 (0.0006) [2023-03-07 16:46:49,811][232226] Updated weights for policy 0, policy_version 25830 (0.0007) [2023-03-07 16:46:50,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12919.4, 300 sec: 12912.8). Total num frames: 26452992. Throughput: 0: 12930.1. Samples: 26430677. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:46:50,069][231894] Avg episode reward: [(0, '192.051')] [2023-03-07 16:46:50,606][232226] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-07 16:46:51,398][232226] Updated weights for policy 0, policy_version 25850 (0.0006) [2023-03-07 16:46:52,167][232226] Updated weights for policy 0, policy_version 25860 (0.0007) [2023-03-07 16:46:52,978][232226] Updated weights for policy 0, policy_version 25870 (0.0006) [2023-03-07 16:46:53,777][232226] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-07 16:46:54,570][232226] Updated weights for policy 0, policy_version 25890 (0.0006) [2023-03-07 16:46:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12936.5, 300 sec: 12912.8). Total num frames: 26517504. Throughput: 0: 12933.0. Samples: 26508143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:46:55,069][231894] Avg episode reward: [(0, '186.412')] [2023-03-07 16:46:55,342][232226] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-07 16:46:56,149][232226] Updated weights for policy 0, policy_version 25910 (0.0005) [2023-03-07 16:46:56,933][232226] Updated weights for policy 0, policy_version 25920 (0.0006) [2023-03-07 16:46:57,725][232226] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-07 16:46:58,542][232226] Updated weights for policy 0, policy_version 25940 (0.0007) [2023-03-07 16:46:59,334][232226] Updated weights for policy 0, policy_version 25950 (0.0006) [2023-03-07 16:47:00,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12919.5, 300 sec: 12912.8). Total num frames: 26582016. Throughput: 0: 12938.0. Samples: 26546865. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:47:00,069][231894] Avg episode reward: [(0, '188.660')] [2023-03-07 16:47:00,142][232226] Updated weights for policy 0, policy_version 25960 (0.0006) [2023-03-07 16:47:00,909][232226] Updated weights for policy 0, policy_version 25970 (0.0006) [2023-03-07 16:47:01,735][232226] Updated weights for policy 0, policy_version 25980 (0.0006) [2023-03-07 16:47:02,508][232226] Updated weights for policy 0, policy_version 25990 (0.0006) [2023-03-07 16:47:03,297][232226] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-07 16:47:04,103][232226] Updated weights for policy 0, policy_version 26010 (0.0006) [2023-03-07 16:47:04,891][232226] Updated weights for policy 0, policy_version 26020 (0.0007) [2023-03-07 16:47:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12912.8). Total num frames: 26646528. Throughput: 0: 12937.1. Samples: 26624098. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:05,069][231894] Avg episode reward: [(0, '191.180')] [2023-03-07 16:47:05,675][232226] Updated weights for policy 0, policy_version 26030 (0.0007) [2023-03-07 16:47:06,489][232226] Updated weights for policy 0, policy_version 26040 (0.0006) [2023-03-07 16:47:07,272][232226] Updated weights for policy 0, policy_version 26050 (0.0007) [2023-03-07 16:47:08,052][232226] Updated weights for policy 0, policy_version 26060 (0.0006) [2023-03-07 16:47:08,860][232226] Updated weights for policy 0, policy_version 26070 (0.0006) [2023-03-07 16:47:09,642][232226] Updated weights for policy 0, policy_version 26080 (0.0006) [2023-03-07 16:47:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12912.8). Total num frames: 26711040. Throughput: 0: 12934.8. Samples: 26701643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:10,069][231894] Avg episode reward: [(0, '185.064')] [2023-03-07 16:47:10,419][232226] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-07 16:47:11,253][232226] Updated weights for policy 0, policy_version 26100 (0.0007) [2023-03-07 16:47:12,041][232226] Updated weights for policy 0, policy_version 26110 (0.0007) [2023-03-07 16:47:12,826][232226] Updated weights for policy 0, policy_version 26120 (0.0006) [2023-03-07 16:47:13,641][232226] Updated weights for policy 0, policy_version 26130 (0.0007) [2023-03-07 16:47:14,428][232226] Updated weights for policy 0, policy_version 26140 (0.0008) [2023-03-07 16:47:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 26774528. Throughput: 0: 12927.8. Samples: 26740135. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:15,069][231894] Avg episode reward: [(0, '198.486')] [2023-03-07 16:47:15,229][232226] Updated weights for policy 0, policy_version 26150 (0.0006) [2023-03-07 16:47:16,022][232226] Updated weights for policy 0, policy_version 26160 (0.0006) [2023-03-07 16:47:16,812][232226] Updated weights for policy 0, policy_version 26170 (0.0006) [2023-03-07 16:47:17,607][232226] Updated weights for policy 0, policy_version 26180 (0.0006) [2023-03-07 16:47:18,388][232226] Updated weights for policy 0, policy_version 26190 (0.0006) [2023-03-07 16:47:19,185][232226] Updated weights for policy 0, policy_version 26200 (0.0006) [2023-03-07 16:47:19,977][232226] Updated weights for policy 0, policy_version 26210 (0.0006) [2023-03-07 16:47:20,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12936.5, 300 sec: 12912.8). Total num frames: 26840064. Throughput: 0: 12914.5. Samples: 26817616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:20,069][231894] Avg episode reward: [(0, '189.107')] [2023-03-07 16:47:20,762][232226] Updated weights for policy 0, policy_version 26220 (0.0007) [2023-03-07 16:47:21,560][232226] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-07 16:47:22,355][232226] Updated weights for policy 0, policy_version 26240 (0.0007) [2023-03-07 16:47:23,143][232226] Updated weights for policy 0, policy_version 26250 (0.0007) [2023-03-07 16:47:23,950][232226] Updated weights for policy 0, policy_version 26260 (0.0006) [2023-03-07 16:47:24,734][232226] Updated weights for policy 0, policy_version 26270 (0.0006) [2023-03-07 16:47:25,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 26904576. Throughput: 0: 12900.1. Samples: 26894929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:25,070][231894] Avg episode reward: [(0, '189.149')] [2023-03-07 16:47:25,536][232226] Updated weights for policy 0, policy_version 26280 (0.0007) [2023-03-07 16:47:26,328][232226] Updated weights for policy 0, policy_version 26290 (0.0006) [2023-03-07 16:47:27,109][232226] Updated weights for policy 0, policy_version 26300 (0.0006) [2023-03-07 16:47:27,930][232226] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-07 16:47:28,714][232226] Updated weights for policy 0, policy_version 26320 (0.0007) [2023-03-07 16:47:29,533][232226] Updated weights for policy 0, policy_version 26330 (0.0007) [2023-03-07 16:47:30,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 26968064. Throughput: 0: 12898.5. Samples: 26933657. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:30,070][231894] Avg episode reward: [(0, '192.387')] [2023-03-07 16:47:30,328][232226] Updated weights for policy 0, policy_version 26340 (0.0006) [2023-03-07 16:47:31,138][232226] Updated weights for policy 0, policy_version 26350 (0.0006) [2023-03-07 16:47:31,914][232226] Updated weights for policy 0, policy_version 26360 (0.0006) [2023-03-07 16:47:32,704][232226] Updated weights for policy 0, policy_version 26370 (0.0007) [2023-03-07 16:47:33,508][232226] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-07 16:47:34,293][232226] Updated weights for policy 0, policy_version 26390 (0.0006) [2023-03-07 16:47:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 27032576. Throughput: 0: 12886.7. Samples: 27010579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:35,080][231894] Avg episode reward: [(0, '197.700')] [2023-03-07 16:47:35,088][232226] Updated weights for policy 0, policy_version 26400 (0.0006) [2023-03-07 16:47:35,877][232226] Updated weights for policy 0, policy_version 26410 (0.0007) [2023-03-07 16:47:36,672][232226] Updated weights for policy 0, policy_version 26420 (0.0006) [2023-03-07 16:47:37,472][232226] Updated weights for policy 0, policy_version 26430 (0.0006) [2023-03-07 16:47:38,277][232226] Updated weights for policy 0, policy_version 26440 (0.0006) [2023-03-07 16:47:39,057][232226] Updated weights for policy 0, policy_version 26450 (0.0007) [2023-03-07 16:47:39,843][232226] Updated weights for policy 0, policy_version 26460 (0.0007) [2023-03-07 16:47:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 27097088. Throughput: 0: 12887.4. Samples: 27088078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:40,080][231894] Avg episode reward: [(0, '187.148')] [2023-03-07 16:47:40,644][232226] Updated weights for policy 0, policy_version 26470 (0.0006) [2023-03-07 16:47:41,434][232226] Updated weights for policy 0, policy_version 26480 (0.0007) [2023-03-07 16:47:42,217][232226] Updated weights for policy 0, policy_version 26490 (0.0006) [2023-03-07 16:47:43,018][232226] Updated weights for policy 0, policy_version 26500 (0.0006) [2023-03-07 16:47:43,821][232226] Updated weights for policy 0, policy_version 26510 (0.0006) [2023-03-07 16:47:44,602][232226] Updated weights for policy 0, policy_version 26520 (0.0007) [2023-03-07 16:47:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 27161600. Throughput: 0: 12889.0. Samples: 27126869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:45,080][231894] Avg episode reward: [(0, '188.563')] [2023-03-07 16:47:45,398][232226] Updated weights for policy 0, policy_version 26530 (0.0007) [2023-03-07 16:47:46,193][232226] Updated weights for policy 0, policy_version 26540 (0.0006) [2023-03-07 16:47:46,978][232226] Updated weights for policy 0, policy_version 26550 (0.0006) [2023-03-07 16:47:47,755][232226] Updated weights for policy 0, policy_version 26560 (0.0007) [2023-03-07 16:47:48,539][232226] Updated weights for policy 0, policy_version 26570 (0.0007) [2023-03-07 16:47:49,351][232226] Updated weights for policy 0, policy_version 26580 (0.0007) [2023-03-07 16:47:50,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12912.8). Total num frames: 27227136. Throughput: 0: 12900.5. Samples: 27204621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:50,070][231894] Avg episode reward: [(0, '195.614')] [2023-03-07 16:47:50,141][232226] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-07 16:47:50,937][232226] Updated weights for policy 0, policy_version 26600 (0.0006) [2023-03-07 16:47:51,730][232226] Updated weights for policy 0, policy_version 26610 (0.0006) [2023-03-07 16:47:52,516][232226] Updated weights for policy 0, policy_version 26620 (0.0006) [2023-03-07 16:47:53,301][232226] Updated weights for policy 0, policy_version 26630 (0.0007) [2023-03-07 16:47:54,090][232226] Updated weights for policy 0, policy_version 26640 (0.0007) [2023-03-07 16:47:54,884][232226] Updated weights for policy 0, policy_version 26650 (0.0007) [2023-03-07 16:47:55,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 27291648. Throughput: 0: 12898.4. Samples: 27282068. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:47:55,069][231894] Avg episode reward: [(0, '184.836')] [2023-03-07 16:47:55,696][232226] Updated weights for policy 0, policy_version 26660 (0.0006) [2023-03-07 16:47:56,483][232226] Updated weights for policy 0, policy_version 26670 (0.0008) [2023-03-07 16:47:57,296][232226] Updated weights for policy 0, policy_version 26680 (0.0006) [2023-03-07 16:47:58,086][232226] Updated weights for policy 0, policy_version 26690 (0.0006) [2023-03-07 16:47:58,866][232226] Updated weights for policy 0, policy_version 26700 (0.0006) [2023-03-07 16:47:59,658][232226] Updated weights for policy 0, policy_version 26710 (0.0005) [2023-03-07 16:48:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 27356160. Throughput: 0: 12898.1. Samples: 27320548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:00,070][231894] Avg episode reward: [(0, '193.598')] [2023-03-07 16:48:00,457][232226] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-07 16:48:01,250][232226] Updated weights for policy 0, policy_version 26730 (0.0006) [2023-03-07 16:48:02,033][232226] Updated weights for policy 0, policy_version 26740 (0.0006) [2023-03-07 16:48:02,831][232226] Updated weights for policy 0, policy_version 26750 (0.0006) [2023-03-07 16:48:03,613][232226] Updated weights for policy 0, policy_version 26760 (0.0006) [2023-03-07 16:48:04,406][232226] Updated weights for policy 0, policy_version 26770 (0.0006) [2023-03-07 16:48:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 27420672. Throughput: 0: 12900.8. Samples: 27398153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:05,069][231894] Avg episode reward: [(0, '191.142')] [2023-03-07 16:48:05,222][232226] Updated weights for policy 0, policy_version 26780 (0.0006) [2023-03-07 16:48:06,013][232226] Updated weights for policy 0, policy_version 26790 (0.0006) [2023-03-07 16:48:06,821][232226] Updated weights for policy 0, policy_version 26800 (0.0007) [2023-03-07 16:48:07,611][232226] Updated weights for policy 0, policy_version 26810 (0.0007) [2023-03-07 16:48:08,411][232226] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-07 16:48:09,201][232226] Updated weights for policy 0, policy_version 26830 (0.0007) [2023-03-07 16:48:09,979][232226] Updated weights for policy 0, policy_version 26840 (0.0007) [2023-03-07 16:48:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12909.3). Total num frames: 27485184. Throughput: 0: 12899.5. Samples: 27475407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:10,069][231894] Avg episode reward: [(0, '185.594')] [2023-03-07 16:48:10,773][232226] Updated weights for policy 0, policy_version 26850 (0.0006) [2023-03-07 16:48:11,561][232226] Updated weights for policy 0, policy_version 26860 (0.0006) [2023-03-07 16:48:12,362][232226] Updated weights for policy 0, policy_version 26870 (0.0007) [2023-03-07 16:48:13,154][232226] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-07 16:48:13,941][232226] Updated weights for policy 0, policy_version 26890 (0.0006) [2023-03-07 16:48:14,755][232226] Updated weights for policy 0, policy_version 26900 (0.0006) [2023-03-07 16:48:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 27548672. Throughput: 0: 12902.9. Samples: 27514285. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:15,069][231894] Avg episode reward: [(0, '191.940')] [2023-03-07 16:48:15,546][232226] Updated weights for policy 0, policy_version 26910 (0.0006) [2023-03-07 16:48:16,336][232226] Updated weights for policy 0, policy_version 26920 (0.0007) [2023-03-07 16:48:17,132][232226] Updated weights for policy 0, policy_version 26930 (0.0006) [2023-03-07 16:48:17,926][232226] Updated weights for policy 0, policy_version 26940 (0.0007) [2023-03-07 16:48:18,725][232226] Updated weights for policy 0, policy_version 26950 (0.0007) [2023-03-07 16:48:19,525][232226] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-07 16:48:20,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 27613184. Throughput: 0: 12908.9. Samples: 27591477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:20,069][231894] Avg episode reward: [(0, '191.609')] [2023-03-07 16:48:20,328][232226] Updated weights for policy 0, policy_version 26970 (0.0006) [2023-03-07 16:48:21,129][232226] Updated weights for policy 0, policy_version 26980 (0.0006) [2023-03-07 16:48:21,912][232226] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-07 16:48:22,716][232226] Updated weights for policy 0, policy_version 27000 (0.0006) [2023-03-07 16:48:23,492][232226] Updated weights for policy 0, policy_version 27010 (0.0007) [2023-03-07 16:48:24,290][232226] Updated weights for policy 0, policy_version 27020 (0.0007) [2023-03-07 16:48:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 27677696. Throughput: 0: 12907.1. Samples: 27668894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:25,069][231894] Avg episode reward: [(0, '186.482')] [2023-03-07 16:48:25,076][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000027030_27678720.pth... [2023-03-07 16:48:25,078][232226] Updated weights for policy 0, policy_version 27030 (0.0006) [2023-03-07 16:48:25,107][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000024005_24581120.pth [2023-03-07 16:48:25,868][232226] Updated weights for policy 0, policy_version 27040 (0.0006) [2023-03-07 16:48:26,664][232226] Updated weights for policy 0, policy_version 27050 (0.0006) [2023-03-07 16:48:27,454][232226] Updated weights for policy 0, policy_version 27060 (0.0006) [2023-03-07 16:48:28,248][232226] Updated weights for policy 0, policy_version 27070 (0.0007) [2023-03-07 16:48:29,042][232226] Updated weights for policy 0, policy_version 27080 (0.0006) [2023-03-07 16:48:29,832][232226] Updated weights for policy 0, policy_version 27090 (0.0006) [2023-03-07 16:48:30,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 27743232. Throughput: 0: 12904.7. Samples: 27707580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:30,069][231894] Avg episode reward: [(0, '189.801')] [2023-03-07 16:48:30,605][232226] Updated weights for policy 0, policy_version 27100 (0.0007) [2023-03-07 16:48:31,398][232226] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-07 16:48:32,206][232226] Updated weights for policy 0, policy_version 27120 (0.0006) [2023-03-07 16:48:32,993][232226] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-07 16:48:33,795][232226] Updated weights for policy 0, policy_version 27140 (0.0006) [2023-03-07 16:48:34,589][232226] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-07 16:48:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 27806720. Throughput: 0: 12902.8. Samples: 27785248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:35,070][231894] Avg episode reward: [(0, '193.682')] [2023-03-07 16:48:35,390][232226] Updated weights for policy 0, policy_version 27160 (0.0006) [2023-03-07 16:48:36,163][232226] Updated weights for policy 0, policy_version 27170 (0.0006) [2023-03-07 16:48:36,962][232226] Updated weights for policy 0, policy_version 27180 (0.0006) [2023-03-07 16:48:37,745][232226] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-07 16:48:38,541][232226] Updated weights for policy 0, policy_version 27200 (0.0006) [2023-03-07 16:48:39,335][232226] Updated weights for policy 0, policy_version 27210 (0.0007) [2023-03-07 16:48:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 27871232. Throughput: 0: 12901.1. Samples: 27862619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:40,069][231894] Avg episode reward: [(0, '186.119')] [2023-03-07 16:48:40,156][232226] Updated weights for policy 0, policy_version 27220 (0.0007) [2023-03-07 16:48:40,920][232226] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-07 16:48:41,705][232226] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-07 16:48:42,504][232226] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-07 16:48:43,281][232226] Updated weights for policy 0, policy_version 27260 (0.0007) [2023-03-07 16:48:44,083][232226] Updated weights for policy 0, policy_version 27270 (0.0007) [2023-03-07 16:48:44,877][232226] Updated weights for policy 0, policy_version 27280 (0.0006) [2023-03-07 16:48:45,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 27936768. Throughput: 0: 12907.0. Samples: 27901362. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:45,069][231894] Avg episode reward: [(0, '187.338')] [2023-03-07 16:48:45,669][232226] Updated weights for policy 0, policy_version 27290 (0.0007) [2023-03-07 16:48:46,461][232226] Updated weights for policy 0, policy_version 27300 (0.0007) [2023-03-07 16:48:47,262][232226] Updated weights for policy 0, policy_version 27310 (0.0006) [2023-03-07 16:48:48,054][232226] Updated weights for policy 0, policy_version 27320 (0.0006) [2023-03-07 16:48:48,863][232226] Updated weights for policy 0, policy_version 27330 (0.0007) [2023-03-07 16:48:49,646][232226] Updated weights for policy 0, policy_version 27340 (0.0006) [2023-03-07 16:48:50,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 28001280. Throughput: 0: 12907.9. Samples: 27979007. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:50,070][231894] Avg episode reward: [(0, '192.389')] [2023-03-07 16:48:50,430][232226] Updated weights for policy 0, policy_version 27350 (0.0007) [2023-03-07 16:48:51,214][232226] Updated weights for policy 0, policy_version 27360 (0.0007) [2023-03-07 16:48:52,003][232226] Updated weights for policy 0, policy_version 27370 (0.0006) [2023-03-07 16:48:52,806][232226] Updated weights for policy 0, policy_version 27380 (0.0006) [2023-03-07 16:48:53,605][232226] Updated weights for policy 0, policy_version 27390 (0.0007) [2023-03-07 16:48:54,394][232226] Updated weights for policy 0, policy_version 27400 (0.0006) [2023-03-07 16:48:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 28065792. Throughput: 0: 12916.5. Samples: 28056649. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:48:55,070][231894] Avg episode reward: [(0, '197.018')] [2023-03-07 16:48:55,208][232226] Updated weights for policy 0, policy_version 27410 (0.0006) [2023-03-07 16:48:55,983][232226] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-07 16:48:56,785][232226] Updated weights for policy 0, policy_version 27430 (0.0006) [2023-03-07 16:48:57,569][232226] Updated weights for policy 0, policy_version 27440 (0.0006) [2023-03-07 16:48:58,364][232226] Updated weights for policy 0, policy_version 27450 (0.0007) [2023-03-07 16:48:59,149][232226] Updated weights for policy 0, policy_version 27460 (0.0007) [2023-03-07 16:48:59,946][232226] Updated weights for policy 0, policy_version 27470 (0.0006) [2023-03-07 16:49:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 28130304. Throughput: 0: 12911.6. Samples: 28095306. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:00,069][231894] Avg episode reward: [(0, '192.412')] [2023-03-07 16:49:00,749][232226] Updated weights for policy 0, policy_version 27480 (0.0006) [2023-03-07 16:49:01,536][232226] Updated weights for policy 0, policy_version 27490 (0.0007) [2023-03-07 16:49:02,337][232226] Updated weights for policy 0, policy_version 27500 (0.0007) [2023-03-07 16:49:03,134][232226] Updated weights for policy 0, policy_version 27510 (0.0006) [2023-03-07 16:49:03,913][232226] Updated weights for policy 0, policy_version 27520 (0.0007) [2023-03-07 16:49:04,725][232226] Updated weights for policy 0, policy_version 27530 (0.0006) [2023-03-07 16:49:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 28194816. Throughput: 0: 12910.2. Samples: 28172438. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:05,070][231894] Avg episode reward: [(0, '196.526')] [2023-03-07 16:49:05,512][232226] Updated weights for policy 0, policy_version 27540 (0.0006) [2023-03-07 16:49:06,306][232226] Updated weights for policy 0, policy_version 27550 (0.0006) [2023-03-07 16:49:07,104][232226] Updated weights for policy 0, policy_version 27560 (0.0007) [2023-03-07 16:49:07,905][232226] Updated weights for policy 0, policy_version 27570 (0.0006) [2023-03-07 16:49:08,705][232226] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-07 16:49:09,502][232226] Updated weights for policy 0, policy_version 27590 (0.0007) [2023-03-07 16:49:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 28259328. Throughput: 0: 12907.2. Samples: 28249718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:10,080][231894] Avg episode reward: [(0, '189.270')] [2023-03-07 16:49:10,302][232226] Updated weights for policy 0, policy_version 27600 (0.0006) [2023-03-07 16:49:11,085][232226] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-07 16:49:11,878][232226] Updated weights for policy 0, policy_version 27620 (0.0007) [2023-03-07 16:49:12,674][232226] Updated weights for policy 0, policy_version 27630 (0.0006) [2023-03-07 16:49:13,454][232226] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-07 16:49:14,238][232226] Updated weights for policy 0, policy_version 27650 (0.0006) [2023-03-07 16:49:15,043][232226] Updated weights for policy 0, policy_version 27660 (0.0007) [2023-03-07 16:49:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12905.9). Total num frames: 28323840. Throughput: 0: 12906.9. Samples: 28288392. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:15,080][231894] Avg episode reward: [(0, '188.130')] [2023-03-07 16:49:15,822][232226] Updated weights for policy 0, policy_version 27670 (0.0007) [2023-03-07 16:49:16,622][232226] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-07 16:49:17,425][232226] Updated weights for policy 0, policy_version 27690 (0.0006) [2023-03-07 16:49:18,210][232226] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-07 16:49:19,023][232226] Updated weights for policy 0, policy_version 27710 (0.0006) [2023-03-07 16:49:19,798][232226] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-07 16:49:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.4, 300 sec: 12905.9). Total num frames: 28388352. Throughput: 0: 12907.1. Samples: 28366069. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:20,070][231894] Avg episode reward: [(0, '193.349')] [2023-03-07 16:49:20,593][232226] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-07 16:49:21,379][232226] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-07 16:49:22,168][232226] Updated weights for policy 0, policy_version 27750 (0.0006) [2023-03-07 16:49:22,966][232226] Updated weights for policy 0, policy_version 27760 (0.0005) [2023-03-07 16:49:23,765][232226] Updated weights for policy 0, policy_version 27770 (0.0006) [2023-03-07 16:49:24,570][232226] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-07 16:49:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12909.3). Total num frames: 28452864. Throughput: 0: 12908.5. Samples: 28443503. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:25,069][231894] Avg episode reward: [(0, '193.595')] [2023-03-07 16:49:25,396][232226] Updated weights for policy 0, policy_version 27790 (0.0007) [2023-03-07 16:49:26,181][232226] Updated weights for policy 0, policy_version 27800 (0.0006) [2023-03-07 16:49:26,974][232226] Updated weights for policy 0, policy_version 27810 (0.0007) [2023-03-07 16:49:27,781][232226] Updated weights for policy 0, policy_version 27820 (0.0006) [2023-03-07 16:49:28,579][232226] Updated weights for policy 0, policy_version 27830 (0.0007) [2023-03-07 16:49:29,370][232226] Updated weights for policy 0, policy_version 27840 (0.0006) [2023-03-07 16:49:30,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 28516352. Throughput: 0: 12899.2. Samples: 28481827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:30,069][231894] Avg episode reward: [(0, '195.564')] [2023-03-07 16:49:30,185][232226] Updated weights for policy 0, policy_version 27850 (0.0006) [2023-03-07 16:49:30,977][232226] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-07 16:49:31,769][232226] Updated weights for policy 0, policy_version 27870 (0.0006) [2023-03-07 16:49:32,562][232226] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-07 16:49:33,360][232226] Updated weights for policy 0, policy_version 27890 (0.0006) [2023-03-07 16:49:34,155][232226] Updated weights for policy 0, policy_version 27900 (0.0007) [2023-03-07 16:49:34,948][232226] Updated weights for policy 0, policy_version 27910 (0.0007) [2023-03-07 16:49:35,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 28580864. Throughput: 0: 12882.0. Samples: 28558698. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:35,069][231894] Avg episode reward: [(0, '190.786')] [2023-03-07 16:49:35,749][232226] Updated weights for policy 0, policy_version 27920 (0.0006) [2023-03-07 16:49:36,549][232226] Updated weights for policy 0, policy_version 27930 (0.0007) [2023-03-07 16:49:37,337][232226] Updated weights for policy 0, policy_version 27940 (0.0006) [2023-03-07 16:49:38,141][232226] Updated weights for policy 0, policy_version 27950 (0.0006) [2023-03-07 16:49:38,922][232226] Updated weights for policy 0, policy_version 27960 (0.0006) [2023-03-07 16:49:39,739][232226] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-07 16:49:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 28645376. Throughput: 0: 12872.2. Samples: 28635896. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:49:40,069][231894] Avg episode reward: [(0, '188.254')] [2023-03-07 16:49:40,519][232226] Updated weights for policy 0, policy_version 27980 (0.0006) [2023-03-07 16:49:41,317][232226] Updated weights for policy 0, policy_version 27990 (0.0007) [2023-03-07 16:49:42,102][232226] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-07 16:49:42,904][232226] Updated weights for policy 0, policy_version 28010 (0.0007) [2023-03-07 16:49:43,674][232226] Updated weights for policy 0, policy_version 28020 (0.0007) [2023-03-07 16:49:44,498][232226] Updated weights for policy 0, policy_version 28030 (0.0007) [2023-03-07 16:49:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12905.9). Total num frames: 28709888. Throughput: 0: 12874.2. Samples: 28674644. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:49:45,069][231894] Avg episode reward: [(0, '184.711')] [2023-03-07 16:49:45,275][232226] Updated weights for policy 0, policy_version 28040 (0.0006) [2023-03-07 16:49:46,048][232226] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-07 16:49:46,859][232226] Updated weights for policy 0, policy_version 28060 (0.0006) [2023-03-07 16:49:47,648][232226] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-07 16:49:48,455][232226] Updated weights for policy 0, policy_version 28080 (0.0006) [2023-03-07 16:49:49,246][232226] Updated weights for policy 0, policy_version 28090 (0.0006) [2023-03-07 16:49:50,048][232226] Updated weights for policy 0, policy_version 28100 (0.0006) [2023-03-07 16:49:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 28774400. Throughput: 0: 12882.8. Samples: 28752162. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:49:50,069][231894] Avg episode reward: [(0, '193.214')] [2023-03-07 16:49:50,846][232226] Updated weights for policy 0, policy_version 28110 (0.0006) [2023-03-07 16:49:51,650][232226] Updated weights for policy 0, policy_version 28120 (0.0007) [2023-03-07 16:49:52,441][232226] Updated weights for policy 0, policy_version 28130 (0.0008) [2023-03-07 16:49:53,232][232226] Updated weights for policy 0, policy_version 28140 (0.0006) [2023-03-07 16:49:54,053][232226] Updated weights for policy 0, policy_version 28150 (0.0006) [2023-03-07 16:49:54,837][232226] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-07 16:49:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12902.4). Total num frames: 28838912. Throughput: 0: 12874.6. Samples: 28829077. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:49:55,069][231894] Avg episode reward: [(0, '186.136')] [2023-03-07 16:49:55,632][232226] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-07 16:49:56,430][232226] Updated weights for policy 0, policy_version 28180 (0.0006) [2023-03-07 16:49:57,201][232226] Updated weights for policy 0, policy_version 28190 (0.0006) [2023-03-07 16:49:57,996][232226] Updated weights for policy 0, policy_version 28200 (0.0006) [2023-03-07 16:49:58,781][232226] Updated weights for policy 0, policy_version 28210 (0.0006) [2023-03-07 16:49:59,573][232226] Updated weights for policy 0, policy_version 28220 (0.0007) [2023-03-07 16:50:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 28903424. Throughput: 0: 12879.0. Samples: 28867946. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 16:50:00,069][231894] Avg episode reward: [(0, '191.157')] [2023-03-07 16:50:00,373][232226] Updated weights for policy 0, policy_version 28230 (0.0006) [2023-03-07 16:50:01,169][232226] Updated weights for policy 0, policy_version 28240 (0.0008) [2023-03-07 16:50:01,957][232226] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-07 16:50:02,755][232226] Updated weights for policy 0, policy_version 28260 (0.0006) [2023-03-07 16:50:03,553][232226] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-07 16:50:04,329][232226] Updated weights for policy 0, policy_version 28280 (0.0007) [2023-03-07 16:50:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 28967936. Throughput: 0: 12873.8. Samples: 28945389. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:50:05,070][231894] Avg episode reward: [(0, '190.096')] [2023-03-07 16:50:05,126][232226] Updated weights for policy 0, policy_version 28290 (0.0005) [2023-03-07 16:50:05,919][232226] Updated weights for policy 0, policy_version 28300 (0.0006) [2023-03-07 16:50:06,705][232226] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-07 16:50:07,479][232226] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-07 16:50:08,299][232226] Updated weights for policy 0, policy_version 28330 (0.0006) [2023-03-07 16:50:09,071][232226] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-07 16:50:09,857][232226] Updated weights for policy 0, policy_version 28350 (0.0006) [2023-03-07 16:50:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 29032448. Throughput: 0: 12881.8. Samples: 29023185. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:50:10,069][231894] Avg episode reward: [(0, '191.079')] [2023-03-07 16:50:10,676][232226] Updated weights for policy 0, policy_version 28360 (0.0006) [2023-03-07 16:50:11,461][232226] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-07 16:50:12,245][232226] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-07 16:50:13,030][232226] Updated weights for policy 0, policy_version 28390 (0.0006) [2023-03-07 16:50:13,834][232226] Updated weights for policy 0, policy_version 28400 (0.0006) [2023-03-07 16:50:14,637][232226] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-07 16:50:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 29096960. Throughput: 0: 12891.6. Samples: 29061951. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:50:15,069][231894] Avg episode reward: [(0, '187.204')] [2023-03-07 16:50:15,439][232226] Updated weights for policy 0, policy_version 28420 (0.0006) [2023-03-07 16:50:16,239][232226] Updated weights for policy 0, policy_version 28430 (0.0006) [2023-03-07 16:50:17,042][232226] Updated weights for policy 0, policy_version 28440 (0.0007) [2023-03-07 16:50:17,822][232226] Updated weights for policy 0, policy_version 28450 (0.0007) [2023-03-07 16:50:18,610][232226] Updated weights for policy 0, policy_version 28460 (0.0006) [2023-03-07 16:50:19,381][232226] Updated weights for policy 0, policy_version 28470 (0.0006) [2023-03-07 16:50:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12902.4). Total num frames: 29161472. Throughput: 0: 12899.0. Samples: 29139151. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:50:20,069][231894] Avg episode reward: [(0, '184.518')] [2023-03-07 16:50:20,181][232226] Updated weights for policy 0, policy_version 28480 (0.0007) [2023-03-07 16:50:20,974][232226] Updated weights for policy 0, policy_version 28490 (0.0006) [2023-03-07 16:50:21,770][232226] Updated weights for policy 0, policy_version 28500 (0.0008) [2023-03-07 16:50:22,573][232226] Updated weights for policy 0, policy_version 28510 (0.0006) [2023-03-07 16:50:23,384][232226] Updated weights for policy 0, policy_version 28520 (0.0006) [2023-03-07 16:50:24,160][232226] Updated weights for policy 0, policy_version 28530 (0.0006) [2023-03-07 16:50:24,964][232226] Updated weights for policy 0, policy_version 28540 (0.0007) [2023-03-07 16:50:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 29225984. Throughput: 0: 12905.0. Samples: 29216623. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 16:50:25,070][231894] Avg episode reward: [(0, '195.015')] [2023-03-07 16:50:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000028541_29225984.pth... [2023-03-07 16:50:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000025517_26129408.pth [2023-03-07 16:50:25,766][232226] Updated weights for policy 0, policy_version 28550 (0.0007) [2023-03-07 16:50:26,567][232226] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-07 16:50:27,346][232226] Updated weights for policy 0, policy_version 28570 (0.0007) [2023-03-07 16:50:28,143][232226] Updated weights for policy 0, policy_version 28580 (0.0006) [2023-03-07 16:50:28,943][232226] Updated weights for policy 0, policy_version 28590 (0.0006) [2023-03-07 16:50:29,726][232226] Updated weights for policy 0, policy_version 28600 (0.0006) [2023-03-07 16:50:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 29290496. Throughput: 0: 12900.6. Samples: 29255172. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 16:50:30,069][231894] Avg episode reward: [(0, '188.691')] [2023-03-07 16:50:30,537][232226] Updated weights for policy 0, policy_version 28610 (0.0007) [2023-03-07 16:50:31,321][232226] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-07 16:50:32,117][232226] Updated weights for policy 0, policy_version 28630 (0.0006) [2023-03-07 16:50:32,894][232226] Updated weights for policy 0, policy_version 28640 (0.0006) [2023-03-07 16:50:33,697][232226] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-07 16:50:34,515][232226] Updated weights for policy 0, policy_version 28660 (0.0007) [2023-03-07 16:50:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 29355008. Throughput: 0: 12898.0. Samples: 29332571. Policy #0 lag: (min: 0.0, avg: 0.9, max: 2.0) [2023-03-07 16:50:35,080][231894] Avg episode reward: [(0, '192.906')] [2023-03-07 16:50:35,306][232226] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-07 16:50:36,078][232226] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-07 16:50:36,877][232226] Updated weights for policy 0, policy_version 28690 (0.0006) [2023-03-07 16:50:37,646][232226] Updated weights for policy 0, policy_version 28700 (0.0006) [2023-03-07 16:50:38,458][232226] Updated weights for policy 0, policy_version 28710 (0.0007) [2023-03-07 16:50:39,257][232226] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-07 16:50:40,064][232226] Updated weights for policy 0, policy_version 28730 (0.0007) [2023-03-07 16:50:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 29419520. Throughput: 0: 12910.8. Samples: 29410064. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:50:40,080][231894] Avg episode reward: [(0, '194.743')] [2023-03-07 16:50:40,857][232226] Updated weights for policy 0, policy_version 28740 (0.0007) [2023-03-07 16:50:41,644][232226] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-07 16:50:42,437][232226] Updated weights for policy 0, policy_version 28760 (0.0006) [2023-03-07 16:50:43,214][232226] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-07 16:50:44,001][232226] Updated weights for policy 0, policy_version 28780 (0.0007) [2023-03-07 16:50:44,817][232226] Updated weights for policy 0, policy_version 28790 (0.0005) [2023-03-07 16:50:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 29484032. Throughput: 0: 12904.8. Samples: 29448664. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:50:45,080][231894] Avg episode reward: [(0, '189.166')] [2023-03-07 16:50:45,605][232226] Updated weights for policy 0, policy_version 28800 (0.0007) [2023-03-07 16:50:46,399][232226] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-07 16:50:47,206][232226] Updated weights for policy 0, policy_version 28820 (0.0006) [2023-03-07 16:50:47,989][232226] Updated weights for policy 0, policy_version 28830 (0.0006) [2023-03-07 16:50:48,764][232226] Updated weights for policy 0, policy_version 28840 (0.0007) [2023-03-07 16:50:49,593][232226] Updated weights for policy 0, policy_version 28850 (0.0007) [2023-03-07 16:50:50,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 29548544. Throughput: 0: 12903.7. Samples: 29526054. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:50:50,069][231894] Avg episode reward: [(0, '190.481')] [2023-03-07 16:50:50,371][232226] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-07 16:50:51,160][232226] Updated weights for policy 0, policy_version 28870 (0.0006) [2023-03-07 16:50:51,955][232226] Updated weights for policy 0, policy_version 28880 (0.0006) [2023-03-07 16:50:52,770][232226] Updated weights for policy 0, policy_version 28890 (0.0007) [2023-03-07 16:50:53,553][232226] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-07 16:50:54,355][232226] Updated weights for policy 0, policy_version 28910 (0.0007) [2023-03-07 16:50:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 29613056. Throughput: 0: 12892.2. Samples: 29603334. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:50:55,070][231894] Avg episode reward: [(0, '196.060')] [2023-03-07 16:50:55,149][232226] Updated weights for policy 0, policy_version 28920 (0.0006) [2023-03-07 16:50:55,946][232226] Updated weights for policy 0, policy_version 28930 (0.0006) [2023-03-07 16:50:56,743][232226] Updated weights for policy 0, policy_version 28940 (0.0006) [2023-03-07 16:50:57,527][232226] Updated weights for policy 0, policy_version 28950 (0.0007) [2023-03-07 16:50:58,331][232226] Updated weights for policy 0, policy_version 28960 (0.0007) [2023-03-07 16:50:59,145][232226] Updated weights for policy 0, policy_version 28970 (0.0006) [2023-03-07 16:50:59,919][232226] Updated weights for policy 0, policy_version 28980 (0.0007) [2023-03-07 16:51:00,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 29676544. Throughput: 0: 12888.4. Samples: 29641932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:00,070][231894] Avg episode reward: [(0, '196.996')] [2023-03-07 16:51:00,719][232226] Updated weights for policy 0, policy_version 28990 (0.0006) [2023-03-07 16:51:01,499][232226] Updated weights for policy 0, policy_version 29000 (0.0007) [2023-03-07 16:51:02,303][232226] Updated weights for policy 0, policy_version 29010 (0.0006) [2023-03-07 16:51:03,078][232226] Updated weights for policy 0, policy_version 29020 (0.0007) [2023-03-07 16:51:03,863][232226] Updated weights for policy 0, policy_version 29030 (0.0006) [2023-03-07 16:51:04,670][232226] Updated weights for policy 0, policy_version 29040 (0.0006) [2023-03-07 16:51:05,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 29742080. Throughput: 0: 12894.1. Samples: 29719386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:05,069][231894] Avg episode reward: [(0, '192.092')] [2023-03-07 16:51:05,461][232226] Updated weights for policy 0, policy_version 29050 (0.0006) [2023-03-07 16:51:06,252][232226] Updated weights for policy 0, policy_version 29060 (0.0007) [2023-03-07 16:51:07,053][232226] Updated weights for policy 0, policy_version 29070 (0.0006) [2023-03-07 16:51:07,829][232226] Updated weights for policy 0, policy_version 29080 (0.0006) [2023-03-07 16:51:08,613][232226] Updated weights for policy 0, policy_version 29090 (0.0006) [2023-03-07 16:51:09,409][232226] Updated weights for policy 0, policy_version 29100 (0.0005) [2023-03-07 16:51:10,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 29806592. Throughput: 0: 12899.3. Samples: 29797093. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:10,069][231894] Avg episode reward: [(0, '192.251')] [2023-03-07 16:51:10,205][232226] Updated weights for policy 0, policy_version 29110 (0.0007) [2023-03-07 16:51:10,986][232226] Updated weights for policy 0, policy_version 29120 (0.0007) [2023-03-07 16:51:11,774][232226] Updated weights for policy 0, policy_version 29130 (0.0006) [2023-03-07 16:51:12,574][232226] Updated weights for policy 0, policy_version 29140 (0.0007) [2023-03-07 16:51:13,370][232226] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-07 16:51:14,161][232226] Updated weights for policy 0, policy_version 29160 (0.0005) [2023-03-07 16:51:14,949][232226] Updated weights for policy 0, policy_version 29170 (0.0005) [2023-03-07 16:51:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12905.9). Total num frames: 29871104. Throughput: 0: 12906.5. Samples: 29835966. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:15,070][231894] Avg episode reward: [(0, '193.984')] [2023-03-07 16:51:15,734][232226] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-07 16:51:16,549][232226] Updated weights for policy 0, policy_version 29190 (0.0007) [2023-03-07 16:51:17,340][232226] Updated weights for policy 0, policy_version 29200 (0.0006) [2023-03-07 16:51:18,125][232226] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-07 16:51:18,946][232226] Updated weights for policy 0, policy_version 29220 (0.0006) [2023-03-07 16:51:19,735][232226] Updated weights for policy 0, policy_version 29230 (0.0007) [2023-03-07 16:51:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 29935616. Throughput: 0: 12906.1. Samples: 29913343. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:20,069][231894] Avg episode reward: [(0, '192.771')] [2023-03-07 16:51:20,515][232226] Updated weights for policy 0, policy_version 29240 (0.0006) [2023-03-07 16:51:21,327][232226] Updated weights for policy 0, policy_version 29250 (0.0007) [2023-03-07 16:51:22,124][232226] Updated weights for policy 0, policy_version 29260 (0.0007) [2023-03-07 16:51:22,895][232226] Updated weights for policy 0, policy_version 29270 (0.0007) [2023-03-07 16:51:23,691][232226] Updated weights for policy 0, policy_version 29280 (0.0006) [2023-03-07 16:51:24,501][232226] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-07 16:51:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 30000128. Throughput: 0: 12897.4. Samples: 29990447. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:25,069][231894] Avg episode reward: [(0, '189.766')] [2023-03-07 16:51:25,297][232226] Updated weights for policy 0, policy_version 29300 (0.0006) [2023-03-07 16:51:26,081][232226] Updated weights for policy 0, policy_version 29310 (0.0007) [2023-03-07 16:51:26,868][232226] Updated weights for policy 0, policy_version 29320 (0.0006) [2023-03-07 16:51:27,653][232226] Updated weights for policy 0, policy_version 29330 (0.0006) [2023-03-07 16:51:28,458][232226] Updated weights for policy 0, policy_version 29340 (0.0005) [2023-03-07 16:51:29,275][232226] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-07 16:51:30,069][232226] Updated weights for policy 0, policy_version 29360 (0.0006) [2023-03-07 16:51:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 30064640. Throughput: 0: 12902.1. Samples: 30029260. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:30,080][231894] Avg episode reward: [(0, '196.111')] [2023-03-07 16:51:30,874][232226] Updated weights for policy 0, policy_version 29370 (0.0006) [2023-03-07 16:51:31,676][232226] Updated weights for policy 0, policy_version 29380 (0.0007) [2023-03-07 16:51:32,455][232226] Updated weights for policy 0, policy_version 29390 (0.0007) [2023-03-07 16:51:33,261][232226] Updated weights for policy 0, policy_version 29400 (0.0007) [2023-03-07 16:51:34,057][232226] Updated weights for policy 0, policy_version 29410 (0.0007) [2023-03-07 16:51:34,849][232226] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-07 16:51:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 30128128. Throughput: 0: 12894.3. Samples: 30106300. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:51:35,080][231894] Avg episode reward: [(0, '192.628')] [2023-03-07 16:51:35,644][232226] Updated weights for policy 0, policy_version 29430 (0.0006) [2023-03-07 16:51:36,455][232226] Updated weights for policy 0, policy_version 29440 (0.0006) [2023-03-07 16:51:37,245][232226] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-07 16:51:38,031][232226] Updated weights for policy 0, policy_version 29460 (0.0005) [2023-03-07 16:51:38,814][232226] Updated weights for policy 0, policy_version 29470 (0.0006) [2023-03-07 16:51:39,600][232226] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-07 16:51:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 30192640. Throughput: 0: 12894.7. Samples: 30183595. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:51:40,080][231894] Avg episode reward: [(0, '198.110')] [2023-03-07 16:51:40,391][232226] Updated weights for policy 0, policy_version 29490 (0.0007) [2023-03-07 16:51:41,176][232226] Updated weights for policy 0, policy_version 29500 (0.0006) [2023-03-07 16:51:41,966][232226] Updated weights for policy 0, policy_version 29510 (0.0006) [2023-03-07 16:51:42,760][232226] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-07 16:51:43,547][232226] Updated weights for policy 0, policy_version 29530 (0.0006) [2023-03-07 16:51:44,352][232226] Updated weights for policy 0, policy_version 29540 (0.0006) [2023-03-07 16:51:45,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 30258176. Throughput: 0: 12904.9. Samples: 30222650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:51:45,080][231894] Avg episode reward: [(0, '191.574')] [2023-03-07 16:51:45,158][232226] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-07 16:51:45,946][232226] Updated weights for policy 0, policy_version 29560 (0.0005) [2023-03-07 16:51:46,762][232226] Updated weights for policy 0, policy_version 29570 (0.0006) [2023-03-07 16:51:47,547][232226] Updated weights for policy 0, policy_version 29580 (0.0007) [2023-03-07 16:51:48,334][232226] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-07 16:51:49,126][232226] Updated weights for policy 0, policy_version 29600 (0.0007) [2023-03-07 16:51:49,918][232226] Updated weights for policy 0, policy_version 29610 (0.0006) [2023-03-07 16:51:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 30321664. Throughput: 0: 12900.1. Samples: 30299890. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:51:50,080][231894] Avg episode reward: [(0, '192.300')] [2023-03-07 16:51:50,712][232226] Updated weights for policy 0, policy_version 29620 (0.0006) [2023-03-07 16:51:51,510][232226] Updated weights for policy 0, policy_version 29630 (0.0006) [2023-03-07 16:51:52,301][232226] Updated weights for policy 0, policy_version 29640 (0.0007) [2023-03-07 16:51:53,079][232226] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-07 16:51:53,854][232226] Updated weights for policy 0, policy_version 29660 (0.0006) [2023-03-07 16:51:54,661][232226] Updated weights for policy 0, policy_version 29670 (0.0006) [2023-03-07 16:51:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 30387200. Throughput: 0: 12898.0. Samples: 30377502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:51:55,080][231894] Avg episode reward: [(0, '189.272')] [2023-03-07 16:51:55,448][232226] Updated weights for policy 0, policy_version 29680 (0.0006) [2023-03-07 16:51:56,255][232226] Updated weights for policy 0, policy_version 29690 (0.0006) [2023-03-07 16:51:57,039][232226] Updated weights for policy 0, policy_version 29700 (0.0006) [2023-03-07 16:51:57,823][232226] Updated weights for policy 0, policy_version 29710 (0.0007) [2023-03-07 16:51:58,625][232226] Updated weights for policy 0, policy_version 29720 (0.0006) [2023-03-07 16:51:59,420][232226] Updated weights for policy 0, policy_version 29730 (0.0006) [2023-03-07 16:52:00,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 30451712. Throughput: 0: 12895.0. Samples: 30416241. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:52:00,080][231894] Avg episode reward: [(0, '193.978')] [2023-03-07 16:52:00,224][232226] Updated weights for policy 0, policy_version 29740 (0.0006) [2023-03-07 16:52:01,020][232226] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-07 16:52:01,802][232226] Updated weights for policy 0, policy_version 29760 (0.0006) [2023-03-07 16:52:02,600][232226] Updated weights for policy 0, policy_version 29770 (0.0007) [2023-03-07 16:52:03,378][232226] Updated weights for policy 0, policy_version 29780 (0.0006) [2023-03-07 16:52:04,172][232226] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-07 16:52:04,963][232226] Updated weights for policy 0, policy_version 29800 (0.0006) [2023-03-07 16:52:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 30516224. Throughput: 0: 12899.9. Samples: 30493838. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:52:05,080][231894] Avg episode reward: [(0, '189.977')] [2023-03-07 16:52:05,745][232226] Updated weights for policy 0, policy_version 29810 (0.0006) [2023-03-07 16:52:06,549][232226] Updated weights for policy 0, policy_version 29820 (0.0007) [2023-03-07 16:52:07,329][232226] Updated weights for policy 0, policy_version 29830 (0.0006) [2023-03-07 16:52:08,124][232226] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-07 16:52:08,924][232226] Updated weights for policy 0, policy_version 29850 (0.0006) [2023-03-07 16:52:09,709][232226] Updated weights for policy 0, policy_version 29860 (0.0006) [2023-03-07 16:52:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 30580736. Throughput: 0: 12908.6. Samples: 30571332. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:52:10,080][231894] Avg episode reward: [(0, '193.946')] [2023-03-07 16:52:10,505][232226] Updated weights for policy 0, policy_version 29870 (0.0006) [2023-03-07 16:52:11,300][232226] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-07 16:52:12,086][232226] Updated weights for policy 0, policy_version 29890 (0.0008) [2023-03-07 16:52:12,872][232226] Updated weights for policy 0, policy_version 29900 (0.0007) [2023-03-07 16:52:13,667][232226] Updated weights for policy 0, policy_version 29910 (0.0006) [2023-03-07 16:52:14,457][232226] Updated weights for policy 0, policy_version 29920 (0.0006) [2023-03-07 16:52:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 30645248. Throughput: 0: 12910.6. Samples: 30610235. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:15,080][231894] Avg episode reward: [(0, '190.329')] [2023-03-07 16:52:15,248][232226] Updated weights for policy 0, policy_version 29930 (0.0008) [2023-03-07 16:52:16,044][232226] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-07 16:52:16,842][232226] Updated weights for policy 0, policy_version 29950 (0.0005) [2023-03-07 16:52:17,633][232226] Updated weights for policy 0, policy_version 29960 (0.0006) [2023-03-07 16:52:18,450][232226] Updated weights for policy 0, policy_version 29970 (0.0007) [2023-03-07 16:52:19,235][232226] Updated weights for policy 0, policy_version 29980 (0.0007) [2023-03-07 16:52:20,030][232226] Updated weights for policy 0, policy_version 29990 (0.0006) [2023-03-07 16:52:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 30709760. Throughput: 0: 12918.3. Samples: 30687622. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:20,069][231894] Avg episode reward: [(0, '185.799')] [2023-03-07 16:52:20,828][232226] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-07 16:52:21,634][232226] Updated weights for policy 0, policy_version 30010 (0.0005) [2023-03-07 16:52:22,422][232226] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-07 16:52:23,204][232226] Updated weights for policy 0, policy_version 30030 (0.0007) [2023-03-07 16:52:24,006][232226] Updated weights for policy 0, policy_version 30040 (0.0006) [2023-03-07 16:52:24,807][232226] Updated weights for policy 0, policy_version 30050 (0.0006) [2023-03-07 16:52:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 30774272. Throughput: 0: 12915.0. Samples: 30764773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:25,069][231894] Avg episode reward: [(0, '191.666')] [2023-03-07 16:52:25,072][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000030053_30774272.pth... [2023-03-07 16:52:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000027030_27678720.pth [2023-03-07 16:52:25,582][232226] Updated weights for policy 0, policy_version 30060 (0.0007) [2023-03-07 16:52:26,375][232226] Updated weights for policy 0, policy_version 30070 (0.0007) [2023-03-07 16:52:27,183][232226] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-07 16:52:27,953][232226] Updated weights for policy 0, policy_version 30090 (0.0005) [2023-03-07 16:52:28,758][232226] Updated weights for policy 0, policy_version 30100 (0.0007) [2023-03-07 16:52:29,564][232226] Updated weights for policy 0, policy_version 30110 (0.0006) [2023-03-07 16:52:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 30838784. Throughput: 0: 12910.6. Samples: 30803626. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:30,069][231894] Avg episode reward: [(0, '191.893')] [2023-03-07 16:52:30,351][232226] Updated weights for policy 0, policy_version 30120 (0.0006) [2023-03-07 16:52:31,148][232226] Updated weights for policy 0, policy_version 30130 (0.0007) [2023-03-07 16:52:31,938][232226] Updated weights for policy 0, policy_version 30140 (0.0007) [2023-03-07 16:52:32,734][232226] Updated weights for policy 0, policy_version 30150 (0.0007) [2023-03-07 16:52:33,519][232226] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-07 16:52:34,303][232226] Updated weights for policy 0, policy_version 30170 (0.0007) [2023-03-07 16:52:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 30903296. Throughput: 0: 12917.9. Samples: 30881197. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:35,069][231894] Avg episode reward: [(0, '191.282')] [2023-03-07 16:52:35,088][232226] Updated weights for policy 0, policy_version 30180 (0.0007) [2023-03-07 16:52:35,879][232226] Updated weights for policy 0, policy_version 30190 (0.0006) [2023-03-07 16:52:36,672][232226] Updated weights for policy 0, policy_version 30200 (0.0007) [2023-03-07 16:52:37,469][232226] Updated weights for policy 0, policy_version 30210 (0.0006) [2023-03-07 16:52:38,243][232226] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-07 16:52:39,033][232226] Updated weights for policy 0, policy_version 30230 (0.0006) [2023-03-07 16:52:39,821][232226] Updated weights for policy 0, policy_version 30240 (0.0006) [2023-03-07 16:52:40,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12936.5, 300 sec: 12905.9). Total num frames: 30968832. Throughput: 0: 12923.7. Samples: 30959067. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:40,069][231894] Avg episode reward: [(0, '194.246')] [2023-03-07 16:52:40,628][232226] Updated weights for policy 0, policy_version 30250 (0.0007) [2023-03-07 16:52:41,401][232226] Updated weights for policy 0, policy_version 30260 (0.0006) [2023-03-07 16:52:42,190][232226] Updated weights for policy 0, policy_version 30270 (0.0006) [2023-03-07 16:52:42,993][232226] Updated weights for policy 0, policy_version 30280 (0.0006) [2023-03-07 16:52:43,797][232226] Updated weights for policy 0, policy_version 30290 (0.0006) [2023-03-07 16:52:44,580][232226] Updated weights for policy 0, policy_version 30300 (0.0006) [2023-03-07 16:52:45,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12919.4, 300 sec: 12902.4). Total num frames: 31033344. Throughput: 0: 12922.9. Samples: 30997773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:45,070][231894] Avg episode reward: [(0, '199.315')] [2023-03-07 16:52:45,379][232226] Updated weights for policy 0, policy_version 30310 (0.0007) [2023-03-07 16:52:46,173][232226] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-07 16:52:46,948][232226] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-07 16:52:47,758][232226] Updated weights for policy 0, policy_version 30340 (0.0007) [2023-03-07 16:52:48,530][232226] Updated weights for policy 0, policy_version 30350 (0.0007) [2023-03-07 16:52:49,337][232226] Updated weights for policy 0, policy_version 30360 (0.0006) [2023-03-07 16:52:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12936.5, 300 sec: 12902.4). Total num frames: 31097856. Throughput: 0: 12925.3. Samples: 31075475. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:50,069][231894] Avg episode reward: [(0, '194.799')] [2023-03-07 16:52:50,143][232226] Updated weights for policy 0, policy_version 30370 (0.0006) [2023-03-07 16:52:50,928][232226] Updated weights for policy 0, policy_version 30380 (0.0008) [2023-03-07 16:52:51,722][232226] Updated weights for policy 0, policy_version 30390 (0.0006) [2023-03-07 16:52:52,533][232226] Updated weights for policy 0, policy_version 30400 (0.0006) [2023-03-07 16:52:53,314][232226] Updated weights for policy 0, policy_version 30410 (0.0006) [2023-03-07 16:52:54,118][232226] Updated weights for policy 0, policy_version 30420 (0.0006) [2023-03-07 16:52:54,914][232226] Updated weights for policy 0, policy_version 30430 (0.0006) [2023-03-07 16:52:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 31162368. Throughput: 0: 12914.5. Samples: 31152483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:52:55,070][231894] Avg episode reward: [(0, '199.293')] [2023-03-07 16:52:55,714][232226] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-07 16:52:56,518][232226] Updated weights for policy 0, policy_version 30450 (0.0006) [2023-03-07 16:52:57,307][232226] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-07 16:52:58,098][232226] Updated weights for policy 0, policy_version 30470 (0.0007) [2023-03-07 16:52:58,889][232226] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-07 16:52:59,687][232226] Updated weights for policy 0, policy_version 30490 (0.0007) [2023-03-07 16:53:00,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 31225856. Throughput: 0: 12907.6. Samples: 31191075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:00,069][231894] Avg episode reward: [(0, '199.446')] [2023-03-07 16:53:00,483][232226] Updated weights for policy 0, policy_version 30500 (0.0006) [2023-03-07 16:53:01,276][232226] Updated weights for policy 0, policy_version 30510 (0.0006) [2023-03-07 16:53:02,061][232226] Updated weights for policy 0, policy_version 30520 (0.0006) [2023-03-07 16:53:02,870][232226] Updated weights for policy 0, policy_version 30530 (0.0006) [2023-03-07 16:53:03,666][232226] Updated weights for policy 0, policy_version 30540 (0.0006) [2023-03-07 16:53:04,456][232226] Updated weights for policy 0, policy_version 30550 (0.0007) [2023-03-07 16:53:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 31290368. Throughput: 0: 12903.2. Samples: 31268267. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:05,069][231894] Avg episode reward: [(0, '189.450')] [2023-03-07 16:53:05,242][232226] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-07 16:53:06,048][232226] Updated weights for policy 0, policy_version 30570 (0.0007) [2023-03-07 16:53:06,823][232226] Updated weights for policy 0, policy_version 30580 (0.0006) [2023-03-07 16:53:07,630][232226] Updated weights for policy 0, policy_version 30590 (0.0006) [2023-03-07 16:53:08,432][232226] Updated weights for policy 0, policy_version 30600 (0.0006) [2023-03-07 16:53:09,224][232226] Updated weights for policy 0, policy_version 30610 (0.0007) [2023-03-07 16:53:10,016][232226] Updated weights for policy 0, policy_version 30620 (0.0006) [2023-03-07 16:53:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 31354880. Throughput: 0: 12911.3. Samples: 31345780. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:10,069][231894] Avg episode reward: [(0, '192.532')] [2023-03-07 16:53:10,815][232226] Updated weights for policy 0, policy_version 30630 (0.0006) [2023-03-07 16:53:11,620][232226] Updated weights for policy 0, policy_version 30640 (0.0006) [2023-03-07 16:53:12,418][232226] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-07 16:53:13,208][232226] Updated weights for policy 0, policy_version 30660 (0.0006) [2023-03-07 16:53:13,994][232226] Updated weights for policy 0, policy_version 30670 (0.0006) [2023-03-07 16:53:14,786][232226] Updated weights for policy 0, policy_version 30680 (0.0006) [2023-03-07 16:53:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 31419392. Throughput: 0: 12907.2. Samples: 31384452. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:53:15,069][231894] Avg episode reward: [(0, '196.295')] [2023-03-07 16:53:15,562][232226] Updated weights for policy 0, policy_version 30690 (0.0007) [2023-03-07 16:53:16,362][232226] Updated weights for policy 0, policy_version 30700 (0.0006) [2023-03-07 16:53:17,160][232226] Updated weights for policy 0, policy_version 30710 (0.0007) [2023-03-07 16:53:17,957][232226] Updated weights for policy 0, policy_version 30720 (0.0007) [2023-03-07 16:53:18,752][232226] Updated weights for policy 0, policy_version 30730 (0.0007) [2023-03-07 16:53:19,534][232226] Updated weights for policy 0, policy_version 30740 (0.0007) [2023-03-07 16:53:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 31483904. Throughput: 0: 12901.9. Samples: 31461784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:53:20,069][231894] Avg episode reward: [(0, '185.524')] [2023-03-07 16:53:20,338][232226] Updated weights for policy 0, policy_version 30750 (0.0006) [2023-03-07 16:53:21,115][232226] Updated weights for policy 0, policy_version 30760 (0.0006) [2023-03-07 16:53:21,916][232226] Updated weights for policy 0, policy_version 30770 (0.0007) [2023-03-07 16:53:22,720][232226] Updated weights for policy 0, policy_version 30780 (0.0005) [2023-03-07 16:53:23,507][232226] Updated weights for policy 0, policy_version 30790 (0.0007) [2023-03-07 16:53:24,296][232226] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-07 16:53:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 31548416. Throughput: 0: 12895.0. Samples: 31539344. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:53:25,069][231894] Avg episode reward: [(0, '189.985')] [2023-03-07 16:53:25,097][232226] Updated weights for policy 0, policy_version 30810 (0.0006) [2023-03-07 16:53:25,866][232226] Updated weights for policy 0, policy_version 30820 (0.0007) [2023-03-07 16:53:26,658][232226] Updated weights for policy 0, policy_version 30830 (0.0006) [2023-03-07 16:53:27,455][232226] Updated weights for policy 0, policy_version 30840 (0.0006) [2023-03-07 16:53:28,246][232226] Updated weights for policy 0, policy_version 30850 (0.0006) [2023-03-07 16:53:29,053][232226] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-07 16:53:29,846][232226] Updated weights for policy 0, policy_version 30870 (0.0006) [2023-03-07 16:53:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 31612928. Throughput: 0: 12898.7. Samples: 31578212. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:53:30,069][231894] Avg episode reward: [(0, '186.292')] [2023-03-07 16:53:30,629][232226] Updated weights for policy 0, policy_version 30880 (0.0006) [2023-03-07 16:53:31,444][232226] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-07 16:53:32,235][232226] Updated weights for policy 0, policy_version 30900 (0.0007) [2023-03-07 16:53:33,032][232226] Updated weights for policy 0, policy_version 30910 (0.0006) [2023-03-07 16:53:33,822][232226] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-07 16:53:34,640][232226] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-07 16:53:35,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 31677440. Throughput: 0: 12889.7. Samples: 31655515. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:35,070][231894] Avg episode reward: [(0, '187.602')] [2023-03-07 16:53:35,420][232226] Updated weights for policy 0, policy_version 30940 (0.0007) [2023-03-07 16:53:36,200][232226] Updated weights for policy 0, policy_version 30950 (0.0006) [2023-03-07 16:53:37,001][232226] Updated weights for policy 0, policy_version 30960 (0.0006) [2023-03-07 16:53:37,782][232226] Updated weights for policy 0, policy_version 30970 (0.0007) [2023-03-07 16:53:38,601][232226] Updated weights for policy 0, policy_version 30980 (0.0006) [2023-03-07 16:53:39,385][232226] Updated weights for policy 0, policy_version 30990 (0.0006) [2023-03-07 16:53:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 31741952. Throughput: 0: 12896.1. Samples: 31732807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:40,069][231894] Avg episode reward: [(0, '197.697')] [2023-03-07 16:53:40,179][232226] Updated weights for policy 0, policy_version 31000 (0.0006) [2023-03-07 16:53:40,992][232226] Updated weights for policy 0, policy_version 31010 (0.0007) [2023-03-07 16:53:41,776][232226] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-07 16:53:42,574][232226] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-07 16:53:43,374][232226] Updated weights for policy 0, policy_version 31040 (0.0007) [2023-03-07 16:53:44,142][232226] Updated weights for policy 0, policy_version 31050 (0.0007) [2023-03-07 16:53:44,944][232226] Updated weights for policy 0, policy_version 31060 (0.0007) [2023-03-07 16:53:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 31806464. Throughput: 0: 12895.5. Samples: 31771374. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:45,070][231894] Avg episode reward: [(0, '195.122')] [2023-03-07 16:53:45,720][232226] Updated weights for policy 0, policy_version 31070 (0.0006) [2023-03-07 16:53:46,521][232226] Updated weights for policy 0, policy_version 31080 (0.0007) [2023-03-07 16:53:47,326][232226] Updated weights for policy 0, policy_version 31090 (0.0006) [2023-03-07 16:53:48,146][232226] Updated weights for policy 0, policy_version 31100 (0.0006) [2023-03-07 16:53:48,929][232226] Updated weights for policy 0, policy_version 31110 (0.0006) [2023-03-07 16:53:49,747][232226] Updated weights for policy 0, policy_version 31120 (0.0006) [2023-03-07 16:53:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 31870976. Throughput: 0: 12897.6. Samples: 31848660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:50,069][231894] Avg episode reward: [(0, '191.581')] [2023-03-07 16:53:50,532][232226] Updated weights for policy 0, policy_version 31130 (0.0007) [2023-03-07 16:53:51,316][232226] Updated weights for policy 0, policy_version 31140 (0.0007) [2023-03-07 16:53:52,127][232226] Updated weights for policy 0, policy_version 31150 (0.0007) [2023-03-07 16:53:52,908][232226] Updated weights for policy 0, policy_version 31160 (0.0007) [2023-03-07 16:53:53,691][232226] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-07 16:53:54,495][232226] Updated weights for policy 0, policy_version 31180 (0.0007) [2023-03-07 16:53:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 31935488. Throughput: 0: 12891.4. Samples: 31925893. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:53:55,069][231894] Avg episode reward: [(0, '192.054')] [2023-03-07 16:53:55,292][232226] Updated weights for policy 0, policy_version 31190 (0.0005) [2023-03-07 16:53:56,082][232226] Updated weights for policy 0, policy_version 31200 (0.0007) [2023-03-07 16:53:56,889][232226] Updated weights for policy 0, policy_version 31210 (0.0006) [2023-03-07 16:53:57,687][232226] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-07 16:53:58,485][232226] Updated weights for policy 0, policy_version 31230 (0.0006) [2023-03-07 16:53:59,270][232226] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-07 16:54:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 31998976. Throughput: 0: 12889.7. Samples: 31964490. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:00,069][231894] Avg episode reward: [(0, '194.307')] [2023-03-07 16:54:00,095][232226] Updated weights for policy 0, policy_version 31250 (0.0006) [2023-03-07 16:54:00,890][232226] Updated weights for policy 0, policy_version 31260 (0.0006) [2023-03-07 16:54:01,669][232226] Updated weights for policy 0, policy_version 31270 (0.0007) [2023-03-07 16:54:02,476][232226] Updated weights for policy 0, policy_version 31280 (0.0006) [2023-03-07 16:54:03,262][232226] Updated weights for policy 0, policy_version 31290 (0.0006) [2023-03-07 16:54:04,052][232226] Updated weights for policy 0, policy_version 31300 (0.0006) [2023-03-07 16:54:04,854][232226] Updated weights for policy 0, policy_version 31310 (0.0007) [2023-03-07 16:54:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 32063488. Throughput: 0: 12885.6. Samples: 32041636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:05,069][231894] Avg episode reward: [(0, '194.080')] [2023-03-07 16:54:05,653][232226] Updated weights for policy 0, policy_version 31320 (0.0006) [2023-03-07 16:54:06,446][232226] Updated weights for policy 0, policy_version 31330 (0.0007) [2023-03-07 16:54:07,233][232226] Updated weights for policy 0, policy_version 31340 (0.0006) [2023-03-07 16:54:08,023][232226] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-07 16:54:08,823][232226] Updated weights for policy 0, policy_version 31360 (0.0005) [2023-03-07 16:54:09,610][232226] Updated weights for policy 0, policy_version 31370 (0.0006) [2023-03-07 16:54:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 32128000. Throughput: 0: 12882.8. Samples: 32119071. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:10,070][231894] Avg episode reward: [(0, '193.025')] [2023-03-07 16:54:10,417][232226] Updated weights for policy 0, policy_version 31380 (0.0006) [2023-03-07 16:54:11,186][232226] Updated weights for policy 0, policy_version 31390 (0.0007) [2023-03-07 16:54:11,986][232226] Updated weights for policy 0, policy_version 31400 (0.0006) [2023-03-07 16:54:12,791][232226] Updated weights for policy 0, policy_version 31410 (0.0007) [2023-03-07 16:54:13,600][232226] Updated weights for policy 0, policy_version 31420 (0.0007) [2023-03-07 16:54:14,387][232226] Updated weights for policy 0, policy_version 31430 (0.0006) [2023-03-07 16:54:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 32192512. Throughput: 0: 12875.5. Samples: 32157612. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:54:15,069][231894] Avg episode reward: [(0, '191.306')] [2023-03-07 16:54:15,205][232226] Updated weights for policy 0, policy_version 31440 (0.0007) [2023-03-07 16:54:16,001][232226] Updated weights for policy 0, policy_version 31450 (0.0006) [2023-03-07 16:54:16,790][232226] Updated weights for policy 0, policy_version 31460 (0.0007) [2023-03-07 16:54:17,580][232226] Updated weights for policy 0, policy_version 31470 (0.0006) [2023-03-07 16:54:18,392][232226] Updated weights for policy 0, policy_version 31480 (0.0007) [2023-03-07 16:54:19,174][232226] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-07 16:54:19,970][232226] Updated weights for policy 0, policy_version 31500 (0.0006) [2023-03-07 16:54:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 32257024. Throughput: 0: 12868.6. Samples: 32234600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:54:20,069][231894] Avg episode reward: [(0, '197.030')] [2023-03-07 16:54:20,750][232226] Updated weights for policy 0, policy_version 31510 (0.0006) [2023-03-07 16:54:21,533][232226] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-07 16:54:22,348][232226] Updated weights for policy 0, policy_version 31530 (0.0007) [2023-03-07 16:54:23,130][232226] Updated weights for policy 0, policy_version 31540 (0.0007) [2023-03-07 16:54:23,910][232226] Updated weights for policy 0, policy_version 31550 (0.0006) [2023-03-07 16:54:24,722][232226] Updated weights for policy 0, policy_version 31560 (0.0006) [2023-03-07 16:54:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 32321536. Throughput: 0: 12875.4. Samples: 32312201. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:54:25,069][231894] Avg episode reward: [(0, '192.612')] [2023-03-07 16:54:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000031564_32321536.pth... [2023-03-07 16:54:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000028541_29225984.pth [2023-03-07 16:54:25,506][232226] Updated weights for policy 0, policy_version 31570 (0.0006) [2023-03-07 16:54:26,313][232226] Updated weights for policy 0, policy_version 31580 (0.0006) [2023-03-07 16:54:27,105][232226] Updated weights for policy 0, policy_version 31590 (0.0007) [2023-03-07 16:54:27,916][232226] Updated weights for policy 0, policy_version 31600 (0.0006) [2023-03-07 16:54:28,683][232226] Updated weights for policy 0, policy_version 31610 (0.0007) [2023-03-07 16:54:29,472][232226] Updated weights for policy 0, policy_version 31620 (0.0006) [2023-03-07 16:54:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 32386048. Throughput: 0: 12873.5. Samples: 32350681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:54:30,069][231894] Avg episode reward: [(0, '195.421')] [2023-03-07 16:54:30,263][232226] Updated weights for policy 0, policy_version 31630 (0.0007) [2023-03-07 16:54:31,048][232226] Updated weights for policy 0, policy_version 31640 (0.0006) [2023-03-07 16:54:31,846][232226] Updated weights for policy 0, policy_version 31650 (0.0006) [2023-03-07 16:54:32,645][232226] Updated weights for policy 0, policy_version 31660 (0.0007) [2023-03-07 16:54:33,405][232226] Updated weights for policy 0, policy_version 31670 (0.0007) [2023-03-07 16:54:34,227][232226] Updated weights for policy 0, policy_version 31680 (0.0006) [2023-03-07 16:54:35,007][232226] Updated weights for policy 0, policy_version 31690 (0.0006) [2023-03-07 16:54:35,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 32450560. Throughput: 0: 12885.9. Samples: 32428526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:35,069][231894] Avg episode reward: [(0, '192.231')] [2023-03-07 16:54:35,800][232226] Updated weights for policy 0, policy_version 31700 (0.0007) [2023-03-07 16:54:36,613][232226] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-07 16:54:37,413][232226] Updated weights for policy 0, policy_version 31720 (0.0008) [2023-03-07 16:54:38,204][232226] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-07 16:54:38,994][232226] Updated weights for policy 0, policy_version 31740 (0.0006) [2023-03-07 16:54:39,782][232226] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-07 16:54:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 32515072. Throughput: 0: 12887.5. Samples: 32505830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:40,069][231894] Avg episode reward: [(0, '195.288')] [2023-03-07 16:54:40,594][232226] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-07 16:54:41,371][232226] Updated weights for policy 0, policy_version 31770 (0.0006) [2023-03-07 16:54:42,162][232226] Updated weights for policy 0, policy_version 31780 (0.0006) [2023-03-07 16:54:42,961][232226] Updated weights for policy 0, policy_version 31790 (0.0007) [2023-03-07 16:54:43,758][232226] Updated weights for policy 0, policy_version 31800 (0.0006) [2023-03-07 16:54:44,558][232226] Updated weights for policy 0, policy_version 31810 (0.0006) [2023-03-07 16:54:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12898.9). Total num frames: 32579584. Throughput: 0: 12889.4. Samples: 32544510. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:45,069][231894] Avg episode reward: [(0, '186.447')] [2023-03-07 16:54:45,355][232226] Updated weights for policy 0, policy_version 31820 (0.0007) [2023-03-07 16:54:46,142][232226] Updated weights for policy 0, policy_version 31830 (0.0006) [2023-03-07 16:54:46,950][232226] Updated weights for policy 0, policy_version 31840 (0.0006) [2023-03-07 16:54:47,733][232226] Updated weights for policy 0, policy_version 31850 (0.0007) [2023-03-07 16:54:48,542][232226] Updated weights for policy 0, policy_version 31860 (0.0006) [2023-03-07 16:54:49,333][232226] Updated weights for policy 0, policy_version 31870 (0.0007) [2023-03-07 16:54:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 32644096. Throughput: 0: 12895.9. Samples: 32621952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:50,069][231894] Avg episode reward: [(0, '198.176')] [2023-03-07 16:54:50,127][232226] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-07 16:54:50,924][232226] Updated weights for policy 0, policy_version 31890 (0.0006) [2023-03-07 16:54:51,720][232226] Updated weights for policy 0, policy_version 31900 (0.0006) [2023-03-07 16:54:52,522][232226] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-07 16:54:53,299][232226] Updated weights for policy 0, policy_version 31920 (0.0007) [2023-03-07 16:54:54,096][232226] Updated weights for policy 0, policy_version 31930 (0.0007) [2023-03-07 16:54:54,887][232226] Updated weights for policy 0, policy_version 31940 (0.0006) [2023-03-07 16:54:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 32708608. Throughput: 0: 12891.3. Samples: 32699181. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:54:55,069][231894] Avg episode reward: [(0, '196.138')] [2023-03-07 16:54:55,674][232226] Updated weights for policy 0, policy_version 31950 (0.0007) [2023-03-07 16:54:56,469][232226] Updated weights for policy 0, policy_version 31960 (0.0006) [2023-03-07 16:54:57,256][232226] Updated weights for policy 0, policy_version 31970 (0.0006) [2023-03-07 16:54:58,055][232226] Updated weights for policy 0, policy_version 31980 (0.0007) [2023-03-07 16:54:58,847][232226] Updated weights for policy 0, policy_version 31990 (0.0007) [2023-03-07 16:54:59,648][232226] Updated weights for policy 0, policy_version 32000 (0.0007) [2023-03-07 16:55:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 32773120. Throughput: 0: 12897.0. Samples: 32737976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:00,070][231894] Avg episode reward: [(0, '193.282')] [2023-03-07 16:55:00,448][232226] Updated weights for policy 0, policy_version 32010 (0.0007) [2023-03-07 16:55:01,238][232226] Updated weights for policy 0, policy_version 32020 (0.0006) [2023-03-07 16:55:02,039][232226] Updated weights for policy 0, policy_version 32030 (0.0007) [2023-03-07 16:55:02,824][232226] Updated weights for policy 0, policy_version 32040 (0.0007) [2023-03-07 16:55:03,624][232226] Updated weights for policy 0, policy_version 32050 (0.0007) [2023-03-07 16:55:04,421][232226] Updated weights for policy 0, policy_version 32060 (0.0007) [2023-03-07 16:55:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 32837632. Throughput: 0: 12902.2. Samples: 32815198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:05,069][231894] Avg episode reward: [(0, '185.528')] [2023-03-07 16:55:05,201][232226] Updated weights for policy 0, policy_version 32070 (0.0006) [2023-03-07 16:55:06,029][232226] Updated weights for policy 0, policy_version 32080 (0.0006) [2023-03-07 16:55:06,825][232226] Updated weights for policy 0, policy_version 32090 (0.0007) [2023-03-07 16:55:07,585][232226] Updated weights for policy 0, policy_version 32100 (0.0007) [2023-03-07 16:55:08,402][232226] Updated weights for policy 0, policy_version 32110 (0.0007) [2023-03-07 16:55:09,179][232226] Updated weights for policy 0, policy_version 32120 (0.0006) [2023-03-07 16:55:09,979][232226] Updated weights for policy 0, policy_version 32130 (0.0007) [2023-03-07 16:55:10,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 32902144. Throughput: 0: 12896.2. Samples: 32892532. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:10,069][231894] Avg episode reward: [(0, '186.506')] [2023-03-07 16:55:10,792][232226] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-07 16:55:11,573][232226] Updated weights for policy 0, policy_version 32150 (0.0005) [2023-03-07 16:55:12,368][232226] Updated weights for policy 0, policy_version 32160 (0.0006) [2023-03-07 16:55:13,156][232226] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-07 16:55:13,930][232226] Updated weights for policy 0, policy_version 32180 (0.0006) [2023-03-07 16:55:14,734][232226] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-07 16:55:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 32966656. Throughput: 0: 12901.1. Samples: 32931229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:15,080][231894] Avg episode reward: [(0, '190.858')] [2023-03-07 16:55:15,522][232226] Updated weights for policy 0, policy_version 32200 (0.0007) [2023-03-07 16:55:16,315][232226] Updated weights for policy 0, policy_version 32210 (0.0007) [2023-03-07 16:55:17,119][232226] Updated weights for policy 0, policy_version 32220 (0.0006) [2023-03-07 16:55:17,923][232226] Updated weights for policy 0, policy_version 32230 (0.0006) [2023-03-07 16:55:18,701][232226] Updated weights for policy 0, policy_version 32240 (0.0006) [2023-03-07 16:55:19,501][232226] Updated weights for policy 0, policy_version 32250 (0.0007) [2023-03-07 16:55:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33031168. Throughput: 0: 12892.1. Samples: 33008671. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:55:20,069][231894] Avg episode reward: [(0, '196.549')] [2023-03-07 16:55:20,307][232226] Updated weights for policy 0, policy_version 32260 (0.0007) [2023-03-07 16:55:21,079][232226] Updated weights for policy 0, policy_version 32270 (0.0006) [2023-03-07 16:55:21,878][232226] Updated weights for policy 0, policy_version 32280 (0.0006) [2023-03-07 16:55:22,683][232226] Updated weights for policy 0, policy_version 32290 (0.0006) [2023-03-07 16:55:23,469][232226] Updated weights for policy 0, policy_version 32300 (0.0007) [2023-03-07 16:55:24,276][232226] Updated weights for policy 0, policy_version 32310 (0.0007) [2023-03-07 16:55:25,058][232226] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-07 16:55:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33095680. Throughput: 0: 12891.9. Samples: 33085966. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:55:25,069][231894] Avg episode reward: [(0, '191.405')] [2023-03-07 16:55:25,854][232226] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-07 16:55:26,666][232226] Updated weights for policy 0, policy_version 32340 (0.0005) [2023-03-07 16:55:27,444][232226] Updated weights for policy 0, policy_version 32350 (0.0007) [2023-03-07 16:55:28,249][232226] Updated weights for policy 0, policy_version 32360 (0.0006) [2023-03-07 16:55:29,050][232226] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-07 16:55:29,845][232226] Updated weights for policy 0, policy_version 32380 (0.0007) [2023-03-07 16:55:30,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 33159168. Throughput: 0: 12888.6. Samples: 33124499. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:55:30,070][231894] Avg episode reward: [(0, '193.739')] [2023-03-07 16:55:30,627][232226] Updated weights for policy 0, policy_version 32390 (0.0007) [2023-03-07 16:55:31,412][232226] Updated weights for policy 0, policy_version 32400 (0.0007) [2023-03-07 16:55:32,202][232226] Updated weights for policy 0, policy_version 32410 (0.0006) [2023-03-07 16:55:33,005][232226] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-07 16:55:33,777][232226] Updated weights for policy 0, policy_version 32430 (0.0006) [2023-03-07 16:55:34,577][232226] Updated weights for policy 0, policy_version 32440 (0.0007) [2023-03-07 16:55:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33224704. Throughput: 0: 12892.4. Samples: 33202110. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:55:35,069][231894] Avg episode reward: [(0, '196.720')] [2023-03-07 16:55:35,365][232226] Updated weights for policy 0, policy_version 32450 (0.0007) [2023-03-07 16:55:36,164][232226] Updated weights for policy 0, policy_version 32460 (0.0007) [2023-03-07 16:55:36,973][232226] Updated weights for policy 0, policy_version 32470 (0.0007) [2023-03-07 16:55:37,764][232226] Updated weights for policy 0, policy_version 32480 (0.0007) [2023-03-07 16:55:38,557][232226] Updated weights for policy 0, policy_version 32490 (0.0007) [2023-03-07 16:55:39,348][232226] Updated weights for policy 0, policy_version 32500 (0.0006) [2023-03-07 16:55:40,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33289216. Throughput: 0: 12896.0. Samples: 33279503. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:40,069][231894] Avg episode reward: [(0, '188.026')] [2023-03-07 16:55:40,136][232226] Updated weights for policy 0, policy_version 32510 (0.0007) [2023-03-07 16:55:40,933][232226] Updated weights for policy 0, policy_version 32520 (0.0007) [2023-03-07 16:55:41,734][232226] Updated weights for policy 0, policy_version 32530 (0.0006) [2023-03-07 16:55:42,529][232226] Updated weights for policy 0, policy_version 32540 (0.0007) [2023-03-07 16:55:43,317][232226] Updated weights for policy 0, policy_version 32550 (0.0006) [2023-03-07 16:55:44,119][232226] Updated weights for policy 0, policy_version 32560 (0.0006) [2023-03-07 16:55:44,925][232226] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-07 16:55:45,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 33352704. Throughput: 0: 12890.2. Samples: 33318037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:45,070][231894] Avg episode reward: [(0, '179.471')] [2023-03-07 16:55:45,709][232226] Updated weights for policy 0, policy_version 32580 (0.0006) [2023-03-07 16:55:46,504][232226] Updated weights for policy 0, policy_version 32590 (0.0005) [2023-03-07 16:55:47,289][232226] Updated weights for policy 0, policy_version 32600 (0.0006) [2023-03-07 16:55:48,087][232226] Updated weights for policy 0, policy_version 32610 (0.0007) [2023-03-07 16:55:48,854][232226] Updated weights for policy 0, policy_version 32620 (0.0007) [2023-03-07 16:55:49,661][232226] Updated weights for policy 0, policy_version 32630 (0.0005) [2023-03-07 16:55:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33418240. Throughput: 0: 12899.6. Samples: 33395677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:50,069][231894] Avg episode reward: [(0, '187.859')] [2023-03-07 16:55:50,447][232226] Updated weights for policy 0, policy_version 32640 (0.0006) [2023-03-07 16:55:51,249][232226] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-07 16:55:52,035][232226] Updated weights for policy 0, policy_version 32660 (0.0005) [2023-03-07 16:55:52,849][232226] Updated weights for policy 0, policy_version 32670 (0.0006) [2023-03-07 16:55:53,630][232226] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-07 16:55:54,437][232226] Updated weights for policy 0, policy_version 32690 (0.0007) [2023-03-07 16:55:55,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12898.9). Total num frames: 33481728. Throughput: 0: 12900.5. Samples: 33473056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:55:55,069][231894] Avg episode reward: [(0, '193.311')] [2023-03-07 16:55:55,233][232226] Updated weights for policy 0, policy_version 32700 (0.0006) [2023-03-07 16:55:56,021][232226] Updated weights for policy 0, policy_version 32710 (0.0007) [2023-03-07 16:55:56,835][232226] Updated weights for policy 0, policy_version 32720 (0.0006) [2023-03-07 16:55:57,620][232226] Updated weights for policy 0, policy_version 32730 (0.0007) [2023-03-07 16:55:58,408][232226] Updated weights for policy 0, policy_version 32740 (0.0006) [2023-03-07 16:55:59,198][232226] Updated weights for policy 0, policy_version 32750 (0.0008) [2023-03-07 16:55:59,978][232226] Updated weights for policy 0, policy_version 32760 (0.0007) [2023-03-07 16:56:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33547264. Throughput: 0: 12896.0. Samples: 33511548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:00,069][231894] Avg episode reward: [(0, '193.827')] [2023-03-07 16:56:00,766][232226] Updated weights for policy 0, policy_version 32770 (0.0006) [2023-03-07 16:56:01,561][232226] Updated weights for policy 0, policy_version 32780 (0.0007) [2023-03-07 16:56:02,352][232226] Updated weights for policy 0, policy_version 32790 (0.0006) [2023-03-07 16:56:03,160][232226] Updated weights for policy 0, policy_version 32800 (0.0007) [2023-03-07 16:56:03,973][232226] Updated weights for policy 0, policy_version 32810 (0.0006) [2023-03-07 16:56:04,749][232226] Updated weights for policy 0, policy_version 32820 (0.0007) [2023-03-07 16:56:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 33610752. Throughput: 0: 12900.3. Samples: 33589188. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:05,070][231894] Avg episode reward: [(0, '192.365')] [2023-03-07 16:56:05,550][232226] Updated weights for policy 0, policy_version 32830 (0.0006) [2023-03-07 16:56:06,335][232226] Updated weights for policy 0, policy_version 32840 (0.0007) [2023-03-07 16:56:07,128][232226] Updated weights for policy 0, policy_version 32850 (0.0007) [2023-03-07 16:56:07,930][232226] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-07 16:56:08,732][232226] Updated weights for policy 0, policy_version 32870 (0.0007) [2023-03-07 16:56:09,512][232226] Updated weights for policy 0, policy_version 32880 (0.0005) [2023-03-07 16:56:10,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 33675264. Throughput: 0: 12897.3. Samples: 33666344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:10,069][231894] Avg episode reward: [(0, '192.577')] [2023-03-07 16:56:10,317][232226] Updated weights for policy 0, policy_version 32890 (0.0006) [2023-03-07 16:56:11,097][232226] Updated weights for policy 0, policy_version 32900 (0.0006) [2023-03-07 16:56:11,905][232226] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-07 16:56:12,693][232226] Updated weights for policy 0, policy_version 32920 (0.0006) [2023-03-07 16:56:13,464][232226] Updated weights for policy 0, policy_version 32930 (0.0007) [2023-03-07 16:56:14,270][232226] Updated weights for policy 0, policy_version 32940 (0.0007) [2023-03-07 16:56:15,054][232226] Updated weights for policy 0, policy_version 32950 (0.0006) [2023-03-07 16:56:15,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33740800. Throughput: 0: 12902.8. Samples: 33705123. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:15,069][231894] Avg episode reward: [(0, '190.620')] [2023-03-07 16:56:15,847][232226] Updated weights for policy 0, policy_version 32960 (0.0006) [2023-03-07 16:56:16,639][232226] Updated weights for policy 0, policy_version 32970 (0.0007) [2023-03-07 16:56:17,435][232226] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-07 16:56:18,222][232226] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-07 16:56:19,024][232226] Updated weights for policy 0, policy_version 33000 (0.0007) [2023-03-07 16:56:19,825][232226] Updated weights for policy 0, policy_version 33010 (0.0007) [2023-03-07 16:56:20,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33805312. Throughput: 0: 12905.4. Samples: 33782851. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:20,069][231894] Avg episode reward: [(0, '190.107')] [2023-03-07 16:56:20,620][232226] Updated weights for policy 0, policy_version 33020 (0.0006) [2023-03-07 16:56:21,432][232226] Updated weights for policy 0, policy_version 33030 (0.0006) [2023-03-07 16:56:22,190][232226] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-07 16:56:22,987][232226] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-07 16:56:23,789][232226] Updated weights for policy 0, policy_version 33060 (0.0006) [2023-03-07 16:56:24,554][232226] Updated weights for policy 0, policy_version 33070 (0.0005) [2023-03-07 16:56:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 33869824. Throughput: 0: 12905.5. Samples: 33860251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:25,070][231894] Avg episode reward: [(0, '198.686')] [2023-03-07 16:56:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000033076_33869824.pth... [2023-03-07 16:56:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000030053_30774272.pth [2023-03-07 16:56:25,353][232226] Updated weights for policy 0, policy_version 33080 (0.0007) [2023-03-07 16:56:26,149][232226] Updated weights for policy 0, policy_version 33090 (0.0007) [2023-03-07 16:56:26,938][232226] Updated weights for policy 0, policy_version 33100 (0.0006) [2023-03-07 16:56:27,736][232226] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-07 16:56:28,534][232226] Updated weights for policy 0, policy_version 33120 (0.0008) [2023-03-07 16:56:29,315][232226] Updated weights for policy 0, policy_version 33130 (0.0007) [2023-03-07 16:56:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 33934336. Throughput: 0: 12916.6. Samples: 33899280. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:30,069][231894] Avg episode reward: [(0, '195.595')] [2023-03-07 16:56:30,105][232226] Updated weights for policy 0, policy_version 33140 (0.0007) [2023-03-07 16:56:30,909][232226] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-07 16:56:31,701][232226] Updated weights for policy 0, policy_version 33160 (0.0007) [2023-03-07 16:56:32,501][232226] Updated weights for policy 0, policy_version 33170 (0.0007) [2023-03-07 16:56:33,282][232226] Updated weights for policy 0, policy_version 33180 (0.0006) [2023-03-07 16:56:34,093][232226] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-07 16:56:34,871][232226] Updated weights for policy 0, policy_version 33200 (0.0006) [2023-03-07 16:56:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 33998848. Throughput: 0: 12909.6. Samples: 33976612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:35,069][231894] Avg episode reward: [(0, '192.484')] [2023-03-07 16:56:35,684][232226] Updated weights for policy 0, policy_version 33210 (0.0006) [2023-03-07 16:56:36,486][232226] Updated weights for policy 0, policy_version 33220 (0.0007) [2023-03-07 16:56:37,286][232226] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-07 16:56:38,068][232226] Updated weights for policy 0, policy_version 33240 (0.0007) [2023-03-07 16:56:38,869][232226] Updated weights for policy 0, policy_version 33250 (0.0006) [2023-03-07 16:56:39,655][232226] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-07 16:56:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 34063360. Throughput: 0: 12905.4. Samples: 34053802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:40,080][231894] Avg episode reward: [(0, '185.745')] [2023-03-07 16:56:40,452][232226] Updated weights for policy 0, policy_version 33270 (0.0007) [2023-03-07 16:56:41,248][232226] Updated weights for policy 0, policy_version 33280 (0.0008) [2023-03-07 16:56:42,049][232226] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-07 16:56:42,829][232226] Updated weights for policy 0, policy_version 33300 (0.0007) [2023-03-07 16:56:43,617][232226] Updated weights for policy 0, policy_version 33310 (0.0006) [2023-03-07 16:56:44,422][232226] Updated weights for policy 0, policy_version 33320 (0.0007) [2023-03-07 16:56:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 34127872. Throughput: 0: 12910.2. Samples: 34092509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:45,080][231894] Avg episode reward: [(0, '186.196')] [2023-03-07 16:56:45,214][232226] Updated weights for policy 0, policy_version 33330 (0.0007) [2023-03-07 16:56:45,986][232226] Updated weights for policy 0, policy_version 33340 (0.0006) [2023-03-07 16:56:46,786][232226] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-07 16:56:47,578][232226] Updated weights for policy 0, policy_version 33360 (0.0007) [2023-03-07 16:56:48,379][232226] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-07 16:56:49,174][232226] Updated weights for policy 0, policy_version 33380 (0.0007) [2023-03-07 16:56:49,969][232226] Updated weights for policy 0, policy_version 33390 (0.0007) [2023-03-07 16:56:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 34191360. Throughput: 0: 12905.1. Samples: 34169917. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:50,080][231894] Avg episode reward: [(0, '191.444')] [2023-03-07 16:56:50,783][232226] Updated weights for policy 0, policy_version 33400 (0.0008) [2023-03-07 16:56:51,561][232226] Updated weights for policy 0, policy_version 33410 (0.0005) [2023-03-07 16:56:52,357][232226] Updated weights for policy 0, policy_version 33420 (0.0007) [2023-03-07 16:56:53,143][232226] Updated weights for policy 0, policy_version 33430 (0.0006) [2023-03-07 16:56:53,955][232226] Updated weights for policy 0, policy_version 33440 (0.0007) [2023-03-07 16:56:54,733][232226] Updated weights for policy 0, policy_version 33450 (0.0006) [2023-03-07 16:56:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 34256896. Throughput: 0: 12906.6. Samples: 34247139. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:56:55,080][231894] Avg episode reward: [(0, '198.215')] [2023-03-07 16:56:55,522][232226] Updated weights for policy 0, policy_version 33460 (0.0007) [2023-03-07 16:56:56,327][232226] Updated weights for policy 0, policy_version 33470 (0.0006) [2023-03-07 16:56:57,130][232226] Updated weights for policy 0, policy_version 33480 (0.0007) [2023-03-07 16:56:57,918][232226] Updated weights for policy 0, policy_version 33490 (0.0007) [2023-03-07 16:56:58,721][232226] Updated weights for policy 0, policy_version 33500 (0.0007) [2023-03-07 16:56:59,516][232226] Updated weights for policy 0, policy_version 33510 (0.0005) [2023-03-07 16:57:00,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 34321408. Throughput: 0: 12904.6. Samples: 34285830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:00,069][231894] Avg episode reward: [(0, '189.367')] [2023-03-07 16:57:00,314][232226] Updated weights for policy 0, policy_version 33520 (0.0007) [2023-03-07 16:57:01,096][232226] Updated weights for policy 0, policy_version 33530 (0.0007) [2023-03-07 16:57:01,883][232226] Updated weights for policy 0, policy_version 33540 (0.0006) [2023-03-07 16:57:02,688][232226] Updated weights for policy 0, policy_version 33550 (0.0006) [2023-03-07 16:57:03,483][232226] Updated weights for policy 0, policy_version 33560 (0.0006) [2023-03-07 16:57:04,268][232226] Updated weights for policy 0, policy_version 33570 (0.0006) [2023-03-07 16:57:05,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 34384896. Throughput: 0: 12895.9. Samples: 34363170. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:57:05,070][231894] Avg episode reward: [(0, '194.712')] [2023-03-07 16:57:05,084][232226] Updated weights for policy 0, policy_version 33580 (0.0005) [2023-03-07 16:57:05,878][232226] Updated weights for policy 0, policy_version 33590 (0.0007) [2023-03-07 16:57:06,661][232226] Updated weights for policy 0, policy_version 33600 (0.0006) [2023-03-07 16:57:07,444][232226] Updated weights for policy 0, policy_version 33610 (0.0007) [2023-03-07 16:57:08,258][232226] Updated weights for policy 0, policy_version 33620 (0.0007) [2023-03-07 16:57:09,057][232226] Updated weights for policy 0, policy_version 33630 (0.0007) [2023-03-07 16:57:09,848][232226] Updated weights for policy 0, policy_version 33640 (0.0006) [2023-03-07 16:57:10,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12919.5, 300 sec: 12898.9). Total num frames: 34450432. Throughput: 0: 12896.7. Samples: 34440604. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:57:10,070][231894] Avg episode reward: [(0, '196.543')] [2023-03-07 16:57:10,650][232226] Updated weights for policy 0, policy_version 33650 (0.0006) [2023-03-07 16:57:11,458][232226] Updated weights for policy 0, policy_version 33660 (0.0006) [2023-03-07 16:57:12,246][232226] Updated weights for policy 0, policy_version 33670 (0.0007) [2023-03-07 16:57:13,030][232226] Updated weights for policy 0, policy_version 33680 (0.0005) [2023-03-07 16:57:13,836][232226] Updated weights for policy 0, policy_version 33690 (0.0007) [2023-03-07 16:57:14,618][232226] Updated weights for policy 0, policy_version 33700 (0.0006) [2023-03-07 16:57:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 34513920. Throughput: 0: 12884.0. Samples: 34479061. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:57:15,069][231894] Avg episode reward: [(0, '200.610')] [2023-03-07 16:57:15,433][232226] Updated weights for policy 0, policy_version 33710 (0.0006) [2023-03-07 16:57:16,225][232226] Updated weights for policy 0, policy_version 33720 (0.0007) [2023-03-07 16:57:17,029][232226] Updated weights for policy 0, policy_version 33730 (0.0006) [2023-03-07 16:57:17,813][232226] Updated weights for policy 0, policy_version 33740 (0.0007) [2023-03-07 16:57:18,606][232226] Updated weights for policy 0, policy_version 33750 (0.0006) [2023-03-07 16:57:19,398][232226] Updated weights for policy 0, policy_version 33760 (0.0007) [2023-03-07 16:57:20,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 34578432. Throughput: 0: 12880.5. Samples: 34556235. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:57:20,069][231894] Avg episode reward: [(0, '189.609')] [2023-03-07 16:57:20,183][232226] Updated weights for policy 0, policy_version 33770 (0.0006) [2023-03-07 16:57:21,000][232226] Updated weights for policy 0, policy_version 33780 (0.0006) [2023-03-07 16:57:21,781][232226] Updated weights for policy 0, policy_version 33790 (0.0006) [2023-03-07 16:57:22,588][232226] Updated weights for policy 0, policy_version 33800 (0.0006) [2023-03-07 16:57:23,365][232226] Updated weights for policy 0, policy_version 33810 (0.0005) [2023-03-07 16:57:24,171][232226] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-07 16:57:24,970][232226] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-07 16:57:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 34642944. Throughput: 0: 12881.4. Samples: 34633465. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:25,069][231894] Avg episode reward: [(0, '192.392')] [2023-03-07 16:57:25,761][232226] Updated weights for policy 0, policy_version 33840 (0.0007) [2023-03-07 16:57:26,548][232226] Updated weights for policy 0, policy_version 33850 (0.0006) [2023-03-07 16:57:27,338][232226] Updated weights for policy 0, policy_version 33860 (0.0006) [2023-03-07 16:57:28,138][232226] Updated weights for policy 0, policy_version 33870 (0.0008) [2023-03-07 16:57:28,924][232226] Updated weights for policy 0, policy_version 33880 (0.0006) [2023-03-07 16:57:29,722][232226] Updated weights for policy 0, policy_version 33890 (0.0006) [2023-03-07 16:57:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 34707456. Throughput: 0: 12882.7. Samples: 34672230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:30,069][231894] Avg episode reward: [(0, '191.527')] [2023-03-07 16:57:30,513][232226] Updated weights for policy 0, policy_version 33900 (0.0006) [2023-03-07 16:57:31,320][232226] Updated weights for policy 0, policy_version 33910 (0.0006) [2023-03-07 16:57:32,113][232226] Updated weights for policy 0, policy_version 33920 (0.0006) [2023-03-07 16:57:32,908][232226] Updated weights for policy 0, policy_version 33930 (0.0006) [2023-03-07 16:57:33,702][232226] Updated weights for policy 0, policy_version 33940 (0.0006) [2023-03-07 16:57:34,502][232226] Updated weights for policy 0, policy_version 33950 (0.0006) [2023-03-07 16:57:35,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12892.0). Total num frames: 34771968. Throughput: 0: 12880.1. Samples: 34749521. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:35,069][231894] Avg episode reward: [(0, '196.007')] [2023-03-07 16:57:35,296][232226] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-07 16:57:36,083][232226] Updated weights for policy 0, policy_version 33970 (0.0007) [2023-03-07 16:57:36,902][232226] Updated weights for policy 0, policy_version 33980 (0.0007) [2023-03-07 16:57:37,678][232226] Updated weights for policy 0, policy_version 33990 (0.0006) [2023-03-07 16:57:38,481][232226] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-07 16:57:39,281][232226] Updated weights for policy 0, policy_version 34010 (0.0007) [2023-03-07 16:57:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 34835456. Throughput: 0: 12876.9. Samples: 34826598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:40,069][231894] Avg episode reward: [(0, '189.824')] [2023-03-07 16:57:40,076][232226] Updated weights for policy 0, policy_version 34020 (0.0007) [2023-03-07 16:57:40,878][232226] Updated weights for policy 0, policy_version 34030 (0.0006) [2023-03-07 16:57:41,677][232226] Updated weights for policy 0, policy_version 34040 (0.0006) [2023-03-07 16:57:42,461][232226] Updated weights for policy 0, policy_version 34050 (0.0007) [2023-03-07 16:57:43,262][232226] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-07 16:57:44,075][232226] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-07 16:57:44,867][232226] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-07 16:57:45,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 34899968. Throughput: 0: 12872.8. Samples: 34865106. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:45,069][231894] Avg episode reward: [(0, '188.760')] [2023-03-07 16:57:45,652][232226] Updated weights for policy 0, policy_version 34090 (0.0006) [2023-03-07 16:57:46,457][232226] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-07 16:57:47,265][232226] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-07 16:57:48,054][232226] Updated weights for policy 0, policy_version 34120 (0.0007) [2023-03-07 16:57:48,844][232226] Updated weights for policy 0, policy_version 34130 (0.0006) [2023-03-07 16:57:49,651][232226] Updated weights for policy 0, policy_version 34140 (0.0007) [2023-03-07 16:57:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 34964480. Throughput: 0: 12864.6. Samples: 34942077. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:50,069][231894] Avg episode reward: [(0, '197.384')] [2023-03-07 16:57:50,451][232226] Updated weights for policy 0, policy_version 34150 (0.0007) [2023-03-07 16:57:51,252][232226] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-07 16:57:52,048][232226] Updated weights for policy 0, policy_version 34170 (0.0007) [2023-03-07 16:57:52,859][232226] Updated weights for policy 0, policy_version 34180 (0.0006) [2023-03-07 16:57:53,658][232226] Updated weights for policy 0, policy_version 34190 (0.0007) [2023-03-07 16:57:54,450][232226] Updated weights for policy 0, policy_version 34200 (0.0006) [2023-03-07 16:57:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12888.5). Total num frames: 35027968. Throughput: 0: 12853.6. Samples: 35019017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:57:55,069][231894] Avg episode reward: [(0, '189.569')] [2023-03-07 16:57:55,243][232226] Updated weights for policy 0, policy_version 34210 (0.0007) [2023-03-07 16:57:56,029][232226] Updated weights for policy 0, policy_version 34220 (0.0007) [2023-03-07 16:57:56,817][232226] Updated weights for policy 0, policy_version 34230 (0.0006) [2023-03-07 16:57:57,601][232226] Updated weights for policy 0, policy_version 34240 (0.0007) [2023-03-07 16:57:58,380][232226] Updated weights for policy 0, policy_version 34250 (0.0006) [2023-03-07 16:57:59,191][232226] Updated weights for policy 0, policy_version 34260 (0.0006) [2023-03-07 16:57:59,981][232226] Updated weights for policy 0, policy_version 34270 (0.0006) [2023-03-07 16:58:00,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12888.5). Total num frames: 35092480. Throughput: 0: 12862.0. Samples: 35057849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:58:00,069][231894] Avg episode reward: [(0, '186.329')] [2023-03-07 16:58:00,778][232226] Updated weights for policy 0, policy_version 34280 (0.0007) [2023-03-07 16:58:01,576][232226] Updated weights for policy 0, policy_version 34290 (0.0006) [2023-03-07 16:58:02,370][232226] Updated weights for policy 0, policy_version 34300 (0.0007) [2023-03-07 16:58:03,185][232226] Updated weights for policy 0, policy_version 34310 (0.0006) [2023-03-07 16:58:03,953][232226] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-07 16:58:04,758][232226] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-07 16:58:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 35156992. Throughput: 0: 12869.3. Samples: 35135355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:58:05,069][231894] Avg episode reward: [(0, '192.058')] [2023-03-07 16:58:05,538][232226] Updated weights for policy 0, policy_version 34340 (0.0007) [2023-03-07 16:58:06,328][232226] Updated weights for policy 0, policy_version 34350 (0.0007) [2023-03-07 16:58:07,144][232226] Updated weights for policy 0, policy_version 34360 (0.0008) [2023-03-07 16:58:07,945][232226] Updated weights for policy 0, policy_version 34370 (0.0006) [2023-03-07 16:58:08,740][232226] Updated weights for policy 0, policy_version 34380 (0.0006) [2023-03-07 16:58:09,528][232226] Updated weights for policy 0, policy_version 34390 (0.0006) [2023-03-07 16:58:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12888.5). Total num frames: 35221504. Throughput: 0: 12867.0. Samples: 35212479. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:58:10,069][231894] Avg episode reward: [(0, '189.636')] [2023-03-07 16:58:10,339][232226] Updated weights for policy 0, policy_version 34400 (0.0006) [2023-03-07 16:58:11,133][232226] Updated weights for policy 0, policy_version 34410 (0.0006) [2023-03-07 16:58:11,937][232226] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-07 16:58:12,712][232226] Updated weights for policy 0, policy_version 34430 (0.0007) [2023-03-07 16:58:13,519][232226] Updated weights for policy 0, policy_version 34440 (0.0007) [2023-03-07 16:58:14,304][232226] Updated weights for policy 0, policy_version 34450 (0.0006) [2023-03-07 16:58:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 35286016. Throughput: 0: 12863.5. Samples: 35251086. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:58:15,069][231894] Avg episode reward: [(0, '189.532')] [2023-03-07 16:58:15,086][232226] Updated weights for policy 0, policy_version 34460 (0.0007) [2023-03-07 16:58:15,889][232226] Updated weights for policy 0, policy_version 34470 (0.0006) [2023-03-07 16:58:16,665][232226] Updated weights for policy 0, policy_version 34480 (0.0006) [2023-03-07 16:58:17,486][232226] Updated weights for policy 0, policy_version 34490 (0.0006) [2023-03-07 16:58:18,281][232226] Updated weights for policy 0, policy_version 34500 (0.0006) [2023-03-07 16:58:19,064][232226] Updated weights for policy 0, policy_version 34510 (0.0007) [2023-03-07 16:58:19,863][232226] Updated weights for policy 0, policy_version 34520 (0.0007) [2023-03-07 16:58:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 35350528. Throughput: 0: 12862.6. Samples: 35328337. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:58:20,070][231894] Avg episode reward: [(0, '193.997')] [2023-03-07 16:58:20,642][232226] Updated weights for policy 0, policy_version 34530 (0.0007) [2023-03-07 16:58:21,441][232226] Updated weights for policy 0, policy_version 34540 (0.0007) [2023-03-07 16:58:22,236][232226] Updated weights for policy 0, policy_version 34550 (0.0006) [2023-03-07 16:58:23,035][232226] Updated weights for policy 0, policy_version 34560 (0.0006) [2023-03-07 16:58:23,816][232226] Updated weights for policy 0, policy_version 34570 (0.0006) [2023-03-07 16:58:24,621][232226] Updated weights for policy 0, policy_version 34580 (0.0006) [2023-03-07 16:58:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 35415040. Throughput: 0: 12880.2. Samples: 35406208. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:58:25,069][231894] Avg episode reward: [(0, '200.120')] [2023-03-07 16:58:25,082][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000034586_35416064.pth... [2023-03-07 16:58:25,112][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000031564_32321536.pth [2023-03-07 16:58:25,405][232226] Updated weights for policy 0, policy_version 34590 (0.0007) [2023-03-07 16:58:26,203][232226] Updated weights for policy 0, policy_version 34600 (0.0007) [2023-03-07 16:58:26,979][232226] Updated weights for policy 0, policy_version 34610 (0.0006) [2023-03-07 16:58:27,769][232226] Updated weights for policy 0, policy_version 34620 (0.0007) [2023-03-07 16:58:28,557][232226] Updated weights for policy 0, policy_version 34630 (0.0006) [2023-03-07 16:58:29,342][232226] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-07 16:58:30,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 35480576. Throughput: 0: 12884.4. Samples: 35444901. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:58:30,069][231894] Avg episode reward: [(0, '190.629')] [2023-03-07 16:58:30,149][232226] Updated weights for policy 0, policy_version 34650 (0.0006) [2023-03-07 16:58:30,927][232226] Updated weights for policy 0, policy_version 34660 (0.0006) [2023-03-07 16:58:31,715][232226] Updated weights for policy 0, policy_version 34670 (0.0006) [2023-03-07 16:58:32,518][232226] Updated weights for policy 0, policy_version 34680 (0.0007) [2023-03-07 16:58:33,322][232226] Updated weights for policy 0, policy_version 34690 (0.0006) [2023-03-07 16:58:34,129][232226] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-07 16:58:34,914][232226] Updated weights for policy 0, policy_version 34710 (0.0006) [2023-03-07 16:58:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 35544064. Throughput: 0: 12898.2. Samples: 35522497. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:58:35,069][231894] Avg episode reward: [(0, '194.954')] [2023-03-07 16:58:35,700][232226] Updated weights for policy 0, policy_version 34720 (0.0006) [2023-03-07 16:58:36,502][232226] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-07 16:58:37,280][232226] Updated weights for policy 0, policy_version 34740 (0.0007) [2023-03-07 16:58:38,078][232226] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-07 16:58:38,892][232226] Updated weights for policy 0, policy_version 34760 (0.0006) [2023-03-07 16:58:39,679][232226] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-07 16:58:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 35608576. Throughput: 0: 12902.6. Samples: 35599634. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:58:40,069][231894] Avg episode reward: [(0, '188.340')] [2023-03-07 16:58:40,482][232226] Updated weights for policy 0, policy_version 34780 (0.0007) [2023-03-07 16:58:41,278][232226] Updated weights for policy 0, policy_version 34790 (0.0007) [2023-03-07 16:58:42,070][232226] Updated weights for policy 0, policy_version 34800 (0.0006) [2023-03-07 16:58:42,861][232226] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-07 16:58:43,661][232226] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-07 16:58:44,450][232226] Updated weights for policy 0, policy_version 34830 (0.0006) [2023-03-07 16:58:45,069][231894] Fps is (10 sec: 12902.1, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 35673088. Throughput: 0: 12897.1. Samples: 35638220. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:58:45,070][231894] Avg episode reward: [(0, '192.166')] [2023-03-07 16:58:45,259][232226] Updated weights for policy 0, policy_version 34840 (0.0006) [2023-03-07 16:58:46,042][232226] Updated weights for policy 0, policy_version 34850 (0.0006) [2023-03-07 16:58:46,837][232226] Updated weights for policy 0, policy_version 34860 (0.0007) [2023-03-07 16:58:47,632][232226] Updated weights for policy 0, policy_version 34870 (0.0006) [2023-03-07 16:58:48,429][232226] Updated weights for policy 0, policy_version 34880 (0.0006) [2023-03-07 16:58:49,242][232226] Updated weights for policy 0, policy_version 34890 (0.0006) [2023-03-07 16:58:50,040][232226] Updated weights for policy 0, policy_version 34900 (0.0006) [2023-03-07 16:58:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 35737600. Throughput: 0: 12891.0. Samples: 35715452. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 16:58:50,069][231894] Avg episode reward: [(0, '198.529')] [2023-03-07 16:58:50,847][232226] Updated weights for policy 0, policy_version 34910 (0.0007) [2023-03-07 16:58:51,662][232226] Updated weights for policy 0, policy_version 34920 (0.0007) [2023-03-07 16:58:52,448][232226] Updated weights for policy 0, policy_version 34930 (0.0007) [2023-03-07 16:58:53,232][232226] Updated weights for policy 0, policy_version 34940 (0.0006) [2023-03-07 16:58:54,033][232226] Updated weights for policy 0, policy_version 34950 (0.0006) [2023-03-07 16:58:54,831][232226] Updated weights for policy 0, policy_version 34960 (0.0007) [2023-03-07 16:58:55,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 35801088. Throughput: 0: 12884.9. Samples: 35792299. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:58:55,069][231894] Avg episode reward: [(0, '191.733')] [2023-03-07 16:58:55,627][232226] Updated weights for policy 0, policy_version 34970 (0.0006) [2023-03-07 16:58:56,426][232226] Updated weights for policy 0, policy_version 34980 (0.0006) [2023-03-07 16:58:57,222][232226] Updated weights for policy 0, policy_version 34990 (0.0007) [2023-03-07 16:58:58,013][232226] Updated weights for policy 0, policy_version 35000 (0.0006) [2023-03-07 16:58:58,823][232226] Updated weights for policy 0, policy_version 35010 (0.0006) [2023-03-07 16:58:59,611][232226] Updated weights for policy 0, policy_version 35020 (0.0006) [2023-03-07 16:59:00,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 35865600. Throughput: 0: 12884.1. Samples: 35830870. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:59:00,069][231894] Avg episode reward: [(0, '193.790')] [2023-03-07 16:59:00,412][232226] Updated weights for policy 0, policy_version 35030 (0.0007) [2023-03-07 16:59:01,206][232226] Updated weights for policy 0, policy_version 35040 (0.0007) [2023-03-07 16:59:01,993][232226] Updated weights for policy 0, policy_version 35050 (0.0006) [2023-03-07 16:59:02,790][232226] Updated weights for policy 0, policy_version 35060 (0.0006) [2023-03-07 16:59:03,592][232226] Updated weights for policy 0, policy_version 35070 (0.0006) [2023-03-07 16:59:04,385][232226] Updated weights for policy 0, policy_version 35080 (0.0006) [2023-03-07 16:59:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 35930112. Throughput: 0: 12883.2. Samples: 35908083. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:59:05,069][231894] Avg episode reward: [(0, '205.052')] [2023-03-07 16:59:05,192][232226] Updated weights for policy 0, policy_version 35090 (0.0007) [2023-03-07 16:59:05,973][232226] Updated weights for policy 0, policy_version 35100 (0.0006) [2023-03-07 16:59:06,786][232226] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-07 16:59:07,577][232226] Updated weights for policy 0, policy_version 35120 (0.0006) [2023-03-07 16:59:08,362][232226] Updated weights for policy 0, policy_version 35130 (0.0006) [2023-03-07 16:59:09,151][232226] Updated weights for policy 0, policy_version 35140 (0.0007) [2023-03-07 16:59:09,952][232226] Updated weights for policy 0, policy_version 35150 (0.0006) [2023-03-07 16:59:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 35994624. Throughput: 0: 12872.2. Samples: 35985458. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 16:59:10,069][231894] Avg episode reward: [(0, '194.284')] [2023-03-07 16:59:10,727][232226] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-07 16:59:11,513][232226] Updated weights for policy 0, policy_version 35170 (0.0006) [2023-03-07 16:59:12,305][232226] Updated weights for policy 0, policy_version 35180 (0.0007) [2023-03-07 16:59:13,097][232226] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-07 16:59:13,865][232226] Updated weights for policy 0, policy_version 35200 (0.0007) [2023-03-07 16:59:14,674][232226] Updated weights for policy 0, policy_version 35210 (0.0007) [2023-03-07 16:59:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 36059136. Throughput: 0: 12875.8. Samples: 36024315. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:15,069][231894] Avg episode reward: [(0, '186.425')] [2023-03-07 16:59:15,464][232226] Updated weights for policy 0, policy_version 35220 (0.0007) [2023-03-07 16:59:16,256][232226] Updated weights for policy 0, policy_version 35230 (0.0006) [2023-03-07 16:59:17,053][232226] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-07 16:59:17,843][232226] Updated weights for policy 0, policy_version 35250 (0.0006) [2023-03-07 16:59:18,627][232226] Updated weights for policy 0, policy_version 35260 (0.0007) [2023-03-07 16:59:19,422][232226] Updated weights for policy 0, policy_version 35270 (0.0006) [2023-03-07 16:59:20,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 36124672. Throughput: 0: 12880.7. Samples: 36102132. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:20,069][231894] Avg episode reward: [(0, '198.396')] [2023-03-07 16:59:20,205][232226] Updated weights for policy 0, policy_version 35280 (0.0007) [2023-03-07 16:59:21,002][232226] Updated weights for policy 0, policy_version 35290 (0.0006) [2023-03-07 16:59:21,792][232226] Updated weights for policy 0, policy_version 35300 (0.0008) [2023-03-07 16:59:22,585][232226] Updated weights for policy 0, policy_version 35310 (0.0006) [2023-03-07 16:59:23,373][232226] Updated weights for policy 0, policy_version 35320 (0.0006) [2023-03-07 16:59:24,186][232226] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-07 16:59:24,994][232226] Updated weights for policy 0, policy_version 35340 (0.0008) [2023-03-07 16:59:25,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 36189184. Throughput: 0: 12884.3. Samples: 36179426. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:25,069][231894] Avg episode reward: [(0, '197.591')] [2023-03-07 16:59:25,780][232226] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-07 16:59:26,582][232226] Updated weights for policy 0, policy_version 35360 (0.0007) [2023-03-07 16:59:27,378][232226] Updated weights for policy 0, policy_version 35370 (0.0006) [2023-03-07 16:59:28,173][232226] Updated weights for policy 0, policy_version 35380 (0.0006) [2023-03-07 16:59:28,968][232226] Updated weights for policy 0, policy_version 35390 (0.0006) [2023-03-07 16:59:29,761][232226] Updated weights for policy 0, policy_version 35400 (0.0006) [2023-03-07 16:59:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 36252672. Throughput: 0: 12884.4. Samples: 36218018. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:30,069][231894] Avg episode reward: [(0, '192.000')] [2023-03-07 16:59:30,557][232226] Updated weights for policy 0, policy_version 35410 (0.0007) [2023-03-07 16:59:31,352][232226] Updated weights for policy 0, policy_version 35420 (0.0005) [2023-03-07 16:59:32,141][232226] Updated weights for policy 0, policy_version 35430 (0.0006) [2023-03-07 16:59:32,938][232226] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-07 16:59:33,729][232226] Updated weights for policy 0, policy_version 35450 (0.0006) [2023-03-07 16:59:34,522][232226] Updated weights for policy 0, policy_version 35460 (0.0006) [2023-03-07 16:59:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 36317184. Throughput: 0: 12888.3. Samples: 36295424. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:35,069][231894] Avg episode reward: [(0, '185.027')] [2023-03-07 16:59:35,327][232226] Updated weights for policy 0, policy_version 35470 (0.0007) [2023-03-07 16:59:36,099][232226] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-07 16:59:36,892][232226] Updated weights for policy 0, policy_version 35490 (0.0006) [2023-03-07 16:59:37,681][232226] Updated weights for policy 0, policy_version 35500 (0.0007) [2023-03-07 16:59:38,488][232226] Updated weights for policy 0, policy_version 35510 (0.0006) [2023-03-07 16:59:39,292][232226] Updated weights for policy 0, policy_version 35520 (0.0005) [2023-03-07 16:59:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 36381696. Throughput: 0: 12898.0. Samples: 36372708. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:40,069][231894] Avg episode reward: [(0, '193.818')] [2023-03-07 16:59:40,077][232226] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-07 16:59:40,883][232226] Updated weights for policy 0, policy_version 35540 (0.0006) [2023-03-07 16:59:41,662][232226] Updated weights for policy 0, policy_version 35550 (0.0005) [2023-03-07 16:59:42,462][232226] Updated weights for policy 0, policy_version 35560 (0.0007) [2023-03-07 16:59:43,244][232226] Updated weights for policy 0, policy_version 35570 (0.0006) [2023-03-07 16:59:44,042][232226] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-07 16:59:44,832][232226] Updated weights for policy 0, policy_version 35590 (0.0006) [2023-03-07 16:59:45,069][231894] Fps is (10 sec: 13004.6, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 36447232. Throughput: 0: 12906.1. Samples: 36411643. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:45,069][231894] Avg episode reward: [(0, '197.110')] [2023-03-07 16:59:45,612][232226] Updated weights for policy 0, policy_version 35600 (0.0006) [2023-03-07 16:59:46,416][232226] Updated weights for policy 0, policy_version 35610 (0.0007) [2023-03-07 16:59:47,191][232226] Updated weights for policy 0, policy_version 35620 (0.0005) [2023-03-07 16:59:48,004][232226] Updated weights for policy 0, policy_version 35630 (0.0006) [2023-03-07 16:59:48,791][232226] Updated weights for policy 0, policy_version 35640 (0.0006) [2023-03-07 16:59:49,590][232226] Updated weights for policy 0, policy_version 35650 (0.0006) [2023-03-07 16:59:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 36510720. Throughput: 0: 12912.7. Samples: 36489154. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:50,069][231894] Avg episode reward: [(0, '197.464')] [2023-03-07 16:59:50,400][232226] Updated weights for policy 0, policy_version 35660 (0.0007) [2023-03-07 16:59:51,190][232226] Updated weights for policy 0, policy_version 35670 (0.0007) [2023-03-07 16:59:51,979][232226] Updated weights for policy 0, policy_version 35680 (0.0007) [2023-03-07 16:59:52,800][232226] Updated weights for policy 0, policy_version 35690 (0.0007) [2023-03-07 16:59:53,596][232226] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-07 16:59:54,390][232226] Updated weights for policy 0, policy_version 35710 (0.0006) [2023-03-07 16:59:55,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 36575232. Throughput: 0: 12900.7. Samples: 36565989. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 16:59:55,069][231894] Avg episode reward: [(0, '189.699')] [2023-03-07 16:59:55,200][232226] Updated weights for policy 0, policy_version 35720 (0.0007) [2023-03-07 16:59:55,986][232226] Updated weights for policy 0, policy_version 35730 (0.0007) [2023-03-07 16:59:56,785][232226] Updated weights for policy 0, policy_version 35740 (0.0006) [2023-03-07 16:59:57,588][232226] Updated weights for policy 0, policy_version 35750 (0.0007) [2023-03-07 16:59:58,375][232226] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-07 16:59:59,166][232226] Updated weights for policy 0, policy_version 35770 (0.0006) [2023-03-07 16:59:59,982][232226] Updated weights for policy 0, policy_version 35780 (0.0006) [2023-03-07 17:00:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 36639744. Throughput: 0: 12890.5. Samples: 36604386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:00:00,069][231894] Avg episode reward: [(0, '192.709')] [2023-03-07 17:00:00,786][232226] Updated weights for policy 0, policy_version 35790 (0.0006) [2023-03-07 17:00:01,580][232226] Updated weights for policy 0, policy_version 35800 (0.0006) [2023-03-07 17:00:02,366][232226] Updated weights for policy 0, policy_version 35810 (0.0006) [2023-03-07 17:00:03,157][232226] Updated weights for policy 0, policy_version 35820 (0.0006) [2023-03-07 17:00:03,987][232226] Updated weights for policy 0, policy_version 35830 (0.0006) [2023-03-07 17:00:04,761][232226] Updated weights for policy 0, policy_version 35840 (0.0006) [2023-03-07 17:00:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 36704256. Throughput: 0: 12874.3. Samples: 36681474. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:00:05,069][231894] Avg episode reward: [(0, '194.972')] [2023-03-07 17:00:05,561][232226] Updated weights for policy 0, policy_version 35850 (0.0007) [2023-03-07 17:00:06,357][232226] Updated weights for policy 0, policy_version 35860 (0.0006) [2023-03-07 17:00:07,152][232226] Updated weights for policy 0, policy_version 35870 (0.0007) [2023-03-07 17:00:07,941][232226] Updated weights for policy 0, policy_version 35880 (0.0007) [2023-03-07 17:00:08,718][232226] Updated weights for policy 0, policy_version 35890 (0.0006) [2023-03-07 17:00:09,514][232226] Updated weights for policy 0, policy_version 35900 (0.0007) [2023-03-07 17:00:10,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 36767744. Throughput: 0: 12874.1. Samples: 36758760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:00:10,069][231894] Avg episode reward: [(0, '197.692')] [2023-03-07 17:00:10,309][232226] Updated weights for policy 0, policy_version 35910 (0.0006) [2023-03-07 17:00:11,113][232226] Updated weights for policy 0, policy_version 35920 (0.0006) [2023-03-07 17:00:11,900][232226] Updated weights for policy 0, policy_version 35930 (0.0006) [2023-03-07 17:00:12,689][232226] Updated weights for policy 0, policy_version 35940 (0.0006) [2023-03-07 17:00:13,485][232226] Updated weights for policy 0, policy_version 35950 (0.0006) [2023-03-07 17:00:14,284][232226] Updated weights for policy 0, policy_version 35960 (0.0006) [2023-03-07 17:00:15,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 36832256. Throughput: 0: 12876.9. Samples: 36797477. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:00:15,069][231894] Avg episode reward: [(0, '194.864')] [2023-03-07 17:00:15,099][232226] Updated weights for policy 0, policy_version 35970 (0.0006) [2023-03-07 17:00:15,881][232226] Updated weights for policy 0, policy_version 35980 (0.0007) [2023-03-07 17:00:16,679][232226] Updated weights for policy 0, policy_version 35990 (0.0006) [2023-03-07 17:00:17,474][232226] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-07 17:00:18,269][232226] Updated weights for policy 0, policy_version 36010 (0.0007) [2023-03-07 17:00:19,051][232226] Updated weights for policy 0, policy_version 36020 (0.0006) [2023-03-07 17:00:19,850][232226] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-07 17:00:20,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 36896768. Throughput: 0: 12871.9. Samples: 36874660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:00:20,069][231894] Avg episode reward: [(0, '190.005')] [2023-03-07 17:00:20,646][232226] Updated weights for policy 0, policy_version 36040 (0.0006) [2023-03-07 17:00:21,446][232226] Updated weights for policy 0, policy_version 36050 (0.0006) [2023-03-07 17:00:22,235][232226] Updated weights for policy 0, policy_version 36060 (0.0006) [2023-03-07 17:00:23,038][232226] Updated weights for policy 0, policy_version 36070 (0.0006) [2023-03-07 17:00:23,834][232226] Updated weights for policy 0, policy_version 36080 (0.0006) [2023-03-07 17:00:24,617][232226] Updated weights for policy 0, policy_version 36090 (0.0006) [2023-03-07 17:00:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 36961280. Throughput: 0: 12877.3. Samples: 36952187. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:00:25,069][231894] Avg episode reward: [(0, '194.739')] [2023-03-07 17:00:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000036095_36961280.pth... [2023-03-07 17:00:25,107][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000033076_33869824.pth [2023-03-07 17:00:25,413][232226] Updated weights for policy 0, policy_version 36100 (0.0006) [2023-03-07 17:00:26,216][232226] Updated weights for policy 0, policy_version 36110 (0.0007) [2023-03-07 17:00:27,009][232226] Updated weights for policy 0, policy_version 36120 (0.0006) [2023-03-07 17:00:27,785][232226] Updated weights for policy 0, policy_version 36130 (0.0007) [2023-03-07 17:00:28,578][232226] Updated weights for policy 0, policy_version 36140 (0.0006) [2023-03-07 17:00:29,376][232226] Updated weights for policy 0, policy_version 36150 (0.0006) [2023-03-07 17:00:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 37025792. Throughput: 0: 12872.6. Samples: 36990907. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:00:30,069][231894] Avg episode reward: [(0, '187.784')] [2023-03-07 17:00:30,169][232226] Updated weights for policy 0, policy_version 36160 (0.0007) [2023-03-07 17:00:30,964][232226] Updated weights for policy 0, policy_version 36170 (0.0006) [2023-03-07 17:00:31,750][232226] Updated weights for policy 0, policy_version 36180 (0.0006) [2023-03-07 17:00:32,556][232226] Updated weights for policy 0, policy_version 36190 (0.0006) [2023-03-07 17:00:33,349][232226] Updated weights for policy 0, policy_version 36200 (0.0006) [2023-03-07 17:00:34,131][232226] Updated weights for policy 0, policy_version 36210 (0.0007) [2023-03-07 17:00:34,935][232226] Updated weights for policy 0, policy_version 36220 (0.0006) [2023-03-07 17:00:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 37090304. Throughput: 0: 12872.1. Samples: 37068398. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:00:35,069][231894] Avg episode reward: [(0, '197.304')] [2023-03-07 17:00:35,729][232226] Updated weights for policy 0, policy_version 36230 (0.0006) [2023-03-07 17:00:36,529][232226] Updated weights for policy 0, policy_version 36240 (0.0007) [2023-03-07 17:00:37,329][232226] Updated weights for policy 0, policy_version 36250 (0.0006) [2023-03-07 17:00:38,117][232226] Updated weights for policy 0, policy_version 36260 (0.0006) [2023-03-07 17:00:38,898][232226] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-07 17:00:39,133][232173] KL-divergence is very high: 187.3551 [2023-03-07 17:00:39,718][232226] Updated weights for policy 0, policy_version 36280 (0.0006) [2023-03-07 17:00:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12888.5). Total num frames: 37154816. Throughput: 0: 12883.8. Samples: 37145762. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:00:40,069][231894] Avg episode reward: [(0, '198.700')] [2023-03-07 17:00:40,493][232226] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-07 17:00:41,280][232226] Updated weights for policy 0, policy_version 36300 (0.0005) [2023-03-07 17:00:42,077][232226] Updated weights for policy 0, policy_version 36310 (0.0006) [2023-03-07 17:00:42,876][232226] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-07 17:00:43,665][232226] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-07 17:00:44,469][232226] Updated weights for policy 0, policy_version 36340 (0.0006) [2023-03-07 17:00:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 37219328. Throughput: 0: 12890.2. Samples: 37184446. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:00:45,070][231894] Avg episode reward: [(0, '193.393')] [2023-03-07 17:00:45,261][232226] Updated weights for policy 0, policy_version 36350 (0.0006) [2023-03-07 17:00:46,061][232226] Updated weights for policy 0, policy_version 36360 (0.0006) [2023-03-07 17:00:46,839][232226] Updated weights for policy 0, policy_version 36370 (0.0007) [2023-03-07 17:00:47,634][232226] Updated weights for policy 0, policy_version 36380 (0.0007) [2023-03-07 17:00:48,424][232226] Updated weights for policy 0, policy_version 36390 (0.0006) [2023-03-07 17:00:49,219][232226] Updated weights for policy 0, policy_version 36400 (0.0007) [2023-03-07 17:00:50,010][232226] Updated weights for policy 0, policy_version 36410 (0.0007) [2023-03-07 17:00:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 37283840. Throughput: 0: 12897.3. Samples: 37261852. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:00:50,069][231894] Avg episode reward: [(0, '186.336')] [2023-03-07 17:00:50,799][232226] Updated weights for policy 0, policy_version 36420 (0.0007) [2023-03-07 17:00:51,610][232226] Updated weights for policy 0, policy_version 36430 (0.0007) [2023-03-07 17:00:52,398][232226] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-07 17:00:53,191][232226] Updated weights for policy 0, policy_version 36450 (0.0006) [2023-03-07 17:00:53,999][232226] Updated weights for policy 0, policy_version 36460 (0.0006) [2023-03-07 17:00:54,799][232226] Updated weights for policy 0, policy_version 36470 (0.0007) [2023-03-07 17:00:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 37348352. Throughput: 0: 12895.3. Samples: 37339049. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:00:55,069][231894] Avg episode reward: [(0, '196.234')] [2023-03-07 17:00:55,577][232226] Updated weights for policy 0, policy_version 36480 (0.0007) [2023-03-07 17:00:56,381][232226] Updated weights for policy 0, policy_version 36490 (0.0006) [2023-03-07 17:00:57,164][232226] Updated weights for policy 0, policy_version 36500 (0.0007) [2023-03-07 17:00:57,964][232226] Updated weights for policy 0, policy_version 36510 (0.0006) [2023-03-07 17:00:58,761][232226] Updated weights for policy 0, policy_version 36520 (0.0007) [2023-03-07 17:00:59,559][232226] Updated weights for policy 0, policy_version 36530 (0.0006) [2023-03-07 17:01:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 37412864. Throughput: 0: 12898.6. Samples: 37377914. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:01:00,069][231894] Avg episode reward: [(0, '194.317')] [2023-03-07 17:01:00,350][232226] Updated weights for policy 0, policy_version 36540 (0.0007) [2023-03-07 17:01:01,123][232226] Updated weights for policy 0, policy_version 36550 (0.0006) [2023-03-07 17:01:01,918][232226] Updated weights for policy 0, policy_version 36560 (0.0008) [2023-03-07 17:01:02,727][232226] Updated weights for policy 0, policy_version 36570 (0.0006) [2023-03-07 17:01:03,506][232226] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-07 17:01:04,317][232226] Updated weights for policy 0, policy_version 36590 (0.0007) [2023-03-07 17:01:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 37477376. Throughput: 0: 12902.3. Samples: 37455264. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:01:05,069][231894] Avg episode reward: [(0, '195.557')] [2023-03-07 17:01:05,115][232226] Updated weights for policy 0, policy_version 36600 (0.0007) [2023-03-07 17:01:05,890][232226] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-07 17:01:06,700][232226] Updated weights for policy 0, policy_version 36620 (0.0006) [2023-03-07 17:01:07,493][232226] Updated weights for policy 0, policy_version 36630 (0.0006) [2023-03-07 17:01:08,278][232226] Updated weights for policy 0, policy_version 36640 (0.0005) [2023-03-07 17:01:09,076][232226] Updated weights for policy 0, policy_version 36650 (0.0007) [2023-03-07 17:01:09,900][232226] Updated weights for policy 0, policy_version 36660 (0.0006) [2023-03-07 17:01:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 37541888. Throughput: 0: 12892.2. Samples: 37532335. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:01:10,069][231894] Avg episode reward: [(0, '200.770')] [2023-03-07 17:01:10,687][232226] Updated weights for policy 0, policy_version 36670 (0.0006) [2023-03-07 17:01:11,487][232226] Updated weights for policy 0, policy_version 36680 (0.0006) [2023-03-07 17:01:12,294][232226] Updated weights for policy 0, policy_version 36690 (0.0006) [2023-03-07 17:01:13,073][232226] Updated weights for policy 0, policy_version 36700 (0.0006) [2023-03-07 17:01:13,872][232226] Updated weights for policy 0, policy_version 36710 (0.0007) [2023-03-07 17:01:14,668][232226] Updated weights for policy 0, policy_version 36720 (0.0006) [2023-03-07 17:01:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 37605376. Throughput: 0: 12885.9. Samples: 37570774. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:01:15,069][231894] Avg episode reward: [(0, '191.533')] [2023-03-07 17:01:15,453][232226] Updated weights for policy 0, policy_version 36730 (0.0007) [2023-03-07 17:01:16,262][232226] Updated weights for policy 0, policy_version 36740 (0.0006) [2023-03-07 17:01:17,070][232226] Updated weights for policy 0, policy_version 36750 (0.0006) [2023-03-07 17:01:17,862][232226] Updated weights for policy 0, policy_version 36760 (0.0007) [2023-03-07 17:01:18,629][232226] Updated weights for policy 0, policy_version 36770 (0.0007) [2023-03-07 17:01:19,462][232226] Updated weights for policy 0, policy_version 36780 (0.0006) [2023-03-07 17:01:20,069][231894] Fps is (10 sec: 12799.3, 60 sec: 12885.2, 300 sec: 12881.6). Total num frames: 37669888. Throughput: 0: 12883.2. Samples: 37648150. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:01:20,070][231894] Avg episode reward: [(0, '182.865')] [2023-03-07 17:01:20,253][232226] Updated weights for policy 0, policy_version 36790 (0.0006) [2023-03-07 17:01:21,039][232226] Updated weights for policy 0, policy_version 36800 (0.0006) [2023-03-07 17:01:21,863][232226] Updated weights for policy 0, policy_version 36810 (0.0006) [2023-03-07 17:01:22,642][232226] Updated weights for policy 0, policy_version 36820 (0.0005) [2023-03-07 17:01:23,437][232226] Updated weights for policy 0, policy_version 36830 (0.0006) [2023-03-07 17:01:24,252][232226] Updated weights for policy 0, policy_version 36840 (0.0007) [2023-03-07 17:01:25,038][232226] Updated weights for policy 0, policy_version 36850 (0.0006) [2023-03-07 17:01:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 37734400. Throughput: 0: 12874.3. Samples: 37725105. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:01:25,069][231894] Avg episode reward: [(0, '190.128')] [2023-03-07 17:01:25,843][232226] Updated weights for policy 0, policy_version 36860 (0.0007) [2023-03-07 17:01:26,640][232226] Updated weights for policy 0, policy_version 36870 (0.0006) [2023-03-07 17:01:27,419][232226] Updated weights for policy 0, policy_version 36880 (0.0006) [2023-03-07 17:01:28,225][232226] Updated weights for policy 0, policy_version 36890 (0.0007) [2023-03-07 17:01:29,006][232226] Updated weights for policy 0, policy_version 36900 (0.0006) [2023-03-07 17:01:29,798][232226] Updated weights for policy 0, policy_version 36910 (0.0006) [2023-03-07 17:01:30,069][231894] Fps is (10 sec: 12903.0, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 37798912. Throughput: 0: 12873.9. Samples: 37763771. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:01:30,070][231894] Avg episode reward: [(0, '188.565')] [2023-03-07 17:01:30,613][232226] Updated weights for policy 0, policy_version 36920 (0.0007) [2023-03-07 17:01:31,406][232226] Updated weights for policy 0, policy_version 36930 (0.0006) [2023-03-07 17:01:32,202][232226] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-07 17:01:33,016][232226] Updated weights for policy 0, policy_version 36950 (0.0007) [2023-03-07 17:01:33,803][232226] Updated weights for policy 0, policy_version 36960 (0.0005) [2023-03-07 17:01:34,598][232226] Updated weights for policy 0, policy_version 36970 (0.0007) [2023-03-07 17:01:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 37863424. Throughput: 0: 12864.2. Samples: 37840741. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:01:35,069][231894] Avg episode reward: [(0, '195.647')] [2023-03-07 17:01:35,393][232226] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-07 17:01:36,186][232226] Updated weights for policy 0, policy_version 36990 (0.0006) [2023-03-07 17:01:36,990][232226] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-07 17:01:37,783][232226] Updated weights for policy 0, policy_version 37010 (0.0006) [2023-03-07 17:01:38,588][232226] Updated weights for policy 0, policy_version 37020 (0.0007) [2023-03-07 17:01:39,385][232226] Updated weights for policy 0, policy_version 37030 (0.0006) [2023-03-07 17:01:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.2, 300 sec: 12878.1). Total num frames: 37926912. Throughput: 0: 12862.6. Samples: 37917868. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:01:40,070][231894] Avg episode reward: [(0, '194.870')] [2023-03-07 17:01:40,167][232226] Updated weights for policy 0, policy_version 37040 (0.0006) [2023-03-07 17:01:40,937][232226] Updated weights for policy 0, policy_version 37050 (0.0006) [2023-03-07 17:01:41,758][232226] Updated weights for policy 0, policy_version 37060 (0.0006) [2023-03-07 17:01:42,543][232226] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-07 17:01:43,334][232226] Updated weights for policy 0, policy_version 37080 (0.0006) [2023-03-07 17:01:44,127][232226] Updated weights for policy 0, policy_version 37090 (0.0006) [2023-03-07 17:01:44,920][232226] Updated weights for policy 0, policy_version 37100 (0.0006) [2023-03-07 17:01:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 37991424. Throughput: 0: 12862.0. Samples: 37956706. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:01:45,069][231894] Avg episode reward: [(0, '189.525')] [2023-03-07 17:01:45,707][232226] Updated weights for policy 0, policy_version 37110 (0.0007) [2023-03-07 17:01:46,497][232226] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-07 17:01:47,290][232226] Updated weights for policy 0, policy_version 37130 (0.0006) [2023-03-07 17:01:48,102][232226] Updated weights for policy 0, policy_version 37140 (0.0006) [2023-03-07 17:01:48,894][232226] Updated weights for policy 0, policy_version 37150 (0.0006) [2023-03-07 17:01:49,685][232226] Updated weights for policy 0, policy_version 37160 (0.0006) [2023-03-07 17:01:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 38055936. Throughput: 0: 12861.4. Samples: 38034025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:01:50,069][231894] Avg episode reward: [(0, '193.169')] [2023-03-07 17:01:50,477][232226] Updated weights for policy 0, policy_version 37170 (0.0006) [2023-03-07 17:01:51,260][232226] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-07 17:01:52,061][232226] Updated weights for policy 0, policy_version 37190 (0.0006) [2023-03-07 17:01:52,841][232226] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-07 17:01:53,649][232226] Updated weights for policy 0, policy_version 37210 (0.0007) [2023-03-07 17:01:54,438][232226] Updated weights for policy 0, policy_version 37220 (0.0006) [2023-03-07 17:01:55,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 38121472. Throughput: 0: 12873.0. Samples: 38111617. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:01:55,069][231894] Avg episode reward: [(0, '188.491')] [2023-03-07 17:01:55,226][232226] Updated weights for policy 0, policy_version 37230 (0.0007) [2023-03-07 17:01:56,030][232226] Updated weights for policy 0, policy_version 37240 (0.0006) [2023-03-07 17:01:56,851][232226] Updated weights for policy 0, policy_version 37250 (0.0007) [2023-03-07 17:01:57,642][232226] Updated weights for policy 0, policy_version 37260 (0.0006) [2023-03-07 17:01:58,413][232226] Updated weights for policy 0, policy_version 37270 (0.0007) [2023-03-07 17:01:59,214][232226] Updated weights for policy 0, policy_version 37280 (0.0006) [2023-03-07 17:02:00,002][232226] Updated weights for policy 0, policy_version 37290 (0.0006) [2023-03-07 17:02:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 38184960. Throughput: 0: 12875.4. Samples: 38150167. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:02:00,069][231894] Avg episode reward: [(0, '188.758')] [2023-03-07 17:02:00,795][232226] Updated weights for policy 0, policy_version 37300 (0.0007) [2023-03-07 17:02:01,595][232226] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-07 17:02:02,372][232226] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-07 17:02:03,185][232226] Updated weights for policy 0, policy_version 37330 (0.0006) [2023-03-07 17:02:03,966][232226] Updated weights for policy 0, policy_version 37340 (0.0006) [2023-03-07 17:02:04,752][232226] Updated weights for policy 0, policy_version 37350 (0.0007) [2023-03-07 17:02:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 38249472. Throughput: 0: 12879.1. Samples: 38227700. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:02:05,069][231894] Avg episode reward: [(0, '188.101')] [2023-03-07 17:02:05,561][232226] Updated weights for policy 0, policy_version 37360 (0.0007) [2023-03-07 17:02:06,355][232226] Updated weights for policy 0, policy_version 37370 (0.0006) [2023-03-07 17:02:07,134][232226] Updated weights for policy 0, policy_version 37380 (0.0007) [2023-03-07 17:02:07,918][232226] Updated weights for policy 0, policy_version 37390 (0.0006) [2023-03-07 17:02:08,709][232226] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-07 17:02:09,478][232226] Updated weights for policy 0, policy_version 37410 (0.0006) [2023-03-07 17:02:10,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 38315008. Throughput: 0: 12897.0. Samples: 38305472. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:02:10,069][231894] Avg episode reward: [(0, '194.609')] [2023-03-07 17:02:10,285][232226] Updated weights for policy 0, policy_version 37420 (0.0007) [2023-03-07 17:02:11,079][232226] Updated weights for policy 0, policy_version 37430 (0.0007) [2023-03-07 17:02:11,870][232226] Updated weights for policy 0, policy_version 37440 (0.0007) [2023-03-07 17:02:12,670][232226] Updated weights for policy 0, policy_version 37450 (0.0007) [2023-03-07 17:02:13,473][232226] Updated weights for policy 0, policy_version 37460 (0.0005) [2023-03-07 17:02:14,269][232226] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-07 17:02:15,055][232226] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-07 17:02:15,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 38379520. Throughput: 0: 12899.2. Samples: 38344235. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:02:15,069][231894] Avg episode reward: [(0, '194.295')] [2023-03-07 17:02:15,868][232226] Updated weights for policy 0, policy_version 37490 (0.0007) [2023-03-07 17:02:16,642][232226] Updated weights for policy 0, policy_version 37500 (0.0007) [2023-03-07 17:02:17,433][232226] Updated weights for policy 0, policy_version 37510 (0.0007) [2023-03-07 17:02:18,223][232226] Updated weights for policy 0, policy_version 37520 (0.0006) [2023-03-07 17:02:19,027][232226] Updated weights for policy 0, policy_version 37530 (0.0008) [2023-03-07 17:02:19,823][232226] Updated weights for policy 0, policy_version 37540 (0.0006) [2023-03-07 17:02:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.5, 300 sec: 12885.0). Total num frames: 38444032. Throughput: 0: 12904.8. Samples: 38421459. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:20,069][231894] Avg episode reward: [(0, '186.198')] [2023-03-07 17:02:20,605][232226] Updated weights for policy 0, policy_version 37550 (0.0006) [2023-03-07 17:02:21,421][232226] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-07 17:02:22,226][232226] Updated weights for policy 0, policy_version 37570 (0.0006) [2023-03-07 17:02:23,011][232226] Updated weights for policy 0, policy_version 37580 (0.0007) [2023-03-07 17:02:23,821][232226] Updated weights for policy 0, policy_version 37590 (0.0007) [2023-03-07 17:02:24,616][232226] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-07 17:02:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 38507520. Throughput: 0: 12902.1. Samples: 38498461. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:25,069][231894] Avg episode reward: [(0, '191.383')] [2023-03-07 17:02:25,088][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000037606_38508544.pth... [2023-03-07 17:02:25,117][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000034586_35416064.pth [2023-03-07 17:02:25,402][232226] Updated weights for policy 0, policy_version 37610 (0.0007) [2023-03-07 17:02:26,219][232226] Updated weights for policy 0, policy_version 37620 (0.0007) [2023-03-07 17:02:27,014][232226] Updated weights for policy 0, policy_version 37630 (0.0006) [2023-03-07 17:02:27,805][232226] Updated weights for policy 0, policy_version 37640 (0.0006) [2023-03-07 17:02:28,601][232226] Updated weights for policy 0, policy_version 37650 (0.0006) [2023-03-07 17:02:29,395][232226] Updated weights for policy 0, policy_version 37660 (0.0006) [2023-03-07 17:02:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 38572032. Throughput: 0: 12891.8. Samples: 38536835. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:30,069][231894] Avg episode reward: [(0, '194.903')] [2023-03-07 17:02:30,201][232226] Updated weights for policy 0, policy_version 37670 (0.0007) [2023-03-07 17:02:30,988][232226] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-07 17:02:31,779][232226] Updated weights for policy 0, policy_version 37690 (0.0007) [2023-03-07 17:02:32,582][232226] Updated weights for policy 0, policy_version 37700 (0.0006) [2023-03-07 17:02:33,368][232226] Updated weights for policy 0, policy_version 37710 (0.0008) [2023-03-07 17:02:34,169][232226] Updated weights for policy 0, policy_version 37720 (0.0007) [2023-03-07 17:02:34,982][232226] Updated weights for policy 0, policy_version 37730 (0.0007) [2023-03-07 17:02:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 38636544. Throughput: 0: 12892.6. Samples: 38614195. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:35,070][231894] Avg episode reward: [(0, '192.003')] [2023-03-07 17:02:35,783][232226] Updated weights for policy 0, policy_version 37740 (0.0006) [2023-03-07 17:02:36,574][232226] Updated weights for policy 0, policy_version 37750 (0.0007) [2023-03-07 17:02:37,359][232226] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-07 17:02:38,174][232226] Updated weights for policy 0, policy_version 37770 (0.0007) [2023-03-07 17:02:38,962][232226] Updated weights for policy 0, policy_version 37780 (0.0006) [2023-03-07 17:02:39,757][232226] Updated weights for policy 0, policy_version 37790 (0.0006) [2023-03-07 17:02:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 38700032. Throughput: 0: 12877.1. Samples: 38691086. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:40,069][231894] Avg episode reward: [(0, '196.082')] [2023-03-07 17:02:40,554][232226] Updated weights for policy 0, policy_version 37800 (0.0006) [2023-03-07 17:02:41,353][232226] Updated weights for policy 0, policy_version 37810 (0.0006) [2023-03-07 17:02:42,152][232226] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-07 17:02:42,950][232226] Updated weights for policy 0, policy_version 37830 (0.0007) [2023-03-07 17:02:43,721][232226] Updated weights for policy 0, policy_version 37840 (0.0007) [2023-03-07 17:02:44,522][232226] Updated weights for policy 0, policy_version 37850 (0.0006) [2023-03-07 17:02:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 38764544. Throughput: 0: 12878.6. Samples: 38729707. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:45,069][231894] Avg episode reward: [(0, '200.325')] [2023-03-07 17:02:45,327][232226] Updated weights for policy 0, policy_version 37860 (0.0006) [2023-03-07 17:02:46,112][232226] Updated weights for policy 0, policy_version 37870 (0.0007) [2023-03-07 17:02:46,925][232226] Updated weights for policy 0, policy_version 37880 (0.0006) [2023-03-07 17:02:47,717][232226] Updated weights for policy 0, policy_version 37890 (0.0006) [2023-03-07 17:02:48,525][232226] Updated weights for policy 0, policy_version 37900 (0.0007) [2023-03-07 17:02:49,317][232226] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-07 17:02:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 38829056. Throughput: 0: 12870.4. Samples: 38806865. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:50,069][231894] Avg episode reward: [(0, '185.430')] [2023-03-07 17:02:50,106][232226] Updated weights for policy 0, policy_version 37920 (0.0006) [2023-03-07 17:02:50,905][232226] Updated weights for policy 0, policy_version 37930 (0.0007) [2023-03-07 17:02:51,709][232226] Updated weights for policy 0, policy_version 37940 (0.0006) [2023-03-07 17:02:52,501][232226] Updated weights for policy 0, policy_version 37950 (0.0007) [2023-03-07 17:02:53,275][232226] Updated weights for policy 0, policy_version 37960 (0.0007) [2023-03-07 17:02:54,090][232226] Updated weights for policy 0, policy_version 37970 (0.0007) [2023-03-07 17:02:54,883][232226] Updated weights for policy 0, policy_version 37980 (0.0006) [2023-03-07 17:02:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 38893568. Throughput: 0: 12859.6. Samples: 38884155. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:02:55,069][231894] Avg episode reward: [(0, '196.531')] [2023-03-07 17:02:55,685][232226] Updated weights for policy 0, policy_version 37990 (0.0006) [2023-03-07 17:02:56,462][232226] Updated weights for policy 0, policy_version 38000 (0.0006) [2023-03-07 17:02:57,251][232226] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-07 17:02:58,029][232226] Updated weights for policy 0, policy_version 38020 (0.0007) [2023-03-07 17:02:58,843][232226] Updated weights for policy 0, policy_version 38030 (0.0005) [2023-03-07 17:02:59,634][232226] Updated weights for policy 0, policy_version 38040 (0.0007) [2023-03-07 17:03:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 38958080. Throughput: 0: 12859.8. Samples: 38922925. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:03:00,069][231894] Avg episode reward: [(0, '190.708')] [2023-03-07 17:03:00,428][232226] Updated weights for policy 0, policy_version 38050 (0.0006) [2023-03-07 17:03:01,202][232226] Updated weights for policy 0, policy_version 38060 (0.0006) [2023-03-07 17:03:02,010][232226] Updated weights for policy 0, policy_version 38070 (0.0006) [2023-03-07 17:03:02,814][232226] Updated weights for policy 0, policy_version 38080 (0.0006) [2023-03-07 17:03:03,612][232226] Updated weights for policy 0, policy_version 38090 (0.0006) [2023-03-07 17:03:04,386][232226] Updated weights for policy 0, policy_version 38100 (0.0007) [2023-03-07 17:03:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 39022592. Throughput: 0: 12866.5. Samples: 39000451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:03:05,069][231894] Avg episode reward: [(0, '196.274')] [2023-03-07 17:03:05,183][232226] Updated weights for policy 0, policy_version 38110 (0.0006) [2023-03-07 17:03:05,966][232226] Updated weights for policy 0, policy_version 38120 (0.0006) [2023-03-07 17:03:06,757][232226] Updated weights for policy 0, policy_version 38130 (0.0005) [2023-03-07 17:03:07,550][232226] Updated weights for policy 0, policy_version 38140 (0.0007) [2023-03-07 17:03:08,341][232226] Updated weights for policy 0, policy_version 38150 (0.0006) [2023-03-07 17:03:09,141][232226] Updated weights for policy 0, policy_version 38160 (0.0007) [2023-03-07 17:03:09,935][232226] Updated weights for policy 0, policy_version 38170 (0.0007) [2023-03-07 17:03:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12885.0). Total num frames: 39087104. Throughput: 0: 12879.7. Samples: 39078047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:03:10,069][231894] Avg episode reward: [(0, '186.511')] [2023-03-07 17:03:10,730][232226] Updated weights for policy 0, policy_version 38180 (0.0007) [2023-03-07 17:03:11,531][232226] Updated weights for policy 0, policy_version 38190 (0.0007) [2023-03-07 17:03:12,327][232226] Updated weights for policy 0, policy_version 38200 (0.0006) [2023-03-07 17:03:13,123][232226] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-07 17:03:13,906][232226] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-07 17:03:14,696][232226] Updated weights for policy 0, policy_version 38230 (0.0006) [2023-03-07 17:03:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 39151616. Throughput: 0: 12883.9. Samples: 39116610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:03:15,069][231894] Avg episode reward: [(0, '195.573')] [2023-03-07 17:03:15,485][232226] Updated weights for policy 0, policy_version 38240 (0.0007) [2023-03-07 17:03:16,285][232226] Updated weights for policy 0, policy_version 38250 (0.0007) [2023-03-07 17:03:17,083][232226] Updated weights for policy 0, policy_version 38260 (0.0008) [2023-03-07 17:03:17,893][232226] Updated weights for policy 0, policy_version 38270 (0.0007) [2023-03-07 17:03:18,677][232226] Updated weights for policy 0, policy_version 38280 (0.0006) [2023-03-07 17:03:19,467][232226] Updated weights for policy 0, policy_version 38290 (0.0006) [2023-03-07 17:03:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 39216128. Throughput: 0: 12884.3. Samples: 39193987. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:03:20,069][231894] Avg episode reward: [(0, '193.905')] [2023-03-07 17:03:20,268][232226] Updated weights for policy 0, policy_version 38300 (0.0006) [2023-03-07 17:03:21,042][232226] Updated weights for policy 0, policy_version 38310 (0.0006) [2023-03-07 17:03:21,841][232226] Updated weights for policy 0, policy_version 38320 (0.0006) [2023-03-07 17:03:22,626][232226] Updated weights for policy 0, policy_version 38330 (0.0007) [2023-03-07 17:03:23,411][232226] Updated weights for policy 0, policy_version 38340 (0.0006) [2023-03-07 17:03:24,196][232226] Updated weights for policy 0, policy_version 38350 (0.0007) [2023-03-07 17:03:24,987][232226] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-07 17:03:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 39280640. Throughput: 0: 12903.8. Samples: 39271758. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:03:25,070][231894] Avg episode reward: [(0, '186.925')] [2023-03-07 17:03:25,782][232226] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-07 17:03:26,577][232226] Updated weights for policy 0, policy_version 38380 (0.0007) [2023-03-07 17:03:27,363][232226] Updated weights for policy 0, policy_version 38390 (0.0006) [2023-03-07 17:03:28,181][232226] Updated weights for policy 0, policy_version 38400 (0.0007) [2023-03-07 17:03:28,974][232226] Updated weights for policy 0, policy_version 38410 (0.0006) [2023-03-07 17:03:29,741][232226] Updated weights for policy 0, policy_version 38420 (0.0006) [2023-03-07 17:03:30,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 39346176. Throughput: 0: 12907.9. Samples: 39310561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:03:30,069][231894] Avg episode reward: [(0, '191.561')] [2023-03-07 17:03:30,558][232226] Updated weights for policy 0, policy_version 38430 (0.0007) [2023-03-07 17:03:31,333][232226] Updated weights for policy 0, policy_version 38440 (0.0006) [2023-03-07 17:03:32,138][232226] Updated weights for policy 0, policy_version 38450 (0.0006) [2023-03-07 17:03:32,911][232226] Updated weights for policy 0, policy_version 38460 (0.0007) [2023-03-07 17:03:33,710][232226] Updated weights for policy 0, policy_version 38470 (0.0005) [2023-03-07 17:03:34,482][232226] Updated weights for policy 0, policy_version 38480 (0.0007) [2023-03-07 17:03:35,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 39410688. Throughput: 0: 12915.8. Samples: 39388077. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:03:35,069][231894] Avg episode reward: [(0, '192.757')] [2023-03-07 17:03:35,275][232226] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-07 17:03:36,067][232226] Updated weights for policy 0, policy_version 38500 (0.0007) [2023-03-07 17:03:36,845][232226] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-07 17:03:37,650][232226] Updated weights for policy 0, policy_version 38520 (0.0006) [2023-03-07 17:03:38,457][232226] Updated weights for policy 0, policy_version 38530 (0.0005) [2023-03-07 17:03:39,242][232226] Updated weights for policy 0, policy_version 38540 (0.0007) [2023-03-07 17:03:40,031][232226] Updated weights for policy 0, policy_version 38550 (0.0007) [2023-03-07 17:03:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12888.5). Total num frames: 39475200. Throughput: 0: 12926.4. Samples: 39465843. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:03:40,069][231894] Avg episode reward: [(0, '191.475')] [2023-03-07 17:03:40,827][232226] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-07 17:03:41,609][232226] Updated weights for policy 0, policy_version 38570 (0.0008) [2023-03-07 17:03:42,412][232226] Updated weights for policy 0, policy_version 38580 (0.0006) [2023-03-07 17:03:43,202][232226] Updated weights for policy 0, policy_version 38590 (0.0006) [2023-03-07 17:03:43,977][232226] Updated weights for policy 0, policy_version 38600 (0.0006) [2023-03-07 17:03:44,793][232226] Updated weights for policy 0, policy_version 38610 (0.0006) [2023-03-07 17:03:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12888.5). Total num frames: 39539712. Throughput: 0: 12926.3. Samples: 39504607. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:03:45,069][231894] Avg episode reward: [(0, '187.062')] [2023-03-07 17:03:45,607][232226] Updated weights for policy 0, policy_version 38620 (0.0006) [2023-03-07 17:03:46,389][232226] Updated weights for policy 0, policy_version 38630 (0.0007) [2023-03-07 17:03:47,200][232226] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-07 17:03:48,002][232226] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-07 17:03:48,771][232226] Updated weights for policy 0, policy_version 38660 (0.0005) [2023-03-07 17:03:49,576][232226] Updated weights for policy 0, policy_version 38670 (0.0007) [2023-03-07 17:03:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 39604224. Throughput: 0: 12916.2. Samples: 39581682. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:03:50,080][231894] Avg episode reward: [(0, '193.027')] [2023-03-07 17:03:50,365][232226] Updated weights for policy 0, policy_version 38680 (0.0006) [2023-03-07 17:03:51,142][232226] Updated weights for policy 0, policy_version 38690 (0.0007) [2023-03-07 17:03:51,934][232226] Updated weights for policy 0, policy_version 38700 (0.0006) [2023-03-07 17:03:52,718][232226] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-07 17:03:53,509][232226] Updated weights for policy 0, policy_version 38720 (0.0006) [2023-03-07 17:03:54,301][232226] Updated weights for policy 0, policy_version 38730 (0.0007) [2023-03-07 17:03:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 39668736. Throughput: 0: 12923.5. Samples: 39659603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:03:55,080][231894] Avg episode reward: [(0, '195.708')] [2023-03-07 17:03:55,108][232226] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-07 17:03:55,894][232226] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-07 17:03:56,690][232226] Updated weights for policy 0, policy_version 38760 (0.0006) [2023-03-07 17:03:57,502][232226] Updated weights for policy 0, policy_version 38770 (0.0007) [2023-03-07 17:03:58,300][232226] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-07 17:03:59,102][232226] Updated weights for policy 0, policy_version 38790 (0.0006) [2023-03-07 17:03:59,884][232226] Updated weights for policy 0, policy_version 38800 (0.0007) [2023-03-07 17:04:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 39733248. Throughput: 0: 12917.2. Samples: 39697887. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:04:00,080][231894] Avg episode reward: [(0, '195.619')] [2023-03-07 17:04:00,693][232226] Updated weights for policy 0, policy_version 38810 (0.0007) [2023-03-07 17:04:01,486][232226] Updated weights for policy 0, policy_version 38820 (0.0007) [2023-03-07 17:04:02,288][232226] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-07 17:04:03,084][232226] Updated weights for policy 0, policy_version 38840 (0.0006) [2023-03-07 17:04:03,885][232226] Updated weights for policy 0, policy_version 38850 (0.0006) [2023-03-07 17:04:04,696][232226] Updated weights for policy 0, policy_version 38860 (0.0006) [2023-03-07 17:04:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 39796736. Throughput: 0: 12913.8. Samples: 39775110. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:04:05,078][231894] Avg episode reward: [(0, '189.940')] [2023-03-07 17:04:05,489][232226] Updated weights for policy 0, policy_version 38870 (0.0007) [2023-03-07 17:04:06,259][232226] Updated weights for policy 0, policy_version 38880 (0.0006) [2023-03-07 17:04:07,050][232226] Updated weights for policy 0, policy_version 38890 (0.0006) [2023-03-07 17:04:07,845][232226] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-07 17:04:08,639][232226] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-07 17:04:09,442][232226] Updated weights for policy 0, policy_version 38920 (0.0006) [2023-03-07 17:04:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 39862272. Throughput: 0: 12902.2. Samples: 39852356. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:04:10,080][231894] Avg episode reward: [(0, '197.976')] [2023-03-07 17:04:10,236][232226] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-07 17:04:11,029][232226] Updated weights for policy 0, policy_version 38940 (0.0006) [2023-03-07 17:04:11,840][232226] Updated weights for policy 0, policy_version 38950 (0.0006) [2023-03-07 17:04:12,613][232226] Updated weights for policy 0, policy_version 38960 (0.0006) [2023-03-07 17:04:13,431][232226] Updated weights for policy 0, policy_version 38970 (0.0007) [2023-03-07 17:04:14,233][232226] Updated weights for policy 0, policy_version 38980 (0.0006) [2023-03-07 17:04:15,002][232226] Updated weights for policy 0, policy_version 38990 (0.0006) [2023-03-07 17:04:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 39925760. Throughput: 0: 12898.6. Samples: 39891000. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:04:15,080][231894] Avg episode reward: [(0, '192.997')] [2023-03-07 17:04:15,806][232226] Updated weights for policy 0, policy_version 39000 (0.0006) [2023-03-07 17:04:16,610][232226] Updated weights for policy 0, policy_version 39010 (0.0007) [2023-03-07 17:04:17,407][232226] Updated weights for policy 0, policy_version 39020 (0.0007) [2023-03-07 17:04:18,192][232226] Updated weights for policy 0, policy_version 39030 (0.0006) [2023-03-07 17:04:18,994][232226] Updated weights for policy 0, policy_version 39040 (0.0006) [2023-03-07 17:04:19,789][232226] Updated weights for policy 0, policy_version 39050 (0.0006) [2023-03-07 17:04:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 39990272. Throughput: 0: 12892.5. Samples: 39968237. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:04:20,080][231894] Avg episode reward: [(0, '189.916')] [2023-03-07 17:04:20,574][232226] Updated weights for policy 0, policy_version 39060 (0.0007) [2023-03-07 17:04:21,385][232226] Updated weights for policy 0, policy_version 39070 (0.0007) [2023-03-07 17:04:22,168][232226] Updated weights for policy 0, policy_version 39080 (0.0006) [2023-03-07 17:04:22,953][232226] Updated weights for policy 0, policy_version 39090 (0.0006) [2023-03-07 17:04:23,749][232226] Updated weights for policy 0, policy_version 39100 (0.0006) [2023-03-07 17:04:24,559][232226] Updated weights for policy 0, policy_version 39110 (0.0007) [2023-03-07 17:04:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 40054784. Throughput: 0: 12880.9. Samples: 40045487. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:04:25,069][231894] Avg episode reward: [(0, '192.420')] [2023-03-07 17:04:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000039116_40054784.pth... [2023-03-07 17:04:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000036095_36961280.pth [2023-03-07 17:04:25,367][232226] Updated weights for policy 0, policy_version 39120 (0.0007) [2023-03-07 17:04:26,155][232226] Updated weights for policy 0, policy_version 39130 (0.0007) [2023-03-07 17:04:26,945][232226] Updated weights for policy 0, policy_version 39140 (0.0006) [2023-03-07 17:04:27,729][232226] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-07 17:04:28,527][232226] Updated weights for policy 0, policy_version 39160 (0.0007) [2023-03-07 17:04:29,319][232226] Updated weights for policy 0, policy_version 39170 (0.0006) [2023-03-07 17:04:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 40119296. Throughput: 0: 12875.1. Samples: 40083989. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:04:30,080][231894] Avg episode reward: [(0, '187.706')] [2023-03-07 17:04:30,118][232226] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-07 17:04:30,923][232226] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-07 17:04:31,723][232226] Updated weights for policy 0, policy_version 39200 (0.0007) [2023-03-07 17:04:32,512][232226] Updated weights for policy 0, policy_version 39210 (0.0006) [2023-03-07 17:04:33,318][232226] Updated weights for policy 0, policy_version 39220 (0.0006) [2023-03-07 17:04:34,106][232226] Updated weights for policy 0, policy_version 39230 (0.0007) [2023-03-07 17:04:34,903][232226] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-07 17:04:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 40182784. Throughput: 0: 12879.6. Samples: 40161263. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:04:35,080][231894] Avg episode reward: [(0, '192.534')] [2023-03-07 17:04:35,702][232226] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-07 17:04:36,508][232226] Updated weights for policy 0, policy_version 39260 (0.0006) [2023-03-07 17:04:37,280][232226] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-07 17:04:38,062][232226] Updated weights for policy 0, policy_version 39280 (0.0007) [2023-03-07 17:04:38,875][232226] Updated weights for policy 0, policy_version 39290 (0.0007) [2023-03-07 17:04:39,662][232226] Updated weights for policy 0, policy_version 39300 (0.0007) [2023-03-07 17:04:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 40248320. Throughput: 0: 12866.0. Samples: 40238575. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:04:40,080][231894] Avg episode reward: [(0, '192.907')] [2023-03-07 17:04:40,449][232226] Updated weights for policy 0, policy_version 39310 (0.0007) [2023-03-07 17:04:41,249][232226] Updated weights for policy 0, policy_version 39320 (0.0006) [2023-03-07 17:04:42,033][232226] Updated weights for policy 0, policy_version 39330 (0.0006) [2023-03-07 17:04:42,822][232226] Updated weights for policy 0, policy_version 39340 (0.0006) [2023-03-07 17:04:43,628][232226] Updated weights for policy 0, policy_version 39350 (0.0006) [2023-03-07 17:04:44,412][232226] Updated weights for policy 0, policy_version 39360 (0.0006) [2023-03-07 17:04:45,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 40312832. Throughput: 0: 12879.2. Samples: 40277449. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:04:45,080][231894] Avg episode reward: [(0, '187.153')] [2023-03-07 17:04:45,190][232226] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-07 17:04:45,993][232226] Updated weights for policy 0, policy_version 39380 (0.0006) [2023-03-07 17:04:46,785][232226] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-07 17:04:47,591][232226] Updated weights for policy 0, policy_version 39400 (0.0006) [2023-03-07 17:04:48,376][232226] Updated weights for policy 0, policy_version 39410 (0.0006) [2023-03-07 17:04:49,192][232226] Updated weights for policy 0, policy_version 39420 (0.0006) [2023-03-07 17:04:49,964][232226] Updated weights for policy 0, policy_version 39430 (0.0007) [2023-03-07 17:04:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 40377344. Throughput: 0: 12884.4. Samples: 40354906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:04:50,080][231894] Avg episode reward: [(0, '185.405')] [2023-03-07 17:04:50,755][232226] Updated weights for policy 0, policy_version 39440 (0.0007) [2023-03-07 17:04:51,554][232226] Updated weights for policy 0, policy_version 39450 (0.0006) [2023-03-07 17:04:52,350][232226] Updated weights for policy 0, policy_version 39460 (0.0006) [2023-03-07 17:04:53,133][232226] Updated weights for policy 0, policy_version 39470 (0.0006) [2023-03-07 17:04:53,944][232226] Updated weights for policy 0, policy_version 39480 (0.0007) [2023-03-07 17:04:54,741][232226] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-07 17:04:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 40441856. Throughput: 0: 12884.4. Samples: 40432155. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:04:55,080][231894] Avg episode reward: [(0, '188.724')] [2023-03-07 17:04:55,529][232226] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-07 17:04:56,323][232226] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-07 17:04:57,101][232226] Updated weights for policy 0, policy_version 39520 (0.0008) [2023-03-07 17:04:57,897][232226] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-07 17:04:58,678][232226] Updated weights for policy 0, policy_version 39540 (0.0007) [2023-03-07 17:04:59,491][232226] Updated weights for policy 0, policy_version 39550 (0.0006) [2023-03-07 17:05:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 40506368. Throughput: 0: 12890.3. Samples: 40471066. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:00,069][231894] Avg episode reward: [(0, '190.716')] [2023-03-07 17:05:00,293][232226] Updated weights for policy 0, policy_version 39560 (0.0006) [2023-03-07 17:05:01,078][232226] Updated weights for policy 0, policy_version 39570 (0.0007) [2023-03-07 17:05:01,865][232226] Updated weights for policy 0, policy_version 39580 (0.0007) [2023-03-07 17:05:02,641][232226] Updated weights for policy 0, policy_version 39590 (0.0006) [2023-03-07 17:05:03,450][232226] Updated weights for policy 0, policy_version 39600 (0.0007) [2023-03-07 17:05:04,223][232226] Updated weights for policy 0, policy_version 39610 (0.0005) [2023-03-07 17:05:05,017][232226] Updated weights for policy 0, policy_version 39620 (0.0007) [2023-03-07 17:05:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 40570880. Throughput: 0: 12896.7. Samples: 40548592. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:05,069][231894] Avg episode reward: [(0, '190.229')] [2023-03-07 17:05:05,805][232226] Updated weights for policy 0, policy_version 39630 (0.0007) [2023-03-07 17:05:06,598][232226] Updated weights for policy 0, policy_version 39640 (0.0007) [2023-03-07 17:05:07,401][232226] Updated weights for policy 0, policy_version 39650 (0.0006) [2023-03-07 17:05:08,190][232226] Updated weights for policy 0, policy_version 39660 (0.0006) [2023-03-07 17:05:08,986][232226] Updated weights for policy 0, policy_version 39670 (0.0007) [2023-03-07 17:05:09,789][232226] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-07 17:05:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 40635392. Throughput: 0: 12905.2. Samples: 40626222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:10,069][231894] Avg episode reward: [(0, '185.630')] [2023-03-07 17:05:10,561][232226] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-07 17:05:11,361][232226] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-07 17:05:12,159][232226] Updated weights for policy 0, policy_version 39710 (0.0007) [2023-03-07 17:05:12,957][232226] Updated weights for policy 0, policy_version 39720 (0.0006) [2023-03-07 17:05:13,765][232226] Updated weights for policy 0, policy_version 39730 (0.0006) [2023-03-07 17:05:14,535][232226] Updated weights for policy 0, policy_version 39740 (0.0006) [2023-03-07 17:05:15,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 40699904. Throughput: 0: 12913.7. Samples: 40665103. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:15,069][231894] Avg episode reward: [(0, '192.495')] [2023-03-07 17:05:15,330][232226] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-07 17:05:16,121][232226] Updated weights for policy 0, policy_version 39760 (0.0006) [2023-03-07 17:05:16,905][232226] Updated weights for policy 0, policy_version 39770 (0.0006) [2023-03-07 17:05:17,706][232226] Updated weights for policy 0, policy_version 39780 (0.0006) [2023-03-07 17:05:18,514][232226] Updated weights for policy 0, policy_version 39790 (0.0007) [2023-03-07 17:05:19,306][232226] Updated weights for policy 0, policy_version 39800 (0.0006) [2023-03-07 17:05:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 40764416. Throughput: 0: 12910.6. Samples: 40742242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:20,070][231894] Avg episode reward: [(0, '195.804')] [2023-03-07 17:05:20,098][232226] Updated weights for policy 0, policy_version 39810 (0.0007) [2023-03-07 17:05:20,902][232226] Updated weights for policy 0, policy_version 39820 (0.0006) [2023-03-07 17:05:21,711][232226] Updated weights for policy 0, policy_version 39830 (0.0007) [2023-03-07 17:05:22,506][232226] Updated weights for policy 0, policy_version 39840 (0.0006) [2023-03-07 17:05:23,300][232226] Updated weights for policy 0, policy_version 39850 (0.0006) [2023-03-07 17:05:24,103][232226] Updated weights for policy 0, policy_version 39860 (0.0006) [2023-03-07 17:05:24,890][232226] Updated weights for policy 0, policy_version 39870 (0.0007) [2023-03-07 17:05:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 40828928. Throughput: 0: 12904.0. Samples: 40819254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:25,070][231894] Avg episode reward: [(0, '191.153')] [2023-03-07 17:05:25,693][232226] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-07 17:05:26,488][232226] Updated weights for policy 0, policy_version 39890 (0.0006) [2023-03-07 17:05:27,263][232226] Updated weights for policy 0, policy_version 39900 (0.0007) [2023-03-07 17:05:28,076][232226] Updated weights for policy 0, policy_version 39910 (0.0006) [2023-03-07 17:05:28,873][232226] Updated weights for policy 0, policy_version 39920 (0.0007) [2023-03-07 17:05:29,653][232226] Updated weights for policy 0, policy_version 39930 (0.0007) [2023-03-07 17:05:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 40893440. Throughput: 0: 12903.3. Samples: 40858099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:30,069][231894] Avg episode reward: [(0, '193.982')] [2023-03-07 17:05:30,451][232226] Updated weights for policy 0, policy_version 39940 (0.0007) [2023-03-07 17:05:31,243][232226] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-07 17:05:32,036][232226] Updated weights for policy 0, policy_version 39960 (0.0007) [2023-03-07 17:05:32,838][232226] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-07 17:05:33,627][232226] Updated weights for policy 0, policy_version 39980 (0.0006) [2023-03-07 17:05:34,410][232226] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-07 17:05:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 40957952. Throughput: 0: 12899.1. Samples: 40935365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:35,070][231894] Avg episode reward: [(0, '190.391')] [2023-03-07 17:05:35,207][232226] Updated weights for policy 0, policy_version 40000 (0.0006) [2023-03-07 17:05:36,005][232226] Updated weights for policy 0, policy_version 40010 (0.0007) [2023-03-07 17:05:36,800][232226] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-07 17:05:37,597][232226] Updated weights for policy 0, policy_version 40030 (0.0007) [2023-03-07 17:05:38,414][232226] Updated weights for policy 0, policy_version 40040 (0.0006) [2023-03-07 17:05:39,199][232226] Updated weights for policy 0, policy_version 40050 (0.0005) [2023-03-07 17:05:40,007][232226] Updated weights for policy 0, policy_version 40060 (0.0006) [2023-03-07 17:05:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 41021440. Throughput: 0: 12896.5. Samples: 41012498. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:40,070][231894] Avg episode reward: [(0, '197.208')] [2023-03-07 17:05:40,797][232226] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-07 17:05:41,617][232226] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-07 17:05:42,403][232226] Updated weights for policy 0, policy_version 40090 (0.0006) [2023-03-07 17:05:43,209][232226] Updated weights for policy 0, policy_version 40100 (0.0006) [2023-03-07 17:05:44,003][232226] Updated weights for policy 0, policy_version 40110 (0.0006) [2023-03-07 17:05:44,788][232226] Updated weights for policy 0, policy_version 40120 (0.0007) [2023-03-07 17:05:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 41085952. Throughput: 0: 12884.6. Samples: 41050875. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:45,070][231894] Avg episode reward: [(0, '194.309')] [2023-03-07 17:05:45,586][232226] Updated weights for policy 0, policy_version 40130 (0.0006) [2023-03-07 17:05:46,395][232226] Updated weights for policy 0, policy_version 40140 (0.0006) [2023-03-07 17:05:47,171][232226] Updated weights for policy 0, policy_version 40150 (0.0007) [2023-03-07 17:05:47,991][232226] Updated weights for policy 0, policy_version 40160 (0.0006) [2023-03-07 17:05:48,778][232226] Updated weights for policy 0, policy_version 40170 (0.0006) [2023-03-07 17:05:49,572][232226] Updated weights for policy 0, policy_version 40180 (0.0006) [2023-03-07 17:05:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 41150464. Throughput: 0: 12875.6. Samples: 41127991. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:50,069][231894] Avg episode reward: [(0, '202.059')] [2023-03-07 17:05:50,377][232226] Updated weights for policy 0, policy_version 40190 (0.0006) [2023-03-07 17:05:51,166][232226] Updated weights for policy 0, policy_version 40200 (0.0006) [2023-03-07 17:05:51,959][232226] Updated weights for policy 0, policy_version 40210 (0.0006) [2023-03-07 17:05:52,764][232226] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-07 17:05:53,548][232226] Updated weights for policy 0, policy_version 40230 (0.0006) [2023-03-07 17:05:54,352][232226] Updated weights for policy 0, policy_version 40240 (0.0007) [2023-03-07 17:05:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 41214976. Throughput: 0: 12867.4. Samples: 41205256. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:05:55,069][231894] Avg episode reward: [(0, '187.309')] [2023-03-07 17:05:55,139][232226] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-07 17:05:55,935][232226] Updated weights for policy 0, policy_version 40260 (0.0007) [2023-03-07 17:05:56,734][232226] Updated weights for policy 0, policy_version 40270 (0.0007) [2023-03-07 17:05:57,522][232226] Updated weights for policy 0, policy_version 40280 (0.0006) [2023-03-07 17:05:58,332][232226] Updated weights for policy 0, policy_version 40290 (0.0006) [2023-03-07 17:05:59,129][232226] Updated weights for policy 0, policy_version 40300 (0.0006) [2023-03-07 17:05:59,897][232226] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-07 17:06:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 41279488. Throughput: 0: 12863.4. Samples: 41243957. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:06:00,069][231894] Avg episode reward: [(0, '194.667')] [2023-03-07 17:06:00,708][232226] Updated weights for policy 0, policy_version 40320 (0.0007) [2023-03-07 17:06:01,495][232226] Updated weights for policy 0, policy_version 40330 (0.0006) [2023-03-07 17:06:02,285][232226] Updated weights for policy 0, policy_version 40340 (0.0006) [2023-03-07 17:06:03,082][232226] Updated weights for policy 0, policy_version 40350 (0.0006) [2023-03-07 17:06:03,863][232226] Updated weights for policy 0, policy_version 40360 (0.0006) [2023-03-07 17:06:04,648][232226] Updated weights for policy 0, policy_version 40370 (0.0006) [2023-03-07 17:06:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12888.5). Total num frames: 41344000. Throughput: 0: 12869.8. Samples: 41321383. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:06:05,069][231894] Avg episode reward: [(0, '192.106')] [2023-03-07 17:06:05,457][232226] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-07 17:06:06,253][232226] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-07 17:06:07,055][232226] Updated weights for policy 0, policy_version 40400 (0.0007) [2023-03-07 17:06:07,837][232226] Updated weights for policy 0, policy_version 40410 (0.0007) [2023-03-07 17:06:08,629][232226] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-07 17:06:09,405][232226] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-07 17:06:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 41408512. Throughput: 0: 12879.5. Samples: 41398832. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:06:10,069][231894] Avg episode reward: [(0, '190.438')] [2023-03-07 17:06:10,203][232226] Updated weights for policy 0, policy_version 40440 (0.0006) [2023-03-07 17:06:10,999][232226] Updated weights for policy 0, policy_version 40450 (0.0007) [2023-03-07 17:06:11,789][232226] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-07 17:06:12,589][232226] Updated weights for policy 0, policy_version 40470 (0.0006) [2023-03-07 17:06:13,382][232226] Updated weights for policy 0, policy_version 40480 (0.0007) [2023-03-07 17:06:14,173][232226] Updated weights for policy 0, policy_version 40490 (0.0006) [2023-03-07 17:06:14,975][232226] Updated weights for policy 0, policy_version 40500 (0.0006) [2023-03-07 17:06:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 41473024. Throughput: 0: 12878.8. Samples: 41437647. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:06:15,069][231894] Avg episode reward: [(0, '198.268')] [2023-03-07 17:06:15,778][232226] Updated weights for policy 0, policy_version 40510 (0.0006) [2023-03-07 17:06:16,563][232226] Updated weights for policy 0, policy_version 40520 (0.0007) [2023-03-07 17:06:17,354][232226] Updated weights for policy 0, policy_version 40530 (0.0006) [2023-03-07 17:06:18,162][232226] Updated weights for policy 0, policy_version 40540 (0.0005) [2023-03-07 17:06:18,954][232226] Updated weights for policy 0, policy_version 40550 (0.0006) [2023-03-07 17:06:19,737][232226] Updated weights for policy 0, policy_version 40560 (0.0007) [2023-03-07 17:06:20,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 41536512. Throughput: 0: 12880.7. Samples: 41514996. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:06:20,080][231894] Avg episode reward: [(0, '195.369')] [2023-03-07 17:06:20,548][232226] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-07 17:06:21,360][232226] Updated weights for policy 0, policy_version 40580 (0.0006) [2023-03-07 17:06:22,132][232226] Updated weights for policy 0, policy_version 40590 (0.0007) [2023-03-07 17:06:22,925][232226] Updated weights for policy 0, policy_version 40600 (0.0007) [2023-03-07 17:06:23,741][232226] Updated weights for policy 0, policy_version 40610 (0.0006) [2023-03-07 17:06:24,541][232226] Updated weights for policy 0, policy_version 40620 (0.0007) [2023-03-07 17:06:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 41601024. Throughput: 0: 12875.5. Samples: 41591897. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:06:25,080][231894] Avg episode reward: [(0, '191.649')] [2023-03-07 17:06:25,085][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000040626_41601024.pth... [2023-03-07 17:06:25,119][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000037606_38508544.pth [2023-03-07 17:06:25,336][232226] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-07 17:06:26,129][232226] Updated weights for policy 0, policy_version 40640 (0.0006) [2023-03-07 17:06:26,926][232226] Updated weights for policy 0, policy_version 40650 (0.0007) [2023-03-07 17:06:27,717][232226] Updated weights for policy 0, policy_version 40660 (0.0007) [2023-03-07 17:06:28,507][232226] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-07 17:06:29,305][232226] Updated weights for policy 0, policy_version 40680 (0.0006) [2023-03-07 17:06:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 41665536. Throughput: 0: 12882.0. Samples: 41630565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:06:30,080][231894] Avg episode reward: [(0, '188.586')] [2023-03-07 17:06:30,090][232226] Updated weights for policy 0, policy_version 40690 (0.0006) [2023-03-07 17:06:30,890][232226] Updated weights for policy 0, policy_version 40700 (0.0006) [2023-03-07 17:06:31,685][232226] Updated weights for policy 0, policy_version 40710 (0.0007) [2023-03-07 17:06:32,489][232226] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-07 17:06:33,286][232226] Updated weights for policy 0, policy_version 40730 (0.0006) [2023-03-07 17:06:34,058][232226] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-07 17:06:34,859][232226] Updated weights for policy 0, policy_version 40750 (0.0007) [2023-03-07 17:06:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 41730048. Throughput: 0: 12887.9. Samples: 41707949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:06:35,080][231894] Avg episode reward: [(0, '195.589')] [2023-03-07 17:06:35,661][232226] Updated weights for policy 0, policy_version 40760 (0.0007) [2023-03-07 17:06:36,453][232226] Updated weights for policy 0, policy_version 40770 (0.0006) [2023-03-07 17:06:37,241][232226] Updated weights for policy 0, policy_version 40780 (0.0006) [2023-03-07 17:06:38,026][232226] Updated weights for policy 0, policy_version 40790 (0.0007) [2023-03-07 17:06:38,830][232226] Updated weights for policy 0, policy_version 40800 (0.0007) [2023-03-07 17:06:39,619][232226] Updated weights for policy 0, policy_version 40810 (0.0007) [2023-03-07 17:06:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 41794560. Throughput: 0: 12890.8. Samples: 41785343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:06:40,070][231894] Avg episode reward: [(0, '187.955')] [2023-03-07 17:06:40,412][232226] Updated weights for policy 0, policy_version 40820 (0.0007) [2023-03-07 17:06:41,204][232226] Updated weights for policy 0, policy_version 40830 (0.0007) [2023-03-07 17:06:42,002][232226] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-07 17:06:42,791][232226] Updated weights for policy 0, policy_version 40850 (0.0006) [2023-03-07 17:06:43,578][232226] Updated weights for policy 0, policy_version 40860 (0.0006) [2023-03-07 17:06:44,370][232226] Updated weights for policy 0, policy_version 40870 (0.0006) [2023-03-07 17:06:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 41859072. Throughput: 0: 12893.0. Samples: 41824143. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:06:45,069][231894] Avg episode reward: [(0, '192.550')] [2023-03-07 17:06:45,191][232226] Updated weights for policy 0, policy_version 40880 (0.0006) [2023-03-07 17:06:46,005][232226] Updated weights for policy 0, policy_version 40890 (0.0006) [2023-03-07 17:06:46,793][232226] Updated weights for policy 0, policy_version 40900 (0.0006) [2023-03-07 17:06:47,604][232226] Updated weights for policy 0, policy_version 40910 (0.0006) [2023-03-07 17:06:48,402][232226] Updated weights for policy 0, policy_version 40920 (0.0007) [2023-03-07 17:06:49,193][232226] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-07 17:06:50,006][232226] Updated weights for policy 0, policy_version 40940 (0.0006) [2023-03-07 17:06:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 41922560. Throughput: 0: 12881.0. Samples: 41901026. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:06:50,069][231894] Avg episode reward: [(0, '181.672')] [2023-03-07 17:06:50,799][232226] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-07 17:06:51,594][232226] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-07 17:06:52,388][232226] Updated weights for policy 0, policy_version 40970 (0.0006) [2023-03-07 17:06:53,174][232226] Updated weights for policy 0, policy_version 40980 (0.0007) [2023-03-07 17:06:53,977][232226] Updated weights for policy 0, policy_version 40990 (0.0006) [2023-03-07 17:06:54,775][232226] Updated weights for policy 0, policy_version 41000 (0.0006) [2023-03-07 17:06:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 41987072. Throughput: 0: 12872.3. Samples: 41978085. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:06:55,069][231894] Avg episode reward: [(0, '198.141')] [2023-03-07 17:06:55,562][232226] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-07 17:06:56,373][232226] Updated weights for policy 0, policy_version 41020 (0.0006) [2023-03-07 17:06:57,171][232226] Updated weights for policy 0, policy_version 41030 (0.0007) [2023-03-07 17:06:57,965][232226] Updated weights for policy 0, policy_version 41040 (0.0006) [2023-03-07 17:06:58,743][232226] Updated weights for policy 0, policy_version 41050 (0.0006) [2023-03-07 17:06:59,547][232226] Updated weights for policy 0, policy_version 41060 (0.0006) [2023-03-07 17:07:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 42051584. Throughput: 0: 12869.4. Samples: 42016768. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:07:00,069][231894] Avg episode reward: [(0, '192.517')] [2023-03-07 17:07:00,357][232226] Updated weights for policy 0, policy_version 41070 (0.0007) [2023-03-07 17:07:01,124][232226] Updated weights for policy 0, policy_version 41080 (0.0006) [2023-03-07 17:07:01,942][232226] Updated weights for policy 0, policy_version 41090 (0.0007) [2023-03-07 17:07:02,719][232226] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-07 17:07:03,509][232226] Updated weights for policy 0, policy_version 41110 (0.0005) [2023-03-07 17:07:04,310][232226] Updated weights for policy 0, policy_version 41120 (0.0007) [2023-03-07 17:07:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42116096. Throughput: 0: 12866.9. Samples: 42094007. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:07:05,069][231894] Avg episode reward: [(0, '192.218')] [2023-03-07 17:07:05,115][232226] Updated weights for policy 0, policy_version 41130 (0.0006) [2023-03-07 17:07:05,896][232226] Updated weights for policy 0, policy_version 41140 (0.0007) [2023-03-07 17:07:06,697][232226] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-07 17:07:07,491][232226] Updated weights for policy 0, policy_version 41160 (0.0006) [2023-03-07 17:07:08,291][232226] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-07 17:07:09,086][232226] Updated weights for policy 0, policy_version 41180 (0.0006) [2023-03-07 17:07:09,908][232226] Updated weights for policy 0, policy_version 41190 (0.0007) [2023-03-07 17:07:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42180608. Throughput: 0: 12871.5. Samples: 42171116. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:07:10,069][231894] Avg episode reward: [(0, '192.079')] [2023-03-07 17:07:10,690][232226] Updated weights for policy 0, policy_version 41200 (0.0006) [2023-03-07 17:07:11,477][232226] Updated weights for policy 0, policy_version 41210 (0.0007) [2023-03-07 17:07:12,269][232226] Updated weights for policy 0, policy_version 41220 (0.0006) [2023-03-07 17:07:13,074][232226] Updated weights for policy 0, policy_version 41230 (0.0006) [2023-03-07 17:07:13,863][232226] Updated weights for policy 0, policy_version 41240 (0.0007) [2023-03-07 17:07:14,654][232226] Updated weights for policy 0, policy_version 41250 (0.0006) [2023-03-07 17:07:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42245120. Throughput: 0: 12870.4. Samples: 42209731. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:07:15,069][231894] Avg episode reward: [(0, '194.938')] [2023-03-07 17:07:15,457][232226] Updated weights for policy 0, policy_version 41260 (0.0006) [2023-03-07 17:07:16,259][232226] Updated weights for policy 0, policy_version 41270 (0.0006) [2023-03-07 17:07:17,034][232226] Updated weights for policy 0, policy_version 41280 (0.0006) [2023-03-07 17:07:17,842][232226] Updated weights for policy 0, policy_version 41290 (0.0006) [2023-03-07 17:07:18,625][232226] Updated weights for policy 0, policy_version 41300 (0.0006) [2023-03-07 17:07:19,413][232226] Updated weights for policy 0, policy_version 41310 (0.0006) [2023-03-07 17:07:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 42309632. Throughput: 0: 12872.5. Samples: 42287210. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:20,069][231894] Avg episode reward: [(0, '196.760')] [2023-03-07 17:07:20,214][232226] Updated weights for policy 0, policy_version 41320 (0.0006) [2023-03-07 17:07:21,007][232226] Updated weights for policy 0, policy_version 41330 (0.0006) [2023-03-07 17:07:21,806][232226] Updated weights for policy 0, policy_version 41340 (0.0007) [2023-03-07 17:07:22,594][232226] Updated weights for policy 0, policy_version 41350 (0.0006) [2023-03-07 17:07:23,393][232226] Updated weights for policy 0, policy_version 41360 (0.0007) [2023-03-07 17:07:24,197][232226] Updated weights for policy 0, policy_version 41370 (0.0006) [2023-03-07 17:07:24,972][232226] Updated weights for policy 0, policy_version 41380 (0.0006) [2023-03-07 17:07:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 42374144. Throughput: 0: 12870.3. Samples: 42364505. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:25,069][231894] Avg episode reward: [(0, '188.731')] [2023-03-07 17:07:25,790][232226] Updated weights for policy 0, policy_version 41390 (0.0006) [2023-03-07 17:07:26,577][232226] Updated weights for policy 0, policy_version 41400 (0.0006) [2023-03-07 17:07:27,380][232226] Updated weights for policy 0, policy_version 41410 (0.0007) [2023-03-07 17:07:28,181][232226] Updated weights for policy 0, policy_version 41420 (0.0006) [2023-03-07 17:07:28,974][232226] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-07 17:07:29,782][232226] Updated weights for policy 0, policy_version 41440 (0.0007) [2023-03-07 17:07:30,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12868.3, 300 sec: 12885.1). Total num frames: 42437632. Throughput: 0: 12862.0. Samples: 42402933. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:30,069][231894] Avg episode reward: [(0, '191.637')] [2023-03-07 17:07:30,579][232226] Updated weights for policy 0, policy_version 41450 (0.0006) [2023-03-07 17:07:31,368][232226] Updated weights for policy 0, policy_version 41460 (0.0006) [2023-03-07 17:07:32,169][232226] Updated weights for policy 0, policy_version 41470 (0.0006) [2023-03-07 17:07:32,961][232226] Updated weights for policy 0, policy_version 41480 (0.0006) [2023-03-07 17:07:33,765][232226] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-07 17:07:34,579][232226] Updated weights for policy 0, policy_version 41500 (0.0006) [2023-03-07 17:07:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 42502144. Throughput: 0: 12865.4. Samples: 42479971. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:35,069][231894] Avg episode reward: [(0, '189.037')] [2023-03-07 17:07:35,372][232226] Updated weights for policy 0, policy_version 41510 (0.0007) [2023-03-07 17:07:36,174][232226] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-07 17:07:36,960][232226] Updated weights for policy 0, policy_version 41530 (0.0006) [2023-03-07 17:07:37,754][232226] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-07 17:07:38,563][232226] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-07 17:07:39,352][232226] Updated weights for policy 0, policy_version 41560 (0.0006) [2023-03-07 17:07:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12885.0). Total num frames: 42565632. Throughput: 0: 12861.8. Samples: 42556868. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:40,069][231894] Avg episode reward: [(0, '183.863')] [2023-03-07 17:07:40,161][232226] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-07 17:07:40,956][232226] Updated weights for policy 0, policy_version 41580 (0.0007) [2023-03-07 17:07:41,735][232226] Updated weights for policy 0, policy_version 41590 (0.0006) [2023-03-07 17:07:42,537][232226] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-07 17:07:43,321][232226] Updated weights for policy 0, policy_version 41610 (0.0006) [2023-03-07 17:07:44,119][232226] Updated weights for policy 0, policy_version 41620 (0.0007) [2023-03-07 17:07:44,929][232226] Updated weights for policy 0, policy_version 41630 (0.0006) [2023-03-07 17:07:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12885.0). Total num frames: 42630144. Throughput: 0: 12863.3. Samples: 42595618. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:45,069][231894] Avg episode reward: [(0, '189.446')] [2023-03-07 17:07:45,718][232226] Updated weights for policy 0, policy_version 41640 (0.0007) [2023-03-07 17:07:46,526][232226] Updated weights for policy 0, policy_version 41650 (0.0007) [2023-03-07 17:07:47,329][232226] Updated weights for policy 0, policy_version 41660 (0.0005) [2023-03-07 17:07:48,108][232226] Updated weights for policy 0, policy_version 41670 (0.0007) [2023-03-07 17:07:48,901][232226] Updated weights for policy 0, policy_version 41680 (0.0006) [2023-03-07 17:07:49,700][232226] Updated weights for policy 0, policy_version 41690 (0.0006) [2023-03-07 17:07:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42694656. Throughput: 0: 12855.7. Samples: 42672512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:50,069][231894] Avg episode reward: [(0, '194.203')] [2023-03-07 17:07:50,479][232226] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-07 17:07:51,298][232226] Updated weights for policy 0, policy_version 41710 (0.0006) [2023-03-07 17:07:52,088][232226] Updated weights for policy 0, policy_version 41720 (0.0006) [2023-03-07 17:07:52,889][232226] Updated weights for policy 0, policy_version 41730 (0.0006) [2023-03-07 17:07:53,695][232226] Updated weights for policy 0, policy_version 41740 (0.0007) [2023-03-07 17:07:54,482][232226] Updated weights for policy 0, policy_version 41750 (0.0006) [2023-03-07 17:07:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42759168. Throughput: 0: 12859.1. Samples: 42749776. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:07:55,069][231894] Avg episode reward: [(0, '195.590')] [2023-03-07 17:07:55,278][232226] Updated weights for policy 0, policy_version 41760 (0.0007) [2023-03-07 17:07:56,070][232226] Updated weights for policy 0, policy_version 41770 (0.0006) [2023-03-07 17:07:56,881][232226] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-07 17:07:57,661][232226] Updated weights for policy 0, policy_version 41790 (0.0007) [2023-03-07 17:07:58,462][232226] Updated weights for policy 0, policy_version 41800 (0.0007) [2023-03-07 17:07:59,265][232226] Updated weights for policy 0, policy_version 41810 (0.0006) [2023-03-07 17:08:00,058][232226] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-07 17:08:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42823680. Throughput: 0: 12859.2. Samples: 42788396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:08:00,069][231894] Avg episode reward: [(0, '194.849')] [2023-03-07 17:08:00,858][232226] Updated weights for policy 0, policy_version 41830 (0.0006) [2023-03-07 17:08:01,649][232226] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-07 17:08:02,465][232226] Updated weights for policy 0, policy_version 41850 (0.0006) [2023-03-07 17:08:03,259][232226] Updated weights for policy 0, policy_version 41860 (0.0007) [2023-03-07 17:08:04,053][232226] Updated weights for policy 0, policy_version 41870 (0.0007) [2023-03-07 17:08:04,832][232226] Updated weights for policy 0, policy_version 41880 (0.0007) [2023-03-07 17:08:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42888192. Throughput: 0: 12846.7. Samples: 42865310. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:08:05,080][231894] Avg episode reward: [(0, '187.004')] [2023-03-07 17:08:05,629][232226] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-07 17:08:06,411][232226] Updated weights for policy 0, policy_version 41900 (0.0006) [2023-03-07 17:08:07,218][232226] Updated weights for policy 0, policy_version 41910 (0.0006) [2023-03-07 17:08:08,012][232226] Updated weights for policy 0, policy_version 41920 (0.0006) [2023-03-07 17:08:08,788][232226] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-07 17:08:09,581][232226] Updated weights for policy 0, policy_version 41940 (0.0006) [2023-03-07 17:08:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 42952704. Throughput: 0: 12855.8. Samples: 42943015. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:08:10,080][231894] Avg episode reward: [(0, '191.332')] [2023-03-07 17:08:10,380][232226] Updated weights for policy 0, policy_version 41950 (0.0006) [2023-03-07 17:08:11,155][232226] Updated weights for policy 0, policy_version 41960 (0.0006) [2023-03-07 17:08:11,945][232226] Updated weights for policy 0, policy_version 41970 (0.0008) [2023-03-07 17:08:12,755][232226] Updated weights for policy 0, policy_version 41980 (0.0006) [2023-03-07 17:08:13,546][232226] Updated weights for policy 0, policy_version 41990 (0.0006) [2023-03-07 17:08:14,341][232226] Updated weights for policy 0, policy_version 42000 (0.0006) [2023-03-07 17:08:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 43017216. Throughput: 0: 12863.5. Samples: 42981789. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:08:15,080][231894] Avg episode reward: [(0, '189.807')] [2023-03-07 17:08:15,146][232226] Updated weights for policy 0, policy_version 42010 (0.0007) [2023-03-07 17:08:15,927][232226] Updated weights for policy 0, policy_version 42020 (0.0005) [2023-03-07 17:08:16,730][232226] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-07 17:08:17,505][232226] Updated weights for policy 0, policy_version 42040 (0.0006) [2023-03-07 17:08:18,304][232226] Updated weights for policy 0, policy_version 42050 (0.0006) [2023-03-07 17:08:19,092][232226] Updated weights for policy 0, policy_version 42060 (0.0006) [2023-03-07 17:08:19,877][232226] Updated weights for policy 0, policy_version 42070 (0.0006) [2023-03-07 17:08:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 43081728. Throughput: 0: 12872.2. Samples: 43059222. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:08:20,070][231894] Avg episode reward: [(0, '180.690')] [2023-03-07 17:08:20,693][232226] Updated weights for policy 0, policy_version 42080 (0.0006) [2023-03-07 17:08:21,470][232226] Updated weights for policy 0, policy_version 42090 (0.0007) [2023-03-07 17:08:22,289][232226] Updated weights for policy 0, policy_version 42100 (0.0006) [2023-03-07 17:08:23,078][232226] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-07 17:08:23,872][232226] Updated weights for policy 0, policy_version 42120 (0.0006) [2023-03-07 17:08:24,658][232226] Updated weights for policy 0, policy_version 42130 (0.0006) [2023-03-07 17:08:25,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 43146240. Throughput: 0: 12884.2. Samples: 43136657. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:08:25,070][231894] Avg episode reward: [(0, '196.233')] [2023-03-07 17:08:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000042135_43146240.pth... [2023-03-07 17:08:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000039116_40054784.pth [2023-03-07 17:08:25,452][232226] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-07 17:08:26,229][232226] Updated weights for policy 0, policy_version 42150 (0.0005) [2023-03-07 17:08:27,025][232226] Updated weights for policy 0, policy_version 42160 (0.0007) [2023-03-07 17:08:27,807][232226] Updated weights for policy 0, policy_version 42170 (0.0007) [2023-03-07 17:08:28,612][232226] Updated weights for policy 0, policy_version 42180 (0.0006) [2023-03-07 17:08:29,386][232226] Updated weights for policy 0, policy_version 42190 (0.0005) [2023-03-07 17:08:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 43210752. Throughput: 0: 12885.9. Samples: 43175485. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:08:30,069][231894] Avg episode reward: [(0, '196.608')] [2023-03-07 17:08:30,196][232226] Updated weights for policy 0, policy_version 42200 (0.0006) [2023-03-07 17:08:30,998][232226] Updated weights for policy 0, policy_version 42210 (0.0007) [2023-03-07 17:08:31,785][232226] Updated weights for policy 0, policy_version 42220 (0.0006) [2023-03-07 17:08:32,586][232226] Updated weights for policy 0, policy_version 42230 (0.0007) [2023-03-07 17:08:33,389][232226] Updated weights for policy 0, policy_version 42240 (0.0007) [2023-03-07 17:08:34,181][232226] Updated weights for policy 0, policy_version 42250 (0.0007) [2023-03-07 17:08:34,983][232226] Updated weights for policy 0, policy_version 42260 (0.0007) [2023-03-07 17:08:35,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 43274240. Throughput: 0: 12892.2. Samples: 43252663. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:08:35,069][231894] Avg episode reward: [(0, '197.638')] [2023-03-07 17:08:35,790][232226] Updated weights for policy 0, policy_version 42270 (0.0006) [2023-03-07 17:08:36,585][232226] Updated weights for policy 0, policy_version 42280 (0.0006) [2023-03-07 17:08:37,373][232226] Updated weights for policy 0, policy_version 42290 (0.0006) [2023-03-07 17:08:38,176][232226] Updated weights for policy 0, policy_version 42300 (0.0006) [2023-03-07 17:08:38,964][232226] Updated weights for policy 0, policy_version 42310 (0.0006) [2023-03-07 17:08:39,755][232226] Updated weights for policy 0, policy_version 42320 (0.0006) [2023-03-07 17:08:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.4, 300 sec: 12878.1). Total num frames: 43338752. Throughput: 0: 12890.1. Samples: 43329832. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:08:40,069][231894] Avg episode reward: [(0, '197.937')] [2023-03-07 17:08:40,545][232226] Updated weights for policy 0, policy_version 42330 (0.0007) [2023-03-07 17:08:41,348][232226] Updated weights for policy 0, policy_version 42340 (0.0007) [2023-03-07 17:08:42,136][232226] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-07 17:08:42,925][232226] Updated weights for policy 0, policy_version 42360 (0.0006) [2023-03-07 17:08:43,737][232226] Updated weights for policy 0, policy_version 42370 (0.0006) [2023-03-07 17:08:44,522][232226] Updated weights for policy 0, policy_version 42380 (0.0006) [2023-03-07 17:08:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 43403264. Throughput: 0: 12893.7. Samples: 43368610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:08:45,069][231894] Avg episode reward: [(0, '191.961')] [2023-03-07 17:08:45,313][232226] Updated weights for policy 0, policy_version 42390 (0.0007) [2023-03-07 17:08:46,134][232226] Updated weights for policy 0, policy_version 42400 (0.0006) [2023-03-07 17:08:46,917][232226] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-07 17:08:47,701][232226] Updated weights for policy 0, policy_version 42420 (0.0006) [2023-03-07 17:08:48,499][232226] Updated weights for policy 0, policy_version 42430 (0.0006) [2023-03-07 17:08:49,309][232226] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-07 17:08:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 43467776. Throughput: 0: 12897.7. Samples: 43445707. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:08:50,069][231894] Avg episode reward: [(0, '190.083')] [2023-03-07 17:08:50,094][232226] Updated weights for policy 0, policy_version 42450 (0.0006) [2023-03-07 17:08:50,882][232226] Updated weights for policy 0, policy_version 42460 (0.0007) [2023-03-07 17:08:51,684][232226] Updated weights for policy 0, policy_version 42470 (0.0006) [2023-03-07 17:08:52,454][232226] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-07 17:08:53,266][232226] Updated weights for policy 0, policy_version 42490 (0.0006) [2023-03-07 17:08:54,067][232226] Updated weights for policy 0, policy_version 42500 (0.0007) [2023-03-07 17:08:54,850][232226] Updated weights for policy 0, policy_version 42510 (0.0007) [2023-03-07 17:08:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 43532288. Throughput: 0: 12896.4. Samples: 43523353. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:08:55,069][231894] Avg episode reward: [(0, '194.228')] [2023-03-07 17:08:55,647][232226] Updated weights for policy 0, policy_version 42520 (0.0007) [2023-03-07 17:08:56,450][232226] Updated weights for policy 0, policy_version 42530 (0.0006) [2023-03-07 17:08:57,227][232226] Updated weights for policy 0, policy_version 42540 (0.0007) [2023-03-07 17:08:58,031][232226] Updated weights for policy 0, policy_version 42550 (0.0007) [2023-03-07 17:08:58,838][232226] Updated weights for policy 0, policy_version 42560 (0.0007) [2023-03-07 17:08:59,621][232226] Updated weights for policy 0, policy_version 42570 (0.0006) [2023-03-07 17:09:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 43596800. Throughput: 0: 12895.9. Samples: 43562104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:09:00,069][231894] Avg episode reward: [(0, '190.530')] [2023-03-07 17:09:00,423][232226] Updated weights for policy 0, policy_version 42580 (0.0007) [2023-03-07 17:09:01,223][232226] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-07 17:09:01,998][232226] Updated weights for policy 0, policy_version 42600 (0.0006) [2023-03-07 17:09:02,791][232226] Updated weights for policy 0, policy_version 42610 (0.0006) [2023-03-07 17:09:03,591][232226] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-07 17:09:04,377][232226] Updated weights for policy 0, policy_version 42630 (0.0006) [2023-03-07 17:09:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 43661312. Throughput: 0: 12889.1. Samples: 43639232. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:09:05,069][231894] Avg episode reward: [(0, '195.721')] [2023-03-07 17:09:05,173][232226] Updated weights for policy 0, policy_version 42640 (0.0006) [2023-03-07 17:09:05,958][232226] Updated weights for policy 0, policy_version 42650 (0.0008) [2023-03-07 17:09:06,765][232226] Updated weights for policy 0, policy_version 42660 (0.0006) [2023-03-07 17:09:07,560][232226] Updated weights for policy 0, policy_version 42670 (0.0006) [2023-03-07 17:09:08,354][232226] Updated weights for policy 0, policy_version 42680 (0.0006) [2023-03-07 17:09:09,141][232226] Updated weights for policy 0, policy_version 42690 (0.0006) [2023-03-07 17:09:09,941][232226] Updated weights for policy 0, policy_version 42700 (0.0007) [2023-03-07 17:09:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 43725824. Throughput: 0: 12890.6. Samples: 43716731. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:09:10,069][231894] Avg episode reward: [(0, '192.354')] [2023-03-07 17:09:10,728][232226] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-07 17:09:11,517][232226] Updated weights for policy 0, policy_version 42720 (0.0007) [2023-03-07 17:09:12,317][232226] Updated weights for policy 0, policy_version 42730 (0.0006) [2023-03-07 17:09:13,116][232226] Updated weights for policy 0, policy_version 42740 (0.0006) [2023-03-07 17:09:13,912][232226] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-07 17:09:14,705][232226] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-07 17:09:15,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 43790336. Throughput: 0: 12885.1. Samples: 43755317. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:09:15,070][231894] Avg episode reward: [(0, '190.410')] [2023-03-07 17:09:15,516][232226] Updated weights for policy 0, policy_version 42770 (0.0006) [2023-03-07 17:09:16,306][232226] Updated weights for policy 0, policy_version 42780 (0.0007) [2023-03-07 17:09:17,099][232226] Updated weights for policy 0, policy_version 42790 (0.0007) [2023-03-07 17:09:17,891][232226] Updated weights for policy 0, policy_version 42800 (0.0006) [2023-03-07 17:09:18,684][232226] Updated weights for policy 0, policy_version 42810 (0.0006) [2023-03-07 17:09:19,482][232226] Updated weights for policy 0, policy_version 42820 (0.0007) [2023-03-07 17:09:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 43854848. Throughput: 0: 12887.3. Samples: 43832592. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:09:20,069][231894] Avg episode reward: [(0, '193.948')] [2023-03-07 17:09:20,278][232226] Updated weights for policy 0, policy_version 42830 (0.0007) [2023-03-07 17:09:21,063][232226] Updated weights for policy 0, policy_version 42840 (0.0007) [2023-03-07 17:09:21,861][232226] Updated weights for policy 0, policy_version 42850 (0.0007) [2023-03-07 17:09:22,654][232226] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-07 17:09:23,448][232226] Updated weights for policy 0, policy_version 42870 (0.0006) [2023-03-07 17:09:24,246][232226] Updated weights for policy 0, policy_version 42880 (0.0006) [2023-03-07 17:09:25,038][232226] Updated weights for policy 0, policy_version 42890 (0.0006) [2023-03-07 17:09:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 43919360. Throughput: 0: 12893.3. Samples: 43910029. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:09:25,069][231894] Avg episode reward: [(0, '191.075')] [2023-03-07 17:09:25,835][232226] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-07 17:09:26,624][232226] Updated weights for policy 0, policy_version 42910 (0.0007) [2023-03-07 17:09:27,424][232226] Updated weights for policy 0, policy_version 42920 (0.0006) [2023-03-07 17:09:28,210][232226] Updated weights for policy 0, policy_version 42930 (0.0007) [2023-03-07 17:09:28,992][232226] Updated weights for policy 0, policy_version 42940 (0.0008) [2023-03-07 17:09:29,809][232226] Updated weights for policy 0, policy_version 42950 (0.0007) [2023-03-07 17:09:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 43983872. Throughput: 0: 12889.4. Samples: 43948633. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:09:30,069][231894] Avg episode reward: [(0, '192.031')] [2023-03-07 17:09:30,607][232226] Updated weights for policy 0, policy_version 42960 (0.0007) [2023-03-07 17:09:31,398][232226] Updated weights for policy 0, policy_version 42970 (0.0005) [2023-03-07 17:09:32,181][232226] Updated weights for policy 0, policy_version 42980 (0.0006) [2023-03-07 17:09:32,978][232226] Updated weights for policy 0, policy_version 42990 (0.0006) [2023-03-07 17:09:33,790][232226] Updated weights for policy 0, policy_version 43000 (0.0006) [2023-03-07 17:09:34,603][232226] Updated weights for policy 0, policy_version 43010 (0.0007) [2023-03-07 17:09:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 44048384. Throughput: 0: 12891.7. Samples: 44025836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:09:35,069][231894] Avg episode reward: [(0, '195.749')] [2023-03-07 17:09:35,390][232226] Updated weights for policy 0, policy_version 43020 (0.0007) [2023-03-07 17:09:36,193][232226] Updated weights for policy 0, policy_version 43030 (0.0008) [2023-03-07 17:09:36,965][232226] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-07 17:09:37,783][232226] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-07 17:09:38,586][232226] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-07 17:09:39,355][232226] Updated weights for policy 0, policy_version 43070 (0.0007) [2023-03-07 17:09:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44111872. Throughput: 0: 12882.1. Samples: 44103049. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:09:40,069][231894] Avg episode reward: [(0, '190.045')] [2023-03-07 17:09:40,176][232226] Updated weights for policy 0, policy_version 43080 (0.0007) [2023-03-07 17:09:40,972][232226] Updated weights for policy 0, policy_version 43090 (0.0007) [2023-03-07 17:09:41,770][232226] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-07 17:09:42,576][232226] Updated weights for policy 0, policy_version 43110 (0.0008) [2023-03-07 17:09:43,373][232226] Updated weights for policy 0, policy_version 43120 (0.0006) [2023-03-07 17:09:44,166][232226] Updated weights for policy 0, policy_version 43130 (0.0006) [2023-03-07 17:09:44,962][232226] Updated weights for policy 0, policy_version 43140 (0.0007) [2023-03-07 17:09:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44176384. Throughput: 0: 12873.0. Samples: 44141390. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:09:45,069][231894] Avg episode reward: [(0, '199.186')] [2023-03-07 17:09:45,761][232226] Updated weights for policy 0, policy_version 43150 (0.0006) [2023-03-07 17:09:46,557][232226] Updated weights for policy 0, policy_version 43160 (0.0007) [2023-03-07 17:09:47,349][232226] Updated weights for policy 0, policy_version 43170 (0.0006) [2023-03-07 17:09:48,145][232226] Updated weights for policy 0, policy_version 43180 (0.0007) [2023-03-07 17:09:48,942][232226] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-07 17:09:49,736][232226] Updated weights for policy 0, policy_version 43200 (0.0007) [2023-03-07 17:09:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44240896. Throughput: 0: 12871.4. Samples: 44218444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:09:50,069][231894] Avg episode reward: [(0, '191.719')] [2023-03-07 17:09:50,534][232226] Updated weights for policy 0, policy_version 43210 (0.0007) [2023-03-07 17:09:51,337][232226] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-07 17:09:52,125][232226] Updated weights for policy 0, policy_version 43230 (0.0006) [2023-03-07 17:09:52,935][232226] Updated weights for policy 0, policy_version 43240 (0.0006) [2023-03-07 17:09:53,719][232226] Updated weights for policy 0, policy_version 43250 (0.0006) [2023-03-07 17:09:54,511][232226] Updated weights for policy 0, policy_version 43260 (0.0006) [2023-03-07 17:09:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 44304384. Throughput: 0: 12865.9. Samples: 44295697. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:09:55,069][231894] Avg episode reward: [(0, '194.897')] [2023-03-07 17:09:55,311][232226] Updated weights for policy 0, policy_version 43270 (0.0007) [2023-03-07 17:09:56,099][232226] Updated weights for policy 0, policy_version 43280 (0.0007) [2023-03-07 17:09:56,885][232226] Updated weights for policy 0, policy_version 43290 (0.0006) [2023-03-07 17:09:57,675][232226] Updated weights for policy 0, policy_version 43300 (0.0007) [2023-03-07 17:09:58,468][232226] Updated weights for policy 0, policy_version 43310 (0.0006) [2023-03-07 17:09:59,248][232226] Updated weights for policy 0, policy_version 43320 (0.0006) [2023-03-07 17:10:00,066][232226] Updated weights for policy 0, policy_version 43330 (0.0007) [2023-03-07 17:10:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44369920. Throughput: 0: 12868.8. Samples: 44334409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:00,069][231894] Avg episode reward: [(0, '194.217')] [2023-03-07 17:10:00,858][232226] Updated weights for policy 0, policy_version 43340 (0.0006) [2023-03-07 17:10:01,644][232226] Updated weights for policy 0, policy_version 43350 (0.0006) [2023-03-07 17:10:02,444][232226] Updated weights for policy 0, policy_version 43360 (0.0007) [2023-03-07 17:10:03,226][232226] Updated weights for policy 0, policy_version 43370 (0.0007) [2023-03-07 17:10:04,023][232226] Updated weights for policy 0, policy_version 43380 (0.0007) [2023-03-07 17:10:04,829][232226] Updated weights for policy 0, policy_version 43390 (0.0007) [2023-03-07 17:10:05,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44434432. Throughput: 0: 12872.0. Samples: 44411833. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:05,069][231894] Avg episode reward: [(0, '195.104')] [2023-03-07 17:10:05,612][232226] Updated weights for policy 0, policy_version 43400 (0.0007) [2023-03-07 17:10:06,390][232226] Updated weights for policy 0, policy_version 43410 (0.0006) [2023-03-07 17:10:07,195][232226] Updated weights for policy 0, policy_version 43420 (0.0006) [2023-03-07 17:10:07,996][232226] Updated weights for policy 0, policy_version 43430 (0.0007) [2023-03-07 17:10:08,774][232226] Updated weights for policy 0, policy_version 43440 (0.0006) [2023-03-07 17:10:09,582][232226] Updated weights for policy 0, policy_version 43450 (0.0007) [2023-03-07 17:10:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44498944. Throughput: 0: 12874.9. Samples: 44489398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:10,069][231894] Avg episode reward: [(0, '197.636')] [2023-03-07 17:10:10,375][232226] Updated weights for policy 0, policy_version 43460 (0.0006) [2023-03-07 17:10:11,165][232226] Updated weights for policy 0, policy_version 43470 (0.0005) [2023-03-07 17:10:11,962][232226] Updated weights for policy 0, policy_version 43480 (0.0006) [2023-03-07 17:10:12,750][232226] Updated weights for policy 0, policy_version 43490 (0.0006) [2023-03-07 17:10:13,551][232226] Updated weights for policy 0, policy_version 43500 (0.0007) [2023-03-07 17:10:14,342][232226] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-07 17:10:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12878.1). Total num frames: 44563456. Throughput: 0: 12873.6. Samples: 44527943. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:15,069][231894] Avg episode reward: [(0, '196.108')] [2023-03-07 17:10:15,152][232226] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-07 17:10:15,941][232226] Updated weights for policy 0, policy_version 43530 (0.0007) [2023-03-07 17:10:16,726][232226] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-07 17:10:17,527][232226] Updated weights for policy 0, policy_version 43550 (0.0006) [2023-03-07 17:10:18,319][232226] Updated weights for policy 0, policy_version 43560 (0.0006) [2023-03-07 17:10:19,096][232226] Updated weights for policy 0, policy_version 43570 (0.0006) [2023-03-07 17:10:19,891][232226] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-07 17:10:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44627968. Throughput: 0: 12880.6. Samples: 44605462. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:20,069][231894] Avg episode reward: [(0, '193.686')] [2023-03-07 17:10:20,689][232226] Updated weights for policy 0, policy_version 43590 (0.0006) [2023-03-07 17:10:21,489][232226] Updated weights for policy 0, policy_version 43600 (0.0007) [2023-03-07 17:10:22,277][232226] Updated weights for policy 0, policy_version 43610 (0.0006) [2023-03-07 17:10:23,064][232226] Updated weights for policy 0, policy_version 43620 (0.0007) [2023-03-07 17:10:23,869][232226] Updated weights for policy 0, policy_version 43630 (0.0006) [2023-03-07 17:10:24,677][232226] Updated weights for policy 0, policy_version 43640 (0.0007) [2023-03-07 17:10:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 44691456. Throughput: 0: 12883.0. Samples: 44682783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:25,069][231894] Avg episode reward: [(0, '194.731')] [2023-03-07 17:10:25,077][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000043645_44692480.pth... [2023-03-07 17:10:25,106][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000040626_41601024.pth [2023-03-07 17:10:25,485][232226] Updated weights for policy 0, policy_version 43650 (0.0007) [2023-03-07 17:10:26,286][232226] Updated weights for policy 0, policy_version 43660 (0.0007) [2023-03-07 17:10:27,085][232226] Updated weights for policy 0, policy_version 43670 (0.0007) [2023-03-07 17:10:27,881][232226] Updated weights for policy 0, policy_version 43680 (0.0006) [2023-03-07 17:10:28,684][232226] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-07 17:10:29,494][232226] Updated weights for policy 0, policy_version 43700 (0.0007) [2023-03-07 17:10:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 44755968. Throughput: 0: 12878.8. Samples: 44720937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:30,069][231894] Avg episode reward: [(0, '195.318')] [2023-03-07 17:10:30,286][232226] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-07 17:10:31,067][232226] Updated weights for policy 0, policy_version 43720 (0.0005) [2023-03-07 17:10:31,868][232226] Updated weights for policy 0, policy_version 43730 (0.0006) [2023-03-07 17:10:32,668][232226] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-07 17:10:33,456][232226] Updated weights for policy 0, policy_version 43750 (0.0007) [2023-03-07 17:10:34,233][232226] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-07 17:10:35,032][232226] Updated weights for policy 0, policy_version 43770 (0.0006) [2023-03-07 17:10:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 44820480. Throughput: 0: 12881.7. Samples: 44798119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:35,069][231894] Avg episode reward: [(0, '201.340')] [2023-03-07 17:10:35,838][232226] Updated weights for policy 0, policy_version 43780 (0.0006) [2023-03-07 17:10:36,624][232226] Updated weights for policy 0, policy_version 43790 (0.0006) [2023-03-07 17:10:37,430][232226] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-07 17:10:38,226][232226] Updated weights for policy 0, policy_version 43810 (0.0007) [2023-03-07 17:10:39,029][232226] Updated weights for policy 0, policy_version 43820 (0.0006) [2023-03-07 17:10:39,846][232226] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-07 17:10:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 44883968. Throughput: 0: 12880.1. Samples: 44875299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:40,069][231894] Avg episode reward: [(0, '182.003')] [2023-03-07 17:10:40,625][232226] Updated weights for policy 0, policy_version 43840 (0.0006) [2023-03-07 17:10:41,418][232226] Updated weights for policy 0, policy_version 43850 (0.0006) [2023-03-07 17:10:42,217][232226] Updated weights for policy 0, policy_version 43860 (0.0006) [2023-03-07 17:10:43,028][232226] Updated weights for policy 0, policy_version 43870 (0.0005) [2023-03-07 17:10:43,808][232226] Updated weights for policy 0, policy_version 43880 (0.0006) [2023-03-07 17:10:44,588][232226] Updated weights for policy 0, policy_version 43890 (0.0007) [2023-03-07 17:10:45,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 44949504. Throughput: 0: 12877.4. Samples: 44913894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:45,070][231894] Avg episode reward: [(0, '189.511')] [2023-03-07 17:10:45,374][232226] Updated weights for policy 0, policy_version 43900 (0.0006) [2023-03-07 17:10:46,190][232226] Updated weights for policy 0, policy_version 43910 (0.0007) [2023-03-07 17:10:46,963][232226] Updated weights for policy 0, policy_version 43920 (0.0006) [2023-03-07 17:10:47,762][232226] Updated weights for policy 0, policy_version 43930 (0.0006) [2023-03-07 17:10:48,587][232226] Updated weights for policy 0, policy_version 43940 (0.0007) [2023-03-07 17:10:49,357][232226] Updated weights for policy 0, policy_version 43950 (0.0007) [2023-03-07 17:10:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 45012992. Throughput: 0: 12875.4. Samples: 44991225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:50,070][231894] Avg episode reward: [(0, '190.430')] [2023-03-07 17:10:50,153][232226] Updated weights for policy 0, policy_version 43960 (0.0006) [2023-03-07 17:10:50,960][232226] Updated weights for policy 0, policy_version 43970 (0.0007) [2023-03-07 17:10:51,746][232226] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-07 17:10:52,538][232226] Updated weights for policy 0, policy_version 43990 (0.0008) [2023-03-07 17:10:53,321][232226] Updated weights for policy 0, policy_version 44000 (0.0006) [2023-03-07 17:10:54,108][232226] Updated weights for policy 0, policy_version 44010 (0.0007) [2023-03-07 17:10:54,911][232226] Updated weights for policy 0, policy_version 44020 (0.0006) [2023-03-07 17:10:55,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 45078528. Throughput: 0: 12875.7. Samples: 45068804. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:10:55,069][231894] Avg episode reward: [(0, '193.887')] [2023-03-07 17:10:55,691][232226] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-07 17:10:56,497][232226] Updated weights for policy 0, policy_version 44040 (0.0007) [2023-03-07 17:10:57,282][232226] Updated weights for policy 0, policy_version 44050 (0.0007) [2023-03-07 17:10:58,076][232226] Updated weights for policy 0, policy_version 44060 (0.0006) [2023-03-07 17:10:58,894][232226] Updated weights for policy 0, policy_version 44070 (0.0007) [2023-03-07 17:10:59,682][232226] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-07 17:11:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 45142016. Throughput: 0: 12876.4. Samples: 45107381. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:00,069][231894] Avg episode reward: [(0, '187.368')] [2023-03-07 17:11:00,470][232226] Updated weights for policy 0, policy_version 44090 (0.0006) [2023-03-07 17:11:01,286][232226] Updated weights for policy 0, policy_version 44100 (0.0006) [2023-03-07 17:11:02,067][232226] Updated weights for policy 0, policy_version 44110 (0.0006) [2023-03-07 17:11:02,885][232226] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-07 17:11:03,673][232226] Updated weights for policy 0, policy_version 44130 (0.0007) [2023-03-07 17:11:04,469][232226] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-07 17:11:05,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 45206528. Throughput: 0: 12865.5. Samples: 45184412. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:05,070][231894] Avg episode reward: [(0, '198.321')] [2023-03-07 17:11:05,258][232226] Updated weights for policy 0, policy_version 44150 (0.0007) [2023-03-07 17:11:06,047][232226] Updated weights for policy 0, policy_version 44160 (0.0007) [2023-03-07 17:11:06,853][232226] Updated weights for policy 0, policy_version 44170 (0.0007) [2023-03-07 17:11:07,642][232226] Updated weights for policy 0, policy_version 44180 (0.0006) [2023-03-07 17:11:08,450][232226] Updated weights for policy 0, policy_version 44190 (0.0006) [2023-03-07 17:11:09,232][232226] Updated weights for policy 0, policy_version 44200 (0.0007) [2023-03-07 17:11:10,051][232226] Updated weights for policy 0, policy_version 44210 (0.0007) [2023-03-07 17:11:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 45271040. Throughput: 0: 12864.6. Samples: 45261692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:10,069][231894] Avg episode reward: [(0, '189.339')] [2023-03-07 17:11:10,854][232226] Updated weights for policy 0, policy_version 44220 (0.0006) [2023-03-07 17:11:11,634][232226] Updated weights for policy 0, policy_version 44230 (0.0006) [2023-03-07 17:11:12,427][232226] Updated weights for policy 0, policy_version 44240 (0.0006) [2023-03-07 17:11:13,222][232226] Updated weights for policy 0, policy_version 44250 (0.0006) [2023-03-07 17:11:14,012][232226] Updated weights for policy 0, policy_version 44260 (0.0007) [2023-03-07 17:11:14,803][232226] Updated weights for policy 0, policy_version 44270 (0.0006) [2023-03-07 17:11:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 45335552. Throughput: 0: 12871.6. Samples: 45300159. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:15,069][231894] Avg episode reward: [(0, '196.314')] [2023-03-07 17:11:15,607][232226] Updated weights for policy 0, policy_version 44280 (0.0007) [2023-03-07 17:11:16,389][232226] Updated weights for policy 0, policy_version 44290 (0.0005) [2023-03-07 17:11:17,187][232226] Updated weights for policy 0, policy_version 44300 (0.0007) [2023-03-07 17:11:17,992][232226] Updated weights for policy 0, policy_version 44310 (0.0006) [2023-03-07 17:11:18,792][232226] Updated weights for policy 0, policy_version 44320 (0.0006) [2023-03-07 17:11:19,581][232226] Updated weights for policy 0, policy_version 44330 (0.0007) [2023-03-07 17:11:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 45400064. Throughput: 0: 12874.3. Samples: 45377461. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:20,069][231894] Avg episode reward: [(0, '191.886')] [2023-03-07 17:11:20,370][232226] Updated weights for policy 0, policy_version 44340 (0.0006) [2023-03-07 17:11:21,178][232226] Updated weights for policy 0, policy_version 44350 (0.0006) [2023-03-07 17:11:21,970][232226] Updated weights for policy 0, policy_version 44360 (0.0007) [2023-03-07 17:11:22,759][232226] Updated weights for policy 0, policy_version 44370 (0.0006) [2023-03-07 17:11:23,557][232226] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-07 17:11:24,324][232226] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-07 17:11:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 45464576. Throughput: 0: 12884.4. Samples: 45455099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:25,069][231894] Avg episode reward: [(0, '182.804')] [2023-03-07 17:11:25,138][232226] Updated weights for policy 0, policy_version 44400 (0.0006) [2023-03-07 17:11:25,923][232226] Updated weights for policy 0, policy_version 44410 (0.0006) [2023-03-07 17:11:26,695][232226] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-07 17:11:27,506][232226] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-07 17:11:28,298][232226] Updated weights for policy 0, policy_version 44440 (0.0006) [2023-03-07 17:11:29,090][232226] Updated weights for policy 0, policy_version 44450 (0.0006) [2023-03-07 17:11:29,878][232226] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-07 17:11:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 45529088. Throughput: 0: 12887.1. Samples: 45493814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:30,069][231894] Avg episode reward: [(0, '195.450')] [2023-03-07 17:11:30,684][232226] Updated weights for policy 0, policy_version 44470 (0.0007) [2023-03-07 17:11:31,469][232226] Updated weights for policy 0, policy_version 44480 (0.0006) [2023-03-07 17:11:32,272][232226] Updated weights for policy 0, policy_version 44490 (0.0007) [2023-03-07 17:11:33,062][232226] Updated weights for policy 0, policy_version 44500 (0.0006) [2023-03-07 17:11:33,872][232226] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-07 17:11:34,650][232226] Updated weights for policy 0, policy_version 44520 (0.0006) [2023-03-07 17:11:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 45593600. Throughput: 0: 12887.4. Samples: 45571160. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:35,069][231894] Avg episode reward: [(0, '195.562')] [2023-03-07 17:11:35,441][232226] Updated weights for policy 0, policy_version 44530 (0.0008) [2023-03-07 17:11:36,245][232226] Updated weights for policy 0, policy_version 44540 (0.0007) [2023-03-07 17:11:37,019][232226] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-07 17:11:37,812][232226] Updated weights for policy 0, policy_version 44560 (0.0006) [2023-03-07 17:11:38,610][232226] Updated weights for policy 0, policy_version 44570 (0.0006) [2023-03-07 17:11:39,376][232226] Updated weights for policy 0, policy_version 44580 (0.0006) [2023-03-07 17:11:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 45658112. Throughput: 0: 12894.3. Samples: 45649046. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:40,069][231894] Avg episode reward: [(0, '195.257')] [2023-03-07 17:11:40,158][232226] Updated weights for policy 0, policy_version 44590 (0.0007) [2023-03-07 17:11:40,974][232226] Updated weights for policy 0, policy_version 44600 (0.0007) [2023-03-07 17:11:41,759][232226] Updated weights for policy 0, policy_version 44610 (0.0006) [2023-03-07 17:11:42,546][232226] Updated weights for policy 0, policy_version 44620 (0.0006) [2023-03-07 17:11:43,339][232226] Updated weights for policy 0, policy_version 44630 (0.0006) [2023-03-07 17:11:44,147][232226] Updated weights for policy 0, policy_version 44640 (0.0006) [2023-03-07 17:11:44,916][232226] Updated weights for policy 0, policy_version 44650 (0.0006) [2023-03-07 17:11:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 45722624. Throughput: 0: 12897.9. Samples: 45687788. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:45,069][231894] Avg episode reward: [(0, '193.190')] [2023-03-07 17:11:45,737][232226] Updated weights for policy 0, policy_version 44660 (0.0007) [2023-03-07 17:11:46,537][232226] Updated weights for policy 0, policy_version 44670 (0.0007) [2023-03-07 17:11:47,341][232226] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-07 17:11:48,135][232226] Updated weights for policy 0, policy_version 44690 (0.0006) [2023-03-07 17:11:48,922][232226] Updated weights for policy 0, policy_version 44700 (0.0008) [2023-03-07 17:11:49,729][232226] Updated weights for policy 0, policy_version 44710 (0.0008) [2023-03-07 17:11:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 45787136. Throughput: 0: 12899.1. Samples: 45764869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:50,069][231894] Avg episode reward: [(0, '196.590')] [2023-03-07 17:11:50,524][232226] Updated weights for policy 0, policy_version 44720 (0.0006) [2023-03-07 17:11:51,315][232226] Updated weights for policy 0, policy_version 44730 (0.0006) [2023-03-07 17:11:52,116][232226] Updated weights for policy 0, policy_version 44740 (0.0006) [2023-03-07 17:11:52,905][232226] Updated weights for policy 0, policy_version 44750 (0.0007) [2023-03-07 17:11:53,675][232226] Updated weights for policy 0, policy_version 44760 (0.0007) [2023-03-07 17:11:54,456][232226] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-07 17:11:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 45851648. Throughput: 0: 12905.6. Samples: 45842444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:11:55,069][231894] Avg episode reward: [(0, '193.229')] [2023-03-07 17:11:55,262][232226] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-07 17:11:56,042][232226] Updated weights for policy 0, policy_version 44790 (0.0006) [2023-03-07 17:11:56,818][232226] Updated weights for policy 0, policy_version 44800 (0.0006) [2023-03-07 17:11:57,642][232226] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-07 17:11:58,436][232226] Updated weights for policy 0, policy_version 44820 (0.0006) [2023-03-07 17:11:59,220][232226] Updated weights for policy 0, policy_version 44830 (0.0007) [2023-03-07 17:12:00,014][232226] Updated weights for policy 0, policy_version 44840 (0.0007) [2023-03-07 17:12:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 45916160. Throughput: 0: 12914.4. Samples: 45881307. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:12:00,069][231894] Avg episode reward: [(0, '198.447')] [2023-03-07 17:12:00,831][232226] Updated weights for policy 0, policy_version 44850 (0.0007) [2023-03-07 17:12:01,601][232226] Updated weights for policy 0, policy_version 44860 (0.0007) [2023-03-07 17:12:02,380][232226] Updated weights for policy 0, policy_version 44870 (0.0007) [2023-03-07 17:12:03,201][232226] Updated weights for policy 0, policy_version 44880 (0.0006) [2023-03-07 17:12:03,982][232226] Updated weights for policy 0, policy_version 44890 (0.0006) [2023-03-07 17:12:04,763][232226] Updated weights for policy 0, policy_version 44900 (0.0006) [2023-03-07 17:12:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 45980672. Throughput: 0: 12917.7. Samples: 45958760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:12:05,069][231894] Avg episode reward: [(0, '202.260')] [2023-03-07 17:12:05,547][232226] Updated weights for policy 0, policy_version 44910 (0.0007) [2023-03-07 17:12:06,348][232226] Updated weights for policy 0, policy_version 44920 (0.0007) [2023-03-07 17:12:07,133][232226] Updated weights for policy 0, policy_version 44930 (0.0007) [2023-03-07 17:12:07,935][232226] Updated weights for policy 0, policy_version 44940 (0.0006) [2023-03-07 17:12:08,709][232226] Updated weights for policy 0, policy_version 44950 (0.0007) [2023-03-07 17:12:09,529][232226] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-07 17:12:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 46045184. Throughput: 0: 12915.6. Samples: 46036301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:12:10,069][231894] Avg episode reward: [(0, '191.245')] [2023-03-07 17:12:10,304][232226] Updated weights for policy 0, policy_version 44970 (0.0007) [2023-03-07 17:12:11,094][232226] Updated weights for policy 0, policy_version 44980 (0.0006) [2023-03-07 17:12:11,893][232226] Updated weights for policy 0, policy_version 44990 (0.0006) [2023-03-07 17:12:12,711][232226] Updated weights for policy 0, policy_version 45000 (0.0006) [2023-03-07 17:12:13,502][232226] Updated weights for policy 0, policy_version 45010 (0.0006) [2023-03-07 17:12:14,289][232226] Updated weights for policy 0, policy_version 45020 (0.0005) [2023-03-07 17:12:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 46109696. Throughput: 0: 12916.5. Samples: 46075057. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:12:15,069][231894] Avg episode reward: [(0, '194.446')] [2023-03-07 17:12:15,098][232226] Updated weights for policy 0, policy_version 45030 (0.0006) [2023-03-07 17:12:15,898][232226] Updated weights for policy 0, policy_version 45040 (0.0005) [2023-03-07 17:12:16,691][232226] Updated weights for policy 0, policy_version 45050 (0.0006) [2023-03-07 17:12:17,489][232226] Updated weights for policy 0, policy_version 45060 (0.0006) [2023-03-07 17:12:18,279][232226] Updated weights for policy 0, policy_version 45070 (0.0006) [2023-03-07 17:12:19,062][232226] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-07 17:12:19,866][232226] Updated weights for policy 0, policy_version 45090 (0.0006) [2023-03-07 17:12:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 46174208. Throughput: 0: 12909.9. Samples: 46152109. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:20,070][231894] Avg episode reward: [(0, '187.454')] [2023-03-07 17:12:20,653][232226] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-07 17:12:21,442][232226] Updated weights for policy 0, policy_version 45110 (0.0006) [2023-03-07 17:12:22,228][232226] Updated weights for policy 0, policy_version 45120 (0.0006) [2023-03-07 17:12:23,041][232226] Updated weights for policy 0, policy_version 45130 (0.0006) [2023-03-07 17:12:23,829][232226] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-07 17:12:24,630][232226] Updated weights for policy 0, policy_version 45150 (0.0007) [2023-03-07 17:12:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 46238720. Throughput: 0: 12898.8. Samples: 46229493. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:25,069][231894] Avg episode reward: [(0, '192.617')] [2023-03-07 17:12:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000045155_46238720.pth... [2023-03-07 17:12:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000042135_43146240.pth [2023-03-07 17:12:25,431][232226] Updated weights for policy 0, policy_version 45160 (0.0006) [2023-03-07 17:12:26,221][232226] Updated weights for policy 0, policy_version 45170 (0.0006) [2023-03-07 17:12:27,018][232226] Updated weights for policy 0, policy_version 45180 (0.0007) [2023-03-07 17:12:27,822][232226] Updated weights for policy 0, policy_version 45190 (0.0006) [2023-03-07 17:12:28,611][232226] Updated weights for policy 0, policy_version 45200 (0.0006) [2023-03-07 17:12:29,426][232226] Updated weights for policy 0, policy_version 45210 (0.0007) [2023-03-07 17:12:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 46303232. Throughput: 0: 12893.9. Samples: 46268011. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:30,069][231894] Avg episode reward: [(0, '193.427')] [2023-03-07 17:12:30,224][232226] Updated weights for policy 0, policy_version 45220 (0.0006) [2023-03-07 17:12:31,012][232226] Updated weights for policy 0, policy_version 45230 (0.0006) [2023-03-07 17:12:31,794][232226] Updated weights for policy 0, policy_version 45240 (0.0006) [2023-03-07 17:12:32,621][232226] Updated weights for policy 0, policy_version 45250 (0.0006) [2023-03-07 17:12:33,391][232226] Updated weights for policy 0, policy_version 45260 (0.0007) [2023-03-07 17:12:34,195][232226] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-07 17:12:35,002][232226] Updated weights for policy 0, policy_version 45280 (0.0006) [2023-03-07 17:12:35,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 46367744. Throughput: 0: 12897.5. Samples: 46345257. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:35,069][231894] Avg episode reward: [(0, '199.059')] [2023-03-07 17:12:35,807][232226] Updated weights for policy 0, policy_version 45290 (0.0006) [2023-03-07 17:12:36,605][232226] Updated weights for policy 0, policy_version 45300 (0.0006) [2023-03-07 17:12:37,390][232226] Updated weights for policy 0, policy_version 45310 (0.0007) [2023-03-07 17:12:38,173][232226] Updated weights for policy 0, policy_version 45320 (0.0006) [2023-03-07 17:12:38,968][232226] Updated weights for policy 0, policy_version 45330 (0.0007) [2023-03-07 17:12:39,767][232226] Updated weights for policy 0, policy_version 45340 (0.0006) [2023-03-07 17:12:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 46431232. Throughput: 0: 12886.1. Samples: 46422321. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:40,069][231894] Avg episode reward: [(0, '190.246')] [2023-03-07 17:12:40,561][232226] Updated weights for policy 0, policy_version 45350 (0.0007) [2023-03-07 17:12:41,351][232226] Updated weights for policy 0, policy_version 45360 (0.0007) [2023-03-07 17:12:42,128][232226] Updated weights for policy 0, policy_version 45370 (0.0007) [2023-03-07 17:12:42,923][232226] Updated weights for policy 0, policy_version 45380 (0.0006) [2023-03-07 17:12:43,737][232226] Updated weights for policy 0, policy_version 45390 (0.0006) [2023-03-07 17:12:44,523][232226] Updated weights for policy 0, policy_version 45400 (0.0006) [2023-03-07 17:12:45,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 46495744. Throughput: 0: 12886.0. Samples: 46461176. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:45,069][231894] Avg episode reward: [(0, '198.145')] [2023-03-07 17:12:45,334][232226] Updated weights for policy 0, policy_version 45410 (0.0006) [2023-03-07 17:12:46,124][232226] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-07 17:12:46,914][232226] Updated weights for policy 0, policy_version 45430 (0.0006) [2023-03-07 17:12:47,717][232226] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-07 17:12:48,515][232226] Updated weights for policy 0, policy_version 45450 (0.0007) [2023-03-07 17:12:49,317][232226] Updated weights for policy 0, policy_version 45460 (0.0007) [2023-03-07 17:12:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 46560256. Throughput: 0: 12873.0. Samples: 46538046. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:50,070][231894] Avg episode reward: [(0, '195.720')] [2023-03-07 17:12:50,145][232226] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-07 17:12:50,930][232226] Updated weights for policy 0, policy_version 45480 (0.0007) [2023-03-07 17:12:51,735][232226] Updated weights for policy 0, policy_version 45490 (0.0006) [2023-03-07 17:12:52,532][232226] Updated weights for policy 0, policy_version 45500 (0.0007) [2023-03-07 17:12:53,328][232226] Updated weights for policy 0, policy_version 45510 (0.0006) [2023-03-07 17:12:54,129][232226] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-07 17:12:54,917][232226] Updated weights for policy 0, policy_version 45530 (0.0006) [2023-03-07 17:12:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 46623744. Throughput: 0: 12857.1. Samples: 46614871. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:12:55,069][231894] Avg episode reward: [(0, '184.412')] [2023-03-07 17:12:55,704][232226] Updated weights for policy 0, policy_version 45540 (0.0007) [2023-03-07 17:12:56,496][232226] Updated weights for policy 0, policy_version 45550 (0.0007) [2023-03-07 17:12:57,291][232226] Updated weights for policy 0, policy_version 45560 (0.0006) [2023-03-07 17:12:58,086][232226] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-07 17:12:58,899][232226] Updated weights for policy 0, policy_version 45580 (0.0006) [2023-03-07 17:12:59,679][232226] Updated weights for policy 0, policy_version 45590 (0.0006) [2023-03-07 17:13:00,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 46688256. Throughput: 0: 12858.0. Samples: 46653665. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:13:00,080][231894] Avg episode reward: [(0, '194.630')] [2023-03-07 17:13:00,458][232226] Updated weights for policy 0, policy_version 45600 (0.0006) [2023-03-07 17:13:01,262][232226] Updated weights for policy 0, policy_version 45610 (0.0007) [2023-03-07 17:13:02,060][232226] Updated weights for policy 0, policy_version 45620 (0.0006) [2023-03-07 17:13:02,850][232226] Updated weights for policy 0, policy_version 45630 (0.0006) [2023-03-07 17:13:03,653][232226] Updated weights for policy 0, policy_version 45640 (0.0006) [2023-03-07 17:13:04,429][232226] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-07 17:13:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 46752768. Throughput: 0: 12864.6. Samples: 46731016. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:13:05,080][231894] Avg episode reward: [(0, '196.997')] [2023-03-07 17:13:05,253][232226] Updated weights for policy 0, policy_version 45660 (0.0007) [2023-03-07 17:13:06,019][232226] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-07 17:13:06,810][232226] Updated weights for policy 0, policy_version 45680 (0.0007) [2023-03-07 17:13:07,618][232226] Updated weights for policy 0, policy_version 45690 (0.0008) [2023-03-07 17:13:08,397][232226] Updated weights for policy 0, policy_version 45700 (0.0006) [2023-03-07 17:13:09,213][232226] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-07 17:13:10,009][232226] Updated weights for policy 0, policy_version 45720 (0.0006) [2023-03-07 17:13:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 46817280. Throughput: 0: 12861.2. Samples: 46808244. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:13:10,080][231894] Avg episode reward: [(0, '185.982')] [2023-03-07 17:13:10,800][232226] Updated weights for policy 0, policy_version 45730 (0.0007) [2023-03-07 17:13:11,590][232226] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-07 17:13:12,374][232226] Updated weights for policy 0, policy_version 45750 (0.0006) [2023-03-07 17:13:13,182][232226] Updated weights for policy 0, policy_version 45760 (0.0007) [2023-03-07 17:13:13,999][232226] Updated weights for policy 0, policy_version 45770 (0.0007) [2023-03-07 17:13:14,779][232226] Updated weights for policy 0, policy_version 45780 (0.0006) [2023-03-07 17:13:15,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 46881792. Throughput: 0: 12869.3. Samples: 46847130. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:13:15,080][231894] Avg episode reward: [(0, '188.566')] [2023-03-07 17:13:15,573][232226] Updated weights for policy 0, policy_version 45790 (0.0006) [2023-03-07 17:13:16,385][232226] Updated weights for policy 0, policy_version 45800 (0.0007) [2023-03-07 17:13:17,182][232226] Updated weights for policy 0, policy_version 45810 (0.0007) [2023-03-07 17:13:17,995][232226] Updated weights for policy 0, policy_version 45820 (0.0006) [2023-03-07 17:13:18,773][232226] Updated weights for policy 0, policy_version 45830 (0.0007) [2023-03-07 17:13:19,551][232226] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-07 17:13:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 46946304. Throughput: 0: 12857.6. Samples: 46923847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:13:20,069][231894] Avg episode reward: [(0, '197.457')] [2023-03-07 17:13:20,353][232226] Updated weights for policy 0, policy_version 45850 (0.0007) [2023-03-07 17:13:21,157][232226] Updated weights for policy 0, policy_version 45860 (0.0006) [2023-03-07 17:13:21,949][232226] Updated weights for policy 0, policy_version 45870 (0.0006) [2023-03-07 17:13:22,745][232226] Updated weights for policy 0, policy_version 45880 (0.0006) [2023-03-07 17:13:23,543][232226] Updated weights for policy 0, policy_version 45890 (0.0007) [2023-03-07 17:13:24,329][232226] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-07 17:13:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 47010816. Throughput: 0: 12866.0. Samples: 47001291. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:13:25,070][231894] Avg episode reward: [(0, '186.844')] [2023-03-07 17:13:25,134][232226] Updated weights for policy 0, policy_version 45910 (0.0006) [2023-03-07 17:13:25,924][232226] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-07 17:13:26,725][232226] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-07 17:13:27,502][232226] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-07 17:13:28,294][232226] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-07 17:13:29,094][232226] Updated weights for policy 0, policy_version 45960 (0.0007) [2023-03-07 17:13:29,894][232226] Updated weights for policy 0, policy_version 45970 (0.0006) [2023-03-07 17:13:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 47075328. Throughput: 0: 12864.6. Samples: 47040081. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:13:30,069][231894] Avg episode reward: [(0, '191.659')] [2023-03-07 17:13:30,703][232226] Updated weights for policy 0, policy_version 45980 (0.0007) [2023-03-07 17:13:31,482][232226] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-07 17:13:32,271][232226] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-07 17:13:33,081][232226] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-07 17:13:33,846][232226] Updated weights for policy 0, policy_version 46020 (0.0007) [2023-03-07 17:13:34,642][232226] Updated weights for policy 0, policy_version 46030 (0.0006) [2023-03-07 17:13:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 47139840. Throughput: 0: 12871.4. Samples: 47117258. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:13:35,069][231894] Avg episode reward: [(0, '191.235')] [2023-03-07 17:13:35,456][232226] Updated weights for policy 0, policy_version 46040 (0.0006) [2023-03-07 17:13:36,239][232226] Updated weights for policy 0, policy_version 46050 (0.0006) [2023-03-07 17:13:37,054][232226] Updated weights for policy 0, policy_version 46060 (0.0006) [2023-03-07 17:13:37,849][232226] Updated weights for policy 0, policy_version 46070 (0.0005) [2023-03-07 17:13:38,644][232226] Updated weights for policy 0, policy_version 46080 (0.0008) [2023-03-07 17:13:39,437][232226] Updated weights for policy 0, policy_version 46090 (0.0006) [2023-03-07 17:13:40,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 47204352. Throughput: 0: 12878.5. Samples: 47194400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:13:40,069][231894] Avg episode reward: [(0, '200.628')] [2023-03-07 17:13:40,234][232226] Updated weights for policy 0, policy_version 46100 (0.0006) [2023-03-07 17:13:41,056][232226] Updated weights for policy 0, policy_version 46110 (0.0006) [2023-03-07 17:13:41,852][232226] Updated weights for policy 0, policy_version 46120 (0.0006) [2023-03-07 17:13:42,628][232226] Updated weights for policy 0, policy_version 46130 (0.0006) [2023-03-07 17:13:43,441][232226] Updated weights for policy 0, policy_version 46140 (0.0007) [2023-03-07 17:13:44,226][232226] Updated weights for policy 0, policy_version 46150 (0.0006) [2023-03-07 17:13:45,022][232226] Updated weights for policy 0, policy_version 46160 (0.0007) [2023-03-07 17:13:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 47267840. Throughput: 0: 12872.6. Samples: 47232932. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:13:45,069][231894] Avg episode reward: [(0, '192.055')] [2023-03-07 17:13:45,821][232226] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-07 17:13:46,611][232226] Updated weights for policy 0, policy_version 46180 (0.0006) [2023-03-07 17:13:47,416][232226] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-07 17:13:48,230][232226] Updated weights for policy 0, policy_version 46200 (0.0006) [2023-03-07 17:13:49,009][232226] Updated weights for policy 0, policy_version 46210 (0.0006) [2023-03-07 17:13:49,788][232226] Updated weights for policy 0, policy_version 46220 (0.0006) [2023-03-07 17:13:50,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 47332352. Throughput: 0: 12865.9. Samples: 47309983. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:13:50,069][231894] Avg episode reward: [(0, '190.594')] [2023-03-07 17:13:50,598][232226] Updated weights for policy 0, policy_version 46230 (0.0006) [2023-03-07 17:13:51,390][232226] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-07 17:13:52,193][232226] Updated weights for policy 0, policy_version 46250 (0.0006) [2023-03-07 17:13:52,979][232226] Updated weights for policy 0, policy_version 46260 (0.0007) [2023-03-07 17:13:53,776][232226] Updated weights for policy 0, policy_version 46270 (0.0006) [2023-03-07 17:13:54,557][232226] Updated weights for policy 0, policy_version 46280 (0.0006) [2023-03-07 17:13:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 47396864. Throughput: 0: 12871.6. Samples: 47387464. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:13:55,069][231894] Avg episode reward: [(0, '200.526')] [2023-03-07 17:13:55,356][232226] Updated weights for policy 0, policy_version 46290 (0.0006) [2023-03-07 17:13:56,165][232226] Updated weights for policy 0, policy_version 46300 (0.0007) [2023-03-07 17:13:56,942][232226] Updated weights for policy 0, policy_version 46310 (0.0006) [2023-03-07 17:13:57,732][232226] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-07 17:13:58,545][232226] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-07 17:13:59,330][232226] Updated weights for policy 0, policy_version 46340 (0.0007) [2023-03-07 17:14:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 47461376. Throughput: 0: 12867.6. Samples: 47426172. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:14:00,069][231894] Avg episode reward: [(0, '195.314')] [2023-03-07 17:14:00,119][232226] Updated weights for policy 0, policy_version 46350 (0.0006) [2023-03-07 17:14:00,906][232226] Updated weights for policy 0, policy_version 46360 (0.0007) [2023-03-07 17:14:01,702][232226] Updated weights for policy 0, policy_version 46370 (0.0007) [2023-03-07 17:14:02,477][232226] Updated weights for policy 0, policy_version 46380 (0.0007) [2023-03-07 17:14:03,302][232226] Updated weights for policy 0, policy_version 46390 (0.0006) [2023-03-07 17:14:04,090][232226] Updated weights for policy 0, policy_version 46400 (0.0007) [2023-03-07 17:14:04,890][232226] Updated weights for policy 0, policy_version 46410 (0.0007) [2023-03-07 17:14:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 47525888. Throughput: 0: 12885.3. Samples: 47503688. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:14:05,069][231894] Avg episode reward: [(0, '187.066')] [2023-03-07 17:14:05,671][232226] Updated weights for policy 0, policy_version 46420 (0.0008) [2023-03-07 17:14:06,476][232226] Updated weights for policy 0, policy_version 46430 (0.0006) [2023-03-07 17:14:07,257][232226] Updated weights for policy 0, policy_version 46440 (0.0006) [2023-03-07 17:14:08,053][232226] Updated weights for policy 0, policy_version 46450 (0.0006) [2023-03-07 17:14:08,827][232226] Updated weights for policy 0, policy_version 46460 (0.0006) [2023-03-07 17:14:09,644][232226] Updated weights for policy 0, policy_version 46470 (0.0006) [2023-03-07 17:14:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 47590400. Throughput: 0: 12883.4. Samples: 47581046. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:14:10,069][231894] Avg episode reward: [(0, '193.774')] [2023-03-07 17:14:10,433][232226] Updated weights for policy 0, policy_version 46480 (0.0006) [2023-03-07 17:14:11,221][232226] Updated weights for policy 0, policy_version 46490 (0.0006) [2023-03-07 17:14:12,029][232226] Updated weights for policy 0, policy_version 46500 (0.0008) [2023-03-07 17:14:12,835][232226] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-07 17:14:13,639][232226] Updated weights for policy 0, policy_version 46520 (0.0007) [2023-03-07 17:14:14,423][232226] Updated weights for policy 0, policy_version 46530 (0.0007) [2023-03-07 17:14:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 47654912. Throughput: 0: 12876.9. Samples: 47619539. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:14:15,069][231894] Avg episode reward: [(0, '192.198')] [2023-03-07 17:14:15,230][232226] Updated weights for policy 0, policy_version 46540 (0.0007) [2023-03-07 17:14:16,024][232226] Updated weights for policy 0, policy_version 46550 (0.0006) [2023-03-07 17:14:16,827][232226] Updated weights for policy 0, policy_version 46560 (0.0007) [2023-03-07 17:14:17,615][232226] Updated weights for policy 0, policy_version 46570 (0.0006) [2023-03-07 17:14:18,411][232226] Updated weights for policy 0, policy_version 46580 (0.0006) [2023-03-07 17:14:19,214][232226] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-07 17:14:20,009][232226] Updated weights for policy 0, policy_version 46600 (0.0006) [2023-03-07 17:14:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 47718400. Throughput: 0: 12878.3. Samples: 47696783. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:14:20,069][231894] Avg episode reward: [(0, '187.117')] [2023-03-07 17:14:20,809][232226] Updated weights for policy 0, policy_version 46610 (0.0006) [2023-03-07 17:14:21,606][232226] Updated weights for policy 0, policy_version 46620 (0.0006) [2023-03-07 17:14:22,398][232226] Updated weights for policy 0, policy_version 46630 (0.0008) [2023-03-07 17:14:23,193][232226] Updated weights for policy 0, policy_version 46640 (0.0007) [2023-03-07 17:14:24,002][232226] Updated weights for policy 0, policy_version 46650 (0.0006) [2023-03-07 17:14:24,779][232226] Updated weights for policy 0, policy_version 46660 (0.0006) [2023-03-07 17:14:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 47782912. Throughput: 0: 12873.6. Samples: 47773716. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:14:25,069][231894] Avg episode reward: [(0, '193.570')] [2023-03-07 17:14:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000046663_47782912.pth... [2023-03-07 17:14:25,106][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000043645_44692480.pth [2023-03-07 17:14:25,578][232226] Updated weights for policy 0, policy_version 46670 (0.0006) [2023-03-07 17:14:26,376][232226] Updated weights for policy 0, policy_version 46680 (0.0006) [2023-03-07 17:14:27,155][232226] Updated weights for policy 0, policy_version 46690 (0.0007) [2023-03-07 17:14:27,951][232226] Updated weights for policy 0, policy_version 46700 (0.0006) [2023-03-07 17:14:28,745][232226] Updated weights for policy 0, policy_version 46710 (0.0007) [2023-03-07 17:14:29,544][232226] Updated weights for policy 0, policy_version 46720 (0.0007) [2023-03-07 17:14:30,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 47847424. Throughput: 0: 12880.6. Samples: 47812556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:14:30,069][231894] Avg episode reward: [(0, '193.747')] [2023-03-07 17:14:30,340][232226] Updated weights for policy 0, policy_version 46730 (0.0007) [2023-03-07 17:14:31,123][232226] Updated weights for policy 0, policy_version 46740 (0.0007) [2023-03-07 17:14:31,917][232226] Updated weights for policy 0, policy_version 46750 (0.0007) [2023-03-07 17:14:32,719][232226] Updated weights for policy 0, policy_version 46760 (0.0006) [2023-03-07 17:14:33,505][232226] Updated weights for policy 0, policy_version 46770 (0.0006) [2023-03-07 17:14:34,316][232226] Updated weights for policy 0, policy_version 46780 (0.0006) [2023-03-07 17:14:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 47911936. Throughput: 0: 12885.9. Samples: 47889848. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:14:35,069][231894] Avg episode reward: [(0, '188.374')] [2023-03-07 17:14:35,104][232226] Updated weights for policy 0, policy_version 46790 (0.0006) [2023-03-07 17:14:35,882][232226] Updated weights for policy 0, policy_version 46800 (0.0007) [2023-03-07 17:14:36,702][232226] Updated weights for policy 0, policy_version 46810 (0.0007) [2023-03-07 17:14:37,490][232226] Updated weights for policy 0, policy_version 46820 (0.0007) [2023-03-07 17:14:38,269][232226] Updated weights for policy 0, policy_version 46830 (0.0007) [2023-03-07 17:14:39,077][232226] Updated weights for policy 0, policy_version 46840 (0.0005) [2023-03-07 17:14:39,867][232226] Updated weights for policy 0, policy_version 46850 (0.0005) [2023-03-07 17:14:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 47976448. Throughput: 0: 12882.5. Samples: 47967175. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:14:40,069][231894] Avg episode reward: [(0, '193.792')] [2023-03-07 17:14:40,649][232226] Updated weights for policy 0, policy_version 46860 (0.0007) [2023-03-07 17:14:41,450][232226] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-07 17:14:42,262][232226] Updated weights for policy 0, policy_version 46880 (0.0006) [2023-03-07 17:14:43,052][232226] Updated weights for policy 0, policy_version 46890 (0.0007) [2023-03-07 17:14:43,846][232226] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-07 17:14:44,656][232226] Updated weights for policy 0, policy_version 46910 (0.0006) [2023-03-07 17:14:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 48040960. Throughput: 0: 12877.8. Samples: 48005673. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:14:45,069][231894] Avg episode reward: [(0, '194.323')] [2023-03-07 17:14:45,439][232226] Updated weights for policy 0, policy_version 46920 (0.0007) [2023-03-07 17:14:46,245][232226] Updated weights for policy 0, policy_version 46930 (0.0006) [2023-03-07 17:14:47,049][232226] Updated weights for policy 0, policy_version 46940 (0.0007) [2023-03-07 17:14:47,831][232226] Updated weights for policy 0, policy_version 46950 (0.0007) [2023-03-07 17:14:48,648][232226] Updated weights for policy 0, policy_version 46960 (0.0008) [2023-03-07 17:14:49,472][232226] Updated weights for policy 0, policy_version 46970 (0.0008) [2023-03-07 17:14:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 48104448. Throughput: 0: 12869.1. Samples: 48082795. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:14:50,069][231894] Avg episode reward: [(0, '191.456')] [2023-03-07 17:14:50,247][232226] Updated weights for policy 0, policy_version 46980 (0.0006) [2023-03-07 17:14:51,025][232226] Updated weights for policy 0, policy_version 46990 (0.0007) [2023-03-07 17:14:51,830][232226] Updated weights for policy 0, policy_version 47000 (0.0007) [2023-03-07 17:14:52,631][232226] Updated weights for policy 0, policy_version 47010 (0.0006) [2023-03-07 17:14:53,438][232226] Updated weights for policy 0, policy_version 47020 (0.0006) [2023-03-07 17:14:54,225][232226] Updated weights for policy 0, policy_version 47030 (0.0007) [2023-03-07 17:14:55,021][232226] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-07 17:14:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 48168960. Throughput: 0: 12860.8. Samples: 48159781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:14:55,069][231894] Avg episode reward: [(0, '189.384')] [2023-03-07 17:14:55,816][232226] Updated weights for policy 0, policy_version 47050 (0.0006) [2023-03-07 17:14:56,603][232226] Updated weights for policy 0, policy_version 47060 (0.0007) [2023-03-07 17:14:57,396][232226] Updated weights for policy 0, policy_version 47070 (0.0006) [2023-03-07 17:14:58,174][232226] Updated weights for policy 0, policy_version 47080 (0.0006) [2023-03-07 17:14:58,985][232226] Updated weights for policy 0, policy_version 47090 (0.0006) [2023-03-07 17:14:59,777][232226] Updated weights for policy 0, policy_version 47100 (0.0006) [2023-03-07 17:15:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 48233472. Throughput: 0: 12868.2. Samples: 48198610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:00,069][231894] Avg episode reward: [(0, '190.011')] [2023-03-07 17:15:00,578][232226] Updated weights for policy 0, policy_version 47110 (0.0007) [2023-03-07 17:15:01,379][232226] Updated weights for policy 0, policy_version 47120 (0.0007) [2023-03-07 17:15:02,190][232226] Updated weights for policy 0, policy_version 47130 (0.0005) [2023-03-07 17:15:02,982][232226] Updated weights for policy 0, policy_version 47140 (0.0007) [2023-03-07 17:15:03,771][232226] Updated weights for policy 0, policy_version 47150 (0.0006) [2023-03-07 17:15:04,584][232226] Updated weights for policy 0, policy_version 47160 (0.0007) [2023-03-07 17:15:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 48297984. Throughput: 0: 12864.0. Samples: 48275664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:05,070][231894] Avg episode reward: [(0, '191.887')] [2023-03-07 17:15:05,371][232226] Updated weights for policy 0, policy_version 47170 (0.0006) [2023-03-07 17:15:06,146][232226] Updated weights for policy 0, policy_version 47180 (0.0007) [2023-03-07 17:15:06,969][232226] Updated weights for policy 0, policy_version 47190 (0.0006) [2023-03-07 17:15:07,764][232226] Updated weights for policy 0, policy_version 47200 (0.0007) [2023-03-07 17:15:08,563][232226] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-07 17:15:09,359][232226] Updated weights for policy 0, policy_version 47220 (0.0007) [2023-03-07 17:15:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 48361472. Throughput: 0: 12863.5. Samples: 48352574. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:10,069][231894] Avg episode reward: [(0, '194.919')] [2023-03-07 17:15:10,150][232226] Updated weights for policy 0, policy_version 47230 (0.0005) [2023-03-07 17:15:10,946][232226] Updated weights for policy 0, policy_version 47240 (0.0006) [2023-03-07 17:15:11,753][232226] Updated weights for policy 0, policy_version 47250 (0.0005) [2023-03-07 17:15:12,527][232226] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-07 17:15:13,314][232226] Updated weights for policy 0, policy_version 47270 (0.0006) [2023-03-07 17:15:14,121][232226] Updated weights for policy 0, policy_version 47280 (0.0006) [2023-03-07 17:15:14,902][232226] Updated weights for policy 0, policy_version 47290 (0.0007) [2023-03-07 17:15:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 48427008. Throughput: 0: 12862.4. Samples: 48391365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:15,069][231894] Avg episode reward: [(0, '193.686')] [2023-03-07 17:15:15,730][232226] Updated weights for policy 0, policy_version 47300 (0.0006) [2023-03-07 17:15:16,527][232226] Updated weights for policy 0, policy_version 47310 (0.0006) [2023-03-07 17:15:17,314][232226] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-07 17:15:18,114][232226] Updated weights for policy 0, policy_version 47330 (0.0006) [2023-03-07 17:15:18,921][232226] Updated weights for policy 0, policy_version 47340 (0.0006) [2023-03-07 17:15:19,702][232226] Updated weights for policy 0, policy_version 47350 (0.0006) [2023-03-07 17:15:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 48490496. Throughput: 0: 12856.9. Samples: 48468408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:20,069][231894] Avg episode reward: [(0, '194.601')] [2023-03-07 17:15:20,482][232226] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-07 17:15:21,279][232226] Updated weights for policy 0, policy_version 47370 (0.0007) [2023-03-07 17:15:22,062][232226] Updated weights for policy 0, policy_version 47380 (0.0006) [2023-03-07 17:15:22,851][232226] Updated weights for policy 0, policy_version 47390 (0.0006) [2023-03-07 17:15:23,638][232226] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-07 17:15:24,436][232226] Updated weights for policy 0, policy_version 47410 (0.0007) [2023-03-07 17:15:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 48555008. Throughput: 0: 12866.3. Samples: 48546160. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:25,069][231894] Avg episode reward: [(0, '193.706')] [2023-03-07 17:15:25,230][232226] Updated weights for policy 0, policy_version 47420 (0.0006) [2023-03-07 17:15:26,047][232226] Updated weights for policy 0, policy_version 47430 (0.0006) [2023-03-07 17:15:26,815][232226] Updated weights for policy 0, policy_version 47440 (0.0006) [2023-03-07 17:15:27,629][232226] Updated weights for policy 0, policy_version 47450 (0.0007) [2023-03-07 17:15:28,416][232226] Updated weights for policy 0, policy_version 47460 (0.0007) [2023-03-07 17:15:29,194][232226] Updated weights for policy 0, policy_version 47470 (0.0006) [2023-03-07 17:15:30,013][232226] Updated weights for policy 0, policy_version 47480 (0.0006) [2023-03-07 17:15:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12878.1). Total num frames: 48619520. Throughput: 0: 12869.9. Samples: 48584821. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:30,069][231894] Avg episode reward: [(0, '192.935')] [2023-03-07 17:15:30,796][232226] Updated weights for policy 0, policy_version 47490 (0.0006) [2023-03-07 17:15:31,592][232226] Updated weights for policy 0, policy_version 47500 (0.0006) [2023-03-07 17:15:32,376][232226] Updated weights for policy 0, policy_version 47510 (0.0007) [2023-03-07 17:15:33,160][232226] Updated weights for policy 0, policy_version 47520 (0.0006) [2023-03-07 17:15:33,970][232226] Updated weights for policy 0, policy_version 47530 (0.0007) [2023-03-07 17:15:34,762][232226] Updated weights for policy 0, policy_version 47540 (0.0006) [2023-03-07 17:15:35,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 48685056. Throughput: 0: 12875.7. Samples: 48662200. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:35,069][231894] Avg episode reward: [(0, '193.645')] [2023-03-07 17:15:35,546][232226] Updated weights for policy 0, policy_version 47550 (0.0006) [2023-03-07 17:15:36,365][232226] Updated weights for policy 0, policy_version 47560 (0.0006) [2023-03-07 17:15:37,145][232226] Updated weights for policy 0, policy_version 47570 (0.0006) [2023-03-07 17:15:37,926][232226] Updated weights for policy 0, policy_version 47580 (0.0007) [2023-03-07 17:15:38,713][232226] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-07 17:15:39,493][232226] Updated weights for policy 0, policy_version 47600 (0.0006) [2023-03-07 17:15:40,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 48749568. Throughput: 0: 12893.7. Samples: 48739999. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:40,069][231894] Avg episode reward: [(0, '194.093')] [2023-03-07 17:15:40,282][232226] Updated weights for policy 0, policy_version 47610 (0.0006) [2023-03-07 17:15:41,073][232226] Updated weights for policy 0, policy_version 47620 (0.0006) [2023-03-07 17:15:41,870][232226] Updated weights for policy 0, policy_version 47630 (0.0007) [2023-03-07 17:15:42,677][232226] Updated weights for policy 0, policy_version 47640 (0.0006) [2023-03-07 17:15:43,453][232226] Updated weights for policy 0, policy_version 47650 (0.0007) [2023-03-07 17:15:44,244][232226] Updated weights for policy 0, policy_version 47660 (0.0006) [2023-03-07 17:15:45,045][232226] Updated weights for policy 0, policy_version 47670 (0.0006) [2023-03-07 17:15:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 48814080. Throughput: 0: 12891.6. Samples: 48778733. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:45,069][231894] Avg episode reward: [(0, '195.331')] [2023-03-07 17:15:45,850][232226] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-07 17:15:46,625][232226] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-07 17:15:47,409][232226] Updated weights for policy 0, policy_version 47700 (0.0006) [2023-03-07 17:15:48,216][232226] Updated weights for policy 0, policy_version 47710 (0.0006) [2023-03-07 17:15:49,029][232226] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-07 17:15:49,814][232226] Updated weights for policy 0, policy_version 47730 (0.0007) [2023-03-07 17:15:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 48878592. Throughput: 0: 12901.6. Samples: 48856235. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:50,070][231894] Avg episode reward: [(0, '190.444')] [2023-03-07 17:15:50,624][232226] Updated weights for policy 0, policy_version 47740 (0.0007) [2023-03-07 17:15:51,412][232226] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-07 17:15:52,219][232226] Updated weights for policy 0, policy_version 47760 (0.0006) [2023-03-07 17:15:53,020][232226] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-07 17:15:53,803][232226] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-07 17:15:54,596][232226] Updated weights for policy 0, policy_version 47790 (0.0007) [2023-03-07 17:15:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 48942080. Throughput: 0: 12904.8. Samples: 48933291. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:15:55,069][231894] Avg episode reward: [(0, '188.960')] [2023-03-07 17:15:55,397][232226] Updated weights for policy 0, policy_version 47800 (0.0006) [2023-03-07 17:15:56,173][232226] Updated weights for policy 0, policy_version 47810 (0.0006) [2023-03-07 17:15:56,968][232226] Updated weights for policy 0, policy_version 47820 (0.0006) [2023-03-07 17:15:57,753][232226] Updated weights for policy 0, policy_version 47830 (0.0007) [2023-03-07 17:15:58,561][232226] Updated weights for policy 0, policy_version 47840 (0.0006) [2023-03-07 17:15:59,339][232226] Updated weights for policy 0, policy_version 47850 (0.0006) [2023-03-07 17:16:00,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12885.1). Total num frames: 49007616. Throughput: 0: 12906.8. Samples: 48972171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:16:00,069][231894] Avg episode reward: [(0, '188.422')] [2023-03-07 17:16:00,137][232226] Updated weights for policy 0, policy_version 47860 (0.0006) [2023-03-07 17:16:00,925][232226] Updated weights for policy 0, policy_version 47870 (0.0006) [2023-03-07 17:16:01,716][232226] Updated weights for policy 0, policy_version 47880 (0.0006) [2023-03-07 17:16:02,504][232226] Updated weights for policy 0, policy_version 47890 (0.0007) [2023-03-07 17:16:03,322][232226] Updated weights for policy 0, policy_version 47900 (0.0006) [2023-03-07 17:16:04,124][232226] Updated weights for policy 0, policy_version 47910 (0.0006) [2023-03-07 17:16:04,890][232226] Updated weights for policy 0, policy_version 47920 (0.0006) [2023-03-07 17:16:05,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 49072128. Throughput: 0: 12915.7. Samples: 49049612. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:16:05,069][231894] Avg episode reward: [(0, '189.689')] [2023-03-07 17:16:05,703][232226] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-07 17:16:06,501][232226] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-07 17:16:07,274][232226] Updated weights for policy 0, policy_version 47950 (0.0006) [2023-03-07 17:16:08,099][232226] Updated weights for policy 0, policy_version 47960 (0.0006) [2023-03-07 17:16:08,898][232226] Updated weights for policy 0, policy_version 47970 (0.0006) [2023-03-07 17:16:09,675][232226] Updated weights for policy 0, policy_version 47980 (0.0007) [2023-03-07 17:16:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12885.0). Total num frames: 49136640. Throughput: 0: 12904.1. Samples: 49126843. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:16:10,069][231894] Avg episode reward: [(0, '193.257')] [2023-03-07 17:16:10,476][232226] Updated weights for policy 0, policy_version 47990 (0.0006) [2023-03-07 17:16:11,278][232226] Updated weights for policy 0, policy_version 48000 (0.0006) [2023-03-07 17:16:12,076][232226] Updated weights for policy 0, policy_version 48010 (0.0006) [2023-03-07 17:16:12,859][232226] Updated weights for policy 0, policy_version 48020 (0.0007) [2023-03-07 17:16:13,663][232226] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-07 17:16:14,457][232226] Updated weights for policy 0, policy_version 48040 (0.0007) [2023-03-07 17:16:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49200128. Throughput: 0: 12899.0. Samples: 49165278. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:16:15,069][231894] Avg episode reward: [(0, '191.075')] [2023-03-07 17:16:15,249][232226] Updated weights for policy 0, policy_version 48050 (0.0007) [2023-03-07 17:16:16,047][232226] Updated weights for policy 0, policy_version 48060 (0.0006) [2023-03-07 17:16:16,841][232226] Updated weights for policy 0, policy_version 48070 (0.0005) [2023-03-07 17:16:17,636][232226] Updated weights for policy 0, policy_version 48080 (0.0007) [2023-03-07 17:16:18,426][232226] Updated weights for policy 0, policy_version 48090 (0.0007) [2023-03-07 17:16:19,233][232226] Updated weights for policy 0, policy_version 48100 (0.0007) [2023-03-07 17:16:20,030][232226] Updated weights for policy 0, policy_version 48110 (0.0006) [2023-03-07 17:16:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 49264640. Throughput: 0: 12896.1. Samples: 49242526. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:16:20,069][231894] Avg episode reward: [(0, '189.030')] [2023-03-07 17:16:20,833][232226] Updated weights for policy 0, policy_version 48120 (0.0007) [2023-03-07 17:16:21,635][232226] Updated weights for policy 0, policy_version 48130 (0.0006) [2023-03-07 17:16:22,413][232226] Updated weights for policy 0, policy_version 48140 (0.0006) [2023-03-07 17:16:23,220][232226] Updated weights for policy 0, policy_version 48150 (0.0006) [2023-03-07 17:16:24,017][232226] Updated weights for policy 0, policy_version 48160 (0.0006) [2023-03-07 17:16:24,810][232226] Updated weights for policy 0, policy_version 48170 (0.0007) [2023-03-07 17:16:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 49329152. Throughput: 0: 12880.5. Samples: 49319623. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:16:25,069][231894] Avg episode reward: [(0, '193.936')] [2023-03-07 17:16:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000048173_49329152.pth... [2023-03-07 17:16:25,102][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000045155_46238720.pth [2023-03-07 17:16:25,600][232226] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-07 17:16:26,404][232226] Updated weights for policy 0, policy_version 48190 (0.0007) [2023-03-07 17:16:27,182][232226] Updated weights for policy 0, policy_version 48200 (0.0006) [2023-03-07 17:16:27,987][232226] Updated weights for policy 0, policy_version 48210 (0.0006) [2023-03-07 17:16:28,777][232226] Updated weights for policy 0, policy_version 48220 (0.0007) [2023-03-07 17:16:29,583][232226] Updated weights for policy 0, policy_version 48230 (0.0006) [2023-03-07 17:16:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 49393664. Throughput: 0: 12880.3. Samples: 49358348. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:16:30,069][231894] Avg episode reward: [(0, '188.143')] [2023-03-07 17:16:30,361][232226] Updated weights for policy 0, policy_version 48240 (0.0006) [2023-03-07 17:16:31,159][232226] Updated weights for policy 0, policy_version 48250 (0.0006) [2023-03-07 17:16:31,944][232226] Updated weights for policy 0, policy_version 48260 (0.0006) [2023-03-07 17:16:32,721][232226] Updated weights for policy 0, policy_version 48270 (0.0006) [2023-03-07 17:16:33,524][232226] Updated weights for policy 0, policy_version 48280 (0.0006) [2023-03-07 17:16:34,320][232226] Updated weights for policy 0, policy_version 48290 (0.0006) [2023-03-07 17:16:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49458176. Throughput: 0: 12882.6. Samples: 49435952. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:16:35,070][231894] Avg episode reward: [(0, '196.756')] [2023-03-07 17:16:35,101][232226] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-07 17:16:35,905][232226] Updated weights for policy 0, policy_version 48310 (0.0006) [2023-03-07 17:16:36,702][232226] Updated weights for policy 0, policy_version 48320 (0.0005) [2023-03-07 17:16:37,470][232226] Updated weights for policy 0, policy_version 48330 (0.0007) [2023-03-07 17:16:38,268][232226] Updated weights for policy 0, policy_version 48340 (0.0006) [2023-03-07 17:16:39,066][232226] Updated weights for policy 0, policy_version 48350 (0.0007) [2023-03-07 17:16:39,850][232226] Updated weights for policy 0, policy_version 48360 (0.0007) [2023-03-07 17:16:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49522688. Throughput: 0: 12896.3. Samples: 49513626. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:16:40,080][231894] Avg episode reward: [(0, '188.834')] [2023-03-07 17:16:40,669][232226] Updated weights for policy 0, policy_version 48370 (0.0007) [2023-03-07 17:16:41,450][232226] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-07 17:16:42,220][232226] Updated weights for policy 0, policy_version 48390 (0.0007) [2023-03-07 17:16:43,022][232226] Updated weights for policy 0, policy_version 48400 (0.0006) [2023-03-07 17:16:43,838][232226] Updated weights for policy 0, policy_version 48410 (0.0007) [2023-03-07 17:16:44,631][232226] Updated weights for policy 0, policy_version 48420 (0.0007) [2023-03-07 17:16:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49587200. Throughput: 0: 12892.1. Samples: 49552314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:16:45,080][231894] Avg episode reward: [(0, '194.002')] [2023-03-07 17:16:45,440][232226] Updated weights for policy 0, policy_version 48430 (0.0006) [2023-03-07 17:16:46,247][232226] Updated weights for policy 0, policy_version 48440 (0.0007) [2023-03-07 17:16:47,025][232226] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-07 17:16:47,815][232226] Updated weights for policy 0, policy_version 48460 (0.0007) [2023-03-07 17:16:48,618][232226] Updated weights for policy 0, policy_version 48470 (0.0006) [2023-03-07 17:16:49,395][232226] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-07 17:16:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49651712. Throughput: 0: 12885.8. Samples: 49629472. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:16:50,080][231894] Avg episode reward: [(0, '186.208')] [2023-03-07 17:16:50,172][232226] Updated weights for policy 0, policy_version 48490 (0.0007) [2023-03-07 17:16:50,968][232226] Updated weights for policy 0, policy_version 48500 (0.0006) [2023-03-07 17:16:51,769][232226] Updated weights for policy 0, policy_version 48510 (0.0006) [2023-03-07 17:16:52,575][232226] Updated weights for policy 0, policy_version 48520 (0.0007) [2023-03-07 17:16:53,371][232226] Updated weights for policy 0, policy_version 48530 (0.0006) [2023-03-07 17:16:54,174][232226] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-07 17:16:54,965][232226] Updated weights for policy 0, policy_version 48550 (0.0006) [2023-03-07 17:16:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 49716224. Throughput: 0: 12890.7. Samples: 49706924. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:16:55,080][231894] Avg episode reward: [(0, '192.180')] [2023-03-07 17:16:55,759][232226] Updated weights for policy 0, policy_version 48560 (0.0006) [2023-03-07 17:16:56,569][232226] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-07 17:16:57,356][232226] Updated weights for policy 0, policy_version 48580 (0.0006) [2023-03-07 17:16:58,142][232226] Updated weights for policy 0, policy_version 48590 (0.0008) [2023-03-07 17:16:58,941][232226] Updated weights for policy 0, policy_version 48600 (0.0006) [2023-03-07 17:16:59,715][232226] Updated weights for policy 0, policy_version 48610 (0.0006) [2023-03-07 17:17:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49780736. Throughput: 0: 12890.0. Samples: 49745325. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:00,069][231894] Avg episode reward: [(0, '187.808')] [2023-03-07 17:17:00,521][232226] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-07 17:17:01,326][232226] Updated weights for policy 0, policy_version 48630 (0.0006) [2023-03-07 17:17:02,121][232226] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-07 17:17:02,906][232226] Updated weights for policy 0, policy_version 48650 (0.0007) [2023-03-07 17:17:03,696][232226] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-07 17:17:04,486][232226] Updated weights for policy 0, policy_version 48670 (0.0007) [2023-03-07 17:17:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49845248. Throughput: 0: 12896.2. Samples: 49822856. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:05,069][231894] Avg episode reward: [(0, '193.804')] [2023-03-07 17:17:05,291][232226] Updated weights for policy 0, policy_version 48680 (0.0006) [2023-03-07 17:17:06,082][232226] Updated weights for policy 0, policy_version 48690 (0.0006) [2023-03-07 17:17:06,890][232226] Updated weights for policy 0, policy_version 48700 (0.0006) [2023-03-07 17:17:07,713][232226] Updated weights for policy 0, policy_version 48710 (0.0006) [2023-03-07 17:17:08,490][232226] Updated weights for policy 0, policy_version 48720 (0.0006) [2023-03-07 17:17:09,293][232226] Updated weights for policy 0, policy_version 48730 (0.0007) [2023-03-07 17:17:10,066][232226] Updated weights for policy 0, policy_version 48740 (0.0006) [2023-03-07 17:17:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 49909760. Throughput: 0: 12893.0. Samples: 49899807. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:10,069][231894] Avg episode reward: [(0, '184.208')] [2023-03-07 17:17:10,873][232226] Updated weights for policy 0, policy_version 48750 (0.0007) [2023-03-07 17:17:11,660][232226] Updated weights for policy 0, policy_version 48760 (0.0005) [2023-03-07 17:17:12,457][232226] Updated weights for policy 0, policy_version 48770 (0.0007) [2023-03-07 17:17:13,250][232226] Updated weights for policy 0, policy_version 48780 (0.0007) [2023-03-07 17:17:14,040][232226] Updated weights for policy 0, policy_version 48790 (0.0006) [2023-03-07 17:17:14,846][232226] Updated weights for policy 0, policy_version 48800 (0.0007) [2023-03-07 17:17:15,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.4, 300 sec: 12878.1). Total num frames: 49973248. Throughput: 0: 12893.1. Samples: 49938536. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:15,069][231894] Avg episode reward: [(0, '189.768')] [2023-03-07 17:17:15,642][232226] Updated weights for policy 0, policy_version 48810 (0.0006) [2023-03-07 17:17:16,437][232226] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-07 17:17:17,236][232226] Updated weights for policy 0, policy_version 48830 (0.0006) [2023-03-07 17:17:18,033][232226] Updated weights for policy 0, policy_version 48840 (0.0006) [2023-03-07 17:17:18,829][232226] Updated weights for policy 0, policy_version 48850 (0.0007) [2023-03-07 17:17:19,622][232226] Updated weights for policy 0, policy_version 48860 (0.0007) [2023-03-07 17:17:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 50037760. Throughput: 0: 12880.1. Samples: 50015554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:20,070][231894] Avg episode reward: [(0, '192.122')] [2023-03-07 17:17:20,417][232226] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-07 17:17:21,202][232226] Updated weights for policy 0, policy_version 48880 (0.0006) [2023-03-07 17:17:21,993][232226] Updated weights for policy 0, policy_version 48890 (0.0006) [2023-03-07 17:17:22,790][232226] Updated weights for policy 0, policy_version 48900 (0.0006) [2023-03-07 17:17:23,570][232226] Updated weights for policy 0, policy_version 48910 (0.0006) [2023-03-07 17:17:24,373][232226] Updated weights for policy 0, policy_version 48920 (0.0005) [2023-03-07 17:17:25,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 50102272. Throughput: 0: 12880.8. Samples: 50093261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:25,069][231894] Avg episode reward: [(0, '188.443')] [2023-03-07 17:17:25,169][232226] Updated weights for policy 0, policy_version 48930 (0.0007) [2023-03-07 17:17:25,953][232226] Updated weights for policy 0, policy_version 48940 (0.0006) [2023-03-07 17:17:26,748][232226] Updated weights for policy 0, policy_version 48950 (0.0006) [2023-03-07 17:17:27,548][232226] Updated weights for policy 0, policy_version 48960 (0.0007) [2023-03-07 17:17:28,341][232226] Updated weights for policy 0, policy_version 48970 (0.0006) [2023-03-07 17:17:29,141][232226] Updated weights for policy 0, policy_version 48980 (0.0006) [2023-03-07 17:17:29,949][232226] Updated weights for policy 0, policy_version 48990 (0.0006) [2023-03-07 17:17:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 50166784. Throughput: 0: 12882.1. Samples: 50132009. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:30,069][231894] Avg episode reward: [(0, '190.578')] [2023-03-07 17:17:30,717][232226] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-07 17:17:31,497][232226] Updated weights for policy 0, policy_version 49010 (0.0006) [2023-03-07 17:17:32,309][232226] Updated weights for policy 0, policy_version 49020 (0.0006) [2023-03-07 17:17:33,107][232226] Updated weights for policy 0, policy_version 49030 (0.0006) [2023-03-07 17:17:33,900][232226] Updated weights for policy 0, policy_version 49040 (0.0007) [2023-03-07 17:17:34,686][232226] Updated weights for policy 0, policy_version 49050 (0.0006) [2023-03-07 17:17:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 50231296. Throughput: 0: 12886.1. Samples: 50209344. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:35,069][231894] Avg episode reward: [(0, '186.970')] [2023-03-07 17:17:35,488][232226] Updated weights for policy 0, policy_version 49060 (0.0007) [2023-03-07 17:17:36,282][232226] Updated weights for policy 0, policy_version 49070 (0.0008) [2023-03-07 17:17:37,058][232226] Updated weights for policy 0, policy_version 49080 (0.0006) [2023-03-07 17:17:37,863][232226] Updated weights for policy 0, policy_version 49090 (0.0007) [2023-03-07 17:17:38,658][232226] Updated weights for policy 0, policy_version 49100 (0.0006) [2023-03-07 17:17:39,459][232226] Updated weights for policy 0, policy_version 49110 (0.0006) [2023-03-07 17:17:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 50295808. Throughput: 0: 12884.7. Samples: 50286735. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:40,070][231894] Avg episode reward: [(0, '189.104')] [2023-03-07 17:17:40,250][232226] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-07 17:17:41,042][232226] Updated weights for policy 0, policy_version 49130 (0.0007) [2023-03-07 17:17:41,838][232226] Updated weights for policy 0, policy_version 49140 (0.0006) [2023-03-07 17:17:42,625][232226] Updated weights for policy 0, policy_version 49150 (0.0007) [2023-03-07 17:17:43,431][232226] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-07 17:17:44,228][232226] Updated weights for policy 0, policy_version 49170 (0.0007) [2023-03-07 17:17:45,008][232226] Updated weights for policy 0, policy_version 49180 (0.0006) [2023-03-07 17:17:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 50360320. Throughput: 0: 12893.3. Samples: 50325526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:45,069][231894] Avg episode reward: [(0, '190.531')] [2023-03-07 17:17:45,815][232226] Updated weights for policy 0, policy_version 49190 (0.0008) [2023-03-07 17:17:46,620][232226] Updated weights for policy 0, policy_version 49200 (0.0007) [2023-03-07 17:17:47,406][232226] Updated weights for policy 0, policy_version 49210 (0.0006) [2023-03-07 17:17:48,200][232226] Updated weights for policy 0, policy_version 49220 (0.0008) [2023-03-07 17:17:48,994][232226] Updated weights for policy 0, policy_version 49230 (0.0006) [2023-03-07 17:17:49,783][232226] Updated weights for policy 0, policy_version 49240 (0.0006) [2023-03-07 17:17:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 50424832. Throughput: 0: 12886.8. Samples: 50402765. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:50,069][231894] Avg episode reward: [(0, '189.653')] [2023-03-07 17:17:50,570][232226] Updated weights for policy 0, policy_version 49250 (0.0007) [2023-03-07 17:17:51,376][232226] Updated weights for policy 0, policy_version 49260 (0.0006) [2023-03-07 17:17:52,166][232226] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-07 17:17:52,946][232226] Updated weights for policy 0, policy_version 49280 (0.0007) [2023-03-07 17:17:53,745][232226] Updated weights for policy 0, policy_version 49290 (0.0006) [2023-03-07 17:17:54,564][232226] Updated weights for policy 0, policy_version 49300 (0.0007) [2023-03-07 17:17:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 50489344. Throughput: 0: 12896.4. Samples: 50480145. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:17:55,080][231894] Avg episode reward: [(0, '191.255')] [2023-03-07 17:17:55,364][232226] Updated weights for policy 0, policy_version 49310 (0.0006) [2023-03-07 17:17:56,147][232226] Updated weights for policy 0, policy_version 49320 (0.0007) [2023-03-07 17:17:56,938][232226] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-07 17:17:57,735][232226] Updated weights for policy 0, policy_version 49340 (0.0005) [2023-03-07 17:17:58,522][232226] Updated weights for policy 0, policy_version 49350 (0.0006) [2023-03-07 17:17:59,331][232226] Updated weights for policy 0, policy_version 49360 (0.0006) [2023-03-07 17:18:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 50553856. Throughput: 0: 12893.5. Samples: 50518743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:00,080][231894] Avg episode reward: [(0, '185.881')] [2023-03-07 17:18:00,148][232226] Updated weights for policy 0, policy_version 49370 (0.0007) [2023-03-07 17:18:00,946][232226] Updated weights for policy 0, policy_version 49380 (0.0006) [2023-03-07 17:18:01,750][232226] Updated weights for policy 0, policy_version 49390 (0.0007) [2023-03-07 17:18:02,535][232226] Updated weights for policy 0, policy_version 49400 (0.0007) [2023-03-07 17:18:03,327][232226] Updated weights for policy 0, policy_version 49410 (0.0006) [2023-03-07 17:18:04,125][232226] Updated weights for policy 0, policy_version 49420 (0.0007) [2023-03-07 17:18:04,918][232226] Updated weights for policy 0, policy_version 49430 (0.0006) [2023-03-07 17:18:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 50617344. Throughput: 0: 12888.1. Samples: 50595520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:05,080][231894] Avg episode reward: [(0, '189.166')] [2023-03-07 17:18:05,712][232226] Updated weights for policy 0, policy_version 49440 (0.0006) [2023-03-07 17:18:06,508][232226] Updated weights for policy 0, policy_version 49450 (0.0007) [2023-03-07 17:18:07,300][232226] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-07 17:18:08,097][232226] Updated weights for policy 0, policy_version 49470 (0.0006) [2023-03-07 17:18:08,884][232226] Updated weights for policy 0, policy_version 49480 (0.0006) [2023-03-07 17:18:09,676][232226] Updated weights for policy 0, policy_version 49490 (0.0006) [2023-03-07 17:18:10,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12885.1). Total num frames: 50682880. Throughput: 0: 12884.0. Samples: 50673038. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:10,080][231894] Avg episode reward: [(0, '197.881')] [2023-03-07 17:18:10,461][232226] Updated weights for policy 0, policy_version 49500 (0.0007) [2023-03-07 17:18:11,261][232226] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-07 17:18:12,054][232226] Updated weights for policy 0, policy_version 49520 (0.0007) [2023-03-07 17:18:12,853][232226] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-07 17:18:13,666][232226] Updated weights for policy 0, policy_version 49540 (0.0007) [2023-03-07 17:18:14,464][232226] Updated weights for policy 0, policy_version 49550 (0.0007) [2023-03-07 17:18:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 50746368. Throughput: 0: 12884.7. Samples: 50711823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:15,080][231894] Avg episode reward: [(0, '198.621')] [2023-03-07 17:18:15,234][232226] Updated weights for policy 0, policy_version 49560 (0.0006) [2023-03-07 17:18:16,031][232226] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-07 17:18:16,826][232226] Updated weights for policy 0, policy_version 49580 (0.0006) [2023-03-07 17:18:17,624][232226] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-07 17:18:18,414][232226] Updated weights for policy 0, policy_version 49600 (0.0007) [2023-03-07 17:18:19,208][232226] Updated weights for policy 0, policy_version 49610 (0.0006) [2023-03-07 17:18:20,008][232226] Updated weights for policy 0, policy_version 49620 (0.0006) [2023-03-07 17:18:20,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 50810880. Throughput: 0: 12881.7. Samples: 50789021. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:20,069][231894] Avg episode reward: [(0, '188.337')] [2023-03-07 17:18:20,790][232226] Updated weights for policy 0, policy_version 49630 (0.0006) [2023-03-07 17:18:21,598][232226] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-07 17:18:22,396][232226] Updated weights for policy 0, policy_version 49650 (0.0006) [2023-03-07 17:18:23,198][232226] Updated weights for policy 0, policy_version 49660 (0.0006) [2023-03-07 17:18:23,986][232226] Updated weights for policy 0, policy_version 49670 (0.0006) [2023-03-07 17:18:24,772][232226] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-07 17:18:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 50875392. Throughput: 0: 12877.2. Samples: 50866210. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:25,070][231894] Avg episode reward: [(0, '193.714')] [2023-03-07 17:18:25,083][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000049684_50876416.pth... [2023-03-07 17:18:25,112][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000046663_47782912.pth [2023-03-07 17:18:25,568][232226] Updated weights for policy 0, policy_version 49690 (0.0006) [2023-03-07 17:18:26,385][232226] Updated weights for policy 0, policy_version 49700 (0.0007) [2023-03-07 17:18:27,183][232226] Updated weights for policy 0, policy_version 49710 (0.0005) [2023-03-07 17:18:27,984][232226] Updated weights for policy 0, policy_version 49720 (0.0007) [2023-03-07 17:18:28,765][232226] Updated weights for policy 0, policy_version 49730 (0.0006) [2023-03-07 17:18:29,550][232226] Updated weights for policy 0, policy_version 49740 (0.0006) [2023-03-07 17:18:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 50939904. Throughput: 0: 12869.3. Samples: 50904647. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:30,070][231894] Avg episode reward: [(0, '184.716')] [2023-03-07 17:18:30,360][232226] Updated weights for policy 0, policy_version 49750 (0.0007) [2023-03-07 17:18:31,141][232226] Updated weights for policy 0, policy_version 49760 (0.0005) [2023-03-07 17:18:31,961][232226] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-07 17:18:32,753][232226] Updated weights for policy 0, policy_version 49780 (0.0008) [2023-03-07 17:18:33,553][232226] Updated weights for policy 0, policy_version 49790 (0.0006) [2023-03-07 17:18:34,338][232226] Updated weights for policy 0, policy_version 49800 (0.0006) [2023-03-07 17:18:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 51004416. Throughput: 0: 12873.4. Samples: 50982066. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:35,069][231894] Avg episode reward: [(0, '197.814')] [2023-03-07 17:18:35,140][232226] Updated weights for policy 0, policy_version 49810 (0.0007) [2023-03-07 17:18:35,926][232226] Updated weights for policy 0, policy_version 49820 (0.0007) [2023-03-07 17:18:36,724][232226] Updated weights for policy 0, policy_version 49830 (0.0007) [2023-03-07 17:18:37,525][232226] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-07 17:18:38,317][232226] Updated weights for policy 0, policy_version 49850 (0.0007) [2023-03-07 17:18:39,128][232226] Updated weights for policy 0, policy_version 49860 (0.0006) [2023-03-07 17:18:39,934][232226] Updated weights for policy 0, policy_version 49870 (0.0006) [2023-03-07 17:18:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 51067904. Throughput: 0: 12865.1. Samples: 51059076. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:18:40,069][231894] Avg episode reward: [(0, '196.416')] [2023-03-07 17:18:40,698][232226] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-07 17:18:41,514][232226] Updated weights for policy 0, policy_version 49890 (0.0008) [2023-03-07 17:18:42,305][232226] Updated weights for policy 0, policy_version 49900 (0.0007) [2023-03-07 17:18:43,087][232226] Updated weights for policy 0, policy_version 49910 (0.0007) [2023-03-07 17:18:43,908][232226] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-07 17:18:44,684][232226] Updated weights for policy 0, policy_version 49930 (0.0006) [2023-03-07 17:18:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 51132416. Throughput: 0: 12863.8. Samples: 51097615. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:18:45,069][231894] Avg episode reward: [(0, '183.869')] [2023-03-07 17:18:45,493][232226] Updated weights for policy 0, policy_version 49940 (0.0006) [2023-03-07 17:18:46,283][232226] Updated weights for policy 0, policy_version 49950 (0.0006) [2023-03-07 17:18:47,075][232226] Updated weights for policy 0, policy_version 49960 (0.0006) [2023-03-07 17:18:47,882][232226] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-07 17:18:48,674][232226] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-07 17:18:49,463][232226] Updated weights for policy 0, policy_version 49990 (0.0007) [2023-03-07 17:18:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 51196928. Throughput: 0: 12870.0. Samples: 51174670. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:18:50,069][231894] Avg episode reward: [(0, '198.451')] [2023-03-07 17:18:50,254][232226] Updated weights for policy 0, policy_version 50000 (0.0007) [2023-03-07 17:18:51,037][232226] Updated weights for policy 0, policy_version 50010 (0.0006) [2023-03-07 17:18:51,848][232226] Updated weights for policy 0, policy_version 50020 (0.0007) [2023-03-07 17:18:52,634][232226] Updated weights for policy 0, policy_version 50030 (0.0006) [2023-03-07 17:18:53,430][232226] Updated weights for policy 0, policy_version 50040 (0.0007) [2023-03-07 17:18:54,226][232226] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-07 17:18:55,018][232226] Updated weights for policy 0, policy_version 50060 (0.0006) [2023-03-07 17:18:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 51261440. Throughput: 0: 12874.4. Samples: 51252389. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:18:55,069][231894] Avg episode reward: [(0, '190.025')] [2023-03-07 17:18:55,810][232226] Updated weights for policy 0, policy_version 50070 (0.0007) [2023-03-07 17:18:56,607][232226] Updated weights for policy 0, policy_version 50080 (0.0006) [2023-03-07 17:18:57,395][232226] Updated weights for policy 0, policy_version 50090 (0.0006) [2023-03-07 17:18:58,197][232226] Updated weights for policy 0, policy_version 50100 (0.0007) [2023-03-07 17:18:58,988][232226] Updated weights for policy 0, policy_version 50110 (0.0005) [2023-03-07 17:18:59,794][232226] Updated weights for policy 0, policy_version 50120 (0.0006) [2023-03-07 17:19:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 51325952. Throughput: 0: 12868.5. Samples: 51290904. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:19:00,070][231894] Avg episode reward: [(0, '196.725')] [2023-03-07 17:19:00,578][232226] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-07 17:19:01,363][232226] Updated weights for policy 0, policy_version 50140 (0.0006) [2023-03-07 17:19:02,154][232226] Updated weights for policy 0, policy_version 50150 (0.0006) [2023-03-07 17:19:02,960][232226] Updated weights for policy 0, policy_version 50160 (0.0007) [2023-03-07 17:19:03,749][232226] Updated weights for policy 0, policy_version 50170 (0.0007) [2023-03-07 17:19:04,555][232226] Updated weights for policy 0, policy_version 50180 (0.0005) [2023-03-07 17:19:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 51390464. Throughput: 0: 12871.9. Samples: 51368258. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:19:05,069][231894] Avg episode reward: [(0, '189.838')] [2023-03-07 17:19:05,345][232226] Updated weights for policy 0, policy_version 50190 (0.0007) [2023-03-07 17:19:06,150][232226] Updated weights for policy 0, policy_version 50200 (0.0007) [2023-03-07 17:19:06,946][232226] Updated weights for policy 0, policy_version 50210 (0.0006) [2023-03-07 17:19:07,744][232226] Updated weights for policy 0, policy_version 50220 (0.0006) [2023-03-07 17:19:08,540][232226] Updated weights for policy 0, policy_version 50230 (0.0007) [2023-03-07 17:19:09,332][232226] Updated weights for policy 0, policy_version 50240 (0.0006) [2023-03-07 17:19:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 51454976. Throughput: 0: 12871.3. Samples: 51445417. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:19:10,069][231894] Avg episode reward: [(0, '186.868')] [2023-03-07 17:19:10,133][232226] Updated weights for policy 0, policy_version 50250 (0.0006) [2023-03-07 17:19:10,948][232226] Updated weights for policy 0, policy_version 50260 (0.0006) [2023-03-07 17:19:11,740][232226] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-07 17:19:12,533][232226] Updated weights for policy 0, policy_version 50280 (0.0006) [2023-03-07 17:19:13,317][232226] Updated weights for policy 0, policy_version 50290 (0.0007) [2023-03-07 17:19:14,123][232226] Updated weights for policy 0, policy_version 50300 (0.0007) [2023-03-07 17:19:14,901][232226] Updated weights for policy 0, policy_version 50310 (0.0006) [2023-03-07 17:19:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 51519488. Throughput: 0: 12871.7. Samples: 51483873. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:19:15,069][231894] Avg episode reward: [(0, '192.242')] [2023-03-07 17:19:15,705][232226] Updated weights for policy 0, policy_version 50320 (0.0006) [2023-03-07 17:19:16,485][232226] Updated weights for policy 0, policy_version 50330 (0.0007) [2023-03-07 17:19:17,260][232226] Updated weights for policy 0, policy_version 50340 (0.0006) [2023-03-07 17:19:18,061][232226] Updated weights for policy 0, policy_version 50350 (0.0006) [2023-03-07 17:19:18,848][232226] Updated weights for policy 0, policy_version 50360 (0.0007) [2023-03-07 17:19:19,654][232226] Updated weights for policy 0, policy_version 50370 (0.0006) [2023-03-07 17:19:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 51584000. Throughput: 0: 12876.7. Samples: 51561520. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:19:20,069][231894] Avg episode reward: [(0, '189.387')] [2023-03-07 17:19:20,431][232226] Updated weights for policy 0, policy_version 50380 (0.0005) [2023-03-07 17:19:21,221][232226] Updated weights for policy 0, policy_version 50390 (0.0006) [2023-03-07 17:19:22,030][232226] Updated weights for policy 0, policy_version 50400 (0.0006) [2023-03-07 17:19:22,809][232226] Updated weights for policy 0, policy_version 50410 (0.0007) [2023-03-07 17:19:23,609][232226] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-07 17:19:24,402][232226] Updated weights for policy 0, policy_version 50430 (0.0007) [2023-03-07 17:19:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 51648512. Throughput: 0: 12891.4. Samples: 51639187. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:19:25,069][231894] Avg episode reward: [(0, '190.319')] [2023-03-07 17:19:25,200][232226] Updated weights for policy 0, policy_version 50440 (0.0005) [2023-03-07 17:19:25,982][232226] Updated weights for policy 0, policy_version 50450 (0.0006) [2023-03-07 17:19:26,809][232226] Updated weights for policy 0, policy_version 50460 (0.0006) [2023-03-07 17:19:27,581][232226] Updated weights for policy 0, policy_version 50470 (0.0006) [2023-03-07 17:19:28,380][232226] Updated weights for policy 0, policy_version 50480 (0.0007) [2023-03-07 17:19:29,174][232226] Updated weights for policy 0, policy_version 50490 (0.0007) [2023-03-07 17:19:29,956][232226] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-07 17:19:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 51713024. Throughput: 0: 12890.8. Samples: 51677701. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:19:30,069][231894] Avg episode reward: [(0, '188.057')] [2023-03-07 17:19:30,728][232226] Updated weights for policy 0, policy_version 50510 (0.0007) [2023-03-07 17:19:31,518][232226] Updated weights for policy 0, policy_version 50520 (0.0006) [2023-03-07 17:19:32,299][232226] Updated weights for policy 0, policy_version 50530 (0.0006) [2023-03-07 17:19:33,097][232226] Updated weights for policy 0, policy_version 50540 (0.0007) [2023-03-07 17:19:33,883][232226] Updated weights for policy 0, policy_version 50550 (0.0007) [2023-03-07 17:19:34,682][232226] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-07 17:19:35,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 51777536. Throughput: 0: 12913.1. Samples: 51755759. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:19:35,070][231894] Avg episode reward: [(0, '188.643')] [2023-03-07 17:19:35,474][232226] Updated weights for policy 0, policy_version 50570 (0.0007) [2023-03-07 17:19:36,257][232226] Updated weights for policy 0, policy_version 50580 (0.0006) [2023-03-07 17:19:37,061][232226] Updated weights for policy 0, policy_version 50590 (0.0007) [2023-03-07 17:19:37,858][232226] Updated weights for policy 0, policy_version 50600 (0.0007) [2023-03-07 17:19:38,649][232226] Updated weights for policy 0, policy_version 50610 (0.0006) [2023-03-07 17:19:39,453][232226] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-07 17:19:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 51842048. Throughput: 0: 12904.6. Samples: 51833096. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:19:40,070][231894] Avg episode reward: [(0, '194.475')] [2023-03-07 17:19:40,235][232226] Updated weights for policy 0, policy_version 50630 (0.0006) [2023-03-07 17:19:41,038][232226] Updated weights for policy 0, policy_version 50640 (0.0006) [2023-03-07 17:19:41,184][232173] KL-divergence is very high: 489.4575 [2023-03-07 17:19:41,819][232173] KL-divergence is very high: 426.5414 [2023-03-07 17:19:41,826][232226] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-07 17:19:42,629][232226] Updated weights for policy 0, policy_version 50660 (0.0006) [2023-03-07 17:19:43,425][232226] Updated weights for policy 0, policy_version 50670 (0.0007) [2023-03-07 17:19:44,206][232226] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-07 17:19:44,982][232226] Updated weights for policy 0, policy_version 50690 (0.0006) [2023-03-07 17:19:45,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 51906560. Throughput: 0: 12910.9. Samples: 51871893. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:19:45,069][231894] Avg episode reward: [(0, '196.169')] [2023-03-07 17:19:45,775][232226] Updated weights for policy 0, policy_version 50700 (0.0007) [2023-03-07 17:19:46,573][232226] Updated weights for policy 0, policy_version 50710 (0.0006) [2023-03-07 17:19:47,342][232226] Updated weights for policy 0, policy_version 50720 (0.0006) [2023-03-07 17:19:48,159][232226] Updated weights for policy 0, policy_version 50730 (0.0006) [2023-03-07 17:19:48,947][232226] Updated weights for policy 0, policy_version 50740 (0.0006) [2023-03-07 17:19:49,737][232226] Updated weights for policy 0, policy_version 50750 (0.0006) [2023-03-07 17:19:50,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 51972096. Throughput: 0: 12917.0. Samples: 51949522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:19:50,069][231894] Avg episode reward: [(0, '200.819')] [2023-03-07 17:19:50,535][232226] Updated weights for policy 0, policy_version 50760 (0.0006) [2023-03-07 17:19:51,318][232226] Updated weights for policy 0, policy_version 50770 (0.0007) [2023-03-07 17:19:52,111][232226] Updated weights for policy 0, policy_version 50780 (0.0006) [2023-03-07 17:19:52,913][232226] Updated weights for policy 0, policy_version 50790 (0.0007) [2023-03-07 17:19:53,703][232226] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-07 17:19:54,510][232226] Updated weights for policy 0, policy_version 50810 (0.0007) [2023-03-07 17:19:55,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 52036608. Throughput: 0: 12923.1. Samples: 52026954. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:19:55,069][231894] Avg episode reward: [(0, '197.691')] [2023-03-07 17:19:55,289][232226] Updated weights for policy 0, policy_version 50820 (0.0006) [2023-03-07 17:19:56,089][232226] Updated weights for policy 0, policy_version 50830 (0.0007) [2023-03-07 17:19:56,874][232226] Updated weights for policy 0, policy_version 50840 (0.0006) [2023-03-07 17:19:57,665][232226] Updated weights for policy 0, policy_version 50850 (0.0007) [2023-03-07 17:19:58,459][232226] Updated weights for policy 0, policy_version 50860 (0.0006) [2023-03-07 17:19:59,259][232226] Updated weights for policy 0, policy_version 50870 (0.0006) [2023-03-07 17:20:00,035][232226] Updated weights for policy 0, policy_version 50880 (0.0008) [2023-03-07 17:20:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 52101120. Throughput: 0: 12929.3. Samples: 52065692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:20:00,069][231894] Avg episode reward: [(0, '201.343')] [2023-03-07 17:20:00,832][232226] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-07 17:20:01,627][232226] Updated weights for policy 0, policy_version 50900 (0.0008) [2023-03-07 17:20:02,432][232226] Updated weights for policy 0, policy_version 50910 (0.0007) [2023-03-07 17:20:03,218][232226] Updated weights for policy 0, policy_version 50920 (0.0007) [2023-03-07 17:20:04,002][232226] Updated weights for policy 0, policy_version 50930 (0.0007) [2023-03-07 17:20:04,798][232226] Updated weights for policy 0, policy_version 50940 (0.0006) [2023-03-07 17:20:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 52165632. Throughput: 0: 12929.7. Samples: 52143357. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:20:05,069][231894] Avg episode reward: [(0, '198.489')] [2023-03-07 17:20:05,589][232226] Updated weights for policy 0, policy_version 50950 (0.0007) [2023-03-07 17:20:06,368][232226] Updated weights for policy 0, policy_version 50960 (0.0006) [2023-03-07 17:20:07,171][232226] Updated weights for policy 0, policy_version 50970 (0.0006) [2023-03-07 17:20:07,962][232226] Updated weights for policy 0, policy_version 50980 (0.0006) [2023-03-07 17:20:08,741][232226] Updated weights for policy 0, policy_version 50990 (0.0006) [2023-03-07 17:20:09,526][232226] Updated weights for policy 0, policy_version 51000 (0.0007) [2023-03-07 17:20:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 52230144. Throughput: 0: 12930.1. Samples: 52221043. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:20:10,069][231894] Avg episode reward: [(0, '193.988')] [2023-03-07 17:20:10,338][232226] Updated weights for policy 0, policy_version 51010 (0.0006) [2023-03-07 17:20:11,132][232226] Updated weights for policy 0, policy_version 51020 (0.0007) [2023-03-07 17:20:11,922][232226] Updated weights for policy 0, policy_version 51030 (0.0006) [2023-03-07 17:20:12,709][232226] Updated weights for policy 0, policy_version 51040 (0.0006) [2023-03-07 17:20:13,518][232226] Updated weights for policy 0, policy_version 51050 (0.0007) [2023-03-07 17:20:14,301][232226] Updated weights for policy 0, policy_version 51060 (0.0007) [2023-03-07 17:20:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 52294656. Throughput: 0: 12933.5. Samples: 52259708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:20:15,070][231894] Avg episode reward: [(0, '197.636')] [2023-03-07 17:20:15,110][232226] Updated weights for policy 0, policy_version 51070 (0.0006) [2023-03-07 17:20:15,878][232226] Updated weights for policy 0, policy_version 51080 (0.0006) [2023-03-07 17:20:16,680][232226] Updated weights for policy 0, policy_version 51090 (0.0006) [2023-03-07 17:20:17,473][232226] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-07 17:20:18,254][232226] Updated weights for policy 0, policy_version 51110 (0.0007) [2023-03-07 17:20:19,053][232226] Updated weights for policy 0, policy_version 51120 (0.0006) [2023-03-07 17:20:19,851][232226] Updated weights for policy 0, policy_version 51130 (0.0008) [2023-03-07 17:20:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 52359168. Throughput: 0: 12921.4. Samples: 52337224. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:20:20,069][231894] Avg episode reward: [(0, '195.074')] [2023-03-07 17:20:20,634][232226] Updated weights for policy 0, policy_version 51140 (0.0006) [2023-03-07 17:20:21,449][232226] Updated weights for policy 0, policy_version 51150 (0.0006) [2023-03-07 17:20:22,232][232226] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-07 17:20:23,022][232226] Updated weights for policy 0, policy_version 51170 (0.0006) [2023-03-07 17:20:23,834][232226] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-07 17:20:24,636][232226] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-07 17:20:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 52423680. Throughput: 0: 12921.0. Samples: 52414541. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:20:25,069][231894] Avg episode reward: [(0, '185.412')] [2023-03-07 17:20:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000051195_52423680.pth... [2023-03-07 17:20:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000048173_49329152.pth [2023-03-07 17:20:25,424][232226] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-07 17:20:26,227][232226] Updated weights for policy 0, policy_version 51210 (0.0007) [2023-03-07 17:20:27,030][232226] Updated weights for policy 0, policy_version 51220 (0.0007) [2023-03-07 17:20:27,822][232226] Updated weights for policy 0, policy_version 51230 (0.0006) [2023-03-07 17:20:28,617][232226] Updated weights for policy 0, policy_version 51240 (0.0007) [2023-03-07 17:20:29,408][232226] Updated weights for policy 0, policy_version 51250 (0.0006) [2023-03-07 17:20:30,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 52488192. Throughput: 0: 12913.0. Samples: 52452979. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:20:30,069][231894] Avg episode reward: [(0, '194.103')] [2023-03-07 17:20:30,206][232226] Updated weights for policy 0, policy_version 51260 (0.0007) [2023-03-07 17:20:30,997][232226] Updated weights for policy 0, policy_version 51270 (0.0007) [2023-03-07 17:20:31,790][232226] Updated weights for policy 0, policy_version 51280 (0.0007) [2023-03-07 17:20:32,603][232226] Updated weights for policy 0, policy_version 51290 (0.0006) [2023-03-07 17:20:33,413][232226] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-07 17:20:34,182][232226] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-07 17:20:34,989][232226] Updated weights for policy 0, policy_version 51320 (0.0006) [2023-03-07 17:20:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12892.0). Total num frames: 52552704. Throughput: 0: 12902.2. Samples: 52530122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:20:35,069][231894] Avg episode reward: [(0, '195.654')] [2023-03-07 17:20:35,770][232226] Updated weights for policy 0, policy_version 51330 (0.0007) [2023-03-07 17:20:36,581][232226] Updated weights for policy 0, policy_version 51340 (0.0006) [2023-03-07 17:20:37,365][232226] Updated weights for policy 0, policy_version 51350 (0.0006) [2023-03-07 17:20:38,158][232226] Updated weights for policy 0, policy_version 51360 (0.0006) [2023-03-07 17:20:38,960][232226] Updated weights for policy 0, policy_version 51370 (0.0007) [2023-03-07 17:20:39,777][232226] Updated weights for policy 0, policy_version 51380 (0.0007) [2023-03-07 17:20:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 52616192. Throughput: 0: 12896.1. Samples: 52607277. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:20:40,069][231894] Avg episode reward: [(0, '190.832')] [2023-03-07 17:20:40,564][232226] Updated weights for policy 0, policy_version 51390 (0.0006) [2023-03-07 17:20:41,354][232226] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-07 17:20:42,145][232226] Updated weights for policy 0, policy_version 51410 (0.0007) [2023-03-07 17:20:42,954][232226] Updated weights for policy 0, policy_version 51420 (0.0007) [2023-03-07 17:20:43,741][232226] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-07 17:20:44,527][232226] Updated weights for policy 0, policy_version 51440 (0.0007) [2023-03-07 17:20:45,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12902.4, 300 sec: 12888.5). Total num frames: 52680704. Throughput: 0: 12895.5. Samples: 52645990. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:20:45,069][231894] Avg episode reward: [(0, '198.338')] [2023-03-07 17:20:45,312][232226] Updated weights for policy 0, policy_version 51450 (0.0006) [2023-03-07 17:20:46,113][232226] Updated weights for policy 0, policy_version 51460 (0.0006) [2023-03-07 17:20:46,906][232226] Updated weights for policy 0, policy_version 51470 (0.0006) [2023-03-07 17:20:47,693][232226] Updated weights for policy 0, policy_version 51480 (0.0006) [2023-03-07 17:20:48,486][232226] Updated weights for policy 0, policy_version 51490 (0.0006) [2023-03-07 17:20:49,275][232226] Updated weights for policy 0, policy_version 51500 (0.0007) [2023-03-07 17:20:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 52745216. Throughput: 0: 12891.9. Samples: 52723491. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:20:50,069][231894] Avg episode reward: [(0, '191.648')] [2023-03-07 17:20:50,073][232226] Updated weights for policy 0, policy_version 51510 (0.0007) [2023-03-07 17:20:50,846][232226] Updated weights for policy 0, policy_version 51520 (0.0006) [2023-03-07 17:20:51,654][232226] Updated weights for policy 0, policy_version 51530 (0.0006) [2023-03-07 17:20:52,438][232226] Updated weights for policy 0, policy_version 51540 (0.0006) [2023-03-07 17:20:53,249][232226] Updated weights for policy 0, policy_version 51550 (0.0006) [2023-03-07 17:20:54,058][232226] Updated weights for policy 0, policy_version 51560 (0.0006) [2023-03-07 17:20:54,849][232226] Updated weights for policy 0, policy_version 51570 (0.0007) [2023-03-07 17:20:55,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 52809728. Throughput: 0: 12879.6. Samples: 52800626. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:20:55,070][231894] Avg episode reward: [(0, '192.913')] [2023-03-07 17:20:55,671][232226] Updated weights for policy 0, policy_version 51580 (0.0006) [2023-03-07 17:20:56,454][232226] Updated weights for policy 0, policy_version 51590 (0.0006) [2023-03-07 17:20:57,263][232226] Updated weights for policy 0, policy_version 51600 (0.0006) [2023-03-07 17:20:58,052][232226] Updated weights for policy 0, policy_version 51610 (0.0007) [2023-03-07 17:20:58,853][232226] Updated weights for policy 0, policy_version 51620 (0.0006) [2023-03-07 17:20:59,648][232226] Updated weights for policy 0, policy_version 51630 (0.0007) [2023-03-07 17:21:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 52874240. Throughput: 0: 12877.0. Samples: 52839172. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:21:00,069][231894] Avg episode reward: [(0, '193.397')] [2023-03-07 17:21:00,447][232226] Updated weights for policy 0, policy_version 51640 (0.0006) [2023-03-07 17:21:01,250][232226] Updated weights for policy 0, policy_version 51650 (0.0007) [2023-03-07 17:21:02,018][232226] Updated weights for policy 0, policy_version 51660 (0.0006) [2023-03-07 17:21:02,821][232226] Updated weights for policy 0, policy_version 51670 (0.0007) [2023-03-07 17:21:03,617][232226] Updated weights for policy 0, policy_version 51680 (0.0007) [2023-03-07 17:21:04,435][232226] Updated weights for policy 0, policy_version 51690 (0.0006) [2023-03-07 17:21:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 52938752. Throughput: 0: 12866.6. Samples: 52916218. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:21:05,070][231894] Avg episode reward: [(0, '201.320')] [2023-03-07 17:21:05,217][232226] Updated weights for policy 0, policy_version 51700 (0.0006) [2023-03-07 17:21:06,007][232226] Updated weights for policy 0, policy_version 51710 (0.0006) [2023-03-07 17:21:06,803][232226] Updated weights for policy 0, policy_version 51720 (0.0006) [2023-03-07 17:21:07,598][232226] Updated weights for policy 0, policy_version 51730 (0.0006) [2023-03-07 17:21:08,383][232226] Updated weights for policy 0, policy_version 51740 (0.0006) [2023-03-07 17:21:09,178][232226] Updated weights for policy 0, policy_version 51750 (0.0007) [2023-03-07 17:21:09,985][232226] Updated weights for policy 0, policy_version 51760 (0.0006) [2023-03-07 17:21:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 53003264. Throughput: 0: 12868.1. Samples: 52993607. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:21:10,070][231894] Avg episode reward: [(0, '195.658')] [2023-03-07 17:21:10,760][232226] Updated weights for policy 0, policy_version 51770 (0.0007) [2023-03-07 17:21:11,558][232226] Updated weights for policy 0, policy_version 51780 (0.0007) [2023-03-07 17:21:12,342][232226] Updated weights for policy 0, policy_version 51790 (0.0007) [2023-03-07 17:21:13,150][232226] Updated weights for policy 0, policy_version 51800 (0.0006) [2023-03-07 17:21:13,954][232226] Updated weights for policy 0, policy_version 51810 (0.0006) [2023-03-07 17:21:14,750][232226] Updated weights for policy 0, policy_version 51820 (0.0006) [2023-03-07 17:21:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 53066752. Throughput: 0: 12873.3. Samples: 53032277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:21:15,069][231894] Avg episode reward: [(0, '204.828')] [2023-03-07 17:21:15,539][232226] Updated weights for policy 0, policy_version 51830 (0.0007) [2023-03-07 17:21:16,327][232226] Updated weights for policy 0, policy_version 51840 (0.0007) [2023-03-07 17:21:17,127][232226] Updated weights for policy 0, policy_version 51850 (0.0007) [2023-03-07 17:21:17,923][232226] Updated weights for policy 0, policy_version 51860 (0.0007) [2023-03-07 17:21:18,712][232226] Updated weights for policy 0, policy_version 51870 (0.0007) [2023-03-07 17:21:19,511][232226] Updated weights for policy 0, policy_version 51880 (0.0006) [2023-03-07 17:21:20,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12892.0). Total num frames: 53132288. Throughput: 0: 12878.0. Samples: 53109631. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:21:20,069][231894] Avg episode reward: [(0, '199.469')] [2023-03-07 17:21:20,306][232226] Updated weights for policy 0, policy_version 51890 (0.0006) [2023-03-07 17:21:21,102][232226] Updated weights for policy 0, policy_version 51900 (0.0007) [2023-03-07 17:21:21,906][232226] Updated weights for policy 0, policy_version 51910 (0.0006) [2023-03-07 17:21:22,705][232226] Updated weights for policy 0, policy_version 51920 (0.0007) [2023-03-07 17:21:23,511][232226] Updated weights for policy 0, policy_version 51930 (0.0007) [2023-03-07 17:21:24,286][232226] Updated weights for policy 0, policy_version 51940 (0.0007) [2023-03-07 17:21:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 53195776. Throughput: 0: 12878.9. Samples: 53186829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:21:25,080][231894] Avg episode reward: [(0, '192.548')] [2023-03-07 17:21:25,082][232226] Updated weights for policy 0, policy_version 51950 (0.0006) [2023-03-07 17:21:25,872][232226] Updated weights for policy 0, policy_version 51960 (0.0007) [2023-03-07 17:21:26,693][232226] Updated weights for policy 0, policy_version 51970 (0.0006) [2023-03-07 17:21:27,485][232226] Updated weights for policy 0, policy_version 51980 (0.0007) [2023-03-07 17:21:28,281][232226] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-07 17:21:29,078][232226] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-07 17:21:29,895][232226] Updated weights for policy 0, policy_version 52010 (0.0006) [2023-03-07 17:21:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 53260288. Throughput: 0: 12874.6. Samples: 53225347. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:21:30,080][231894] Avg episode reward: [(0, '191.939')] [2023-03-07 17:21:30,677][232226] Updated weights for policy 0, policy_version 52020 (0.0006) [2023-03-07 17:21:31,460][232226] Updated weights for policy 0, policy_version 52030 (0.0007) [2023-03-07 17:21:32,247][232226] Updated weights for policy 0, policy_version 52040 (0.0007) [2023-03-07 17:21:33,059][232226] Updated weights for policy 0, policy_version 52050 (0.0006) [2023-03-07 17:21:33,842][232226] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-07 17:21:34,651][232226] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-07 17:21:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 53324800. Throughput: 0: 12867.4. Samples: 53302524. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:21:35,080][231894] Avg episode reward: [(0, '198.088')] [2023-03-07 17:21:35,440][232226] Updated weights for policy 0, policy_version 52080 (0.0006) [2023-03-07 17:21:36,222][232226] Updated weights for policy 0, policy_version 52090 (0.0007) [2023-03-07 17:21:37,022][232226] Updated weights for policy 0, policy_version 52100 (0.0007) [2023-03-07 17:21:37,802][232226] Updated weights for policy 0, policy_version 52110 (0.0007) [2023-03-07 17:21:38,599][232226] Updated weights for policy 0, policy_version 52120 (0.0006) [2023-03-07 17:21:39,397][232226] Updated weights for policy 0, policy_version 52130 (0.0006) [2023-03-07 17:21:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 53389312. Throughput: 0: 12875.9. Samples: 53380039. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:21:40,080][231894] Avg episode reward: [(0, '191.544')] [2023-03-07 17:21:40,171][232226] Updated weights for policy 0, policy_version 52140 (0.0006) [2023-03-07 17:21:40,976][232226] Updated weights for policy 0, policy_version 52150 (0.0008) [2023-03-07 17:21:41,776][232226] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-07 17:21:42,558][232226] Updated weights for policy 0, policy_version 52170 (0.0006) [2023-03-07 17:21:43,349][232226] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-07 17:21:44,141][232226] Updated weights for policy 0, policy_version 52190 (0.0007) [2023-03-07 17:21:44,933][232226] Updated weights for policy 0, policy_version 52200 (0.0006) [2023-03-07 17:21:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 53453824. Throughput: 0: 12880.1. Samples: 53418777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:21:45,069][231894] Avg episode reward: [(0, '193.094')] [2023-03-07 17:21:45,710][232226] Updated weights for policy 0, policy_version 52210 (0.0006) [2023-03-07 17:21:46,527][232226] Updated weights for policy 0, policy_version 52220 (0.0006) [2023-03-07 17:21:47,319][232226] Updated weights for policy 0, policy_version 52230 (0.0006) [2023-03-07 17:21:48,095][232226] Updated weights for policy 0, policy_version 52240 (0.0006) [2023-03-07 17:21:48,913][232226] Updated weights for policy 0, policy_version 52250 (0.0007) [2023-03-07 17:21:49,696][232226] Updated weights for policy 0, policy_version 52260 (0.0007) [2023-03-07 17:21:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 53518336. Throughput: 0: 12893.9. Samples: 53496444. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:21:50,069][231894] Avg episode reward: [(0, '194.510')] [2023-03-07 17:21:50,501][232226] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-07 17:21:51,285][232226] Updated weights for policy 0, policy_version 52280 (0.0006) [2023-03-07 17:21:52,088][232226] Updated weights for policy 0, policy_version 52290 (0.0006) [2023-03-07 17:21:52,862][232226] Updated weights for policy 0, policy_version 52300 (0.0006) [2023-03-07 17:21:53,647][232226] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-07 17:21:54,453][232226] Updated weights for policy 0, policy_version 52320 (0.0006) [2023-03-07 17:21:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12888.5). Total num frames: 53582848. Throughput: 0: 12896.7. Samples: 53573959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:21:55,069][231894] Avg episode reward: [(0, '197.515')] [2023-03-07 17:21:55,231][232226] Updated weights for policy 0, policy_version 52330 (0.0006) [2023-03-07 17:21:56,028][232226] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-07 17:21:56,809][232226] Updated weights for policy 0, policy_version 52350 (0.0006) [2023-03-07 17:21:57,617][232226] Updated weights for policy 0, policy_version 52360 (0.0007) [2023-03-07 17:21:58,401][232226] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-07 17:21:59,213][232226] Updated weights for policy 0, policy_version 52380 (0.0007) [2023-03-07 17:22:00,018][232226] Updated weights for policy 0, policy_version 52390 (0.0006) [2023-03-07 17:22:00,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12892.0). Total num frames: 53648384. Throughput: 0: 12902.0. Samples: 53612866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:00,069][231894] Avg episode reward: [(0, '188.165')] [2023-03-07 17:22:00,789][232226] Updated weights for policy 0, policy_version 52400 (0.0006) [2023-03-07 17:22:01,601][232226] Updated weights for policy 0, policy_version 52410 (0.0007) [2023-03-07 17:22:02,385][232226] Updated weights for policy 0, policy_version 52420 (0.0006) [2023-03-07 17:22:03,196][232226] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-07 17:22:03,989][232226] Updated weights for policy 0, policy_version 52440 (0.0006) [2023-03-07 17:22:04,779][232226] Updated weights for policy 0, policy_version 52450 (0.0006) [2023-03-07 17:22:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 53711872. Throughput: 0: 12892.9. Samples: 53689812. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:05,069][231894] Avg episode reward: [(0, '193.204')] [2023-03-07 17:22:05,553][232226] Updated weights for policy 0, policy_version 52460 (0.0006) [2023-03-07 17:22:06,344][232226] Updated weights for policy 0, policy_version 52470 (0.0006) [2023-03-07 17:22:07,129][232226] Updated weights for policy 0, policy_version 52480 (0.0008) [2023-03-07 17:22:07,913][232226] Updated weights for policy 0, policy_version 52490 (0.0007) [2023-03-07 17:22:08,706][232226] Updated weights for policy 0, policy_version 52500 (0.0006) [2023-03-07 17:22:09,498][232226] Updated weights for policy 0, policy_version 52510 (0.0006) [2023-03-07 17:22:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 53777408. Throughput: 0: 12908.8. Samples: 53767726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:10,069][231894] Avg episode reward: [(0, '200.342')] [2023-03-07 17:22:10,290][232226] Updated weights for policy 0, policy_version 52520 (0.0006) [2023-03-07 17:22:11,079][232226] Updated weights for policy 0, policy_version 52530 (0.0006) [2023-03-07 17:22:11,870][232226] Updated weights for policy 0, policy_version 52540 (0.0006) [2023-03-07 17:22:12,680][232226] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-07 17:22:13,452][232226] Updated weights for policy 0, policy_version 52560 (0.0005) [2023-03-07 17:22:14,242][232226] Updated weights for policy 0, policy_version 52570 (0.0006) [2023-03-07 17:22:15,056][232226] Updated weights for policy 0, policy_version 52580 (0.0006) [2023-03-07 17:22:15,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 53841920. Throughput: 0: 12913.6. Samples: 53806458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:15,069][231894] Avg episode reward: [(0, '193.897')] [2023-03-07 17:22:15,821][232226] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-07 17:22:16,620][232226] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-07 17:22:17,424][232226] Updated weights for policy 0, policy_version 52610 (0.0006) [2023-03-07 17:22:18,205][232226] Updated weights for policy 0, policy_version 52620 (0.0005) [2023-03-07 17:22:19,009][232226] Updated weights for policy 0, policy_version 52630 (0.0006) [2023-03-07 17:22:19,799][232226] Updated weights for policy 0, policy_version 52640 (0.0007) [2023-03-07 17:22:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 53906432. Throughput: 0: 12922.8. Samples: 53884048. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:20,069][231894] Avg episode reward: [(0, '192.506')] [2023-03-07 17:22:20,588][232226] Updated weights for policy 0, policy_version 52650 (0.0006) [2023-03-07 17:22:21,390][232226] Updated weights for policy 0, policy_version 52660 (0.0006) [2023-03-07 17:22:22,179][232226] Updated weights for policy 0, policy_version 52670 (0.0008) [2023-03-07 17:22:22,980][232226] Updated weights for policy 0, policy_version 52680 (0.0006) [2023-03-07 17:22:23,771][232226] Updated weights for policy 0, policy_version 52690 (0.0006) [2023-03-07 17:22:24,566][232226] Updated weights for policy 0, policy_version 52700 (0.0006) [2023-03-07 17:22:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.4, 300 sec: 12895.5). Total num frames: 53970944. Throughput: 0: 12916.9. Samples: 53961299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:25,069][231894] Avg episode reward: [(0, '187.715')] [2023-03-07 17:22:25,072][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000052706_53970944.pth... [2023-03-07 17:22:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000049684_50876416.pth [2023-03-07 17:22:25,366][232226] Updated weights for policy 0, policy_version 52710 (0.0007) [2023-03-07 17:22:26,150][232226] Updated weights for policy 0, policy_version 52720 (0.0007) [2023-03-07 17:22:26,944][232226] Updated weights for policy 0, policy_version 52730 (0.0006) [2023-03-07 17:22:27,734][232226] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-07 17:22:28,534][232226] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-07 17:22:29,318][232226] Updated weights for policy 0, policy_version 52760 (0.0006) [2023-03-07 17:22:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.4, 300 sec: 12895.5). Total num frames: 54035456. Throughput: 0: 12918.0. Samples: 54000086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:30,069][231894] Avg episode reward: [(0, '194.651')] [2023-03-07 17:22:30,108][232226] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-07 17:22:30,890][232226] Updated weights for policy 0, policy_version 52780 (0.0007) [2023-03-07 17:22:31,719][232226] Updated weights for policy 0, policy_version 52790 (0.0006) [2023-03-07 17:22:32,505][232226] Updated weights for policy 0, policy_version 52800 (0.0006) [2023-03-07 17:22:33,281][232226] Updated weights for policy 0, policy_version 52810 (0.0006) [2023-03-07 17:22:34,086][232226] Updated weights for policy 0, policy_version 52820 (0.0006) [2023-03-07 17:22:34,870][232226] Updated weights for policy 0, policy_version 52830 (0.0007) [2023-03-07 17:22:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 54099968. Throughput: 0: 12918.2. Samples: 54077762. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:35,069][231894] Avg episode reward: [(0, '190.420')] [2023-03-07 17:22:35,657][232226] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-07 17:22:36,452][232226] Updated weights for policy 0, policy_version 52850 (0.0007) [2023-03-07 17:22:37,247][232226] Updated weights for policy 0, policy_version 52860 (0.0005) [2023-03-07 17:22:38,062][232226] Updated weights for policy 0, policy_version 52870 (0.0006) [2023-03-07 17:22:38,845][232226] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-07 17:22:39,634][232226] Updated weights for policy 0, policy_version 52890 (0.0006) [2023-03-07 17:22:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 54164480. Throughput: 0: 12915.0. Samples: 54155133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:40,069][231894] Avg episode reward: [(0, '193.771')] [2023-03-07 17:22:40,450][232226] Updated weights for policy 0, policy_version 52900 (0.0006) [2023-03-07 17:22:41,218][232226] Updated weights for policy 0, policy_version 52910 (0.0005) [2023-03-07 17:22:42,012][232226] Updated weights for policy 0, policy_version 52920 (0.0006) [2023-03-07 17:22:42,809][232226] Updated weights for policy 0, policy_version 52930 (0.0007) [2023-03-07 17:22:43,617][232226] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-07 17:22:44,417][232226] Updated weights for policy 0, policy_version 52950 (0.0006) [2023-03-07 17:22:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 54228992. Throughput: 0: 12908.8. Samples: 54193763. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:45,069][231894] Avg episode reward: [(0, '191.277')] [2023-03-07 17:22:45,207][232226] Updated weights for policy 0, policy_version 52960 (0.0007) [2023-03-07 17:22:46,002][232226] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-07 17:22:46,787][232226] Updated weights for policy 0, policy_version 52980 (0.0006) [2023-03-07 17:22:47,578][232226] Updated weights for policy 0, policy_version 52990 (0.0007) [2023-03-07 17:22:48,364][232226] Updated weights for policy 0, policy_version 53000 (0.0006) [2023-03-07 17:22:49,168][232226] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-07 17:22:49,967][232226] Updated weights for policy 0, policy_version 53020 (0.0006) [2023-03-07 17:22:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 54293504. Throughput: 0: 12920.9. Samples: 54271252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:50,069][231894] Avg episode reward: [(0, '189.208')] [2023-03-07 17:22:50,739][232226] Updated weights for policy 0, policy_version 53030 (0.0006) [2023-03-07 17:22:51,535][232226] Updated weights for policy 0, policy_version 53040 (0.0006) [2023-03-07 17:22:52,344][232226] Updated weights for policy 0, policy_version 53050 (0.0007) [2023-03-07 17:22:53,142][232226] Updated weights for policy 0, policy_version 53060 (0.0006) [2023-03-07 17:22:53,921][232226] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-07 17:22:54,717][232226] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-07 17:22:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 54358016. Throughput: 0: 12909.2. Samples: 54348642. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:22:55,069][231894] Avg episode reward: [(0, '193.248')] [2023-03-07 17:22:55,512][232226] Updated weights for policy 0, policy_version 53090 (0.0007) [2023-03-07 17:22:56,309][232226] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-07 17:22:57,114][232226] Updated weights for policy 0, policy_version 53110 (0.0006) [2023-03-07 17:22:57,892][232226] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-07 17:22:58,680][232226] Updated weights for policy 0, policy_version 53130 (0.0007) [2023-03-07 17:22:59,482][232226] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-07 17:23:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 54422528. Throughput: 0: 12906.2. Samples: 54387239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:23:00,080][231894] Avg episode reward: [(0, '194.019')] [2023-03-07 17:23:00,267][232226] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-07 17:23:01,048][232226] Updated weights for policy 0, policy_version 53160 (0.0006) [2023-03-07 17:23:01,841][232226] Updated weights for policy 0, policy_version 53170 (0.0006) [2023-03-07 17:23:02,641][232226] Updated weights for policy 0, policy_version 53180 (0.0006) [2023-03-07 17:23:03,420][232226] Updated weights for policy 0, policy_version 53190 (0.0007) [2023-03-07 17:23:04,223][232226] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-07 17:23:05,022][232226] Updated weights for policy 0, policy_version 53210 (0.0007) [2023-03-07 17:23:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12895.5). Total num frames: 54487040. Throughput: 0: 12908.7. Samples: 54464941. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:23:05,080][231894] Avg episode reward: [(0, '193.384')] [2023-03-07 17:23:05,816][232226] Updated weights for policy 0, policy_version 53220 (0.0006) [2023-03-07 17:23:06,603][232226] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-07 17:23:07,398][232226] Updated weights for policy 0, policy_version 53240 (0.0006) [2023-03-07 17:23:08,195][232226] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-07 17:23:08,987][232226] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-07 17:23:09,771][232226] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-07 17:23:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 54551552. Throughput: 0: 12915.2. Samples: 54542483. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:23:10,080][231894] Avg episode reward: [(0, '188.425')] [2023-03-07 17:23:10,585][232226] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-07 17:23:11,376][232226] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-07 17:23:12,171][232226] Updated weights for policy 0, policy_version 53300 (0.0005) [2023-03-07 17:23:12,969][232226] Updated weights for policy 0, policy_version 53310 (0.0006) [2023-03-07 17:23:13,759][232226] Updated weights for policy 0, policy_version 53320 (0.0006) [2023-03-07 17:23:14,560][232226] Updated weights for policy 0, policy_version 53330 (0.0006) [2023-03-07 17:23:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 54616064. Throughput: 0: 12908.9. Samples: 54580988. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:23:15,080][231894] Avg episode reward: [(0, '191.659')] [2023-03-07 17:23:15,344][232226] Updated weights for policy 0, policy_version 53340 (0.0006) [2023-03-07 17:23:16,134][232226] Updated weights for policy 0, policy_version 53350 (0.0006) [2023-03-07 17:23:16,935][232226] Updated weights for policy 0, policy_version 53360 (0.0006) [2023-03-07 17:23:17,711][232226] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-07 17:23:18,516][232226] Updated weights for policy 0, policy_version 53380 (0.0006) [2023-03-07 17:23:19,313][232226] Updated weights for policy 0, policy_version 53390 (0.0006) [2023-03-07 17:23:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 54680576. Throughput: 0: 12905.7. Samples: 54658519. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:23:20,080][231894] Avg episode reward: [(0, '195.695')] [2023-03-07 17:23:20,092][232226] Updated weights for policy 0, policy_version 53400 (0.0007) [2023-03-07 17:23:20,889][232226] Updated weights for policy 0, policy_version 53410 (0.0006) [2023-03-07 17:23:21,668][232226] Updated weights for policy 0, policy_version 53420 (0.0005) [2023-03-07 17:23:22,476][232226] Updated weights for policy 0, policy_version 53430 (0.0006) [2023-03-07 17:23:23,262][232226] Updated weights for policy 0, policy_version 53440 (0.0007) [2023-03-07 17:23:24,064][232226] Updated weights for policy 0, policy_version 53450 (0.0006) [2023-03-07 17:23:24,849][232226] Updated weights for policy 0, policy_version 53460 (0.0006) [2023-03-07 17:23:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 54745088. Throughput: 0: 12909.1. Samples: 54736042. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:23:25,080][231894] Avg episode reward: [(0, '193.790')] [2023-03-07 17:23:25,639][232226] Updated weights for policy 0, policy_version 53470 (0.0006) [2023-03-07 17:23:26,442][232226] Updated weights for policy 0, policy_version 53480 (0.0006) [2023-03-07 17:23:27,227][232226] Updated weights for policy 0, policy_version 53490 (0.0006) [2023-03-07 17:23:28,009][232226] Updated weights for policy 0, policy_version 53500 (0.0006) [2023-03-07 17:23:28,810][232226] Updated weights for policy 0, policy_version 53510 (0.0006) [2023-03-07 17:23:29,622][232226] Updated weights for policy 0, policy_version 53520 (0.0007) [2023-03-07 17:23:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 54809600. Throughput: 0: 12911.4. Samples: 54774777. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:23:30,069][231894] Avg episode reward: [(0, '198.157')] [2023-03-07 17:23:30,412][232226] Updated weights for policy 0, policy_version 53530 (0.0007) [2023-03-07 17:23:31,213][232226] Updated weights for policy 0, policy_version 53540 (0.0008) [2023-03-07 17:23:32,001][232226] Updated weights for policy 0, policy_version 53550 (0.0007) [2023-03-07 17:23:32,788][232226] Updated weights for policy 0, policy_version 53560 (0.0005) [2023-03-07 17:23:33,579][232226] Updated weights for policy 0, policy_version 53570 (0.0007) [2023-03-07 17:23:34,373][232226] Updated weights for policy 0, policy_version 53580 (0.0006) [2023-03-07 17:23:35,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 54874112. Throughput: 0: 12909.3. Samples: 54852173. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:23:35,070][231894] Avg episode reward: [(0, '199.994')] [2023-03-07 17:23:35,171][232226] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-07 17:23:35,982][232226] Updated weights for policy 0, policy_version 53600 (0.0006) [2023-03-07 17:23:36,775][232226] Updated weights for policy 0, policy_version 53610 (0.0007) [2023-03-07 17:23:37,559][232226] Updated weights for policy 0, policy_version 53620 (0.0006) [2023-03-07 17:23:38,369][232226] Updated weights for policy 0, policy_version 53630 (0.0006) [2023-03-07 17:23:39,160][232226] Updated weights for policy 0, policy_version 53640 (0.0006) [2023-03-07 17:23:39,970][232226] Updated weights for policy 0, policy_version 53650 (0.0006) [2023-03-07 17:23:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 54938624. Throughput: 0: 12905.1. Samples: 54929373. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:23:40,069][231894] Avg episode reward: [(0, '196.550')] [2023-03-07 17:23:40,757][232226] Updated weights for policy 0, policy_version 53660 (0.0006) [2023-03-07 17:23:41,557][232226] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-07 17:23:42,350][232226] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-07 17:23:43,130][232226] Updated weights for policy 0, policy_version 53690 (0.0006) [2023-03-07 17:23:43,927][232226] Updated weights for policy 0, policy_version 53700 (0.0006) [2023-03-07 17:23:44,719][232226] Updated weights for policy 0, policy_version 53710 (0.0006) [2023-03-07 17:23:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55003136. Throughput: 0: 12901.7. Samples: 54967819. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:23:45,069][231894] Avg episode reward: [(0, '201.202')] [2023-03-07 17:23:45,501][232226] Updated weights for policy 0, policy_version 53720 (0.0007) [2023-03-07 17:23:46,304][232226] Updated weights for policy 0, policy_version 53730 (0.0007) [2023-03-07 17:23:47,117][232226] Updated weights for policy 0, policy_version 53740 (0.0006) [2023-03-07 17:23:47,899][232226] Updated weights for policy 0, policy_version 53750 (0.0007) [2023-03-07 17:23:48,693][232226] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-07 17:23:49,496][232226] Updated weights for policy 0, policy_version 53770 (0.0007) [2023-03-07 17:23:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55067648. Throughput: 0: 12897.3. Samples: 55045318. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:23:50,069][231894] Avg episode reward: [(0, '187.181')] [2023-03-07 17:23:50,274][232226] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-07 17:23:51,059][232226] Updated weights for policy 0, policy_version 53790 (0.0006) [2023-03-07 17:23:51,873][232226] Updated weights for policy 0, policy_version 53800 (0.0007) [2023-03-07 17:23:52,658][232226] Updated weights for policy 0, policy_version 53810 (0.0006) [2023-03-07 17:23:53,429][232226] Updated weights for policy 0, policy_version 53820 (0.0006) [2023-03-07 17:23:54,249][232226] Updated weights for policy 0, policy_version 53830 (0.0006) [2023-03-07 17:23:55,055][232226] Updated weights for policy 0, policy_version 53840 (0.0006) [2023-03-07 17:23:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55132160. Throughput: 0: 12895.0. Samples: 55122760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:23:55,069][231894] Avg episode reward: [(0, '196.430')] [2023-03-07 17:23:55,841][232226] Updated weights for policy 0, policy_version 53850 (0.0006) [2023-03-07 17:23:56,635][232226] Updated weights for policy 0, policy_version 53860 (0.0006) [2023-03-07 17:23:57,432][232226] Updated weights for policy 0, policy_version 53870 (0.0007) [2023-03-07 17:23:58,222][232226] Updated weights for policy 0, policy_version 53880 (0.0006) [2023-03-07 17:23:59,008][232226] Updated weights for policy 0, policy_version 53890 (0.0006) [2023-03-07 17:23:59,803][232226] Updated weights for policy 0, policy_version 53900 (0.0006) [2023-03-07 17:24:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55196672. Throughput: 0: 12896.5. Samples: 55161332. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:24:00,069][231894] Avg episode reward: [(0, '195.745')] [2023-03-07 17:24:00,587][232226] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-07 17:24:01,381][232226] Updated weights for policy 0, policy_version 53920 (0.0007) [2023-03-07 17:24:02,168][232226] Updated weights for policy 0, policy_version 53930 (0.0007) [2023-03-07 17:24:02,961][232226] Updated weights for policy 0, policy_version 53940 (0.0007) [2023-03-07 17:24:03,764][232226] Updated weights for policy 0, policy_version 53950 (0.0006) [2023-03-07 17:24:04,550][232226] Updated weights for policy 0, policy_version 53960 (0.0006) [2023-03-07 17:24:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55261184. Throughput: 0: 12899.6. Samples: 55239001. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:24:05,069][231894] Avg episode reward: [(0, '195.412')] [2023-03-07 17:24:05,353][232226] Updated weights for policy 0, policy_version 53970 (0.0006) [2023-03-07 17:24:06,159][232226] Updated weights for policy 0, policy_version 53980 (0.0006) [2023-03-07 17:24:06,946][232226] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-07 17:24:07,725][232226] Updated weights for policy 0, policy_version 54000 (0.0006) [2023-03-07 17:24:08,510][232226] Updated weights for policy 0, policy_version 54010 (0.0006) [2023-03-07 17:24:09,301][232226] Updated weights for policy 0, policy_version 54020 (0.0006) [2023-03-07 17:24:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55325696. Throughput: 0: 12898.0. Samples: 55316452. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:24:10,069][231894] Avg episode reward: [(0, '192.782')] [2023-03-07 17:24:10,089][232226] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-07 17:24:10,892][232226] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-07 17:24:11,706][232226] Updated weights for policy 0, policy_version 54050 (0.0006) [2023-03-07 17:24:12,517][232226] Updated weights for policy 0, policy_version 54060 (0.0007) [2023-03-07 17:24:13,290][232226] Updated weights for policy 0, policy_version 54070 (0.0007) [2023-03-07 17:24:14,093][232226] Updated weights for policy 0, policy_version 54080 (0.0007) [2023-03-07 17:24:14,896][232226] Updated weights for policy 0, policy_version 54090 (0.0006) [2023-03-07 17:24:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55390208. Throughput: 0: 12894.0. Samples: 55355009. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:24:15,069][231894] Avg episode reward: [(0, '201.689')] [2023-03-07 17:24:15,683][232226] Updated weights for policy 0, policy_version 54100 (0.0006) [2023-03-07 17:24:16,507][232226] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-07 17:24:17,288][232226] Updated weights for policy 0, policy_version 54120 (0.0006) [2023-03-07 17:24:18,077][232226] Updated weights for policy 0, policy_version 54130 (0.0006) [2023-03-07 17:24:18,877][232226] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-07 17:24:19,662][232226] Updated weights for policy 0, policy_version 54150 (0.0006) [2023-03-07 17:24:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55454720. Throughput: 0: 12888.9. Samples: 55432175. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:24:20,070][231894] Avg episode reward: [(0, '193.384')] [2023-03-07 17:24:20,438][232226] Updated weights for policy 0, policy_version 54160 (0.0007) [2023-03-07 17:24:21,236][232226] Updated weights for policy 0, policy_version 54170 (0.0005) [2023-03-07 17:24:22,029][232226] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-07 17:24:22,809][232226] Updated weights for policy 0, policy_version 54190 (0.0007) [2023-03-07 17:24:23,609][232226] Updated weights for policy 0, policy_version 54200 (0.0006) [2023-03-07 17:24:24,407][232226] Updated weights for policy 0, policy_version 54210 (0.0007) [2023-03-07 17:24:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55519232. Throughput: 0: 12899.4. Samples: 55509845. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:24:25,069][231894] Avg episode reward: [(0, '192.826')] [2023-03-07 17:24:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000054218_55519232.pth... [2023-03-07 17:24:25,156][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000051195_52423680.pth [2023-03-07 17:24:25,199][232226] Updated weights for policy 0, policy_version 54220 (0.0007) [2023-03-07 17:24:25,994][232226] Updated weights for policy 0, policy_version 54230 (0.0006) [2023-03-07 17:24:26,788][232226] Updated weights for policy 0, policy_version 54240 (0.0006) [2023-03-07 17:24:27,590][232226] Updated weights for policy 0, policy_version 54250 (0.0007) [2023-03-07 17:24:28,407][232226] Updated weights for policy 0, policy_version 54260 (0.0007) [2023-03-07 17:24:29,200][232226] Updated weights for policy 0, policy_version 54270 (0.0006) [2023-03-07 17:24:29,983][232226] Updated weights for policy 0, policy_version 54280 (0.0007) [2023-03-07 17:24:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 55583744. Throughput: 0: 12899.4. Samples: 55548290. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:24:30,069][231894] Avg episode reward: [(0, '193.019')] [2023-03-07 17:24:30,790][232226] Updated weights for policy 0, policy_version 54290 (0.0006) [2023-03-07 17:24:31,590][232226] Updated weights for policy 0, policy_version 54300 (0.0007) [2023-03-07 17:24:32,377][232226] Updated weights for policy 0, policy_version 54310 (0.0006) [2023-03-07 17:24:33,169][232226] Updated weights for policy 0, policy_version 54320 (0.0006) [2023-03-07 17:24:33,974][232226] Updated weights for policy 0, policy_version 54330 (0.0006) [2023-03-07 17:24:34,763][232226] Updated weights for policy 0, policy_version 54340 (0.0006) [2023-03-07 17:24:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 55647232. Throughput: 0: 12892.1. Samples: 55625461. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:24:35,070][231894] Avg episode reward: [(0, '202.758')] [2023-03-07 17:24:35,561][232226] Updated weights for policy 0, policy_version 54350 (0.0006) [2023-03-07 17:24:36,367][232226] Updated weights for policy 0, policy_version 54360 (0.0006) [2023-03-07 17:24:37,161][232226] Updated weights for policy 0, policy_version 54370 (0.0006) [2023-03-07 17:24:37,934][232226] Updated weights for policy 0, policy_version 54380 (0.0006) [2023-03-07 17:24:38,731][232226] Updated weights for policy 0, policy_version 54390 (0.0006) [2023-03-07 17:24:39,516][232226] Updated weights for policy 0, policy_version 54400 (0.0006) [2023-03-07 17:24:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 55711744. Throughput: 0: 12893.3. Samples: 55702960. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:24:40,069][231894] Avg episode reward: [(0, '196.200')] [2023-03-07 17:24:40,301][232226] Updated weights for policy 0, policy_version 54410 (0.0006) [2023-03-07 17:24:41,103][232226] Updated weights for policy 0, policy_version 54420 (0.0006) [2023-03-07 17:24:41,897][232226] Updated weights for policy 0, policy_version 54430 (0.0006) [2023-03-07 17:24:42,681][232226] Updated weights for policy 0, policy_version 54440 (0.0006) [2023-03-07 17:24:43,490][232226] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-07 17:24:44,288][232226] Updated weights for policy 0, policy_version 54460 (0.0006) [2023-03-07 17:24:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 55776256. Throughput: 0: 12895.1. Samples: 55741611. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:24:45,069][231894] Avg episode reward: [(0, '190.627')] [2023-03-07 17:24:45,090][232226] Updated weights for policy 0, policy_version 54470 (0.0007) [2023-03-07 17:24:45,868][232226] Updated weights for policy 0, policy_version 54480 (0.0007) [2023-03-07 17:24:46,650][232226] Updated weights for policy 0, policy_version 54490 (0.0006) [2023-03-07 17:24:47,452][232226] Updated weights for policy 0, policy_version 54500 (0.0006) [2023-03-07 17:24:48,248][232226] Updated weights for policy 0, policy_version 54510 (0.0007) [2023-03-07 17:24:49,036][232226] Updated weights for policy 0, policy_version 54520 (0.0006) [2023-03-07 17:24:49,812][232226] Updated weights for policy 0, policy_version 54530 (0.0006) [2023-03-07 17:24:50,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 55841792. Throughput: 0: 12890.0. Samples: 55819049. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:24:50,069][231894] Avg episode reward: [(0, '190.065')] [2023-03-07 17:24:50,630][232226] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-07 17:24:51,411][232226] Updated weights for policy 0, policy_version 54550 (0.0006) [2023-03-07 17:24:52,208][232226] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-07 17:24:52,985][232226] Updated weights for policy 0, policy_version 54570 (0.0006) [2023-03-07 17:24:53,778][232226] Updated weights for policy 0, policy_version 54580 (0.0006) [2023-03-07 17:24:54,555][232226] Updated weights for policy 0, policy_version 54590 (0.0006) [2023-03-07 17:24:55,069][231894] Fps is (10 sec: 13004.6, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 55906304. Throughput: 0: 12899.7. Samples: 55896938. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:24:55,070][231894] Avg episode reward: [(0, '192.605')] [2023-03-07 17:24:55,346][232226] Updated weights for policy 0, policy_version 54600 (0.0007) [2023-03-07 17:24:56,137][232226] Updated weights for policy 0, policy_version 54610 (0.0006) [2023-03-07 17:24:56,940][232226] Updated weights for policy 0, policy_version 54620 (0.0007) [2023-03-07 17:24:57,714][232226] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-07 17:24:58,512][232226] Updated weights for policy 0, policy_version 54640 (0.0008) [2023-03-07 17:24:59,306][232226] Updated weights for policy 0, policy_version 54650 (0.0006) [2023-03-07 17:25:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 55970816. Throughput: 0: 12907.3. Samples: 55935837. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:00,070][231894] Avg episode reward: [(0, '193.804')] [2023-03-07 17:25:00,092][232226] Updated weights for policy 0, policy_version 54660 (0.0007) [2023-03-07 17:25:00,902][232226] Updated weights for policy 0, policy_version 54670 (0.0006) [2023-03-07 17:25:01,699][232226] Updated weights for policy 0, policy_version 54680 (0.0006) [2023-03-07 17:25:02,486][232226] Updated weights for policy 0, policy_version 54690 (0.0006) [2023-03-07 17:25:03,287][232226] Updated weights for policy 0, policy_version 54700 (0.0006) [2023-03-07 17:25:04,070][232226] Updated weights for policy 0, policy_version 54710 (0.0007) [2023-03-07 17:25:04,859][232226] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-07 17:25:05,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 56035328. Throughput: 0: 12910.0. Samples: 56013124. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:05,069][231894] Avg episode reward: [(0, '193.460')] [2023-03-07 17:25:05,641][232226] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-07 17:25:06,435][232226] Updated weights for policy 0, policy_version 54740 (0.0007) [2023-03-07 17:25:07,235][232226] Updated weights for policy 0, policy_version 54750 (0.0006) [2023-03-07 17:25:08,013][232226] Updated weights for policy 0, policy_version 54760 (0.0006) [2023-03-07 17:25:08,810][232226] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-07 17:25:09,594][232226] Updated weights for policy 0, policy_version 54780 (0.0006) [2023-03-07 17:25:10,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 56100864. Throughput: 0: 12912.1. Samples: 56090889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:10,069][231894] Avg episode reward: [(0, '191.507')] [2023-03-07 17:25:10,379][232226] Updated weights for policy 0, policy_version 54790 (0.0006) [2023-03-07 17:25:11,190][232226] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-07 17:25:11,978][232226] Updated weights for policy 0, policy_version 54810 (0.0006) [2023-03-07 17:25:12,771][232226] Updated weights for policy 0, policy_version 54820 (0.0007) [2023-03-07 17:25:13,566][232226] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-07 17:25:14,370][232226] Updated weights for policy 0, policy_version 54840 (0.0006) [2023-03-07 17:25:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 56164352. Throughput: 0: 12920.5. Samples: 56129712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:15,069][231894] Avg episode reward: [(0, '199.480')] [2023-03-07 17:25:15,155][232226] Updated weights for policy 0, policy_version 54850 (0.0006) [2023-03-07 17:25:15,958][232226] Updated weights for policy 0, policy_version 54860 (0.0007) [2023-03-07 17:25:16,751][232226] Updated weights for policy 0, policy_version 54870 (0.0006) [2023-03-07 17:25:17,529][232226] Updated weights for policy 0, policy_version 54880 (0.0007) [2023-03-07 17:25:18,322][232226] Updated weights for policy 0, policy_version 54890 (0.0006) [2023-03-07 17:25:19,109][232226] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-07 17:25:19,900][232226] Updated weights for policy 0, policy_version 54910 (0.0007) [2023-03-07 17:25:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 56229888. Throughput: 0: 12930.4. Samples: 56207326. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:20,069][231894] Avg episode reward: [(0, '199.227')] [2023-03-07 17:25:20,695][232226] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-07 17:25:21,488][232226] Updated weights for policy 0, policy_version 54930 (0.0006) [2023-03-07 17:25:22,279][232226] Updated weights for policy 0, policy_version 54940 (0.0006) [2023-03-07 17:25:23,062][232226] Updated weights for policy 0, policy_version 54950 (0.0007) [2023-03-07 17:25:23,850][232226] Updated weights for policy 0, policy_version 54960 (0.0005) [2023-03-07 17:25:24,650][232226] Updated weights for policy 0, policy_version 54970 (0.0007) [2023-03-07 17:25:25,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 56294400. Throughput: 0: 12931.4. Samples: 56284876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:25,069][231894] Avg episode reward: [(0, '191.253')] [2023-03-07 17:25:25,439][232226] Updated weights for policy 0, policy_version 54980 (0.0006) [2023-03-07 17:25:26,257][232226] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-07 17:25:27,049][232226] Updated weights for policy 0, policy_version 55000 (0.0006) [2023-03-07 17:25:27,840][232226] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-07 17:25:28,637][232226] Updated weights for policy 0, policy_version 55020 (0.0006) [2023-03-07 17:25:29,438][232226] Updated weights for policy 0, policy_version 55030 (0.0006) [2023-03-07 17:25:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 56358912. Throughput: 0: 12928.6. Samples: 56323400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:30,069][231894] Avg episode reward: [(0, '192.055')] [2023-03-07 17:25:30,237][232226] Updated weights for policy 0, policy_version 55040 (0.0006) [2023-03-07 17:25:31,024][232226] Updated weights for policy 0, policy_version 55050 (0.0006) [2023-03-07 17:25:31,841][232226] Updated weights for policy 0, policy_version 55060 (0.0007) [2023-03-07 17:25:32,617][232226] Updated weights for policy 0, policy_version 55070 (0.0006) [2023-03-07 17:25:33,411][232226] Updated weights for policy 0, policy_version 55080 (0.0007) [2023-03-07 17:25:34,210][232226] Updated weights for policy 0, policy_version 55090 (0.0007) [2023-03-07 17:25:35,017][232226] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-07 17:25:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 56422400. Throughput: 0: 12923.6. Samples: 56400610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:35,069][231894] Avg episode reward: [(0, '187.762')] [2023-03-07 17:25:35,809][232226] Updated weights for policy 0, policy_version 55110 (0.0006) [2023-03-07 17:25:36,615][232226] Updated weights for policy 0, policy_version 55120 (0.0007) [2023-03-07 17:25:37,400][232226] Updated weights for policy 0, policy_version 55130 (0.0007) [2023-03-07 17:25:38,218][232226] Updated weights for policy 0, policy_version 55140 (0.0007) [2023-03-07 17:25:39,014][232226] Updated weights for policy 0, policy_version 55150 (0.0006) [2023-03-07 17:25:39,813][232226] Updated weights for policy 0, policy_version 55160 (0.0007) [2023-03-07 17:25:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 56486912. Throughput: 0: 12900.1. Samples: 56477440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:40,070][231894] Avg episode reward: [(0, '195.238')] [2023-03-07 17:25:40,617][232226] Updated weights for policy 0, policy_version 55170 (0.0007) [2023-03-07 17:25:41,413][232226] Updated weights for policy 0, policy_version 55180 (0.0007) [2023-03-07 17:25:42,198][232226] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-07 17:25:43,013][232226] Updated weights for policy 0, policy_version 55200 (0.0006) [2023-03-07 17:25:43,789][232226] Updated weights for policy 0, policy_version 55210 (0.0008) [2023-03-07 17:25:44,590][232226] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-07 17:25:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.5, 300 sec: 12902.4). Total num frames: 56551424. Throughput: 0: 12887.8. Samples: 56515785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:25:45,069][231894] Avg episode reward: [(0, '194.047')] [2023-03-07 17:25:45,379][232226] Updated weights for policy 0, policy_version 55230 (0.0007) [2023-03-07 17:25:46,166][232226] Updated weights for policy 0, policy_version 55240 (0.0007) [2023-03-07 17:25:46,996][232226] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-07 17:25:47,801][232226] Updated weights for policy 0, policy_version 55260 (0.0007) [2023-03-07 17:25:48,579][232226] Updated weights for policy 0, policy_version 55270 (0.0006) [2023-03-07 17:25:49,373][232226] Updated weights for policy 0, policy_version 55280 (0.0007) [2023-03-07 17:25:50,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 56614912. Throughput: 0: 12885.3. Samples: 56592963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:25:50,069][231894] Avg episode reward: [(0, '191.778')] [2023-03-07 17:25:50,165][232226] Updated weights for policy 0, policy_version 55290 (0.0006) [2023-03-07 17:25:50,961][232226] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-07 17:25:51,762][232226] Updated weights for policy 0, policy_version 55310 (0.0006) [2023-03-07 17:25:52,562][232226] Updated weights for policy 0, policy_version 55320 (0.0007) [2023-03-07 17:25:53,352][232226] Updated weights for policy 0, policy_version 55330 (0.0006) [2023-03-07 17:25:54,161][232226] Updated weights for policy 0, policy_version 55340 (0.0007) [2023-03-07 17:25:54,943][232226] Updated weights for policy 0, policy_version 55350 (0.0006) [2023-03-07 17:25:55,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 56679424. Throughput: 0: 12871.6. Samples: 56670115. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:25:55,070][231894] Avg episode reward: [(0, '193.802')] [2023-03-07 17:25:55,738][232226] Updated weights for policy 0, policy_version 55360 (0.0006) [2023-03-07 17:25:56,532][232226] Updated weights for policy 0, policy_version 55370 (0.0006) [2023-03-07 17:25:57,337][232226] Updated weights for policy 0, policy_version 55380 (0.0006) [2023-03-07 17:25:58,149][232226] Updated weights for policy 0, policy_version 55390 (0.0006) [2023-03-07 17:25:58,925][232226] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-07 17:25:59,723][232226] Updated weights for policy 0, policy_version 55410 (0.0006) [2023-03-07 17:26:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 56743936. Throughput: 0: 12868.8. Samples: 56708808. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:26:00,069][231894] Avg episode reward: [(0, '199.154')] [2023-03-07 17:26:00,490][232226] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-07 17:26:01,314][232226] Updated weights for policy 0, policy_version 55430 (0.0006) [2023-03-07 17:26:02,102][232226] Updated weights for policy 0, policy_version 55440 (0.0007) [2023-03-07 17:26:02,890][232226] Updated weights for policy 0, policy_version 55450 (0.0007) [2023-03-07 17:26:03,698][232226] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-07 17:26:04,498][232226] Updated weights for policy 0, policy_version 55470 (0.0007) [2023-03-07 17:26:05,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 56808448. Throughput: 0: 12863.0. Samples: 56786161. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:26:05,069][231894] Avg episode reward: [(0, '188.506')] [2023-03-07 17:26:05,284][232226] Updated weights for policy 0, policy_version 55480 (0.0007) [2023-03-07 17:26:06,073][232226] Updated weights for policy 0, policy_version 55490 (0.0006) [2023-03-07 17:26:06,913][232226] Updated weights for policy 0, policy_version 55500 (0.0007) [2023-03-07 17:26:07,683][232226] Updated weights for policy 0, policy_version 55510 (0.0008) [2023-03-07 17:26:08,478][232226] Updated weights for policy 0, policy_version 55520 (0.0007) [2023-03-07 17:26:09,269][232226] Updated weights for policy 0, policy_version 55530 (0.0006) [2023-03-07 17:26:10,062][232226] Updated weights for policy 0, policy_version 55540 (0.0007) [2023-03-07 17:26:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12902.4). Total num frames: 56872960. Throughput: 0: 12852.5. Samples: 56863239. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:26:10,069][231894] Avg episode reward: [(0, '187.116')] [2023-03-07 17:26:10,843][232226] Updated weights for policy 0, policy_version 55550 (0.0006) [2023-03-07 17:26:11,645][232226] Updated weights for policy 0, policy_version 55560 (0.0006) [2023-03-07 17:26:12,442][232226] Updated weights for policy 0, policy_version 55570 (0.0006) [2023-03-07 17:26:13,248][232226] Updated weights for policy 0, policy_version 55580 (0.0006) [2023-03-07 17:26:14,026][232226] Updated weights for policy 0, policy_version 55590 (0.0006) [2023-03-07 17:26:14,808][232226] Updated weights for policy 0, policy_version 55600 (0.0007) [2023-03-07 17:26:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12898.9). Total num frames: 56937472. Throughput: 0: 12858.0. Samples: 56902011. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:26:15,069][231894] Avg episode reward: [(0, '200.747')] [2023-03-07 17:26:15,605][232226] Updated weights for policy 0, policy_version 55610 (0.0007) [2023-03-07 17:26:16,389][232226] Updated weights for policy 0, policy_version 55620 (0.0006) [2023-03-07 17:26:17,184][232226] Updated weights for policy 0, policy_version 55630 (0.0006) [2023-03-07 17:26:17,993][232226] Updated weights for policy 0, policy_version 55640 (0.0006) [2023-03-07 17:26:18,784][232226] Updated weights for policy 0, policy_version 55650 (0.0006) [2023-03-07 17:26:19,597][232226] Updated weights for policy 0, policy_version 55660 (0.0007) [2023-03-07 17:26:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12898.9). Total num frames: 57000960. Throughput: 0: 12863.0. Samples: 56979445. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:26:20,069][231894] Avg episode reward: [(0, '195.245')] [2023-03-07 17:26:20,398][232226] Updated weights for policy 0, policy_version 55670 (0.0006) [2023-03-07 17:26:21,188][232226] Updated weights for policy 0, policy_version 55680 (0.0008) [2023-03-07 17:26:21,970][232226] Updated weights for policy 0, policy_version 55690 (0.0006) [2023-03-07 17:26:22,781][232226] Updated weights for policy 0, policy_version 55700 (0.0007) [2023-03-07 17:26:23,572][232226] Updated weights for policy 0, policy_version 55710 (0.0006) [2023-03-07 17:26:24,352][232226] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-07 17:26:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12898.9). Total num frames: 57065472. Throughput: 0: 12869.4. Samples: 57056563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:26:25,070][231894] Avg episode reward: [(0, '199.716')] [2023-03-07 17:26:25,075][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000055729_57066496.pth... [2023-03-07 17:26:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000052706_53970944.pth [2023-03-07 17:26:25,155][232226] Updated weights for policy 0, policy_version 55730 (0.0006) [2023-03-07 17:26:25,950][232226] Updated weights for policy 0, policy_version 55740 (0.0006) [2023-03-07 17:26:26,745][232226] Updated weights for policy 0, policy_version 55750 (0.0006) [2023-03-07 17:26:27,542][232226] Updated weights for policy 0, policy_version 55760 (0.0006) [2023-03-07 17:26:28,340][232226] Updated weights for policy 0, policy_version 55770 (0.0007) [2023-03-07 17:26:29,121][232226] Updated weights for policy 0, policy_version 55780 (0.0006) [2023-03-07 17:26:29,921][232226] Updated weights for policy 0, policy_version 55790 (0.0006) [2023-03-07 17:26:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12898.9). Total num frames: 57129984. Throughput: 0: 12876.7. Samples: 57095236. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:26:30,070][231894] Avg episode reward: [(0, '192.465')] [2023-03-07 17:26:30,696][232226] Updated weights for policy 0, policy_version 55800 (0.0005) [2023-03-07 17:26:31,488][232226] Updated weights for policy 0, policy_version 55810 (0.0006) [2023-03-07 17:26:32,294][232226] Updated weights for policy 0, policy_version 55820 (0.0007) [2023-03-07 17:26:33,089][232226] Updated weights for policy 0, policy_version 55830 (0.0007) [2023-03-07 17:26:33,869][232226] Updated weights for policy 0, policy_version 55840 (0.0006) [2023-03-07 17:26:34,648][232226] Updated weights for policy 0, policy_version 55850 (0.0006) [2023-03-07 17:26:35,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 57195520. Throughput: 0: 12889.6. Samples: 57172997. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:26:35,069][231894] Avg episode reward: [(0, '192.152')] [2023-03-07 17:26:35,437][232226] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-07 17:26:36,203][232226] Updated weights for policy 0, policy_version 55870 (0.0007) [2023-03-07 17:26:37,026][232226] Updated weights for policy 0, policy_version 55880 (0.0007) [2023-03-07 17:26:37,805][232226] Updated weights for policy 0, policy_version 55890 (0.0006) [2023-03-07 17:26:38,606][232226] Updated weights for policy 0, policy_version 55900 (0.0006) [2023-03-07 17:26:39,385][232226] Updated weights for policy 0, policy_version 55910 (0.0007) [2023-03-07 17:26:40,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 57260032. Throughput: 0: 12903.0. Samples: 57250747. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:26:40,069][231894] Avg episode reward: [(0, '190.030')] [2023-03-07 17:26:40,192][232226] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-07 17:26:41,005][232226] Updated weights for policy 0, policy_version 55930 (0.0007) [2023-03-07 17:26:41,788][232226] Updated weights for policy 0, policy_version 55940 (0.0006) [2023-03-07 17:26:42,584][232226] Updated weights for policy 0, policy_version 55950 (0.0006) [2023-03-07 17:26:43,378][232226] Updated weights for policy 0, policy_version 55960 (0.0006) [2023-03-07 17:26:44,162][232226] Updated weights for policy 0, policy_version 55970 (0.0007) [2023-03-07 17:26:44,941][232226] Updated weights for policy 0, policy_version 55980 (0.0006) [2023-03-07 17:26:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12902.4). Total num frames: 57324544. Throughput: 0: 12899.0. Samples: 57289262. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:26:45,069][231894] Avg episode reward: [(0, '193.243')] [2023-03-07 17:26:45,737][232226] Updated weights for policy 0, policy_version 55990 (0.0007) [2023-03-07 17:26:46,542][232226] Updated weights for policy 0, policy_version 56000 (0.0006) [2023-03-07 17:26:47,335][232226] Updated weights for policy 0, policy_version 56010 (0.0006) [2023-03-07 17:26:48,127][232226] Updated weights for policy 0, policy_version 56020 (0.0006) [2023-03-07 17:26:48,925][232226] Updated weights for policy 0, policy_version 56030 (0.0006) [2023-03-07 17:26:49,713][232226] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-07 17:26:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 57389056. Throughput: 0: 12902.9. Samples: 57366792. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:26:50,069][231894] Avg episode reward: [(0, '196.031')] [2023-03-07 17:26:50,490][232226] Updated weights for policy 0, policy_version 56050 (0.0006) [2023-03-07 17:26:51,309][232226] Updated weights for policy 0, policy_version 56060 (0.0007) [2023-03-07 17:26:52,125][232226] Updated weights for policy 0, policy_version 56070 (0.0006) [2023-03-07 17:26:52,922][232226] Updated weights for policy 0, policy_version 56080 (0.0007) [2023-03-07 17:26:53,713][232226] Updated weights for policy 0, policy_version 56090 (0.0007) [2023-03-07 17:26:54,532][232226] Updated weights for policy 0, policy_version 56100 (0.0006) [2023-03-07 17:26:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 57453568. Throughput: 0: 12899.0. Samples: 57443695. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:26:55,069][231894] Avg episode reward: [(0, '199.417')] [2023-03-07 17:26:55,316][232226] Updated weights for policy 0, policy_version 56110 (0.0006) [2023-03-07 17:26:56,085][232226] Updated weights for policy 0, policy_version 56120 (0.0006) [2023-03-07 17:26:56,893][232226] Updated weights for policy 0, policy_version 56130 (0.0006) [2023-03-07 17:26:57,689][232226] Updated weights for policy 0, policy_version 56140 (0.0007) [2023-03-07 17:26:58,495][232226] Updated weights for policy 0, policy_version 56150 (0.0006) [2023-03-07 17:26:59,286][232226] Updated weights for policy 0, policy_version 56160 (0.0006) [2023-03-07 17:27:00,053][232226] Updated weights for policy 0, policy_version 56170 (0.0007) [2023-03-07 17:27:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 57518080. Throughput: 0: 12904.1. Samples: 57482695. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:27:00,069][231894] Avg episode reward: [(0, '199.098')] [2023-03-07 17:27:00,858][232226] Updated weights for policy 0, policy_version 56180 (0.0007) [2023-03-07 17:27:01,653][232226] Updated weights for policy 0, policy_version 56190 (0.0007) [2023-03-07 17:27:02,433][232226] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-07 17:27:03,233][232226] Updated weights for policy 0, policy_version 56210 (0.0007) [2023-03-07 17:27:04,018][232226] Updated weights for policy 0, policy_version 56220 (0.0007) [2023-03-07 17:27:04,821][232226] Updated weights for policy 0, policy_version 56230 (0.0006) [2023-03-07 17:27:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 57582592. Throughput: 0: 12903.9. Samples: 57560121. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:27:05,069][231894] Avg episode reward: [(0, '196.750')] [2023-03-07 17:27:05,606][232226] Updated weights for policy 0, policy_version 56240 (0.0007) [2023-03-07 17:27:06,399][232226] Updated weights for policy 0, policy_version 56250 (0.0007) [2023-03-07 17:27:07,213][232226] Updated weights for policy 0, policy_version 56260 (0.0006) [2023-03-07 17:27:08,016][232226] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-07 17:27:08,803][232226] Updated weights for policy 0, policy_version 56280 (0.0007) [2023-03-07 17:27:09,626][232226] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-07 17:27:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12895.5). Total num frames: 57646080. Throughput: 0: 12899.4. Samples: 57637035. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:27:10,069][231894] Avg episode reward: [(0, '187.297')] [2023-03-07 17:27:10,418][232226] Updated weights for policy 0, policy_version 56300 (0.0007) [2023-03-07 17:27:11,210][232226] Updated weights for policy 0, policy_version 56310 (0.0007) [2023-03-07 17:27:12,016][232226] Updated weights for policy 0, policy_version 56320 (0.0006) [2023-03-07 17:27:12,794][232226] Updated weights for policy 0, policy_version 56330 (0.0006) [2023-03-07 17:27:13,602][232226] Updated weights for policy 0, policy_version 56340 (0.0006) [2023-03-07 17:27:14,373][232226] Updated weights for policy 0, policy_version 56350 (0.0006) [2023-03-07 17:27:15,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12898.9). Total num frames: 57711616. Throughput: 0: 12898.7. Samples: 57675675. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:27:15,069][231894] Avg episode reward: [(0, '188.432')] [2023-03-07 17:27:15,173][232226] Updated weights for policy 0, policy_version 56360 (0.0006) [2023-03-07 17:27:15,953][232226] Updated weights for policy 0, policy_version 56370 (0.0006) [2023-03-07 17:27:16,779][232226] Updated weights for policy 0, policy_version 56380 (0.0006) [2023-03-07 17:27:17,580][232226] Updated weights for policy 0, policy_version 56390 (0.0007) [2023-03-07 17:27:18,359][232226] Updated weights for policy 0, policy_version 56400 (0.0007) [2023-03-07 17:27:19,162][232226] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-07 17:27:19,938][232226] Updated weights for policy 0, policy_version 56420 (0.0007) [2023-03-07 17:27:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 57775104. Throughput: 0: 12883.7. Samples: 57752764. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:27:20,070][231894] Avg episode reward: [(0, '188.283')] [2023-03-07 17:27:20,728][232226] Updated weights for policy 0, policy_version 56430 (0.0006) [2023-03-07 17:27:21,530][232226] Updated weights for policy 0, policy_version 56440 (0.0007) [2023-03-07 17:27:22,328][232226] Updated weights for policy 0, policy_version 56450 (0.0006) [2023-03-07 17:27:23,110][232226] Updated weights for policy 0, policy_version 56460 (0.0006) [2023-03-07 17:27:23,900][232226] Updated weights for policy 0, policy_version 56470 (0.0007) [2023-03-07 17:27:24,707][232226] Updated weights for policy 0, policy_version 56480 (0.0007) [2023-03-07 17:27:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 57839616. Throughput: 0: 12883.7. Samples: 57830514. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:27:25,069][231894] Avg episode reward: [(0, '196.463')] [2023-03-07 17:27:25,503][232226] Updated weights for policy 0, policy_version 56490 (0.0006) [2023-03-07 17:27:26,295][232226] Updated weights for policy 0, policy_version 56500 (0.0007) [2023-03-07 17:27:27,097][232226] Updated weights for policy 0, policy_version 56510 (0.0006) [2023-03-07 17:27:27,887][232226] Updated weights for policy 0, policy_version 56520 (0.0006) [2023-03-07 17:27:28,677][232226] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-07 17:27:29,462][232226] Updated weights for policy 0, policy_version 56540 (0.0006) [2023-03-07 17:27:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12895.5). Total num frames: 57904128. Throughput: 0: 12881.9. Samples: 57868947. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:27:30,069][231894] Avg episode reward: [(0, '197.051')] [2023-03-07 17:27:30,266][232226] Updated weights for policy 0, policy_version 56550 (0.0006) [2023-03-07 17:27:31,056][232226] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-07 17:27:31,848][232226] Updated weights for policy 0, policy_version 56570 (0.0007) [2023-03-07 17:27:32,649][232226] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-07 17:27:33,440][232226] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-07 17:27:34,253][232226] Updated weights for policy 0, policy_version 56600 (0.0007) [2023-03-07 17:27:35,063][232226] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-07 17:27:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12895.5). Total num frames: 57968640. Throughput: 0: 12879.4. Samples: 57946364. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:27:35,070][231894] Avg episode reward: [(0, '192.768')] [2023-03-07 17:27:35,864][232226] Updated weights for policy 0, policy_version 56620 (0.0006) [2023-03-07 17:27:36,664][232226] Updated weights for policy 0, policy_version 56630 (0.0006) [2023-03-07 17:27:37,468][232226] Updated weights for policy 0, policy_version 56640 (0.0007) [2023-03-07 17:27:38,257][232226] Updated weights for policy 0, policy_version 56650 (0.0006) [2023-03-07 17:27:39,058][232226] Updated weights for policy 0, policy_version 56660 (0.0005) [2023-03-07 17:27:39,858][232226] Updated weights for policy 0, policy_version 56670 (0.0006) [2023-03-07 17:27:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 58032128. Throughput: 0: 12876.5. Samples: 58023138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:27:40,069][231894] Avg episode reward: [(0, '181.431')] [2023-03-07 17:27:40,659][232226] Updated weights for policy 0, policy_version 56680 (0.0005) [2023-03-07 17:27:41,450][232226] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-07 17:27:42,226][232226] Updated weights for policy 0, policy_version 56700 (0.0006) [2023-03-07 17:27:43,025][232226] Updated weights for policy 0, policy_version 56710 (0.0006) [2023-03-07 17:27:43,832][232226] Updated weights for policy 0, policy_version 56720 (0.0006) [2023-03-07 17:27:44,605][232226] Updated weights for policy 0, policy_version 56730 (0.0007) [2023-03-07 17:27:45,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12892.0). Total num frames: 58096640. Throughput: 0: 12873.1. Samples: 58061987. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:27:45,070][231894] Avg episode reward: [(0, '189.136')] [2023-03-07 17:27:45,414][232226] Updated weights for policy 0, policy_version 56740 (0.0006) [2023-03-07 17:27:46,195][232226] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-07 17:27:46,981][232226] Updated weights for policy 0, policy_version 56760 (0.0007) [2023-03-07 17:27:47,786][232226] Updated weights for policy 0, policy_version 56770 (0.0007) [2023-03-07 17:27:48,603][232226] Updated weights for policy 0, policy_version 56780 (0.0007) [2023-03-07 17:27:49,402][232226] Updated weights for policy 0, policy_version 56790 (0.0007) [2023-03-07 17:27:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 58161152. Throughput: 0: 12869.2. Samples: 58139232. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:27:50,069][231894] Avg episode reward: [(0, '195.012')] [2023-03-07 17:27:50,185][232226] Updated weights for policy 0, policy_version 56800 (0.0007) [2023-03-07 17:27:50,966][232226] Updated weights for policy 0, policy_version 56810 (0.0006) [2023-03-07 17:27:51,779][232226] Updated weights for policy 0, policy_version 56820 (0.0006) [2023-03-07 17:27:52,565][232226] Updated weights for policy 0, policy_version 56830 (0.0007) [2023-03-07 17:27:53,354][232226] Updated weights for policy 0, policy_version 56840 (0.0006) [2023-03-07 17:27:54,154][232226] Updated weights for policy 0, policy_version 56850 (0.0007) [2023-03-07 17:27:54,950][232226] Updated weights for policy 0, policy_version 56860 (0.0006) [2023-03-07 17:27:55,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 58225664. Throughput: 0: 12875.3. Samples: 58216422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:27:55,069][231894] Avg episode reward: [(0, '193.134')] [2023-03-07 17:27:55,744][232226] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-07 17:27:56,538][232226] Updated weights for policy 0, policy_version 56880 (0.0007) [2023-03-07 17:27:57,345][232226] Updated weights for policy 0, policy_version 56890 (0.0007) [2023-03-07 17:27:58,126][232226] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-07 17:27:58,909][232226] Updated weights for policy 0, policy_version 56910 (0.0007) [2023-03-07 17:27:59,709][232226] Updated weights for policy 0, policy_version 56920 (0.0007) [2023-03-07 17:28:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 58290176. Throughput: 0: 12874.6. Samples: 58255034. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:00,069][231894] Avg episode reward: [(0, '195.977')] [2023-03-07 17:28:00,498][232226] Updated weights for policy 0, policy_version 56930 (0.0006) [2023-03-07 17:28:01,301][232226] Updated weights for policy 0, policy_version 56940 (0.0006) [2023-03-07 17:28:02,102][232226] Updated weights for policy 0, policy_version 56950 (0.0007) [2023-03-07 17:28:02,887][232226] Updated weights for policy 0, policy_version 56960 (0.0007) [2023-03-07 17:28:03,688][232226] Updated weights for policy 0, policy_version 56970 (0.0007) [2023-03-07 17:28:04,495][232226] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-07 17:28:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 58354688. Throughput: 0: 12879.7. Samples: 58332351. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:05,070][231894] Avg episode reward: [(0, '189.842')] [2023-03-07 17:28:05,297][232226] Updated weights for policy 0, policy_version 56990 (0.0007) [2023-03-07 17:28:06,076][232226] Updated weights for policy 0, policy_version 57000 (0.0006) [2023-03-07 17:28:06,873][232226] Updated weights for policy 0, policy_version 57010 (0.0007) [2023-03-07 17:28:07,652][232226] Updated weights for policy 0, policy_version 57020 (0.0006) [2023-03-07 17:28:08,448][232226] Updated weights for policy 0, policy_version 57030 (0.0006) [2023-03-07 17:28:09,237][232226] Updated weights for policy 0, policy_version 57040 (0.0007) [2023-03-07 17:28:10,041][232226] Updated weights for policy 0, policy_version 57050 (0.0006) [2023-03-07 17:28:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12892.0). Total num frames: 58419200. Throughput: 0: 12872.8. Samples: 58409791. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:10,069][231894] Avg episode reward: [(0, '194.185')] [2023-03-07 17:28:10,837][232226] Updated weights for policy 0, policy_version 57060 (0.0007) [2023-03-07 17:28:11,638][232226] Updated weights for policy 0, policy_version 57070 (0.0006) [2023-03-07 17:28:12,421][232226] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-07 17:28:13,230][232226] Updated weights for policy 0, policy_version 57090 (0.0006) [2023-03-07 17:28:14,021][232226] Updated weights for policy 0, policy_version 57100 (0.0006) [2023-03-07 17:28:14,828][232226] Updated weights for policy 0, policy_version 57110 (0.0007) [2023-03-07 17:28:15,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12892.0). Total num frames: 58483712. Throughput: 0: 12877.9. Samples: 58448453. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:15,069][231894] Avg episode reward: [(0, '194.245')] [2023-03-07 17:28:15,619][232226] Updated weights for policy 0, policy_version 57120 (0.0006) [2023-03-07 17:28:16,413][232226] Updated weights for policy 0, policy_version 57130 (0.0007) [2023-03-07 17:28:17,210][232226] Updated weights for policy 0, policy_version 57140 (0.0007) [2023-03-07 17:28:18,038][232226] Updated weights for policy 0, policy_version 57150 (0.0007) [2023-03-07 17:28:18,816][232226] Updated weights for policy 0, policy_version 57160 (0.0006) [2023-03-07 17:28:19,607][232226] Updated weights for policy 0, policy_version 57170 (0.0007) [2023-03-07 17:28:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 58547200. Throughput: 0: 12865.4. Samples: 58525306. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:20,069][231894] Avg episode reward: [(0, '196.639')] [2023-03-07 17:28:20,417][232226] Updated weights for policy 0, policy_version 57180 (0.0007) [2023-03-07 17:28:21,194][232226] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-07 17:28:22,001][232226] Updated weights for policy 0, policy_version 57200 (0.0006) [2023-03-07 17:28:22,799][232226] Updated weights for policy 0, policy_version 57210 (0.0006) [2023-03-07 17:28:23,592][232226] Updated weights for policy 0, policy_version 57220 (0.0008) [2023-03-07 17:28:24,367][232226] Updated weights for policy 0, policy_version 57230 (0.0008) [2023-03-07 17:28:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 58611712. Throughput: 0: 12878.6. Samples: 58602677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:25,069][231894] Avg episode reward: [(0, '195.488')] [2023-03-07 17:28:25,077][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000057239_58612736.pth... [2023-03-07 17:28:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000054218_55519232.pth [2023-03-07 17:28:25,161][232226] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-07 17:28:25,960][232226] Updated weights for policy 0, policy_version 57250 (0.0006) [2023-03-07 17:28:26,736][232226] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-07 17:28:27,565][232226] Updated weights for policy 0, policy_version 57270 (0.0006) [2023-03-07 17:28:28,341][232226] Updated weights for policy 0, policy_version 57280 (0.0006) [2023-03-07 17:28:29,139][232226] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-07 17:28:29,950][232226] Updated weights for policy 0, policy_version 57300 (0.0006) [2023-03-07 17:28:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 58676224. Throughput: 0: 12879.3. Samples: 58641552. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:30,080][231894] Avg episode reward: [(0, '195.182')] [2023-03-07 17:28:30,741][232226] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-07 17:28:31,534][232226] Updated weights for policy 0, policy_version 57320 (0.0006) [2023-03-07 17:28:32,314][232226] Updated weights for policy 0, policy_version 57330 (0.0007) [2023-03-07 17:28:33,096][232226] Updated weights for policy 0, policy_version 57340 (0.0006) [2023-03-07 17:28:33,910][232226] Updated weights for policy 0, policy_version 57350 (0.0007) [2023-03-07 17:28:34,684][232226] Updated weights for policy 0, policy_version 57360 (0.0006) [2023-03-07 17:28:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12888.5). Total num frames: 58740736. Throughput: 0: 12878.8. Samples: 58718777. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:35,080][231894] Avg episode reward: [(0, '192.709')] [2023-03-07 17:28:35,478][232226] Updated weights for policy 0, policy_version 57370 (0.0006) [2023-03-07 17:28:36,282][232226] Updated weights for policy 0, policy_version 57380 (0.0006) [2023-03-07 17:28:37,079][232226] Updated weights for policy 0, policy_version 57390 (0.0007) [2023-03-07 17:28:37,867][232226] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-07 17:28:38,661][232226] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-07 17:28:39,458][232226] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-07 17:28:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 58805248. Throughput: 0: 12883.5. Samples: 58796181. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:40,080][231894] Avg episode reward: [(0, '190.084')] [2023-03-07 17:28:40,251][232226] Updated weights for policy 0, policy_version 57430 (0.0006) [2023-03-07 17:28:41,042][232226] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-07 17:28:41,832][232226] Updated weights for policy 0, policy_version 57450 (0.0006) [2023-03-07 17:28:42,644][232226] Updated weights for policy 0, policy_version 57460 (0.0006) [2023-03-07 17:28:43,438][232226] Updated weights for policy 0, policy_version 57470 (0.0006) [2023-03-07 17:28:44,219][232226] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-07 17:28:45,026][232226] Updated weights for policy 0, policy_version 57490 (0.0006) [2023-03-07 17:28:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12888.5). Total num frames: 58869760. Throughput: 0: 12883.2. Samples: 58834778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:45,080][231894] Avg episode reward: [(0, '194.409')] [2023-03-07 17:28:45,838][232226] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-07 17:28:46,606][232226] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-07 17:28:47,424][232226] Updated weights for policy 0, policy_version 57520 (0.0006) [2023-03-07 17:28:48,226][232226] Updated weights for policy 0, policy_version 57530 (0.0008) [2023-03-07 17:28:48,989][232226] Updated weights for policy 0, policy_version 57540 (0.0007) [2023-03-07 17:28:49,802][232226] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-07 17:28:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 58934272. Throughput: 0: 12880.6. Samples: 58911977. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:50,069][231894] Avg episode reward: [(0, '188.069')] [2023-03-07 17:28:50,617][232226] Updated weights for policy 0, policy_version 57560 (0.0006) [2023-03-07 17:28:51,408][232226] Updated weights for policy 0, policy_version 57570 (0.0007) [2023-03-07 17:28:52,182][232226] Updated weights for policy 0, policy_version 57580 (0.0006) [2023-03-07 17:28:52,992][232226] Updated weights for policy 0, policy_version 57590 (0.0007) [2023-03-07 17:28:53,772][232226] Updated weights for policy 0, policy_version 57600 (0.0006) [2023-03-07 17:28:54,567][232226] Updated weights for policy 0, policy_version 57610 (0.0008) [2023-03-07 17:28:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 58998784. Throughput: 0: 12882.1. Samples: 58989485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:28:55,069][231894] Avg episode reward: [(0, '195.820')] [2023-03-07 17:28:55,366][232226] Updated weights for policy 0, policy_version 57620 (0.0007) [2023-03-07 17:28:56,152][232226] Updated weights for policy 0, policy_version 57630 (0.0006) [2023-03-07 17:28:56,949][232226] Updated weights for policy 0, policy_version 57640 (0.0007) [2023-03-07 17:28:57,745][232226] Updated weights for policy 0, policy_version 57650 (0.0007) [2023-03-07 17:28:58,550][232226] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-07 17:28:59,346][232226] Updated weights for policy 0, policy_version 57670 (0.0006) [2023-03-07 17:29:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 59063296. Throughput: 0: 12877.2. Samples: 59027930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:00,069][231894] Avg episode reward: [(0, '185.731')] [2023-03-07 17:29:00,138][232226] Updated weights for policy 0, policy_version 57680 (0.0006) [2023-03-07 17:29:00,950][232226] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-07 17:29:01,747][232226] Updated weights for policy 0, policy_version 57700 (0.0006) [2023-03-07 17:29:02,534][232226] Updated weights for policy 0, policy_version 57710 (0.0007) [2023-03-07 17:29:03,322][232226] Updated weights for policy 0, policy_version 57720 (0.0006) [2023-03-07 17:29:04,118][232226] Updated weights for policy 0, policy_version 57730 (0.0006) [2023-03-07 17:29:04,908][232226] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-07 17:29:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 59127808. Throughput: 0: 12887.1. Samples: 59105225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:05,069][231894] Avg episode reward: [(0, '191.343')] [2023-03-07 17:29:05,699][232226] Updated weights for policy 0, policy_version 57750 (0.0006) [2023-03-07 17:29:06,486][232226] Updated weights for policy 0, policy_version 57760 (0.0006) [2023-03-07 17:29:07,279][232226] Updated weights for policy 0, policy_version 57770 (0.0006) [2023-03-07 17:29:08,074][232226] Updated weights for policy 0, policy_version 57780 (0.0008) [2023-03-07 17:29:08,868][232226] Updated weights for policy 0, policy_version 57790 (0.0006) [2023-03-07 17:29:09,653][232226] Updated weights for policy 0, policy_version 57800 (0.0006) [2023-03-07 17:29:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 59192320. Throughput: 0: 12889.6. Samples: 59182706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:10,069][231894] Avg episode reward: [(0, '197.501')] [2023-03-07 17:29:10,466][232226] Updated weights for policy 0, policy_version 57810 (0.0006) [2023-03-07 17:29:11,268][232226] Updated weights for policy 0, policy_version 57820 (0.0007) [2023-03-07 17:29:12,058][232226] Updated weights for policy 0, policy_version 57830 (0.0006) [2023-03-07 17:29:12,859][232226] Updated weights for policy 0, policy_version 57840 (0.0006) [2023-03-07 17:29:13,656][232226] Updated weights for policy 0, policy_version 57850 (0.0006) [2023-03-07 17:29:14,428][232226] Updated weights for policy 0, policy_version 57860 (0.0006) [2023-03-07 17:29:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.2, 300 sec: 12885.0). Total num frames: 59255808. Throughput: 0: 12878.5. Samples: 59221083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:15,069][231894] Avg episode reward: [(0, '194.099')] [2023-03-07 17:29:15,235][232226] Updated weights for policy 0, policy_version 57870 (0.0007) [2023-03-07 17:29:16,021][232226] Updated weights for policy 0, policy_version 57880 (0.0006) [2023-03-07 17:29:16,817][232226] Updated weights for policy 0, policy_version 57890 (0.0006) [2023-03-07 17:29:17,613][232226] Updated weights for policy 0, policy_version 57900 (0.0007) [2023-03-07 17:29:18,420][232226] Updated weights for policy 0, policy_version 57910 (0.0006) [2023-03-07 17:29:19,206][232226] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-07 17:29:19,999][232226] Updated weights for policy 0, policy_version 57930 (0.0006) [2023-03-07 17:29:20,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 59320320. Throughput: 0: 12881.8. Samples: 59298459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:20,069][231894] Avg episode reward: [(0, '197.354')] [2023-03-07 17:29:20,806][232226] Updated weights for policy 0, policy_version 57940 (0.0006) [2023-03-07 17:29:21,608][232226] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-07 17:29:22,384][232226] Updated weights for policy 0, policy_version 57960 (0.0006) [2023-03-07 17:29:23,178][232226] Updated weights for policy 0, policy_version 57970 (0.0006) [2023-03-07 17:29:23,980][232226] Updated weights for policy 0, policy_version 57980 (0.0007) [2023-03-07 17:29:24,775][232226] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-07 17:29:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 59384832. Throughput: 0: 12880.5. Samples: 59375802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:25,069][231894] Avg episode reward: [(0, '190.037')] [2023-03-07 17:29:25,589][232226] Updated weights for policy 0, policy_version 58000 (0.0007) [2023-03-07 17:29:26,360][232226] Updated weights for policy 0, policy_version 58010 (0.0006) [2023-03-07 17:29:27,154][232226] Updated weights for policy 0, policy_version 58020 (0.0005) [2023-03-07 17:29:27,967][232226] Updated weights for policy 0, policy_version 58030 (0.0007) [2023-03-07 17:29:28,744][232226] Updated weights for policy 0, policy_version 58040 (0.0007) [2023-03-07 17:29:29,546][232226] Updated weights for policy 0, policy_version 58050 (0.0007) [2023-03-07 17:29:30,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 59449344. Throughput: 0: 12883.5. Samples: 59414533. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:30,069][231894] Avg episode reward: [(0, '198.253')] [2023-03-07 17:29:30,337][232226] Updated weights for policy 0, policy_version 58060 (0.0006) [2023-03-07 17:29:31,133][232226] Updated weights for policy 0, policy_version 58070 (0.0006) [2023-03-07 17:29:31,938][232226] Updated weights for policy 0, policy_version 58080 (0.0006) [2023-03-07 17:29:32,742][232226] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-07 17:29:33,535][232226] Updated weights for policy 0, policy_version 58100 (0.0006) [2023-03-07 17:29:34,329][232226] Updated weights for policy 0, policy_version 58110 (0.0007) [2023-03-07 17:29:35,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 59513856. Throughput: 0: 12878.7. Samples: 59491519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:35,069][231894] Avg episode reward: [(0, '195.970')] [2023-03-07 17:29:35,113][232226] Updated weights for policy 0, policy_version 58120 (0.0007) [2023-03-07 17:29:35,905][232226] Updated weights for policy 0, policy_version 58130 (0.0006) [2023-03-07 17:29:36,697][232226] Updated weights for policy 0, policy_version 58140 (0.0007) [2023-03-07 17:29:37,517][232226] Updated weights for policy 0, policy_version 58150 (0.0006) [2023-03-07 17:29:38,297][232226] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-07 17:29:39,098][232226] Updated weights for policy 0, policy_version 58170 (0.0007) [2023-03-07 17:29:39,877][232226] Updated weights for policy 0, policy_version 58180 (0.0006) [2023-03-07 17:29:40,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12888.5). Total num frames: 59578368. Throughput: 0: 12876.2. Samples: 59568912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:40,069][231894] Avg episode reward: [(0, '195.371')] [2023-03-07 17:29:40,706][232226] Updated weights for policy 0, policy_version 58190 (0.0008) [2023-03-07 17:29:41,477][232226] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-07 17:29:42,274][232226] Updated weights for policy 0, policy_version 58210 (0.0007) [2023-03-07 17:29:43,057][232226] Updated weights for policy 0, policy_version 58220 (0.0006) [2023-03-07 17:29:43,855][232226] Updated weights for policy 0, policy_version 58230 (0.0008) [2023-03-07 17:29:44,637][232226] Updated weights for policy 0, policy_version 58240 (0.0006) [2023-03-07 17:29:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 59642880. Throughput: 0: 12881.0. Samples: 59607576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:45,069][231894] Avg episode reward: [(0, '186.934')] [2023-03-07 17:29:45,460][232226] Updated weights for policy 0, policy_version 58250 (0.0006) [2023-03-07 17:29:46,237][232226] Updated weights for policy 0, policy_version 58260 (0.0007) [2023-03-07 17:29:47,034][232226] Updated weights for policy 0, policy_version 58270 (0.0005) [2023-03-07 17:29:47,839][232226] Updated weights for policy 0, policy_version 58280 (0.0006) [2023-03-07 17:29:48,653][232226] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-07 17:29:49,444][232226] Updated weights for policy 0, policy_version 58300 (0.0007) [2023-03-07 17:29:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 59706368. Throughput: 0: 12878.4. Samples: 59684752. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:50,069][231894] Avg episode reward: [(0, '181.984')] [2023-03-07 17:29:50,230][232226] Updated weights for policy 0, policy_version 58310 (0.0006) [2023-03-07 17:29:51,039][232226] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-07 17:29:51,823][232226] Updated weights for policy 0, policy_version 58330 (0.0006) [2023-03-07 17:29:52,613][232226] Updated weights for policy 0, policy_version 58340 (0.0007) [2023-03-07 17:29:53,414][232226] Updated weights for policy 0, policy_version 58350 (0.0007) [2023-03-07 17:29:54,204][232226] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-07 17:29:54,988][232226] Updated weights for policy 0, policy_version 58370 (0.0006) [2023-03-07 17:29:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 59770880. Throughput: 0: 12874.2. Samples: 59762047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:29:55,069][231894] Avg episode reward: [(0, '194.165')] [2023-03-07 17:29:55,794][232226] Updated weights for policy 0, policy_version 58380 (0.0006) [2023-03-07 17:29:56,580][232226] Updated weights for policy 0, policy_version 58390 (0.0006) [2023-03-07 17:29:57,384][232226] Updated weights for policy 0, policy_version 58400 (0.0007) [2023-03-07 17:29:58,173][232226] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-07 17:29:58,958][232226] Updated weights for policy 0, policy_version 58420 (0.0007) [2023-03-07 17:29:59,761][232226] Updated weights for policy 0, policy_version 58430 (0.0006) [2023-03-07 17:30:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 59835392. Throughput: 0: 12881.5. Samples: 59800751. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:30:00,069][231894] Avg episode reward: [(0, '196.634')] [2023-03-07 17:30:00,555][232226] Updated weights for policy 0, policy_version 58440 (0.0006) [2023-03-07 17:30:01,350][232226] Updated weights for policy 0, policy_version 58450 (0.0006) [2023-03-07 17:30:02,147][232226] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-07 17:30:02,934][232226] Updated weights for policy 0, policy_version 58470 (0.0006) [2023-03-07 17:30:03,740][232226] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-07 17:30:04,538][232226] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-07 17:30:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 59899904. Throughput: 0: 12882.9. Samples: 59878191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:30:05,080][231894] Avg episode reward: [(0, '194.825')] [2023-03-07 17:30:05,312][232226] Updated weights for policy 0, policy_version 58500 (0.0006) [2023-03-07 17:30:06,100][232226] Updated weights for policy 0, policy_version 58510 (0.0007) [2023-03-07 17:30:06,891][232226] Updated weights for policy 0, policy_version 58520 (0.0006) [2023-03-07 17:30:07,699][232226] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-07 17:30:08,470][232226] Updated weights for policy 0, policy_version 58540 (0.0006) [2023-03-07 17:30:09,277][232226] Updated weights for policy 0, policy_version 58550 (0.0006) [2023-03-07 17:30:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 59964416. Throughput: 0: 12885.7. Samples: 59955662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:30:10,080][231894] Avg episode reward: [(0, '192.871')] [2023-03-07 17:30:10,083][232226] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-07 17:30:10,857][232226] Updated weights for policy 0, policy_version 58570 (0.0006) [2023-03-07 17:30:11,676][232226] Updated weights for policy 0, policy_version 58580 (0.0006) [2023-03-07 17:30:12,486][232226] Updated weights for policy 0, policy_version 58590 (0.0006) [2023-03-07 17:30:13,273][232226] Updated weights for policy 0, policy_version 58600 (0.0007) [2023-03-07 17:30:14,054][232226] Updated weights for policy 0, policy_version 58610 (0.0006) [2023-03-07 17:30:14,843][232226] Updated weights for policy 0, policy_version 58620 (0.0006) [2023-03-07 17:30:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 60028928. Throughput: 0: 12878.4. Samples: 59994062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:30:15,080][231894] Avg episode reward: [(0, '198.136')] [2023-03-07 17:30:15,651][232226] Updated weights for policy 0, policy_version 58630 (0.0006) [2023-03-07 17:30:16,446][232226] Updated weights for policy 0, policy_version 58640 (0.0006) [2023-03-07 17:30:17,250][232226] Updated weights for policy 0, policy_version 58650 (0.0006) [2023-03-07 17:30:18,051][232226] Updated weights for policy 0, policy_version 58660 (0.0006) [2023-03-07 17:30:18,841][232226] Updated weights for policy 0, policy_version 58670 (0.0007) [2023-03-07 17:30:19,646][232226] Updated weights for policy 0, policy_version 58680 (0.0007) [2023-03-07 17:30:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 60093440. Throughput: 0: 12883.8. Samples: 60071289. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:30:20,080][231894] Avg episode reward: [(0, '191.057')] [2023-03-07 17:30:20,433][232226] Updated weights for policy 0, policy_version 58690 (0.0006) [2023-03-07 17:30:21,225][232226] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-07 17:30:22,015][232226] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-07 17:30:22,799][232226] Updated weights for policy 0, policy_version 58720 (0.0006) [2023-03-07 17:30:23,605][232226] Updated weights for policy 0, policy_version 58730 (0.0007) [2023-03-07 17:30:24,405][232226] Updated weights for policy 0, policy_version 58740 (0.0007) [2023-03-07 17:30:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 60157952. Throughput: 0: 12882.2. Samples: 60148611. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:30:25,080][231894] Avg episode reward: [(0, '184.833')] [2023-03-07 17:30:25,084][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000058748_60157952.pth... [2023-03-07 17:30:25,113][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000055729_57066496.pth [2023-03-07 17:30:25,203][232226] Updated weights for policy 0, policy_version 58750 (0.0006) [2023-03-07 17:30:26,010][232226] Updated weights for policy 0, policy_version 58760 (0.0007) [2023-03-07 17:30:26,793][232226] Updated weights for policy 0, policy_version 58770 (0.0007) [2023-03-07 17:30:27,595][232226] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-07 17:30:28,380][232226] Updated weights for policy 0, policy_version 58790 (0.0006) [2023-03-07 17:30:29,171][232226] Updated weights for policy 0, policy_version 58800 (0.0007) [2023-03-07 17:30:29,963][232226] Updated weights for policy 0, policy_version 58810 (0.0006) [2023-03-07 17:30:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 60222464. Throughput: 0: 12878.0. Samples: 60187088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:30:30,069][231894] Avg episode reward: [(0, '197.978')] [2023-03-07 17:30:30,759][232226] Updated weights for policy 0, policy_version 58820 (0.0006) [2023-03-07 17:30:31,573][232226] Updated weights for policy 0, policy_version 58830 (0.0006) [2023-03-07 17:30:32,362][232226] Updated weights for policy 0, policy_version 58840 (0.0007) [2023-03-07 17:30:33,166][232226] Updated weights for policy 0, policy_version 58850 (0.0007) [2023-03-07 17:30:33,952][232226] Updated weights for policy 0, policy_version 58860 (0.0006) [2023-03-07 17:30:34,754][232226] Updated weights for policy 0, policy_version 58870 (0.0006) [2023-03-07 17:30:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 60285952. Throughput: 0: 12879.3. Samples: 60264322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:30:35,069][231894] Avg episode reward: [(0, '192.102')] [2023-03-07 17:30:35,546][232226] Updated weights for policy 0, policy_version 58880 (0.0006) [2023-03-07 17:30:36,344][232226] Updated weights for policy 0, policy_version 58890 (0.0006) [2023-03-07 17:30:37,137][232226] Updated weights for policy 0, policy_version 58900 (0.0006) [2023-03-07 17:30:37,922][232226] Updated weights for policy 0, policy_version 58910 (0.0007) [2023-03-07 17:30:38,729][232226] Updated weights for policy 0, policy_version 58920 (0.0006) [2023-03-07 17:30:39,527][232226] Updated weights for policy 0, policy_version 58930 (0.0006) [2023-03-07 17:30:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 60350464. Throughput: 0: 12878.2. Samples: 60341564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:30:40,069][231894] Avg episode reward: [(0, '194.495')] [2023-03-07 17:30:40,322][232226] Updated weights for policy 0, policy_version 58940 (0.0006) [2023-03-07 17:30:41,123][232226] Updated weights for policy 0, policy_version 58950 (0.0006) [2023-03-07 17:30:41,914][232226] Updated weights for policy 0, policy_version 58960 (0.0006) [2023-03-07 17:30:42,705][232226] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-07 17:30:43,521][232226] Updated weights for policy 0, policy_version 58980 (0.0007) [2023-03-07 17:30:44,299][232226] Updated weights for policy 0, policy_version 58990 (0.0006) [2023-03-07 17:30:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 60414976. Throughput: 0: 12873.9. Samples: 60380078. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:30:45,069][231894] Avg episode reward: [(0, '193.712')] [2023-03-07 17:30:45,098][232226] Updated weights for policy 0, policy_version 59000 (0.0007) [2023-03-07 17:30:45,896][232226] Updated weights for policy 0, policy_version 59010 (0.0007) [2023-03-07 17:30:46,662][232226] Updated weights for policy 0, policy_version 59020 (0.0007) [2023-03-07 17:30:47,486][232226] Updated weights for policy 0, policy_version 59030 (0.0007) [2023-03-07 17:30:48,287][232226] Updated weights for policy 0, policy_version 59040 (0.0006) [2023-03-07 17:30:49,065][232226] Updated weights for policy 0, policy_version 59050 (0.0006) [2023-03-07 17:30:49,871][232226] Updated weights for policy 0, policy_version 59060 (0.0006) [2023-03-07 17:30:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 60479488. Throughput: 0: 12868.9. Samples: 60457294. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:30:50,069][231894] Avg episode reward: [(0, '190.587')] [2023-03-07 17:30:50,671][232226] Updated weights for policy 0, policy_version 59070 (0.0007) [2023-03-07 17:30:51,475][232226] Updated weights for policy 0, policy_version 59080 (0.0007) [2023-03-07 17:30:52,262][232226] Updated weights for policy 0, policy_version 59090 (0.0006) [2023-03-07 17:30:53,062][232226] Updated weights for policy 0, policy_version 59100 (0.0006) [2023-03-07 17:30:53,837][232226] Updated weights for policy 0, policy_version 59110 (0.0008) [2023-03-07 17:30:54,625][232226] Updated weights for policy 0, policy_version 59120 (0.0006) [2023-03-07 17:30:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 60544000. Throughput: 0: 12868.8. Samples: 60534756. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:30:55,069][231894] Avg episode reward: [(0, '195.461')] [2023-03-07 17:30:55,427][232226] Updated weights for policy 0, policy_version 59130 (0.0006) [2023-03-07 17:30:56,226][232226] Updated weights for policy 0, policy_version 59140 (0.0006) [2023-03-07 17:30:57,038][232226] Updated weights for policy 0, policy_version 59150 (0.0008) [2023-03-07 17:30:57,832][232226] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-07 17:30:58,620][232226] Updated weights for policy 0, policy_version 59170 (0.0007) [2023-03-07 17:30:59,425][232226] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-07 17:31:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 60608512. Throughput: 0: 12870.3. Samples: 60573223. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:31:00,069][231894] Avg episode reward: [(0, '196.727')] [2023-03-07 17:31:00,228][232226] Updated weights for policy 0, policy_version 59190 (0.0006) [2023-03-07 17:31:01,002][232226] Updated weights for policy 0, policy_version 59200 (0.0007) [2023-03-07 17:31:01,790][232226] Updated weights for policy 0, policy_version 59210 (0.0006) [2023-03-07 17:31:02,586][232226] Updated weights for policy 0, policy_version 59220 (0.0005) [2023-03-07 17:31:03,381][232226] Updated weights for policy 0, policy_version 59230 (0.0006) [2023-03-07 17:31:04,175][232226] Updated weights for policy 0, policy_version 59240 (0.0006) [2023-03-07 17:31:04,955][232226] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-07 17:31:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 60673024. Throughput: 0: 12871.8. Samples: 60650520. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:31:05,069][231894] Avg episode reward: [(0, '186.989')] [2023-03-07 17:31:05,733][232226] Updated weights for policy 0, policy_version 59260 (0.0006) [2023-03-07 17:31:06,533][232226] Updated weights for policy 0, policy_version 59270 (0.0006) [2023-03-07 17:31:07,325][232226] Updated weights for policy 0, policy_version 59280 (0.0007) [2023-03-07 17:31:08,120][232226] Updated weights for policy 0, policy_version 59290 (0.0007) [2023-03-07 17:31:08,915][232226] Updated weights for policy 0, policy_version 59300 (0.0007) [2023-03-07 17:31:09,698][232226] Updated weights for policy 0, policy_version 59310 (0.0007) [2023-03-07 17:31:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 60737536. Throughput: 0: 12885.1. Samples: 60728439. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:31:10,070][231894] Avg episode reward: [(0, '188.219')] [2023-03-07 17:31:10,486][232226] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-07 17:31:11,266][232226] Updated weights for policy 0, policy_version 59330 (0.0006) [2023-03-07 17:31:12,080][232226] Updated weights for policy 0, policy_version 59340 (0.0007) [2023-03-07 17:31:12,875][232226] Updated weights for policy 0, policy_version 59350 (0.0006) [2023-03-07 17:31:13,668][232226] Updated weights for policy 0, policy_version 59360 (0.0006) [2023-03-07 17:31:14,465][232226] Updated weights for policy 0, policy_version 59370 (0.0007) [2023-03-07 17:31:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 60802048. Throughput: 0: 12893.0. Samples: 60767271. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:31:15,069][231894] Avg episode reward: [(0, '195.255')] [2023-03-07 17:31:15,254][232226] Updated weights for policy 0, policy_version 59380 (0.0006) [2023-03-07 17:31:16,061][232226] Updated weights for policy 0, policy_version 59390 (0.0006) [2023-03-07 17:31:16,844][232226] Updated weights for policy 0, policy_version 59400 (0.0006) [2023-03-07 17:31:17,654][232226] Updated weights for policy 0, policy_version 59410 (0.0006) [2023-03-07 17:31:18,425][232226] Updated weights for policy 0, policy_version 59420 (0.0006) [2023-03-07 17:31:19,228][232226] Updated weights for policy 0, policy_version 59430 (0.0006) [2023-03-07 17:31:20,022][232226] Updated weights for policy 0, policy_version 59440 (0.0005) [2023-03-07 17:31:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 60866560. Throughput: 0: 12891.5. Samples: 60844441. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:31:20,069][231894] Avg episode reward: [(0, '195.676')] [2023-03-07 17:31:20,817][232226] Updated weights for policy 0, policy_version 59450 (0.0006) [2023-03-07 17:31:21,614][232226] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-07 17:31:22,424][232226] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-07 17:31:23,211][232226] Updated weights for policy 0, policy_version 59480 (0.0006) [2023-03-07 17:31:23,990][232226] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-07 17:31:24,803][232226] Updated weights for policy 0, policy_version 59500 (0.0007) [2023-03-07 17:31:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 60931072. Throughput: 0: 12892.9. Samples: 60921744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:31:25,069][231894] Avg episode reward: [(0, '189.967')] [2023-03-07 17:31:25,594][232226] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-07 17:31:26,394][232226] Updated weights for policy 0, policy_version 59520 (0.0006) [2023-03-07 17:31:27,181][232226] Updated weights for policy 0, policy_version 59530 (0.0006) [2023-03-07 17:31:27,980][232226] Updated weights for policy 0, policy_version 59540 (0.0006) [2023-03-07 17:31:28,771][232226] Updated weights for policy 0, policy_version 59550 (0.0006) [2023-03-07 17:31:29,578][232226] Updated weights for policy 0, policy_version 59560 (0.0007) [2023-03-07 17:31:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 60995584. Throughput: 0: 12893.2. Samples: 60960272. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:31:30,069][231894] Avg episode reward: [(0, '191.174')] [2023-03-07 17:31:30,370][232226] Updated weights for policy 0, policy_version 59570 (0.0007) [2023-03-07 17:31:31,150][232226] Updated weights for policy 0, policy_version 59580 (0.0006) [2023-03-07 17:31:31,942][232226] Updated weights for policy 0, policy_version 59590 (0.0005) [2023-03-07 17:31:32,746][232226] Updated weights for policy 0, policy_version 59600 (0.0006) [2023-03-07 17:31:33,523][232226] Updated weights for policy 0, policy_version 59610 (0.0006) [2023-03-07 17:31:34,329][232226] Updated weights for policy 0, policy_version 59620 (0.0007) [2023-03-07 17:31:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 61060096. Throughput: 0: 12899.2. Samples: 61037756. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:31:35,069][231894] Avg episode reward: [(0, '198.303')] [2023-03-07 17:31:35,127][232226] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-07 17:31:35,916][232226] Updated weights for policy 0, policy_version 59640 (0.0008) [2023-03-07 17:31:36,710][232226] Updated weights for policy 0, policy_version 59650 (0.0006) [2023-03-07 17:31:37,508][232226] Updated weights for policy 0, policy_version 59660 (0.0007) [2023-03-07 17:31:38,293][232226] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-07 17:31:39,098][232226] Updated weights for policy 0, policy_version 59680 (0.0006) [2023-03-07 17:31:39,909][232226] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-07 17:31:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 61123584. Throughput: 0: 12893.6. Samples: 61114969. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:31:40,069][231894] Avg episode reward: [(0, '191.538')] [2023-03-07 17:31:40,714][232226] Updated weights for policy 0, policy_version 59700 (0.0006) [2023-03-07 17:31:41,533][232226] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-07 17:31:42,307][232226] Updated weights for policy 0, policy_version 59720 (0.0007) [2023-03-07 17:31:43,086][232226] Updated weights for policy 0, policy_version 59730 (0.0006) [2023-03-07 17:31:43,890][232226] Updated weights for policy 0, policy_version 59740 (0.0006) [2023-03-07 17:31:44,673][232226] Updated weights for policy 0, policy_version 59750 (0.0006) [2023-03-07 17:31:45,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 61188096. Throughput: 0: 12891.1. Samples: 61153323. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:31:45,070][231894] Avg episode reward: [(0, '202.265')] [2023-03-07 17:31:45,474][232226] Updated weights for policy 0, policy_version 59760 (0.0008) [2023-03-07 17:31:46,263][232226] Updated weights for policy 0, policy_version 59770 (0.0006) [2023-03-07 17:31:47,066][232226] Updated weights for policy 0, policy_version 59780 (0.0006) [2023-03-07 17:31:47,859][232226] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-07 17:31:48,645][232226] Updated weights for policy 0, policy_version 59800 (0.0006) [2023-03-07 17:31:49,434][232226] Updated weights for policy 0, policy_version 59810 (0.0006) [2023-03-07 17:31:50,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 61253632. Throughput: 0: 12895.2. Samples: 61230806. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:31:50,080][231894] Avg episode reward: [(0, '194.937')] [2023-03-07 17:31:50,229][232226] Updated weights for policy 0, policy_version 59820 (0.0006) [2023-03-07 17:31:51,010][232226] Updated weights for policy 0, policy_version 59830 (0.0006) [2023-03-07 17:31:51,803][232226] Updated weights for policy 0, policy_version 59840 (0.0007) [2023-03-07 17:31:52,594][232226] Updated weights for policy 0, policy_version 59850 (0.0006) [2023-03-07 17:31:53,387][232226] Updated weights for policy 0, policy_version 59860 (0.0006) [2023-03-07 17:31:54,193][232226] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-07 17:31:54,966][232226] Updated weights for policy 0, policy_version 59880 (0.0006) [2023-03-07 17:31:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 61317120. Throughput: 0: 12889.8. Samples: 61308477. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:31:55,080][231894] Avg episode reward: [(0, '198.632')] [2023-03-07 17:31:55,761][232226] Updated weights for policy 0, policy_version 59890 (0.0007) [2023-03-07 17:31:56,573][232226] Updated weights for policy 0, policy_version 59900 (0.0006) [2023-03-07 17:31:57,377][232226] Updated weights for policy 0, policy_version 59910 (0.0007) [2023-03-07 17:31:58,153][232226] Updated weights for policy 0, policy_version 59920 (0.0007) [2023-03-07 17:31:58,943][232226] Updated weights for policy 0, policy_version 59930 (0.0006) [2023-03-07 17:31:59,741][232226] Updated weights for policy 0, policy_version 59940 (0.0007) [2023-03-07 17:32:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 61382656. Throughput: 0: 12883.6. Samples: 61347034. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:32:00,080][231894] Avg episode reward: [(0, '190.906')] [2023-03-07 17:32:00,526][232226] Updated weights for policy 0, policy_version 59950 (0.0006) [2023-03-07 17:32:01,319][232226] Updated weights for policy 0, policy_version 59960 (0.0006) [2023-03-07 17:32:02,117][232226] Updated weights for policy 0, policy_version 59970 (0.0007) [2023-03-07 17:32:02,905][232226] Updated weights for policy 0, policy_version 59980 (0.0007) [2023-03-07 17:32:03,705][232226] Updated weights for policy 0, policy_version 59990 (0.0006) [2023-03-07 17:32:04,502][232226] Updated weights for policy 0, policy_version 60000 (0.0006) [2023-03-07 17:32:05,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 61447168. Throughput: 0: 12895.2. Samples: 61424727. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:32:05,080][231894] Avg episode reward: [(0, '191.299')] [2023-03-07 17:32:05,281][232226] Updated weights for policy 0, policy_version 60010 (0.0006) [2023-03-07 17:32:06,075][232226] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-07 17:32:06,858][232226] Updated weights for policy 0, policy_version 60030 (0.0007) [2023-03-07 17:32:07,661][232226] Updated weights for policy 0, policy_version 60040 (0.0006) [2023-03-07 17:32:08,450][232226] Updated weights for policy 0, policy_version 60050 (0.0006) [2023-03-07 17:32:09,248][232226] Updated weights for policy 0, policy_version 60060 (0.0006) [2023-03-07 17:32:10,037][232226] Updated weights for policy 0, policy_version 60070 (0.0006) [2023-03-07 17:32:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12881.6). Total num frames: 61511680. Throughput: 0: 12899.1. Samples: 61502203. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:32:10,080][231894] Avg episode reward: [(0, '193.516')] [2023-03-07 17:32:10,839][232226] Updated weights for policy 0, policy_version 60080 (0.0006) [2023-03-07 17:32:11,621][232226] Updated weights for policy 0, policy_version 60090 (0.0007) [2023-03-07 17:32:12,407][232226] Updated weights for policy 0, policy_version 60100 (0.0006) [2023-03-07 17:32:13,205][232226] Updated weights for policy 0, policy_version 60110 (0.0006) [2023-03-07 17:32:14,021][232226] Updated weights for policy 0, policy_version 60120 (0.0006) [2023-03-07 17:32:14,815][232226] Updated weights for policy 0, policy_version 60130 (0.0006) [2023-03-07 17:32:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 61576192. Throughput: 0: 12907.2. Samples: 61541096. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:32:15,080][231894] Avg episode reward: [(0, '188.582')] [2023-03-07 17:32:15,621][232226] Updated weights for policy 0, policy_version 60140 (0.0006) [2023-03-07 17:32:16,403][232226] Updated weights for policy 0, policy_version 60150 (0.0006) [2023-03-07 17:32:17,226][232226] Updated weights for policy 0, policy_version 60160 (0.0005) [2023-03-07 17:32:18,005][232226] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-07 17:32:18,786][232226] Updated weights for policy 0, policy_version 60180 (0.0006) [2023-03-07 17:32:19,599][232226] Updated weights for policy 0, policy_version 60190 (0.0006) [2023-03-07 17:32:20,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 61640704. Throughput: 0: 12897.0. Samples: 61618119. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:32:20,069][231894] Avg episode reward: [(0, '197.631')] [2023-03-07 17:32:20,389][232226] Updated weights for policy 0, policy_version 60200 (0.0007) [2023-03-07 17:32:21,190][232226] Updated weights for policy 0, policy_version 60210 (0.0007) [2023-03-07 17:32:22,000][232226] Updated weights for policy 0, policy_version 60220 (0.0007) [2023-03-07 17:32:22,806][232226] Updated weights for policy 0, policy_version 60230 (0.0008) [2023-03-07 17:32:23,580][232226] Updated weights for policy 0, policy_version 60240 (0.0006) [2023-03-07 17:32:24,378][232226] Updated weights for policy 0, policy_version 60250 (0.0006) [2023-03-07 17:32:25,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 61704192. Throughput: 0: 12893.7. Samples: 61695188. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:32:25,070][231894] Avg episode reward: [(0, '193.245')] [2023-03-07 17:32:25,081][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000060259_61705216.pth... [2023-03-07 17:32:25,110][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000057239_58612736.pth [2023-03-07 17:32:25,170][232226] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-07 17:32:25,953][232226] Updated weights for policy 0, policy_version 60270 (0.0006) [2023-03-07 17:32:26,752][232226] Updated weights for policy 0, policy_version 60280 (0.0007) [2023-03-07 17:32:27,549][232226] Updated weights for policy 0, policy_version 60290 (0.0006) [2023-03-07 17:32:28,346][232226] Updated weights for policy 0, policy_version 60300 (0.0006) [2023-03-07 17:32:29,139][232226] Updated weights for policy 0, policy_version 60310 (0.0007) [2023-03-07 17:32:29,940][232226] Updated weights for policy 0, policy_version 60320 (0.0006) [2023-03-07 17:32:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 61768704. Throughput: 0: 12902.7. Samples: 61733942. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:32:30,069][231894] Avg episode reward: [(0, '190.382')] [2023-03-07 17:32:30,731][232226] Updated weights for policy 0, policy_version 60330 (0.0006) [2023-03-07 17:32:31,517][232226] Updated weights for policy 0, policy_version 60340 (0.0006) [2023-03-07 17:32:32,317][232226] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-07 17:32:33,138][232226] Updated weights for policy 0, policy_version 60360 (0.0007) [2023-03-07 17:32:33,914][232226] Updated weights for policy 0, policy_version 60370 (0.0007) [2023-03-07 17:32:34,721][232226] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-07 17:32:35,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 61833216. Throughput: 0: 12892.6. Samples: 61810974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:32:35,069][231894] Avg episode reward: [(0, '183.716')] [2023-03-07 17:32:35,521][232226] Updated weights for policy 0, policy_version 60390 (0.0007) [2023-03-07 17:32:36,314][232226] Updated weights for policy 0, policy_version 60400 (0.0006) [2023-03-07 17:32:37,098][232226] Updated weights for policy 0, policy_version 60410 (0.0007) [2023-03-07 17:32:37,891][232226] Updated weights for policy 0, policy_version 60420 (0.0006) [2023-03-07 17:32:38,688][232226] Updated weights for policy 0, policy_version 60430 (0.0006) [2023-03-07 17:32:39,490][232226] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-07 17:32:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 61897728. Throughput: 0: 12888.2. Samples: 61888445. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:32:40,069][231894] Avg episode reward: [(0, '190.931')] [2023-03-07 17:32:40,277][232226] Updated weights for policy 0, policy_version 60450 (0.0006) [2023-03-07 17:32:41,063][232226] Updated weights for policy 0, policy_version 60460 (0.0007) [2023-03-07 17:32:41,865][232226] Updated weights for policy 0, policy_version 60470 (0.0007) [2023-03-07 17:32:42,650][232226] Updated weights for policy 0, policy_version 60480 (0.0007) [2023-03-07 17:32:43,449][232226] Updated weights for policy 0, policy_version 60490 (0.0006) [2023-03-07 17:32:44,244][232226] Updated weights for policy 0, policy_version 60500 (0.0007) [2023-03-07 17:32:45,040][232226] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-07 17:32:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 61962240. Throughput: 0: 12888.2. Samples: 61927004. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:32:45,069][231894] Avg episode reward: [(0, '194.779')] [2023-03-07 17:32:45,827][232226] Updated weights for policy 0, policy_version 60520 (0.0006) [2023-03-07 17:32:46,622][232226] Updated weights for policy 0, policy_version 60530 (0.0006) [2023-03-07 17:32:47,449][232226] Updated weights for policy 0, policy_version 60540 (0.0006) [2023-03-07 17:32:48,220][232226] Updated weights for policy 0, policy_version 60550 (0.0006) [2023-03-07 17:32:49,026][232226] Updated weights for policy 0, policy_version 60560 (0.0006) [2023-03-07 17:32:49,826][232226] Updated weights for policy 0, policy_version 60570 (0.0007) [2023-03-07 17:32:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 62026752. Throughput: 0: 12879.0. Samples: 62004280. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:32:50,069][231894] Avg episode reward: [(0, '193.209')] [2023-03-07 17:32:50,628][232226] Updated weights for policy 0, policy_version 60580 (0.0007) [2023-03-07 17:32:51,391][232226] Updated weights for policy 0, policy_version 60590 (0.0006) [2023-03-07 17:32:52,191][232226] Updated weights for policy 0, policy_version 60600 (0.0006) [2023-03-07 17:32:52,997][232226] Updated weights for policy 0, policy_version 60610 (0.0007) [2023-03-07 17:32:53,795][232226] Updated weights for policy 0, policy_version 60620 (0.0006) [2023-03-07 17:32:54,612][232226] Updated weights for policy 0, policy_version 60630 (0.0006) [2023-03-07 17:32:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 62090240. Throughput: 0: 12869.8. Samples: 62081342. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:32:55,069][231894] Avg episode reward: [(0, '193.945')] [2023-03-07 17:32:55,395][232226] Updated weights for policy 0, policy_version 60640 (0.0005) [2023-03-07 17:32:56,188][232226] Updated weights for policy 0, policy_version 60650 (0.0006) [2023-03-07 17:32:56,999][232226] Updated weights for policy 0, policy_version 60660 (0.0006) [2023-03-07 17:32:57,777][232226] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-07 17:32:58,559][232226] Updated weights for policy 0, policy_version 60680 (0.0007) [2023-03-07 17:32:59,363][232226] Updated weights for policy 0, policy_version 60690 (0.0007) [2023-03-07 17:33:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62154752. Throughput: 0: 12866.8. Samples: 62120099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:00,069][231894] Avg episode reward: [(0, '196.061')] [2023-03-07 17:33:00,172][232226] Updated weights for policy 0, policy_version 60700 (0.0007) [2023-03-07 17:33:00,952][232226] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-07 17:33:01,748][232226] Updated weights for policy 0, policy_version 60720 (0.0007) [2023-03-07 17:33:02,555][232226] Updated weights for policy 0, policy_version 60730 (0.0007) [2023-03-07 17:33:03,350][232226] Updated weights for policy 0, policy_version 60740 (0.0007) [2023-03-07 17:33:04,141][232226] Updated weights for policy 0, policy_version 60750 (0.0007) [2023-03-07 17:33:04,920][232226] Updated weights for policy 0, policy_version 60760 (0.0007) [2023-03-07 17:33:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62219264. Throughput: 0: 12872.2. Samples: 62197369. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:05,069][231894] Avg episode reward: [(0, '188.817')] [2023-03-07 17:33:05,716][232226] Updated weights for policy 0, policy_version 60770 (0.0007) [2023-03-07 17:33:06,512][232226] Updated weights for policy 0, policy_version 60780 (0.0007) [2023-03-07 17:33:07,314][232226] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-07 17:33:08,098][232226] Updated weights for policy 0, policy_version 60800 (0.0007) [2023-03-07 17:33:08,883][232226] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-07 17:33:09,684][232226] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-07 17:33:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62283776. Throughput: 0: 12880.0. Samples: 62274784. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:10,069][231894] Avg episode reward: [(0, '195.828')] [2023-03-07 17:33:10,472][232226] Updated weights for policy 0, policy_version 60830 (0.0005) [2023-03-07 17:33:11,268][232226] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-07 17:33:12,051][232226] Updated weights for policy 0, policy_version 60850 (0.0007) [2023-03-07 17:33:12,846][232226] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-07 17:33:13,669][232226] Updated weights for policy 0, policy_version 60870 (0.0007) [2023-03-07 17:33:14,457][232226] Updated weights for policy 0, policy_version 60880 (0.0007) [2023-03-07 17:33:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 62348288. Throughput: 0: 12880.9. Samples: 62313582. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:15,069][231894] Avg episode reward: [(0, '191.216')] [2023-03-07 17:33:15,258][232226] Updated weights for policy 0, policy_version 60890 (0.0007) [2023-03-07 17:33:16,047][232226] Updated weights for policy 0, policy_version 60900 (0.0006) [2023-03-07 17:33:16,842][232226] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-07 17:33:17,631][232226] Updated weights for policy 0, policy_version 60920 (0.0007) [2023-03-07 17:33:18,437][232226] Updated weights for policy 0, policy_version 60930 (0.0006) [2023-03-07 17:33:19,241][232226] Updated weights for policy 0, policy_version 60940 (0.0007) [2023-03-07 17:33:20,025][232226] Updated weights for policy 0, policy_version 60950 (0.0006) [2023-03-07 17:33:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12885.0). Total num frames: 62412800. Throughput: 0: 12879.7. Samples: 62390560. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:20,069][231894] Avg episode reward: [(0, '191.705')] [2023-03-07 17:33:20,835][232226] Updated weights for policy 0, policy_version 60960 (0.0006) [2023-03-07 17:33:21,625][232226] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-07 17:33:22,391][232226] Updated weights for policy 0, policy_version 60980 (0.0006) [2023-03-07 17:33:23,216][232226] Updated weights for policy 0, policy_version 60990 (0.0006) [2023-03-07 17:33:24,037][232226] Updated weights for policy 0, policy_version 61000 (0.0007) [2023-03-07 17:33:24,816][232226] Updated weights for policy 0, policy_version 61010 (0.0007) [2023-03-07 17:33:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62476288. Throughput: 0: 12869.9. Samples: 62467593. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:25,069][231894] Avg episode reward: [(0, '197.284')] [2023-03-07 17:33:25,613][232226] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-07 17:33:26,418][232226] Updated weights for policy 0, policy_version 61030 (0.0006) [2023-03-07 17:33:27,233][232226] Updated weights for policy 0, policy_version 61040 (0.0007) [2023-03-07 17:33:28,022][232226] Updated weights for policy 0, policy_version 61050 (0.0007) [2023-03-07 17:33:28,795][232226] Updated weights for policy 0, policy_version 61060 (0.0006) [2023-03-07 17:33:29,630][232226] Updated weights for policy 0, policy_version 61070 (0.0006) [2023-03-07 17:33:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62540800. Throughput: 0: 12867.4. Samples: 62506038. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:30,069][231894] Avg episode reward: [(0, '189.743')] [2023-03-07 17:33:30,442][232226] Updated weights for policy 0, policy_version 61080 (0.0006) [2023-03-07 17:33:31,229][232226] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-07 17:33:32,025][232226] Updated weights for policy 0, policy_version 61100 (0.0006) [2023-03-07 17:33:32,811][232226] Updated weights for policy 0, policy_version 61110 (0.0007) [2023-03-07 17:33:33,622][232226] Updated weights for policy 0, policy_version 61120 (0.0007) [2023-03-07 17:33:34,416][232226] Updated weights for policy 0, policy_version 61130 (0.0007) [2023-03-07 17:33:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62605312. Throughput: 0: 12860.3. Samples: 62582993. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:35,069][231894] Avg episode reward: [(0, '187.225')] [2023-03-07 17:33:35,219][232226] Updated weights for policy 0, policy_version 61140 (0.0006) [2023-03-07 17:33:35,997][232226] Updated weights for policy 0, policy_version 61150 (0.0006) [2023-03-07 17:33:36,807][232226] Updated weights for policy 0, policy_version 61160 (0.0006) [2023-03-07 17:33:37,600][232226] Updated weights for policy 0, policy_version 61170 (0.0007) [2023-03-07 17:33:38,373][232226] Updated weights for policy 0, policy_version 61180 (0.0006) [2023-03-07 17:33:39,170][232226] Updated weights for policy 0, policy_version 61190 (0.0006) [2023-03-07 17:33:39,968][232226] Updated weights for policy 0, policy_version 61200 (0.0007) [2023-03-07 17:33:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62669824. Throughput: 0: 12866.9. Samples: 62660354. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:40,069][231894] Avg episode reward: [(0, '197.686')] [2023-03-07 17:33:40,751][232226] Updated weights for policy 0, policy_version 61210 (0.0006) [2023-03-07 17:33:41,537][232226] Updated weights for policy 0, policy_version 61220 (0.0007) [2023-03-07 17:33:42,337][232226] Updated weights for policy 0, policy_version 61230 (0.0006) [2023-03-07 17:33:43,123][232226] Updated weights for policy 0, policy_version 61240 (0.0006) [2023-03-07 17:33:43,892][232226] Updated weights for policy 0, policy_version 61250 (0.0006) [2023-03-07 17:33:44,715][232226] Updated weights for policy 0, policy_version 61260 (0.0006) [2023-03-07 17:33:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62734336. Throughput: 0: 12866.4. Samples: 62699086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:45,070][231894] Avg episode reward: [(0, '187.911')] [2023-03-07 17:33:45,501][232226] Updated weights for policy 0, policy_version 61270 (0.0006) [2023-03-07 17:33:46,288][232226] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-07 17:33:47,081][232226] Updated weights for policy 0, policy_version 61290 (0.0006) [2023-03-07 17:33:47,872][232226] Updated weights for policy 0, policy_version 61300 (0.0007) [2023-03-07 17:33:48,682][232226] Updated weights for policy 0, policy_version 61310 (0.0007) [2023-03-07 17:33:49,475][232226] Updated weights for policy 0, policy_version 61320 (0.0007) [2023-03-07 17:33:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 62798848. Throughput: 0: 12871.5. Samples: 62776587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:50,069][231894] Avg episode reward: [(0, '188.996')] [2023-03-07 17:33:50,271][232226] Updated weights for policy 0, policy_version 61330 (0.0006) [2023-03-07 17:33:51,072][232226] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-07 17:33:51,884][232226] Updated weights for policy 0, policy_version 61350 (0.0007) [2023-03-07 17:33:52,661][232226] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-07 17:33:53,456][232226] Updated weights for policy 0, policy_version 61370 (0.0007) [2023-03-07 17:33:54,260][232226] Updated weights for policy 0, policy_version 61380 (0.0007) [2023-03-07 17:33:55,053][232226] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-07 17:33:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 62863360. Throughput: 0: 12867.1. Samples: 62853801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:33:55,070][231894] Avg episode reward: [(0, '189.544')] [2023-03-07 17:33:55,836][232226] Updated weights for policy 0, policy_version 61400 (0.0007) [2023-03-07 17:33:56,640][232226] Updated weights for policy 0, policy_version 61410 (0.0007) [2023-03-07 17:33:57,423][232226] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-07 17:33:58,221][232226] Updated weights for policy 0, policy_version 61430 (0.0007) [2023-03-07 17:33:59,035][232226] Updated weights for policy 0, policy_version 61440 (0.0007) [2023-03-07 17:33:59,810][232226] Updated weights for policy 0, policy_version 61450 (0.0007) [2023-03-07 17:34:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 62927872. Throughput: 0: 12865.3. Samples: 62892522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:34:00,070][231894] Avg episode reward: [(0, '196.553')] [2023-03-07 17:34:00,604][232226] Updated weights for policy 0, policy_version 61460 (0.0006) [2023-03-07 17:34:01,389][232226] Updated weights for policy 0, policy_version 61470 (0.0006) [2023-03-07 17:34:02,170][232226] Updated weights for policy 0, policy_version 61480 (0.0006) [2023-03-07 17:34:02,973][232226] Updated weights for policy 0, policy_version 61490 (0.0007) [2023-03-07 17:34:03,765][232226] Updated weights for policy 0, policy_version 61500 (0.0006) [2023-03-07 17:34:04,565][232226] Updated weights for policy 0, policy_version 61510 (0.0008) [2023-03-07 17:34:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 62992384. Throughput: 0: 12877.4. Samples: 62970046. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:34:05,069][231894] Avg episode reward: [(0, '186.816')] [2023-03-07 17:34:05,358][232226] Updated weights for policy 0, policy_version 61520 (0.0006) [2023-03-07 17:34:06,150][232226] Updated weights for policy 0, policy_version 61530 (0.0007) [2023-03-07 17:34:06,969][232226] Updated weights for policy 0, policy_version 61540 (0.0008) [2023-03-07 17:34:07,762][232226] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-07 17:34:08,558][232226] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-07 17:34:09,355][232226] Updated weights for policy 0, policy_version 61570 (0.0006) [2023-03-07 17:34:10,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 63056896. Throughput: 0: 12879.4. Samples: 63047167. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:34:10,069][231894] Avg episode reward: [(0, '191.302')] [2023-03-07 17:34:10,133][232226] Updated weights for policy 0, policy_version 61580 (0.0006) [2023-03-07 17:34:10,930][232226] Updated weights for policy 0, policy_version 61590 (0.0006) [2023-03-07 17:34:11,730][232226] Updated weights for policy 0, policy_version 61600 (0.0007) [2023-03-07 17:34:12,523][232226] Updated weights for policy 0, policy_version 61610 (0.0006) [2023-03-07 17:34:13,328][232226] Updated weights for policy 0, policy_version 61620 (0.0007) [2023-03-07 17:34:14,122][232226] Updated weights for policy 0, policy_version 61630 (0.0007) [2023-03-07 17:34:14,927][232226] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-07 17:34:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 63120384. Throughput: 0: 12886.3. Samples: 63085922. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:15,070][231894] Avg episode reward: [(0, '186.491')] [2023-03-07 17:34:15,724][232226] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-07 17:34:16,513][232226] Updated weights for policy 0, policy_version 61660 (0.0007) [2023-03-07 17:34:17,326][232226] Updated weights for policy 0, policy_version 61670 (0.0006) [2023-03-07 17:34:18,122][232226] Updated weights for policy 0, policy_version 61680 (0.0006) [2023-03-07 17:34:18,928][232226] Updated weights for policy 0, policy_version 61690 (0.0008) [2023-03-07 17:34:19,713][232226] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-07 17:34:20,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 63184896. Throughput: 0: 12884.7. Samples: 63162803. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:20,070][231894] Avg episode reward: [(0, '197.426')] [2023-03-07 17:34:20,502][232226] Updated weights for policy 0, policy_version 61710 (0.0007) [2023-03-07 17:34:21,304][232226] Updated weights for policy 0, policy_version 61720 (0.0006) [2023-03-07 17:34:22,099][232226] Updated weights for policy 0, policy_version 61730 (0.0007) [2023-03-07 17:34:22,893][232226] Updated weights for policy 0, policy_version 61740 (0.0007) [2023-03-07 17:34:23,673][232226] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-07 17:34:24,462][232226] Updated weights for policy 0, policy_version 61760 (0.0007) [2023-03-07 17:34:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 63249408. Throughput: 0: 12884.8. Samples: 63240171. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:25,070][231894] Avg episode reward: [(0, '199.026')] [2023-03-07 17:34:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000061767_63249408.pth... [2023-03-07 17:34:25,102][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000058748_60157952.pth [2023-03-07 17:34:25,257][232226] Updated weights for policy 0, policy_version 61770 (0.0006) [2023-03-07 17:34:26,057][232226] Updated weights for policy 0, policy_version 61780 (0.0006) [2023-03-07 17:34:26,865][232226] Updated weights for policy 0, policy_version 61790 (0.0006) [2023-03-07 17:34:27,665][232226] Updated weights for policy 0, policy_version 61800 (0.0006) [2023-03-07 17:34:28,442][232226] Updated weights for policy 0, policy_version 61810 (0.0007) [2023-03-07 17:34:29,241][232226] Updated weights for policy 0, policy_version 61820 (0.0007) [2023-03-07 17:34:30,019][232226] Updated weights for policy 0, policy_version 61830 (0.0007) [2023-03-07 17:34:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 63313920. Throughput: 0: 12880.0. Samples: 63278684. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:30,069][231894] Avg episode reward: [(0, '197.924')] [2023-03-07 17:34:30,834][232226] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-07 17:34:31,628][232226] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-07 17:34:32,413][232226] Updated weights for policy 0, policy_version 61860 (0.0007) [2023-03-07 17:34:33,214][232226] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-07 17:34:34,004][232226] Updated weights for policy 0, policy_version 61880 (0.0006) [2023-03-07 17:34:34,786][232226] Updated weights for policy 0, policy_version 61890 (0.0007) [2023-03-07 17:34:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 63378432. Throughput: 0: 12880.1. Samples: 63356194. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:35,069][231894] Avg episode reward: [(0, '192.731')] [2023-03-07 17:34:35,593][232226] Updated weights for policy 0, policy_version 61900 (0.0006) [2023-03-07 17:34:36,357][232226] Updated weights for policy 0, policy_version 61910 (0.0007) [2023-03-07 17:34:37,174][232226] Updated weights for policy 0, policy_version 61920 (0.0007) [2023-03-07 17:34:37,987][232226] Updated weights for policy 0, policy_version 61930 (0.0007) [2023-03-07 17:34:38,777][232226] Updated weights for policy 0, policy_version 61940 (0.0007) [2023-03-07 17:34:39,584][232226] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-07 17:34:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 63442944. Throughput: 0: 12879.4. Samples: 63433374. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:40,069][231894] Avg episode reward: [(0, '193.572')] [2023-03-07 17:34:40,367][232226] Updated weights for policy 0, policy_version 61960 (0.0006) [2023-03-07 17:34:41,151][232226] Updated weights for policy 0, policy_version 61970 (0.0007) [2023-03-07 17:34:41,949][232226] Updated weights for policy 0, policy_version 61980 (0.0007) [2023-03-07 17:34:42,743][232226] Updated weights for policy 0, policy_version 61990 (0.0005) [2023-03-07 17:34:43,535][232226] Updated weights for policy 0, policy_version 62000 (0.0006) [2023-03-07 17:34:44,329][232226] Updated weights for policy 0, policy_version 62010 (0.0006) [2023-03-07 17:34:45,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 63507456. Throughput: 0: 12883.9. Samples: 63472296. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:45,069][231894] Avg episode reward: [(0, '189.922')] [2023-03-07 17:34:45,109][232226] Updated weights for policy 0, policy_version 62020 (0.0006) [2023-03-07 17:34:45,909][232226] Updated weights for policy 0, policy_version 62030 (0.0006) [2023-03-07 17:34:46,700][232226] Updated weights for policy 0, policy_version 62040 (0.0006) [2023-03-07 17:34:47,511][232226] Updated weights for policy 0, policy_version 62050 (0.0006) [2023-03-07 17:34:48,294][232226] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-07 17:34:49,083][232226] Updated weights for policy 0, policy_version 62070 (0.0006) [2023-03-07 17:34:49,857][232226] Updated weights for policy 0, policy_version 62080 (0.0007) [2023-03-07 17:34:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 63571968. Throughput: 0: 12882.1. Samples: 63549739. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:34:50,069][231894] Avg episode reward: [(0, '190.594')] [2023-03-07 17:34:50,645][232226] Updated weights for policy 0, policy_version 62090 (0.0006) [2023-03-07 17:34:51,438][232226] Updated weights for policy 0, policy_version 62100 (0.0006) [2023-03-07 17:34:52,226][232226] Updated weights for policy 0, policy_version 62110 (0.0006) [2023-03-07 17:34:52,998][232226] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-07 17:34:53,797][232226] Updated weights for policy 0, policy_version 62130 (0.0006) [2023-03-07 17:34:54,594][232226] Updated weights for policy 0, policy_version 62140 (0.0006) [2023-03-07 17:34:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 63636480. Throughput: 0: 12897.4. Samples: 63627548. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:34:55,069][231894] Avg episode reward: [(0, '197.753')] [2023-03-07 17:34:55,377][232226] Updated weights for policy 0, policy_version 62150 (0.0006) [2023-03-07 17:34:56,201][232226] Updated weights for policy 0, policy_version 62160 (0.0007) [2023-03-07 17:34:56,988][232226] Updated weights for policy 0, policy_version 62170 (0.0006) [2023-03-07 17:34:57,785][232226] Updated weights for policy 0, policy_version 62180 (0.0007) [2023-03-07 17:34:58,586][232226] Updated weights for policy 0, policy_version 62190 (0.0007) [2023-03-07 17:34:59,389][232226] Updated weights for policy 0, policy_version 62200 (0.0006) [2023-03-07 17:35:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 63700992. Throughput: 0: 12892.5. Samples: 63666084. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:35:00,069][231894] Avg episode reward: [(0, '196.372')] [2023-03-07 17:35:00,157][232226] Updated weights for policy 0, policy_version 62210 (0.0007) [2023-03-07 17:35:00,962][232226] Updated weights for policy 0, policy_version 62220 (0.0006) [2023-03-07 17:35:01,775][232226] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-07 17:35:02,564][232226] Updated weights for policy 0, policy_version 62240 (0.0006) [2023-03-07 17:35:03,377][232226] Updated weights for policy 0, policy_version 62250 (0.0005) [2023-03-07 17:35:04,178][232226] Updated weights for policy 0, policy_version 62260 (0.0006) [2023-03-07 17:35:04,964][232226] Updated weights for policy 0, policy_version 62270 (0.0006) [2023-03-07 17:35:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 63765504. Throughput: 0: 12899.4. Samples: 63743278. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:35:05,069][231894] Avg episode reward: [(0, '199.285')] [2023-03-07 17:35:05,759][232226] Updated weights for policy 0, policy_version 62280 (0.0006) [2023-03-07 17:35:06,557][232226] Updated weights for policy 0, policy_version 62290 (0.0006) [2023-03-07 17:35:07,350][232226] Updated weights for policy 0, policy_version 62300 (0.0006) [2023-03-07 17:35:08,171][232226] Updated weights for policy 0, policy_version 62310 (0.0007) [2023-03-07 17:35:08,965][232226] Updated weights for policy 0, policy_version 62320 (0.0006) [2023-03-07 17:35:09,753][232226] Updated weights for policy 0, policy_version 62330 (0.0006) [2023-03-07 17:35:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 63828992. Throughput: 0: 12890.1. Samples: 63820226. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:35:10,069][231894] Avg episode reward: [(0, '187.528')] [2023-03-07 17:35:10,553][232226] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-07 17:35:11,348][232226] Updated weights for policy 0, policy_version 62350 (0.0006) [2023-03-07 17:35:12,149][232226] Updated weights for policy 0, policy_version 62360 (0.0007) [2023-03-07 17:35:12,948][232226] Updated weights for policy 0, policy_version 62370 (0.0006) [2023-03-07 17:35:13,737][232226] Updated weights for policy 0, policy_version 62380 (0.0006) [2023-03-07 17:35:14,554][232226] Updated weights for policy 0, policy_version 62390 (0.0007) [2023-03-07 17:35:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 63893504. Throughput: 0: 12892.3. Samples: 63858837. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:35:15,070][231894] Avg episode reward: [(0, '196.126')] [2023-03-07 17:35:15,338][232226] Updated weights for policy 0, policy_version 62400 (0.0007) [2023-03-07 17:35:16,135][232226] Updated weights for policy 0, policy_version 62410 (0.0007) [2023-03-07 17:35:16,937][232226] Updated weights for policy 0, policy_version 62420 (0.0006) [2023-03-07 17:35:17,712][232226] Updated weights for policy 0, policy_version 62430 (0.0007) [2023-03-07 17:35:18,502][232226] Updated weights for policy 0, policy_version 62440 (0.0006) [2023-03-07 17:35:19,296][232226] Updated weights for policy 0, policy_version 62450 (0.0006) [2023-03-07 17:35:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 63958016. Throughput: 0: 12886.8. Samples: 63936100. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:35:20,069][231894] Avg episode reward: [(0, '196.233')] [2023-03-07 17:35:20,088][232226] Updated weights for policy 0, policy_version 62460 (0.0008) [2023-03-07 17:35:20,877][232226] Updated weights for policy 0, policy_version 62470 (0.0007) [2023-03-07 17:35:21,672][232226] Updated weights for policy 0, policy_version 62480 (0.0007) [2023-03-07 17:35:22,454][232226] Updated weights for policy 0, policy_version 62490 (0.0007) [2023-03-07 17:35:23,253][232226] Updated weights for policy 0, policy_version 62500 (0.0007) [2023-03-07 17:35:24,066][232226] Updated weights for policy 0, policy_version 62510 (0.0006) [2023-03-07 17:35:24,847][232226] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-07 17:35:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 64022528. Throughput: 0: 12893.0. Samples: 64013558. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:35:25,069][231894] Avg episode reward: [(0, '193.206')] [2023-03-07 17:35:25,635][232226] Updated weights for policy 0, policy_version 62530 (0.0006) [2023-03-07 17:35:26,450][232226] Updated weights for policy 0, policy_version 62540 (0.0007) [2023-03-07 17:35:27,228][232226] Updated weights for policy 0, policy_version 62550 (0.0006) [2023-03-07 17:35:28,017][232226] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-07 17:35:28,835][232226] Updated weights for policy 0, policy_version 62570 (0.0006) [2023-03-07 17:35:29,616][232226] Updated weights for policy 0, policy_version 62580 (0.0006) [2023-03-07 17:35:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 64087040. Throughput: 0: 12888.0. Samples: 64052257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:35:30,069][231894] Avg episode reward: [(0, '196.383')] [2023-03-07 17:35:30,408][232226] Updated weights for policy 0, policy_version 62590 (0.0007) [2023-03-07 17:35:31,194][232226] Updated weights for policy 0, policy_version 62600 (0.0006) [2023-03-07 17:35:32,017][232226] Updated weights for policy 0, policy_version 62610 (0.0006) [2023-03-07 17:35:32,808][232226] Updated weights for policy 0, policy_version 62620 (0.0006) [2023-03-07 17:35:33,622][232226] Updated weights for policy 0, policy_version 62630 (0.0007) [2023-03-07 17:35:34,425][232226] Updated weights for policy 0, policy_version 62640 (0.0006) [2023-03-07 17:35:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 64151552. Throughput: 0: 12881.4. Samples: 64129404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:35:35,069][231894] Avg episode reward: [(0, '189.119')] [2023-03-07 17:35:35,198][232226] Updated weights for policy 0, policy_version 62650 (0.0006) [2023-03-07 17:35:35,998][232226] Updated weights for policy 0, policy_version 62660 (0.0006) [2023-03-07 17:35:36,788][232226] Updated weights for policy 0, policy_version 62670 (0.0006) [2023-03-07 17:35:37,581][232226] Updated weights for policy 0, policy_version 62680 (0.0006) [2023-03-07 17:35:38,387][232226] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-07 17:35:39,163][232226] Updated weights for policy 0, policy_version 62700 (0.0007) [2023-03-07 17:35:39,975][232226] Updated weights for policy 0, policy_version 62710 (0.0006) [2023-03-07 17:35:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12885.0). Total num frames: 64216064. Throughput: 0: 12868.3. Samples: 64206624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:35:40,069][231894] Avg episode reward: [(0, '192.148')] [2023-03-07 17:35:40,758][232226] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-07 17:35:41,534][232226] Updated weights for policy 0, policy_version 62730 (0.0007) [2023-03-07 17:35:42,325][232226] Updated weights for policy 0, policy_version 62740 (0.0006) [2023-03-07 17:35:43,114][232226] Updated weights for policy 0, policy_version 62750 (0.0006) [2023-03-07 17:35:43,906][232226] Updated weights for policy 0, policy_version 62760 (0.0006) [2023-03-07 17:35:44,699][232226] Updated weights for policy 0, policy_version 62770 (0.0006) [2023-03-07 17:35:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 64280576. Throughput: 0: 12877.3. Samples: 64245561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:35:45,069][231894] Avg episode reward: [(0, '194.138')] [2023-03-07 17:35:45,502][232226] Updated weights for policy 0, policy_version 62780 (0.0006) [2023-03-07 17:35:46,318][232226] Updated weights for policy 0, policy_version 62790 (0.0006) [2023-03-07 17:35:47,091][232226] Updated weights for policy 0, policy_version 62800 (0.0007) [2023-03-07 17:35:47,892][232226] Updated weights for policy 0, policy_version 62810 (0.0006) [2023-03-07 17:35:48,690][232226] Updated weights for policy 0, policy_version 62820 (0.0006) [2023-03-07 17:35:49,476][232226] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-07 17:35:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 64345088. Throughput: 0: 12877.9. Samples: 64322783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:35:50,069][231894] Avg episode reward: [(0, '198.119')] [2023-03-07 17:35:50,290][232226] Updated weights for policy 0, policy_version 62840 (0.0006) [2023-03-07 17:35:51,084][232226] Updated weights for policy 0, policy_version 62850 (0.0006) [2023-03-07 17:35:51,882][232226] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-07 17:35:52,668][232226] Updated weights for policy 0, policy_version 62870 (0.0007) [2023-03-07 17:35:53,468][232226] Updated weights for policy 0, policy_version 62880 (0.0006) [2023-03-07 17:35:54,268][232226] Updated weights for policy 0, policy_version 62890 (0.0007) [2023-03-07 17:35:55,065][232226] Updated weights for policy 0, policy_version 62900 (0.0006) [2023-03-07 17:35:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 64409600. Throughput: 0: 12883.3. Samples: 64399974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:35:55,069][231894] Avg episode reward: [(0, '196.689')] [2023-03-07 17:35:55,852][232226] Updated weights for policy 0, policy_version 62910 (0.0007) [2023-03-07 17:35:56,654][232226] Updated weights for policy 0, policy_version 62920 (0.0006) [2023-03-07 17:35:57,446][232226] Updated weights for policy 0, policy_version 62930 (0.0007) [2023-03-07 17:35:58,257][232226] Updated weights for policy 0, policy_version 62940 (0.0006) [2023-03-07 17:35:59,030][232226] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-07 17:35:59,828][232226] Updated weights for policy 0, policy_version 62960 (0.0006) [2023-03-07 17:36:00,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12881.6). Total num frames: 64473088. Throughput: 0: 12881.6. Samples: 64438508. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:36:00,069][231894] Avg episode reward: [(0, '190.273')] [2023-03-07 17:36:00,643][232226] Updated weights for policy 0, policy_version 62970 (0.0006) [2023-03-07 17:36:01,415][232226] Updated weights for policy 0, policy_version 62980 (0.0006) [2023-03-07 17:36:02,203][232226] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-07 17:36:02,995][232226] Updated weights for policy 0, policy_version 63000 (0.0006) [2023-03-07 17:36:03,785][232226] Updated weights for policy 0, policy_version 63010 (0.0007) [2023-03-07 17:36:04,591][232226] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-07 17:36:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12885.1). Total num frames: 64538624. Throughput: 0: 12887.8. Samples: 64516050. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:36:05,069][231894] Avg episode reward: [(0, '193.507')] [2023-03-07 17:36:05,387][232226] Updated weights for policy 0, policy_version 63030 (0.0006) [2023-03-07 17:36:06,184][232226] Updated weights for policy 0, policy_version 63040 (0.0007) [2023-03-07 17:36:06,974][232226] Updated weights for policy 0, policy_version 63050 (0.0006) [2023-03-07 17:36:07,771][232226] Updated weights for policy 0, policy_version 63060 (0.0006) [2023-03-07 17:36:08,559][232226] Updated weights for policy 0, policy_version 63070 (0.0006) [2023-03-07 17:36:09,349][232226] Updated weights for policy 0, policy_version 63080 (0.0006) [2023-03-07 17:36:10,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12885.0). Total num frames: 64603136. Throughput: 0: 12884.9. Samples: 64593380. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:10,069][231894] Avg episode reward: [(0, '198.199')] [2023-03-07 17:36:10,138][232226] Updated weights for policy 0, policy_version 63090 (0.0006) [2023-03-07 17:36:10,934][232226] Updated weights for policy 0, policy_version 63100 (0.0007) [2023-03-07 17:36:11,729][232226] Updated weights for policy 0, policy_version 63110 (0.0007) [2023-03-07 17:36:12,521][232226] Updated weights for policy 0, policy_version 63120 (0.0006) [2023-03-07 17:36:13,314][232226] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-07 17:36:14,111][232226] Updated weights for policy 0, policy_version 63140 (0.0006) [2023-03-07 17:36:14,914][232226] Updated weights for policy 0, policy_version 63150 (0.0006) [2023-03-07 17:36:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.4, 300 sec: 12881.6). Total num frames: 64666624. Throughput: 0: 12885.5. Samples: 64632105. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:15,069][231894] Avg episode reward: [(0, '189.347')] [2023-03-07 17:36:15,713][232226] Updated weights for policy 0, policy_version 63160 (0.0005) [2023-03-07 17:36:16,507][232226] Updated weights for policy 0, policy_version 63170 (0.0007) [2023-03-07 17:36:17,310][232226] Updated weights for policy 0, policy_version 63180 (0.0006) [2023-03-07 17:36:18,096][232226] Updated weights for policy 0, policy_version 63190 (0.0006) [2023-03-07 17:36:18,893][232226] Updated weights for policy 0, policy_version 63200 (0.0006) [2023-03-07 17:36:19,691][232226] Updated weights for policy 0, policy_version 63210 (0.0006) [2023-03-07 17:36:20,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 64731136. Throughput: 0: 12886.3. Samples: 64709289. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:20,070][231894] Avg episode reward: [(0, '194.694')] [2023-03-07 17:36:20,486][232226] Updated weights for policy 0, policy_version 63220 (0.0006) [2023-03-07 17:36:21,259][232226] Updated weights for policy 0, policy_version 63230 (0.0006) [2023-03-07 17:36:22,054][232226] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-07 17:36:22,845][232226] Updated weights for policy 0, policy_version 63250 (0.0007) [2023-03-07 17:36:23,649][232226] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-07 17:36:24,437][232226] Updated weights for policy 0, policy_version 63270 (0.0007) [2023-03-07 17:36:25,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 64795648. Throughput: 0: 12890.0. Samples: 64786677. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:25,069][231894] Avg episode reward: [(0, '182.672')] [2023-03-07 17:36:25,084][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000063278_64796672.pth... [2023-03-07 17:36:25,113][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000060259_61705216.pth [2023-03-07 17:36:25,236][232226] Updated weights for policy 0, policy_version 63280 (0.0006) [2023-03-07 17:36:26,043][232226] Updated weights for policy 0, policy_version 63290 (0.0007) [2023-03-07 17:36:26,844][232226] Updated weights for policy 0, policy_version 63300 (0.0006) [2023-03-07 17:36:27,635][232226] Updated weights for policy 0, policy_version 63310 (0.0007) [2023-03-07 17:36:28,425][232226] Updated weights for policy 0, policy_version 63320 (0.0007) [2023-03-07 17:36:29,221][232226] Updated weights for policy 0, policy_version 63330 (0.0007) [2023-03-07 17:36:30,001][232226] Updated weights for policy 0, policy_version 63340 (0.0006) [2023-03-07 17:36:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12881.6). Total num frames: 64860160. Throughput: 0: 12883.1. Samples: 64825301. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:30,069][231894] Avg episode reward: [(0, '194.438')] [2023-03-07 17:36:30,800][232226] Updated weights for policy 0, policy_version 63350 (0.0007) [2023-03-07 17:36:31,599][232226] Updated weights for policy 0, policy_version 63360 (0.0007) [2023-03-07 17:36:32,392][232226] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-07 17:36:33,178][232226] Updated weights for policy 0, policy_version 63380 (0.0007) [2023-03-07 17:36:34,002][232226] Updated weights for policy 0, policy_version 63390 (0.0007) [2023-03-07 17:36:34,800][232226] Updated weights for policy 0, policy_version 63400 (0.0007) [2023-03-07 17:36:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.0). Total num frames: 64924672. Throughput: 0: 12886.7. Samples: 64902687. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:35,070][231894] Avg episode reward: [(0, '182.680')] [2023-03-07 17:36:35,598][232226] Updated weights for policy 0, policy_version 63410 (0.0006) [2023-03-07 17:36:36,394][232226] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-07 17:36:37,198][232226] Updated weights for policy 0, policy_version 63430 (0.0007) [2023-03-07 17:36:38,003][232226] Updated weights for policy 0, policy_version 63440 (0.0006) [2023-03-07 17:36:38,790][232226] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-07 17:36:39,580][232226] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-07 17:36:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12885.1). Total num frames: 64989184. Throughput: 0: 12882.1. Samples: 64979668. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:40,080][231894] Avg episode reward: [(0, '200.651')] [2023-03-07 17:36:40,378][232226] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-07 17:36:41,161][232226] Updated weights for policy 0, policy_version 63480 (0.0006) [2023-03-07 17:36:41,956][232226] Updated weights for policy 0, policy_version 63490 (0.0006) [2023-03-07 17:36:42,760][232226] Updated weights for policy 0, policy_version 63500 (0.0007) [2023-03-07 17:36:43,555][232226] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-07 17:36:44,353][232226] Updated weights for policy 0, policy_version 63520 (0.0005) [2023-03-07 17:36:45,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 65052672. Throughput: 0: 12883.3. Samples: 65018256. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:36:45,080][231894] Avg episode reward: [(0, '194.560')] [2023-03-07 17:36:45,142][232226] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-07 17:36:45,922][232226] Updated weights for policy 0, policy_version 63540 (0.0006) [2023-03-07 17:36:46,728][232226] Updated weights for policy 0, policy_version 63550 (0.0006) [2023-03-07 17:36:47,547][232226] Updated weights for policy 0, policy_version 63560 (0.0007) [2023-03-07 17:36:48,325][232226] Updated weights for policy 0, policy_version 63570 (0.0007) [2023-03-07 17:36:49,131][232226] Updated weights for policy 0, policy_version 63580 (0.0006) [2023-03-07 17:36:49,933][232226] Updated weights for policy 0, policy_version 63590 (0.0006) [2023-03-07 17:36:50,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12881.6). Total num frames: 65117184. Throughput: 0: 12874.8. Samples: 65095418. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:36:50,080][231894] Avg episode reward: [(0, '191.175')] [2023-03-07 17:36:50,735][232226] Updated weights for policy 0, policy_version 63600 (0.0007) [2023-03-07 17:36:51,539][232226] Updated weights for policy 0, policy_version 63610 (0.0006) [2023-03-07 17:36:52,331][232226] Updated weights for policy 0, policy_version 63620 (0.0006) [2023-03-07 17:36:53,119][232226] Updated weights for policy 0, policy_version 63630 (0.0006) [2023-03-07 17:36:53,931][232226] Updated weights for policy 0, policy_version 63640 (0.0006) [2023-03-07 17:36:54,734][232226] Updated weights for policy 0, policy_version 63650 (0.0006) [2023-03-07 17:36:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 65181696. Throughput: 0: 12867.0. Samples: 65172394. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:36:55,080][231894] Avg episode reward: [(0, '187.152')] [2023-03-07 17:36:55,524][232226] Updated weights for policy 0, policy_version 63660 (0.0005) [2023-03-07 17:36:56,313][232226] Updated weights for policy 0, policy_version 63670 (0.0006) [2023-03-07 17:36:57,122][232226] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-07 17:36:57,914][232226] Updated weights for policy 0, policy_version 63690 (0.0005) [2023-03-07 17:36:58,700][232226] Updated weights for policy 0, policy_version 63700 (0.0007) [2023-03-07 17:36:59,521][232226] Updated weights for policy 0, policy_version 63710 (0.0006) [2023-03-07 17:37:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 65246208. Throughput: 0: 12863.1. Samples: 65210943. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:37:00,080][231894] Avg episode reward: [(0, '187.165')] [2023-03-07 17:37:00,302][232226] Updated weights for policy 0, policy_version 63720 (0.0007) [2023-03-07 17:37:01,102][232226] Updated weights for policy 0, policy_version 63730 (0.0007) [2023-03-07 17:37:01,882][232226] Updated weights for policy 0, policy_version 63740 (0.0006) [2023-03-07 17:37:02,690][232226] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-07 17:37:03,497][232226] Updated weights for policy 0, policy_version 63760 (0.0008) [2023-03-07 17:37:04,271][232226] Updated weights for policy 0, policy_version 63770 (0.0006) [2023-03-07 17:37:05,061][232226] Updated weights for policy 0, policy_version 63780 (0.0007) [2023-03-07 17:37:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12878.1). Total num frames: 65310720. Throughput: 0: 12856.1. Samples: 65287812. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:37:05,080][231894] Avg episode reward: [(0, '187.362')] [2023-03-07 17:37:05,882][232226] Updated weights for policy 0, policy_version 63790 (0.0007) [2023-03-07 17:37:06,673][232226] Updated weights for policy 0, policy_version 63800 (0.0006) [2023-03-07 17:37:07,469][232226] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-07 17:37:08,282][232226] Updated weights for policy 0, policy_version 63820 (0.0008) [2023-03-07 17:37:09,062][232226] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-07 17:37:09,867][232226] Updated weights for policy 0, policy_version 63840 (0.0007) [2023-03-07 17:37:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 65374208. Throughput: 0: 12851.4. Samples: 65364990. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:37:10,080][231894] Avg episode reward: [(0, '191.206')] [2023-03-07 17:37:10,652][232226] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-07 17:37:11,452][232226] Updated weights for policy 0, policy_version 63860 (0.0006) [2023-03-07 17:37:12,264][232226] Updated weights for policy 0, policy_version 63870 (0.0007) [2023-03-07 17:37:13,052][232226] Updated weights for policy 0, policy_version 63880 (0.0007) [2023-03-07 17:37:13,858][232226] Updated weights for policy 0, policy_version 63890 (0.0007) [2023-03-07 17:37:14,657][232226] Updated weights for policy 0, policy_version 63900 (0.0006) [2023-03-07 17:37:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 65438720. Throughput: 0: 12849.9. Samples: 65403550. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:37:15,080][231894] Avg episode reward: [(0, '188.273')] [2023-03-07 17:37:15,462][232226] Updated weights for policy 0, policy_version 63910 (0.0006) [2023-03-07 17:37:16,273][232226] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-07 17:37:17,038][232226] Updated weights for policy 0, policy_version 63930 (0.0006) [2023-03-07 17:37:17,841][232226] Updated weights for policy 0, policy_version 63940 (0.0006) [2023-03-07 17:37:18,634][232226] Updated weights for policy 0, policy_version 63950 (0.0006) [2023-03-07 17:37:19,409][232226] Updated weights for policy 0, policy_version 63960 (0.0006) [2023-03-07 17:37:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 65503232. Throughput: 0: 12844.4. Samples: 65480681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:37:20,080][231894] Avg episode reward: [(0, '193.836')] [2023-03-07 17:37:20,221][232226] Updated weights for policy 0, policy_version 63970 (0.0006) [2023-03-07 17:37:21,003][232226] Updated weights for policy 0, policy_version 63980 (0.0006) [2023-03-07 17:37:21,823][232226] Updated weights for policy 0, policy_version 63990 (0.0008) [2023-03-07 17:37:22,614][232226] Updated weights for policy 0, policy_version 64000 (0.0006) [2023-03-07 17:37:23,425][232226] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-07 17:37:24,207][232226] Updated weights for policy 0, policy_version 64020 (0.0006) [2023-03-07 17:37:25,004][232226] Updated weights for policy 0, policy_version 64030 (0.0006) [2023-03-07 17:37:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 65567744. Throughput: 0: 12848.0. Samples: 65557830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:37:25,080][231894] Avg episode reward: [(0, '196.854')] [2023-03-07 17:37:25,800][232226] Updated weights for policy 0, policy_version 64040 (0.0007) [2023-03-07 17:37:26,591][232226] Updated weights for policy 0, policy_version 64050 (0.0006) [2023-03-07 17:37:27,389][232226] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-07 17:37:28,190][232226] Updated weights for policy 0, policy_version 64070 (0.0007) [2023-03-07 17:37:28,993][232226] Updated weights for policy 0, policy_version 64080 (0.0007) [2023-03-07 17:37:29,786][232226] Updated weights for policy 0, policy_version 64090 (0.0006) [2023-03-07 17:37:30,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 65631232. Throughput: 0: 12847.1. Samples: 65596379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:37:30,080][231894] Avg episode reward: [(0, '193.581')] [2023-03-07 17:37:30,565][232226] Updated weights for policy 0, policy_version 64100 (0.0007) [2023-03-07 17:37:31,369][232226] Updated weights for policy 0, policy_version 64110 (0.0007) [2023-03-07 17:37:32,177][232226] Updated weights for policy 0, policy_version 64120 (0.0007) [2023-03-07 17:37:32,969][232226] Updated weights for policy 0, policy_version 64130 (0.0006) [2023-03-07 17:37:33,752][232226] Updated weights for policy 0, policy_version 64140 (0.0006) [2023-03-07 17:37:34,559][232226] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-07 17:37:35,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 65695744. Throughput: 0: 12847.1. Samples: 65673538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:37:35,080][231894] Avg episode reward: [(0, '192.310')] [2023-03-07 17:37:35,363][232226] Updated weights for policy 0, policy_version 64160 (0.0006) [2023-03-07 17:37:36,147][232226] Updated weights for policy 0, policy_version 64170 (0.0006) [2023-03-07 17:37:36,931][232226] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-07 17:37:37,734][232226] Updated weights for policy 0, policy_version 64190 (0.0006) [2023-03-07 17:37:38,533][232226] Updated weights for policy 0, policy_version 64200 (0.0007) [2023-03-07 17:37:39,337][232226] Updated weights for policy 0, policy_version 64210 (0.0007) [2023-03-07 17:37:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12871.2). Total num frames: 65759232. Throughput: 0: 12851.1. Samples: 65750694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:37:40,069][231894] Avg episode reward: [(0, '189.207')] [2023-03-07 17:37:40,138][232226] Updated weights for policy 0, policy_version 64220 (0.0007) [2023-03-07 17:37:40,953][232226] Updated weights for policy 0, policy_version 64230 (0.0006) [2023-03-07 17:37:41,738][232226] Updated weights for policy 0, policy_version 64240 (0.0007) [2023-03-07 17:37:42,546][232226] Updated weights for policy 0, policy_version 64250 (0.0006) [2023-03-07 17:37:43,338][232226] Updated weights for policy 0, policy_version 64260 (0.0007) [2023-03-07 17:37:44,157][232226] Updated weights for policy 0, policy_version 64270 (0.0006) [2023-03-07 17:37:44,942][232226] Updated weights for policy 0, policy_version 64280 (0.0007) [2023-03-07 17:37:45,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 65823744. Throughput: 0: 12846.6. Samples: 65789040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:37:45,069][231894] Avg episode reward: [(0, '196.712')] [2023-03-07 17:37:45,754][232226] Updated weights for policy 0, policy_version 64290 (0.0006) [2023-03-07 17:37:46,537][232226] Updated weights for policy 0, policy_version 64300 (0.0007) [2023-03-07 17:37:47,355][232226] Updated weights for policy 0, policy_version 64310 (0.0006) [2023-03-07 17:37:48,142][232226] Updated weights for policy 0, policy_version 64320 (0.0007) [2023-03-07 17:37:48,923][232226] Updated weights for policy 0, policy_version 64330 (0.0007) [2023-03-07 17:37:49,744][232226] Updated weights for policy 0, policy_version 64340 (0.0007) [2023-03-07 17:37:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 65888256. Throughput: 0: 12845.1. Samples: 65865842. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:37:50,069][231894] Avg episode reward: [(0, '193.403')] [2023-03-07 17:37:50,524][232226] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-07 17:37:51,328][232226] Updated weights for policy 0, policy_version 64360 (0.0006) [2023-03-07 17:37:52,114][232226] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-07 17:37:52,918][232226] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-07 17:37:53,708][232226] Updated weights for policy 0, policy_version 64390 (0.0007) [2023-03-07 17:37:54,527][232226] Updated weights for policy 0, policy_version 64400 (0.0006) [2023-03-07 17:37:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 65952768. Throughput: 0: 12844.9. Samples: 65943009. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:37:55,069][231894] Avg episode reward: [(0, '188.512')] [2023-03-07 17:37:55,328][232226] Updated weights for policy 0, policy_version 64410 (0.0007) [2023-03-07 17:37:56,101][232226] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-07 17:37:56,902][232226] Updated weights for policy 0, policy_version 64430 (0.0006) [2023-03-07 17:37:57,677][232226] Updated weights for policy 0, policy_version 64440 (0.0006) [2023-03-07 17:37:58,497][232226] Updated weights for policy 0, policy_version 64450 (0.0007) [2023-03-07 17:37:59,269][232226] Updated weights for policy 0, policy_version 64460 (0.0006) [2023-03-07 17:38:00,062][232226] Updated weights for policy 0, policy_version 64470 (0.0006) [2023-03-07 17:38:00,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 66017280. Throughput: 0: 12846.4. Samples: 65981638. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:38:00,069][231894] Avg episode reward: [(0, '192.702')] [2023-03-07 17:38:00,872][232226] Updated weights for policy 0, policy_version 64480 (0.0007) [2023-03-07 17:38:01,663][232226] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-07 17:38:02,462][232226] Updated weights for policy 0, policy_version 64500 (0.0006) [2023-03-07 17:38:03,257][232226] Updated weights for policy 0, policy_version 64510 (0.0006) [2023-03-07 17:38:04,069][232226] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-07 17:38:04,862][232226] Updated weights for policy 0, policy_version 64530 (0.0006) [2023-03-07 17:38:05,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12871.2). Total num frames: 66080768. Throughput: 0: 12849.1. Samples: 66058891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:38:05,070][231894] Avg episode reward: [(0, '190.988')] [2023-03-07 17:38:05,654][232226] Updated weights for policy 0, policy_version 64540 (0.0006) [2023-03-07 17:38:06,441][232226] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-07 17:38:07,242][232226] Updated weights for policy 0, policy_version 64560 (0.0006) [2023-03-07 17:38:08,021][232226] Updated weights for policy 0, policy_version 64570 (0.0007) [2023-03-07 17:38:08,807][232226] Updated weights for policy 0, policy_version 64580 (0.0007) [2023-03-07 17:38:09,606][232226] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-07 17:38:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 66145280. Throughput: 0: 12854.2. Samples: 66136272. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:10,069][231894] Avg episode reward: [(0, '194.757')] [2023-03-07 17:38:10,389][232226] Updated weights for policy 0, policy_version 64600 (0.0007) [2023-03-07 17:38:11,182][232226] Updated weights for policy 0, policy_version 64610 (0.0006) [2023-03-07 17:38:11,970][232226] Updated weights for policy 0, policy_version 64620 (0.0006) [2023-03-07 17:38:12,768][232226] Updated weights for policy 0, policy_version 64630 (0.0006) [2023-03-07 17:38:13,565][232226] Updated weights for policy 0, policy_version 64640 (0.0007) [2023-03-07 17:38:14,361][232226] Updated weights for policy 0, policy_version 64650 (0.0006) [2023-03-07 17:38:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 66209792. Throughput: 0: 12862.4. Samples: 66175186. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:15,069][231894] Avg episode reward: [(0, '198.257')] [2023-03-07 17:38:15,148][232226] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-07 17:38:15,944][232226] Updated weights for policy 0, policy_version 64670 (0.0006) [2023-03-07 17:38:16,738][232226] Updated weights for policy 0, policy_version 64680 (0.0006) [2023-03-07 17:38:17,545][232226] Updated weights for policy 0, policy_version 64690 (0.0007) [2023-03-07 17:38:18,320][232226] Updated weights for policy 0, policy_version 64700 (0.0006) [2023-03-07 17:38:19,118][232226] Updated weights for policy 0, policy_version 64710 (0.0007) [2023-03-07 17:38:19,923][232226] Updated weights for policy 0, policy_version 64720 (0.0008) [2023-03-07 17:38:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 66274304. Throughput: 0: 12870.6. Samples: 66252713. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:20,070][231894] Avg episode reward: [(0, '197.943')] [2023-03-07 17:38:20,705][232226] Updated weights for policy 0, policy_version 64730 (0.0006) [2023-03-07 17:38:21,504][232226] Updated weights for policy 0, policy_version 64740 (0.0006) [2023-03-07 17:38:22,284][232226] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-07 17:38:23,095][232226] Updated weights for policy 0, policy_version 64760 (0.0006) [2023-03-07 17:38:23,894][232226] Updated weights for policy 0, policy_version 64770 (0.0007) [2023-03-07 17:38:24,683][232226] Updated weights for policy 0, policy_version 64780 (0.0007) [2023-03-07 17:38:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 66338816. Throughput: 0: 12872.7. Samples: 66329969. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:25,070][231894] Avg episode reward: [(0, '193.684')] [2023-03-07 17:38:25,077][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000064785_66339840.pth... [2023-03-07 17:38:25,107][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000061767_63249408.pth [2023-03-07 17:38:25,482][232226] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-07 17:38:26,281][232226] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-07 17:38:27,056][232226] Updated weights for policy 0, policy_version 64810 (0.0006) [2023-03-07 17:38:27,841][232226] Updated weights for policy 0, policy_version 64820 (0.0006) [2023-03-07 17:38:28,635][232226] Updated weights for policy 0, policy_version 64830 (0.0006) [2023-03-07 17:38:29,462][232226] Updated weights for policy 0, policy_version 64840 (0.0006) [2023-03-07 17:38:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 66403328. Throughput: 0: 12879.9. Samples: 66368637. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:30,069][231894] Avg episode reward: [(0, '194.301')] [2023-03-07 17:38:30,249][232226] Updated weights for policy 0, policy_version 64850 (0.0006) [2023-03-07 17:38:31,026][232226] Updated weights for policy 0, policy_version 64860 (0.0006) [2023-03-07 17:38:31,847][232226] Updated weights for policy 0, policy_version 64870 (0.0006) [2023-03-07 17:38:32,626][232226] Updated weights for policy 0, policy_version 64880 (0.0006) [2023-03-07 17:38:33,427][232226] Updated weights for policy 0, policy_version 64890 (0.0007) [2023-03-07 17:38:34,221][232226] Updated weights for policy 0, policy_version 64900 (0.0007) [2023-03-07 17:38:35,041][232226] Updated weights for policy 0, policy_version 64910 (0.0006) [2023-03-07 17:38:35,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 66467840. Throughput: 0: 12887.2. Samples: 66445765. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:35,069][231894] Avg episode reward: [(0, '195.457')] [2023-03-07 17:38:35,819][232226] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-07 17:38:36,638][232226] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-07 17:38:37,425][232226] Updated weights for policy 0, policy_version 64940 (0.0006) [2023-03-07 17:38:38,213][232226] Updated weights for policy 0, policy_version 64950 (0.0007) [2023-03-07 17:38:39,007][232226] Updated weights for policy 0, policy_version 64960 (0.0006) [2023-03-07 17:38:39,799][232226] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-07 17:38:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 66532352. Throughput: 0: 12887.3. Samples: 66522938. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:40,070][231894] Avg episode reward: [(0, '192.763')] [2023-03-07 17:38:40,586][232226] Updated weights for policy 0, policy_version 64980 (0.0007) [2023-03-07 17:38:41,373][232226] Updated weights for policy 0, policy_version 64990 (0.0007) [2023-03-07 17:38:42,201][232226] Updated weights for policy 0, policy_version 65000 (0.0007) [2023-03-07 17:38:42,991][232226] Updated weights for policy 0, policy_version 65010 (0.0007) [2023-03-07 17:38:43,785][232226] Updated weights for policy 0, policy_version 65020 (0.0006) [2023-03-07 17:38:44,582][232226] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-07 17:38:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 66596864. Throughput: 0: 12885.9. Samples: 66561502. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:45,069][231894] Avg episode reward: [(0, '193.528')] [2023-03-07 17:38:45,366][232226] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-07 17:38:46,148][232226] Updated weights for policy 0, policy_version 65050 (0.0006) [2023-03-07 17:38:46,953][232226] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-07 17:38:47,759][232226] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-07 17:38:48,569][232226] Updated weights for policy 0, policy_version 65080 (0.0006) [2023-03-07 17:38:49,362][232226] Updated weights for policy 0, policy_version 65090 (0.0006) [2023-03-07 17:38:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 66661376. Throughput: 0: 12885.2. Samples: 66638723. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:50,069][231894] Avg episode reward: [(0, '186.613')] [2023-03-07 17:38:50,161][232226] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-07 17:38:50,942][232226] Updated weights for policy 0, policy_version 65110 (0.0006) [2023-03-07 17:38:51,741][232226] Updated weights for policy 0, policy_version 65120 (0.0007) [2023-03-07 17:38:52,536][232226] Updated weights for policy 0, policy_version 65130 (0.0007) [2023-03-07 17:38:53,313][232226] Updated weights for policy 0, policy_version 65140 (0.0007) [2023-03-07 17:38:54,112][232226] Updated weights for policy 0, policy_version 65150 (0.0006) [2023-03-07 17:38:54,928][232226] Updated weights for policy 0, policy_version 65160 (0.0007) [2023-03-07 17:38:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 66724864. Throughput: 0: 12886.7. Samples: 66716173. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:38:55,069][231894] Avg episode reward: [(0, '200.206')] [2023-03-07 17:38:55,706][232226] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-07 17:38:56,488][232226] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-07 17:38:57,290][232226] Updated weights for policy 0, policy_version 65190 (0.0007) [2023-03-07 17:38:58,073][232226] Updated weights for policy 0, policy_version 65200 (0.0007) [2023-03-07 17:38:58,890][232226] Updated weights for policy 0, policy_version 65210 (0.0006) [2023-03-07 17:38:59,677][232226] Updated weights for policy 0, policy_version 65220 (0.0007) [2023-03-07 17:39:00,069][231894] Fps is (10 sec: 12799.6, 60 sec: 12868.2, 300 sec: 12871.1). Total num frames: 66789376. Throughput: 0: 12880.1. Samples: 66754794. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:39:00,070][231894] Avg episode reward: [(0, '184.430')] [2023-03-07 17:39:00,483][232226] Updated weights for policy 0, policy_version 65230 (0.0006) [2023-03-07 17:39:01,290][232226] Updated weights for policy 0, policy_version 65240 (0.0006) [2023-03-07 17:39:02,075][232226] Updated weights for policy 0, policy_version 65250 (0.0007) [2023-03-07 17:39:02,879][232226] Updated weights for policy 0, policy_version 65260 (0.0007) [2023-03-07 17:39:03,680][232226] Updated weights for policy 0, policy_version 65270 (0.0007) [2023-03-07 17:39:04,469][232226] Updated weights for policy 0, policy_version 65280 (0.0007) [2023-03-07 17:39:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 66853888. Throughput: 0: 12869.1. Samples: 66831824. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:39:05,069][231894] Avg episode reward: [(0, '188.387')] [2023-03-07 17:39:05,259][232226] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-07 17:39:06,070][232226] Updated weights for policy 0, policy_version 65300 (0.0006) [2023-03-07 17:39:06,865][232226] Updated weights for policy 0, policy_version 65310 (0.0007) [2023-03-07 17:39:07,646][232226] Updated weights for policy 0, policy_version 65320 (0.0006) [2023-03-07 17:39:08,443][232226] Updated weights for policy 0, policy_version 65330 (0.0007) [2023-03-07 17:39:09,246][232226] Updated weights for policy 0, policy_version 65340 (0.0006) [2023-03-07 17:39:10,031][232226] Updated weights for policy 0, policy_version 65350 (0.0006) [2023-03-07 17:39:10,069][231894] Fps is (10 sec: 12902.7, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 66918400. Throughput: 0: 12863.9. Samples: 66908846. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:39:10,070][231894] Avg episode reward: [(0, '192.857')] [2023-03-07 17:39:10,831][232226] Updated weights for policy 0, policy_version 65360 (0.0007) [2023-03-07 17:39:11,635][232226] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-07 17:39:12,438][232226] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-07 17:39:13,234][232226] Updated weights for policy 0, policy_version 65390 (0.0006) [2023-03-07 17:39:14,037][232226] Updated weights for policy 0, policy_version 65400 (0.0006) [2023-03-07 17:39:14,828][232226] Updated weights for policy 0, policy_version 65410 (0.0007) [2023-03-07 17:39:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 66982912. Throughput: 0: 12862.5. Samples: 66947450. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:39:15,069][231894] Avg episode reward: [(0, '189.112')] [2023-03-07 17:39:15,633][232226] Updated weights for policy 0, policy_version 65420 (0.0006) [2023-03-07 17:39:16,418][232226] Updated weights for policy 0, policy_version 65430 (0.0005) [2023-03-07 17:39:17,219][232226] Updated weights for policy 0, policy_version 65440 (0.0006) [2023-03-07 17:39:18,010][232226] Updated weights for policy 0, policy_version 65450 (0.0007) [2023-03-07 17:39:18,785][232226] Updated weights for policy 0, policy_version 65460 (0.0006) [2023-03-07 17:39:19,607][232226] Updated weights for policy 0, policy_version 65470 (0.0007) [2023-03-07 17:39:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 67046400. Throughput: 0: 12863.8. Samples: 67024639. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:39:20,070][231894] Avg episode reward: [(0, '196.234')] [2023-03-07 17:39:20,403][232226] Updated weights for policy 0, policy_version 65480 (0.0007) [2023-03-07 17:39:21,196][232226] Updated weights for policy 0, policy_version 65490 (0.0007) [2023-03-07 17:39:21,993][232226] Updated weights for policy 0, policy_version 65500 (0.0006) [2023-03-07 17:39:22,809][232226] Updated weights for policy 0, policy_version 65510 (0.0006) [2023-03-07 17:39:23,604][232226] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-07 17:39:24,398][232226] Updated weights for policy 0, policy_version 65530 (0.0007) [2023-03-07 17:39:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 67110912. Throughput: 0: 12859.7. Samples: 67101626. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:39:25,069][231894] Avg episode reward: [(0, '192.719')] [2023-03-07 17:39:25,187][232226] Updated weights for policy 0, policy_version 65540 (0.0006) [2023-03-07 17:39:26,002][232226] Updated weights for policy 0, policy_version 65550 (0.0007) [2023-03-07 17:39:26,795][232226] Updated weights for policy 0, policy_version 65560 (0.0007) [2023-03-07 17:39:27,578][232226] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-07 17:39:28,381][232226] Updated weights for policy 0, policy_version 65580 (0.0006) [2023-03-07 17:39:29,149][232226] Updated weights for policy 0, policy_version 65590 (0.0007) [2023-03-07 17:39:29,943][232226] Updated weights for policy 0, policy_version 65600 (0.0006) [2023-03-07 17:39:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 67175424. Throughput: 0: 12859.3. Samples: 67140171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:39:30,069][231894] Avg episode reward: [(0, '190.614')] [2023-03-07 17:39:30,757][232226] Updated weights for policy 0, policy_version 65610 (0.0006) [2023-03-07 17:39:31,562][232226] Updated weights for policy 0, policy_version 65620 (0.0006) [2023-03-07 17:39:32,344][232226] Updated weights for policy 0, policy_version 65630 (0.0006) [2023-03-07 17:39:33,148][232226] Updated weights for policy 0, policy_version 65640 (0.0007) [2023-03-07 17:39:33,915][232226] Updated weights for policy 0, policy_version 65650 (0.0006) [2023-03-07 17:39:34,721][232226] Updated weights for policy 0, policy_version 65660 (0.0006) [2023-03-07 17:39:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 67239936. Throughput: 0: 12864.2. Samples: 67217610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:39:35,069][231894] Avg episode reward: [(0, '193.318')] [2023-03-07 17:39:35,518][232226] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-07 17:39:36,314][232226] Updated weights for policy 0, policy_version 65680 (0.0007) [2023-03-07 17:39:37,112][232226] Updated weights for policy 0, policy_version 65690 (0.0006) [2023-03-07 17:39:37,924][232226] Updated weights for policy 0, policy_version 65700 (0.0005) [2023-03-07 17:39:38,710][232226] Updated weights for policy 0, policy_version 65710 (0.0006) [2023-03-07 17:39:39,500][232226] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-07 17:39:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 67304448. Throughput: 0: 12856.0. Samples: 67294695. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:39:40,069][231894] Avg episode reward: [(0, '192.812')] [2023-03-07 17:39:40,305][232226] Updated weights for policy 0, policy_version 65730 (0.0008) [2023-03-07 17:39:41,099][232226] Updated weights for policy 0, policy_version 65740 (0.0006) [2023-03-07 17:39:41,893][232226] Updated weights for policy 0, policy_version 65750 (0.0006) [2023-03-07 17:39:42,673][232226] Updated weights for policy 0, policy_version 65760 (0.0007) [2023-03-07 17:39:43,499][232226] Updated weights for policy 0, policy_version 65770 (0.0007) [2023-03-07 17:39:44,274][232226] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-07 17:39:45,057][232226] Updated weights for policy 0, policy_version 65790 (0.0006) [2023-03-07 17:39:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 67368960. Throughput: 0: 12862.0. Samples: 67333579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:39:45,069][231894] Avg episode reward: [(0, '191.247')] [2023-03-07 17:39:45,862][232226] Updated weights for policy 0, policy_version 65800 (0.0006) [2023-03-07 17:39:46,657][232226] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-07 17:39:47,462][232226] Updated weights for policy 0, policy_version 65820 (0.0006) [2023-03-07 17:39:48,259][232226] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-07 17:39:49,058][232226] Updated weights for policy 0, policy_version 65840 (0.0006) [2023-03-07 17:39:49,863][232226] Updated weights for policy 0, policy_version 65850 (0.0007) [2023-03-07 17:39:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 67432448. Throughput: 0: 12860.9. Samples: 67410562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:39:50,069][231894] Avg episode reward: [(0, '189.684')] [2023-03-07 17:39:50,671][232226] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-07 17:39:51,431][232226] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-07 17:39:52,222][232226] Updated weights for policy 0, policy_version 65880 (0.0006) [2023-03-07 17:39:53,034][232226] Updated weights for policy 0, policy_version 65890 (0.0007) [2023-03-07 17:39:53,807][232226] Updated weights for policy 0, policy_version 65900 (0.0006) [2023-03-07 17:39:54,596][232226] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-07 17:39:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 67496960. Throughput: 0: 12871.2. Samples: 67488050. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:39:55,069][231894] Avg episode reward: [(0, '195.779')] [2023-03-07 17:39:55,392][232226] Updated weights for policy 0, policy_version 65920 (0.0006) [2023-03-07 17:39:56,178][232226] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-07 17:39:56,967][232226] Updated weights for policy 0, policy_version 65940 (0.0006) [2023-03-07 17:39:57,773][232226] Updated weights for policy 0, policy_version 65950 (0.0006) [2023-03-07 17:39:58,559][232226] Updated weights for policy 0, policy_version 65960 (0.0006) [2023-03-07 17:39:59,351][232226] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-07 17:40:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 67561472. Throughput: 0: 12878.2. Samples: 67526966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:40:00,069][231894] Avg episode reward: [(0, '187.321')] [2023-03-07 17:40:00,157][232226] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-07 17:40:00,938][232226] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-07 17:40:01,714][232226] Updated weights for policy 0, policy_version 66000 (0.0007) [2023-03-07 17:40:02,525][232226] Updated weights for policy 0, policy_version 66010 (0.0007) [2023-03-07 17:40:03,312][232226] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-07 17:40:04,092][232226] Updated weights for policy 0, policy_version 66030 (0.0006) [2023-03-07 17:40:04,899][232226] Updated weights for policy 0, policy_version 66040 (0.0006) [2023-03-07 17:40:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 67625984. Throughput: 0: 12880.6. Samples: 67604266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:40:05,069][231894] Avg episode reward: [(0, '194.148')] [2023-03-07 17:40:05,686][232226] Updated weights for policy 0, policy_version 66050 (0.0006) [2023-03-07 17:40:06,483][232226] Updated weights for policy 0, policy_version 66060 (0.0006) [2023-03-07 17:40:07,273][232226] Updated weights for policy 0, policy_version 66070 (0.0007) [2023-03-07 17:40:08,063][232226] Updated weights for policy 0, policy_version 66080 (0.0007) [2023-03-07 17:40:08,857][232226] Updated weights for policy 0, policy_version 66090 (0.0005) [2023-03-07 17:40:09,653][232226] Updated weights for policy 0, policy_version 66100 (0.0006) [2023-03-07 17:40:10,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.4, 300 sec: 12874.6). Total num frames: 67691520. Throughput: 0: 12895.4. Samples: 67681916. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:10,069][231894] Avg episode reward: [(0, '192.804')] [2023-03-07 17:40:10,448][232226] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-07 17:40:11,231][232226] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-07 17:40:12,023][232226] Updated weights for policy 0, policy_version 66130 (0.0006) [2023-03-07 17:40:12,814][232226] Updated weights for policy 0, policy_version 66140 (0.0006) [2023-03-07 17:40:13,594][232226] Updated weights for policy 0, policy_version 66150 (0.0006) [2023-03-07 17:40:14,389][232226] Updated weights for policy 0, policy_version 66160 (0.0006) [2023-03-07 17:40:15,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12885.4, 300 sec: 12874.6). Total num frames: 67756032. Throughput: 0: 12903.8. Samples: 67720844. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:15,069][231894] Avg episode reward: [(0, '192.937')] [2023-03-07 17:40:15,192][232226] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-07 17:40:15,975][232226] Updated weights for policy 0, policy_version 66180 (0.0007) [2023-03-07 17:40:16,754][232226] Updated weights for policy 0, policy_version 66190 (0.0006) [2023-03-07 17:40:17,558][232226] Updated weights for policy 0, policy_version 66200 (0.0006) [2023-03-07 17:40:18,365][232226] Updated weights for policy 0, policy_version 66210 (0.0006) [2023-03-07 17:40:19,136][232226] Updated weights for policy 0, policy_version 66220 (0.0007) [2023-03-07 17:40:19,939][232226] Updated weights for policy 0, policy_version 66230 (0.0006) [2023-03-07 17:40:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 67820544. Throughput: 0: 12911.1. Samples: 67798611. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:20,069][231894] Avg episode reward: [(0, '188.559')] [2023-03-07 17:40:20,738][232226] Updated weights for policy 0, policy_version 66240 (0.0006) [2023-03-07 17:40:21,522][232226] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-07 17:40:22,319][232226] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-07 17:40:23,126][232226] Updated weights for policy 0, policy_version 66270 (0.0006) [2023-03-07 17:40:23,915][232226] Updated weights for policy 0, policy_version 66280 (0.0006) [2023-03-07 17:40:24,704][232226] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-07 17:40:25,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 67885056. Throughput: 0: 12915.4. Samples: 67875889. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:25,070][231894] Avg episode reward: [(0, '197.619')] [2023-03-07 17:40:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000066294_67885056.pth... [2023-03-07 17:40:25,107][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000063278_64796672.pth [2023-03-07 17:40:25,492][232226] Updated weights for policy 0, policy_version 66300 (0.0007) [2023-03-07 17:40:26,270][232226] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-07 17:40:27,075][232226] Updated weights for policy 0, policy_version 66320 (0.0007) [2023-03-07 17:40:27,873][232226] Updated weights for policy 0, policy_version 66330 (0.0006) [2023-03-07 17:40:28,669][232226] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-07 17:40:29,451][232226] Updated weights for policy 0, policy_version 66350 (0.0007) [2023-03-07 17:40:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 67949568. Throughput: 0: 12911.6. Samples: 67914601. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:30,069][231894] Avg episode reward: [(0, '191.047')] [2023-03-07 17:40:30,273][232226] Updated weights for policy 0, policy_version 66360 (0.0006) [2023-03-07 17:40:31,049][232226] Updated weights for policy 0, policy_version 66370 (0.0007) [2023-03-07 17:40:31,829][232226] Updated weights for policy 0, policy_version 66380 (0.0006) [2023-03-07 17:40:32,642][232226] Updated weights for policy 0, policy_version 66390 (0.0006) [2023-03-07 17:40:33,442][232226] Updated weights for policy 0, policy_version 66400 (0.0006) [2023-03-07 17:40:34,246][232226] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-07 17:40:35,029][232226] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-07 17:40:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 68014080. Throughput: 0: 12919.9. Samples: 67991961. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:35,070][231894] Avg episode reward: [(0, '192.942')] [2023-03-07 17:40:35,803][232226] Updated weights for policy 0, policy_version 66430 (0.0006) [2023-03-07 17:40:36,596][232226] Updated weights for policy 0, policy_version 66440 (0.0006) [2023-03-07 17:40:37,397][232226] Updated weights for policy 0, policy_version 66450 (0.0006) [2023-03-07 17:40:38,204][232226] Updated weights for policy 0, policy_version 66460 (0.0007) [2023-03-07 17:40:38,999][232226] Updated weights for policy 0, policy_version 66470 (0.0006) [2023-03-07 17:40:39,805][232226] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-07 17:40:40,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 68078592. Throughput: 0: 12917.3. Samples: 68069328. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:40,069][231894] Avg episode reward: [(0, '197.604')] [2023-03-07 17:40:40,597][232226] Updated weights for policy 0, policy_version 66490 (0.0006) [2023-03-07 17:40:41,376][232226] Updated weights for policy 0, policy_version 66500 (0.0007) [2023-03-07 17:40:42,179][232226] Updated weights for policy 0, policy_version 66510 (0.0007) [2023-03-07 17:40:42,985][232226] Updated weights for policy 0, policy_version 66520 (0.0006) [2023-03-07 17:40:43,788][232226] Updated weights for policy 0, policy_version 66530 (0.0007) [2023-03-07 17:40:44,572][232226] Updated weights for policy 0, policy_version 66540 (0.0006) [2023-03-07 17:40:45,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 68143104. Throughput: 0: 12908.8. Samples: 68107862. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:40:45,069][231894] Avg episode reward: [(0, '194.251')] [2023-03-07 17:40:45,365][232226] Updated weights for policy 0, policy_version 66550 (0.0007) [2023-03-07 17:40:46,163][232226] Updated weights for policy 0, policy_version 66560 (0.0006) [2023-03-07 17:40:46,945][232226] Updated weights for policy 0, policy_version 66570 (0.0006) [2023-03-07 17:40:47,740][232226] Updated weights for policy 0, policy_version 66580 (0.0006) [2023-03-07 17:40:48,534][232226] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-07 17:40:49,327][232226] Updated weights for policy 0, policy_version 66600 (0.0007) [2023-03-07 17:40:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12919.5, 300 sec: 12874.6). Total num frames: 68207616. Throughput: 0: 12908.6. Samples: 68185155. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:40:50,070][231894] Avg episode reward: [(0, '195.451')] [2023-03-07 17:40:50,122][232226] Updated weights for policy 0, policy_version 66610 (0.0006) [2023-03-07 17:40:50,924][232226] Updated weights for policy 0, policy_version 66620 (0.0007) [2023-03-07 17:40:51,726][232226] Updated weights for policy 0, policy_version 66630 (0.0007) [2023-03-07 17:40:52,521][232226] Updated weights for policy 0, policy_version 66640 (0.0006) [2023-03-07 17:40:53,321][232226] Updated weights for policy 0, policy_version 66650 (0.0007) [2023-03-07 17:40:54,102][232226] Updated weights for policy 0, policy_version 66660 (0.0006) [2023-03-07 17:40:54,891][232226] Updated weights for policy 0, policy_version 66670 (0.0006) [2023-03-07 17:40:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12919.4, 300 sec: 12878.1). Total num frames: 68272128. Throughput: 0: 12903.5. Samples: 68262575. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:40:55,070][231894] Avg episode reward: [(0, '189.208')] [2023-03-07 17:40:55,683][232226] Updated weights for policy 0, policy_version 66680 (0.0007) [2023-03-07 17:40:56,483][232226] Updated weights for policy 0, policy_version 66690 (0.0006) [2023-03-07 17:40:57,282][232226] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-07 17:40:58,071][232226] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-07 17:40:58,858][232226] Updated weights for policy 0, policy_version 66720 (0.0006) [2023-03-07 17:40:59,659][232226] Updated weights for policy 0, policy_version 66730 (0.0006) [2023-03-07 17:41:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12919.4, 300 sec: 12874.6). Total num frames: 68336640. Throughput: 0: 12897.5. Samples: 68301234. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:41:00,070][231894] Avg episode reward: [(0, '181.628')] [2023-03-07 17:41:00,455][232226] Updated weights for policy 0, policy_version 66740 (0.0006) [2023-03-07 17:41:01,240][232226] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-07 17:41:02,037][232226] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-07 17:41:02,834][232226] Updated weights for policy 0, policy_version 66770 (0.0007) [2023-03-07 17:41:03,620][232226] Updated weights for policy 0, policy_version 66780 (0.0006) [2023-03-07 17:41:04,439][232226] Updated weights for policy 0, policy_version 66790 (0.0007) [2023-03-07 17:41:05,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12902.4, 300 sec: 12871.2). Total num frames: 68400128. Throughput: 0: 12888.3. Samples: 68378583. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:41:05,069][231894] Avg episode reward: [(0, '195.972')] [2023-03-07 17:41:05,244][232226] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-07 17:41:06,030][232226] Updated weights for policy 0, policy_version 66810 (0.0006) [2023-03-07 17:41:06,834][232226] Updated weights for policy 0, policy_version 66820 (0.0006) [2023-03-07 17:41:07,640][232226] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-07 17:41:08,434][232226] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-07 17:41:09,226][232226] Updated weights for policy 0, policy_version 66850 (0.0006) [2023-03-07 17:41:10,028][232226] Updated weights for policy 0, policy_version 66860 (0.0006) [2023-03-07 17:41:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 68464640. Throughput: 0: 12880.1. Samples: 68455492. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:41:10,069][231894] Avg episode reward: [(0, '197.207')] [2023-03-07 17:41:10,827][232226] Updated weights for policy 0, policy_version 66870 (0.0006) [2023-03-07 17:41:11,643][232226] Updated weights for policy 0, policy_version 66880 (0.0007) [2023-03-07 17:41:12,443][232226] Updated weights for policy 0, policy_version 66890 (0.0006) [2023-03-07 17:41:13,235][232226] Updated weights for policy 0, policy_version 66900 (0.0006) [2023-03-07 17:41:14,029][232226] Updated weights for policy 0, policy_version 66910 (0.0007) [2023-03-07 17:41:14,826][232226] Updated weights for policy 0, policy_version 66920 (0.0006) [2023-03-07 17:41:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 68528128. Throughput: 0: 12868.2. Samples: 68493671. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:41:15,069][231894] Avg episode reward: [(0, '188.277')] [2023-03-07 17:41:15,635][232226] Updated weights for policy 0, policy_version 66930 (0.0007) [2023-03-07 17:41:16,439][232226] Updated weights for policy 0, policy_version 66940 (0.0007) [2023-03-07 17:41:17,250][232226] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-07 17:41:18,034][232226] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-07 17:41:18,827][232226] Updated weights for policy 0, policy_version 66970 (0.0007) [2023-03-07 17:41:19,625][232226] Updated weights for policy 0, policy_version 66980 (0.0006) [2023-03-07 17:41:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 68592640. Throughput: 0: 12857.3. Samples: 68570538. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:41:20,069][231894] Avg episode reward: [(0, '186.255')] [2023-03-07 17:41:20,441][232226] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-07 17:41:21,234][232226] Updated weights for policy 0, policy_version 67000 (0.0008) [2023-03-07 17:41:22,035][232226] Updated weights for policy 0, policy_version 67010 (0.0006) [2023-03-07 17:41:22,843][232226] Updated weights for policy 0, policy_version 67020 (0.0006) [2023-03-07 17:41:23,614][232226] Updated weights for policy 0, policy_version 67030 (0.0006) [2023-03-07 17:41:24,414][232226] Updated weights for policy 0, policy_version 67040 (0.0006) [2023-03-07 17:41:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 68657152. Throughput: 0: 12854.0. Samples: 68647757. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:41:25,069][231894] Avg episode reward: [(0, '192.747')] [2023-03-07 17:41:25,210][232226] Updated weights for policy 0, policy_version 67050 (0.0006) [2023-03-07 17:41:26,006][232226] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-07 17:41:26,793][232226] Updated weights for policy 0, policy_version 67070 (0.0006) [2023-03-07 17:41:27,592][232226] Updated weights for policy 0, policy_version 67080 (0.0006) [2023-03-07 17:41:28,345][232226] Updated weights for policy 0, policy_version 67090 (0.0006) [2023-03-07 17:41:29,161][232226] Updated weights for policy 0, policy_version 67100 (0.0006) [2023-03-07 17:41:29,961][232226] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-07 17:41:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 68721664. Throughput: 0: 12859.0. Samples: 68686519. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:41:30,069][231894] Avg episode reward: [(0, '194.754')] [2023-03-07 17:41:30,766][232226] Updated weights for policy 0, policy_version 67120 (0.0006) [2023-03-07 17:41:31,563][232226] Updated weights for policy 0, policy_version 67130 (0.0007) [2023-03-07 17:41:32,364][232226] Updated weights for policy 0, policy_version 67140 (0.0006) [2023-03-07 17:41:33,166][232226] Updated weights for policy 0, policy_version 67150 (0.0006) [2023-03-07 17:41:33,954][232226] Updated weights for policy 0, policy_version 67160 (0.0006) [2023-03-07 17:41:34,761][232226] Updated weights for policy 0, policy_version 67170 (0.0007) [2023-03-07 17:41:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 68785152. Throughput: 0: 12851.6. Samples: 68763475. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:41:35,069][231894] Avg episode reward: [(0, '196.236')] [2023-03-07 17:41:35,556][232226] Updated weights for policy 0, policy_version 67180 (0.0007) [2023-03-07 17:41:36,353][232226] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-07 17:41:37,145][232226] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-07 17:41:37,948][232226] Updated weights for policy 0, policy_version 67210 (0.0007) [2023-03-07 17:41:38,739][232226] Updated weights for policy 0, policy_version 67220 (0.0007) [2023-03-07 17:41:39,550][232226] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-07 17:41:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 68849664. Throughput: 0: 12841.4. Samples: 68840439. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:41:40,070][231894] Avg episode reward: [(0, '193.912')] [2023-03-07 17:41:40,338][232226] Updated weights for policy 0, policy_version 67240 (0.0007) [2023-03-07 17:41:41,133][232226] Updated weights for policy 0, policy_version 67250 (0.0006) [2023-03-07 17:41:41,926][232226] Updated weights for policy 0, policy_version 67260 (0.0006) [2023-03-07 17:41:42,740][232226] Updated weights for policy 0, policy_version 67270 (0.0008) [2023-03-07 17:41:43,520][232226] Updated weights for policy 0, policy_version 67280 (0.0007) [2023-03-07 17:41:44,317][232226] Updated weights for policy 0, policy_version 67290 (0.0006) [2023-03-07 17:41:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 68914176. Throughput: 0: 12838.7. Samples: 68878973. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:41:45,069][231894] Avg episode reward: [(0, '194.524')] [2023-03-07 17:41:45,118][232226] Updated weights for policy 0, policy_version 67300 (0.0006) [2023-03-07 17:41:45,909][232226] Updated weights for policy 0, policy_version 67310 (0.0007) [2023-03-07 17:41:46,709][232226] Updated weights for policy 0, policy_version 67320 (0.0006) [2023-03-07 17:41:47,494][232226] Updated weights for policy 0, policy_version 67330 (0.0007) [2023-03-07 17:41:48,286][232226] Updated weights for policy 0, policy_version 67340 (0.0007) [2023-03-07 17:41:49,103][232226] Updated weights for policy 0, policy_version 67350 (0.0007) [2023-03-07 17:41:49,889][232226] Updated weights for policy 0, policy_version 67360 (0.0007) [2023-03-07 17:41:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 68978688. Throughput: 0: 12839.0. Samples: 68956340. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:41:50,069][231894] Avg episode reward: [(0, '188.131')] [2023-03-07 17:41:50,705][232226] Updated weights for policy 0, policy_version 67370 (0.0006) [2023-03-07 17:41:51,493][232226] Updated weights for policy 0, policy_version 67380 (0.0006) [2023-03-07 17:41:52,270][232226] Updated weights for policy 0, policy_version 67390 (0.0006) [2023-03-07 17:41:53,076][232226] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-07 17:41:53,860][232226] Updated weights for policy 0, policy_version 67410 (0.0007) [2023-03-07 17:41:54,683][232226] Updated weights for policy 0, policy_version 67420 (0.0006) [2023-03-07 17:41:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12867.7). Total num frames: 69042176. Throughput: 0: 12842.4. Samples: 69033397. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:41:55,069][231894] Avg episode reward: [(0, '195.351')] [2023-03-07 17:41:55,472][232226] Updated weights for policy 0, policy_version 67430 (0.0006) [2023-03-07 17:41:56,259][232226] Updated weights for policy 0, policy_version 67440 (0.0006) [2023-03-07 17:41:57,086][232226] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-07 17:41:57,870][232226] Updated weights for policy 0, policy_version 67460 (0.0007) [2023-03-07 17:41:58,654][232226] Updated weights for policy 0, policy_version 67470 (0.0006) [2023-03-07 17:41:59,457][232226] Updated weights for policy 0, policy_version 67480 (0.0006) [2023-03-07 17:42:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12867.7). Total num frames: 69106688. Throughput: 0: 12850.0. Samples: 69071922. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:42:00,069][231894] Avg episode reward: [(0, '193.092')] [2023-03-07 17:42:00,242][232226] Updated weights for policy 0, policy_version 67490 (0.0006) [2023-03-07 17:42:01,022][232226] Updated weights for policy 0, policy_version 67500 (0.0006) [2023-03-07 17:42:01,827][232226] Updated weights for policy 0, policy_version 67510 (0.0006) [2023-03-07 17:42:02,624][232226] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-07 17:42:03,431][232226] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-07 17:42:04,212][232226] Updated weights for policy 0, policy_version 67540 (0.0007) [2023-03-07 17:42:04,997][232226] Updated weights for policy 0, policy_version 67550 (0.0006) [2023-03-07 17:42:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 69171200. Throughput: 0: 12859.2. Samples: 69149203. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:42:05,069][231894] Avg episode reward: [(0, '190.769')] [2023-03-07 17:42:05,793][232226] Updated weights for policy 0, policy_version 67560 (0.0006) [2023-03-07 17:42:06,592][232226] Updated weights for policy 0, policy_version 67570 (0.0007) [2023-03-07 17:42:07,377][232226] Updated weights for policy 0, policy_version 67580 (0.0007) [2023-03-07 17:42:08,173][232226] Updated weights for policy 0, policy_version 67590 (0.0006) [2023-03-07 17:42:08,965][232226] Updated weights for policy 0, policy_version 67600 (0.0007) [2023-03-07 17:42:09,764][232226] Updated weights for policy 0, policy_version 67610 (0.0007) [2023-03-07 17:42:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 69235712. Throughput: 0: 12870.0. Samples: 69226905. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:10,069][231894] Avg episode reward: [(0, '199.581')] [2023-03-07 17:42:10,562][232226] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-07 17:42:11,349][232226] Updated weights for policy 0, policy_version 67630 (0.0006) [2023-03-07 17:42:12,147][232226] Updated weights for policy 0, policy_version 67640 (0.0007) [2023-03-07 17:42:12,956][232226] Updated weights for policy 0, policy_version 67650 (0.0007) [2023-03-07 17:42:13,747][232226] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-07 17:42:14,543][232226] Updated weights for policy 0, policy_version 67670 (0.0007) [2023-03-07 17:42:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 69300224. Throughput: 0: 12864.5. Samples: 69265422. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:15,069][231894] Avg episode reward: [(0, '195.025')] [2023-03-07 17:42:15,339][232226] Updated weights for policy 0, policy_version 67680 (0.0005) [2023-03-07 17:42:16,144][232226] Updated weights for policy 0, policy_version 67690 (0.0007) [2023-03-07 17:42:16,926][232226] Updated weights for policy 0, policy_version 67700 (0.0007) [2023-03-07 17:42:17,745][232226] Updated weights for policy 0, policy_version 67710 (0.0006) [2023-03-07 17:42:18,534][232226] Updated weights for policy 0, policy_version 67720 (0.0005) [2023-03-07 17:42:19,326][232226] Updated weights for policy 0, policy_version 67730 (0.0006) [2023-03-07 17:42:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 69364736. Throughput: 0: 12866.0. Samples: 69342445. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:20,069][231894] Avg episode reward: [(0, '192.773')] [2023-03-07 17:42:20,124][232226] Updated weights for policy 0, policy_version 67740 (0.0005) [2023-03-07 17:42:20,920][232226] Updated weights for policy 0, policy_version 67750 (0.0006) [2023-03-07 17:42:21,710][232226] Updated weights for policy 0, policy_version 67760 (0.0007) [2023-03-07 17:42:22,489][232226] Updated weights for policy 0, policy_version 67770 (0.0007) [2023-03-07 17:42:23,297][232226] Updated weights for policy 0, policy_version 67780 (0.0007) [2023-03-07 17:42:24,100][232226] Updated weights for policy 0, policy_version 67790 (0.0007) [2023-03-07 17:42:24,898][232226] Updated weights for policy 0, policy_version 67800 (0.0008) [2023-03-07 17:42:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 69429248. Throughput: 0: 12872.8. Samples: 69419717. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:25,069][231894] Avg episode reward: [(0, '194.526')] [2023-03-07 17:42:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000067802_69429248.pth... [2023-03-07 17:42:25,102][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000064785_66339840.pth [2023-03-07 17:42:25,694][232226] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-07 17:42:26,501][232226] Updated weights for policy 0, policy_version 67820 (0.0006) [2023-03-07 17:42:27,287][232226] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-07 17:42:28,077][232226] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-07 17:42:28,882][232226] Updated weights for policy 0, policy_version 67850 (0.0006) [2023-03-07 17:42:29,672][232226] Updated weights for policy 0, policy_version 67860 (0.0006) [2023-03-07 17:42:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 69493760. Throughput: 0: 12869.1. Samples: 69458084. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:30,069][231894] Avg episode reward: [(0, '188.609')] [2023-03-07 17:42:30,485][232226] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-07 17:42:31,288][232226] Updated weights for policy 0, policy_version 67880 (0.0007) [2023-03-07 17:42:32,082][232226] Updated weights for policy 0, policy_version 67890 (0.0007) [2023-03-07 17:42:32,875][232226] Updated weights for policy 0, policy_version 67900 (0.0006) [2023-03-07 17:42:33,662][232226] Updated weights for policy 0, policy_version 67910 (0.0006) [2023-03-07 17:42:34,469][232226] Updated weights for policy 0, policy_version 67920 (0.0007) [2023-03-07 17:42:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 69557248. Throughput: 0: 12867.7. Samples: 69535385. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:35,069][231894] Avg episode reward: [(0, '196.643')] [2023-03-07 17:42:35,249][232226] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-07 17:42:36,049][232226] Updated weights for policy 0, policy_version 67940 (0.0006) [2023-03-07 17:42:36,833][232226] Updated weights for policy 0, policy_version 67950 (0.0006) [2023-03-07 17:42:37,649][232226] Updated weights for policy 0, policy_version 67960 (0.0006) [2023-03-07 17:42:38,433][232226] Updated weights for policy 0, policy_version 67970 (0.0007) [2023-03-07 17:42:39,220][232226] Updated weights for policy 0, policy_version 67980 (0.0007) [2023-03-07 17:42:40,026][232226] Updated weights for policy 0, policy_version 67990 (0.0006) [2023-03-07 17:42:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 69621760. Throughput: 0: 12869.8. Samples: 69612541. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:40,069][231894] Avg episode reward: [(0, '191.579')] [2023-03-07 17:42:40,816][232226] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-07 17:42:41,605][232226] Updated weights for policy 0, policy_version 68010 (0.0006) [2023-03-07 17:42:42,425][232226] Updated weights for policy 0, policy_version 68020 (0.0006) [2023-03-07 17:42:43,208][232226] Updated weights for policy 0, policy_version 68030 (0.0006) [2023-03-07 17:42:44,008][232226] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-07 17:42:44,805][232226] Updated weights for policy 0, policy_version 68050 (0.0007) [2023-03-07 17:42:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 69686272. Throughput: 0: 12870.6. Samples: 69651098. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:45,069][231894] Avg episode reward: [(0, '191.604')] [2023-03-07 17:42:45,611][232226] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-07 17:42:46,403][232226] Updated weights for policy 0, policy_version 68070 (0.0006) [2023-03-07 17:42:47,211][232226] Updated weights for policy 0, policy_version 68080 (0.0006) [2023-03-07 17:42:48,005][232226] Updated weights for policy 0, policy_version 68090 (0.0006) [2023-03-07 17:42:48,819][232226] Updated weights for policy 0, policy_version 68100 (0.0007) [2023-03-07 17:42:49,618][232226] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-07 17:42:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 69749760. Throughput: 0: 12865.4. Samples: 69728147. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:42:50,070][231894] Avg episode reward: [(0, '190.959')] [2023-03-07 17:42:50,408][232226] Updated weights for policy 0, policy_version 68120 (0.0007) [2023-03-07 17:42:51,201][232226] Updated weights for policy 0, policy_version 68130 (0.0007) [2023-03-07 17:42:52,020][232226] Updated weights for policy 0, policy_version 68140 (0.0006) [2023-03-07 17:42:52,810][232226] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-07 17:42:53,622][232226] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-07 17:42:54,401][232226] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-07 17:42:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 69814272. Throughput: 0: 12843.0. Samples: 69804841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:42:55,080][231894] Avg episode reward: [(0, '199.121')] [2023-03-07 17:42:55,214][232226] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-07 17:42:56,005][232226] Updated weights for policy 0, policy_version 68190 (0.0007) [2023-03-07 17:42:56,804][232226] Updated weights for policy 0, policy_version 68200 (0.0006) [2023-03-07 17:42:57,595][232226] Updated weights for policy 0, policy_version 68210 (0.0007) [2023-03-07 17:42:58,406][232226] Updated weights for policy 0, policy_version 68220 (0.0006) [2023-03-07 17:42:59,182][232226] Updated weights for policy 0, policy_version 68230 (0.0006) [2023-03-07 17:42:59,970][232226] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-07 17:43:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 69878784. Throughput: 0: 12842.2. Samples: 69843324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:43:00,070][231894] Avg episode reward: [(0, '198.291')] [2023-03-07 17:43:00,764][232226] Updated weights for policy 0, policy_version 68250 (0.0008) [2023-03-07 17:43:01,566][232226] Updated weights for policy 0, policy_version 68260 (0.0006) [2023-03-07 17:43:02,347][232226] Updated weights for policy 0, policy_version 68270 (0.0006) [2023-03-07 17:43:03,134][232226] Updated weights for policy 0, policy_version 68280 (0.0007) [2023-03-07 17:43:03,938][232226] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-07 17:43:04,718][232226] Updated weights for policy 0, policy_version 68300 (0.0005) [2023-03-07 17:43:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 69943296. Throughput: 0: 12854.2. Samples: 69920883. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:43:05,069][231894] Avg episode reward: [(0, '184.942')] [2023-03-07 17:43:05,502][232226] Updated weights for policy 0, policy_version 68310 (0.0006) [2023-03-07 17:43:06,314][232226] Updated weights for policy 0, policy_version 68320 (0.0006) [2023-03-07 17:43:07,121][232226] Updated weights for policy 0, policy_version 68330 (0.0006) [2023-03-07 17:43:07,902][232226] Updated weights for policy 0, policy_version 68340 (0.0006) [2023-03-07 17:43:08,690][232226] Updated weights for policy 0, policy_version 68350 (0.0006) [2023-03-07 17:43:09,501][232226] Updated weights for policy 0, policy_version 68360 (0.0007) [2023-03-07 17:43:10,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 70006784. Throughput: 0: 12856.1. Samples: 69998241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:43:10,069][231894] Avg episode reward: [(0, '187.590')] [2023-03-07 17:43:10,297][232226] Updated weights for policy 0, policy_version 68370 (0.0006) [2023-03-07 17:43:11,099][232226] Updated weights for policy 0, policy_version 68380 (0.0006) [2023-03-07 17:43:11,878][232226] Updated weights for policy 0, policy_version 68390 (0.0006) [2023-03-07 17:43:12,674][232226] Updated weights for policy 0, policy_version 68400 (0.0006) [2023-03-07 17:43:13,457][232226] Updated weights for policy 0, policy_version 68410 (0.0005) [2023-03-07 17:43:14,269][232226] Updated weights for policy 0, policy_version 68420 (0.0006) [2023-03-07 17:43:15,064][232226] Updated weights for policy 0, policy_version 68430 (0.0007) [2023-03-07 17:43:15,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 70072320. Throughput: 0: 12857.9. Samples: 70036690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:43:15,069][231894] Avg episode reward: [(0, '195.986')] [2023-03-07 17:43:15,855][232226] Updated weights for policy 0, policy_version 68440 (0.0005) [2023-03-07 17:43:16,661][232226] Updated weights for policy 0, policy_version 68450 (0.0006) [2023-03-07 17:43:17,472][232226] Updated weights for policy 0, policy_version 68460 (0.0006) [2023-03-07 17:43:18,268][232226] Updated weights for policy 0, policy_version 68470 (0.0007) [2023-03-07 17:43:19,063][232226] Updated weights for policy 0, policy_version 68480 (0.0006) [2023-03-07 17:43:19,844][232226] Updated weights for policy 0, policy_version 68490 (0.0006) [2023-03-07 17:43:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 70135808. Throughput: 0: 12854.9. Samples: 70113853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:43:20,069][231894] Avg episode reward: [(0, '198.030')] [2023-03-07 17:43:20,650][232226] Updated weights for policy 0, policy_version 68500 (0.0006) [2023-03-07 17:43:21,433][232226] Updated weights for policy 0, policy_version 68510 (0.0006) [2023-03-07 17:43:22,230][232226] Updated weights for policy 0, policy_version 68520 (0.0006) [2023-03-07 17:43:23,044][232226] Updated weights for policy 0, policy_version 68530 (0.0006) [2023-03-07 17:43:23,817][232226] Updated weights for policy 0, policy_version 68540 (0.0006) [2023-03-07 17:43:24,618][232226] Updated weights for policy 0, policy_version 68550 (0.0006) [2023-03-07 17:43:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 70200320. Throughput: 0: 12861.4. Samples: 70191305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:43:25,070][231894] Avg episode reward: [(0, '193.589')] [2023-03-07 17:43:25,387][232226] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-07 17:43:26,173][232226] Updated weights for policy 0, policy_version 68570 (0.0006) [2023-03-07 17:43:26,981][232226] Updated weights for policy 0, policy_version 68580 (0.0006) [2023-03-07 17:43:27,758][232226] Updated weights for policy 0, policy_version 68590 (0.0006) [2023-03-07 17:43:28,543][232226] Updated weights for policy 0, policy_version 68600 (0.0006) [2023-03-07 17:43:29,332][232226] Updated weights for policy 0, policy_version 68610 (0.0006) [2023-03-07 17:43:30,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 70265856. Throughput: 0: 12869.3. Samples: 70230217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:43:30,069][231894] Avg episode reward: [(0, '194.602')] [2023-03-07 17:43:30,125][232226] Updated weights for policy 0, policy_version 68620 (0.0007) [2023-03-07 17:43:30,935][232226] Updated weights for policy 0, policy_version 68630 (0.0007) [2023-03-07 17:43:31,747][232226] Updated weights for policy 0, policy_version 68640 (0.0006) [2023-03-07 17:43:32,548][232226] Updated weights for policy 0, policy_version 68650 (0.0007) [2023-03-07 17:43:33,334][232226] Updated weights for policy 0, policy_version 68660 (0.0006) [2023-03-07 17:43:34,095][232226] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-07 17:43:34,902][232226] Updated weights for policy 0, policy_version 68680 (0.0006) [2023-03-07 17:43:35,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 70330368. Throughput: 0: 12875.6. Samples: 70307549. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:43:35,069][231894] Avg episode reward: [(0, '189.337')] [2023-03-07 17:43:35,673][232226] Updated weights for policy 0, policy_version 68690 (0.0006) [2023-03-07 17:43:36,474][232226] Updated weights for policy 0, policy_version 68700 (0.0006) [2023-03-07 17:43:37,273][232226] Updated weights for policy 0, policy_version 68710 (0.0006) [2023-03-07 17:43:38,061][232226] Updated weights for policy 0, policy_version 68720 (0.0006) [2023-03-07 17:43:38,854][232226] Updated weights for policy 0, policy_version 68730 (0.0006) [2023-03-07 17:43:39,632][232226] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-07 17:43:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 70394880. Throughput: 0: 12905.2. Samples: 70385577. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:43:40,070][231894] Avg episode reward: [(0, '192.382')] [2023-03-07 17:43:40,430][232226] Updated weights for policy 0, policy_version 68750 (0.0007) [2023-03-07 17:43:41,227][232226] Updated weights for policy 0, policy_version 68760 (0.0006) [2023-03-07 17:43:42,015][232226] Updated weights for policy 0, policy_version 68770 (0.0006) [2023-03-07 17:43:42,813][232226] Updated weights for policy 0, policy_version 68780 (0.0006) [2023-03-07 17:43:43,607][232226] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-07 17:43:44,399][232226] Updated weights for policy 0, policy_version 68800 (0.0007) [2023-03-07 17:43:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 70459392. Throughput: 0: 12904.8. Samples: 70424039. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:43:45,069][231894] Avg episode reward: [(0, '187.799')] [2023-03-07 17:43:45,185][232226] Updated weights for policy 0, policy_version 68810 (0.0006) [2023-03-07 17:43:45,991][232226] Updated weights for policy 0, policy_version 68820 (0.0007) [2023-03-07 17:43:46,781][232226] Updated weights for policy 0, policy_version 68830 (0.0006) [2023-03-07 17:43:47,574][232226] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-07 17:43:48,374][232226] Updated weights for policy 0, policy_version 68850 (0.0006) [2023-03-07 17:43:49,150][232226] Updated weights for policy 0, policy_version 68860 (0.0006) [2023-03-07 17:43:49,957][232226] Updated weights for policy 0, policy_version 68870 (0.0007) [2023-03-07 17:43:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 70523904. Throughput: 0: 12907.1. Samples: 70501702. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:43:50,069][231894] Avg episode reward: [(0, '194.447')] [2023-03-07 17:43:50,741][232226] Updated weights for policy 0, policy_version 68880 (0.0007) [2023-03-07 17:43:51,534][232226] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-07 17:43:52,330][232226] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-07 17:43:53,137][232226] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-07 17:43:53,915][232226] Updated weights for policy 0, policy_version 68920 (0.0007) [2023-03-07 17:43:54,718][232226] Updated weights for policy 0, policy_version 68930 (0.0007) [2023-03-07 17:43:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 70588416. Throughput: 0: 12905.6. Samples: 70578993. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:43:55,069][231894] Avg episode reward: [(0, '190.793')] [2023-03-07 17:43:55,501][232226] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-07 17:43:56,309][232226] Updated weights for policy 0, policy_version 68950 (0.0006) [2023-03-07 17:43:57,115][232226] Updated weights for policy 0, policy_version 68960 (0.0005) [2023-03-07 17:43:57,906][232226] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-07 17:43:58,704][232226] Updated weights for policy 0, policy_version 68980 (0.0008) [2023-03-07 17:43:59,508][232226] Updated weights for policy 0, policy_version 68990 (0.0006) [2023-03-07 17:44:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 70652928. Throughput: 0: 12908.4. Samples: 70617569. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:44:00,069][231894] Avg episode reward: [(0, '197.550')] [2023-03-07 17:44:00,296][232226] Updated weights for policy 0, policy_version 69000 (0.0007) [2023-03-07 17:44:01,126][232226] Updated weights for policy 0, policy_version 69010 (0.0006) [2023-03-07 17:44:01,913][232226] Updated weights for policy 0, policy_version 69020 (0.0006) [2023-03-07 17:44:02,696][232226] Updated weights for policy 0, policy_version 69030 (0.0006) [2023-03-07 17:44:03,493][232226] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-07 17:44:04,297][232226] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-07 17:44:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 70716416. Throughput: 0: 12907.9. Samples: 70694708. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:44:05,069][231894] Avg episode reward: [(0, '193.821')] [2023-03-07 17:44:05,074][232226] Updated weights for policy 0, policy_version 69060 (0.0006) [2023-03-07 17:44:05,882][232226] Updated weights for policy 0, policy_version 69070 (0.0007) [2023-03-07 17:44:06,665][232226] Updated weights for policy 0, policy_version 69080 (0.0007) [2023-03-07 17:44:07,473][232226] Updated weights for policy 0, policy_version 69090 (0.0007) [2023-03-07 17:44:08,256][232226] Updated weights for policy 0, policy_version 69100 (0.0007) [2023-03-07 17:44:09,042][232226] Updated weights for policy 0, policy_version 69110 (0.0006) [2023-03-07 17:44:09,845][232226] Updated weights for policy 0, policy_version 69120 (0.0007) [2023-03-07 17:44:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 70780928. Throughput: 0: 12904.8. Samples: 70772019. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:44:10,070][231894] Avg episode reward: [(0, '187.086')] [2023-03-07 17:44:10,647][232226] Updated weights for policy 0, policy_version 69130 (0.0007) [2023-03-07 17:44:11,445][232226] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-07 17:44:12,236][232226] Updated weights for policy 0, policy_version 69150 (0.0006) [2023-03-07 17:44:13,013][232226] Updated weights for policy 0, policy_version 69160 (0.0007) [2023-03-07 17:44:13,828][232226] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-07 17:44:14,613][232226] Updated weights for policy 0, policy_version 69180 (0.0006) [2023-03-07 17:44:15,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 70845440. Throughput: 0: 12894.3. Samples: 70810464. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:15,070][231894] Avg episode reward: [(0, '191.454')] [2023-03-07 17:44:15,397][232226] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-07 17:44:16,198][232226] Updated weights for policy 0, policy_version 69200 (0.0006) [2023-03-07 17:44:16,996][232226] Updated weights for policy 0, policy_version 69210 (0.0006) [2023-03-07 17:44:17,787][232226] Updated weights for policy 0, policy_version 69220 (0.0006) [2023-03-07 17:44:18,593][232226] Updated weights for policy 0, policy_version 69230 (0.0006) [2023-03-07 17:44:19,390][232226] Updated weights for policy 0, policy_version 69240 (0.0006) [2023-03-07 17:44:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 70909952. Throughput: 0: 12894.8. Samples: 70887815. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:20,069][231894] Avg episode reward: [(0, '194.441')] [2023-03-07 17:44:20,168][232226] Updated weights for policy 0, policy_version 69250 (0.0006) [2023-03-07 17:44:20,977][232226] Updated weights for policy 0, policy_version 69260 (0.0007) [2023-03-07 17:44:21,774][232226] Updated weights for policy 0, policy_version 69270 (0.0007) [2023-03-07 17:44:22,569][232226] Updated weights for policy 0, policy_version 69280 (0.0007) [2023-03-07 17:44:23,370][232226] Updated weights for policy 0, policy_version 69290 (0.0007) [2023-03-07 17:44:24,156][232226] Updated weights for policy 0, policy_version 69300 (0.0006) [2023-03-07 17:44:24,946][232226] Updated weights for policy 0, policy_version 69310 (0.0006) [2023-03-07 17:44:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 70974464. Throughput: 0: 12881.5. Samples: 70965242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:25,069][231894] Avg episode reward: [(0, '196.733')] [2023-03-07 17:44:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000069311_70974464.pth... [2023-03-07 17:44:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000066294_67885056.pth [2023-03-07 17:44:25,754][232226] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-07 17:44:26,538][232226] Updated weights for policy 0, policy_version 69330 (0.0006) [2023-03-07 17:44:27,334][232226] Updated weights for policy 0, policy_version 69340 (0.0006) [2023-03-07 17:44:28,122][232226] Updated weights for policy 0, policy_version 69350 (0.0007) [2023-03-07 17:44:28,910][232226] Updated weights for policy 0, policy_version 69360 (0.0007) [2023-03-07 17:44:29,717][232226] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-07 17:44:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 71038976. Throughput: 0: 12885.5. Samples: 71003889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:30,069][231894] Avg episode reward: [(0, '192.167')] [2023-03-07 17:44:30,506][232226] Updated weights for policy 0, policy_version 69380 (0.0006) [2023-03-07 17:44:31,308][232226] Updated weights for policy 0, policy_version 69390 (0.0007) [2023-03-07 17:44:32,103][232226] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-07 17:44:32,895][232226] Updated weights for policy 0, policy_version 69410 (0.0006) [2023-03-07 17:44:33,695][232226] Updated weights for policy 0, policy_version 69420 (0.0006) [2023-03-07 17:44:34,490][232226] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-07 17:44:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 71103488. Throughput: 0: 12875.8. Samples: 71081113. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:35,069][231894] Avg episode reward: [(0, '188.117')] [2023-03-07 17:44:35,281][232226] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-07 17:44:36,080][232226] Updated weights for policy 0, policy_version 69450 (0.0007) [2023-03-07 17:44:36,884][232226] Updated weights for policy 0, policy_version 69460 (0.0006) [2023-03-07 17:44:37,677][232226] Updated weights for policy 0, policy_version 69470 (0.0006) [2023-03-07 17:44:38,483][232226] Updated weights for policy 0, policy_version 69480 (0.0006) [2023-03-07 17:44:39,286][232226] Updated weights for policy 0, policy_version 69490 (0.0006) [2023-03-07 17:44:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 71166976. Throughput: 0: 12868.1. Samples: 71158056. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:40,069][231894] Avg episode reward: [(0, '191.885')] [2023-03-07 17:44:40,093][232226] Updated weights for policy 0, policy_version 69500 (0.0007) [2023-03-07 17:44:40,895][232226] Updated weights for policy 0, policy_version 69510 (0.0007) [2023-03-07 17:44:41,704][232226] Updated weights for policy 0, policy_version 69520 (0.0006) [2023-03-07 17:44:42,493][232226] Updated weights for policy 0, policy_version 69530 (0.0006) [2023-03-07 17:44:43,289][232226] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-07 17:44:44,072][232226] Updated weights for policy 0, policy_version 69550 (0.0006) [2023-03-07 17:44:44,873][232226] Updated weights for policy 0, policy_version 69560 (0.0006) [2023-03-07 17:44:45,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12878.1). Total num frames: 71231488. Throughput: 0: 12862.7. Samples: 71196393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:45,069][231894] Avg episode reward: [(0, '193.200')] [2023-03-07 17:44:45,681][232226] Updated weights for policy 0, policy_version 69570 (0.0006) [2023-03-07 17:44:46,489][232226] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-07 17:44:47,270][232226] Updated weights for policy 0, policy_version 69590 (0.0006) [2023-03-07 17:44:48,067][232226] Updated weights for policy 0, policy_version 69600 (0.0007) [2023-03-07 17:44:48,877][232226] Updated weights for policy 0, policy_version 69610 (0.0006) [2023-03-07 17:44:49,663][232226] Updated weights for policy 0, policy_version 69620 (0.0007) [2023-03-07 17:44:50,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 71294976. Throughput: 0: 12860.6. Samples: 71273434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:50,069][231894] Avg episode reward: [(0, '192.729')] [2023-03-07 17:44:50,458][232226] Updated weights for policy 0, policy_version 69630 (0.0006) [2023-03-07 17:44:51,265][232226] Updated weights for policy 0, policy_version 69640 (0.0007) [2023-03-07 17:44:52,062][232226] Updated weights for policy 0, policy_version 69650 (0.0006) [2023-03-07 17:44:52,850][232226] Updated weights for policy 0, policy_version 69660 (0.0007) [2023-03-07 17:44:53,632][232226] Updated weights for policy 0, policy_version 69670 (0.0007) [2023-03-07 17:44:54,424][232226] Updated weights for policy 0, policy_version 69680 (0.0007) [2023-03-07 17:44:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 71360512. Throughput: 0: 12863.1. Samples: 71350859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:44:55,069][231894] Avg episode reward: [(0, '206.892')] [2023-03-07 17:44:55,206][232226] Updated weights for policy 0, policy_version 69690 (0.0007) [2023-03-07 17:44:56,004][232226] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-07 17:44:56,814][232226] Updated weights for policy 0, policy_version 69710 (0.0006) [2023-03-07 17:44:57,611][232226] Updated weights for policy 0, policy_version 69720 (0.0007) [2023-03-07 17:44:58,393][232226] Updated weights for policy 0, policy_version 69730 (0.0007) [2023-03-07 17:44:59,186][232226] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-07 17:44:59,990][232226] Updated weights for policy 0, policy_version 69750 (0.0006) [2023-03-07 17:45:00,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 71425024. Throughput: 0: 12866.3. Samples: 71389444. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:00,069][231894] Avg episode reward: [(0, '194.426')] [2023-03-07 17:45:00,794][232226] Updated weights for policy 0, policy_version 69760 (0.0006) [2023-03-07 17:45:01,569][232226] Updated weights for policy 0, policy_version 69770 (0.0007) [2023-03-07 17:45:02,371][232226] Updated weights for policy 0, policy_version 69780 (0.0006) [2023-03-07 17:45:03,199][232226] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-07 17:45:03,970][232226] Updated weights for policy 0, policy_version 69800 (0.0006) [2023-03-07 17:45:04,765][232226] Updated weights for policy 0, policy_version 69810 (0.0006) [2023-03-07 17:45:05,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 71488512. Throughput: 0: 12862.1. Samples: 71466612. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:05,070][231894] Avg episode reward: [(0, '186.308')] [2023-03-07 17:45:05,574][232226] Updated weights for policy 0, policy_version 69820 (0.0006) [2023-03-07 17:45:06,367][232226] Updated weights for policy 0, policy_version 69830 (0.0006) [2023-03-07 17:45:07,153][232226] Updated weights for policy 0, policy_version 69840 (0.0006) [2023-03-07 17:45:07,945][232226] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-07 17:45:08,729][232226] Updated weights for policy 0, policy_version 69860 (0.0008) [2023-03-07 17:45:09,535][232226] Updated weights for policy 0, policy_version 69870 (0.0006) [2023-03-07 17:45:10,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 71553024. Throughput: 0: 12861.3. Samples: 71544002. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:10,070][231894] Avg episode reward: [(0, '184.859')] [2023-03-07 17:45:10,331][232226] Updated weights for policy 0, policy_version 69880 (0.0007) [2023-03-07 17:45:11,124][232226] Updated weights for policy 0, policy_version 69890 (0.0006) [2023-03-07 17:45:11,910][232226] Updated weights for policy 0, policy_version 69900 (0.0007) [2023-03-07 17:45:12,709][232226] Updated weights for policy 0, policy_version 69910 (0.0007) [2023-03-07 17:45:13,510][232226] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-07 17:45:14,326][232226] Updated weights for policy 0, policy_version 69930 (0.0007) [2023-03-07 17:45:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 71617536. Throughput: 0: 12862.6. Samples: 71582705. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:15,069][231894] Avg episode reward: [(0, '193.264')] [2023-03-07 17:45:15,117][232226] Updated weights for policy 0, policy_version 69940 (0.0007) [2023-03-07 17:45:15,925][232226] Updated weights for policy 0, policy_version 69950 (0.0006) [2023-03-07 17:45:16,723][232226] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-07 17:45:17,501][232226] Updated weights for policy 0, policy_version 69970 (0.0006) [2023-03-07 17:45:18,305][232226] Updated weights for policy 0, policy_version 69980 (0.0007) [2023-03-07 17:45:19,113][232226] Updated weights for policy 0, policy_version 69990 (0.0007) [2023-03-07 17:45:19,908][232226] Updated weights for policy 0, policy_version 70000 (0.0007) [2023-03-07 17:45:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 71682048. Throughput: 0: 12855.7. Samples: 71659620. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:20,070][231894] Avg episode reward: [(0, '189.129')] [2023-03-07 17:45:20,698][232226] Updated weights for policy 0, policy_version 70010 (0.0006) [2023-03-07 17:45:21,484][232226] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-07 17:45:22,267][232226] Updated weights for policy 0, policy_version 70030 (0.0006) [2023-03-07 17:45:23,073][232226] Updated weights for policy 0, policy_version 70040 (0.0006) [2023-03-07 17:45:23,876][232226] Updated weights for policy 0, policy_version 70050 (0.0006) [2023-03-07 17:45:24,669][232226] Updated weights for policy 0, policy_version 70060 (0.0006) [2023-03-07 17:45:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 71746560. Throughput: 0: 12860.0. Samples: 71736757. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:25,069][231894] Avg episode reward: [(0, '194.093')] [2023-03-07 17:45:25,461][232226] Updated weights for policy 0, policy_version 70070 (0.0006) [2023-03-07 17:45:26,265][232226] Updated weights for policy 0, policy_version 70080 (0.0007) [2023-03-07 17:45:27,073][232226] Updated weights for policy 0, policy_version 70090 (0.0006) [2023-03-07 17:45:27,872][232226] Updated weights for policy 0, policy_version 70100 (0.0006) [2023-03-07 17:45:28,689][232226] Updated weights for policy 0, policy_version 70110 (0.0007) [2023-03-07 17:45:29,493][232226] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-07 17:45:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 71810048. Throughput: 0: 12863.7. Samples: 71775261. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:30,069][231894] Avg episode reward: [(0, '192.318')] [2023-03-07 17:45:30,294][232226] Updated weights for policy 0, policy_version 70130 (0.0006) [2023-03-07 17:45:31,093][232226] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-07 17:45:31,872][232226] Updated weights for policy 0, policy_version 70150 (0.0007) [2023-03-07 17:45:32,664][232226] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-07 17:45:33,450][232226] Updated weights for policy 0, policy_version 70170 (0.0006) [2023-03-07 17:45:34,251][232226] Updated weights for policy 0, policy_version 70180 (0.0006) [2023-03-07 17:45:35,054][232226] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-07 17:45:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 71874560. Throughput: 0: 12861.3. Samples: 71852193. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:35,069][231894] Avg episode reward: [(0, '195.401')] [2023-03-07 17:45:35,860][232226] Updated weights for policy 0, policy_version 70200 (0.0007) [2023-03-07 17:45:36,642][232226] Updated weights for policy 0, policy_version 70210 (0.0006) [2023-03-07 17:45:37,429][232226] Updated weights for policy 0, policy_version 70220 (0.0006) [2023-03-07 17:45:38,219][232226] Updated weights for policy 0, policy_version 70230 (0.0006) [2023-03-07 17:45:39,044][232226] Updated weights for policy 0, policy_version 70240 (0.0006) [2023-03-07 17:45:39,831][232226] Updated weights for policy 0, policy_version 70250 (0.0007) [2023-03-07 17:45:40,071][231894] Fps is (10 sec: 12899.2, 60 sec: 12867.7, 300 sec: 12867.6). Total num frames: 71939072. Throughput: 0: 12854.2. Samples: 71929330. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:40,072][231894] Avg episode reward: [(0, '201.847')] [2023-03-07 17:45:40,632][232226] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-07 17:45:41,442][232226] Updated weights for policy 0, policy_version 70270 (0.0007) [2023-03-07 17:45:42,231][232226] Updated weights for policy 0, policy_version 70280 (0.0006) [2023-03-07 17:45:43,009][232226] Updated weights for policy 0, policy_version 70290 (0.0007) [2023-03-07 17:45:43,812][232226] Updated weights for policy 0, policy_version 70300 (0.0006) [2023-03-07 17:45:44,578][232226] Updated weights for policy 0, policy_version 70310 (0.0006) [2023-03-07 17:45:45,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 72002560. Throughput: 0: 12853.9. Samples: 71967871. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:45,069][231894] Avg episode reward: [(0, '197.375')] [2023-03-07 17:45:45,388][232226] Updated weights for policy 0, policy_version 70320 (0.0006) [2023-03-07 17:45:46,193][232226] Updated weights for policy 0, policy_version 70330 (0.0007) [2023-03-07 17:45:46,989][232226] Updated weights for policy 0, policy_version 70340 (0.0007) [2023-03-07 17:45:47,778][232226] Updated weights for policy 0, policy_version 70350 (0.0006) [2023-03-07 17:45:48,579][232226] Updated weights for policy 0, policy_version 70360 (0.0006) [2023-03-07 17:45:49,383][232226] Updated weights for policy 0, policy_version 70370 (0.0007) [2023-03-07 17:45:50,069][231894] Fps is (10 sec: 12803.2, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 72067072. Throughput: 0: 12854.9. Samples: 72045081. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:50,070][231894] Avg episode reward: [(0, '192.391')] [2023-03-07 17:45:50,196][232226] Updated weights for policy 0, policy_version 70380 (0.0006) [2023-03-07 17:45:51,006][232226] Updated weights for policy 0, policy_version 70390 (0.0006) [2023-03-07 17:45:51,801][232226] Updated weights for policy 0, policy_version 70400 (0.0006) [2023-03-07 17:45:52,595][232226] Updated weights for policy 0, policy_version 70410 (0.0006) [2023-03-07 17:45:53,382][232226] Updated weights for policy 0, policy_version 70420 (0.0006) [2023-03-07 17:45:54,189][232226] Updated weights for policy 0, policy_version 70430 (0.0008) [2023-03-07 17:45:54,969][232226] Updated weights for policy 0, policy_version 70440 (0.0006) [2023-03-07 17:45:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 72131584. Throughput: 0: 12842.0. Samples: 72121893. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:45:55,069][231894] Avg episode reward: [(0, '188.747')] [2023-03-07 17:45:55,771][232226] Updated weights for policy 0, policy_version 70450 (0.0006) [2023-03-07 17:45:56,564][232226] Updated weights for policy 0, policy_version 70460 (0.0006) [2023-03-07 17:45:57,347][232226] Updated weights for policy 0, policy_version 70470 (0.0006) [2023-03-07 17:45:58,153][232226] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-07 17:45:58,934][232226] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-07 17:45:59,730][232226] Updated weights for policy 0, policy_version 70500 (0.0008) [2023-03-07 17:46:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 72196096. Throughput: 0: 12843.6. Samples: 72160666. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:46:00,069][231894] Avg episode reward: [(0, '188.552')] [2023-03-07 17:46:00,536][232226] Updated weights for policy 0, policy_version 70510 (0.0006) [2023-03-07 17:46:01,314][232226] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-07 17:46:02,123][232226] Updated weights for policy 0, policy_version 70530 (0.0007) [2023-03-07 17:46:02,926][232226] Updated weights for policy 0, policy_version 70540 (0.0006) [2023-03-07 17:46:03,706][232226] Updated weights for policy 0, policy_version 70550 (0.0007) [2023-03-07 17:46:04,515][232226] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-07 17:46:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 72260608. Throughput: 0: 12852.8. Samples: 72237995. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:46:05,069][231894] Avg episode reward: [(0, '183.624')] [2023-03-07 17:46:05,296][232226] Updated weights for policy 0, policy_version 70570 (0.0006) [2023-03-07 17:46:06,096][232226] Updated weights for policy 0, policy_version 70580 (0.0006) [2023-03-07 17:46:06,885][232226] Updated weights for policy 0, policy_version 70590 (0.0008) [2023-03-07 17:46:07,662][232226] Updated weights for policy 0, policy_version 70600 (0.0008) [2023-03-07 17:46:08,449][232226] Updated weights for policy 0, policy_version 70610 (0.0006) [2023-03-07 17:46:09,238][232226] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-07 17:46:10,038][232226] Updated weights for policy 0, policy_version 70630 (0.0006) [2023-03-07 17:46:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 72325120. Throughput: 0: 12867.3. Samples: 72315788. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:46:10,069][231894] Avg episode reward: [(0, '200.094')] [2023-03-07 17:46:10,827][232226] Updated weights for policy 0, policy_version 70640 (0.0006) [2023-03-07 17:46:11,629][232226] Updated weights for policy 0, policy_version 70650 (0.0007) [2023-03-07 17:46:12,410][232226] Updated weights for policy 0, policy_version 70660 (0.0007) [2023-03-07 17:46:13,247][232226] Updated weights for policy 0, policy_version 70670 (0.0006) [2023-03-07 17:46:14,025][232226] Updated weights for policy 0, policy_version 70680 (0.0006) [2023-03-07 17:46:14,803][232226] Updated weights for policy 0, policy_version 70690 (0.0006) [2023-03-07 17:46:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 72389632. Throughput: 0: 12869.1. Samples: 72354368. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:46:15,069][231894] Avg episode reward: [(0, '193.414')] [2023-03-07 17:46:15,637][232226] Updated weights for policy 0, policy_version 70700 (0.0006) [2023-03-07 17:46:16,422][232226] Updated weights for policy 0, policy_version 70710 (0.0006) [2023-03-07 17:46:17,229][232226] Updated weights for policy 0, policy_version 70720 (0.0006) [2023-03-07 17:46:18,031][232226] Updated weights for policy 0, policy_version 70730 (0.0006) [2023-03-07 17:46:18,810][232226] Updated weights for policy 0, policy_version 70740 (0.0007) [2023-03-07 17:46:19,614][232226] Updated weights for policy 0, policy_version 70750 (0.0007) [2023-03-07 17:46:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 72453120. Throughput: 0: 12868.2. Samples: 72431264. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:46:20,069][231894] Avg episode reward: [(0, '200.866')] [2023-03-07 17:46:20,407][232226] Updated weights for policy 0, policy_version 70760 (0.0006) [2023-03-07 17:46:21,200][232226] Updated weights for policy 0, policy_version 70770 (0.0007) [2023-03-07 17:46:22,005][232226] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-07 17:46:22,803][232226] Updated weights for policy 0, policy_version 70790 (0.0007) [2023-03-07 17:46:23,598][232226] Updated weights for policy 0, policy_version 70800 (0.0006) [2023-03-07 17:46:24,382][232226] Updated weights for policy 0, policy_version 70810 (0.0006) [2023-03-07 17:46:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 72517632. Throughput: 0: 12870.8. Samples: 72508484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:46:25,069][231894] Avg episode reward: [(0, '189.615')] [2023-03-07 17:46:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000070818_72517632.pth... [2023-03-07 17:46:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000067802_69429248.pth [2023-03-07 17:46:25,181][232226] Updated weights for policy 0, policy_version 70820 (0.0008) [2023-03-07 17:46:25,989][232226] Updated weights for policy 0, policy_version 70830 (0.0007) [2023-03-07 17:46:26,780][232226] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-07 17:46:27,590][232226] Updated weights for policy 0, policy_version 70850 (0.0006) [2023-03-07 17:46:28,374][232226] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-07 17:46:29,168][232226] Updated weights for policy 0, policy_version 70870 (0.0007) [2023-03-07 17:46:29,974][232226] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-07 17:46:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 72582144. Throughput: 0: 12872.9. Samples: 72547151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:46:30,069][231894] Avg episode reward: [(0, '189.271')] [2023-03-07 17:46:30,755][232226] Updated weights for policy 0, policy_version 70890 (0.0007) [2023-03-07 17:46:31,561][232226] Updated weights for policy 0, policy_version 70900 (0.0007) [2023-03-07 17:46:32,347][232226] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-07 17:46:33,158][232226] Updated weights for policy 0, policy_version 70920 (0.0006) [2023-03-07 17:46:33,948][232226] Updated weights for policy 0, policy_version 70930 (0.0006) [2023-03-07 17:46:34,746][232226] Updated weights for policy 0, policy_version 70940 (0.0008) [2023-03-07 17:46:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 72646656. Throughput: 0: 12865.0. Samples: 72624008. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:46:35,069][231894] Avg episode reward: [(0, '193.598')] [2023-03-07 17:46:35,545][232226] Updated weights for policy 0, policy_version 70950 (0.0007) [2023-03-07 17:46:36,341][232226] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-07 17:46:37,137][232226] Updated weights for policy 0, policy_version 70970 (0.0006) [2023-03-07 17:46:37,941][232226] Updated weights for policy 0, policy_version 70980 (0.0006) [2023-03-07 17:46:38,742][232226] Updated weights for policy 0, policy_version 70990 (0.0006) [2023-03-07 17:46:39,530][232226] Updated weights for policy 0, policy_version 71000 (0.0007) [2023-03-07 17:46:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.7, 300 sec: 12867.7). Total num frames: 72710144. Throughput: 0: 12869.5. Samples: 72701020. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:46:40,069][231894] Avg episode reward: [(0, '190.986')] [2023-03-07 17:46:40,325][232226] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-07 17:46:41,123][232226] Updated weights for policy 0, policy_version 71020 (0.0006) [2023-03-07 17:46:41,917][232226] Updated weights for policy 0, policy_version 71030 (0.0007) [2023-03-07 17:46:42,712][232226] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-07 17:46:43,521][232226] Updated weights for policy 0, policy_version 71050 (0.0006) [2023-03-07 17:46:44,312][232226] Updated weights for policy 0, policy_version 71060 (0.0007) [2023-03-07 17:46:45,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 72774656. Throughput: 0: 12871.3. Samples: 72739875. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:46:45,070][231894] Avg episode reward: [(0, '195.960')] [2023-03-07 17:46:45,121][232226] Updated weights for policy 0, policy_version 71070 (0.0007) [2023-03-07 17:46:45,900][232226] Updated weights for policy 0, policy_version 71080 (0.0006) [2023-03-07 17:46:46,700][232226] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-07 17:46:47,514][232226] Updated weights for policy 0, policy_version 71100 (0.0006) [2023-03-07 17:46:48,302][232226] Updated weights for policy 0, policy_version 71110 (0.0006) [2023-03-07 17:46:49,081][232226] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-07 17:46:49,899][232226] Updated weights for policy 0, policy_version 71130 (0.0006) [2023-03-07 17:46:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 72838144. Throughput: 0: 12861.9. Samples: 72816779. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:46:50,069][231894] Avg episode reward: [(0, '194.414')] [2023-03-07 17:46:50,698][232226] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-07 17:46:51,495][232226] Updated weights for policy 0, policy_version 71150 (0.0006) [2023-03-07 17:46:52,300][232226] Updated weights for policy 0, policy_version 71160 (0.0006) [2023-03-07 17:46:53,093][232226] Updated weights for policy 0, policy_version 71170 (0.0008) [2023-03-07 17:46:53,894][232226] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-07 17:46:54,678][232226] Updated weights for policy 0, policy_version 71190 (0.0007) [2023-03-07 17:46:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 72902656. Throughput: 0: 12841.3. Samples: 72893647. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:46:55,069][231894] Avg episode reward: [(0, '198.430')] [2023-03-07 17:46:55,486][232226] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-07 17:46:56,288][232226] Updated weights for policy 0, policy_version 71210 (0.0007) [2023-03-07 17:46:57,084][232226] Updated weights for policy 0, policy_version 71220 (0.0007) [2023-03-07 17:46:57,901][232226] Updated weights for policy 0, policy_version 71230 (0.0006) [2023-03-07 17:46:58,690][232226] Updated weights for policy 0, policy_version 71240 (0.0006) [2023-03-07 17:46:59,472][232226] Updated weights for policy 0, policy_version 71250 (0.0006) [2023-03-07 17:47:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 72967168. Throughput: 0: 12838.0. Samples: 72932075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:00,069][231894] Avg episode reward: [(0, '194.937')] [2023-03-07 17:47:00,287][232226] Updated weights for policy 0, policy_version 71260 (0.0007) [2023-03-07 17:47:01,089][232226] Updated weights for policy 0, policy_version 71270 (0.0006) [2023-03-07 17:47:01,872][232226] Updated weights for policy 0, policy_version 71280 (0.0007) [2023-03-07 17:47:02,679][232226] Updated weights for policy 0, policy_version 71290 (0.0006) [2023-03-07 17:47:03,477][232226] Updated weights for policy 0, policy_version 71300 (0.0006) [2023-03-07 17:47:04,253][232226] Updated weights for policy 0, policy_version 71310 (0.0006) [2023-03-07 17:47:05,058][232226] Updated weights for policy 0, policy_version 71320 (0.0007) [2023-03-07 17:47:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 73031680. Throughput: 0: 12841.8. Samples: 73009146. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:05,070][231894] Avg episode reward: [(0, '190.404')] [2023-03-07 17:47:05,841][232226] Updated weights for policy 0, policy_version 71330 (0.0006) [2023-03-07 17:47:06,653][232226] Updated weights for policy 0, policy_version 71340 (0.0007) [2023-03-07 17:47:07,441][232226] Updated weights for policy 0, policy_version 71350 (0.0007) [2023-03-07 17:47:08,225][232226] Updated weights for policy 0, policy_version 71360 (0.0006) [2023-03-07 17:47:09,022][232226] Updated weights for policy 0, policy_version 71370 (0.0007) [2023-03-07 17:47:09,844][232226] Updated weights for policy 0, policy_version 71380 (0.0008) [2023-03-07 17:47:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12864.2). Total num frames: 73095168. Throughput: 0: 12840.2. Samples: 73086294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:10,069][231894] Avg episode reward: [(0, '201.544')] [2023-03-07 17:47:10,635][232226] Updated weights for policy 0, policy_version 71390 (0.0006) [2023-03-07 17:47:11,457][232226] Updated weights for policy 0, policy_version 71400 (0.0007) [2023-03-07 17:47:12,271][232226] Updated weights for policy 0, policy_version 71410 (0.0007) [2023-03-07 17:47:13,073][232226] Updated weights for policy 0, policy_version 71420 (0.0008) [2023-03-07 17:47:13,859][232226] Updated weights for policy 0, policy_version 71430 (0.0006) [2023-03-07 17:47:14,636][232226] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-07 17:47:15,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12864.2). Total num frames: 73159680. Throughput: 0: 12830.8. Samples: 73124537. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:15,069][231894] Avg episode reward: [(0, '188.021')] [2023-03-07 17:47:15,442][232226] Updated weights for policy 0, policy_version 71450 (0.0006) [2023-03-07 17:47:16,238][232226] Updated weights for policy 0, policy_version 71460 (0.0006) [2023-03-07 17:47:17,027][232226] Updated weights for policy 0, policy_version 71470 (0.0006) [2023-03-07 17:47:17,815][232226] Updated weights for policy 0, policy_version 71480 (0.0007) [2023-03-07 17:47:18,601][232226] Updated weights for policy 0, policy_version 71490 (0.0007) [2023-03-07 17:47:19,406][232226] Updated weights for policy 0, policy_version 71500 (0.0007) [2023-03-07 17:47:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 73224192. Throughput: 0: 12840.6. Samples: 73201835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:20,069][231894] Avg episode reward: [(0, '190.395')] [2023-03-07 17:47:20,216][232226] Updated weights for policy 0, policy_version 71510 (0.0006) [2023-03-07 17:47:21,016][232226] Updated weights for policy 0, policy_version 71520 (0.0007) [2023-03-07 17:47:21,811][232226] Updated weights for policy 0, policy_version 71530 (0.0006) [2023-03-07 17:47:22,586][232226] Updated weights for policy 0, policy_version 71540 (0.0006) [2023-03-07 17:47:23,392][232226] Updated weights for policy 0, policy_version 71550 (0.0007) [2023-03-07 17:47:24,186][232226] Updated weights for policy 0, policy_version 71560 (0.0006) [2023-03-07 17:47:24,971][232226] Updated weights for policy 0, policy_version 71570 (0.0006) [2023-03-07 17:47:25,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 73288704. Throughput: 0: 12849.5. Samples: 73279248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:25,069][231894] Avg episode reward: [(0, '190.804')] [2023-03-07 17:47:25,761][232226] Updated weights for policy 0, policy_version 71580 (0.0007) [2023-03-07 17:47:26,558][232226] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-07 17:47:27,335][232226] Updated weights for policy 0, policy_version 71600 (0.0007) [2023-03-07 17:47:28,149][232226] Updated weights for policy 0, policy_version 71610 (0.0007) [2023-03-07 17:47:28,915][232226] Updated weights for policy 0, policy_version 71620 (0.0006) [2023-03-07 17:47:29,709][232226] Updated weights for policy 0, policy_version 71630 (0.0006) [2023-03-07 17:47:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 73353216. Throughput: 0: 12844.8. Samples: 73317888. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:30,069][231894] Avg episode reward: [(0, '189.564')] [2023-03-07 17:47:30,509][232226] Updated weights for policy 0, policy_version 71640 (0.0007) [2023-03-07 17:47:31,303][232226] Updated weights for policy 0, policy_version 71650 (0.0006) [2023-03-07 17:47:32,093][232226] Updated weights for policy 0, policy_version 71660 (0.0007) [2023-03-07 17:47:32,905][232226] Updated weights for policy 0, policy_version 71670 (0.0006) [2023-03-07 17:47:33,682][232226] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-07 17:47:34,505][232226] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-07 17:47:35,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 73417728. Throughput: 0: 12858.4. Samples: 73395405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:35,069][231894] Avg episode reward: [(0, '196.346')] [2023-03-07 17:47:35,312][232226] Updated weights for policy 0, policy_version 71700 (0.0007) [2023-03-07 17:47:36,102][232226] Updated weights for policy 0, policy_version 71710 (0.0006) [2023-03-07 17:47:36,898][232226] Updated weights for policy 0, policy_version 71720 (0.0006) [2023-03-07 17:47:37,699][232226] Updated weights for policy 0, policy_version 71730 (0.0006) [2023-03-07 17:47:38,517][232226] Updated weights for policy 0, policy_version 71740 (0.0006) [2023-03-07 17:47:39,319][232226] Updated weights for policy 0, policy_version 71750 (0.0007) [2023-03-07 17:47:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 73481216. Throughput: 0: 12852.9. Samples: 73472029. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:40,069][231894] Avg episode reward: [(0, '189.434')] [2023-03-07 17:47:40,113][232226] Updated weights for policy 0, policy_version 71760 (0.0007) [2023-03-07 17:47:40,884][232226] Updated weights for policy 0, policy_version 71770 (0.0007) [2023-03-07 17:47:41,706][232226] Updated weights for policy 0, policy_version 71780 (0.0007) [2023-03-07 17:47:42,490][232226] Updated weights for policy 0, policy_version 71790 (0.0006) [2023-03-07 17:47:43,292][232226] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-07 17:47:44,083][232226] Updated weights for policy 0, policy_version 71810 (0.0006) [2023-03-07 17:47:44,879][232226] Updated weights for policy 0, policy_version 71820 (0.0007) [2023-03-07 17:47:45,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 73545728. Throughput: 0: 12855.8. Samples: 73510587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:45,069][231894] Avg episode reward: [(0, '184.827')] [2023-03-07 17:47:45,685][232226] Updated weights for policy 0, policy_version 71830 (0.0007) [2023-03-07 17:47:46,456][232226] Updated weights for policy 0, policy_version 71840 (0.0007) [2023-03-07 17:47:47,258][232226] Updated weights for policy 0, policy_version 71850 (0.0006) [2023-03-07 17:47:48,062][232226] Updated weights for policy 0, policy_version 71860 (0.0005) [2023-03-07 17:47:48,860][232226] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-07 17:47:49,650][232226] Updated weights for policy 0, policy_version 71880 (0.0006) [2023-03-07 17:47:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 73610240. Throughput: 0: 12862.8. Samples: 73587973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:47:50,069][231894] Avg episode reward: [(0, '194.059')] [2023-03-07 17:47:50,448][232226] Updated weights for policy 0, policy_version 71890 (0.0007) [2023-03-07 17:47:51,253][232226] Updated weights for policy 0, policy_version 71900 (0.0007) [2023-03-07 17:47:52,035][232226] Updated weights for policy 0, policy_version 71910 (0.0007) [2023-03-07 17:47:52,834][232226] Updated weights for policy 0, policy_version 71920 (0.0007) [2023-03-07 17:47:53,635][232226] Updated weights for policy 0, policy_version 71930 (0.0006) [2023-03-07 17:47:54,426][232226] Updated weights for policy 0, policy_version 71940 (0.0006) [2023-03-07 17:47:55,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 73673728. Throughput: 0: 12856.9. Samples: 73664855. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:47:55,070][231894] Avg episode reward: [(0, '195.699')] [2023-03-07 17:47:55,235][232226] Updated weights for policy 0, policy_version 71950 (0.0007) [2023-03-07 17:47:56,037][232226] Updated weights for policy 0, policy_version 71960 (0.0007) [2023-03-07 17:47:56,847][232226] Updated weights for policy 0, policy_version 71970 (0.0006) [2023-03-07 17:47:57,650][232226] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-07 17:47:58,423][232226] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-07 17:47:59,221][232226] Updated weights for policy 0, policy_version 72000 (0.0007) [2023-03-07 17:48:00,020][232226] Updated weights for policy 0, policy_version 72010 (0.0006) [2023-03-07 17:48:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 73738240. Throughput: 0: 12860.4. Samples: 73703254. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:48:00,069][231894] Avg episode reward: [(0, '188.091')] [2023-03-07 17:48:00,803][232226] Updated weights for policy 0, policy_version 72020 (0.0007) [2023-03-07 17:48:01,602][232226] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-07 17:48:02,400][232226] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-07 17:48:03,217][232226] Updated weights for policy 0, policy_version 72050 (0.0007) [2023-03-07 17:48:03,998][232226] Updated weights for policy 0, policy_version 72060 (0.0007) [2023-03-07 17:48:04,801][232226] Updated weights for policy 0, policy_version 72070 (0.0006) [2023-03-07 17:48:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 73802752. Throughput: 0: 12856.2. Samples: 73780364. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:48:05,069][231894] Avg episode reward: [(0, '193.957')] [2023-03-07 17:48:05,585][232226] Updated weights for policy 0, policy_version 72080 (0.0007) [2023-03-07 17:48:06,385][232226] Updated weights for policy 0, policy_version 72090 (0.0006) [2023-03-07 17:48:07,188][232226] Updated weights for policy 0, policy_version 72100 (0.0006) [2023-03-07 17:48:07,966][232226] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-07 17:48:08,773][232226] Updated weights for policy 0, policy_version 72120 (0.0006) [2023-03-07 17:48:09,562][232226] Updated weights for policy 0, policy_version 72130 (0.0006) [2023-03-07 17:48:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 73867264. Throughput: 0: 12857.3. Samples: 73857827. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:48:10,070][231894] Avg episode reward: [(0, '191.446')] [2023-03-07 17:48:10,358][232226] Updated weights for policy 0, policy_version 72140 (0.0005) [2023-03-07 17:48:11,129][232226] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-07 17:48:11,933][232226] Updated weights for policy 0, policy_version 72160 (0.0007) [2023-03-07 17:48:12,718][232226] Updated weights for policy 0, policy_version 72170 (0.0007) [2023-03-07 17:48:13,516][232226] Updated weights for policy 0, policy_version 72180 (0.0007) [2023-03-07 17:48:14,322][232226] Updated weights for policy 0, policy_version 72190 (0.0006) [2023-03-07 17:48:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 73931776. Throughput: 0: 12861.6. Samples: 73896659. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:48:15,069][231894] Avg episode reward: [(0, '194.602')] [2023-03-07 17:48:15,107][232226] Updated weights for policy 0, policy_version 72200 (0.0006) [2023-03-07 17:48:15,904][232226] Updated weights for policy 0, policy_version 72210 (0.0006) [2023-03-07 17:48:16,714][232226] Updated weights for policy 0, policy_version 72220 (0.0006) [2023-03-07 17:48:17,489][232226] Updated weights for policy 0, policy_version 72230 (0.0006) [2023-03-07 17:48:18,290][232226] Updated weights for policy 0, policy_version 72240 (0.0006) [2023-03-07 17:48:19,085][232226] Updated weights for policy 0, policy_version 72250 (0.0007) [2023-03-07 17:48:19,891][232226] Updated weights for policy 0, policy_version 72260 (0.0007) [2023-03-07 17:48:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 73996288. Throughput: 0: 12859.0. Samples: 73974059. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:48:20,069][231894] Avg episode reward: [(0, '188.773')] [2023-03-07 17:48:20,677][232226] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-07 17:48:21,462][232226] Updated weights for policy 0, policy_version 72280 (0.0007) [2023-03-07 17:48:22,269][232226] Updated weights for policy 0, policy_version 72290 (0.0006) [2023-03-07 17:48:23,064][232226] Updated weights for policy 0, policy_version 72300 (0.0007) [2023-03-07 17:48:23,853][232226] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-07 17:48:24,666][232226] Updated weights for policy 0, policy_version 72320 (0.0006) [2023-03-07 17:48:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12860.7). Total num frames: 74059776. Throughput: 0: 12866.9. Samples: 74051041. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:48:25,080][231894] Avg episode reward: [(0, '193.996')] [2023-03-07 17:48:25,085][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000072325_74060800.pth... [2023-03-07 17:48:25,117][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000069311_70974464.pth [2023-03-07 17:48:25,447][232226] Updated weights for policy 0, policy_version 72330 (0.0007) [2023-03-07 17:48:26,243][232226] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-07 17:48:27,057][232226] Updated weights for policy 0, policy_version 72350 (0.0006) [2023-03-07 17:48:27,842][232226] Updated weights for policy 0, policy_version 72360 (0.0008) [2023-03-07 17:48:28,639][232226] Updated weights for policy 0, policy_version 72370 (0.0006) [2023-03-07 17:48:29,428][232226] Updated weights for policy 0, policy_version 72380 (0.0006) [2023-03-07 17:48:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12860.7). Total num frames: 74124288. Throughput: 0: 12867.6. Samples: 74089626. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 17:48:30,069][231894] Avg episode reward: [(0, '198.226')] [2023-03-07 17:48:30,238][232226] Updated weights for policy 0, policy_version 72390 (0.0007) [2023-03-07 17:48:31,061][232226] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-07 17:48:31,860][232226] Updated weights for policy 0, policy_version 72410 (0.0006) [2023-03-07 17:48:32,632][232226] Updated weights for policy 0, policy_version 72420 (0.0007) [2023-03-07 17:48:33,442][232226] Updated weights for policy 0, policy_version 72430 (0.0007) [2023-03-07 17:48:34,212][232226] Updated weights for policy 0, policy_version 72440 (0.0006) [2023-03-07 17:48:35,000][232226] Updated weights for policy 0, policy_version 72450 (0.0006) [2023-03-07 17:48:35,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12868.2, 300 sec: 12864.2). Total num frames: 74189824. Throughput: 0: 12864.5. Samples: 74166876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:48:35,070][231894] Avg episode reward: [(0, '193.737')] [2023-03-07 17:48:35,794][232226] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-07 17:48:36,581][232226] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-07 17:48:37,377][232226] Updated weights for policy 0, policy_version 72480 (0.0007) [2023-03-07 17:48:38,173][232226] Updated weights for policy 0, policy_version 72490 (0.0007) [2023-03-07 17:48:38,953][232226] Updated weights for policy 0, policy_version 72500 (0.0007) [2023-03-07 17:48:39,746][232226] Updated weights for policy 0, policy_version 72510 (0.0006) [2023-03-07 17:48:40,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12885.3, 300 sec: 12864.2). Total num frames: 74254336. Throughput: 0: 12880.7. Samples: 74244485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:48:40,069][231894] Avg episode reward: [(0, '195.663')] [2023-03-07 17:48:40,537][232226] Updated weights for policy 0, policy_version 72520 (0.0007) [2023-03-07 17:48:41,339][232226] Updated weights for policy 0, policy_version 72530 (0.0006) [2023-03-07 17:48:42,125][232226] Updated weights for policy 0, policy_version 72540 (0.0006) [2023-03-07 17:48:42,930][232226] Updated weights for policy 0, policy_version 72550 (0.0006) [2023-03-07 17:48:43,715][232226] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-07 17:48:44,522][232226] Updated weights for policy 0, policy_version 72570 (0.0006) [2023-03-07 17:48:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 74317824. Throughput: 0: 12889.4. Samples: 74283278. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:48:45,070][231894] Avg episode reward: [(0, '190.893')] [2023-03-07 17:48:45,310][232226] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-07 17:48:46,105][232226] Updated weights for policy 0, policy_version 72590 (0.0006) [2023-03-07 17:48:46,918][232226] Updated weights for policy 0, policy_version 72600 (0.0006) [2023-03-07 17:48:47,698][232226] Updated weights for policy 0, policy_version 72610 (0.0006) [2023-03-07 17:48:48,489][232226] Updated weights for policy 0, policy_version 72620 (0.0006) [2023-03-07 17:48:49,297][232226] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-07 17:48:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 74382336. Throughput: 0: 12889.1. Samples: 74360375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:48:50,069][231894] Avg episode reward: [(0, '190.672')] [2023-03-07 17:48:50,074][232226] Updated weights for policy 0, policy_version 72640 (0.0006) [2023-03-07 17:48:50,880][232226] Updated weights for policy 0, policy_version 72650 (0.0007) [2023-03-07 17:48:51,705][232226] Updated weights for policy 0, policy_version 72660 (0.0006) [2023-03-07 17:48:52,492][232226] Updated weights for policy 0, policy_version 72670 (0.0007) [2023-03-07 17:48:53,289][232226] Updated weights for policy 0, policy_version 72680 (0.0007) [2023-03-07 17:48:54,094][232226] Updated weights for policy 0, policy_version 72690 (0.0007) [2023-03-07 17:48:54,895][232226] Updated weights for policy 0, policy_version 72700 (0.0007) [2023-03-07 17:48:55,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12860.7). Total num frames: 74446848. Throughput: 0: 12877.6. Samples: 74437321. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:48:55,069][231894] Avg episode reward: [(0, '195.293')] [2023-03-07 17:48:55,679][232226] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-07 17:48:56,477][232226] Updated weights for policy 0, policy_version 72720 (0.0006) [2023-03-07 17:48:57,276][232226] Updated weights for policy 0, policy_version 72730 (0.0005) [2023-03-07 17:48:58,076][232226] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-07 17:48:58,869][232226] Updated weights for policy 0, policy_version 72750 (0.0005) [2023-03-07 17:48:59,674][232226] Updated weights for policy 0, policy_version 72760 (0.0006) [2023-03-07 17:49:00,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12864.2). Total num frames: 74511360. Throughput: 0: 12872.9. Samples: 74475941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:00,070][231894] Avg episode reward: [(0, '196.107')] [2023-03-07 17:49:00,456][232226] Updated weights for policy 0, policy_version 72770 (0.0006) [2023-03-07 17:49:01,247][232226] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-07 17:49:02,051][232226] Updated weights for policy 0, policy_version 72790 (0.0006) [2023-03-07 17:49:02,842][232226] Updated weights for policy 0, policy_version 72800 (0.0006) [2023-03-07 17:49:03,634][232226] Updated weights for policy 0, policy_version 72810 (0.0005) [2023-03-07 17:49:04,447][232226] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-07 17:49:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 74574848. Throughput: 0: 12871.3. Samples: 74553269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:05,069][231894] Avg episode reward: [(0, '200.482')] [2023-03-07 17:49:05,253][232226] Updated weights for policy 0, policy_version 72830 (0.0006) [2023-03-07 17:49:06,042][232226] Updated weights for policy 0, policy_version 72840 (0.0005) [2023-03-07 17:49:06,830][232226] Updated weights for policy 0, policy_version 72850 (0.0007) [2023-03-07 17:49:07,606][232226] Updated weights for policy 0, policy_version 72860 (0.0007) [2023-03-07 17:49:08,387][232226] Updated weights for policy 0, policy_version 72870 (0.0006) [2023-03-07 17:49:09,204][232226] Updated weights for policy 0, policy_version 72880 (0.0007) [2023-03-07 17:49:10,005][232226] Updated weights for policy 0, policy_version 72890 (0.0006) [2023-03-07 17:49:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.2, 300 sec: 12860.7). Total num frames: 74639360. Throughput: 0: 12874.1. Samples: 74630378. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:10,069][231894] Avg episode reward: [(0, '196.919')] [2023-03-07 17:49:10,799][232226] Updated weights for policy 0, policy_version 72900 (0.0006) [2023-03-07 17:49:11,599][232226] Updated weights for policy 0, policy_version 72910 (0.0006) [2023-03-07 17:49:12,410][232226] Updated weights for policy 0, policy_version 72920 (0.0006) [2023-03-07 17:49:13,190][232226] Updated weights for policy 0, policy_version 72930 (0.0006) [2023-03-07 17:49:13,990][232226] Updated weights for policy 0, policy_version 72940 (0.0007) [2023-03-07 17:49:14,786][232226] Updated weights for policy 0, policy_version 72950 (0.0007) [2023-03-07 17:49:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 74703872. Throughput: 0: 12875.5. Samples: 74669026. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:15,069][231894] Avg episode reward: [(0, '192.189')] [2023-03-07 17:49:15,590][232226] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-07 17:49:16,374][232226] Updated weights for policy 0, policy_version 72970 (0.0006) [2023-03-07 17:49:17,173][232226] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-07 17:49:17,969][232226] Updated weights for policy 0, policy_version 72990 (0.0005) [2023-03-07 17:49:18,755][232226] Updated weights for policy 0, policy_version 73000 (0.0005) [2023-03-07 17:49:19,548][232226] Updated weights for policy 0, policy_version 73010 (0.0007) [2023-03-07 17:49:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 74768384. Throughput: 0: 12874.7. Samples: 74746239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:20,069][231894] Avg episode reward: [(0, '191.154')] [2023-03-07 17:49:20,344][232226] Updated weights for policy 0, policy_version 73020 (0.0007) [2023-03-07 17:49:21,128][232226] Updated weights for policy 0, policy_version 73030 (0.0008) [2023-03-07 17:49:21,937][232226] Updated weights for policy 0, policy_version 73040 (0.0006) [2023-03-07 17:49:22,720][232226] Updated weights for policy 0, policy_version 73050 (0.0006) [2023-03-07 17:49:23,520][232226] Updated weights for policy 0, policy_version 73060 (0.0007) [2023-03-07 17:49:24,309][232226] Updated weights for policy 0, policy_version 73070 (0.0006) [2023-03-07 17:49:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12860.7). Total num frames: 74832896. Throughput: 0: 12871.4. Samples: 74823698. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:25,069][231894] Avg episode reward: [(0, '200.063')] [2023-03-07 17:49:25,098][232226] Updated weights for policy 0, policy_version 73080 (0.0006) [2023-03-07 17:49:25,881][232226] Updated weights for policy 0, policy_version 73090 (0.0006) [2023-03-07 17:49:26,690][232226] Updated weights for policy 0, policy_version 73100 (0.0006) [2023-03-07 17:49:27,467][232226] Updated weights for policy 0, policy_version 73110 (0.0007) [2023-03-07 17:49:28,258][232226] Updated weights for policy 0, policy_version 73120 (0.0007) [2023-03-07 17:49:29,046][232226] Updated weights for policy 0, policy_version 73130 (0.0006) [2023-03-07 17:49:29,835][232226] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-07 17:49:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12860.7). Total num frames: 74897408. Throughput: 0: 12872.3. Samples: 74862531. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:30,070][231894] Avg episode reward: [(0, '198.368')] [2023-03-07 17:49:30,642][232226] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-07 17:49:31,428][232226] Updated weights for policy 0, policy_version 73160 (0.0006) [2023-03-07 17:49:32,233][232226] Updated weights for policy 0, policy_version 73170 (0.0006) [2023-03-07 17:49:33,034][232226] Updated weights for policy 0, policy_version 73180 (0.0007) [2023-03-07 17:49:33,831][232226] Updated weights for policy 0, policy_version 73190 (0.0006) [2023-03-07 17:49:34,624][232226] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-07 17:49:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 74961920. Throughput: 0: 12879.5. Samples: 74939955. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:35,069][231894] Avg episode reward: [(0, '194.792')] [2023-03-07 17:49:35,422][232226] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-07 17:49:36,238][232226] Updated weights for policy 0, policy_version 73220 (0.0006) [2023-03-07 17:49:36,997][232226] Updated weights for policy 0, policy_version 73230 (0.0007) [2023-03-07 17:49:37,805][232226] Updated weights for policy 0, policy_version 73240 (0.0006) [2023-03-07 17:49:38,600][232226] Updated weights for policy 0, policy_version 73250 (0.0006) [2023-03-07 17:49:39,385][232226] Updated weights for policy 0, policy_version 73260 (0.0006) [2023-03-07 17:49:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 75026432. Throughput: 0: 12888.4. Samples: 75017298. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:40,069][231894] Avg episode reward: [(0, '183.195')] [2023-03-07 17:49:40,189][232226] Updated weights for policy 0, policy_version 73270 (0.0006) [2023-03-07 17:49:40,991][232226] Updated weights for policy 0, policy_version 73280 (0.0007) [2023-03-07 17:49:41,781][232226] Updated weights for policy 0, policy_version 73290 (0.0007) [2023-03-07 17:49:42,575][232226] Updated weights for policy 0, policy_version 73300 (0.0007) [2023-03-07 17:49:43,364][232226] Updated weights for policy 0, policy_version 73310 (0.0006) [2023-03-07 17:49:44,153][232226] Updated weights for policy 0, policy_version 73320 (0.0007) [2023-03-07 17:49:44,974][232226] Updated weights for policy 0, policy_version 73330 (0.0007) [2023-03-07 17:49:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12867.7). Total num frames: 75090944. Throughput: 0: 12886.2. Samples: 75055819. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:45,069][231894] Avg episode reward: [(0, '199.195')] [2023-03-07 17:49:45,762][232226] Updated weights for policy 0, policy_version 73340 (0.0006) [2023-03-07 17:49:46,567][232226] Updated weights for policy 0, policy_version 73350 (0.0005) [2023-03-07 17:49:47,351][232226] Updated weights for policy 0, policy_version 73360 (0.0006) [2023-03-07 17:49:48,142][232226] Updated weights for policy 0, policy_version 73370 (0.0007) [2023-03-07 17:49:48,937][232226] Updated weights for policy 0, policy_version 73380 (0.0006) [2023-03-07 17:49:49,721][232226] Updated weights for policy 0, policy_version 73390 (0.0006) [2023-03-07 17:49:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12864.2). Total num frames: 75155456. Throughput: 0: 12883.9. Samples: 75133044. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:50,069][231894] Avg episode reward: [(0, '193.718')] [2023-03-07 17:49:50,518][232226] Updated weights for policy 0, policy_version 73400 (0.0008) [2023-03-07 17:49:51,301][232226] Updated weights for policy 0, policy_version 73410 (0.0006) [2023-03-07 17:49:52,093][232226] Updated weights for policy 0, policy_version 73420 (0.0006) [2023-03-07 17:49:52,884][232226] Updated weights for policy 0, policy_version 73430 (0.0005) [2023-03-07 17:49:53,672][232226] Updated weights for policy 0, policy_version 73440 (0.0008) [2023-03-07 17:49:54,482][232226] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-07 17:49:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12864.2). Total num frames: 75219968. Throughput: 0: 12893.2. Samples: 75210571. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:49:55,069][231894] Avg episode reward: [(0, '193.152')] [2023-03-07 17:49:55,272][232226] Updated weights for policy 0, policy_version 73460 (0.0006) [2023-03-07 17:49:56,057][232226] Updated weights for policy 0, policy_version 73470 (0.0006) [2023-03-07 17:49:56,855][232226] Updated weights for policy 0, policy_version 73480 (0.0007) [2023-03-07 17:49:57,658][232226] Updated weights for policy 0, policy_version 73490 (0.0006) [2023-03-07 17:49:58,468][232226] Updated weights for policy 0, policy_version 73500 (0.0007) [2023-03-07 17:49:59,266][232226] Updated weights for policy 0, policy_version 73510 (0.0006) [2023-03-07 17:50:00,050][232226] Updated weights for policy 0, policy_version 73520 (0.0006) [2023-03-07 17:50:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12867.7). Total num frames: 75284480. Throughput: 0: 12895.0. Samples: 75249303. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:50:00,069][231894] Avg episode reward: [(0, '200.347')] [2023-03-07 17:50:00,843][232226] Updated weights for policy 0, policy_version 73530 (0.0007) [2023-03-07 17:50:01,645][232226] Updated weights for policy 0, policy_version 73540 (0.0007) [2023-03-07 17:50:02,431][232226] Updated weights for policy 0, policy_version 73550 (0.0006) [2023-03-07 17:50:03,236][232226] Updated weights for policy 0, policy_version 73560 (0.0008) [2023-03-07 17:50:04,026][232226] Updated weights for policy 0, policy_version 73570 (0.0006) [2023-03-07 17:50:04,821][232226] Updated weights for policy 0, policy_version 73580 (0.0006) [2023-03-07 17:50:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12867.7). Total num frames: 75348992. Throughput: 0: 12892.0. Samples: 75326380. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:05,070][231894] Avg episode reward: [(0, '188.096')] [2023-03-07 17:50:05,626][232226] Updated weights for policy 0, policy_version 73590 (0.0006) [2023-03-07 17:50:06,427][232226] Updated weights for policy 0, policy_version 73600 (0.0007) [2023-03-07 17:50:07,211][232226] Updated weights for policy 0, policy_version 73610 (0.0007) [2023-03-07 17:50:08,010][232226] Updated weights for policy 0, policy_version 73620 (0.0006) [2023-03-07 17:50:08,794][232226] Updated weights for policy 0, policy_version 73630 (0.0007) [2023-03-07 17:50:09,573][232226] Updated weights for policy 0, policy_version 73640 (0.0007) [2023-03-07 17:50:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12867.7). Total num frames: 75413504. Throughput: 0: 12892.0. Samples: 75403837. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:10,069][231894] Avg episode reward: [(0, '195.805')] [2023-03-07 17:50:10,385][232226] Updated weights for policy 0, policy_version 73650 (0.0007) [2023-03-07 17:50:11,173][232226] Updated weights for policy 0, policy_version 73660 (0.0006) [2023-03-07 17:50:11,957][232226] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-07 17:50:12,748][232226] Updated weights for policy 0, policy_version 73680 (0.0006) [2023-03-07 17:50:13,556][232226] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-07 17:50:14,347][232226] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-07 17:50:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12867.7). Total num frames: 75478016. Throughput: 0: 12888.8. Samples: 75442527. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:15,069][231894] Avg episode reward: [(0, '196.987')] [2023-03-07 17:50:15,150][232226] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-07 17:50:15,962][232226] Updated weights for policy 0, policy_version 73720 (0.0005) [2023-03-07 17:50:16,765][232226] Updated weights for policy 0, policy_version 73730 (0.0006) [2023-03-07 17:50:17,562][232226] Updated weights for policy 0, policy_version 73740 (0.0006) [2023-03-07 17:50:18,365][232226] Updated weights for policy 0, policy_version 73750 (0.0006) [2023-03-07 17:50:19,156][232226] Updated weights for policy 0, policy_version 73760 (0.0006) [2023-03-07 17:50:19,957][232226] Updated weights for policy 0, policy_version 73770 (0.0006) [2023-03-07 17:50:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12864.2). Total num frames: 75541504. Throughput: 0: 12877.6. Samples: 75519448. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:20,069][231894] Avg episode reward: [(0, '202.954')] [2023-03-07 17:50:20,762][232226] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-07 17:50:21,553][232226] Updated weights for policy 0, policy_version 73790 (0.0006) [2023-03-07 17:50:22,332][232226] Updated weights for policy 0, policy_version 73800 (0.0006) [2023-03-07 17:50:23,132][232226] Updated weights for policy 0, policy_version 73810 (0.0007) [2023-03-07 17:50:23,928][232226] Updated weights for policy 0, policy_version 73820 (0.0006) [2023-03-07 17:50:24,731][232226] Updated weights for policy 0, policy_version 73830 (0.0005) [2023-03-07 17:50:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12867.7). Total num frames: 75606016. Throughput: 0: 12872.4. Samples: 75596556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:25,069][231894] Avg episode reward: [(0, '195.678')] [2023-03-07 17:50:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000073834_75606016.pth... [2023-03-07 17:50:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000070818_72517632.pth [2023-03-07 17:50:25,533][232226] Updated weights for policy 0, policy_version 73840 (0.0007) [2023-03-07 17:50:26,327][232226] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-07 17:50:27,113][232226] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-07 17:50:27,922][232226] Updated weights for policy 0, policy_version 73870 (0.0007) [2023-03-07 17:50:28,708][232226] Updated weights for policy 0, policy_version 73880 (0.0006) [2023-03-07 17:50:29,502][232226] Updated weights for policy 0, policy_version 73890 (0.0007) [2023-03-07 17:50:30,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12867.7). Total num frames: 75670528. Throughput: 0: 12872.6. Samples: 75635084. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:30,069][231894] Avg episode reward: [(0, '197.049')] [2023-03-07 17:50:30,309][232226] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-07 17:50:31,099][232226] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-07 17:50:31,907][232226] Updated weights for policy 0, policy_version 73920 (0.0006) [2023-03-07 17:50:32,718][232226] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-07 17:50:33,523][232226] Updated weights for policy 0, policy_version 73940 (0.0007) [2023-03-07 17:50:34,308][232226] Updated weights for policy 0, policy_version 73950 (0.0006) [2023-03-07 17:50:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12864.3). Total num frames: 75734016. Throughput: 0: 12865.3. Samples: 75711983. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:35,069][231894] Avg episode reward: [(0, '201.502')] [2023-03-07 17:50:35,106][232226] Updated weights for policy 0, policy_version 73960 (0.0006) [2023-03-07 17:50:35,907][232226] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-07 17:50:36,697][232226] Updated weights for policy 0, policy_version 73980 (0.0006) [2023-03-07 17:50:37,503][232226] Updated weights for policy 0, policy_version 73990 (0.0007) [2023-03-07 17:50:38,281][232226] Updated weights for policy 0, policy_version 74000 (0.0007) [2023-03-07 17:50:39,087][232226] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-07 17:50:39,877][232226] Updated weights for policy 0, policy_version 74020 (0.0006) [2023-03-07 17:50:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 75798528. Throughput: 0: 12855.8. Samples: 75789082. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:40,069][231894] Avg episode reward: [(0, '184.489')] [2023-03-07 17:50:40,666][232226] Updated weights for policy 0, policy_version 74030 (0.0006) [2023-03-07 17:50:41,450][232226] Updated weights for policy 0, policy_version 74040 (0.0006) [2023-03-07 17:50:42,255][232226] Updated weights for policy 0, policy_version 74050 (0.0006) [2023-03-07 17:50:43,048][232226] Updated weights for policy 0, policy_version 74060 (0.0007) [2023-03-07 17:50:43,854][232226] Updated weights for policy 0, policy_version 74070 (0.0007) [2023-03-07 17:50:44,649][232226] Updated weights for policy 0, policy_version 74080 (0.0007) [2023-03-07 17:50:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 75863040. Throughput: 0: 12859.1. Samples: 75827963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:50:45,069][231894] Avg episode reward: [(0, '192.778')] [2023-03-07 17:50:45,441][232226] Updated weights for policy 0, policy_version 74090 (0.0007) [2023-03-07 17:50:46,254][232226] Updated weights for policy 0, policy_version 74100 (0.0006) [2023-03-07 17:50:47,045][232226] Updated weights for policy 0, policy_version 74110 (0.0006) [2023-03-07 17:50:47,849][232226] Updated weights for policy 0, policy_version 74120 (0.0007) [2023-03-07 17:50:48,638][232226] Updated weights for policy 0, policy_version 74130 (0.0006) [2023-03-07 17:50:49,441][232226] Updated weights for policy 0, policy_version 74140 (0.0007) [2023-03-07 17:50:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 75926528. Throughput: 0: 12858.3. Samples: 75905001. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:50:50,069][231894] Avg episode reward: [(0, '191.722')] [2023-03-07 17:50:50,240][232226] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-07 17:50:51,044][232226] Updated weights for policy 0, policy_version 74160 (0.0007) [2023-03-07 17:50:51,824][232226] Updated weights for policy 0, policy_version 74170 (0.0007) [2023-03-07 17:50:52,636][232226] Updated weights for policy 0, policy_version 74180 (0.0006) [2023-03-07 17:50:53,420][232226] Updated weights for policy 0, policy_version 74190 (0.0007) [2023-03-07 17:50:54,210][232226] Updated weights for policy 0, policy_version 74200 (0.0007) [2023-03-07 17:50:54,998][232226] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-07 17:50:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 75992064. Throughput: 0: 12850.7. Samples: 75982118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:50:55,069][231894] Avg episode reward: [(0, '198.950')] [2023-03-07 17:50:55,786][232226] Updated weights for policy 0, policy_version 74220 (0.0006) [2023-03-07 17:50:56,586][232226] Updated weights for policy 0, policy_version 74230 (0.0007) [2023-03-07 17:50:57,373][232226] Updated weights for policy 0, policy_version 74240 (0.0006) [2023-03-07 17:50:58,189][232226] Updated weights for policy 0, policy_version 74250 (0.0006) [2023-03-07 17:50:58,993][232226] Updated weights for policy 0, policy_version 74260 (0.0006) [2023-03-07 17:50:59,774][232226] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-07 17:51:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 76055552. Throughput: 0: 12851.6. Samples: 76020849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:51:00,069][231894] Avg episode reward: [(0, '192.675')] [2023-03-07 17:51:00,581][232226] Updated weights for policy 0, policy_version 74280 (0.0006) [2023-03-07 17:51:01,361][232226] Updated weights for policy 0, policy_version 74290 (0.0006) [2023-03-07 17:51:02,153][232226] Updated weights for policy 0, policy_version 74300 (0.0007) [2023-03-07 17:51:02,956][232226] Updated weights for policy 0, policy_version 74310 (0.0008) [2023-03-07 17:51:03,759][232226] Updated weights for policy 0, policy_version 74320 (0.0007) [2023-03-07 17:51:04,556][232226] Updated weights for policy 0, policy_version 74330 (0.0006) [2023-03-07 17:51:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 76120064. Throughput: 0: 12851.3. Samples: 76097755. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:51:05,069][231894] Avg episode reward: [(0, '192.985')] [2023-03-07 17:51:05,344][232226] Updated weights for policy 0, policy_version 74340 (0.0006) [2023-03-07 17:51:06,135][232226] Updated weights for policy 0, policy_version 74350 (0.0007) [2023-03-07 17:51:06,923][232226] Updated weights for policy 0, policy_version 74360 (0.0006) [2023-03-07 17:51:07,725][232226] Updated weights for policy 0, policy_version 74370 (0.0006) [2023-03-07 17:51:08,543][232226] Updated weights for policy 0, policy_version 74380 (0.0006) [2023-03-07 17:51:09,332][232226] Updated weights for policy 0, policy_version 74390 (0.0007) [2023-03-07 17:51:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 76184576. Throughput: 0: 12856.5. Samples: 76175099. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:51:10,069][231894] Avg episode reward: [(0, '194.942')] [2023-03-07 17:51:10,115][232226] Updated weights for policy 0, policy_version 74400 (0.0007) [2023-03-07 17:51:10,918][232226] Updated weights for policy 0, policy_version 74410 (0.0006) [2023-03-07 17:51:11,693][232226] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-07 17:51:12,505][232226] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-07 17:51:13,312][232226] Updated weights for policy 0, policy_version 74440 (0.0007) [2023-03-07 17:51:14,099][232226] Updated weights for policy 0, policy_version 74450 (0.0007) [2023-03-07 17:51:14,896][232226] Updated weights for policy 0, policy_version 74460 (0.0007) [2023-03-07 17:51:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 76249088. Throughput: 0: 12858.3. Samples: 76213711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:51:15,069][231894] Avg episode reward: [(0, '194.240')] [2023-03-07 17:51:15,700][232226] Updated weights for policy 0, policy_version 74470 (0.0007) [2023-03-07 17:51:16,510][232226] Updated weights for policy 0, policy_version 74480 (0.0007) [2023-03-07 17:51:17,299][232226] Updated weights for policy 0, policy_version 74490 (0.0006) [2023-03-07 17:51:18,095][232226] Updated weights for policy 0, policy_version 74500 (0.0006) [2023-03-07 17:51:18,885][232226] Updated weights for policy 0, policy_version 74510 (0.0006) [2023-03-07 17:51:19,694][232226] Updated weights for policy 0, policy_version 74520 (0.0006) [2023-03-07 17:51:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 76312576. Throughput: 0: 12860.2. Samples: 76290690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:51:20,069][231894] Avg episode reward: [(0, '188.333')] [2023-03-07 17:51:20,525][232226] Updated weights for policy 0, policy_version 74530 (0.0006) [2023-03-07 17:51:21,309][232226] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-07 17:51:22,090][232226] Updated weights for policy 0, policy_version 74550 (0.0006) [2023-03-07 17:51:22,918][232226] Updated weights for policy 0, policy_version 74560 (0.0006) [2023-03-07 17:51:23,719][232226] Updated weights for policy 0, policy_version 74570 (0.0006) [2023-03-07 17:51:24,505][232226] Updated weights for policy 0, policy_version 74580 (0.0006) [2023-03-07 17:51:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 76377088. Throughput: 0: 12851.4. Samples: 76367394. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:51:25,069][231894] Avg episode reward: [(0, '189.891')] [2023-03-07 17:51:25,330][232226] Updated weights for policy 0, policy_version 74590 (0.0007) [2023-03-07 17:51:26,113][232226] Updated weights for policy 0, policy_version 74600 (0.0007) [2023-03-07 17:51:26,928][232226] Updated weights for policy 0, policy_version 74610 (0.0006) [2023-03-07 17:51:27,711][232226] Updated weights for policy 0, policy_version 74620 (0.0006) [2023-03-07 17:51:28,497][232226] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-07 17:51:29,281][232226] Updated weights for policy 0, policy_version 74640 (0.0007) [2023-03-07 17:51:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12860.7). Total num frames: 76440576. Throughput: 0: 12841.8. Samples: 76405844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:51:30,080][231894] Avg episode reward: [(0, '194.702')] [2023-03-07 17:51:30,089][232226] Updated weights for policy 0, policy_version 74650 (0.0006) [2023-03-07 17:51:30,883][232226] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-07 17:51:31,666][232226] Updated weights for policy 0, policy_version 74670 (0.0006) [2023-03-07 17:51:32,457][232226] Updated weights for policy 0, policy_version 74680 (0.0006) [2023-03-07 17:51:33,262][232226] Updated weights for policy 0, policy_version 74690 (0.0007) [2023-03-07 17:51:34,059][232226] Updated weights for policy 0, policy_version 74700 (0.0007) [2023-03-07 17:51:34,863][232226] Updated weights for policy 0, policy_version 74710 (0.0006) [2023-03-07 17:51:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 76505088. Throughput: 0: 12849.4. Samples: 76483227. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:51:35,080][231894] Avg episode reward: [(0, '192.090')] [2023-03-07 17:51:35,635][232226] Updated weights for policy 0, policy_version 74720 (0.0006) [2023-03-07 17:51:36,435][232226] Updated weights for policy 0, policy_version 74730 (0.0007) [2023-03-07 17:51:37,227][232226] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-07 17:51:38,021][232226] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-07 17:51:38,805][232226] Updated weights for policy 0, policy_version 74760 (0.0007) [2023-03-07 17:51:39,614][232226] Updated weights for policy 0, policy_version 74770 (0.0006) [2023-03-07 17:51:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 76569600. Throughput: 0: 12857.7. Samples: 76560717. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:51:40,069][231894] Avg episode reward: [(0, '201.465')] [2023-03-07 17:51:40,388][232226] Updated weights for policy 0, policy_version 74780 (0.0006) [2023-03-07 17:51:41,177][232226] Updated weights for policy 0, policy_version 74790 (0.0006) [2023-03-07 17:51:41,974][232226] Updated weights for policy 0, policy_version 74800 (0.0007) [2023-03-07 17:51:42,763][232226] Updated weights for policy 0, policy_version 74810 (0.0006) [2023-03-07 17:51:43,560][232226] Updated weights for policy 0, policy_version 74820 (0.0006) [2023-03-07 17:51:44,353][232226] Updated weights for policy 0, policy_version 74830 (0.0006) [2023-03-07 17:51:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 76634112. Throughput: 0: 12858.5. Samples: 76599481. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:51:45,080][231894] Avg episode reward: [(0, '187.096')] [2023-03-07 17:51:45,168][232226] Updated weights for policy 0, policy_version 74840 (0.0006) [2023-03-07 17:51:45,982][232226] Updated weights for policy 0, policy_version 74850 (0.0007) [2023-03-07 17:51:46,770][232226] Updated weights for policy 0, policy_version 74860 (0.0006) [2023-03-07 17:51:47,558][232226] Updated weights for policy 0, policy_version 74870 (0.0007) [2023-03-07 17:51:48,339][232226] Updated weights for policy 0, policy_version 74880 (0.0006) [2023-03-07 17:51:49,138][232226] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-07 17:51:49,938][232226] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-07 17:51:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 76698624. Throughput: 0: 12864.0. Samples: 76676633. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:51:50,080][231894] Avg episode reward: [(0, '195.881')] [2023-03-07 17:51:50,741][232226] Updated weights for policy 0, policy_version 74910 (0.0006) [2023-03-07 17:51:51,518][232226] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-07 17:51:52,321][232226] Updated weights for policy 0, policy_version 74930 (0.0006) [2023-03-07 17:51:53,118][232226] Updated weights for policy 0, policy_version 74940 (0.0008) [2023-03-07 17:51:53,902][232226] Updated weights for policy 0, policy_version 74950 (0.0006) [2023-03-07 17:51:54,705][232226] Updated weights for policy 0, policy_version 74960 (0.0006) [2023-03-07 17:51:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 76763136. Throughput: 0: 12866.6. Samples: 76754097. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:51:55,080][231894] Avg episode reward: [(0, '194.409')] [2023-03-07 17:51:55,504][232226] Updated weights for policy 0, policy_version 74970 (0.0006) [2023-03-07 17:51:56,297][232226] Updated weights for policy 0, policy_version 74980 (0.0007) [2023-03-07 17:51:57,105][232226] Updated weights for policy 0, policy_version 74990 (0.0007) [2023-03-07 17:51:57,880][232226] Updated weights for policy 0, policy_version 75000 (0.0006) [2023-03-07 17:51:58,698][232226] Updated weights for policy 0, policy_version 75010 (0.0006) [2023-03-07 17:51:59,474][232226] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-07 17:52:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 76827648. Throughput: 0: 12863.3. Samples: 76792561. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:52:00,080][231894] Avg episode reward: [(0, '191.263')] [2023-03-07 17:52:00,271][232226] Updated weights for policy 0, policy_version 75030 (0.0006) [2023-03-07 17:52:01,042][232226] Updated weights for policy 0, policy_version 75040 (0.0006) [2023-03-07 17:52:01,840][232226] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-07 17:52:02,648][232226] Updated weights for policy 0, policy_version 75060 (0.0006) [2023-03-07 17:52:03,441][232226] Updated weights for policy 0, policy_version 75070 (0.0006) [2023-03-07 17:52:04,236][232226] Updated weights for policy 0, policy_version 75080 (0.0006) [2023-03-07 17:52:05,031][232226] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-07 17:52:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 76892160. Throughput: 0: 12873.2. Samples: 76869986. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:52:05,070][231894] Avg episode reward: [(0, '194.776')] [2023-03-07 17:52:05,844][232226] Updated weights for policy 0, policy_version 75100 (0.0006) [2023-03-07 17:52:06,618][232226] Updated weights for policy 0, policy_version 75110 (0.0007) [2023-03-07 17:52:07,416][232226] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-07 17:52:08,208][232226] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-07 17:52:09,014][232226] Updated weights for policy 0, policy_version 75140 (0.0006) [2023-03-07 17:52:09,806][232226] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-07 17:52:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 76956672. Throughput: 0: 12883.5. Samples: 76947151. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:52:10,069][231894] Avg episode reward: [(0, '196.201')] [2023-03-07 17:52:10,607][232226] Updated weights for policy 0, policy_version 75160 (0.0007) [2023-03-07 17:52:11,390][232226] Updated weights for policy 0, policy_version 75170 (0.0007) [2023-03-07 17:52:12,186][232226] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-07 17:52:12,983][232226] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-07 17:52:13,779][232226] Updated weights for policy 0, policy_version 75200 (0.0007) [2023-03-07 17:52:14,566][232226] Updated weights for policy 0, policy_version 75210 (0.0006) [2023-03-07 17:52:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 77021184. Throughput: 0: 12889.9. Samples: 76985891. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 17:52:15,069][231894] Avg episode reward: [(0, '188.321')] [2023-03-07 17:52:15,350][232226] Updated weights for policy 0, policy_version 75220 (0.0007) [2023-03-07 17:52:16,166][232226] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-07 17:52:16,949][232226] Updated weights for policy 0, policy_version 75240 (0.0006) [2023-03-07 17:52:17,751][232226] Updated weights for policy 0, policy_version 75250 (0.0006) [2023-03-07 17:52:18,528][232226] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-07 17:52:19,317][232226] Updated weights for policy 0, policy_version 75270 (0.0005) [2023-03-07 17:52:20,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 77085696. Throughput: 0: 12891.6. Samples: 77063349. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:20,070][231894] Avg episode reward: [(0, '194.710')] [2023-03-07 17:52:20,125][232226] Updated weights for policy 0, policy_version 75280 (0.0007) [2023-03-07 17:52:20,923][232226] Updated weights for policy 0, policy_version 75290 (0.0006) [2023-03-07 17:52:21,714][232226] Updated weights for policy 0, policy_version 75300 (0.0007) [2023-03-07 17:52:22,519][232226] Updated weights for policy 0, policy_version 75310 (0.0007) [2023-03-07 17:52:23,310][232226] Updated weights for policy 0, policy_version 75320 (0.0007) [2023-03-07 17:52:24,101][232226] Updated weights for policy 0, policy_version 75330 (0.0006) [2023-03-07 17:52:24,905][232226] Updated weights for policy 0, policy_version 75340 (0.0007) [2023-03-07 17:52:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 77150208. Throughput: 0: 12881.6. Samples: 77140387. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:25,069][231894] Avg episode reward: [(0, '191.624')] [2023-03-07 17:52:25,072][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000075342_77150208.pth... [2023-03-07 17:52:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000072325_74060800.pth [2023-03-07 17:52:25,705][232226] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-07 17:52:26,503][232226] Updated weights for policy 0, policy_version 75360 (0.0006) [2023-03-07 17:52:27,309][232226] Updated weights for policy 0, policy_version 75370 (0.0007) [2023-03-07 17:52:28,095][232226] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-07 17:52:28,899][232226] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-07 17:52:29,696][232226] Updated weights for policy 0, policy_version 75400 (0.0006) [2023-03-07 17:52:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12867.7). Total num frames: 77213696. Throughput: 0: 12877.4. Samples: 77178962. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:30,069][231894] Avg episode reward: [(0, '195.643')] [2023-03-07 17:52:30,479][232226] Updated weights for policy 0, policy_version 75410 (0.0007) [2023-03-07 17:52:31,269][232226] Updated weights for policy 0, policy_version 75420 (0.0006) [2023-03-07 17:52:32,076][232226] Updated weights for policy 0, policy_version 75430 (0.0006) [2023-03-07 17:52:32,861][232226] Updated weights for policy 0, policy_version 75440 (0.0007) [2023-03-07 17:52:33,659][232226] Updated weights for policy 0, policy_version 75450 (0.0006) [2023-03-07 17:52:34,457][232226] Updated weights for policy 0, policy_version 75460 (0.0006) [2023-03-07 17:52:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 77278208. Throughput: 0: 12881.2. Samples: 77256290. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:35,069][231894] Avg episode reward: [(0, '200.854')] [2023-03-07 17:52:35,236][232226] Updated weights for policy 0, policy_version 75470 (0.0006) [2023-03-07 17:52:36,038][232226] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-07 17:52:36,842][232226] Updated weights for policy 0, policy_version 75490 (0.0005) [2023-03-07 17:52:37,640][232226] Updated weights for policy 0, policy_version 75500 (0.0006) [2023-03-07 17:52:38,425][232226] Updated weights for policy 0, policy_version 75510 (0.0006) [2023-03-07 17:52:39,218][232226] Updated weights for policy 0, policy_version 75520 (0.0006) [2023-03-07 17:52:40,006][232226] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-07 17:52:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 77342720. Throughput: 0: 12877.1. Samples: 77333565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:40,069][231894] Avg episode reward: [(0, '194.742')] [2023-03-07 17:52:40,810][232226] Updated weights for policy 0, policy_version 75540 (0.0007) [2023-03-07 17:52:41,616][232226] Updated weights for policy 0, policy_version 75550 (0.0007) [2023-03-07 17:52:42,416][232226] Updated weights for policy 0, policy_version 75560 (0.0006) [2023-03-07 17:52:43,241][232226] Updated weights for policy 0, policy_version 75570 (0.0006) [2023-03-07 17:52:44,019][232226] Updated weights for policy 0, policy_version 75580 (0.0007) [2023-03-07 17:52:44,818][232226] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-07 17:52:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 77407232. Throughput: 0: 12878.2. Samples: 77372078. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:45,069][231894] Avg episode reward: [(0, '198.389')] [2023-03-07 17:52:45,617][232226] Updated weights for policy 0, policy_version 75600 (0.0006) [2023-03-07 17:52:46,407][232226] Updated weights for policy 0, policy_version 75610 (0.0007) [2023-03-07 17:52:47,194][232226] Updated weights for policy 0, policy_version 75620 (0.0006) [2023-03-07 17:52:48,011][232226] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-07 17:52:48,805][232226] Updated weights for policy 0, policy_version 75640 (0.0006) [2023-03-07 17:52:49,591][232226] Updated weights for policy 0, policy_version 75650 (0.0007) [2023-03-07 17:52:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 77471744. Throughput: 0: 12869.1. Samples: 77449095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:50,069][231894] Avg episode reward: [(0, '194.913')] [2023-03-07 17:52:50,380][232226] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-07 17:52:51,188][232226] Updated weights for policy 0, policy_version 75670 (0.0006) [2023-03-07 17:52:51,989][232226] Updated weights for policy 0, policy_version 75680 (0.0007) [2023-03-07 17:52:52,778][232226] Updated weights for policy 0, policy_version 75690 (0.0006) [2023-03-07 17:52:53,570][232226] Updated weights for policy 0, policy_version 75700 (0.0006) [2023-03-07 17:52:54,353][232226] Updated weights for policy 0, policy_version 75710 (0.0006) [2023-03-07 17:52:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 77535232. Throughput: 0: 12871.6. Samples: 77526375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:52:55,070][231894] Avg episode reward: [(0, '196.321')] [2023-03-07 17:52:55,161][232226] Updated weights for policy 0, policy_version 75720 (0.0006) [2023-03-07 17:52:55,978][232226] Updated weights for policy 0, policy_version 75730 (0.0006) [2023-03-07 17:52:56,765][232226] Updated weights for policy 0, policy_version 75740 (0.0007) [2023-03-07 17:52:57,538][232226] Updated weights for policy 0, policy_version 75750 (0.0006) [2023-03-07 17:52:58,336][232226] Updated weights for policy 0, policy_version 75760 (0.0006) [2023-03-07 17:52:59,124][232226] Updated weights for policy 0, policy_version 75770 (0.0006) [2023-03-07 17:52:59,886][232226] Updated weights for policy 0, policy_version 75780 (0.0006) [2023-03-07 17:53:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 77600768. Throughput: 0: 12866.4. Samples: 77564879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:53:00,069][231894] Avg episode reward: [(0, '189.690')] [2023-03-07 17:53:00,703][232226] Updated weights for policy 0, policy_version 75790 (0.0008) [2023-03-07 17:53:01,519][232226] Updated weights for policy 0, policy_version 75800 (0.0007) [2023-03-07 17:53:02,300][232226] Updated weights for policy 0, policy_version 75810 (0.0007) [2023-03-07 17:53:03,089][232226] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-07 17:53:03,889][232226] Updated weights for policy 0, policy_version 75830 (0.0007) [2023-03-07 17:53:04,680][232226] Updated weights for policy 0, policy_version 75840 (0.0006) [2023-03-07 17:53:05,069][231894] Fps is (10 sec: 13005.0, 60 sec: 12885.4, 300 sec: 12874.6). Total num frames: 77665280. Throughput: 0: 12872.9. Samples: 77642628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:53:05,069][231894] Avg episode reward: [(0, '198.235')] [2023-03-07 17:53:05,469][232226] Updated weights for policy 0, policy_version 75850 (0.0006) [2023-03-07 17:53:06,273][232226] Updated weights for policy 0, policy_version 75860 (0.0006) [2023-03-07 17:53:07,063][232226] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-07 17:53:07,865][232226] Updated weights for policy 0, policy_version 75880 (0.0006) [2023-03-07 17:53:08,655][232226] Updated weights for policy 0, policy_version 75890 (0.0007) [2023-03-07 17:53:09,429][232226] Updated weights for policy 0, policy_version 75900 (0.0006) [2023-03-07 17:53:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 77729792. Throughput: 0: 12875.9. Samples: 77719804. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:10,069][231894] Avg episode reward: [(0, '196.683')] [2023-03-07 17:53:10,223][232226] Updated weights for policy 0, policy_version 75910 (0.0007) [2023-03-07 17:53:11,022][232226] Updated weights for policy 0, policy_version 75920 (0.0006) [2023-03-07 17:53:11,814][232226] Updated weights for policy 0, policy_version 75930 (0.0006) [2023-03-07 17:53:12,619][232226] Updated weights for policy 0, policy_version 75940 (0.0006) [2023-03-07 17:53:13,417][232226] Updated weights for policy 0, policy_version 75950 (0.0007) [2023-03-07 17:53:14,211][232226] Updated weights for policy 0, policy_version 75960 (0.0006) [2023-03-07 17:53:15,018][232226] Updated weights for policy 0, policy_version 75970 (0.0007) [2023-03-07 17:53:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 77793280. Throughput: 0: 12877.2. Samples: 77758435. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:15,069][231894] Avg episode reward: [(0, '199.691')] [2023-03-07 17:53:15,816][232226] Updated weights for policy 0, policy_version 75980 (0.0006) [2023-03-07 17:53:16,622][232226] Updated weights for policy 0, policy_version 75990 (0.0006) [2023-03-07 17:53:17,417][232226] Updated weights for policy 0, policy_version 76000 (0.0007) [2023-03-07 17:53:18,205][232226] Updated weights for policy 0, policy_version 76010 (0.0006) [2023-03-07 17:53:19,005][232226] Updated weights for policy 0, policy_version 76020 (0.0007) [2023-03-07 17:53:19,807][232226] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-07 17:53:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 77857792. Throughput: 0: 12875.3. Samples: 77835678. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:20,070][231894] Avg episode reward: [(0, '194.163')] [2023-03-07 17:53:20,590][232226] Updated weights for policy 0, policy_version 76040 (0.0007) [2023-03-07 17:53:21,398][232226] Updated weights for policy 0, policy_version 76050 (0.0006) [2023-03-07 17:53:22,189][232226] Updated weights for policy 0, policy_version 76060 (0.0007) [2023-03-07 17:53:22,978][232226] Updated weights for policy 0, policy_version 76070 (0.0006) [2023-03-07 17:53:23,786][232226] Updated weights for policy 0, policy_version 76080 (0.0006) [2023-03-07 17:53:24,584][232226] Updated weights for policy 0, policy_version 76090 (0.0007) [2023-03-07 17:53:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 77922304. Throughput: 0: 12871.0. Samples: 77912760. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:25,069][231894] Avg episode reward: [(0, '181.785')] [2023-03-07 17:53:25,370][232226] Updated weights for policy 0, policy_version 76100 (0.0006) [2023-03-07 17:53:26,185][232226] Updated weights for policy 0, policy_version 76110 (0.0006) [2023-03-07 17:53:26,959][232226] Updated weights for policy 0, policy_version 76120 (0.0007) [2023-03-07 17:53:27,771][232226] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-07 17:53:28,561][232226] Updated weights for policy 0, policy_version 76140 (0.0006) [2023-03-07 17:53:29,361][232226] Updated weights for policy 0, policy_version 76150 (0.0007) [2023-03-07 17:53:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 77986816. Throughput: 0: 12870.4. Samples: 77951245. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:30,069][231894] Avg episode reward: [(0, '185.983')] [2023-03-07 17:53:30,159][232226] Updated weights for policy 0, policy_version 76160 (0.0006) [2023-03-07 17:53:30,946][232226] Updated weights for policy 0, policy_version 76170 (0.0006) [2023-03-07 17:53:31,763][232226] Updated weights for policy 0, policy_version 76180 (0.0007) [2023-03-07 17:53:32,562][232226] Updated weights for policy 0, policy_version 76190 (0.0007) [2023-03-07 17:53:33,357][232226] Updated weights for policy 0, policy_version 76200 (0.0006) [2023-03-07 17:53:34,146][232226] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-07 17:53:34,931][232226] Updated weights for policy 0, policy_version 76220 (0.0006) [2023-03-07 17:53:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 78050304. Throughput: 0: 12871.2. Samples: 78028297. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:35,069][231894] Avg episode reward: [(0, '194.063')] [2023-03-07 17:53:35,727][232226] Updated weights for policy 0, policy_version 76230 (0.0006) [2023-03-07 17:53:36,524][232226] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-07 17:53:37,310][232226] Updated weights for policy 0, policy_version 76250 (0.0006) [2023-03-07 17:53:38,100][232226] Updated weights for policy 0, policy_version 76260 (0.0006) [2023-03-07 17:53:38,903][232226] Updated weights for policy 0, policy_version 76270 (0.0007) [2023-03-07 17:53:39,694][232226] Updated weights for policy 0, policy_version 76280 (0.0006) [2023-03-07 17:53:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 78114816. Throughput: 0: 12875.8. Samples: 78105786. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:40,069][231894] Avg episode reward: [(0, '194.704')] [2023-03-07 17:53:40,489][232226] Updated weights for policy 0, policy_version 76290 (0.0005) [2023-03-07 17:53:41,300][232226] Updated weights for policy 0, policy_version 76300 (0.0006) [2023-03-07 17:53:42,070][232226] Updated weights for policy 0, policy_version 76310 (0.0007) [2023-03-07 17:53:42,883][232226] Updated weights for policy 0, policy_version 76320 (0.0007) [2023-03-07 17:53:43,692][232226] Updated weights for policy 0, policy_version 76330 (0.0007) [2023-03-07 17:53:44,490][232226] Updated weights for policy 0, policy_version 76340 (0.0005) [2023-03-07 17:53:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 78179328. Throughput: 0: 12878.8. Samples: 78144425. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:45,069][231894] Avg episode reward: [(0, '195.706')] [2023-03-07 17:53:45,286][232226] Updated weights for policy 0, policy_version 76350 (0.0006) [2023-03-07 17:53:46,087][232226] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-07 17:53:46,866][232226] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-07 17:53:47,653][232226] Updated weights for policy 0, policy_version 76380 (0.0006) [2023-03-07 17:53:48,469][232226] Updated weights for policy 0, policy_version 76390 (0.0006) [2023-03-07 17:53:49,248][232226] Updated weights for policy 0, policy_version 76400 (0.0007) [2023-03-07 17:53:50,025][232226] Updated weights for policy 0, policy_version 76410 (0.0006) [2023-03-07 17:53:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 78243840. Throughput: 0: 12861.1. Samples: 78221379. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 17:53:50,069][231894] Avg episode reward: [(0, '195.548')] [2023-03-07 17:53:50,841][232226] Updated weights for policy 0, policy_version 76420 (0.0006) [2023-03-07 17:53:51,623][232226] Updated weights for policy 0, policy_version 76430 (0.0006) [2023-03-07 17:53:52,410][232226] Updated weights for policy 0, policy_version 76440 (0.0006) [2023-03-07 17:53:53,215][232226] Updated weights for policy 0, policy_version 76450 (0.0006) [2023-03-07 17:53:54,001][232226] Updated weights for policy 0, policy_version 76460 (0.0007) [2023-03-07 17:53:54,780][232226] Updated weights for policy 0, policy_version 76470 (0.0007) [2023-03-07 17:53:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12871.2). Total num frames: 78308352. Throughput: 0: 12871.6. Samples: 78299024. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:53:55,069][231894] Avg episode reward: [(0, '199.157')] [2023-03-07 17:53:55,586][232226] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-07 17:53:56,371][232226] Updated weights for policy 0, policy_version 76490 (0.0006) [2023-03-07 17:53:57,165][232226] Updated weights for policy 0, policy_version 76500 (0.0006) [2023-03-07 17:53:57,973][232226] Updated weights for policy 0, policy_version 76510 (0.0007) [2023-03-07 17:53:58,771][232226] Updated weights for policy 0, policy_version 76520 (0.0007) [2023-03-07 17:53:59,553][232226] Updated weights for policy 0, policy_version 76530 (0.0007) [2023-03-07 17:54:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 78372864. Throughput: 0: 12875.2. Samples: 78337816. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:00,069][231894] Avg episode reward: [(0, '190.430')] [2023-03-07 17:54:00,367][232226] Updated weights for policy 0, policy_version 76540 (0.0007) [2023-03-07 17:54:01,142][232226] Updated weights for policy 0, policy_version 76550 (0.0007) [2023-03-07 17:54:01,934][232226] Updated weights for policy 0, policy_version 76560 (0.0006) [2023-03-07 17:54:02,736][232226] Updated weights for policy 0, policy_version 76570 (0.0006) [2023-03-07 17:54:03,533][232226] Updated weights for policy 0, policy_version 76580 (0.0006) [2023-03-07 17:54:04,320][232226] Updated weights for policy 0, policy_version 76590 (0.0007) [2023-03-07 17:54:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 78437376. Throughput: 0: 12874.1. Samples: 78415011. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:05,069][231894] Avg episode reward: [(0, '186.339')] [2023-03-07 17:54:05,128][232226] Updated weights for policy 0, policy_version 76600 (0.0006) [2023-03-07 17:54:05,918][232226] Updated weights for policy 0, policy_version 76610 (0.0005) [2023-03-07 17:54:06,717][232226] Updated weights for policy 0, policy_version 76620 (0.0006) [2023-03-07 17:54:07,517][232226] Updated weights for policy 0, policy_version 76630 (0.0006) [2023-03-07 17:54:08,294][232226] Updated weights for policy 0, policy_version 76640 (0.0006) [2023-03-07 17:54:09,086][232226] Updated weights for policy 0, policy_version 76650 (0.0007) [2023-03-07 17:54:09,888][232226] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-07 17:54:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 78501888. Throughput: 0: 12880.7. Samples: 78492394. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:10,070][231894] Avg episode reward: [(0, '195.921')] [2023-03-07 17:54:10,682][232226] Updated weights for policy 0, policy_version 76670 (0.0007) [2023-03-07 17:54:11,468][232226] Updated weights for policy 0, policy_version 76680 (0.0008) [2023-03-07 17:54:12,262][232226] Updated weights for policy 0, policy_version 76690 (0.0007) [2023-03-07 17:54:13,058][232226] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-07 17:54:13,841][232226] Updated weights for policy 0, policy_version 76710 (0.0006) [2023-03-07 17:54:14,624][232226] Updated weights for policy 0, policy_version 76720 (0.0006) [2023-03-07 17:54:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 78566400. Throughput: 0: 12885.4. Samples: 78531090. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:15,069][231894] Avg episode reward: [(0, '205.210')] [2023-03-07 17:54:15,426][232226] Updated weights for policy 0, policy_version 76730 (0.0006) [2023-03-07 17:54:16,222][232226] Updated weights for policy 0, policy_version 76740 (0.0006) [2023-03-07 17:54:17,026][232226] Updated weights for policy 0, policy_version 76750 (0.0006) [2023-03-07 17:54:17,816][232226] Updated weights for policy 0, policy_version 76760 (0.0008) [2023-03-07 17:54:18,605][232226] Updated weights for policy 0, policy_version 76770 (0.0006) [2023-03-07 17:54:19,395][232226] Updated weights for policy 0, policy_version 76780 (0.0006) [2023-03-07 17:54:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.4, 300 sec: 12874.6). Total num frames: 78630912. Throughput: 0: 12893.5. Samples: 78608504. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:20,069][231894] Avg episode reward: [(0, '196.337')] [2023-03-07 17:54:20,174][232226] Updated weights for policy 0, policy_version 76790 (0.0006) [2023-03-07 17:54:20,978][232226] Updated weights for policy 0, policy_version 76800 (0.0006) [2023-03-07 17:54:21,773][232226] Updated weights for policy 0, policy_version 76810 (0.0007) [2023-03-07 17:54:22,560][232226] Updated weights for policy 0, policy_version 76820 (0.0006) [2023-03-07 17:54:23,350][232226] Updated weights for policy 0, policy_version 76830 (0.0006) [2023-03-07 17:54:24,151][232226] Updated weights for policy 0, policy_version 76840 (0.0006) [2023-03-07 17:54:24,932][232226] Updated weights for policy 0, policy_version 76850 (0.0006) [2023-03-07 17:54:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 78695424. Throughput: 0: 12898.9. Samples: 78686237. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:25,070][231894] Avg episode reward: [(0, '203.270')] [2023-03-07 17:54:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000076851_78695424.pth... [2023-03-07 17:54:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000073834_75606016.pth [2023-03-07 17:54:25,753][232226] Updated weights for policy 0, policy_version 76860 (0.0007) [2023-03-07 17:54:26,553][232226] Updated weights for policy 0, policy_version 76870 (0.0007) [2023-03-07 17:54:27,332][232226] Updated weights for policy 0, policy_version 76880 (0.0007) [2023-03-07 17:54:28,143][232226] Updated weights for policy 0, policy_version 76890 (0.0007) [2023-03-07 17:54:28,941][232226] Updated weights for policy 0, policy_version 76900 (0.0006) [2023-03-07 17:54:29,730][232226] Updated weights for policy 0, policy_version 76910 (0.0006) [2023-03-07 17:54:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 78759936. Throughput: 0: 12892.9. Samples: 78724606. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:30,069][231894] Avg episode reward: [(0, '196.990')] [2023-03-07 17:54:30,544][232226] Updated weights for policy 0, policy_version 76920 (0.0007) [2023-03-07 17:54:31,320][232226] Updated weights for policy 0, policy_version 76930 (0.0007) [2023-03-07 17:54:32,115][232226] Updated weights for policy 0, policy_version 76940 (0.0007) [2023-03-07 17:54:32,919][232226] Updated weights for policy 0, policy_version 76950 (0.0007) [2023-03-07 17:54:33,713][232226] Updated weights for policy 0, policy_version 76960 (0.0006) [2023-03-07 17:54:34,523][232226] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-07 17:54:35,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 78823424. Throughput: 0: 12897.7. Samples: 78801773. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 17:54:35,069][231894] Avg episode reward: [(0, '193.664')] [2023-03-07 17:54:35,317][232226] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-07 17:54:36,128][232226] Updated weights for policy 0, policy_version 76990 (0.0006) [2023-03-07 17:54:36,932][232226] Updated weights for policy 0, policy_version 77000 (0.0007) [2023-03-07 17:54:37,720][232226] Updated weights for policy 0, policy_version 77010 (0.0007) [2023-03-07 17:54:38,515][232226] Updated weights for policy 0, policy_version 77020 (0.0006) [2023-03-07 17:54:39,304][232226] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-07 17:54:40,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 78887936. Throughput: 0: 12884.0. Samples: 78878802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:54:40,069][231894] Avg episode reward: [(0, '189.321')] [2023-03-07 17:54:40,097][232226] Updated weights for policy 0, policy_version 77040 (0.0006) [2023-03-07 17:54:40,914][232226] Updated weights for policy 0, policy_version 77050 (0.0006) [2023-03-07 17:54:41,679][232226] Updated weights for policy 0, policy_version 77060 (0.0007) [2023-03-07 17:54:42,471][232226] Updated weights for policy 0, policy_version 77070 (0.0007) [2023-03-07 17:54:43,261][232226] Updated weights for policy 0, policy_version 77080 (0.0007) [2023-03-07 17:54:44,077][232226] Updated weights for policy 0, policy_version 77090 (0.0006) [2023-03-07 17:54:44,868][232226] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-07 17:54:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 78952448. Throughput: 0: 12879.8. Samples: 78917407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:54:45,069][231894] Avg episode reward: [(0, '197.129')] [2023-03-07 17:54:45,657][232226] Updated weights for policy 0, policy_version 77110 (0.0007) [2023-03-07 17:54:46,454][232226] Updated weights for policy 0, policy_version 77120 (0.0007) [2023-03-07 17:54:47,234][232226] Updated weights for policy 0, policy_version 77130 (0.0006) [2023-03-07 17:54:48,025][232226] Updated weights for policy 0, policy_version 77140 (0.0006) [2023-03-07 17:54:48,809][232226] Updated weights for policy 0, policy_version 77150 (0.0007) [2023-03-07 17:54:49,612][232226] Updated weights for policy 0, policy_version 77160 (0.0007) [2023-03-07 17:54:50,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 79016960. Throughput: 0: 12890.3. Samples: 78995075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:54:50,069][231894] Avg episode reward: [(0, '185.904')] [2023-03-07 17:54:50,398][232226] Updated weights for policy 0, policy_version 77170 (0.0007) [2023-03-07 17:54:51,192][232226] Updated weights for policy 0, policy_version 77180 (0.0006) [2023-03-07 17:54:51,989][232226] Updated weights for policy 0, policy_version 77190 (0.0006) [2023-03-07 17:54:52,774][232226] Updated weights for policy 0, policy_version 77200 (0.0006) [2023-03-07 17:54:53,570][232226] Updated weights for policy 0, policy_version 77210 (0.0007) [2023-03-07 17:54:54,379][232226] Updated weights for policy 0, policy_version 77220 (0.0006) [2023-03-07 17:54:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 79081472. Throughput: 0: 12889.8. Samples: 79072432. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:54:55,069][231894] Avg episode reward: [(0, '203.179')] [2023-03-07 17:54:55,165][232226] Updated weights for policy 0, policy_version 77230 (0.0006) [2023-03-07 17:54:55,959][232226] Updated weights for policy 0, policy_version 77240 (0.0007) [2023-03-07 17:54:56,756][232226] Updated weights for policy 0, policy_version 77250 (0.0006) [2023-03-07 17:54:57,544][232226] Updated weights for policy 0, policy_version 77260 (0.0006) [2023-03-07 17:54:58,342][232226] Updated weights for policy 0, policy_version 77270 (0.0006) [2023-03-07 17:54:59,118][232226] Updated weights for policy 0, policy_version 77280 (0.0006) [2023-03-07 17:54:59,938][232226] Updated weights for policy 0, policy_version 77290 (0.0006) [2023-03-07 17:55:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 79145984. Throughput: 0: 12889.3. Samples: 79111109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:00,069][231894] Avg episode reward: [(0, '197.762')] [2023-03-07 17:55:00,730][232226] Updated weights for policy 0, policy_version 77300 (0.0006) [2023-03-07 17:55:01,522][232226] Updated weights for policy 0, policy_version 77310 (0.0006) [2023-03-07 17:55:02,333][232226] Updated weights for policy 0, policy_version 77320 (0.0007) [2023-03-07 17:55:03,124][232226] Updated weights for policy 0, policy_version 77330 (0.0006) [2023-03-07 17:55:03,913][232226] Updated weights for policy 0, policy_version 77340 (0.0007) [2023-03-07 17:55:04,725][232226] Updated weights for policy 0, policy_version 77350 (0.0007) [2023-03-07 17:55:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 79210496. Throughput: 0: 12888.4. Samples: 79188486. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:05,070][231894] Avg episode reward: [(0, '189.801')] [2023-03-07 17:55:05,526][232226] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-07 17:55:06,310][232226] Updated weights for policy 0, policy_version 77370 (0.0006) [2023-03-07 17:55:07,100][232226] Updated weights for policy 0, policy_version 77380 (0.0006) [2023-03-07 17:55:07,898][232226] Updated weights for policy 0, policy_version 77390 (0.0007) [2023-03-07 17:55:08,699][232226] Updated weights for policy 0, policy_version 77400 (0.0006) [2023-03-07 17:55:09,510][232226] Updated weights for policy 0, policy_version 77410 (0.0006) [2023-03-07 17:55:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 79275008. Throughput: 0: 12872.3. Samples: 79265489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:10,069][231894] Avg episode reward: [(0, '192.877')] [2023-03-07 17:55:10,276][232226] Updated weights for policy 0, policy_version 77420 (0.0007) [2023-03-07 17:55:11,059][232226] Updated weights for policy 0, policy_version 77430 (0.0007) [2023-03-07 17:55:11,864][232226] Updated weights for policy 0, policy_version 77440 (0.0007) [2023-03-07 17:55:12,656][232226] Updated weights for policy 0, policy_version 77450 (0.0006) [2023-03-07 17:55:13,453][232226] Updated weights for policy 0, policy_version 77460 (0.0006) [2023-03-07 17:55:14,237][232226] Updated weights for policy 0, policy_version 77470 (0.0007) [2023-03-07 17:55:15,052][232226] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-07 17:55:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 79339520. Throughput: 0: 12883.4. Samples: 79304361. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:15,070][231894] Avg episode reward: [(0, '198.021')] [2023-03-07 17:55:15,835][232226] Updated weights for policy 0, policy_version 77490 (0.0007) [2023-03-07 17:55:16,615][232226] Updated weights for policy 0, policy_version 77500 (0.0006) [2023-03-07 17:55:17,420][232226] Updated weights for policy 0, policy_version 77510 (0.0006) [2023-03-07 17:55:18,207][232226] Updated weights for policy 0, policy_version 77520 (0.0006) [2023-03-07 17:55:19,011][232226] Updated weights for policy 0, policy_version 77530 (0.0006) [2023-03-07 17:55:19,795][232226] Updated weights for policy 0, policy_version 77540 (0.0007) [2023-03-07 17:55:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 79404032. Throughput: 0: 12885.5. Samples: 79381622. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:20,069][231894] Avg episode reward: [(0, '192.725')] [2023-03-07 17:55:20,593][232226] Updated weights for policy 0, policy_version 77550 (0.0007) [2023-03-07 17:55:21,379][232226] Updated weights for policy 0, policy_version 77560 (0.0006) [2023-03-07 17:55:22,171][232226] Updated weights for policy 0, policy_version 77570 (0.0006) [2023-03-07 17:55:22,980][232226] Updated weights for policy 0, policy_version 77580 (0.0007) [2023-03-07 17:55:23,757][232226] Updated weights for policy 0, policy_version 77590 (0.0007) [2023-03-07 17:55:24,558][232226] Updated weights for policy 0, policy_version 77600 (0.0006) [2023-03-07 17:55:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12885.4, 300 sec: 12874.6). Total num frames: 79468544. Throughput: 0: 12898.2. Samples: 79459223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:25,069][231894] Avg episode reward: [(0, '196.094')] [2023-03-07 17:55:25,350][232226] Updated weights for policy 0, policy_version 77610 (0.0006) [2023-03-07 17:55:26,153][232226] Updated weights for policy 0, policy_version 77620 (0.0006) [2023-03-07 17:55:26,959][232226] Updated weights for policy 0, policy_version 77630 (0.0007) [2023-03-07 17:55:27,766][232226] Updated weights for policy 0, policy_version 77640 (0.0007) [2023-03-07 17:55:28,561][232226] Updated weights for policy 0, policy_version 77650 (0.0007) [2023-03-07 17:55:29,356][232226] Updated weights for policy 0, policy_version 77660 (0.0007) [2023-03-07 17:55:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 79532032. Throughput: 0: 12891.0. Samples: 79497500. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:30,069][231894] Avg episode reward: [(0, '197.281')] [2023-03-07 17:55:30,144][232226] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-07 17:55:30,934][232226] Updated weights for policy 0, policy_version 77680 (0.0005) [2023-03-07 17:55:31,728][232226] Updated weights for policy 0, policy_version 77690 (0.0006) [2023-03-07 17:55:32,537][232226] Updated weights for policy 0, policy_version 77700 (0.0006) [2023-03-07 17:55:33,325][232226] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-07 17:55:34,133][232226] Updated weights for policy 0, policy_version 77720 (0.0007) [2023-03-07 17:55:34,915][232226] Updated weights for policy 0, policy_version 77730 (0.0007) [2023-03-07 17:55:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 79596544. Throughput: 0: 12883.2. Samples: 79574820. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:35,069][231894] Avg episode reward: [(0, '192.440')] [2023-03-07 17:55:35,711][232226] Updated weights for policy 0, policy_version 77740 (0.0007) [2023-03-07 17:55:36,493][232226] Updated weights for policy 0, policy_version 77750 (0.0007) [2023-03-07 17:55:37,304][232226] Updated weights for policy 0, policy_version 77760 (0.0006) [2023-03-07 17:55:38,101][232226] Updated weights for policy 0, policy_version 77770 (0.0006) [2023-03-07 17:55:38,894][232226] Updated weights for policy 0, policy_version 77780 (0.0007) [2023-03-07 17:55:39,693][232226] Updated weights for policy 0, policy_version 77790 (0.0007) [2023-03-07 17:55:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 79661056. Throughput: 0: 12881.4. Samples: 79652095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:40,069][231894] Avg episode reward: [(0, '197.255')] [2023-03-07 17:55:40,485][232226] Updated weights for policy 0, policy_version 77800 (0.0006) [2023-03-07 17:55:41,267][232226] Updated weights for policy 0, policy_version 77810 (0.0007) [2023-03-07 17:55:42,045][232226] Updated weights for policy 0, policy_version 77820 (0.0006) [2023-03-07 17:55:42,847][232226] Updated weights for policy 0, policy_version 77830 (0.0006) [2023-03-07 17:55:43,637][232226] Updated weights for policy 0, policy_version 77840 (0.0006) [2023-03-07 17:55:44,424][232226] Updated weights for policy 0, policy_version 77850 (0.0006) [2023-03-07 17:55:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 79725568. Throughput: 0: 12885.7. Samples: 79690967. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:45,069][231894] Avg episode reward: [(0, '191.120')] [2023-03-07 17:55:45,240][232226] Updated weights for policy 0, policy_version 77860 (0.0006) [2023-03-07 17:55:46,030][232226] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-07 17:55:46,832][232226] Updated weights for policy 0, policy_version 77880 (0.0007) [2023-03-07 17:55:47,655][232226] Updated weights for policy 0, policy_version 77890 (0.0007) [2023-03-07 17:55:48,432][232226] Updated weights for policy 0, policy_version 77900 (0.0007) [2023-03-07 17:55:49,252][232226] Updated weights for policy 0, policy_version 77910 (0.0007) [2023-03-07 17:55:50,049][232226] Updated weights for policy 0, policy_version 77920 (0.0006) [2023-03-07 17:55:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 79790080. Throughput: 0: 12879.6. Samples: 79768065. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:50,069][231894] Avg episode reward: [(0, '191.559')] [2023-03-07 17:55:50,842][232226] Updated weights for policy 0, policy_version 77930 (0.0006) [2023-03-07 17:55:51,636][232226] Updated weights for policy 0, policy_version 77940 (0.0007) [2023-03-07 17:55:52,429][232226] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-07 17:55:53,239][232226] Updated weights for policy 0, policy_version 77960 (0.0006) [2023-03-07 17:55:54,038][232226] Updated weights for policy 0, policy_version 77970 (0.0007) [2023-03-07 17:55:54,834][232226] Updated weights for policy 0, policy_version 77980 (0.0006) [2023-03-07 17:55:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 79853568. Throughput: 0: 12871.6. Samples: 79844710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:55:55,069][231894] Avg episode reward: [(0, '191.971')] [2023-03-07 17:55:55,630][232226] Updated weights for policy 0, policy_version 77990 (0.0007) [2023-03-07 17:55:56,430][232226] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-07 17:55:57,225][232226] Updated weights for policy 0, policy_version 78010 (0.0007) [2023-03-07 17:55:58,005][232226] Updated weights for policy 0, policy_version 78020 (0.0007) [2023-03-07 17:55:58,800][232226] Updated weights for policy 0, policy_version 78030 (0.0006) [2023-03-07 17:55:59,615][232226] Updated weights for policy 0, policy_version 78040 (0.0007) [2023-03-07 17:56:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 79918080. Throughput: 0: 12865.9. Samples: 79883326. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:00,069][231894] Avg episode reward: [(0, '195.188')] [2023-03-07 17:56:00,392][232226] Updated weights for policy 0, policy_version 78050 (0.0007) [2023-03-07 17:56:01,196][232226] Updated weights for policy 0, policy_version 78060 (0.0006) [2023-03-07 17:56:02,013][232226] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-07 17:56:02,807][232226] Updated weights for policy 0, policy_version 78080 (0.0006) [2023-03-07 17:56:03,603][232226] Updated weights for policy 0, policy_version 78090 (0.0006) [2023-03-07 17:56:04,402][232226] Updated weights for policy 0, policy_version 78100 (0.0006) [2023-03-07 17:56:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 79982592. Throughput: 0: 12865.3. Samples: 79960564. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:05,070][231894] Avg episode reward: [(0, '186.390')] [2023-03-07 17:56:05,187][232226] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-07 17:56:05,968][232226] Updated weights for policy 0, policy_version 78120 (0.0005) [2023-03-07 17:56:06,765][232226] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-07 17:56:07,564][232226] Updated weights for policy 0, policy_version 78140 (0.0006) [2023-03-07 17:56:08,353][232226] Updated weights for policy 0, policy_version 78150 (0.0007) [2023-03-07 17:56:09,154][232226] Updated weights for policy 0, policy_version 78160 (0.0006) [2023-03-07 17:56:09,942][232226] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-07 17:56:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 80047104. Throughput: 0: 12858.4. Samples: 80037854. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:10,070][231894] Avg episode reward: [(0, '198.712')] [2023-03-07 17:56:10,746][232226] Updated weights for policy 0, policy_version 78180 (0.0007) [2023-03-07 17:56:11,538][232226] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-07 17:56:12,345][232226] Updated weights for policy 0, policy_version 78200 (0.0007) [2023-03-07 17:56:13,123][232226] Updated weights for policy 0, policy_version 78210 (0.0007) [2023-03-07 17:56:13,929][232226] Updated weights for policy 0, policy_version 78220 (0.0006) [2023-03-07 17:56:14,734][232226] Updated weights for policy 0, policy_version 78230 (0.0006) [2023-03-07 17:56:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 80111616. Throughput: 0: 12861.9. Samples: 80076287. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:15,070][231894] Avg episode reward: [(0, '192.691')] [2023-03-07 17:56:15,519][232226] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-07 17:56:16,304][232226] Updated weights for policy 0, policy_version 78250 (0.0006) [2023-03-07 17:56:17,138][232226] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-07 17:56:17,929][232226] Updated weights for policy 0, policy_version 78270 (0.0006) [2023-03-07 17:56:18,720][232226] Updated weights for policy 0, policy_version 78280 (0.0007) [2023-03-07 17:56:19,538][232226] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-07 17:56:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 80176128. Throughput: 0: 12858.6. Samples: 80153459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:20,069][231894] Avg episode reward: [(0, '192.446')] [2023-03-07 17:56:20,323][232226] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-07 17:56:21,114][232226] Updated weights for policy 0, policy_version 78310 (0.0007) [2023-03-07 17:56:21,930][232226] Updated weights for policy 0, policy_version 78320 (0.0006) [2023-03-07 17:56:22,734][232226] Updated weights for policy 0, policy_version 78330 (0.0007) [2023-03-07 17:56:23,525][232226] Updated weights for policy 0, policy_version 78340 (0.0006) [2023-03-07 17:56:24,334][232226] Updated weights for policy 0, policy_version 78350 (0.0006) [2023-03-07 17:56:25,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12878.1). Total num frames: 80239616. Throughput: 0: 12850.2. Samples: 80230356. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:25,070][231894] Avg episode reward: [(0, '193.217')] [2023-03-07 17:56:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000078359_80239616.pth... [2023-03-07 17:56:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000075342_77150208.pth [2023-03-07 17:56:25,130][232226] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-07 17:56:25,892][232226] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-07 17:56:26,685][232226] Updated weights for policy 0, policy_version 78380 (0.0007) [2023-03-07 17:56:27,488][232226] Updated weights for policy 0, policy_version 78390 (0.0007) [2023-03-07 17:56:28,287][232226] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-07 17:56:29,074][232226] Updated weights for policy 0, policy_version 78410 (0.0006) [2023-03-07 17:56:29,876][232226] Updated weights for policy 0, policy_version 78420 (0.0006) [2023-03-07 17:56:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12878.1). Total num frames: 80304128. Throughput: 0: 12847.0. Samples: 80269083. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:30,069][231894] Avg episode reward: [(0, '199.881')] [2023-03-07 17:56:30,667][232226] Updated weights for policy 0, policy_version 78430 (0.0005) [2023-03-07 17:56:31,464][232226] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-07 17:56:32,251][232226] Updated weights for policy 0, policy_version 78450 (0.0006) [2023-03-07 17:56:33,045][232226] Updated weights for policy 0, policy_version 78460 (0.0008) [2023-03-07 17:56:33,850][232226] Updated weights for policy 0, policy_version 78470 (0.0007) [2023-03-07 17:56:34,648][232226] Updated weights for policy 0, policy_version 78480 (0.0006) [2023-03-07 17:56:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 80368640. Throughput: 0: 12851.5. Samples: 80346382. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:35,069][231894] Avg episode reward: [(0, '192.894')] [2023-03-07 17:56:35,429][232226] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-07 17:56:36,232][232226] Updated weights for policy 0, policy_version 78500 (0.0006) [2023-03-07 17:56:37,031][232226] Updated weights for policy 0, policy_version 78510 (0.0007) [2023-03-07 17:56:37,814][232226] Updated weights for policy 0, policy_version 78520 (0.0006) [2023-03-07 17:56:38,618][232226] Updated weights for policy 0, policy_version 78530 (0.0006) [2023-03-07 17:56:39,418][232226] Updated weights for policy 0, policy_version 78540 (0.0007) [2023-03-07 17:56:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 80433152. Throughput: 0: 12861.3. Samples: 80423467. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:40,069][231894] Avg episode reward: [(0, '190.760')] [2023-03-07 17:56:40,200][232226] Updated weights for policy 0, policy_version 78550 (0.0007) [2023-03-07 17:56:41,002][232226] Updated weights for policy 0, policy_version 78560 (0.0006) [2023-03-07 17:56:41,787][232226] Updated weights for policy 0, policy_version 78570 (0.0005) [2023-03-07 17:56:42,586][232226] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-07 17:56:43,393][232226] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-07 17:56:44,169][232226] Updated weights for policy 0, policy_version 78600 (0.0006) [2023-03-07 17:56:44,988][232226] Updated weights for policy 0, policy_version 78610 (0.0006) [2023-03-07 17:56:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 80496640. Throughput: 0: 12866.1. Samples: 80462302. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:45,069][231894] Avg episode reward: [(0, '191.203')] [2023-03-07 17:56:45,782][232226] Updated weights for policy 0, policy_version 78620 (0.0007) [2023-03-07 17:56:46,580][232226] Updated weights for policy 0, policy_version 78630 (0.0007) [2023-03-07 17:56:47,364][232226] Updated weights for policy 0, policy_version 78640 (0.0006) [2023-03-07 17:56:48,174][232226] Updated weights for policy 0, policy_version 78650 (0.0006) [2023-03-07 17:56:48,968][232226] Updated weights for policy 0, policy_version 78660 (0.0007) [2023-03-07 17:56:49,758][232226] Updated weights for policy 0, policy_version 78670 (0.0007) [2023-03-07 17:56:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 80561152. Throughput: 0: 12862.2. Samples: 80539363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:50,080][231894] Avg episode reward: [(0, '197.632')] [2023-03-07 17:56:50,547][232226] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-07 17:56:51,342][232226] Updated weights for policy 0, policy_version 78690 (0.0006) [2023-03-07 17:56:52,129][232226] Updated weights for policy 0, policy_version 78700 (0.0007) [2023-03-07 17:56:52,904][232226] Updated weights for policy 0, policy_version 78710 (0.0007) [2023-03-07 17:56:53,693][232226] Updated weights for policy 0, policy_version 78720 (0.0007) [2023-03-07 17:56:54,486][232226] Updated weights for policy 0, policy_version 78730 (0.0006) [2023-03-07 17:56:55,069][231894] Fps is (10 sec: 13004.6, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 80626688. Throughput: 0: 12871.8. Samples: 80617086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:56:55,080][231894] Avg episode reward: [(0, '191.904')] [2023-03-07 17:56:55,283][232226] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-07 17:56:56,070][232226] Updated weights for policy 0, policy_version 78750 (0.0007) [2023-03-07 17:56:56,885][232226] Updated weights for policy 0, policy_version 78760 (0.0006) [2023-03-07 17:56:57,687][232226] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-07 17:56:58,475][232226] Updated weights for policy 0, policy_version 78780 (0.0006) [2023-03-07 17:56:59,290][232226] Updated weights for policy 0, policy_version 78790 (0.0006) [2023-03-07 17:57:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 80690176. Throughput: 0: 12876.2. Samples: 80655716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:57:00,080][231894] Avg episode reward: [(0, '186.017')] [2023-03-07 17:57:00,088][232226] Updated weights for policy 0, policy_version 78800 (0.0007) [2023-03-07 17:57:00,888][232226] Updated weights for policy 0, policy_version 78810 (0.0007) [2023-03-07 17:57:01,686][232226] Updated weights for policy 0, policy_version 78820 (0.0007) [2023-03-07 17:57:02,457][232226] Updated weights for policy 0, policy_version 78830 (0.0007) [2023-03-07 17:57:03,253][232226] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-07 17:57:04,054][232226] Updated weights for policy 0, policy_version 78850 (0.0006) [2023-03-07 17:57:04,854][232226] Updated weights for policy 0, policy_version 78860 (0.0006) [2023-03-07 17:57:05,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 80754688. Throughput: 0: 12878.4. Samples: 80732987. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:05,080][231894] Avg episode reward: [(0, '201.283')] [2023-03-07 17:57:05,651][232226] Updated weights for policy 0, policy_version 78870 (0.0008) [2023-03-07 17:57:06,461][232226] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-07 17:57:07,236][232226] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-07 17:57:08,032][232226] Updated weights for policy 0, policy_version 78900 (0.0007) [2023-03-07 17:57:08,843][232226] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-07 17:57:09,632][232226] Updated weights for policy 0, policy_version 78920 (0.0007) [2023-03-07 17:57:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 80819200. Throughput: 0: 12880.9. Samples: 80809999. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:10,080][231894] Avg episode reward: [(0, '205.917')] [2023-03-07 17:57:10,418][232226] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-07 17:57:11,211][232226] Updated weights for policy 0, policy_version 78940 (0.0006) [2023-03-07 17:57:12,001][232226] Updated weights for policy 0, policy_version 78950 (0.0007) [2023-03-07 17:57:12,789][232226] Updated weights for policy 0, policy_version 78960 (0.0007) [2023-03-07 17:57:13,605][232226] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-07 17:57:14,408][232226] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-07 17:57:15,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 80883712. Throughput: 0: 12882.4. Samples: 80848789. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:15,080][231894] Avg episode reward: [(0, '189.122')] [2023-03-07 17:57:15,183][232226] Updated weights for policy 0, policy_version 78990 (0.0007) [2023-03-07 17:57:15,996][232226] Updated weights for policy 0, policy_version 79000 (0.0006) [2023-03-07 17:57:16,797][232226] Updated weights for policy 0, policy_version 79010 (0.0007) [2023-03-07 17:57:17,582][232226] Updated weights for policy 0, policy_version 79020 (0.0006) [2023-03-07 17:57:18,392][232226] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-07 17:57:19,184][232226] Updated weights for policy 0, policy_version 79040 (0.0006) [2023-03-07 17:57:19,973][232226] Updated weights for policy 0, policy_version 79050 (0.0006) [2023-03-07 17:57:20,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 80948224. Throughput: 0: 12876.2. Samples: 80925811. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:20,080][231894] Avg episode reward: [(0, '192.587')] [2023-03-07 17:57:20,775][232226] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-07 17:57:21,581][232226] Updated weights for policy 0, policy_version 79070 (0.0006) [2023-03-07 17:57:22,366][232226] Updated weights for policy 0, policy_version 79080 (0.0006) [2023-03-07 17:57:23,150][232226] Updated weights for policy 0, policy_version 79090 (0.0006) [2023-03-07 17:57:23,953][232226] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-07 17:57:24,748][232226] Updated weights for policy 0, policy_version 79110 (0.0006) [2023-03-07 17:57:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 81012736. Throughput: 0: 12878.5. Samples: 81003001. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:25,080][231894] Avg episode reward: [(0, '190.121')] [2023-03-07 17:57:25,554][232226] Updated weights for policy 0, policy_version 79120 (0.0007) [2023-03-07 17:57:26,337][232226] Updated weights for policy 0, policy_version 79130 (0.0006) [2023-03-07 17:57:27,124][232226] Updated weights for policy 0, policy_version 79140 (0.0007) [2023-03-07 17:57:27,911][232226] Updated weights for policy 0, policy_version 79150 (0.0006) [2023-03-07 17:57:28,719][232226] Updated weights for policy 0, policy_version 79160 (0.0006) [2023-03-07 17:57:29,523][232226] Updated weights for policy 0, policy_version 79170 (0.0006) [2023-03-07 17:57:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 81076224. Throughput: 0: 12874.7. Samples: 81041664. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:30,080][231894] Avg episode reward: [(0, '199.294')] [2023-03-07 17:57:30,324][232226] Updated weights for policy 0, policy_version 79180 (0.0007) [2023-03-07 17:57:31,125][232226] Updated weights for policy 0, policy_version 79190 (0.0006) [2023-03-07 17:57:31,923][232226] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-07 17:57:32,722][232226] Updated weights for policy 0, policy_version 79210 (0.0007) [2023-03-07 17:57:33,525][232226] Updated weights for policy 0, policy_version 79220 (0.0007) [2023-03-07 17:57:34,314][232226] Updated weights for policy 0, policy_version 79230 (0.0007) [2023-03-07 17:57:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 81140736. Throughput: 0: 12873.5. Samples: 81118671. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:35,080][231894] Avg episode reward: [(0, '198.337')] [2023-03-07 17:57:35,113][232226] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-07 17:57:35,914][232226] Updated weights for policy 0, policy_version 79250 (0.0006) [2023-03-07 17:57:36,716][232226] Updated weights for policy 0, policy_version 79260 (0.0006) [2023-03-07 17:57:37,490][232226] Updated weights for policy 0, policy_version 79270 (0.0006) [2023-03-07 17:57:38,294][232226] Updated weights for policy 0, policy_version 79280 (0.0006) [2023-03-07 17:57:39,074][232226] Updated weights for policy 0, policy_version 79290 (0.0006) [2023-03-07 17:57:39,850][232226] Updated weights for policy 0, policy_version 79300 (0.0006) [2023-03-07 17:57:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 81205248. Throughput: 0: 12865.3. Samples: 81196023. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:40,069][231894] Avg episode reward: [(0, '197.159')] [2023-03-07 17:57:40,652][232226] Updated weights for policy 0, policy_version 79310 (0.0006) [2023-03-07 17:57:41,460][232226] Updated weights for policy 0, policy_version 79320 (0.0007) [2023-03-07 17:57:42,246][232226] Updated weights for policy 0, policy_version 79330 (0.0007) [2023-03-07 17:57:43,031][232226] Updated weights for policy 0, policy_version 79340 (0.0005) [2023-03-07 17:57:43,847][232226] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-07 17:57:44,630][232226] Updated weights for policy 0, policy_version 79360 (0.0005) [2023-03-07 17:57:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 81269760. Throughput: 0: 12866.7. Samples: 81234715. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 17:57:45,069][231894] Avg episode reward: [(0, '190.001')] [2023-03-07 17:57:45,421][232226] Updated weights for policy 0, policy_version 79370 (0.0006) [2023-03-07 17:57:46,234][232226] Updated weights for policy 0, policy_version 79380 (0.0006) [2023-03-07 17:57:47,018][232226] Updated weights for policy 0, policy_version 79390 (0.0006) [2023-03-07 17:57:47,817][232226] Updated weights for policy 0, policy_version 79400 (0.0007) [2023-03-07 17:57:48,623][232226] Updated weights for policy 0, policy_version 79410 (0.0006) [2023-03-07 17:57:49,405][232226] Updated weights for policy 0, policy_version 79420 (0.0006) [2023-03-07 17:57:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 81334272. Throughput: 0: 12865.3. Samples: 81311924. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:57:50,069][231894] Avg episode reward: [(0, '190.761')] [2023-03-07 17:57:50,213][232226] Updated weights for policy 0, policy_version 79430 (0.0006) [2023-03-07 17:57:51,006][232226] Updated weights for policy 0, policy_version 79440 (0.0006) [2023-03-07 17:57:51,785][232226] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-07 17:57:52,581][232226] Updated weights for policy 0, policy_version 79460 (0.0006) [2023-03-07 17:57:53,366][232226] Updated weights for policy 0, policy_version 79470 (0.0006) [2023-03-07 17:57:54,156][232226] Updated weights for policy 0, policy_version 79480 (0.0007) [2023-03-07 17:57:54,952][232226] Updated weights for policy 0, policy_version 79490 (0.0007) [2023-03-07 17:57:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 81398784. Throughput: 0: 12876.7. Samples: 81389451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:57:55,069][231894] Avg episode reward: [(0, '198.127')] [2023-03-07 17:57:55,743][232226] Updated weights for policy 0, policy_version 79500 (0.0008) [2023-03-07 17:57:56,545][232226] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-07 17:57:57,334][232226] Updated weights for policy 0, policy_version 79520 (0.0006) [2023-03-07 17:57:58,148][232226] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-07 17:57:58,936][232226] Updated weights for policy 0, policy_version 79540 (0.0006) [2023-03-07 17:57:59,727][232226] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-07 17:58:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 81463296. Throughput: 0: 12874.7. Samples: 81428150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:00,069][231894] Avg episode reward: [(0, '193.372')] [2023-03-07 17:58:00,551][232226] Updated weights for policy 0, policy_version 79560 (0.0006) [2023-03-07 17:58:01,346][232226] Updated weights for policy 0, policy_version 79570 (0.0006) [2023-03-07 17:58:02,149][232226] Updated weights for policy 0, policy_version 79580 (0.0006) [2023-03-07 17:58:02,932][232226] Updated weights for policy 0, policy_version 79590 (0.0006) [2023-03-07 17:58:03,722][232226] Updated weights for policy 0, policy_version 79600 (0.0007) [2023-03-07 17:58:04,525][232226] Updated weights for policy 0, policy_version 79610 (0.0006) [2023-03-07 17:58:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 81526784. Throughput: 0: 12871.3. Samples: 81505020. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:05,070][231894] Avg episode reward: [(0, '195.006')] [2023-03-07 17:58:05,298][232226] Updated weights for policy 0, policy_version 79620 (0.0006) [2023-03-07 17:58:06,085][232226] Updated weights for policy 0, policy_version 79630 (0.0006) [2023-03-07 17:58:06,887][232226] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-07 17:58:07,702][232226] Updated weights for policy 0, policy_version 79650 (0.0006) [2023-03-07 17:58:08,484][232226] Updated weights for policy 0, policy_version 79660 (0.0006) [2023-03-07 17:58:09,296][232226] Updated weights for policy 0, policy_version 79670 (0.0006) [2023-03-07 17:58:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 81591296. Throughput: 0: 12872.7. Samples: 81582271. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:10,069][231894] Avg episode reward: [(0, '190.310')] [2023-03-07 17:58:10,081][232226] Updated weights for policy 0, policy_version 79680 (0.0007) [2023-03-07 17:58:10,890][232226] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-07 17:58:11,673][232226] Updated weights for policy 0, policy_version 79700 (0.0006) [2023-03-07 17:58:12,472][232226] Updated weights for policy 0, policy_version 79710 (0.0006) [2023-03-07 17:58:13,280][232226] Updated weights for policy 0, policy_version 79720 (0.0006) [2023-03-07 17:58:14,070][232226] Updated weights for policy 0, policy_version 79730 (0.0007) [2023-03-07 17:58:14,861][232226] Updated weights for policy 0, policy_version 79740 (0.0006) [2023-03-07 17:58:15,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 81655808. Throughput: 0: 12869.8. Samples: 81620804. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:15,069][231894] Avg episode reward: [(0, '195.020')] [2023-03-07 17:58:15,659][232226] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-07 17:58:16,475][232226] Updated weights for policy 0, policy_version 79760 (0.0007) [2023-03-07 17:58:17,252][232226] Updated weights for policy 0, policy_version 79770 (0.0006) [2023-03-07 17:58:18,038][232226] Updated weights for policy 0, policy_version 79780 (0.0007) [2023-03-07 17:58:18,829][232226] Updated weights for policy 0, policy_version 79790 (0.0006) [2023-03-07 17:58:19,610][232226] Updated weights for policy 0, policy_version 79800 (0.0006) [2023-03-07 17:58:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 81720320. Throughput: 0: 12875.9. Samples: 81698087. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:20,069][231894] Avg episode reward: [(0, '195.476')] [2023-03-07 17:58:20,401][232226] Updated weights for policy 0, policy_version 79810 (0.0007) [2023-03-07 17:58:21,205][232226] Updated weights for policy 0, policy_version 79820 (0.0007) [2023-03-07 17:58:21,987][232226] Updated weights for policy 0, policy_version 79830 (0.0007) [2023-03-07 17:58:22,773][232226] Updated weights for policy 0, policy_version 79840 (0.0006) [2023-03-07 17:58:23,560][232226] Updated weights for policy 0, policy_version 79850 (0.0006) [2023-03-07 17:58:24,345][232226] Updated weights for policy 0, policy_version 79860 (0.0006) [2023-03-07 17:58:25,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12885.4, 300 sec: 12878.1). Total num frames: 81785856. Throughput: 0: 12893.2. Samples: 81776217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:25,069][231894] Avg episode reward: [(0, '185.215')] [2023-03-07 17:58:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000079869_81785856.pth... [2023-03-07 17:58:25,102][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000076851_78695424.pth [2023-03-07 17:58:25,165][232226] Updated weights for policy 0, policy_version 79870 (0.0006) [2023-03-07 17:58:25,941][232226] Updated weights for policy 0, policy_version 79880 (0.0007) [2023-03-07 17:58:26,728][232226] Updated weights for policy 0, policy_version 79890 (0.0006) [2023-03-07 17:58:27,536][232226] Updated weights for policy 0, policy_version 79900 (0.0006) [2023-03-07 17:58:28,335][232226] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-07 17:58:29,125][232226] Updated weights for policy 0, policy_version 79920 (0.0007) [2023-03-07 17:58:29,926][232226] Updated weights for policy 0, policy_version 79930 (0.0006) [2023-03-07 17:58:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 81849344. Throughput: 0: 12889.7. Samples: 81814752. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:30,069][231894] Avg episode reward: [(0, '201.683')] [2023-03-07 17:58:30,719][232226] Updated weights for policy 0, policy_version 79940 (0.0007) [2023-03-07 17:58:31,510][232226] Updated weights for policy 0, policy_version 79950 (0.0006) [2023-03-07 17:58:32,305][232226] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-07 17:58:33,088][232226] Updated weights for policy 0, policy_version 79970 (0.0007) [2023-03-07 17:58:33,883][232226] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-07 17:58:34,669][232226] Updated weights for policy 0, policy_version 79990 (0.0006) [2023-03-07 17:58:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 81913856. Throughput: 0: 12892.4. Samples: 81892082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:35,069][231894] Avg episode reward: [(0, '193.374')] [2023-03-07 17:58:35,465][232226] Updated weights for policy 0, policy_version 80000 (0.0006) [2023-03-07 17:58:36,272][232226] Updated weights for policy 0, policy_version 80010 (0.0006) [2023-03-07 17:58:37,053][232226] Updated weights for policy 0, policy_version 80020 (0.0006) [2023-03-07 17:58:37,857][232226] Updated weights for policy 0, policy_version 80030 (0.0006) [2023-03-07 17:58:38,642][232226] Updated weights for policy 0, policy_version 80040 (0.0005) [2023-03-07 17:58:39,436][232226] Updated weights for policy 0, policy_version 80050 (0.0006) [2023-03-07 17:58:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 81978368. Throughput: 0: 12893.2. Samples: 81969645. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:40,070][231894] Avg episode reward: [(0, '189.128')] [2023-03-07 17:58:40,241][232226] Updated weights for policy 0, policy_version 80060 (0.0006) [2023-03-07 17:58:41,051][232226] Updated weights for policy 0, policy_version 80070 (0.0007) [2023-03-07 17:58:41,833][232226] Updated weights for policy 0, policy_version 80080 (0.0007) [2023-03-07 17:58:42,642][232226] Updated weights for policy 0, policy_version 80090 (0.0006) [2023-03-07 17:58:43,435][232226] Updated weights for policy 0, policy_version 80100 (0.0007) [2023-03-07 17:58:44,219][232226] Updated weights for policy 0, policy_version 80110 (0.0006) [2023-03-07 17:58:45,027][232226] Updated weights for policy 0, policy_version 80120 (0.0007) [2023-03-07 17:58:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 82042880. Throughput: 0: 12880.1. Samples: 82007753. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:45,070][231894] Avg episode reward: [(0, '195.906')] [2023-03-07 17:58:45,832][232226] Updated weights for policy 0, policy_version 80130 (0.0007) [2023-03-07 17:58:46,630][232226] Updated weights for policy 0, policy_version 80140 (0.0007) [2023-03-07 17:58:47,431][232226] Updated weights for policy 0, policy_version 80150 (0.0006) [2023-03-07 17:58:48,217][232226] Updated weights for policy 0, policy_version 80160 (0.0007) [2023-03-07 17:58:49,025][232226] Updated weights for policy 0, policy_version 80170 (0.0006) [2023-03-07 17:58:49,819][232226] Updated weights for policy 0, policy_version 80180 (0.0006) [2023-03-07 17:58:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12878.1). Total num frames: 82107392. Throughput: 0: 12886.6. Samples: 82084915. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:50,070][231894] Avg episode reward: [(0, '198.339')] [2023-03-07 17:58:50,641][232226] Updated weights for policy 0, policy_version 80190 (0.0007) [2023-03-07 17:58:51,433][232226] Updated weights for policy 0, policy_version 80200 (0.0007) [2023-03-07 17:58:52,219][232226] Updated weights for policy 0, policy_version 80210 (0.0006) [2023-03-07 17:58:53,023][232226] Updated weights for policy 0, policy_version 80220 (0.0006) [2023-03-07 17:58:53,827][232226] Updated weights for policy 0, policy_version 80230 (0.0006) [2023-03-07 17:58:54,625][232226] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-07 17:58:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 82170880. Throughput: 0: 12877.8. Samples: 82161774. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:58:55,069][231894] Avg episode reward: [(0, '197.241')] [2023-03-07 17:58:55,409][232226] Updated weights for policy 0, policy_version 80250 (0.0006) [2023-03-07 17:58:56,201][232226] Updated weights for policy 0, policy_version 80260 (0.0006) [2023-03-07 17:58:57,000][232226] Updated weights for policy 0, policy_version 80270 (0.0007) [2023-03-07 17:58:57,784][232226] Updated weights for policy 0, policy_version 80280 (0.0006) [2023-03-07 17:58:58,571][232226] Updated weights for policy 0, policy_version 80290 (0.0006) [2023-03-07 17:58:59,375][232226] Updated weights for policy 0, policy_version 80300 (0.0006) [2023-03-07 17:59:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 82235392. Throughput: 0: 12884.2. Samples: 82200595. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:00,069][231894] Avg episode reward: [(0, '197.860')] [2023-03-07 17:59:00,158][232226] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-07 17:59:00,938][232226] Updated weights for policy 0, policy_version 80320 (0.0008) [2023-03-07 17:59:01,743][232226] Updated weights for policy 0, policy_version 80330 (0.0007) [2023-03-07 17:59:02,550][232226] Updated weights for policy 0, policy_version 80340 (0.0007) [2023-03-07 17:59:03,349][232226] Updated weights for policy 0, policy_version 80350 (0.0007) [2023-03-07 17:59:04,148][232226] Updated weights for policy 0, policy_version 80360 (0.0006) [2023-03-07 17:59:04,962][232226] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-07 17:59:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 82299904. Throughput: 0: 12880.6. Samples: 82277716. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:05,070][231894] Avg episode reward: [(0, '198.924')] [2023-03-07 17:59:05,763][232226] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-07 17:59:06,556][232226] Updated weights for policy 0, policy_version 80390 (0.0006) [2023-03-07 17:59:07,360][232226] Updated weights for policy 0, policy_version 80400 (0.0007) [2023-03-07 17:59:08,147][232226] Updated weights for policy 0, policy_version 80410 (0.0006) [2023-03-07 17:59:08,949][232226] Updated weights for policy 0, policy_version 80420 (0.0006) [2023-03-07 17:59:09,742][232226] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-07 17:59:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 82364416. Throughput: 0: 12856.2. Samples: 82354746. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:10,069][231894] Avg episode reward: [(0, '198.751')] [2023-03-07 17:59:10,529][232226] Updated weights for policy 0, policy_version 80440 (0.0005) [2023-03-07 17:59:11,343][232226] Updated weights for policy 0, policy_version 80450 (0.0007) [2023-03-07 17:59:12,127][232226] Updated weights for policy 0, policy_version 80460 (0.0007) [2023-03-07 17:59:12,926][232226] Updated weights for policy 0, policy_version 80470 (0.0007) [2023-03-07 17:59:13,725][232226] Updated weights for policy 0, policy_version 80480 (0.0007) [2023-03-07 17:59:14,508][232226] Updated weights for policy 0, policy_version 80490 (0.0007) [2023-03-07 17:59:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 82428928. Throughput: 0: 12856.9. Samples: 82393312. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:15,069][231894] Avg episode reward: [(0, '189.553')] [2023-03-07 17:59:15,297][232226] Updated weights for policy 0, policy_version 80500 (0.0007) [2023-03-07 17:59:16,099][232226] Updated weights for policy 0, policy_version 80510 (0.0006) [2023-03-07 17:59:16,896][232226] Updated weights for policy 0, policy_version 80520 (0.0006) [2023-03-07 17:59:17,691][232226] Updated weights for policy 0, policy_version 80530 (0.0006) [2023-03-07 17:59:18,489][232226] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-07 17:59:19,294][232226] Updated weights for policy 0, policy_version 80550 (0.0006) [2023-03-07 17:59:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 82492416. Throughput: 0: 12858.1. Samples: 82470697. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:20,069][231894] Avg episode reward: [(0, '191.760')] [2023-03-07 17:59:20,089][232226] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-07 17:59:20,877][232226] Updated weights for policy 0, policy_version 80570 (0.0006) [2023-03-07 17:59:21,674][232226] Updated weights for policy 0, policy_version 80580 (0.0006) [2023-03-07 17:59:22,454][232226] Updated weights for policy 0, policy_version 80590 (0.0006) [2023-03-07 17:59:23,272][232226] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-07 17:59:24,070][232226] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-07 17:59:24,875][232226] Updated weights for policy 0, policy_version 80620 (0.0007) [2023-03-07 17:59:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 82556928. Throughput: 0: 12842.7. Samples: 82547567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:25,069][231894] Avg episode reward: [(0, '196.599')] [2023-03-07 17:59:25,660][232226] Updated weights for policy 0, policy_version 80630 (0.0007) [2023-03-07 17:59:26,464][232226] Updated weights for policy 0, policy_version 80640 (0.0007) [2023-03-07 17:59:27,265][232226] Updated weights for policy 0, policy_version 80650 (0.0006) [2023-03-07 17:59:28,053][232226] Updated weights for policy 0, policy_version 80660 (0.0006) [2023-03-07 17:59:28,860][232226] Updated weights for policy 0, policy_version 80670 (0.0006) [2023-03-07 17:59:29,650][232226] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-07 17:59:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 82621440. Throughput: 0: 12853.6. Samples: 82586166. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:30,069][231894] Avg episode reward: [(0, '196.674')] [2023-03-07 17:59:30,425][232226] Updated weights for policy 0, policy_version 80690 (0.0007) [2023-03-07 17:59:31,226][232226] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-07 17:59:32,009][232226] Updated weights for policy 0, policy_version 80710 (0.0007) [2023-03-07 17:59:32,800][232226] Updated weights for policy 0, policy_version 80720 (0.0006) [2023-03-07 17:59:33,597][232226] Updated weights for policy 0, policy_version 80730 (0.0007) [2023-03-07 17:59:34,394][232226] Updated weights for policy 0, policy_version 80740 (0.0006) [2023-03-07 17:59:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 82685952. Throughput: 0: 12864.1. Samples: 82663799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:35,069][231894] Avg episode reward: [(0, '201.007')] [2023-03-07 17:59:35,179][232226] Updated weights for policy 0, policy_version 80750 (0.0007) [2023-03-07 17:59:35,971][232226] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-07 17:59:36,783][232226] Updated weights for policy 0, policy_version 80770 (0.0006) [2023-03-07 17:59:37,562][232226] Updated weights for policy 0, policy_version 80780 (0.0006) [2023-03-07 17:59:38,357][232226] Updated weights for policy 0, policy_version 80790 (0.0007) [2023-03-07 17:59:39,163][232226] Updated weights for policy 0, policy_version 80800 (0.0006) [2023-03-07 17:59:39,957][232226] Updated weights for policy 0, policy_version 80810 (0.0005) [2023-03-07 17:59:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 82750464. Throughput: 0: 12876.4. Samples: 82741212. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:40,069][231894] Avg episode reward: [(0, '191.918')] [2023-03-07 17:59:40,754][232226] Updated weights for policy 0, policy_version 80820 (0.0006) [2023-03-07 17:59:41,544][232226] Updated weights for policy 0, policy_version 80830 (0.0006) [2023-03-07 17:59:42,344][232226] Updated weights for policy 0, policy_version 80840 (0.0007) [2023-03-07 17:59:43,153][232226] Updated weights for policy 0, policy_version 80850 (0.0006) [2023-03-07 17:59:43,945][232226] Updated weights for policy 0, policy_version 80860 (0.0007) [2023-03-07 17:59:44,729][232226] Updated weights for policy 0, policy_version 80870 (0.0007) [2023-03-07 17:59:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 82814976. Throughput: 0: 12868.6. Samples: 82779682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:45,069][231894] Avg episode reward: [(0, '199.598')] [2023-03-07 17:59:45,522][232226] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-07 17:59:46,330][232226] Updated weights for policy 0, policy_version 80890 (0.0006) [2023-03-07 17:59:47,134][232226] Updated weights for policy 0, policy_version 80900 (0.0006) [2023-03-07 17:59:47,932][232226] Updated weights for policy 0, policy_version 80910 (0.0007) [2023-03-07 17:59:48,717][232226] Updated weights for policy 0, policy_version 80920 (0.0007) [2023-03-07 17:59:49,513][232226] Updated weights for policy 0, policy_version 80930 (0.0006) [2023-03-07 17:59:50,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 82879488. Throughput: 0: 12869.4. Samples: 82856838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:50,069][231894] Avg episode reward: [(0, '202.528')] [2023-03-07 17:59:50,312][232226] Updated weights for policy 0, policy_version 80940 (0.0006) [2023-03-07 17:59:51,117][232226] Updated weights for policy 0, policy_version 80950 (0.0007) [2023-03-07 17:59:51,909][232226] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-07 17:59:52,699][232226] Updated weights for policy 0, policy_version 80970 (0.0006) [2023-03-07 17:59:53,493][232226] Updated weights for policy 0, policy_version 80980 (0.0006) [2023-03-07 17:59:54,309][232226] Updated weights for policy 0, policy_version 80990 (0.0006) [2023-03-07 17:59:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 82942976. Throughput: 0: 12871.6. Samples: 82933969. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 17:59:55,069][231894] Avg episode reward: [(0, '201.281')] [2023-03-07 17:59:55,097][232226] Updated weights for policy 0, policy_version 81000 (0.0006) [2023-03-07 17:59:55,907][232226] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-07 17:59:56,707][232226] Updated weights for policy 0, policy_version 81020 (0.0006) [2023-03-07 17:59:57,478][232226] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-07 17:59:58,262][232226] Updated weights for policy 0, policy_version 81040 (0.0006) [2023-03-07 17:59:59,072][232226] Updated weights for policy 0, policy_version 81050 (0.0007) [2023-03-07 17:59:59,866][232226] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-07 18:00:00,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83007488. Throughput: 0: 12872.0. Samples: 82972554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:00:00,070][231894] Avg episode reward: [(0, '198.919')] [2023-03-07 18:00:00,666][232226] Updated weights for policy 0, policy_version 81070 (0.0007) [2023-03-07 18:00:01,465][232226] Updated weights for policy 0, policy_version 81080 (0.0007) [2023-03-07 18:00:02,250][232226] Updated weights for policy 0, policy_version 81090 (0.0006) [2023-03-07 18:00:03,048][232226] Updated weights for policy 0, policy_version 81100 (0.0007) [2023-03-07 18:00:03,821][232226] Updated weights for policy 0, policy_version 81110 (0.0007) [2023-03-07 18:00:04,617][232226] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-07 18:00:05,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 83072000. Throughput: 0: 12868.9. Samples: 83049798. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:00:05,069][231894] Avg episode reward: [(0, '202.745')] [2023-03-07 18:00:05,409][232226] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-07 18:00:06,211][232226] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-07 18:00:07,004][232226] Updated weights for policy 0, policy_version 81150 (0.0007) [2023-03-07 18:00:07,799][232226] Updated weights for policy 0, policy_version 81160 (0.0006) [2023-03-07 18:00:08,596][232226] Updated weights for policy 0, policy_version 81170 (0.0007) [2023-03-07 18:00:09,386][232226] Updated weights for policy 0, policy_version 81180 (0.0007) [2023-03-07 18:00:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 83136512. Throughput: 0: 12883.2. Samples: 83127311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:00:10,069][231894] Avg episode reward: [(0, '191.119')] [2023-03-07 18:00:10,201][232226] Updated weights for policy 0, policy_version 81190 (0.0007) [2023-03-07 18:00:10,982][232226] Updated weights for policy 0, policy_version 81200 (0.0007) [2023-03-07 18:00:11,770][232226] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-07 18:00:12,548][232226] Updated weights for policy 0, policy_version 81220 (0.0006) [2023-03-07 18:00:13,346][232226] Updated weights for policy 0, policy_version 81230 (0.0007) [2023-03-07 18:00:14,134][232226] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-07 18:00:14,954][232226] Updated weights for policy 0, policy_version 81250 (0.0006) [2023-03-07 18:00:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 83201024. Throughput: 0: 12890.4. Samples: 83166234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:00:15,070][231894] Avg episode reward: [(0, '184.904')] [2023-03-07 18:00:15,745][232226] Updated weights for policy 0, policy_version 81260 (0.0006) [2023-03-07 18:00:16,545][232226] Updated weights for policy 0, policy_version 81270 (0.0006) [2023-03-07 18:00:17,339][232226] Updated weights for policy 0, policy_version 81280 (0.0006) [2023-03-07 18:00:18,134][232226] Updated weights for policy 0, policy_version 81290 (0.0007) [2023-03-07 18:00:18,929][232226] Updated weights for policy 0, policy_version 81300 (0.0006) [2023-03-07 18:00:19,718][232226] Updated weights for policy 0, policy_version 81310 (0.0006) [2023-03-07 18:00:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 83265536. Throughput: 0: 12875.7. Samples: 83243207. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:20,069][231894] Avg episode reward: [(0, '201.044')] [2023-03-07 18:00:20,511][232226] Updated weights for policy 0, policy_version 81320 (0.0006) [2023-03-07 18:00:21,313][232226] Updated weights for policy 0, policy_version 81330 (0.0008) [2023-03-07 18:00:22,109][232226] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-07 18:00:22,909][232226] Updated weights for policy 0, policy_version 81350 (0.0006) [2023-03-07 18:00:23,711][232226] Updated weights for policy 0, policy_version 81360 (0.0006) [2023-03-07 18:00:24,505][232226] Updated weights for policy 0, policy_version 81370 (0.0006) [2023-03-07 18:00:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 83330048. Throughput: 0: 12870.2. Samples: 83320372. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:25,069][231894] Avg episode reward: [(0, '194.139')] [2023-03-07 18:00:25,073][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000081377_83330048.pth... [2023-03-07 18:00:25,103][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000078359_80239616.pth [2023-03-07 18:00:25,298][232226] Updated weights for policy 0, policy_version 81380 (0.0006) [2023-03-07 18:00:26,103][232226] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-07 18:00:26,885][232226] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-07 18:00:27,698][232226] Updated weights for policy 0, policy_version 81410 (0.0007) [2023-03-07 18:00:28,483][232226] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-07 18:00:29,291][232226] Updated weights for policy 0, policy_version 81430 (0.0007) [2023-03-07 18:00:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12871.2). Total num frames: 83393536. Throughput: 0: 12874.4. Samples: 83359029. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:30,070][231894] Avg episode reward: [(0, '198.592')] [2023-03-07 18:00:30,084][232226] Updated weights for policy 0, policy_version 81440 (0.0006) [2023-03-07 18:00:30,870][232226] Updated weights for policy 0, policy_version 81450 (0.0005) [2023-03-07 18:00:31,659][232226] Updated weights for policy 0, policy_version 81460 (0.0006) [2023-03-07 18:00:32,449][232226] Updated weights for policy 0, policy_version 81470 (0.0007) [2023-03-07 18:00:33,247][232226] Updated weights for policy 0, policy_version 81480 (0.0007) [2023-03-07 18:00:34,046][232226] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-07 18:00:34,842][232226] Updated weights for policy 0, policy_version 81500 (0.0007) [2023-03-07 18:00:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83458048. Throughput: 0: 12878.1. Samples: 83436352. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:35,069][231894] Avg episode reward: [(0, '190.873')] [2023-03-07 18:00:35,629][232226] Updated weights for policy 0, policy_version 81510 (0.0006) [2023-03-07 18:00:36,437][232226] Updated weights for policy 0, policy_version 81520 (0.0007) [2023-03-07 18:00:37,225][232226] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-07 18:00:38,002][232226] Updated weights for policy 0, policy_version 81540 (0.0007) [2023-03-07 18:00:38,824][232226] Updated weights for policy 0, policy_version 81550 (0.0007) [2023-03-07 18:00:39,610][232226] Updated weights for policy 0, policy_version 81560 (0.0006) [2023-03-07 18:00:40,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83522560. Throughput: 0: 12878.6. Samples: 83513507. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:40,069][231894] Avg episode reward: [(0, '197.640')] [2023-03-07 18:00:40,404][232226] Updated weights for policy 0, policy_version 81570 (0.0006) [2023-03-07 18:00:41,218][232226] Updated weights for policy 0, policy_version 81580 (0.0006) [2023-03-07 18:00:42,014][232226] Updated weights for policy 0, policy_version 81590 (0.0007) [2023-03-07 18:00:42,794][232226] Updated weights for policy 0, policy_version 81600 (0.0007) [2023-03-07 18:00:43,600][232226] Updated weights for policy 0, policy_version 81610 (0.0006) [2023-03-07 18:00:44,395][232226] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-07 18:00:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83587072. Throughput: 0: 12882.1. Samples: 83552251. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:45,070][231894] Avg episode reward: [(0, '200.686')] [2023-03-07 18:00:45,191][232226] Updated weights for policy 0, policy_version 81630 (0.0007) [2023-03-07 18:00:45,985][232226] Updated weights for policy 0, policy_version 81640 (0.0006) [2023-03-07 18:00:46,767][232226] Updated weights for policy 0, policy_version 81650 (0.0007) [2023-03-07 18:00:47,576][232226] Updated weights for policy 0, policy_version 81660 (0.0007) [2023-03-07 18:00:48,361][232226] Updated weights for policy 0, policy_version 81670 (0.0007) [2023-03-07 18:00:49,164][232226] Updated weights for policy 0, policy_version 81680 (0.0007) [2023-03-07 18:00:49,981][232226] Updated weights for policy 0, policy_version 81690 (0.0006) [2023-03-07 18:00:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 83651584. Throughput: 0: 12879.8. Samples: 83629388. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:50,069][231894] Avg episode reward: [(0, '197.536')] [2023-03-07 18:00:50,773][232226] Updated weights for policy 0, policy_version 81700 (0.0007) [2023-03-07 18:00:51,585][232226] Updated weights for policy 0, policy_version 81710 (0.0006) [2023-03-07 18:00:52,365][232226] Updated weights for policy 0, policy_version 81720 (0.0006) [2023-03-07 18:00:53,173][232226] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-07 18:00:53,969][232226] Updated weights for policy 0, policy_version 81740 (0.0006) [2023-03-07 18:00:54,751][232226] Updated weights for policy 0, policy_version 81750 (0.0006) [2023-03-07 18:00:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 83716096. Throughput: 0: 12863.5. Samples: 83706169. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:00:55,069][231894] Avg episode reward: [(0, '195.620')] [2023-03-07 18:00:55,561][232226] Updated weights for policy 0, policy_version 81760 (0.0006) [2023-03-07 18:00:56,354][232226] Updated weights for policy 0, policy_version 81770 (0.0006) [2023-03-07 18:00:57,151][232226] Updated weights for policy 0, policy_version 81780 (0.0006) [2023-03-07 18:00:57,964][232226] Updated weights for policy 0, policy_version 81790 (0.0006) [2023-03-07 18:00:58,754][232226] Updated weights for policy 0, policy_version 81800 (0.0006) [2023-03-07 18:00:59,560][232226] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-07 18:01:00,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83779584. Throughput: 0: 12857.2. Samples: 83744806. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:01:00,069][231894] Avg episode reward: [(0, '203.121')] [2023-03-07 18:01:00,343][232226] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-07 18:01:01,153][232226] Updated weights for policy 0, policy_version 81830 (0.0007) [2023-03-07 18:01:01,934][232226] Updated weights for policy 0, policy_version 81840 (0.0006) [2023-03-07 18:01:02,714][232226] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-07 18:01:03,520][232226] Updated weights for policy 0, policy_version 81860 (0.0007) [2023-03-07 18:01:04,306][232226] Updated weights for policy 0, policy_version 81870 (0.0006) [2023-03-07 18:01:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83844096. Throughput: 0: 12861.2. Samples: 83821963. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:01:05,069][231894] Avg episode reward: [(0, '195.865')] [2023-03-07 18:01:05,116][232226] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-07 18:01:05,914][232226] Updated weights for policy 0, policy_version 81890 (0.0006) [2023-03-07 18:01:06,710][232226] Updated weights for policy 0, policy_version 81900 (0.0007) [2023-03-07 18:01:07,508][232226] Updated weights for policy 0, policy_version 81910 (0.0006) [2023-03-07 18:01:08,293][232226] Updated weights for policy 0, policy_version 81920 (0.0007) [2023-03-07 18:01:09,105][232226] Updated weights for policy 0, policy_version 81930 (0.0007) [2023-03-07 18:01:09,901][232226] Updated weights for policy 0, policy_version 81940 (0.0006) [2023-03-07 18:01:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83908608. Throughput: 0: 12857.4. Samples: 83898955. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:10,069][231894] Avg episode reward: [(0, '198.693')] [2023-03-07 18:01:10,694][232226] Updated weights for policy 0, policy_version 81950 (0.0007) [2023-03-07 18:01:11,494][232226] Updated weights for policy 0, policy_version 81960 (0.0006) [2023-03-07 18:01:12,281][232226] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-07 18:01:13,065][232226] Updated weights for policy 0, policy_version 81980 (0.0007) [2023-03-07 18:01:13,849][232226] Updated weights for policy 0, policy_version 81990 (0.0006) [2023-03-07 18:01:14,642][232226] Updated weights for policy 0, policy_version 82000 (0.0006) [2023-03-07 18:01:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 83973120. Throughput: 0: 12861.9. Samples: 83937814. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:15,069][231894] Avg episode reward: [(0, '195.733')] [2023-03-07 18:01:15,459][232226] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-07 18:01:16,251][232226] Updated weights for policy 0, policy_version 82020 (0.0006) [2023-03-07 18:01:17,036][232226] Updated weights for policy 0, policy_version 82030 (0.0007) [2023-03-07 18:01:17,852][232226] Updated weights for policy 0, policy_version 82040 (0.0007) [2023-03-07 18:01:18,629][232226] Updated weights for policy 0, policy_version 82050 (0.0007) [2023-03-07 18:01:19,427][232226] Updated weights for policy 0, policy_version 82060 (0.0007) [2023-03-07 18:01:20,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 84037632. Throughput: 0: 12857.1. Samples: 84014921. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:20,069][231894] Avg episode reward: [(0, '198.770')] [2023-03-07 18:01:20,226][232226] Updated weights for policy 0, policy_version 82070 (0.0006) [2023-03-07 18:01:21,029][232226] Updated weights for policy 0, policy_version 82080 (0.0006) [2023-03-07 18:01:21,816][232226] Updated weights for policy 0, policy_version 82090 (0.0007) [2023-03-07 18:01:22,615][232226] Updated weights for policy 0, policy_version 82100 (0.0007) [2023-03-07 18:01:23,400][232226] Updated weights for policy 0, policy_version 82110 (0.0006) [2023-03-07 18:01:24,188][232226] Updated weights for policy 0, policy_version 82120 (0.0006) [2023-03-07 18:01:24,992][232226] Updated weights for policy 0, policy_version 82130 (0.0007) [2023-03-07 18:01:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 84101120. Throughput: 0: 12860.6. Samples: 84092237. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:25,069][231894] Avg episode reward: [(0, '199.966')] [2023-03-07 18:01:25,805][232226] Updated weights for policy 0, policy_version 82140 (0.0005) [2023-03-07 18:01:26,588][232226] Updated weights for policy 0, policy_version 82150 (0.0006) [2023-03-07 18:01:27,389][232226] Updated weights for policy 0, policy_version 82160 (0.0007) [2023-03-07 18:01:28,182][232226] Updated weights for policy 0, policy_version 82170 (0.0007) [2023-03-07 18:01:28,998][232226] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-07 18:01:29,793][232226] Updated weights for policy 0, policy_version 82190 (0.0008) [2023-03-07 18:01:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 84165632. Throughput: 0: 12858.9. Samples: 84130899. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:30,069][231894] Avg episode reward: [(0, '196.698')] [2023-03-07 18:01:30,589][232226] Updated weights for policy 0, policy_version 82200 (0.0007) [2023-03-07 18:01:31,389][232226] Updated weights for policy 0, policy_version 82210 (0.0007) [2023-03-07 18:01:32,178][232226] Updated weights for policy 0, policy_version 82220 (0.0007) [2023-03-07 18:01:32,981][232226] Updated weights for policy 0, policy_version 82230 (0.0007) [2023-03-07 18:01:33,782][232226] Updated weights for policy 0, policy_version 82240 (0.0007) [2023-03-07 18:01:34,578][232226] Updated weights for policy 0, policy_version 82250 (0.0007) [2023-03-07 18:01:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 84230144. Throughput: 0: 12853.4. Samples: 84207794. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:35,069][231894] Avg episode reward: [(0, '195.575')] [2023-03-07 18:01:35,368][232226] Updated weights for policy 0, policy_version 82260 (0.0007) [2023-03-07 18:01:36,157][232226] Updated weights for policy 0, policy_version 82270 (0.0006) [2023-03-07 18:01:36,950][232226] Updated weights for policy 0, policy_version 82280 (0.0007) [2023-03-07 18:01:37,749][232226] Updated weights for policy 0, policy_version 82290 (0.0006) [2023-03-07 18:01:38,569][232226] Updated weights for policy 0, policy_version 82300 (0.0007) [2023-03-07 18:01:39,358][232226] Updated weights for policy 0, policy_version 82310 (0.0006) [2023-03-07 18:01:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 84294656. Throughput: 0: 12860.6. Samples: 84284896. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:40,069][231894] Avg episode reward: [(0, '204.281')] [2023-03-07 18:01:40,159][232226] Updated weights for policy 0, policy_version 82320 (0.0007) [2023-03-07 18:01:40,966][232226] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-07 18:01:41,767][232226] Updated weights for policy 0, policy_version 82340 (0.0005) [2023-03-07 18:01:42,558][232226] Updated weights for policy 0, policy_version 82350 (0.0006) [2023-03-07 18:01:43,381][232226] Updated weights for policy 0, policy_version 82360 (0.0006) [2023-03-07 18:01:44,140][232226] Updated weights for policy 0, policy_version 82370 (0.0006) [2023-03-07 18:01:44,950][232226] Updated weights for policy 0, policy_version 82380 (0.0007) [2023-03-07 18:01:45,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 84358144. Throughput: 0: 12856.1. Samples: 84323329. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:45,069][231894] Avg episode reward: [(0, '195.948')] [2023-03-07 18:01:45,746][232226] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-07 18:01:46,547][232226] Updated weights for policy 0, policy_version 82400 (0.0006) [2023-03-07 18:01:47,341][232226] Updated weights for policy 0, policy_version 82410 (0.0006) [2023-03-07 18:01:48,142][232226] Updated weights for policy 0, policy_version 82420 (0.0005) [2023-03-07 18:01:48,934][232226] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-07 18:01:49,735][232226] Updated weights for policy 0, policy_version 82440 (0.0007) [2023-03-07 18:01:50,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 84422656. Throughput: 0: 12850.6. Samples: 84400241. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 18:01:50,069][231894] Avg episode reward: [(0, '191.542')] [2023-03-07 18:01:50,536][232226] Updated weights for policy 0, policy_version 82450 (0.0007) [2023-03-07 18:01:51,321][232226] Updated weights for policy 0, policy_version 82460 (0.0006) [2023-03-07 18:01:52,112][232226] Updated weights for policy 0, policy_version 82470 (0.0007) [2023-03-07 18:01:52,913][232226] Updated weights for policy 0, policy_version 82480 (0.0006) [2023-03-07 18:01:53,699][232226] Updated weights for policy 0, policy_version 82490 (0.0007) [2023-03-07 18:01:54,496][232226] Updated weights for policy 0, policy_version 82500 (0.0006) [2023-03-07 18:01:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 84487168. Throughput: 0: 12860.3. Samples: 84477669. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:01:55,069][231894] Avg episode reward: [(0, '192.470')] [2023-03-07 18:01:55,272][232226] Updated weights for policy 0, policy_version 82510 (0.0006) [2023-03-07 18:01:56,076][232226] Updated weights for policy 0, policy_version 82520 (0.0007) [2023-03-07 18:01:56,868][232226] Updated weights for policy 0, policy_version 82530 (0.0006) [2023-03-07 18:01:57,669][232226] Updated weights for policy 0, policy_version 82540 (0.0006) [2023-03-07 18:01:58,445][232226] Updated weights for policy 0, policy_version 82550 (0.0006) [2023-03-07 18:01:59,241][232226] Updated weights for policy 0, policy_version 82560 (0.0006) [2023-03-07 18:02:00,033][232226] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-07 18:02:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 84551680. Throughput: 0: 12858.1. Samples: 84516429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:00,069][231894] Avg episode reward: [(0, '195.907')] [2023-03-07 18:02:00,835][232226] Updated weights for policy 0, policy_version 82580 (0.0006) [2023-03-07 18:02:01,620][232226] Updated weights for policy 0, policy_version 82590 (0.0007) [2023-03-07 18:02:02,413][232226] Updated weights for policy 0, policy_version 82600 (0.0006) [2023-03-07 18:02:03,188][232226] Updated weights for policy 0, policy_version 82610 (0.0007) [2023-03-07 18:02:03,986][232226] Updated weights for policy 0, policy_version 82620 (0.0006) [2023-03-07 18:02:04,795][232226] Updated weights for policy 0, policy_version 82630 (0.0007) [2023-03-07 18:02:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 84616192. Throughput: 0: 12872.0. Samples: 84594164. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:05,069][231894] Avg episode reward: [(0, '192.222')] [2023-03-07 18:02:05,581][232226] Updated weights for policy 0, policy_version 82640 (0.0007) [2023-03-07 18:02:06,378][232226] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-07 18:02:07,186][232226] Updated weights for policy 0, policy_version 82660 (0.0007) [2023-03-07 18:02:07,988][232226] Updated weights for policy 0, policy_version 82670 (0.0006) [2023-03-07 18:02:08,786][232226] Updated weights for policy 0, policy_version 82680 (0.0006) [2023-03-07 18:02:09,577][232226] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-07 18:02:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 84680704. Throughput: 0: 12865.2. Samples: 84671173. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:10,069][231894] Avg episode reward: [(0, '189.585')] [2023-03-07 18:02:10,394][232226] Updated weights for policy 0, policy_version 82700 (0.0006) [2023-03-07 18:02:11,182][232226] Updated weights for policy 0, policy_version 82710 (0.0006) [2023-03-07 18:02:11,961][232226] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-07 18:02:12,770][232226] Updated weights for policy 0, policy_version 82730 (0.0006) [2023-03-07 18:02:13,569][232226] Updated weights for policy 0, policy_version 82740 (0.0006) [2023-03-07 18:02:14,358][232226] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-07 18:02:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 84744192. Throughput: 0: 12861.3. Samples: 84709660. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:15,069][231894] Avg episode reward: [(0, '194.907')] [2023-03-07 18:02:15,157][232226] Updated weights for policy 0, policy_version 82760 (0.0006) [2023-03-07 18:02:15,972][232226] Updated weights for policy 0, policy_version 82770 (0.0007) [2023-03-07 18:02:16,751][232226] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-07 18:02:17,537][232226] Updated weights for policy 0, policy_version 82790 (0.0005) [2023-03-07 18:02:18,353][232226] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-07 18:02:19,146][232226] Updated weights for policy 0, policy_version 82810 (0.0006) [2023-03-07 18:02:19,949][232226] Updated weights for policy 0, policy_version 82820 (0.0007) [2023-03-07 18:02:20,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 84808704. Throughput: 0: 12868.1. Samples: 84786859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:20,069][231894] Avg episode reward: [(0, '186.142')] [2023-03-07 18:02:20,726][232226] Updated weights for policy 0, policy_version 82830 (0.0006) [2023-03-07 18:02:21,541][232226] Updated weights for policy 0, policy_version 82840 (0.0007) [2023-03-07 18:02:22,316][232226] Updated weights for policy 0, policy_version 82850 (0.0006) [2023-03-07 18:02:23,125][232226] Updated weights for policy 0, policy_version 82860 (0.0006) [2023-03-07 18:02:23,909][232226] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-07 18:02:24,704][232226] Updated weights for policy 0, policy_version 82880 (0.0008) [2023-03-07 18:02:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 84873216. Throughput: 0: 12868.2. Samples: 84863964. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:25,069][231894] Avg episode reward: [(0, '196.936')] [2023-03-07 18:02:25,072][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000082884_84873216.pth... [2023-03-07 18:02:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000079869_81785856.pth [2023-03-07 18:02:25,508][232226] Updated weights for policy 0, policy_version 82890 (0.0007) [2023-03-07 18:02:26,309][232226] Updated weights for policy 0, policy_version 82900 (0.0007) [2023-03-07 18:02:27,102][232226] Updated weights for policy 0, policy_version 82910 (0.0006) [2023-03-07 18:02:27,907][232226] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-07 18:02:28,710][232226] Updated weights for policy 0, policy_version 82930 (0.0006) [2023-03-07 18:02:29,511][232226] Updated weights for policy 0, policy_version 82940 (0.0006) [2023-03-07 18:02:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 84936704. Throughput: 0: 12869.4. Samples: 84902454. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:30,069][231894] Avg episode reward: [(0, '196.642')] [2023-03-07 18:02:30,306][232226] Updated weights for policy 0, policy_version 82950 (0.0007) [2023-03-07 18:02:31,094][232226] Updated weights for policy 0, policy_version 82960 (0.0006) [2023-03-07 18:02:31,878][232226] Updated weights for policy 0, policy_version 82970 (0.0007) [2023-03-07 18:02:32,686][232226] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-07 18:02:33,471][232226] Updated weights for policy 0, policy_version 82990 (0.0006) [2023-03-07 18:02:34,281][232226] Updated weights for policy 0, policy_version 83000 (0.0006) [2023-03-07 18:02:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 85001216. Throughput: 0: 12874.6. Samples: 84979597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:35,069][231894] Avg episode reward: [(0, '193.805')] [2023-03-07 18:02:35,082][232226] Updated weights for policy 0, policy_version 83010 (0.0006) [2023-03-07 18:02:35,866][232226] Updated weights for policy 0, policy_version 83020 (0.0006) [2023-03-07 18:02:36,652][232226] Updated weights for policy 0, policy_version 83030 (0.0006) [2023-03-07 18:02:37,456][232226] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-07 18:02:38,229][232226] Updated weights for policy 0, policy_version 83050 (0.0007) [2023-03-07 18:02:39,044][232226] Updated weights for policy 0, policy_version 83060 (0.0007) [2023-03-07 18:02:39,863][232226] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-07 18:02:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 85065728. Throughput: 0: 12868.2. Samples: 85056740. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:40,069][231894] Avg episode reward: [(0, '190.297')] [2023-03-07 18:02:40,639][232226] Updated weights for policy 0, policy_version 83080 (0.0006) [2023-03-07 18:02:41,437][232226] Updated weights for policy 0, policy_version 83090 (0.0005) [2023-03-07 18:02:42,238][232226] Updated weights for policy 0, policy_version 83100 (0.0006) [2023-03-07 18:02:43,028][232226] Updated weights for policy 0, policy_version 83110 (0.0006) [2023-03-07 18:02:43,829][232226] Updated weights for policy 0, policy_version 83120 (0.0006) [2023-03-07 18:02:44,629][232226] Updated weights for policy 0, policy_version 83130 (0.0006) [2023-03-07 18:02:45,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 85130240. Throughput: 0: 12865.3. Samples: 85095368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:45,069][231894] Avg episode reward: [(0, '196.641')] [2023-03-07 18:02:45,420][232226] Updated weights for policy 0, policy_version 83140 (0.0007) [2023-03-07 18:02:46,193][232226] Updated weights for policy 0, policy_version 83150 (0.0006) [2023-03-07 18:02:46,991][232226] Updated weights for policy 0, policy_version 83160 (0.0006) [2023-03-07 18:02:47,782][232226] Updated weights for policy 0, policy_version 83170 (0.0007) [2023-03-07 18:02:48,577][232226] Updated weights for policy 0, policy_version 83180 (0.0007) [2023-03-07 18:02:49,378][232226] Updated weights for policy 0, policy_version 83190 (0.0006) [2023-03-07 18:02:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 85194752. Throughput: 0: 12858.1. Samples: 85172779. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:50,080][231894] Avg episode reward: [(0, '195.468')] [2023-03-07 18:02:50,180][232226] Updated weights for policy 0, policy_version 83200 (0.0006) [2023-03-07 18:02:50,958][232226] Updated weights for policy 0, policy_version 83210 (0.0005) [2023-03-07 18:02:51,760][232226] Updated weights for policy 0, policy_version 83220 (0.0007) [2023-03-07 18:02:52,557][232226] Updated weights for policy 0, policy_version 83230 (0.0007) [2023-03-07 18:02:53,341][232226] Updated weights for policy 0, policy_version 83240 (0.0007) [2023-03-07 18:02:54,146][232226] Updated weights for policy 0, policy_version 83250 (0.0006) [2023-03-07 18:02:54,962][232226] Updated weights for policy 0, policy_version 83260 (0.0006) [2023-03-07 18:02:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 85259264. Throughput: 0: 12866.6. Samples: 85250172. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:02:55,080][231894] Avg episode reward: [(0, '196.396')] [2023-03-07 18:02:55,765][232226] Updated weights for policy 0, policy_version 83270 (0.0006) [2023-03-07 18:02:56,570][232226] Updated weights for policy 0, policy_version 83280 (0.0007) [2023-03-07 18:02:57,362][232226] Updated weights for policy 0, policy_version 83290 (0.0006) [2023-03-07 18:02:58,147][232226] Updated weights for policy 0, policy_version 83300 (0.0007) [2023-03-07 18:02:58,927][232226] Updated weights for policy 0, policy_version 83310 (0.0007) [2023-03-07 18:02:59,734][232226] Updated weights for policy 0, policy_version 83320 (0.0007) [2023-03-07 18:03:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 85323776. Throughput: 0: 12859.8. Samples: 85288351. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:00,080][231894] Avg episode reward: [(0, '196.350')] [2023-03-07 18:03:00,514][232226] Updated weights for policy 0, policy_version 83330 (0.0006) [2023-03-07 18:03:01,303][232226] Updated weights for policy 0, policy_version 83340 (0.0007) [2023-03-07 18:03:02,119][232226] Updated weights for policy 0, policy_version 83350 (0.0007) [2023-03-07 18:03:02,909][232226] Updated weights for policy 0, policy_version 83360 (0.0008) [2023-03-07 18:03:03,693][232226] Updated weights for policy 0, policy_version 83370 (0.0006) [2023-03-07 18:03:04,498][232226] Updated weights for policy 0, policy_version 83380 (0.0007) [2023-03-07 18:03:05,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 85388288. Throughput: 0: 12866.3. Samples: 85365844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:05,080][231894] Avg episode reward: [(0, '188.588')] [2023-03-07 18:03:05,282][232226] Updated weights for policy 0, policy_version 83390 (0.0007) [2023-03-07 18:03:06,083][232226] Updated weights for policy 0, policy_version 83400 (0.0008) [2023-03-07 18:03:06,881][232226] Updated weights for policy 0, policy_version 83410 (0.0006) [2023-03-07 18:03:07,689][232226] Updated weights for policy 0, policy_version 83420 (0.0006) [2023-03-07 18:03:08,492][232226] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-07 18:03:09,308][232226] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-07 18:03:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 85451776. Throughput: 0: 12861.3. Samples: 85442723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:10,069][231894] Avg episode reward: [(0, '199.179')] [2023-03-07 18:03:10,097][232226] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-07 18:03:10,895][232226] Updated weights for policy 0, policy_version 83460 (0.0006) [2023-03-07 18:03:11,676][232226] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-07 18:03:12,494][232226] Updated weights for policy 0, policy_version 83480 (0.0007) [2023-03-07 18:03:13,266][232226] Updated weights for policy 0, policy_version 83490 (0.0007) [2023-03-07 18:03:14,072][232226] Updated weights for policy 0, policy_version 83500 (0.0006) [2023-03-07 18:03:14,869][232226] Updated weights for policy 0, policy_version 83510 (0.0006) [2023-03-07 18:03:15,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 85516288. Throughput: 0: 12866.2. Samples: 85481434. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:15,070][231894] Avg episode reward: [(0, '196.192')] [2023-03-07 18:03:15,663][232226] Updated weights for policy 0, policy_version 83520 (0.0006) [2023-03-07 18:03:16,458][232226] Updated weights for policy 0, policy_version 83530 (0.0007) [2023-03-07 18:03:17,250][232226] Updated weights for policy 0, policy_version 83540 (0.0007) [2023-03-07 18:03:18,053][232226] Updated weights for policy 0, policy_version 83550 (0.0008) [2023-03-07 18:03:18,846][232226] Updated weights for policy 0, policy_version 83560 (0.0006) [2023-03-07 18:03:19,633][232226] Updated weights for policy 0, policy_version 83570 (0.0006) [2023-03-07 18:03:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12864.2). Total num frames: 85580800. Throughput: 0: 12867.0. Samples: 85558615. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:20,070][231894] Avg episode reward: [(0, '201.938')] [2023-03-07 18:03:20,436][232226] Updated weights for policy 0, policy_version 83580 (0.0007) [2023-03-07 18:03:21,228][232226] Updated weights for policy 0, policy_version 83590 (0.0006) [2023-03-07 18:03:22,026][232226] Updated weights for policy 0, policy_version 83600 (0.0007) [2023-03-07 18:03:22,800][232226] Updated weights for policy 0, policy_version 83610 (0.0006) [2023-03-07 18:03:23,603][232226] Updated weights for policy 0, policy_version 83620 (0.0006) [2023-03-07 18:03:24,401][232226] Updated weights for policy 0, policy_version 83630 (0.0007) [2023-03-07 18:03:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 85645312. Throughput: 0: 12873.8. Samples: 85636060. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:25,069][231894] Avg episode reward: [(0, '191.739')] [2023-03-07 18:03:25,187][232226] Updated weights for policy 0, policy_version 83640 (0.0007) [2023-03-07 18:03:25,989][232226] Updated weights for policy 0, policy_version 83650 (0.0006) [2023-03-07 18:03:26,789][232226] Updated weights for policy 0, policy_version 83660 (0.0006) [2023-03-07 18:03:27,591][232226] Updated weights for policy 0, policy_version 83670 (0.0007) [2023-03-07 18:03:28,382][232226] Updated weights for policy 0, policy_version 83680 (0.0006) [2023-03-07 18:03:29,182][232226] Updated weights for policy 0, policy_version 83690 (0.0006) [2023-03-07 18:03:29,983][232226] Updated weights for policy 0, policy_version 83700 (0.0007) [2023-03-07 18:03:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 85708800. Throughput: 0: 12869.9. Samples: 85674512. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:30,069][231894] Avg episode reward: [(0, '192.506')] [2023-03-07 18:03:30,781][232226] Updated weights for policy 0, policy_version 83710 (0.0006) [2023-03-07 18:03:31,561][232226] Updated weights for policy 0, policy_version 83720 (0.0007) [2023-03-07 18:03:32,377][232226] Updated weights for policy 0, policy_version 83730 (0.0006) [2023-03-07 18:03:33,170][232226] Updated weights for policy 0, policy_version 83740 (0.0006) [2023-03-07 18:03:33,971][232226] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-07 18:03:34,763][232226] Updated weights for policy 0, policy_version 83760 (0.0006) [2023-03-07 18:03:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12867.7). Total num frames: 85774336. Throughput: 0: 12860.3. Samples: 85751491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:35,070][231894] Avg episode reward: [(0, '190.233')] [2023-03-07 18:03:35,548][232226] Updated weights for policy 0, policy_version 83770 (0.0006) [2023-03-07 18:03:36,363][232226] Updated weights for policy 0, policy_version 83780 (0.0007) [2023-03-07 18:03:37,157][232226] Updated weights for policy 0, policy_version 83790 (0.0007) [2023-03-07 18:03:37,930][232226] Updated weights for policy 0, policy_version 83800 (0.0007) [2023-03-07 18:03:38,726][232226] Updated weights for policy 0, policy_version 83810 (0.0007) [2023-03-07 18:03:39,518][232226] Updated weights for policy 0, policy_version 83820 (0.0006) [2023-03-07 18:03:40,069][231894] Fps is (10 sec: 13004.7, 60 sec: 12885.3, 300 sec: 12867.7). Total num frames: 85838848. Throughput: 0: 12859.2. Samples: 85828835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:40,069][231894] Avg episode reward: [(0, '189.699')] [2023-03-07 18:03:40,306][232226] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-07 18:03:41,108][232226] Updated weights for policy 0, policy_version 83840 (0.0007) [2023-03-07 18:03:41,936][232226] Updated weights for policy 0, policy_version 83850 (0.0006) [2023-03-07 18:03:42,701][232226] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-07 18:03:43,494][232226] Updated weights for policy 0, policy_version 83870 (0.0006) [2023-03-07 18:03:44,319][232226] Updated weights for policy 0, policy_version 83880 (0.0008) [2023-03-07 18:03:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 85902336. Throughput: 0: 12868.4. Samples: 85867429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:45,069][231894] Avg episode reward: [(0, '191.861')] [2023-03-07 18:03:45,086][232226] Updated weights for policy 0, policy_version 83890 (0.0006) [2023-03-07 18:03:45,884][232226] Updated weights for policy 0, policy_version 83900 (0.0007) [2023-03-07 18:03:46,677][232226] Updated weights for policy 0, policy_version 83910 (0.0007) [2023-03-07 18:03:47,447][232226] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-07 18:03:48,266][232226] Updated weights for policy 0, policy_version 83930 (0.0006) [2023-03-07 18:03:49,047][232226] Updated weights for policy 0, policy_version 83940 (0.0006) [2023-03-07 18:03:49,846][232226] Updated weights for policy 0, policy_version 83950 (0.0006) [2023-03-07 18:03:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 85966848. Throughput: 0: 12869.4. Samples: 85944967. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:50,069][231894] Avg episode reward: [(0, '181.820')] [2023-03-07 18:03:50,640][232226] Updated weights for policy 0, policy_version 83960 (0.0006) [2023-03-07 18:03:51,410][232226] Updated weights for policy 0, policy_version 83970 (0.0006) [2023-03-07 18:03:52,213][232226] Updated weights for policy 0, policy_version 83980 (0.0007) [2023-03-07 18:03:53,007][232226] Updated weights for policy 0, policy_version 83990 (0.0006) [2023-03-07 18:03:53,813][232226] Updated weights for policy 0, policy_version 84000 (0.0006) [2023-03-07 18:03:54,601][232226] Updated weights for policy 0, policy_version 84010 (0.0005) [2023-03-07 18:03:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 86031360. Throughput: 0: 12881.8. Samples: 86022405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:03:55,069][231894] Avg episode reward: [(0, '201.649')] [2023-03-07 18:03:55,399][232226] Updated weights for policy 0, policy_version 84020 (0.0007) [2023-03-07 18:03:56,198][232226] Updated weights for policy 0, policy_version 84030 (0.0008) [2023-03-07 18:03:56,984][232226] Updated weights for policy 0, policy_version 84040 (0.0007) [2023-03-07 18:03:57,775][232226] Updated weights for policy 0, policy_version 84050 (0.0007) [2023-03-07 18:03:58,573][232226] Updated weights for policy 0, policy_version 84060 (0.0006) [2023-03-07 18:03:59,374][232226] Updated weights for policy 0, policy_version 84070 (0.0006) [2023-03-07 18:04:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 86095872. Throughput: 0: 12884.8. Samples: 86061250. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:00,069][231894] Avg episode reward: [(0, '191.333')] [2023-03-07 18:04:00,163][232226] Updated weights for policy 0, policy_version 84080 (0.0008) [2023-03-07 18:04:00,957][232226] Updated weights for policy 0, policy_version 84090 (0.0007) [2023-03-07 18:04:01,760][232226] Updated weights for policy 0, policy_version 84100 (0.0007) [2023-03-07 18:04:02,558][232226] Updated weights for policy 0, policy_version 84110 (0.0007) [2023-03-07 18:04:03,361][232226] Updated weights for policy 0, policy_version 84120 (0.0006) [2023-03-07 18:04:04,162][232226] Updated weights for policy 0, policy_version 84130 (0.0006) [2023-03-07 18:04:04,934][232226] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-07 18:04:05,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 86160384. Throughput: 0: 12885.7. Samples: 86138469. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:05,069][231894] Avg episode reward: [(0, '198.335')] [2023-03-07 18:04:05,732][232226] Updated weights for policy 0, policy_version 84150 (0.0006) [2023-03-07 18:04:06,523][232226] Updated weights for policy 0, policy_version 84160 (0.0006) [2023-03-07 18:04:07,314][232226] Updated weights for policy 0, policy_version 84170 (0.0007) [2023-03-07 18:04:08,120][232226] Updated weights for policy 0, policy_version 84180 (0.0007) [2023-03-07 18:04:08,924][232226] Updated weights for policy 0, policy_version 84190 (0.0006) [2023-03-07 18:04:09,726][232226] Updated weights for policy 0, policy_version 84200 (0.0007) [2023-03-07 18:04:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12867.7). Total num frames: 86224896. Throughput: 0: 12877.1. Samples: 86215530. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:10,080][231894] Avg episode reward: [(0, '205.253')] [2023-03-07 18:04:10,498][232226] Updated weights for policy 0, policy_version 84210 (0.0007) [2023-03-07 18:04:11,293][232226] Updated weights for policy 0, policy_version 84220 (0.0006) [2023-03-07 18:04:12,099][232226] Updated weights for policy 0, policy_version 84230 (0.0006) [2023-03-07 18:04:12,865][232226] Updated weights for policy 0, policy_version 84240 (0.0007) [2023-03-07 18:04:13,678][232226] Updated weights for policy 0, policy_version 84250 (0.0006) [2023-03-07 18:04:14,461][232226] Updated weights for policy 0, policy_version 84260 (0.0006) [2023-03-07 18:04:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12871.2). Total num frames: 86289408. Throughput: 0: 12883.1. Samples: 86254253. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:15,080][231894] Avg episode reward: [(0, '196.777')] [2023-03-07 18:04:15,264][232226] Updated weights for policy 0, policy_version 84270 (0.0007) [2023-03-07 18:04:16,061][232226] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-07 18:04:16,850][232226] Updated weights for policy 0, policy_version 84290 (0.0006) [2023-03-07 18:04:17,656][232226] Updated weights for policy 0, policy_version 84300 (0.0006) [2023-03-07 18:04:18,428][232226] Updated weights for policy 0, policy_version 84310 (0.0007) [2023-03-07 18:04:19,247][232226] Updated weights for policy 0, policy_version 84320 (0.0006) [2023-03-07 18:04:20,037][232226] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-07 18:04:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12871.2). Total num frames: 86353920. Throughput: 0: 12894.5. Samples: 86331741. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:20,080][231894] Avg episode reward: [(0, '197.656')] [2023-03-07 18:04:20,814][232226] Updated weights for policy 0, policy_version 84340 (0.0006) [2023-03-07 18:04:21,637][232226] Updated weights for policy 0, policy_version 84350 (0.0006) [2023-03-07 18:04:22,414][232226] Updated weights for policy 0, policy_version 84360 (0.0006) [2023-03-07 18:04:23,194][232226] Updated weights for policy 0, policy_version 84370 (0.0007) [2023-03-07 18:04:23,987][232226] Updated weights for policy 0, policy_version 84380 (0.0006) [2023-03-07 18:04:24,787][232226] Updated weights for policy 0, policy_version 84390 (0.0006) [2023-03-07 18:04:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 86418432. Throughput: 0: 12897.5. Samples: 86409219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:25,080][231894] Avg episode reward: [(0, '189.257')] [2023-03-07 18:04:25,083][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000084393_86418432.pth... [2023-03-07 18:04:25,113][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000081377_83330048.pth [2023-03-07 18:04:25,588][232226] Updated weights for policy 0, policy_version 84400 (0.0006) [2023-03-07 18:04:26,385][232226] Updated weights for policy 0, policy_version 84410 (0.0006) [2023-03-07 18:04:27,190][232226] Updated weights for policy 0, policy_version 84420 (0.0006) [2023-03-07 18:04:28,000][232226] Updated weights for policy 0, policy_version 84430 (0.0007) [2023-03-07 18:04:28,794][232226] Updated weights for policy 0, policy_version 84440 (0.0007) [2023-03-07 18:04:29,574][232226] Updated weights for policy 0, policy_version 84450 (0.0006) [2023-03-07 18:04:30,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12902.4, 300 sec: 12871.2). Total num frames: 86482944. Throughput: 0: 12892.8. Samples: 86447608. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:30,080][231894] Avg episode reward: [(0, '187.724')] [2023-03-07 18:04:30,364][232226] Updated weights for policy 0, policy_version 84460 (0.0007) [2023-03-07 18:04:31,182][232226] Updated weights for policy 0, policy_version 84470 (0.0006) [2023-03-07 18:04:31,960][232226] Updated weights for policy 0, policy_version 84480 (0.0007) [2023-03-07 18:04:32,742][232226] Updated weights for policy 0, policy_version 84490 (0.0007) [2023-03-07 18:04:33,551][232226] Updated weights for policy 0, policy_version 84500 (0.0006) [2023-03-07 18:04:34,348][232226] Updated weights for policy 0, policy_version 84510 (0.0007) [2023-03-07 18:04:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 86547456. Throughput: 0: 12884.1. Samples: 86524752. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:35,080][231894] Avg episode reward: [(0, '203.846')] [2023-03-07 18:04:35,136][232226] Updated weights for policy 0, policy_version 84520 (0.0007) [2023-03-07 18:04:35,931][232226] Updated weights for policy 0, policy_version 84530 (0.0006) [2023-03-07 18:04:36,718][232226] Updated weights for policy 0, policy_version 84540 (0.0007) [2023-03-07 18:04:37,530][232226] Updated weights for policy 0, policy_version 84550 (0.0007) [2023-03-07 18:04:38,324][232226] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-07 18:04:39,114][232226] Updated weights for policy 0, policy_version 84570 (0.0006) [2023-03-07 18:04:39,898][232226] Updated weights for policy 0, policy_version 84580 (0.0007) [2023-03-07 18:04:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 86610944. Throughput: 0: 12883.8. Samples: 86602175. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:40,080][231894] Avg episode reward: [(0, '199.730')] [2023-03-07 18:04:40,718][232226] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-07 18:04:41,502][232226] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-07 18:04:42,289][232226] Updated weights for policy 0, policy_version 84610 (0.0007) [2023-03-07 18:04:43,088][232226] Updated weights for policy 0, policy_version 84620 (0.0006) [2023-03-07 18:04:43,884][232226] Updated weights for policy 0, policy_version 84630 (0.0007) [2023-03-07 18:04:44,673][232226] Updated weights for policy 0, policy_version 84640 (0.0005) [2023-03-07 18:04:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12902.4, 300 sec: 12871.2). Total num frames: 86676480. Throughput: 0: 12880.9. Samples: 86640892. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:45,080][231894] Avg episode reward: [(0, '188.247')] [2023-03-07 18:04:45,478][232226] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-07 18:04:46,277][232226] Updated weights for policy 0, policy_version 84660 (0.0006) [2023-03-07 18:04:47,045][232226] Updated weights for policy 0, policy_version 84670 (0.0006) [2023-03-07 18:04:47,834][232226] Updated weights for policy 0, policy_version 84680 (0.0008) [2023-03-07 18:04:48,633][232226] Updated weights for policy 0, policy_version 84690 (0.0006) [2023-03-07 18:04:49,417][232226] Updated weights for policy 0, policy_version 84700 (0.0006) [2023-03-07 18:04:50,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 86740992. Throughput: 0: 12887.3. Samples: 86718397. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:50,080][231894] Avg episode reward: [(0, '191.687')] [2023-03-07 18:04:50,229][232226] Updated weights for policy 0, policy_version 84710 (0.0007) [2023-03-07 18:04:51,023][232226] Updated weights for policy 0, policy_version 84720 (0.0007) [2023-03-07 18:04:51,804][232226] Updated weights for policy 0, policy_version 84730 (0.0006) [2023-03-07 18:04:52,602][232226] Updated weights for policy 0, policy_version 84740 (0.0006) [2023-03-07 18:04:53,405][232226] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-07 18:04:54,188][232226] Updated weights for policy 0, policy_version 84760 (0.0007) [2023-03-07 18:04:54,998][232226] Updated weights for policy 0, policy_version 84770 (0.0006) [2023-03-07 18:04:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 86804480. Throughput: 0: 12893.2. Samples: 86795725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:04:55,080][231894] Avg episode reward: [(0, '194.168')] [2023-03-07 18:04:55,774][232226] Updated weights for policy 0, policy_version 84780 (0.0006) [2023-03-07 18:04:56,572][232226] Updated weights for policy 0, policy_version 84790 (0.0006) [2023-03-07 18:04:57,364][232226] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-07 18:04:58,153][232226] Updated weights for policy 0, policy_version 84810 (0.0006) [2023-03-07 18:04:58,949][232226] Updated weights for policy 0, policy_version 84820 (0.0006) [2023-03-07 18:04:59,746][232226] Updated weights for policy 0, policy_version 84830 (0.0006) [2023-03-07 18:05:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 86868992. Throughput: 0: 12894.5. Samples: 86834505. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:05:00,069][231894] Avg episode reward: [(0, '190.874')] [2023-03-07 18:05:00,551][232226] Updated weights for policy 0, policy_version 84840 (0.0007) [2023-03-07 18:05:01,335][232226] Updated weights for policy 0, policy_version 84850 (0.0006) [2023-03-07 18:05:02,134][232226] Updated weights for policy 0, policy_version 84860 (0.0007) [2023-03-07 18:05:02,924][232226] Updated weights for policy 0, policy_version 84870 (0.0006) [2023-03-07 18:05:03,728][232226] Updated weights for policy 0, policy_version 84880 (0.0007) [2023-03-07 18:05:04,527][232226] Updated weights for policy 0, policy_version 84890 (0.0006) [2023-03-07 18:05:05,069][231894] Fps is (10 sec: 13004.8, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 86934528. Throughput: 0: 12890.9. Samples: 86911832. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:05:05,069][231894] Avg episode reward: [(0, '200.472')] [2023-03-07 18:05:05,309][232226] Updated weights for policy 0, policy_version 84900 (0.0006) [2023-03-07 18:05:06,105][232226] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-07 18:05:06,882][232226] Updated weights for policy 0, policy_version 84920 (0.0008) [2023-03-07 18:05:07,677][232226] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-07 18:05:08,466][232226] Updated weights for policy 0, policy_version 84940 (0.0007) [2023-03-07 18:05:09,255][232226] Updated weights for policy 0, policy_version 84950 (0.0006) [2023-03-07 18:05:10,068][232226] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-07 18:05:10,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 86999040. Throughput: 0: 12891.9. Samples: 86989355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:05:10,069][231894] Avg episode reward: [(0, '196.185')] [2023-03-07 18:05:10,868][232226] Updated weights for policy 0, policy_version 84970 (0.0007) [2023-03-07 18:05:11,663][232226] Updated weights for policy 0, policy_version 84980 (0.0006) [2023-03-07 18:05:12,455][232226] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-07 18:05:13,250][232226] Updated weights for policy 0, policy_version 85000 (0.0006) [2023-03-07 18:05:14,044][232226] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-07 18:05:14,838][232226] Updated weights for policy 0, policy_version 85020 (0.0007) [2023-03-07 18:05:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 87062528. Throughput: 0: 12896.7. Samples: 87027958. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:05:15,069][231894] Avg episode reward: [(0, '197.336')] [2023-03-07 18:05:15,646][232226] Updated weights for policy 0, policy_version 85030 (0.0006) [2023-03-07 18:05:16,441][232226] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-07 18:05:17,218][232226] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-07 18:05:18,009][232226] Updated weights for policy 0, policy_version 85060 (0.0007) [2023-03-07 18:05:18,801][232226] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-07 18:05:19,599][232226] Updated weights for policy 0, policy_version 85080 (0.0007) [2023-03-07 18:05:20,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12885.3, 300 sec: 12871.2). Total num frames: 87127040. Throughput: 0: 12896.3. Samples: 87105087. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:20,069][231894] Avg episode reward: [(0, '191.321')] [2023-03-07 18:05:20,394][232226] Updated weights for policy 0, policy_version 85090 (0.0006) [2023-03-07 18:05:21,191][232226] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-07 18:05:21,990][232226] Updated weights for policy 0, policy_version 85110 (0.0007) [2023-03-07 18:05:22,790][232226] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-07 18:05:23,580][232226] Updated weights for policy 0, policy_version 85130 (0.0006) [2023-03-07 18:05:24,386][232226] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-07 18:05:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 87191552. Throughput: 0: 12896.9. Samples: 87182536. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:25,069][231894] Avg episode reward: [(0, '191.469')] [2023-03-07 18:05:25,197][232226] Updated weights for policy 0, policy_version 85150 (0.0006) [2023-03-07 18:05:25,985][232226] Updated weights for policy 0, policy_version 85160 (0.0007) [2023-03-07 18:05:26,770][232226] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-07 18:05:27,566][232226] Updated weights for policy 0, policy_version 85180 (0.0006) [2023-03-07 18:05:28,358][232226] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-07 18:05:29,153][232226] Updated weights for policy 0, policy_version 85200 (0.0007) [2023-03-07 18:05:29,966][232226] Updated weights for policy 0, policy_version 85210 (0.0006) [2023-03-07 18:05:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 87256064. Throughput: 0: 12892.5. Samples: 87221056. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:30,070][231894] Avg episode reward: [(0, '193.532')] [2023-03-07 18:05:30,755][232226] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-07 18:05:31,547][232226] Updated weights for policy 0, policy_version 85230 (0.0006) [2023-03-07 18:05:32,340][232226] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-07 18:05:33,131][232226] Updated weights for policy 0, policy_version 85250 (0.0006) [2023-03-07 18:05:33,918][232226] Updated weights for policy 0, policy_version 85260 (0.0006) [2023-03-07 18:05:34,718][232226] Updated weights for policy 0, policy_version 85270 (0.0007) [2023-03-07 18:05:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 87320576. Throughput: 0: 12885.8. Samples: 87298260. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:35,069][231894] Avg episode reward: [(0, '191.385')] [2023-03-07 18:05:35,493][232226] Updated weights for policy 0, policy_version 85280 (0.0007) [2023-03-07 18:05:36,269][232226] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-07 18:05:37,081][232226] Updated weights for policy 0, policy_version 85300 (0.0006) [2023-03-07 18:05:37,871][232226] Updated weights for policy 0, policy_version 85310 (0.0007) [2023-03-07 18:05:38,666][232226] Updated weights for policy 0, policy_version 85320 (0.0006) [2023-03-07 18:05:39,447][232226] Updated weights for policy 0, policy_version 85330 (0.0006) [2023-03-07 18:05:40,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12874.6). Total num frames: 87385088. Throughput: 0: 12896.5. Samples: 87376065. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:40,069][231894] Avg episode reward: [(0, '196.860')] [2023-03-07 18:05:40,242][232226] Updated weights for policy 0, policy_version 85340 (0.0006) [2023-03-07 18:05:41,043][232226] Updated weights for policy 0, policy_version 85350 (0.0006) [2023-03-07 18:05:41,848][232226] Updated weights for policy 0, policy_version 85360 (0.0007) [2023-03-07 18:05:42,651][232226] Updated weights for policy 0, policy_version 85370 (0.0007) [2023-03-07 18:05:43,431][232226] Updated weights for policy 0, policy_version 85380 (0.0006) [2023-03-07 18:05:44,241][232226] Updated weights for policy 0, policy_version 85390 (0.0006) [2023-03-07 18:05:45,032][232226] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-07 18:05:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 87449600. Throughput: 0: 12887.7. Samples: 87414454. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:45,069][231894] Avg episode reward: [(0, '199.404')] [2023-03-07 18:05:45,825][232226] Updated weights for policy 0, policy_version 85410 (0.0006) [2023-03-07 18:05:46,614][232226] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-07 18:05:47,430][232226] Updated weights for policy 0, policy_version 85430 (0.0006) [2023-03-07 18:05:48,225][232226] Updated weights for policy 0, policy_version 85440 (0.0007) [2023-03-07 18:05:49,011][232226] Updated weights for policy 0, policy_version 85450 (0.0006) [2023-03-07 18:05:49,806][232226] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-07 18:05:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 87514112. Throughput: 0: 12885.8. Samples: 87491693. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:50,069][231894] Avg episode reward: [(0, '194.249')] [2023-03-07 18:05:50,606][232226] Updated weights for policy 0, policy_version 85470 (0.0006) [2023-03-07 18:05:51,400][232226] Updated weights for policy 0, policy_version 85480 (0.0007) [2023-03-07 18:05:52,202][232226] Updated weights for policy 0, policy_version 85490 (0.0007) [2023-03-07 18:05:52,984][232226] Updated weights for policy 0, policy_version 85500 (0.0007) [2023-03-07 18:05:53,765][232226] Updated weights for policy 0, policy_version 85510 (0.0007) [2023-03-07 18:05:54,570][232226] Updated weights for policy 0, policy_version 85520 (0.0007) [2023-03-07 18:05:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 87578624. Throughput: 0: 12887.3. Samples: 87569286. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:05:55,069][231894] Avg episode reward: [(0, '195.601')] [2023-03-07 18:05:55,364][232226] Updated weights for policy 0, policy_version 85530 (0.0007) [2023-03-07 18:05:56,153][232226] Updated weights for policy 0, policy_version 85540 (0.0006) [2023-03-07 18:05:56,934][232226] Updated weights for policy 0, policy_version 85550 (0.0006) [2023-03-07 18:05:57,760][232226] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-07 18:05:58,553][232226] Updated weights for policy 0, policy_version 85570 (0.0007) [2023-03-07 18:05:59,337][232226] Updated weights for policy 0, policy_version 85580 (0.0007) [2023-03-07 18:06:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12902.4, 300 sec: 12878.1). Total num frames: 87643136. Throughput: 0: 12889.2. Samples: 87607970. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:06:00,069][231894] Avg episode reward: [(0, '197.155')] [2023-03-07 18:06:00,156][232226] Updated weights for policy 0, policy_version 85590 (0.0006) [2023-03-07 18:06:00,936][232226] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-07 18:06:01,736][232226] Updated weights for policy 0, policy_version 85610 (0.0006) [2023-03-07 18:06:02,529][232226] Updated weights for policy 0, policy_version 85620 (0.0006) [2023-03-07 18:06:03,331][232226] Updated weights for policy 0, policy_version 85630 (0.0007) [2023-03-07 18:06:04,132][232226] Updated weights for policy 0, policy_version 85640 (0.0007) [2023-03-07 18:06:04,932][232226] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-07 18:06:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 87706624. Throughput: 0: 12887.0. Samples: 87685003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:06:05,069][231894] Avg episode reward: [(0, '196.675')] [2023-03-07 18:06:05,722][232226] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-07 18:06:06,502][232226] Updated weights for policy 0, policy_version 85670 (0.0007) [2023-03-07 18:06:07,317][232226] Updated weights for policy 0, policy_version 85680 (0.0007) [2023-03-07 18:06:08,111][232226] Updated weights for policy 0, policy_version 85690 (0.0007) [2023-03-07 18:06:08,913][232226] Updated weights for policy 0, policy_version 85700 (0.0007) [2023-03-07 18:06:09,717][232226] Updated weights for policy 0, policy_version 85710 (0.0006) [2023-03-07 18:06:10,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 87771136. Throughput: 0: 12876.2. Samples: 87761965. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:06:10,069][231894] Avg episode reward: [(0, '181.931')] [2023-03-07 18:06:10,516][232226] Updated weights for policy 0, policy_version 85720 (0.0006) [2023-03-07 18:06:11,328][232226] Updated weights for policy 0, policy_version 85730 (0.0007) [2023-03-07 18:06:12,105][232226] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-07 18:06:12,925][232226] Updated weights for policy 0, policy_version 85750 (0.0006) [2023-03-07 18:06:13,719][232226] Updated weights for policy 0, policy_version 85760 (0.0006) [2023-03-07 18:06:14,509][232226] Updated weights for policy 0, policy_version 85770 (0.0006) [2023-03-07 18:06:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12874.6). Total num frames: 87835648. Throughput: 0: 12875.0. Samples: 87800430. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:15,070][231894] Avg episode reward: [(0, '193.832')] [2023-03-07 18:06:15,303][232226] Updated weights for policy 0, policy_version 85780 (0.0006) [2023-03-07 18:06:16,101][232226] Updated weights for policy 0, policy_version 85790 (0.0006) [2023-03-07 18:06:16,894][232226] Updated weights for policy 0, policy_version 85800 (0.0006) [2023-03-07 18:06:17,710][232226] Updated weights for policy 0, policy_version 85810 (0.0007) [2023-03-07 18:06:18,506][232226] Updated weights for policy 0, policy_version 85820 (0.0007) [2023-03-07 18:06:19,304][232226] Updated weights for policy 0, policy_version 85830 (0.0007) [2023-03-07 18:06:20,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 87899136. Throughput: 0: 12862.3. Samples: 87877064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:20,069][231894] Avg episode reward: [(0, '198.317')] [2023-03-07 18:06:20,101][232226] Updated weights for policy 0, policy_version 85840 (0.0006) [2023-03-07 18:06:20,896][232226] Updated weights for policy 0, policy_version 85850 (0.0006) [2023-03-07 18:06:21,699][232226] Updated weights for policy 0, policy_version 85860 (0.0006) [2023-03-07 18:06:22,485][232226] Updated weights for policy 0, policy_version 85870 (0.0007) [2023-03-07 18:06:23,274][232226] Updated weights for policy 0, policy_version 85880 (0.0006) [2023-03-07 18:06:24,069][232226] Updated weights for policy 0, policy_version 85890 (0.0007) [2023-03-07 18:06:24,865][232226] Updated weights for policy 0, policy_version 85900 (0.0006) [2023-03-07 18:06:25,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 87963648. Throughput: 0: 12851.1. Samples: 87954363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:25,069][231894] Avg episode reward: [(0, '191.507')] [2023-03-07 18:06:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000085902_87963648.pth... [2023-03-07 18:06:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000082884_84873216.pth [2023-03-07 18:06:25,654][232226] Updated weights for policy 0, policy_version 85910 (0.0008) [2023-03-07 18:06:26,463][232226] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-07 18:06:27,249][232226] Updated weights for policy 0, policy_version 85930 (0.0006) [2023-03-07 18:06:28,048][232226] Updated weights for policy 0, policy_version 85940 (0.0006) [2023-03-07 18:06:28,865][232226] Updated weights for policy 0, policy_version 85950 (0.0006) [2023-03-07 18:06:29,649][232226] Updated weights for policy 0, policy_version 85960 (0.0006) [2023-03-07 18:06:30,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 88028160. Throughput: 0: 12857.0. Samples: 87993018. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:30,080][231894] Avg episode reward: [(0, '192.526')] [2023-03-07 18:06:30,456][232226] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-07 18:06:31,233][232226] Updated weights for policy 0, policy_version 85980 (0.0006) [2023-03-07 18:06:32,029][232226] Updated weights for policy 0, policy_version 85990 (0.0006) [2023-03-07 18:06:32,835][232226] Updated weights for policy 0, policy_version 86000 (0.0007) [2023-03-07 18:06:33,617][232226] Updated weights for policy 0, policy_version 86010 (0.0007) [2023-03-07 18:06:34,420][232226] Updated weights for policy 0, policy_version 86020 (0.0007) [2023-03-07 18:06:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 88091648. Throughput: 0: 12854.6. Samples: 88070149. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:35,080][231894] Avg episode reward: [(0, '195.451')] [2023-03-07 18:06:35,248][232226] Updated weights for policy 0, policy_version 86030 (0.0007) [2023-03-07 18:06:36,027][232226] Updated weights for policy 0, policy_version 86040 (0.0006) [2023-03-07 18:06:36,810][232226] Updated weights for policy 0, policy_version 86050 (0.0007) [2023-03-07 18:06:37,604][232226] Updated weights for policy 0, policy_version 86060 (0.0007) [2023-03-07 18:06:38,391][232226] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-07 18:06:39,193][232226] Updated weights for policy 0, policy_version 86080 (0.0006) [2023-03-07 18:06:39,993][232226] Updated weights for policy 0, policy_version 86090 (0.0006) [2023-03-07 18:06:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 88156160. Throughput: 0: 12847.1. Samples: 88147405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:40,080][231894] Avg episode reward: [(0, '199.279')] [2023-03-07 18:06:40,795][232226] Updated weights for policy 0, policy_version 86100 (0.0007) [2023-03-07 18:06:41,581][232226] Updated weights for policy 0, policy_version 86110 (0.0006) [2023-03-07 18:06:42,395][232226] Updated weights for policy 0, policy_version 86120 (0.0006) [2023-03-07 18:06:43,201][232226] Updated weights for policy 0, policy_version 86130 (0.0006) [2023-03-07 18:06:43,996][232226] Updated weights for policy 0, policy_version 86140 (0.0006) [2023-03-07 18:06:44,797][232226] Updated weights for policy 0, policy_version 86150 (0.0006) [2023-03-07 18:06:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 88220672. Throughput: 0: 12846.1. Samples: 88186045. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:45,080][231894] Avg episode reward: [(0, '194.300')] [2023-03-07 18:06:45,600][232226] Updated weights for policy 0, policy_version 86160 (0.0006) [2023-03-07 18:06:46,378][232226] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-07 18:06:47,179][232226] Updated weights for policy 0, policy_version 86180 (0.0006) [2023-03-07 18:06:47,955][232226] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-07 18:06:48,765][232226] Updated weights for policy 0, policy_version 86200 (0.0007) [2023-03-07 18:06:49,556][232226] Updated weights for policy 0, policy_version 86210 (0.0007) [2023-03-07 18:06:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 88285184. Throughput: 0: 12845.9. Samples: 88263067. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:50,080][231894] Avg episode reward: [(0, '193.511')] [2023-03-07 18:06:50,348][232226] Updated weights for policy 0, policy_version 86220 (0.0006) [2023-03-07 18:06:51,141][232226] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-07 18:06:51,933][232226] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-07 18:06:52,724][232226] Updated weights for policy 0, policy_version 86250 (0.0006) [2023-03-07 18:06:53,505][232226] Updated weights for policy 0, policy_version 86260 (0.0007) [2023-03-07 18:06:54,307][232226] Updated weights for policy 0, policy_version 86270 (0.0007) [2023-03-07 18:06:55,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 88349696. Throughput: 0: 12854.3. Samples: 88340407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:06:55,080][231894] Avg episode reward: [(0, '199.502')] [2023-03-07 18:06:55,113][232226] Updated weights for policy 0, policy_version 86280 (0.0006) [2023-03-07 18:06:55,903][232226] Updated weights for policy 0, policy_version 86290 (0.0007) [2023-03-07 18:06:56,706][232226] Updated weights for policy 0, policy_version 86300 (0.0008) [2023-03-07 18:06:57,502][232226] Updated weights for policy 0, policy_version 86310 (0.0006) [2023-03-07 18:06:58,313][232226] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-07 18:06:59,094][232226] Updated weights for policy 0, policy_version 86330 (0.0006) [2023-03-07 18:06:59,897][232226] Updated weights for policy 0, policy_version 86340 (0.0006) [2023-03-07 18:07:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 88414208. Throughput: 0: 12856.0. Samples: 88378948. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:00,069][231894] Avg episode reward: [(0, '195.843')] [2023-03-07 18:07:00,694][232226] Updated weights for policy 0, policy_version 86350 (0.0006) [2023-03-07 18:07:01,480][232226] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-07 18:07:02,283][232226] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-07 18:07:03,084][232226] Updated weights for policy 0, policy_version 86380 (0.0006) [2023-03-07 18:07:03,899][232226] Updated weights for policy 0, policy_version 86390 (0.0006) [2023-03-07 18:07:04,678][232226] Updated weights for policy 0, policy_version 86400 (0.0006) [2023-03-07 18:07:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 88478720. Throughput: 0: 12866.1. Samples: 88456040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:05,070][231894] Avg episode reward: [(0, '197.798')] [2023-03-07 18:07:05,473][232226] Updated weights for policy 0, policy_version 86410 (0.0007) [2023-03-07 18:07:06,258][232226] Updated weights for policy 0, policy_version 86420 (0.0006) [2023-03-07 18:07:07,051][232226] Updated weights for policy 0, policy_version 86430 (0.0006) [2023-03-07 18:07:07,862][232226] Updated weights for policy 0, policy_version 86440 (0.0007) [2023-03-07 18:07:08,645][232226] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-07 18:07:09,459][232226] Updated weights for policy 0, policy_version 86460 (0.0007) [2023-03-07 18:07:10,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 88542208. Throughput: 0: 12864.4. Samples: 88533260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:10,070][231894] Avg episode reward: [(0, '189.848')] [2023-03-07 18:07:10,246][232226] Updated weights for policy 0, policy_version 86470 (0.0006) [2023-03-07 18:07:11,028][232226] Updated weights for policy 0, policy_version 86480 (0.0007) [2023-03-07 18:07:11,832][232226] Updated weights for policy 0, policy_version 86490 (0.0006) [2023-03-07 18:07:12,622][232226] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-07 18:07:13,413][232226] Updated weights for policy 0, policy_version 86510 (0.0006) [2023-03-07 18:07:14,205][232226] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-07 18:07:15,018][232226] Updated weights for policy 0, policy_version 86530 (0.0007) [2023-03-07 18:07:15,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 88606720. Throughput: 0: 12863.6. Samples: 88571882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:15,069][231894] Avg episode reward: [(0, '199.956')] [2023-03-07 18:07:15,811][232226] Updated weights for policy 0, policy_version 86540 (0.0007) [2023-03-07 18:07:16,593][232226] Updated weights for policy 0, policy_version 86550 (0.0007) [2023-03-07 18:07:17,405][232226] Updated weights for policy 0, policy_version 86560 (0.0006) [2023-03-07 18:07:18,184][232226] Updated weights for policy 0, policy_version 86570 (0.0006) [2023-03-07 18:07:18,985][232226] Updated weights for policy 0, policy_version 86580 (0.0007) [2023-03-07 18:07:19,778][232226] Updated weights for policy 0, policy_version 86590 (0.0007) [2023-03-07 18:07:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 88671232. Throughput: 0: 12869.6. Samples: 88649279. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:20,069][231894] Avg episode reward: [(0, '197.067')] [2023-03-07 18:07:20,562][232226] Updated weights for policy 0, policy_version 86600 (0.0007) [2023-03-07 18:07:21,369][232226] Updated weights for policy 0, policy_version 86610 (0.0006) [2023-03-07 18:07:22,164][232226] Updated weights for policy 0, policy_version 86620 (0.0006) [2023-03-07 18:07:22,972][232226] Updated weights for policy 0, policy_version 86630 (0.0006) [2023-03-07 18:07:23,761][232226] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-07 18:07:24,575][232226] Updated weights for policy 0, policy_version 86650 (0.0006) [2023-03-07 18:07:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 88735744. Throughput: 0: 12866.6. Samples: 88726403. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:25,069][231894] Avg episode reward: [(0, '193.787')] [2023-03-07 18:07:25,345][232226] Updated weights for policy 0, policy_version 86660 (0.0007) [2023-03-07 18:07:26,162][232226] Updated weights for policy 0, policy_version 86670 (0.0007) [2023-03-07 18:07:26,948][232226] Updated weights for policy 0, policy_version 86680 (0.0006) [2023-03-07 18:07:27,754][232226] Updated weights for policy 0, policy_version 86690 (0.0006) [2023-03-07 18:07:28,545][232226] Updated weights for policy 0, policy_version 86700 (0.0007) [2023-03-07 18:07:29,346][232226] Updated weights for policy 0, policy_version 86710 (0.0006) [2023-03-07 18:07:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12878.1). Total num frames: 88800256. Throughput: 0: 12863.6. Samples: 88764908. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:30,069][231894] Avg episode reward: [(0, '201.176')] [2023-03-07 18:07:30,140][232226] Updated weights for policy 0, policy_version 86720 (0.0007) [2023-03-07 18:07:30,937][232226] Updated weights for policy 0, policy_version 86730 (0.0007) [2023-03-07 18:07:31,736][232226] Updated weights for policy 0, policy_version 86740 (0.0006) [2023-03-07 18:07:32,533][232226] Updated weights for policy 0, policy_version 86750 (0.0007) [2023-03-07 18:07:33,353][232226] Updated weights for policy 0, policy_version 86760 (0.0006) [2023-03-07 18:07:34,153][232226] Updated weights for policy 0, policy_version 86770 (0.0006) [2023-03-07 18:07:34,945][232226] Updated weights for policy 0, policy_version 86780 (0.0007) [2023-03-07 18:07:35,069][231894] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 88863744. Throughput: 0: 12862.2. Samples: 88841868. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:35,070][231894] Avg episode reward: [(0, '201.532')] [2023-03-07 18:07:35,750][232226] Updated weights for policy 0, policy_version 86790 (0.0006) [2023-03-07 18:07:36,535][232226] Updated weights for policy 0, policy_version 86800 (0.0006) [2023-03-07 18:07:37,326][232226] Updated weights for policy 0, policy_version 86810 (0.0007) [2023-03-07 18:07:38,126][232226] Updated weights for policy 0, policy_version 86820 (0.0007) [2023-03-07 18:07:38,926][232226] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-07 18:07:39,722][232226] Updated weights for policy 0, policy_version 86840 (0.0007) [2023-03-07 18:07:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 88928256. Throughput: 0: 12855.7. Samples: 88918911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:40,069][231894] Avg episode reward: [(0, '197.197')] [2023-03-07 18:07:40,522][232226] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-07 18:07:41,319][232226] Updated weights for policy 0, policy_version 86860 (0.0005) [2023-03-07 18:07:42,107][232226] Updated weights for policy 0, policy_version 86870 (0.0006) [2023-03-07 18:07:42,910][232226] Updated weights for policy 0, policy_version 86880 (0.0006) [2023-03-07 18:07:43,718][232226] Updated weights for policy 0, policy_version 86890 (0.0006) [2023-03-07 18:07:44,498][232226] Updated weights for policy 0, policy_version 86900 (0.0005) [2023-03-07 18:07:45,069][231894] Fps is (10 sec: 12902.7, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 88992768. Throughput: 0: 12856.3. Samples: 88957480. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:45,069][231894] Avg episode reward: [(0, '194.031')] [2023-03-07 18:07:45,294][232226] Updated weights for policy 0, policy_version 86910 (0.0006) [2023-03-07 18:07:46,074][232226] Updated weights for policy 0, policy_version 86920 (0.0007) [2023-03-07 18:07:46,873][232226] Updated weights for policy 0, policy_version 86930 (0.0006) [2023-03-07 18:07:47,666][232226] Updated weights for policy 0, policy_version 86940 (0.0007) [2023-03-07 18:07:48,467][232226] Updated weights for policy 0, policy_version 86950 (0.0006) [2023-03-07 18:07:49,267][232226] Updated weights for policy 0, policy_version 86960 (0.0008) [2023-03-07 18:07:50,061][232226] Updated weights for policy 0, policy_version 86970 (0.0006) [2023-03-07 18:07:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 89057280. Throughput: 0: 12860.6. Samples: 89034767. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:50,069][231894] Avg episode reward: [(0, '195.250')] [2023-03-07 18:07:50,854][232226] Updated weights for policy 0, policy_version 86980 (0.0007) [2023-03-07 18:07:51,656][232226] Updated weights for policy 0, policy_version 86990 (0.0007) [2023-03-07 18:07:52,451][232226] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-07 18:07:53,261][232226] Updated weights for policy 0, policy_version 87010 (0.0007) [2023-03-07 18:07:54,043][232226] Updated weights for policy 0, policy_version 87020 (0.0007) [2023-03-07 18:07:54,840][232226] Updated weights for policy 0, policy_version 87030 (0.0007) [2023-03-07 18:07:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 89120768. Throughput: 0: 12857.3. Samples: 89111840. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:07:55,069][231894] Avg episode reward: [(0, '194.308')] [2023-03-07 18:07:55,637][232226] Updated weights for policy 0, policy_version 87040 (0.0007) [2023-03-07 18:07:56,429][232226] Updated weights for policy 0, policy_version 87050 (0.0007) [2023-03-07 18:07:57,228][232226] Updated weights for policy 0, policy_version 87060 (0.0007) [2023-03-07 18:07:58,010][232226] Updated weights for policy 0, policy_version 87070 (0.0007) [2023-03-07 18:07:58,821][232226] Updated weights for policy 0, policy_version 87080 (0.0008) [2023-03-07 18:07:59,614][232226] Updated weights for policy 0, policy_version 87090 (0.0006) [2023-03-07 18:08:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 89185280. Throughput: 0: 12859.2. Samples: 89150547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:00,069][231894] Avg episode reward: [(0, '202.305')] [2023-03-07 18:08:00,409][232226] Updated weights for policy 0, policy_version 87100 (0.0007) [2023-03-07 18:08:01,214][232226] Updated weights for policy 0, policy_version 87110 (0.0006) [2023-03-07 18:08:01,998][232226] Updated weights for policy 0, policy_version 87120 (0.0007) [2023-03-07 18:08:02,794][232226] Updated weights for policy 0, policy_version 87130 (0.0007) [2023-03-07 18:08:03,590][232226] Updated weights for policy 0, policy_version 87140 (0.0006) [2023-03-07 18:08:04,399][232226] Updated weights for policy 0, policy_version 87150 (0.0006) [2023-03-07 18:08:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 89249792. Throughput: 0: 12853.6. Samples: 89227691. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:05,069][231894] Avg episode reward: [(0, '193.192')] [2023-03-07 18:08:05,190][232226] Updated weights for policy 0, policy_version 87160 (0.0007) [2023-03-07 18:08:05,992][232226] Updated weights for policy 0, policy_version 87170 (0.0007) [2023-03-07 18:08:06,779][232226] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-07 18:08:07,573][232226] Updated weights for policy 0, policy_version 87190 (0.0006) [2023-03-07 18:08:08,358][232226] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-07 18:08:09,157][232226] Updated weights for policy 0, policy_version 87210 (0.0007) [2023-03-07 18:08:09,954][232226] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-07 18:08:10,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 89314304. Throughput: 0: 12857.2. Samples: 89304976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:10,069][231894] Avg episode reward: [(0, '194.594')] [2023-03-07 18:08:10,731][232226] Updated weights for policy 0, policy_version 87230 (0.0006) [2023-03-07 18:08:11,532][232226] Updated weights for policy 0, policy_version 87240 (0.0006) [2023-03-07 18:08:12,329][232226] Updated weights for policy 0, policy_version 87250 (0.0007) [2023-03-07 18:08:13,125][232226] Updated weights for policy 0, policy_version 87260 (0.0006) [2023-03-07 18:08:13,937][232226] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-07 18:08:14,729][232226] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-07 18:08:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 89378816. Throughput: 0: 12863.8. Samples: 89343781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:15,069][231894] Avg episode reward: [(0, '195.228')] [2023-03-07 18:08:15,519][232226] Updated weights for policy 0, policy_version 87290 (0.0006) [2023-03-07 18:08:16,307][232226] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-07 18:08:17,092][232226] Updated weights for policy 0, policy_version 87310 (0.0007) [2023-03-07 18:08:17,897][232226] Updated weights for policy 0, policy_version 87320 (0.0006) [2023-03-07 18:08:18,677][232226] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-07 18:08:19,474][232226] Updated weights for policy 0, policy_version 87340 (0.0007) [2023-03-07 18:08:20,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12874.6). Total num frames: 89443328. Throughput: 0: 12871.0. Samples: 89421060. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:20,070][231894] Avg episode reward: [(0, '199.027')] [2023-03-07 18:08:20,270][232226] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-07 18:08:21,053][232226] Updated weights for policy 0, policy_version 87360 (0.0007) [2023-03-07 18:08:21,874][232226] Updated weights for policy 0, policy_version 87370 (0.0007) [2023-03-07 18:08:22,672][232226] Updated weights for policy 0, policy_version 87380 (0.0006) [2023-03-07 18:08:23,461][232226] Updated weights for policy 0, policy_version 87390 (0.0008) [2023-03-07 18:08:24,270][232226] Updated weights for policy 0, policy_version 87400 (0.0006) [2023-03-07 18:08:25,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12874.6). Total num frames: 89506816. Throughput: 0: 12872.3. Samples: 89498165. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:25,069][231894] Avg episode reward: [(0, '200.737')] [2023-03-07 18:08:25,072][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000087410_89507840.pth... [2023-03-07 18:08:25,073][232226] Updated weights for policy 0, policy_version 87410 (0.0007) [2023-03-07 18:08:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000084393_86418432.pth [2023-03-07 18:08:25,881][232226] Updated weights for policy 0, policy_version 87420 (0.0006) [2023-03-07 18:08:26,666][232226] Updated weights for policy 0, policy_version 87430 (0.0007) [2023-03-07 18:08:27,472][232226] Updated weights for policy 0, policy_version 87440 (0.0007) [2023-03-07 18:08:28,281][232226] Updated weights for policy 0, policy_version 87450 (0.0007) [2023-03-07 18:08:29,066][232226] Updated weights for policy 0, policy_version 87460 (0.0006) [2023-03-07 18:08:29,867][232226] Updated weights for policy 0, policy_version 87470 (0.0006) [2023-03-07 18:08:30,069][231894] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 89571328. Throughput: 0: 12868.6. Samples: 89536566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:30,069][231894] Avg episode reward: [(0, '194.335')] [2023-03-07 18:08:30,654][232226] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-07 18:08:31,454][232226] Updated weights for policy 0, policy_version 87490 (0.0006) [2023-03-07 18:08:32,265][232226] Updated weights for policy 0, policy_version 87500 (0.0006) [2023-03-07 18:08:33,053][232226] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-07 18:08:33,860][232226] Updated weights for policy 0, policy_version 87520 (0.0007) [2023-03-07 18:08:34,660][232226] Updated weights for policy 0, policy_version 87530 (0.0007) [2023-03-07 18:08:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 89635840. Throughput: 0: 12862.8. Samples: 89613593. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:35,069][231894] Avg episode reward: [(0, '197.821')] [2023-03-07 18:08:35,441][232226] Updated weights for policy 0, policy_version 87540 (0.0006) [2023-03-07 18:08:36,216][232226] Updated weights for policy 0, policy_version 87550 (0.0006) [2023-03-07 18:08:37,017][232226] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-07 18:08:37,817][232226] Updated weights for policy 0, policy_version 87570 (0.0006) [2023-03-07 18:08:38,610][232226] Updated weights for policy 0, policy_version 87580 (0.0007) [2023-03-07 18:08:39,398][232226] Updated weights for policy 0, policy_version 87590 (0.0006) [2023-03-07 18:08:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 89700352. Throughput: 0: 12871.1. Samples: 89691040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:40,069][231894] Avg episode reward: [(0, '194.238')] [2023-03-07 18:08:40,198][232226] Updated weights for policy 0, policy_version 87600 (0.0006) [2023-03-07 18:08:41,001][232226] Updated weights for policy 0, policy_version 87610 (0.0007) [2023-03-07 18:08:41,799][232226] Updated weights for policy 0, policy_version 87620 (0.0006) [2023-03-07 18:08:42,605][232226] Updated weights for policy 0, policy_version 87630 (0.0006) [2023-03-07 18:08:43,389][232226] Updated weights for policy 0, policy_version 87640 (0.0007) [2023-03-07 18:08:44,186][232226] Updated weights for policy 0, policy_version 87650 (0.0006) [2023-03-07 18:08:44,990][232226] Updated weights for policy 0, policy_version 87660 (0.0006) [2023-03-07 18:08:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12874.6). Total num frames: 89764864. Throughput: 0: 12864.8. Samples: 89729462. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:08:45,069][231894] Avg episode reward: [(0, '197.774')] [2023-03-07 18:08:45,777][232226] Updated weights for policy 0, policy_version 87670 (0.0007) [2023-03-07 18:08:46,567][232226] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-07 18:08:47,391][232226] Updated weights for policy 0, policy_version 87690 (0.0007) [2023-03-07 18:08:48,190][232226] Updated weights for policy 0, policy_version 87700 (0.0006) [2023-03-07 18:08:48,986][232226] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-07 18:08:49,795][232226] Updated weights for policy 0, policy_version 87720 (0.0006) [2023-03-07 18:08:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12871.2). Total num frames: 89828352. Throughput: 0: 12856.3. Samples: 89806223. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:08:50,069][231894] Avg episode reward: [(0, '196.057')] [2023-03-07 18:08:50,583][232226] Updated weights for policy 0, policy_version 87730 (0.0007) [2023-03-07 18:08:51,398][232226] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-07 18:08:52,184][232226] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-07 18:08:52,978][232226] Updated weights for policy 0, policy_version 87760 (0.0006) [2023-03-07 18:08:53,772][232226] Updated weights for policy 0, policy_version 87770 (0.0008) [2023-03-07 18:08:54,577][232226] Updated weights for policy 0, policy_version 87780 (0.0007) [2023-03-07 18:08:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 89892864. Throughput: 0: 12846.7. Samples: 89883076. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:08:55,069][231894] Avg episode reward: [(0, '195.289')] [2023-03-07 18:08:55,409][232226] Updated weights for policy 0, policy_version 87790 (0.0007) [2023-03-07 18:08:56,193][232226] Updated weights for policy 0, policy_version 87800 (0.0006) [2023-03-07 18:08:57,005][232226] Updated weights for policy 0, policy_version 87810 (0.0006) [2023-03-07 18:08:57,803][232226] Updated weights for policy 0, policy_version 87820 (0.0007) [2023-03-07 18:08:58,599][232226] Updated weights for policy 0, policy_version 87830 (0.0006) [2023-03-07 18:08:59,389][232226] Updated weights for policy 0, policy_version 87840 (0.0007) [2023-03-07 18:09:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 89956352. Throughput: 0: 12837.8. Samples: 89921482. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:00,069][231894] Avg episode reward: [(0, '190.377')] [2023-03-07 18:09:00,176][232226] Updated weights for policy 0, policy_version 87850 (0.0006) [2023-03-07 18:09:01,005][232226] Updated weights for policy 0, policy_version 87860 (0.0008) [2023-03-07 18:09:01,792][232226] Updated weights for policy 0, policy_version 87870 (0.0007) [2023-03-07 18:09:02,578][232226] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-07 18:09:03,389][232226] Updated weights for policy 0, policy_version 87890 (0.0006) [2023-03-07 18:09:04,173][232226] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-07 18:09:04,971][232226] Updated weights for policy 0, policy_version 87910 (0.0006) [2023-03-07 18:09:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 90020864. Throughput: 0: 12831.5. Samples: 89998479. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:05,070][231894] Avg episode reward: [(0, '199.710')] [2023-03-07 18:09:05,761][232226] Updated weights for policy 0, policy_version 87920 (0.0006) [2023-03-07 18:09:06,556][232226] Updated weights for policy 0, policy_version 87930 (0.0008) [2023-03-07 18:09:07,355][232226] Updated weights for policy 0, policy_version 87940 (0.0007) [2023-03-07 18:09:08,170][232226] Updated weights for policy 0, policy_version 87950 (0.0006) [2023-03-07 18:09:08,949][232226] Updated weights for policy 0, policy_version 87960 (0.0006) [2023-03-07 18:09:09,763][232226] Updated weights for policy 0, policy_version 87970 (0.0006) [2023-03-07 18:09:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12864.2). Total num frames: 90084352. Throughput: 0: 12833.4. Samples: 90075669. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:10,069][231894] Avg episode reward: [(0, '197.968')] [2023-03-07 18:09:10,550][232226] Updated weights for policy 0, policy_version 87980 (0.0006) [2023-03-07 18:09:11,343][232226] Updated weights for policy 0, policy_version 87990 (0.0006) [2023-03-07 18:09:12,134][232226] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-07 18:09:12,949][232226] Updated weights for policy 0, policy_version 88010 (0.0006) [2023-03-07 18:09:13,737][232226] Updated weights for policy 0, policy_version 88020 (0.0005) [2023-03-07 18:09:14,542][232226] Updated weights for policy 0, policy_version 88030 (0.0006) [2023-03-07 18:09:15,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12864.2). Total num frames: 90148864. Throughput: 0: 12834.5. Samples: 90114118. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:15,069][231894] Avg episode reward: [(0, '190.854')] [2023-03-07 18:09:15,349][232226] Updated weights for policy 0, policy_version 88040 (0.0007) [2023-03-07 18:09:16,142][232226] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-07 18:09:16,936][232226] Updated weights for policy 0, policy_version 88060 (0.0007) [2023-03-07 18:09:17,725][232226] Updated weights for policy 0, policy_version 88070 (0.0006) [2023-03-07 18:09:18,518][232226] Updated weights for policy 0, policy_version 88080 (0.0007) [2023-03-07 18:09:19,318][232226] Updated weights for policy 0, policy_version 88090 (0.0006) [2023-03-07 18:09:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12864.2). Total num frames: 90213376. Throughput: 0: 12834.5. Samples: 90191145. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:20,070][231894] Avg episode reward: [(0, '188.443')] [2023-03-07 18:09:20,092][232226] Updated weights for policy 0, policy_version 88100 (0.0007) [2023-03-07 18:09:20,902][232226] Updated weights for policy 0, policy_version 88110 (0.0006) [2023-03-07 18:09:21,702][232226] Updated weights for policy 0, policy_version 88120 (0.0006) [2023-03-07 18:09:22,491][232226] Updated weights for policy 0, policy_version 88130 (0.0007) [2023-03-07 18:09:23,280][232226] Updated weights for policy 0, policy_version 88140 (0.0006) [2023-03-07 18:09:24,066][232226] Updated weights for policy 0, policy_version 88150 (0.0006) [2023-03-07 18:09:24,874][232226] Updated weights for policy 0, policy_version 88160 (0.0007) [2023-03-07 18:09:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 90277888. Throughput: 0: 12836.5. Samples: 90268681. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:25,069][231894] Avg episode reward: [(0, '198.900')] [2023-03-07 18:09:25,677][232226] Updated weights for policy 0, policy_version 88170 (0.0007) [2023-03-07 18:09:26,460][232226] Updated weights for policy 0, policy_version 88180 (0.0007) [2023-03-07 18:09:27,265][232226] Updated weights for policy 0, policy_version 88190 (0.0006) [2023-03-07 18:09:28,070][232226] Updated weights for policy 0, policy_version 88200 (0.0007) [2023-03-07 18:09:28,867][232226] Updated weights for policy 0, policy_version 88210 (0.0007) [2023-03-07 18:09:29,689][232226] Updated weights for policy 0, policy_version 88220 (0.0007) [2023-03-07 18:09:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12860.7). Total num frames: 90341376. Throughput: 0: 12838.3. Samples: 90307186. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:30,069][231894] Avg episode reward: [(0, '200.513')] [2023-03-07 18:09:30,474][232226] Updated weights for policy 0, policy_version 88230 (0.0006) [2023-03-07 18:09:31,271][232226] Updated weights for policy 0, policy_version 88240 (0.0007) [2023-03-07 18:09:32,060][232226] Updated weights for policy 0, policy_version 88250 (0.0006) [2023-03-07 18:09:32,856][232226] Updated weights for policy 0, policy_version 88260 (0.0007) [2023-03-07 18:09:33,643][232226] Updated weights for policy 0, policy_version 88270 (0.0006) [2023-03-07 18:09:34,454][232226] Updated weights for policy 0, policy_version 88280 (0.0006) [2023-03-07 18:09:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12864.2). Total num frames: 90405888. Throughput: 0: 12841.9. Samples: 90384109. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:35,070][231894] Avg episode reward: [(0, '189.794')] [2023-03-07 18:09:35,245][232226] Updated weights for policy 0, policy_version 88290 (0.0006) [2023-03-07 18:09:36,032][232226] Updated weights for policy 0, policy_version 88300 (0.0008) [2023-03-07 18:09:36,849][232226] Updated weights for policy 0, policy_version 88310 (0.0007) [2023-03-07 18:09:37,626][232226] Updated weights for policy 0, policy_version 88320 (0.0007) [2023-03-07 18:09:38,442][232226] Updated weights for policy 0, policy_version 88330 (0.0006) [2023-03-07 18:09:39,250][232226] Updated weights for policy 0, policy_version 88340 (0.0006) [2023-03-07 18:09:40,056][232226] Updated weights for policy 0, policy_version 88350 (0.0006) [2023-03-07 18:09:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12860.7). Total num frames: 90470400. Throughput: 0: 12843.7. Samples: 90461044. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:09:40,070][231894] Avg episode reward: [(0, '190.937')] [2023-03-07 18:09:40,853][232226] Updated weights for policy 0, policy_version 88360 (0.0007) [2023-03-07 18:09:41,636][232226] Updated weights for policy 0, policy_version 88370 (0.0006) [2023-03-07 18:09:42,445][232226] Updated weights for policy 0, policy_version 88380 (0.0006) [2023-03-07 18:09:43,235][232226] Updated weights for policy 0, policy_version 88390 (0.0006) [2023-03-07 18:09:44,038][232226] Updated weights for policy 0, policy_version 88400 (0.0006) [2023-03-07 18:09:44,838][232226] Updated weights for policy 0, policy_version 88410 (0.0006) [2023-03-07 18:09:45,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12857.3). Total num frames: 90533888. Throughput: 0: 12840.2. Samples: 90499293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:09:45,069][231894] Avg episode reward: [(0, '192.695')] [2023-03-07 18:09:45,631][232226] Updated weights for policy 0, policy_version 88420 (0.0006) [2023-03-07 18:09:46,426][232226] Updated weights for policy 0, policy_version 88430 (0.0006) [2023-03-07 18:09:47,229][232226] Updated weights for policy 0, policy_version 88440 (0.0006) [2023-03-07 18:09:48,017][232226] Updated weights for policy 0, policy_version 88450 (0.0007) [2023-03-07 18:09:48,823][232226] Updated weights for policy 0, policy_version 88460 (0.0006) [2023-03-07 18:09:49,630][232226] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-07 18:09:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12860.7). Total num frames: 90598400. Throughput: 0: 12842.3. Samples: 90576379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:09:50,069][231894] Avg episode reward: [(0, '198.808')] [2023-03-07 18:09:50,418][232226] Updated weights for policy 0, policy_version 88480 (0.0005) [2023-03-07 18:09:51,226][232226] Updated weights for policy 0, policy_version 88490 (0.0007) [2023-03-07 18:09:52,028][232226] Updated weights for policy 0, policy_version 88500 (0.0007) [2023-03-07 18:09:52,817][232226] Updated weights for policy 0, policy_version 88510 (0.0007) [2023-03-07 18:09:53,604][232226] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-07 18:09:54,400][232226] Updated weights for policy 0, policy_version 88530 (0.0006) [2023-03-07 18:09:55,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12834.2, 300 sec: 12860.7). Total num frames: 90662912. Throughput: 0: 12841.9. Samples: 90653552. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:09:55,069][231894] Avg episode reward: [(0, '192.344')] [2023-03-07 18:09:55,204][232226] Updated weights for policy 0, policy_version 88540 (0.0006) [2023-03-07 18:09:55,985][232226] Updated weights for policy 0, policy_version 88550 (0.0007) [2023-03-07 18:09:56,797][232226] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-07 18:09:57,587][232226] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-07 18:09:58,372][232226] Updated weights for policy 0, policy_version 88580 (0.0007) [2023-03-07 18:09:59,199][232226] Updated weights for policy 0, policy_version 88590 (0.0006) [2023-03-07 18:10:00,022][232226] Updated weights for policy 0, policy_version 88600 (0.0006) [2023-03-07 18:10:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 90726400. Throughput: 0: 12839.9. Samples: 90691913. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:00,069][231894] Avg episode reward: [(0, '194.934')] [2023-03-07 18:10:00,845][232226] Updated weights for policy 0, policy_version 88610 (0.0005) [2023-03-07 18:10:01,689][232226] Updated weights for policy 0, policy_version 88620 (0.0006) [2023-03-07 18:10:02,509][232226] Updated weights for policy 0, policy_version 88630 (0.0007) [2023-03-07 18:10:03,351][232226] Updated weights for policy 0, policy_version 88640 (0.0007) [2023-03-07 18:10:04,165][232226] Updated weights for policy 0, policy_version 88650 (0.0006) [2023-03-07 18:10:04,975][232226] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-07 18:10:05,069][231894] Fps is (10 sec: 12595.1, 60 sec: 12800.0, 300 sec: 12846.9). Total num frames: 90788864. Throughput: 0: 12793.5. Samples: 90766852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:05,069][231894] Avg episode reward: [(0, '189.483')] [2023-03-07 18:10:05,758][232226] Updated weights for policy 0, policy_version 88670 (0.0006) [2023-03-07 18:10:06,550][232226] Updated weights for policy 0, policy_version 88680 (0.0006) [2023-03-07 18:10:07,342][232226] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-07 18:10:08,138][232226] Updated weights for policy 0, policy_version 88700 (0.0007) [2023-03-07 18:10:08,925][232226] Updated weights for policy 0, policy_version 88710 (0.0006) [2023-03-07 18:10:09,729][232226] Updated weights for policy 0, policy_version 88720 (0.0007) [2023-03-07 18:10:10,069][231894] Fps is (10 sec: 12697.6, 60 sec: 12817.1, 300 sec: 12850.3). Total num frames: 90853376. Throughput: 0: 12779.7. Samples: 90843765. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:10,069][231894] Avg episode reward: [(0, '188.659')] [2023-03-07 18:10:10,553][232226] Updated weights for policy 0, policy_version 88730 (0.0008) [2023-03-07 18:10:11,381][232226] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-07 18:10:12,210][232226] Updated weights for policy 0, policy_version 88750 (0.0006) [2023-03-07 18:10:13,027][232226] Updated weights for policy 0, policy_version 88760 (0.0007) [2023-03-07 18:10:13,871][232226] Updated weights for policy 0, policy_version 88770 (0.0006) [2023-03-07 18:10:14,725][232226] Updated weights for policy 0, policy_version 88780 (0.0006) [2023-03-07 18:10:15,069][231894] Fps is (10 sec: 12595.1, 60 sec: 12765.9, 300 sec: 12839.9). Total num frames: 90914816. Throughput: 0: 12757.4. Samples: 90881268. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:15,069][231894] Avg episode reward: [(0, '192.959')] [2023-03-07 18:10:15,582][232226] Updated weights for policy 0, policy_version 88790 (0.0006) [2023-03-07 18:10:16,438][232226] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-07 18:10:17,297][232226] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-07 18:10:18,143][232226] Updated weights for policy 0, policy_version 88820 (0.0006) [2023-03-07 18:10:18,987][232226] Updated weights for policy 0, policy_version 88830 (0.0007) [2023-03-07 18:10:19,857][232226] Updated weights for policy 0, policy_version 88840 (0.0006) [2023-03-07 18:10:20,069][231894] Fps is (10 sec: 12083.3, 60 sec: 12680.6, 300 sec: 12822.6). Total num frames: 90974208. Throughput: 0: 12658.6. Samples: 90953743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:20,069][231894] Avg episode reward: [(0, '191.032')] [2023-03-07 18:10:20,703][232226] Updated weights for policy 0, policy_version 88850 (0.0006) [2023-03-07 18:10:21,558][232226] Updated weights for policy 0, policy_version 88860 (0.0006) [2023-03-07 18:10:22,387][232226] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-07 18:10:23,243][232226] Updated weights for policy 0, policy_version 88880 (0.0006) [2023-03-07 18:10:24,078][232226] Updated weights for policy 0, policy_version 88890 (0.0006) [2023-03-07 18:10:24,926][232226] Updated weights for policy 0, policy_version 88900 (0.0007) [2023-03-07 18:10:25,069][231894] Fps is (10 sec: 11980.7, 60 sec: 12612.3, 300 sec: 12808.7). Total num frames: 91034624. Throughput: 0: 12560.6. Samples: 91026271. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:25,070][231894] Avg episode reward: [(0, '196.510')] [2023-03-07 18:10:25,089][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000088902_91035648.pth... [2023-03-07 18:10:25,119][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000085902_87963648.pth [2023-03-07 18:10:25,780][232226] Updated weights for policy 0, policy_version 88910 (0.0006) [2023-03-07 18:10:26,616][232226] Updated weights for policy 0, policy_version 88920 (0.0006) [2023-03-07 18:10:27,490][232226] Updated weights for policy 0, policy_version 88930 (0.0006) [2023-03-07 18:10:28,345][232226] Updated weights for policy 0, policy_version 88940 (0.0006) [2023-03-07 18:10:29,197][232226] Updated weights for policy 0, policy_version 88950 (0.0007) [2023-03-07 18:10:30,044][232226] Updated weights for policy 0, policy_version 88960 (0.0007) [2023-03-07 18:10:30,069][231894] Fps is (10 sec: 12083.0, 60 sec: 12561.1, 300 sec: 12794.8). Total num frames: 91095040. Throughput: 0: 12510.2. Samples: 91062253. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:30,070][231894] Avg episode reward: [(0, '193.826')] [2023-03-07 18:10:30,897][232226] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-07 18:10:31,745][232226] Updated weights for policy 0, policy_version 88980 (0.0006) [2023-03-07 18:10:32,586][232226] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-07 18:10:33,431][232226] Updated weights for policy 0, policy_version 89000 (0.0007) [2023-03-07 18:10:34,269][232226] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-07 18:10:35,069][231894] Fps is (10 sec: 12083.2, 60 sec: 12492.8, 300 sec: 12780.9). Total num frames: 91155456. Throughput: 0: 12410.8. Samples: 91134868. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:35,080][231894] Avg episode reward: [(0, '197.903')] [2023-03-07 18:10:35,094][232226] Updated weights for policy 0, policy_version 89020 (0.0007) [2023-03-07 18:10:35,949][232226] Updated weights for policy 0, policy_version 89030 (0.0007) [2023-03-07 18:10:36,813][232226] Updated weights for policy 0, policy_version 89040 (0.0006) [2023-03-07 18:10:37,614][232226] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-07 18:10:38,405][232226] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-07 18:10:39,235][232226] Updated weights for policy 0, policy_version 89070 (0.0006) [2023-03-07 18:10:40,048][232226] Updated weights for policy 0, policy_version 89080 (0.0007) [2023-03-07 18:10:40,069][231894] Fps is (10 sec: 12288.1, 60 sec: 12458.7, 300 sec: 12774.0). Total num frames: 91217920. Throughput: 0: 12340.5. Samples: 91208875. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:40,069][231894] Avg episode reward: [(0, '201.321')] [2023-03-07 18:10:40,835][232226] Updated weights for policy 0, policy_version 89090 (0.0006) [2023-03-07 18:10:41,639][232226] Updated weights for policy 0, policy_version 89100 (0.0006) [2023-03-07 18:10:42,449][232226] Updated weights for policy 0, policy_version 89110 (0.0007) [2023-03-07 18:10:43,253][232226] Updated weights for policy 0, policy_version 89120 (0.0007) [2023-03-07 18:10:44,086][232226] Updated weights for policy 0, policy_version 89130 (0.0007) [2023-03-07 18:10:44,883][232226] Updated weights for policy 0, policy_version 89140 (0.0006) [2023-03-07 18:10:45,069][231894] Fps is (10 sec: 12595.3, 60 sec: 12458.7, 300 sec: 12770.5). Total num frames: 91281408. Throughput: 0: 12330.6. Samples: 91246788. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:45,069][231894] Avg episode reward: [(0, '192.691')] [2023-03-07 18:10:45,665][232226] Updated weights for policy 0, policy_version 89150 (0.0006) [2023-03-07 18:10:46,538][232226] Updated weights for policy 0, policy_version 89160 (0.0007) [2023-03-07 18:10:47,459][232226] Updated weights for policy 0, policy_version 89170 (0.0006) [2023-03-07 18:10:48,363][232226] Updated weights for policy 0, policy_version 89180 (0.0007) [2023-03-07 18:10:49,250][232226] Updated weights for policy 0, policy_version 89190 (0.0007) [2023-03-07 18:10:50,069][231894] Fps is (10 sec: 12185.5, 60 sec: 12356.2, 300 sec: 12749.7). Total num frames: 91339776. Throughput: 0: 12291.3. Samples: 91319961. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:50,070][231894] Avg episode reward: [(0, '194.597')] [2023-03-07 18:10:50,159][232226] Updated weights for policy 0, policy_version 89200 (0.0007) [2023-03-07 18:10:50,955][232226] Updated weights for policy 0, policy_version 89210 (0.0007) [2023-03-07 18:10:51,757][232226] Updated weights for policy 0, policy_version 89220 (0.0006) [2023-03-07 18:10:52,522][232226] Updated weights for policy 0, policy_version 89230 (0.0007) [2023-03-07 18:10:53,302][232226] Updated weights for policy 0, policy_version 89240 (0.0007) [2023-03-07 18:10:54,081][232226] Updated weights for policy 0, policy_version 89250 (0.0006) [2023-03-07 18:10:54,881][232226] Updated weights for policy 0, policy_version 89260 (0.0006) [2023-03-07 18:10:55,069][231894] Fps is (10 sec: 12287.8, 60 sec: 12356.2, 300 sec: 12749.7). Total num frames: 91404288. Throughput: 0: 12244.7. Samples: 91394778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:10:55,070][231894] Avg episode reward: [(0, '194.481')] [2023-03-07 18:10:55,642][232226] Updated weights for policy 0, policy_version 89270 (0.0007) [2023-03-07 18:10:56,439][232226] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-07 18:10:57,229][232226] Updated weights for policy 0, policy_version 89290 (0.0006) [2023-03-07 18:10:58,001][232226] Updated weights for policy 0, policy_version 89300 (0.0006) [2023-03-07 18:10:58,798][232226] Updated weights for policy 0, policy_version 89310 (0.0007) [2023-03-07 18:10:59,606][232226] Updated weights for policy 0, policy_version 89320 (0.0007) [2023-03-07 18:11:00,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12373.3, 300 sec: 12753.1). Total num frames: 91468800. Throughput: 0: 12281.8. Samples: 91433948. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:11:00,069][231894] Avg episode reward: [(0, '190.643')] [2023-03-07 18:11:00,401][232226] Updated weights for policy 0, policy_version 89330 (0.0006) [2023-03-07 18:11:01,207][232226] Updated weights for policy 0, policy_version 89340 (0.0006) [2023-03-07 18:11:02,006][232226] Updated weights for policy 0, policy_version 89350 (0.0007) [2023-03-07 18:11:02,797][232226] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-07 18:11:03,598][232226] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-07 18:11:04,396][232226] Updated weights for policy 0, policy_version 89380 (0.0006) [2023-03-07 18:11:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12407.5, 300 sec: 12753.1). Total num frames: 91533312. Throughput: 0: 12388.5. Samples: 91511225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:11:05,069][231894] Avg episode reward: [(0, '191.410')] [2023-03-07 18:11:05,171][232226] Updated weights for policy 0, policy_version 89390 (0.0006) [2023-03-07 18:11:05,983][232226] Updated weights for policy 0, policy_version 89400 (0.0006) [2023-03-07 18:11:06,773][232226] Updated weights for policy 0, policy_version 89410 (0.0006) [2023-03-07 18:11:07,566][232226] Updated weights for policy 0, policy_version 89420 (0.0006) [2023-03-07 18:11:08,355][232226] Updated weights for policy 0, policy_version 89430 (0.0006) [2023-03-07 18:11:09,154][232226] Updated weights for policy 0, policy_version 89440 (0.0007) [2023-03-07 18:11:09,947][232226] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-07 18:11:10,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12407.4, 300 sec: 12753.1). Total num frames: 91597824. Throughput: 0: 12496.1. Samples: 91588595. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:11:10,069][231894] Avg episode reward: [(0, '191.067')] [2023-03-07 18:11:10,742][232226] Updated weights for policy 0, policy_version 89460 (0.0007) [2023-03-07 18:11:11,538][232226] Updated weights for policy 0, policy_version 89470 (0.0006) [2023-03-07 18:11:12,334][232226] Updated weights for policy 0, policy_version 89480 (0.0006) [2023-03-07 18:11:13,114][232226] Updated weights for policy 0, policy_version 89490 (0.0006) [2023-03-07 18:11:13,907][232226] Updated weights for policy 0, policy_version 89500 (0.0007) [2023-03-07 18:11:14,697][232226] Updated weights for policy 0, policy_version 89510 (0.0007) [2023-03-07 18:11:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12458.7, 300 sec: 12756.6). Total num frames: 91662336. Throughput: 0: 12552.7. Samples: 91627121. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:11:15,069][231894] Avg episode reward: [(0, '196.558')] [2023-03-07 18:11:15,489][232226] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-07 18:11:16,304][232226] Updated weights for policy 0, policy_version 89530 (0.0006) [2023-03-07 18:11:17,094][232226] Updated weights for policy 0, policy_version 89540 (0.0007) [2023-03-07 18:11:17,871][232226] Updated weights for policy 0, policy_version 89550 (0.0006) [2023-03-07 18:11:18,680][232226] Updated weights for policy 0, policy_version 89560 (0.0006) [2023-03-07 18:11:19,469][232226] Updated weights for policy 0, policy_version 89570 (0.0006) [2023-03-07 18:11:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12544.0, 300 sec: 12756.6). Total num frames: 91726848. Throughput: 0: 12664.6. Samples: 91704773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:11:20,080][231894] Avg episode reward: [(0, '188.978')] [2023-03-07 18:11:20,258][232226] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-07 18:11:21,050][232226] Updated weights for policy 0, policy_version 89590 (0.0006) [2023-03-07 18:11:21,850][232226] Updated weights for policy 0, policy_version 89600 (0.0007) [2023-03-07 18:11:22,632][232226] Updated weights for policy 0, policy_version 89610 (0.0006) [2023-03-07 18:11:23,439][232226] Updated weights for policy 0, policy_version 89620 (0.0007) [2023-03-07 18:11:24,230][232226] Updated weights for policy 0, policy_version 89630 (0.0006) [2023-03-07 18:11:25,029][232226] Updated weights for policy 0, policy_version 89640 (0.0006) [2023-03-07 18:11:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12612.3, 300 sec: 12756.6). Total num frames: 91791360. Throughput: 0: 12740.1. Samples: 91782177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:11:25,080][231894] Avg episode reward: [(0, '194.556')] [2023-03-07 18:11:25,828][232226] Updated weights for policy 0, policy_version 89650 (0.0009) [2023-03-07 18:11:26,617][232226] Updated weights for policy 0, policy_version 89660 (0.0006) [2023-03-07 18:11:27,400][232226] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-07 18:11:28,194][232226] Updated weights for policy 0, policy_version 89680 (0.0006) [2023-03-07 18:11:28,979][232226] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-07 18:11:29,790][232226] Updated weights for policy 0, policy_version 89700 (0.0006) [2023-03-07 18:11:30,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12680.6, 300 sec: 12760.1). Total num frames: 91855872. Throughput: 0: 12756.0. Samples: 91820807. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:11:30,080][231894] Avg episode reward: [(0, '191.647')] [2023-03-07 18:11:30,581][232226] Updated weights for policy 0, policy_version 89710 (0.0006) [2023-03-07 18:11:31,374][232226] Updated weights for policy 0, policy_version 89720 (0.0007) [2023-03-07 18:11:32,177][232226] Updated weights for policy 0, policy_version 89730 (0.0006) [2023-03-07 18:11:32,977][232226] Updated weights for policy 0, policy_version 89740 (0.0007) [2023-03-07 18:11:33,769][232226] Updated weights for policy 0, policy_version 89750 (0.0006) [2023-03-07 18:11:34,568][232226] Updated weights for policy 0, policy_version 89760 (0.0006) [2023-03-07 18:11:35,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12748.8, 300 sec: 12760.1). Total num frames: 91920384. Throughput: 0: 12846.9. Samples: 91898070. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:11:35,080][231894] Avg episode reward: [(0, '196.001')] [2023-03-07 18:11:35,346][232226] Updated weights for policy 0, policy_version 89770 (0.0006) [2023-03-07 18:11:36,139][232226] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-07 18:11:36,932][232226] Updated weights for policy 0, policy_version 89790 (0.0008) [2023-03-07 18:11:37,726][232226] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-07 18:11:38,524][232226] Updated weights for policy 0, policy_version 89810 (0.0006) [2023-03-07 18:11:39,326][232226] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-07 18:11:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12782.9, 300 sec: 12760.1). Total num frames: 91984896. Throughput: 0: 12906.6. Samples: 91975573. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:11:40,080][231894] Avg episode reward: [(0, '196.715')] [2023-03-07 18:11:40,122][232226] Updated weights for policy 0, policy_version 89830 (0.0006) [2023-03-07 18:11:40,921][232226] Updated weights for policy 0, policy_version 89840 (0.0007) [2023-03-07 18:11:41,763][232226] Updated weights for policy 0, policy_version 89850 (0.0006) [2023-03-07 18:11:42,565][232226] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-07 18:11:43,406][232226] Updated weights for policy 0, policy_version 89870 (0.0006) [2023-03-07 18:11:44,245][232226] Updated weights for policy 0, policy_version 89880 (0.0006) [2023-03-07 18:11:45,069][231894] Fps is (10 sec: 12595.3, 60 sec: 12748.8, 300 sec: 12749.7). Total num frames: 92046336. Throughput: 0: 12874.0. Samples: 92013280. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:11:45,080][231894] Avg episode reward: [(0, '201.288')] [2023-03-07 18:11:45,092][232226] Updated weights for policy 0, policy_version 89890 (0.0006) [2023-03-07 18:11:45,971][232226] Updated weights for policy 0, policy_version 89900 (0.0007) [2023-03-07 18:11:46,824][232226] Updated weights for policy 0, policy_version 89910 (0.0007) [2023-03-07 18:11:47,679][232226] Updated weights for policy 0, policy_version 89920 (0.0006) [2023-03-07 18:11:48,540][232226] Updated weights for policy 0, policy_version 89930 (0.0007) [2023-03-07 18:11:49,386][232226] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-07 18:11:50,069][231894] Fps is (10 sec: 12185.5, 60 sec: 12782.9, 300 sec: 12735.8). Total num frames: 92106752. Throughput: 0: 12770.6. Samples: 92085904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:11:50,080][231894] Avg episode reward: [(0, '189.568')] [2023-03-07 18:11:50,222][232226] Updated weights for policy 0, policy_version 89950 (0.0007) [2023-03-07 18:11:51,065][232226] Updated weights for policy 0, policy_version 89960 (0.0006) [2023-03-07 18:11:51,888][232226] Updated weights for policy 0, policy_version 89970 (0.0007) [2023-03-07 18:11:52,757][232226] Updated weights for policy 0, policy_version 89980 (0.0007) [2023-03-07 18:11:53,569][232226] Updated weights for policy 0, policy_version 89990 (0.0007) [2023-03-07 18:11:54,417][232226] Updated weights for policy 0, policy_version 90000 (0.0006) [2023-03-07 18:11:55,069][231894] Fps is (10 sec: 12083.2, 60 sec: 12714.7, 300 sec: 12721.9). Total num frames: 92167168. Throughput: 0: 12671.9. Samples: 92158830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:11:55,080][231894] Avg episode reward: [(0, '188.547')] [2023-03-07 18:11:55,261][232226] Updated weights for policy 0, policy_version 90010 (0.0006) [2023-03-07 18:11:56,116][232226] Updated weights for policy 0, policy_version 90020 (0.0007) [2023-03-07 18:11:56,980][232226] Updated weights for policy 0, policy_version 90030 (0.0006) [2023-03-07 18:11:57,819][232226] Updated weights for policy 0, policy_version 90040 (0.0006) [2023-03-07 18:11:58,642][232226] Updated weights for policy 0, policy_version 90050 (0.0006) [2023-03-07 18:11:59,521][232226] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-07 18:12:00,069][231894] Fps is (10 sec: 12083.2, 60 sec: 12646.4, 300 sec: 12708.0). Total num frames: 92227584. Throughput: 0: 12615.8. Samples: 92194833. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:00,080][231894] Avg episode reward: [(0, '192.037')] [2023-03-07 18:12:00,396][232226] Updated weights for policy 0, policy_version 90070 (0.0006) [2023-03-07 18:12:01,203][232226] Updated weights for policy 0, policy_version 90080 (0.0006) [2023-03-07 18:12:02,074][232226] Updated weights for policy 0, policy_version 90090 (0.0006) [2023-03-07 18:12:02,934][232226] Updated weights for policy 0, policy_version 90100 (0.0007) [2023-03-07 18:12:03,771][232226] Updated weights for policy 0, policy_version 90110 (0.0007) [2023-03-07 18:12:04,613][232226] Updated weights for policy 0, policy_version 90120 (0.0006) [2023-03-07 18:12:05,069][231894] Fps is (10 sec: 12083.3, 60 sec: 12578.1, 300 sec: 12697.6). Total num frames: 92288000. Throughput: 0: 12498.2. Samples: 92267192. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:05,080][231894] Avg episode reward: [(0, '180.421')] [2023-03-07 18:12:05,445][232226] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-07 18:12:06,300][232226] Updated weights for policy 0, policy_version 90140 (0.0006) [2023-03-07 18:12:07,138][232226] Updated weights for policy 0, policy_version 90150 (0.0009) [2023-03-07 18:12:07,965][232226] Updated weights for policy 0, policy_version 90160 (0.0007) [2023-03-07 18:12:08,765][232226] Updated weights for policy 0, policy_version 90170 (0.0007) [2023-03-07 18:12:09,555][232226] Updated weights for policy 0, policy_version 90180 (0.0006) [2023-03-07 18:12:10,069][231894] Fps is (10 sec: 12288.1, 60 sec: 12544.0, 300 sec: 12690.7). Total num frames: 92350464. Throughput: 0: 12421.1. Samples: 92341128. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:10,080][231894] Avg episode reward: [(0, '185.585')] [2023-03-07 18:12:10,378][232226] Updated weights for policy 0, policy_version 90190 (0.0007) [2023-03-07 18:12:11,162][232226] Updated weights for policy 0, policy_version 90200 (0.0006) [2023-03-07 18:12:11,969][232226] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-07 18:12:12,766][232226] Updated weights for policy 0, policy_version 90220 (0.0008) [2023-03-07 18:12:13,582][232226] Updated weights for policy 0, policy_version 90230 (0.0007) [2023-03-07 18:12:14,384][232226] Updated weights for policy 0, policy_version 90240 (0.0007) [2023-03-07 18:12:15,069][231894] Fps is (10 sec: 12595.1, 60 sec: 12526.9, 300 sec: 12687.2). Total num frames: 92413952. Throughput: 0: 12413.7. Samples: 92379426. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:15,080][231894] Avg episode reward: [(0, '197.913')] [2023-03-07 18:12:15,183][232226] Updated weights for policy 0, policy_version 90250 (0.0006) [2023-03-07 18:12:15,990][232226] Updated weights for policy 0, policy_version 90260 (0.0007) [2023-03-07 18:12:16,834][232226] Updated weights for policy 0, policy_version 90270 (0.0006) [2023-03-07 18:12:17,746][232226] Updated weights for policy 0, policy_version 90280 (0.0007) [2023-03-07 18:12:18,645][232226] Updated weights for policy 0, policy_version 90290 (0.0007) [2023-03-07 18:12:19,533][232226] Updated weights for policy 0, policy_version 90300 (0.0007) [2023-03-07 18:12:20,069][231894] Fps is (10 sec: 12185.4, 60 sec: 12424.5, 300 sec: 12666.4). Total num frames: 92472320. Throughput: 0: 12339.5. Samples: 92453350. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:20,081][231894] Avg episode reward: [(0, '196.636')] [2023-03-07 18:12:20,445][232226] Updated weights for policy 0, policy_version 90310 (0.0007) [2023-03-07 18:12:21,265][232226] Updated weights for policy 0, policy_version 90320 (0.0007) [2023-03-07 18:12:22,045][232226] Updated weights for policy 0, policy_version 90330 (0.0006) [2023-03-07 18:12:22,837][232226] Updated weights for policy 0, policy_version 90340 (0.0006) [2023-03-07 18:12:23,604][232226] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-07 18:12:24,408][232226] Updated weights for policy 0, policy_version 90360 (0.0007) [2023-03-07 18:12:25,069][231894] Fps is (10 sec: 12288.0, 60 sec: 12424.5, 300 sec: 12666.4). Total num frames: 92536832. Throughput: 0: 12263.2. Samples: 92527418. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:25,080][231894] Avg episode reward: [(0, '195.045')] [2023-03-07 18:12:25,085][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000090368_92536832.pth... [2023-03-07 18:12:25,117][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000087410_89507840.pth [2023-03-07 18:12:25,212][232226] Updated weights for policy 0, policy_version 90370 (0.0006) [2023-03-07 18:12:25,983][232226] Updated weights for policy 0, policy_version 90380 (0.0006) [2023-03-07 18:12:26,777][232226] Updated weights for policy 0, policy_version 90390 (0.0007) [2023-03-07 18:12:27,577][232226] Updated weights for policy 0, policy_version 90400 (0.0007) [2023-03-07 18:12:28,383][232226] Updated weights for policy 0, policy_version 90410 (0.0007) [2023-03-07 18:12:29,172][232226] Updated weights for policy 0, policy_version 90420 (0.0007) [2023-03-07 18:12:29,964][232226] Updated weights for policy 0, policy_version 90430 (0.0007) [2023-03-07 18:12:30,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12424.5, 300 sec: 12669.8). Total num frames: 92601344. Throughput: 0: 12289.9. Samples: 92566327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:30,069][231894] Avg episode reward: [(0, '193.408')] [2023-03-07 18:12:30,758][232226] Updated weights for policy 0, policy_version 90440 (0.0007) [2023-03-07 18:12:31,544][232226] Updated weights for policy 0, policy_version 90450 (0.0006) [2023-03-07 18:12:32,338][232226] Updated weights for policy 0, policy_version 90460 (0.0007) [2023-03-07 18:12:33,125][232226] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-07 18:12:33,927][232226] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-07 18:12:34,710][232226] Updated weights for policy 0, policy_version 90490 (0.0007) [2023-03-07 18:12:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12424.5, 300 sec: 12669.8). Total num frames: 92665856. Throughput: 0: 12391.5. Samples: 92643521. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:35,070][231894] Avg episode reward: [(0, '198.033')] [2023-03-07 18:12:35,529][232226] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-07 18:12:36,325][232226] Updated weights for policy 0, policy_version 90510 (0.0006) [2023-03-07 18:12:37,118][232226] Updated weights for policy 0, policy_version 90520 (0.0007) [2023-03-07 18:12:37,916][232226] Updated weights for policy 0, policy_version 90530 (0.0006) [2023-03-07 18:12:38,698][232226] Updated weights for policy 0, policy_version 90540 (0.0007) [2023-03-07 18:12:39,503][232226] Updated weights for policy 0, policy_version 90550 (0.0007) [2023-03-07 18:12:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12424.5, 300 sec: 12669.8). Total num frames: 92730368. Throughput: 0: 12486.7. Samples: 92720733. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:40,070][231894] Avg episode reward: [(0, '195.394')] [2023-03-07 18:12:40,300][232226] Updated weights for policy 0, policy_version 90560 (0.0007) [2023-03-07 18:12:41,086][232226] Updated weights for policy 0, policy_version 90570 (0.0007) [2023-03-07 18:12:41,878][232226] Updated weights for policy 0, policy_version 90580 (0.0006) [2023-03-07 18:12:42,689][232226] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-07 18:12:43,470][232226] Updated weights for policy 0, policy_version 90600 (0.0006) [2023-03-07 18:12:44,280][232226] Updated weights for policy 0, policy_version 90610 (0.0006) [2023-03-07 18:12:45,062][232226] Updated weights for policy 0, policy_version 90620 (0.0007) [2023-03-07 18:12:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12475.7, 300 sec: 12669.8). Total num frames: 92794880. Throughput: 0: 12545.0. Samples: 92759356. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:45,069][231894] Avg episode reward: [(0, '190.879')] [2023-03-07 18:12:45,858][232226] Updated weights for policy 0, policy_version 90630 (0.0006) [2023-03-07 18:12:46,657][232226] Updated weights for policy 0, policy_version 90640 (0.0006) [2023-03-07 18:12:47,445][232226] Updated weights for policy 0, policy_version 90650 (0.0006) [2023-03-07 18:12:48,265][232226] Updated weights for policy 0, policy_version 90660 (0.0006) [2023-03-07 18:12:49,055][232226] Updated weights for policy 0, policy_version 90670 (0.0006) [2023-03-07 18:12:49,850][232226] Updated weights for policy 0, policy_version 90680 (0.0006) [2023-03-07 18:12:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12526.9, 300 sec: 12669.8). Total num frames: 92858368. Throughput: 0: 12651.9. Samples: 92836527. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:50,069][231894] Avg episode reward: [(0, '192.142')] [2023-03-07 18:12:50,648][232226] Updated weights for policy 0, policy_version 90690 (0.0006) [2023-03-07 18:12:51,434][232226] Updated weights for policy 0, policy_version 90700 (0.0007) [2023-03-07 18:12:52,222][232226] Updated weights for policy 0, policy_version 90710 (0.0006) [2023-03-07 18:12:53,040][232226] Updated weights for policy 0, policy_version 90720 (0.0007) [2023-03-07 18:12:53,842][232226] Updated weights for policy 0, policy_version 90730 (0.0006) [2023-03-07 18:12:54,630][232226] Updated weights for policy 0, policy_version 90740 (0.0007) [2023-03-07 18:12:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12595.2, 300 sec: 12669.8). Total num frames: 92922880. Throughput: 0: 12719.3. Samples: 92913495. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:12:55,069][231894] Avg episode reward: [(0, '197.342')] [2023-03-07 18:12:55,447][232226] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-07 18:12:56,246][232226] Updated weights for policy 0, policy_version 90760 (0.0006) [2023-03-07 18:12:57,028][232226] Updated weights for policy 0, policy_version 90770 (0.0007) [2023-03-07 18:12:57,850][232226] Updated weights for policy 0, policy_version 90780 (0.0006) [2023-03-07 18:12:58,643][232226] Updated weights for policy 0, policy_version 90790 (0.0007) [2023-03-07 18:12:59,456][232226] Updated weights for policy 0, policy_version 90800 (0.0007) [2023-03-07 18:13:00,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12646.4, 300 sec: 12666.4). Total num frames: 92986368. Throughput: 0: 12720.4. Samples: 92951845. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:13:00,070][231894] Avg episode reward: [(0, '200.028')] [2023-03-07 18:13:00,245][232226] Updated weights for policy 0, policy_version 90810 (0.0006) [2023-03-07 18:13:01,044][232226] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-07 18:13:01,853][232226] Updated weights for policy 0, policy_version 90830 (0.0006) [2023-03-07 18:13:02,645][232226] Updated weights for policy 0, policy_version 90840 (0.0007) [2023-03-07 18:13:03,425][232226] Updated weights for policy 0, policy_version 90850 (0.0006) [2023-03-07 18:13:04,233][232226] Updated weights for policy 0, policy_version 90860 (0.0006) [2023-03-07 18:13:05,040][232226] Updated weights for policy 0, policy_version 90870 (0.0007) [2023-03-07 18:13:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12666.4). Total num frames: 93050880. Throughput: 0: 12786.5. Samples: 93028741. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:13:05,069][231894] Avg episode reward: [(0, '191.674')] [2023-03-07 18:13:05,847][232226] Updated weights for policy 0, policy_version 90880 (0.0007) [2023-03-07 18:13:06,665][232226] Updated weights for policy 0, policy_version 90890 (0.0006) [2023-03-07 18:13:07,451][232226] Updated weights for policy 0, policy_version 90900 (0.0007) [2023-03-07 18:13:08,242][232226] Updated weights for policy 0, policy_version 90910 (0.0006) [2023-03-07 18:13:09,029][232226] Updated weights for policy 0, policy_version 90920 (0.0006) [2023-03-07 18:13:09,835][232226] Updated weights for policy 0, policy_version 90930 (0.0006) [2023-03-07 18:13:10,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12748.8, 300 sec: 12666.4). Total num frames: 93115392. Throughput: 0: 12848.5. Samples: 93105600. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:13:10,069][231894] Avg episode reward: [(0, '197.846')] [2023-03-07 18:13:10,633][232226] Updated weights for policy 0, policy_version 90940 (0.0006) [2023-03-07 18:13:11,426][232226] Updated weights for policy 0, policy_version 90950 (0.0006) [2023-03-07 18:13:12,203][232226] Updated weights for policy 0, policy_version 90960 (0.0006) [2023-03-07 18:13:13,006][232226] Updated weights for policy 0, policy_version 90970 (0.0006) [2023-03-07 18:13:13,799][232226] Updated weights for policy 0, policy_version 90980 (0.0006) [2023-03-07 18:13:14,601][232226] Updated weights for policy 0, policy_version 90990 (0.0007) [2023-03-07 18:13:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12662.9). Total num frames: 93178880. Throughput: 0: 12845.4. Samples: 93144369. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:13:15,069][231894] Avg episode reward: [(0, '197.967')] [2023-03-07 18:13:15,390][232226] Updated weights for policy 0, policy_version 91000 (0.0006) [2023-03-07 18:13:16,190][232226] Updated weights for policy 0, policy_version 91010 (0.0006) [2023-03-07 18:13:16,982][232226] Updated weights for policy 0, policy_version 91020 (0.0007) [2023-03-07 18:13:17,791][232226] Updated weights for policy 0, policy_version 91030 (0.0006) [2023-03-07 18:13:18,574][232226] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-07 18:13:19,379][232226] Updated weights for policy 0, policy_version 91050 (0.0006) [2023-03-07 18:13:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12666.4). Total num frames: 93243392. Throughput: 0: 12840.4. Samples: 93221337. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:20,069][231894] Avg episode reward: [(0, '193.698')] [2023-03-07 18:13:20,188][232226] Updated weights for policy 0, policy_version 91060 (0.0007) [2023-03-07 18:13:20,974][232226] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-07 18:13:21,788][232226] Updated weights for policy 0, policy_version 91080 (0.0006) [2023-03-07 18:13:22,596][232226] Updated weights for policy 0, policy_version 91090 (0.0006) [2023-03-07 18:13:23,398][232226] Updated weights for policy 0, policy_version 91100 (0.0007) [2023-03-07 18:13:24,198][232226] Updated weights for policy 0, policy_version 91110 (0.0006) [2023-03-07 18:13:25,002][232226] Updated weights for policy 0, policy_version 91120 (0.0006) [2023-03-07 18:13:25,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12666.4). Total num frames: 93307904. Throughput: 0: 12831.4. Samples: 93298145. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:25,069][231894] Avg episode reward: [(0, '194.351')] [2023-03-07 18:13:25,792][232226] Updated weights for policy 0, policy_version 91130 (0.0006) [2023-03-07 18:13:26,598][232226] Updated weights for policy 0, policy_version 91140 (0.0007) [2023-03-07 18:13:27,408][232226] Updated weights for policy 0, policy_version 91150 (0.0006) [2023-03-07 18:13:28,196][232226] Updated weights for policy 0, policy_version 91160 (0.0006) [2023-03-07 18:13:28,981][232226] Updated weights for policy 0, policy_version 91170 (0.0006) [2023-03-07 18:13:29,786][232226] Updated weights for policy 0, policy_version 91180 (0.0007) [2023-03-07 18:13:30,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12662.9). Total num frames: 93371392. Throughput: 0: 12827.2. Samples: 93336579. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:30,069][231894] Avg episode reward: [(0, '198.250')] [2023-03-07 18:13:30,599][232226] Updated weights for policy 0, policy_version 91190 (0.0007) [2023-03-07 18:13:31,396][232226] Updated weights for policy 0, policy_version 91200 (0.0006) [2023-03-07 18:13:32,174][232226] Updated weights for policy 0, policy_version 91210 (0.0007) [2023-03-07 18:13:32,996][232226] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-07 18:13:33,785][232226] Updated weights for policy 0, policy_version 91230 (0.0007) [2023-03-07 18:13:34,577][232226] Updated weights for policy 0, policy_version 91240 (0.0006) [2023-03-07 18:13:35,069][231894] Fps is (10 sec: 12697.6, 60 sec: 12817.1, 300 sec: 12659.4). Total num frames: 93434880. Throughput: 0: 12822.0. Samples: 93413515. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:35,069][231894] Avg episode reward: [(0, '197.662')] [2023-03-07 18:13:35,384][232226] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-07 18:13:36,173][232226] Updated weights for policy 0, policy_version 91260 (0.0006) [2023-03-07 18:13:36,976][232226] Updated weights for policy 0, policy_version 91270 (0.0006) [2023-03-07 18:13:37,770][232226] Updated weights for policy 0, policy_version 91280 (0.0006) [2023-03-07 18:13:38,568][232226] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-07 18:13:39,361][232226] Updated weights for policy 0, policy_version 91300 (0.0006) [2023-03-07 18:13:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12659.4). Total num frames: 93499392. Throughput: 0: 12823.6. Samples: 93490558. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:40,069][231894] Avg episode reward: [(0, '197.232')] [2023-03-07 18:13:40,168][232226] Updated weights for policy 0, policy_version 91310 (0.0007) [2023-03-07 18:13:40,957][232226] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-07 18:13:41,726][232226] Updated weights for policy 0, policy_version 91330 (0.0006) [2023-03-07 18:13:42,536][232226] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-07 18:13:43,332][232226] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-07 18:13:44,127][232226] Updated weights for policy 0, policy_version 91360 (0.0006) [2023-03-07 18:13:44,949][232226] Updated weights for policy 0, policy_version 91370 (0.0007) [2023-03-07 18:13:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12662.9). Total num frames: 93563904. Throughput: 0: 12833.5. Samples: 93529351. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:45,069][231894] Avg episode reward: [(0, '194.203')] [2023-03-07 18:13:45,738][232226] Updated weights for policy 0, policy_version 91380 (0.0005) [2023-03-07 18:13:46,534][232226] Updated weights for policy 0, policy_version 91390 (0.0007) [2023-03-07 18:13:47,327][232226] Updated weights for policy 0, policy_version 91400 (0.0007) [2023-03-07 18:13:48,121][232226] Updated weights for policy 0, policy_version 91410 (0.0006) [2023-03-07 18:13:48,916][232226] Updated weights for policy 0, policy_version 91420 (0.0006) [2023-03-07 18:13:49,714][232226] Updated weights for policy 0, policy_version 91430 (0.0006) [2023-03-07 18:13:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12662.9). Total num frames: 93628416. Throughput: 0: 12833.6. Samples: 93606253. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:50,069][231894] Avg episode reward: [(0, '193.085')] [2023-03-07 18:13:50,500][232226] Updated weights for policy 0, policy_version 91440 (0.0006) [2023-03-07 18:13:51,294][232226] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-07 18:13:52,078][232226] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-07 18:13:52,889][232226] Updated weights for policy 0, policy_version 91470 (0.0007) [2023-03-07 18:13:53,665][232226] Updated weights for policy 0, policy_version 91480 (0.0007) [2023-03-07 18:13:54,455][232226] Updated weights for policy 0, policy_version 91490 (0.0006) [2023-03-07 18:13:55,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12666.4). Total num frames: 93692928. Throughput: 0: 12850.0. Samples: 93683853. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:13:55,069][231894] Avg episode reward: [(0, '200.224')] [2023-03-07 18:13:55,249][232226] Updated weights for policy 0, policy_version 91500 (0.0007) [2023-03-07 18:13:56,035][232226] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-07 18:13:56,848][232226] Updated weights for policy 0, policy_version 91520 (0.0007) [2023-03-07 18:13:57,649][232226] Updated weights for policy 0, policy_version 91530 (0.0007) [2023-03-07 18:13:58,433][232226] Updated weights for policy 0, policy_version 91540 (0.0007) [2023-03-07 18:13:59,243][232226] Updated weights for policy 0, policy_version 91550 (0.0007) [2023-03-07 18:14:00,042][232226] Updated weights for policy 0, policy_version 91560 (0.0006) [2023-03-07 18:14:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12666.4). Total num frames: 93757440. Throughput: 0: 12847.9. Samples: 93722522. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:14:00,080][231894] Avg episode reward: [(0, '195.283')] [2023-03-07 18:14:00,837][232226] Updated weights for policy 0, policy_version 91570 (0.0007) [2023-03-07 18:14:01,627][232226] Updated weights for policy 0, policy_version 91580 (0.0006) [2023-03-07 18:14:02,443][232226] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-07 18:14:03,240][232226] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-07 18:14:04,026][232226] Updated weights for policy 0, policy_version 91610 (0.0007) [2023-03-07 18:14:04,826][232226] Updated weights for policy 0, policy_version 91620 (0.0007) [2023-03-07 18:14:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12666.4). Total num frames: 93820928. Throughput: 0: 12844.0. Samples: 93799318. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:14:05,070][231894] Avg episode reward: [(0, '189.797')] [2023-03-07 18:14:05,620][232226] Updated weights for policy 0, policy_version 91630 (0.0006) [2023-03-07 18:14:06,420][232226] Updated weights for policy 0, policy_version 91640 (0.0006) [2023-03-07 18:14:07,229][232226] Updated weights for policy 0, policy_version 91650 (0.0006) [2023-03-07 18:14:08,025][232226] Updated weights for policy 0, policy_version 91660 (0.0008) [2023-03-07 18:14:08,838][232226] Updated weights for policy 0, policy_version 91670 (0.0007) [2023-03-07 18:14:09,623][232226] Updated weights for policy 0, policy_version 91680 (0.0006) [2023-03-07 18:14:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12666.4). Total num frames: 93885440. Throughput: 0: 12847.9. Samples: 93876301. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:14:10,069][231894] Avg episode reward: [(0, '191.655')] [2023-03-07 18:14:10,419][232226] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-07 18:14:11,213][232226] Updated weights for policy 0, policy_version 91700 (0.0006) [2023-03-07 18:14:12,000][232226] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-07 18:14:12,799][232226] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-07 18:14:13,603][232226] Updated weights for policy 0, policy_version 91730 (0.0006) [2023-03-07 18:14:14,384][232226] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-07 18:14:15,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12666.4). Total num frames: 93949952. Throughput: 0: 12852.5. Samples: 93914940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:15,069][231894] Avg episode reward: [(0, '195.855')] [2023-03-07 18:14:15,189][232226] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-07 18:14:15,984][232226] Updated weights for policy 0, policy_version 91760 (0.0007) [2023-03-07 18:14:16,781][232226] Updated weights for policy 0, policy_version 91770 (0.0006) [2023-03-07 18:14:17,588][232226] Updated weights for policy 0, policy_version 91780 (0.0007) [2023-03-07 18:14:18,405][232226] Updated weights for policy 0, policy_version 91790 (0.0005) [2023-03-07 18:14:19,193][232226] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-07 18:14:20,005][232226] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-07 18:14:20,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12662.9). Total num frames: 94013440. Throughput: 0: 12851.7. Samples: 93991842. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:20,069][231894] Avg episode reward: [(0, '201.622')] [2023-03-07 18:14:20,811][232226] Updated weights for policy 0, policy_version 91820 (0.0008) [2023-03-07 18:14:21,615][232226] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-07 18:14:22,410][232226] Updated weights for policy 0, policy_version 91840 (0.0006) [2023-03-07 18:14:23,214][232226] Updated weights for policy 0, policy_version 91850 (0.0006) [2023-03-07 18:14:24,002][232226] Updated weights for policy 0, policy_version 91860 (0.0007) [2023-03-07 18:14:24,803][232226] Updated weights for policy 0, policy_version 91870 (0.0007) [2023-03-07 18:14:25,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12666.4). Total num frames: 94077952. Throughput: 0: 12842.8. Samples: 94068486. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:25,069][231894] Avg episode reward: [(0, '191.970')] [2023-03-07 18:14:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000091873_94077952.pth... [2023-03-07 18:14:25,107][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000088902_91035648.pth [2023-03-07 18:14:25,615][232226] Updated weights for policy 0, policy_version 91880 (0.0005) [2023-03-07 18:14:26,408][232226] Updated weights for policy 0, policy_version 91890 (0.0007) [2023-03-07 18:14:27,206][232226] Updated weights for policy 0, policy_version 91900 (0.0007) [2023-03-07 18:14:28,003][232226] Updated weights for policy 0, policy_version 91910 (0.0007) [2023-03-07 18:14:28,798][232226] Updated weights for policy 0, policy_version 91920 (0.0006) [2023-03-07 18:14:29,585][232226] Updated weights for policy 0, policy_version 91930 (0.0007) [2023-03-07 18:14:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12666.4). Total num frames: 94142464. Throughput: 0: 12836.7. Samples: 94107004. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:30,069][231894] Avg episode reward: [(0, '193.874')] [2023-03-07 18:14:30,397][232226] Updated weights for policy 0, policy_version 91940 (0.0007) [2023-03-07 18:14:31,193][232226] Updated weights for policy 0, policy_version 91950 (0.0005) [2023-03-07 18:14:31,974][232226] Updated weights for policy 0, policy_version 91960 (0.0006) [2023-03-07 18:14:32,771][232226] Updated weights for policy 0, policy_version 91970 (0.0006) [2023-03-07 18:14:33,575][232226] Updated weights for policy 0, policy_version 91980 (0.0006) [2023-03-07 18:14:34,374][232226] Updated weights for policy 0, policy_version 91990 (0.0006) [2023-03-07 18:14:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12662.9). Total num frames: 94205952. Throughput: 0: 12842.3. Samples: 94184157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:35,069][231894] Avg episode reward: [(0, '199.736')] [2023-03-07 18:14:35,177][232226] Updated weights for policy 0, policy_version 92000 (0.0007) [2023-03-07 18:14:35,966][232226] Updated weights for policy 0, policy_version 92010 (0.0006) [2023-03-07 18:14:36,770][232226] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-07 18:14:37,553][232226] Updated weights for policy 0, policy_version 92030 (0.0006) [2023-03-07 18:14:38,353][232226] Updated weights for policy 0, policy_version 92040 (0.0006) [2023-03-07 18:14:39,158][232226] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-07 18:14:39,943][232226] Updated weights for policy 0, policy_version 92060 (0.0006) [2023-03-07 18:14:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12666.4). Total num frames: 94270464. Throughput: 0: 12830.1. Samples: 94261207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:40,069][231894] Avg episode reward: [(0, '194.749')] [2023-03-07 18:14:40,735][232226] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-07 18:14:41,531][232226] Updated weights for policy 0, policy_version 92080 (0.0007) [2023-03-07 18:14:42,329][232226] Updated weights for policy 0, policy_version 92090 (0.0006) [2023-03-07 18:14:43,124][232226] Updated weights for policy 0, policy_version 92100 (0.0006) [2023-03-07 18:14:43,925][232226] Updated weights for policy 0, policy_version 92110 (0.0008) [2023-03-07 18:14:44,711][232226] Updated weights for policy 0, policy_version 92120 (0.0006) [2023-03-07 18:14:45,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12666.4). Total num frames: 94334976. Throughput: 0: 12830.5. Samples: 94299897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:45,069][231894] Avg episode reward: [(0, '193.592')] [2023-03-07 18:14:45,529][232226] Updated weights for policy 0, policy_version 92130 (0.0005) [2023-03-07 18:14:46,319][232226] Updated weights for policy 0, policy_version 92140 (0.0007) [2023-03-07 18:14:47,143][232226] Updated weights for policy 0, policy_version 92150 (0.0007) [2023-03-07 18:14:47,919][232226] Updated weights for policy 0, policy_version 92160 (0.0006) [2023-03-07 18:14:48,718][232226] Updated weights for policy 0, policy_version 92170 (0.0006) [2023-03-07 18:14:49,514][232226] Updated weights for policy 0, policy_version 92180 (0.0006) [2023-03-07 18:14:50,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12662.9). Total num frames: 94398464. Throughput: 0: 12833.8. Samples: 94376835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:50,069][231894] Avg episode reward: [(0, '192.647')] [2023-03-07 18:14:50,322][232226] Updated weights for policy 0, policy_version 92190 (0.0006) [2023-03-07 18:14:51,115][232226] Updated weights for policy 0, policy_version 92200 (0.0007) [2023-03-07 18:14:51,912][232226] Updated weights for policy 0, policy_version 92210 (0.0006) [2023-03-07 18:14:52,707][232226] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-07 18:14:53,510][232226] Updated weights for policy 0, policy_version 92230 (0.0007) [2023-03-07 18:14:54,298][232226] Updated weights for policy 0, policy_version 92240 (0.0006) [2023-03-07 18:14:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12666.4). Total num frames: 94462976. Throughput: 0: 12832.3. Samples: 94453755. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:14:55,070][231894] Avg episode reward: [(0, '196.104')] [2023-03-07 18:14:55,085][232226] Updated weights for policy 0, policy_version 92250 (0.0006) [2023-03-07 18:14:55,891][232226] Updated weights for policy 0, policy_version 92260 (0.0006) [2023-03-07 18:14:56,686][232226] Updated weights for policy 0, policy_version 92270 (0.0007) [2023-03-07 18:14:57,490][232226] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-07 18:14:58,287][232226] Updated weights for policy 0, policy_version 92290 (0.0006) [2023-03-07 18:14:59,098][232226] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-07 18:14:59,875][232226] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-07 18:15:00,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12673.3). Total num frames: 94527488. Throughput: 0: 12834.2. Samples: 94492482. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:00,069][231894] Avg episode reward: [(0, '200.302')] [2023-03-07 18:15:00,682][232226] Updated weights for policy 0, policy_version 92320 (0.0007) [2023-03-07 18:15:01,468][232226] Updated weights for policy 0, policy_version 92330 (0.0007) [2023-03-07 18:15:02,260][232226] Updated weights for policy 0, policy_version 92340 (0.0006) [2023-03-07 18:15:03,051][232226] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-07 18:15:03,847][232226] Updated weights for policy 0, policy_version 92360 (0.0005) [2023-03-07 18:15:04,633][232226] Updated weights for policy 0, policy_version 92370 (0.0006) [2023-03-07 18:15:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12673.3). Total num frames: 94592000. Throughput: 0: 12841.0. Samples: 94569689. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:05,070][231894] Avg episode reward: [(0, '194.071')] [2023-03-07 18:15:05,438][232226] Updated weights for policy 0, policy_version 92380 (0.0006) [2023-03-07 18:15:06,238][232226] Updated weights for policy 0, policy_version 92390 (0.0006) [2023-03-07 18:15:07,025][232226] Updated weights for policy 0, policy_version 92400 (0.0007) [2023-03-07 18:15:07,825][232226] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-07 18:15:08,628][232226] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-07 18:15:09,416][232226] Updated weights for policy 0, policy_version 92430 (0.0006) [2023-03-07 18:15:10,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12683.7). Total num frames: 94656512. Throughput: 0: 12850.4. Samples: 94646754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:10,070][231894] Avg episode reward: [(0, '193.271')] [2023-03-07 18:15:10,207][232226] Updated weights for policy 0, policy_version 92440 (0.0006) [2023-03-07 18:15:11,019][232226] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-07 18:15:11,814][232226] Updated weights for policy 0, policy_version 92460 (0.0007) [2023-03-07 18:15:12,589][232226] Updated weights for policy 0, policy_version 92470 (0.0006) [2023-03-07 18:15:13,417][232226] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-07 18:15:14,189][232226] Updated weights for policy 0, policy_version 92490 (0.0006) [2023-03-07 18:15:14,985][232226] Updated weights for policy 0, policy_version 92500 (0.0006) [2023-03-07 18:15:15,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12697.6). Total num frames: 94720000. Throughput: 0: 12855.2. Samples: 94685491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:15,070][231894] Avg episode reward: [(0, '197.043')] [2023-03-07 18:15:15,809][232226] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-07 18:15:16,606][232226] Updated weights for policy 0, policy_version 92520 (0.0007) [2023-03-07 18:15:17,406][232226] Updated weights for policy 0, policy_version 92530 (0.0005) [2023-03-07 18:15:18,192][232226] Updated weights for policy 0, policy_version 92540 (0.0006) [2023-03-07 18:15:19,013][232226] Updated weights for policy 0, policy_version 92550 (0.0006) [2023-03-07 18:15:19,808][232226] Updated weights for policy 0, policy_version 92560 (0.0006) [2023-03-07 18:15:20,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12711.5). Total num frames: 94784512. Throughput: 0: 12848.1. Samples: 94762322. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:20,069][231894] Avg episode reward: [(0, '196.604')] [2023-03-07 18:15:20,586][232226] Updated weights for policy 0, policy_version 92570 (0.0006) [2023-03-07 18:15:21,386][232226] Updated weights for policy 0, policy_version 92580 (0.0007) [2023-03-07 18:15:22,169][232226] Updated weights for policy 0, policy_version 92590 (0.0007) [2023-03-07 18:15:22,978][232226] Updated weights for policy 0, policy_version 92600 (0.0007) [2023-03-07 18:15:23,759][232226] Updated weights for policy 0, policy_version 92610 (0.0005) [2023-03-07 18:15:24,558][232226] Updated weights for policy 0, policy_version 92620 (0.0007) [2023-03-07 18:15:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12725.4). Total num frames: 94849024. Throughput: 0: 12856.2. Samples: 94839736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:25,069][231894] Avg episode reward: [(0, '192.356')] [2023-03-07 18:15:25,355][232226] Updated weights for policy 0, policy_version 92630 (0.0007) [2023-03-07 18:15:26,148][232226] Updated weights for policy 0, policy_version 92640 (0.0008) [2023-03-07 18:15:26,931][232226] Updated weights for policy 0, policy_version 92650 (0.0007) [2023-03-07 18:15:27,732][232226] Updated weights for policy 0, policy_version 92660 (0.0006) [2023-03-07 18:15:28,514][232226] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-07 18:15:29,311][232226] Updated weights for policy 0, policy_version 92680 (0.0006) [2023-03-07 18:15:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12739.3). Total num frames: 94913536. Throughput: 0: 12858.3. Samples: 94878518. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:30,069][231894] Avg episode reward: [(0, '197.876')] [2023-03-07 18:15:30,101][232226] Updated weights for policy 0, policy_version 92690 (0.0006) [2023-03-07 18:15:30,894][232226] Updated weights for policy 0, policy_version 92700 (0.0006) [2023-03-07 18:15:31,702][232226] Updated weights for policy 0, policy_version 92710 (0.0006) [2023-03-07 18:15:32,485][232226] Updated weights for policy 0, policy_version 92720 (0.0007) [2023-03-07 18:15:33,281][232226] Updated weights for policy 0, policy_version 92730 (0.0007) [2023-03-07 18:15:34,095][232226] Updated weights for policy 0, policy_version 92740 (0.0006) [2023-03-07 18:15:34,870][232226] Updated weights for policy 0, policy_version 92750 (0.0007) [2023-03-07 18:15:35,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12746.2). Total num frames: 94978048. Throughput: 0: 12866.2. Samples: 94955818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:35,070][231894] Avg episode reward: [(0, '191.086')] [2023-03-07 18:15:35,677][232226] Updated weights for policy 0, policy_version 92760 (0.0006) [2023-03-07 18:15:36,476][232226] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-07 18:15:37,254][232226] Updated weights for policy 0, policy_version 92780 (0.0006) [2023-03-07 18:15:38,064][232226] Updated weights for policy 0, policy_version 92790 (0.0007) [2023-03-07 18:15:38,855][232226] Updated weights for policy 0, policy_version 92800 (0.0007) [2023-03-07 18:15:39,649][232226] Updated weights for policy 0, policy_version 92810 (0.0006) [2023-03-07 18:15:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12749.7). Total num frames: 95042560. Throughput: 0: 12873.5. Samples: 95033062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:40,069][231894] Avg episode reward: [(0, '201.376')] [2023-03-07 18:15:40,434][232226] Updated weights for policy 0, policy_version 92820 (0.0006) [2023-03-07 18:15:41,248][232226] Updated weights for policy 0, policy_version 92830 (0.0006) [2023-03-07 18:15:42,020][232226] Updated weights for policy 0, policy_version 92840 (0.0007) [2023-03-07 18:15:42,817][232226] Updated weights for policy 0, policy_version 92850 (0.0007) [2023-03-07 18:15:43,622][232226] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-07 18:15:44,404][232226] Updated weights for policy 0, policy_version 92870 (0.0006) [2023-03-07 18:15:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12770.5). Total num frames: 95107072. Throughput: 0: 12877.4. Samples: 95071964. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:45,069][231894] Avg episode reward: [(0, '197.138')] [2023-03-07 18:15:45,190][232226] Updated weights for policy 0, policy_version 92880 (0.0006) [2023-03-07 18:15:45,985][232226] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-07 18:15:46,763][232226] Updated weights for policy 0, policy_version 92900 (0.0006) [2023-03-07 18:15:47,591][232226] Updated weights for policy 0, policy_version 92910 (0.0006) [2023-03-07 18:15:48,382][232226] Updated weights for policy 0, policy_version 92920 (0.0006) [2023-03-07 18:15:49,188][232226] Updated weights for policy 0, policy_version 92930 (0.0006) [2023-03-07 18:15:49,977][232226] Updated weights for policy 0, policy_version 92940 (0.0006) [2023-03-07 18:15:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12770.5). Total num frames: 95171584. Throughput: 0: 12875.8. Samples: 95149100. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:50,069][231894] Avg episode reward: [(0, '186.753')] [2023-03-07 18:15:50,770][232226] Updated weights for policy 0, policy_version 92950 (0.0006) [2023-03-07 18:15:51,563][232226] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-07 18:15:52,373][232226] Updated weights for policy 0, policy_version 92970 (0.0007) [2023-03-07 18:15:53,171][232226] Updated weights for policy 0, policy_version 92980 (0.0006) [2023-03-07 18:15:53,969][232226] Updated weights for policy 0, policy_version 92990 (0.0006) [2023-03-07 18:15:54,747][232226] Updated weights for policy 0, policy_version 93000 (0.0006) [2023-03-07 18:15:55,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12767.0). Total num frames: 95235072. Throughput: 0: 12879.6. Samples: 95226337. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:15:55,069][231894] Avg episode reward: [(0, '200.043')] [2023-03-07 18:15:55,549][232226] Updated weights for policy 0, policy_version 93010 (0.0007) [2023-03-07 18:15:56,372][232226] Updated weights for policy 0, policy_version 93020 (0.0006) [2023-03-07 18:15:57,164][232226] Updated weights for policy 0, policy_version 93030 (0.0007) [2023-03-07 18:15:57,958][232226] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-07 18:15:58,771][232226] Updated weights for policy 0, policy_version 93050 (0.0007) [2023-03-07 18:15:59,550][232226] Updated weights for policy 0, policy_version 93060 (0.0006) [2023-03-07 18:16:00,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12767.0). Total num frames: 95299584. Throughput: 0: 12872.6. Samples: 95264754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:16:00,069][231894] Avg episode reward: [(0, '191.881')] [2023-03-07 18:16:00,342][232226] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-07 18:16:01,173][232226] Updated weights for policy 0, policy_version 93080 (0.0006) [2023-03-07 18:16:01,961][232226] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-07 18:16:02,761][232226] Updated weights for policy 0, policy_version 93100 (0.0006) [2023-03-07 18:16:03,546][232226] Updated weights for policy 0, policy_version 93110 (0.0006) [2023-03-07 18:16:04,356][232226] Updated weights for policy 0, policy_version 93120 (0.0006) [2023-03-07 18:16:05,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12763.6). Total num frames: 95363072. Throughput: 0: 12870.2. Samples: 95341483. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:05,069][231894] Avg episode reward: [(0, '190.467')] [2023-03-07 18:16:05,154][232226] Updated weights for policy 0, policy_version 93130 (0.0006) [2023-03-07 18:16:05,942][232226] Updated weights for policy 0, policy_version 93140 (0.0007) [2023-03-07 18:16:06,741][232226] Updated weights for policy 0, policy_version 93150 (0.0007) [2023-03-07 18:16:07,551][232226] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-07 18:16:08,342][232226] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-07 18:16:09,132][232226] Updated weights for policy 0, policy_version 93180 (0.0006) [2023-03-07 18:16:09,947][232226] Updated weights for policy 0, policy_version 93190 (0.0006) [2023-03-07 18:16:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12763.6). Total num frames: 95427584. Throughput: 0: 12861.2. Samples: 95418487. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:10,069][231894] Avg episode reward: [(0, '196.734')] [2023-03-07 18:16:10,722][232226] Updated weights for policy 0, policy_version 93200 (0.0006) [2023-03-07 18:16:11,531][232226] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-07 18:16:12,336][232226] Updated weights for policy 0, policy_version 93220 (0.0007) [2023-03-07 18:16:13,138][232226] Updated weights for policy 0, policy_version 93230 (0.0007) [2023-03-07 18:16:13,931][232226] Updated weights for policy 0, policy_version 93240 (0.0008) [2023-03-07 18:16:14,725][232226] Updated weights for policy 0, policy_version 93250 (0.0007) [2023-03-07 18:16:15,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12763.6). Total num frames: 95492096. Throughput: 0: 12856.7. Samples: 95457071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:15,069][231894] Avg episode reward: [(0, '196.471')] [2023-03-07 18:16:15,517][232226] Updated weights for policy 0, policy_version 93260 (0.0006) [2023-03-07 18:16:16,314][232226] Updated weights for policy 0, policy_version 93270 (0.0006) [2023-03-07 18:16:17,113][232226] Updated weights for policy 0, policy_version 93280 (0.0006) [2023-03-07 18:16:17,915][232226] Updated weights for policy 0, policy_version 93290 (0.0006) [2023-03-07 18:16:18,695][232226] Updated weights for policy 0, policy_version 93300 (0.0007) [2023-03-07 18:16:19,499][232226] Updated weights for policy 0, policy_version 93310 (0.0007) [2023-03-07 18:16:20,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12763.5). Total num frames: 95556608. Throughput: 0: 12853.9. Samples: 95534241. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:20,070][231894] Avg episode reward: [(0, '192.006')] [2023-03-07 18:16:20,283][232226] Updated weights for policy 0, policy_version 93320 (0.0006) [2023-03-07 18:16:21,104][232226] Updated weights for policy 0, policy_version 93330 (0.0007) [2023-03-07 18:16:21,885][232226] Updated weights for policy 0, policy_version 93340 (0.0006) [2023-03-07 18:16:22,685][232226] Updated weights for policy 0, policy_version 93350 (0.0007) [2023-03-07 18:16:23,477][232226] Updated weights for policy 0, policy_version 93360 (0.0006) [2023-03-07 18:16:24,255][232226] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-07 18:16:25,047][232226] Updated weights for policy 0, policy_version 93380 (0.0006) [2023-03-07 18:16:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12763.6). Total num frames: 95621120. Throughput: 0: 12852.6. Samples: 95611428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:25,069][231894] Avg episode reward: [(0, '191.373')] [2023-03-07 18:16:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000093380_95621120.pth... [2023-03-07 18:16:25,105][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000090368_92536832.pth [2023-03-07 18:16:25,862][232226] Updated weights for policy 0, policy_version 93390 (0.0007) [2023-03-07 18:16:26,653][232226] Updated weights for policy 0, policy_version 93400 (0.0007) [2023-03-07 18:16:27,445][232226] Updated weights for policy 0, policy_version 93410 (0.0006) [2023-03-07 18:16:28,254][232226] Updated weights for policy 0, policy_version 93420 (0.0006) [2023-03-07 18:16:29,038][232226] Updated weights for policy 0, policy_version 93430 (0.0006) [2023-03-07 18:16:29,851][232226] Updated weights for policy 0, policy_version 93440 (0.0007) [2023-03-07 18:16:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12763.6). Total num frames: 95685632. Throughput: 0: 12847.0. Samples: 95650077. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:30,069][231894] Avg episode reward: [(0, '204.019')] [2023-03-07 18:16:30,646][232226] Updated weights for policy 0, policy_version 93450 (0.0007) [2023-03-07 18:16:31,433][232226] Updated weights for policy 0, policy_version 93460 (0.0006) [2023-03-07 18:16:32,239][232226] Updated weights for policy 0, policy_version 93470 (0.0007) [2023-03-07 18:16:33,029][232226] Updated weights for policy 0, policy_version 93480 (0.0007) [2023-03-07 18:16:33,826][232226] Updated weights for policy 0, policy_version 93490 (0.0006) [2023-03-07 18:16:34,630][232226] Updated weights for policy 0, policy_version 93500 (0.0007) [2023-03-07 18:16:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12760.1). Total num frames: 95749120. Throughput: 0: 12849.1. Samples: 95727309. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:35,070][231894] Avg episode reward: [(0, '197.428')] [2023-03-07 18:16:35,441][232226] Updated weights for policy 0, policy_version 93510 (0.0006) [2023-03-07 18:16:36,224][232226] Updated weights for policy 0, policy_version 93520 (0.0006) [2023-03-07 18:16:37,032][232226] Updated weights for policy 0, policy_version 93530 (0.0006) [2023-03-07 18:16:37,838][232226] Updated weights for policy 0, policy_version 93540 (0.0006) [2023-03-07 18:16:38,649][232226] Updated weights for policy 0, policy_version 93550 (0.0007) [2023-03-07 18:16:39,442][232226] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-07 18:16:40,069][231894] Fps is (10 sec: 12697.7, 60 sec: 12834.1, 300 sec: 12767.0). Total num frames: 95812608. Throughput: 0: 12830.7. Samples: 95803719. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:40,069][231894] Avg episode reward: [(0, '197.754')] [2023-03-07 18:16:40,236][232226] Updated weights for policy 0, policy_version 93570 (0.0007) [2023-03-07 18:16:41,038][232226] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-07 18:16:41,807][232226] Updated weights for policy 0, policy_version 93590 (0.0006) [2023-03-07 18:16:42,629][232226] Updated weights for policy 0, policy_version 93600 (0.0006) [2023-03-07 18:16:43,438][232226] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-07 18:16:44,215][232226] Updated weights for policy 0, policy_version 93620 (0.0006) [2023-03-07 18:16:45,015][232226] Updated weights for policy 0, policy_version 93630 (0.0006) [2023-03-07 18:16:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12780.9). Total num frames: 95877120. Throughput: 0: 12835.2. Samples: 95842342. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:45,070][231894] Avg episode reward: [(0, '199.231')] [2023-03-07 18:16:45,830][232226] Updated weights for policy 0, policy_version 93640 (0.0006) [2023-03-07 18:16:46,618][232226] Updated weights for policy 0, policy_version 93650 (0.0006) [2023-03-07 18:16:47,394][232226] Updated weights for policy 0, policy_version 93660 (0.0007) [2023-03-07 18:16:48,201][232226] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-07 18:16:49,012][232226] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-07 18:16:49,819][232226] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-07 18:16:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12794.8). Total num frames: 95941632. Throughput: 0: 12841.6. Samples: 95919354. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:50,069][231894] Avg episode reward: [(0, '196.701')] [2023-03-07 18:16:50,617][232226] Updated weights for policy 0, policy_version 93700 (0.0008) [2023-03-07 18:16:51,399][232226] Updated weights for policy 0, policy_version 93710 (0.0006) [2023-03-07 18:16:52,210][232226] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-07 18:16:53,005][232226] Updated weights for policy 0, policy_version 93730 (0.0005) [2023-03-07 18:16:53,803][232226] Updated weights for policy 0, policy_version 93740 (0.0006) [2023-03-07 18:16:54,591][232226] Updated weights for policy 0, policy_version 93750 (0.0006) [2023-03-07 18:16:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12805.2). Total num frames: 96005120. Throughput: 0: 12840.2. Samples: 95996295. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:16:55,069][231894] Avg episode reward: [(0, '192.674')] [2023-03-07 18:16:55,405][232226] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-07 18:16:56,205][232226] Updated weights for policy 0, policy_version 93770 (0.0007) [2023-03-07 18:16:57,017][232226] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-07 18:16:57,826][232226] Updated weights for policy 0, policy_version 93790 (0.0007) [2023-03-07 18:16:58,608][232226] Updated weights for policy 0, policy_version 93800 (0.0006) [2023-03-07 18:16:59,427][232226] Updated weights for policy 0, policy_version 93810 (0.0007) [2023-03-07 18:17:00,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 96069632. Throughput: 0: 12832.5. Samples: 96034532. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:17:00,069][231894] Avg episode reward: [(0, '192.115')] [2023-03-07 18:17:00,225][232226] Updated weights for policy 0, policy_version 93820 (0.0006) [2023-03-07 18:17:01,007][232226] Updated weights for policy 0, policy_version 93830 (0.0006) [2023-03-07 18:17:01,807][232226] Updated weights for policy 0, policy_version 93840 (0.0007) [2023-03-07 18:17:02,599][232226] Updated weights for policy 0, policy_version 93850 (0.0006) [2023-03-07 18:17:03,387][232226] Updated weights for policy 0, policy_version 93860 (0.0007) [2023-03-07 18:17:04,181][232226] Updated weights for policy 0, policy_version 93870 (0.0006) [2023-03-07 18:17:05,000][232226] Updated weights for policy 0, policy_version 93880 (0.0006) [2023-03-07 18:17:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12822.6). Total num frames: 96133120. Throughput: 0: 12827.5. Samples: 96111477. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:05,069][231894] Avg episode reward: [(0, '184.227')] [2023-03-07 18:17:05,784][232226] Updated weights for policy 0, policy_version 93890 (0.0006) [2023-03-07 18:17:06,569][232226] Updated weights for policy 0, policy_version 93900 (0.0006) [2023-03-07 18:17:07,375][232226] Updated weights for policy 0, policy_version 93910 (0.0006) [2023-03-07 18:17:08,166][232226] Updated weights for policy 0, policy_version 93920 (0.0006) [2023-03-07 18:17:08,951][232226] Updated weights for policy 0, policy_version 93930 (0.0006) [2023-03-07 18:17:09,775][232226] Updated weights for policy 0, policy_version 93940 (0.0006) [2023-03-07 18:17:10,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 96197632. Throughput: 0: 12827.4. Samples: 96188660. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:10,069][231894] Avg episode reward: [(0, '202.035')] [2023-03-07 18:17:10,564][232226] Updated weights for policy 0, policy_version 93950 (0.0006) [2023-03-07 18:17:11,375][232226] Updated weights for policy 0, policy_version 93960 (0.0006) [2023-03-07 18:17:12,173][232226] Updated weights for policy 0, policy_version 93970 (0.0006) [2023-03-07 18:17:12,963][232226] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-07 18:17:13,766][232226] Updated weights for policy 0, policy_version 93990 (0.0006) [2023-03-07 18:17:14,555][232226] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-07 18:17:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 96262144. Throughput: 0: 12826.6. Samples: 96227273. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:15,069][231894] Avg episode reward: [(0, '198.446')] [2023-03-07 18:17:15,365][232226] Updated weights for policy 0, policy_version 94010 (0.0007) [2023-03-07 18:17:16,151][232226] Updated weights for policy 0, policy_version 94020 (0.0007) [2023-03-07 18:17:16,926][232226] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-07 18:17:17,730][232226] Updated weights for policy 0, policy_version 94040 (0.0007) [2023-03-07 18:17:18,521][232226] Updated weights for policy 0, policy_version 94050 (0.0006) [2023-03-07 18:17:19,304][232226] Updated weights for policy 0, policy_version 94060 (0.0007) [2023-03-07 18:17:20,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 96326656. Throughput: 0: 12826.4. Samples: 96304498. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:20,069][231894] Avg episode reward: [(0, '194.635')] [2023-03-07 18:17:20,114][232226] Updated weights for policy 0, policy_version 94070 (0.0006) [2023-03-07 18:17:20,914][232226] Updated weights for policy 0, policy_version 94080 (0.0006) [2023-03-07 18:17:21,713][232226] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-07 18:17:22,506][232226] Updated weights for policy 0, policy_version 94100 (0.0006) [2023-03-07 18:17:23,309][232226] Updated weights for policy 0, policy_version 94110 (0.0006) [2023-03-07 18:17:24,106][232226] Updated weights for policy 0, policy_version 94120 (0.0008) [2023-03-07 18:17:24,897][232226] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-07 18:17:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 96391168. Throughput: 0: 12837.6. Samples: 96381413. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:25,069][231894] Avg episode reward: [(0, '197.035')] [2023-03-07 18:17:25,716][232226] Updated weights for policy 0, policy_version 94140 (0.0006) [2023-03-07 18:17:26,507][232226] Updated weights for policy 0, policy_version 94150 (0.0005) [2023-03-07 18:17:27,302][232226] Updated weights for policy 0, policy_version 94160 (0.0006) [2023-03-07 18:17:28,110][232226] Updated weights for policy 0, policy_version 94170 (0.0006) [2023-03-07 18:17:28,895][232226] Updated weights for policy 0, policy_version 94180 (0.0006) [2023-03-07 18:17:29,718][232226] Updated weights for policy 0, policy_version 94190 (0.0007) [2023-03-07 18:17:30,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12843.4). Total num frames: 96454656. Throughput: 0: 12835.2. Samples: 96419924. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:30,070][231894] Avg episode reward: [(0, '195.499')] [2023-03-07 18:17:30,502][232226] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-07 18:17:31,306][232226] Updated weights for policy 0, policy_version 94210 (0.0006) [2023-03-07 18:17:32,111][232226] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-07 18:17:32,894][232226] Updated weights for policy 0, policy_version 94230 (0.0007) [2023-03-07 18:17:33,694][232226] Updated weights for policy 0, policy_version 94240 (0.0007) [2023-03-07 18:17:34,498][232226] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-07 18:17:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 96519168. Throughput: 0: 12833.4. Samples: 96496856. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:35,069][231894] Avg episode reward: [(0, '192.454')] [2023-03-07 18:17:35,286][232226] Updated weights for policy 0, policy_version 94260 (0.0006) [2023-03-07 18:17:36,081][232226] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-07 18:17:36,881][232226] Updated weights for policy 0, policy_version 94280 (0.0007) [2023-03-07 18:17:37,682][232226] Updated weights for policy 0, policy_version 94290 (0.0006) [2023-03-07 18:17:38,481][232226] Updated weights for policy 0, policy_version 94300 (0.0006) [2023-03-07 18:17:39,284][232226] Updated weights for policy 0, policy_version 94310 (0.0006) [2023-03-07 18:17:40,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12839.9). Total num frames: 96582656. Throughput: 0: 12833.3. Samples: 96573794. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:40,070][231894] Avg episode reward: [(0, '201.401')] [2023-03-07 18:17:40,087][232226] Updated weights for policy 0, policy_version 94320 (0.0007) [2023-03-07 18:17:40,887][232226] Updated weights for policy 0, policy_version 94330 (0.0006) [2023-03-07 18:17:41,672][232226] Updated weights for policy 0, policy_version 94340 (0.0006) [2023-03-07 18:17:42,465][232226] Updated weights for policy 0, policy_version 94350 (0.0006) [2023-03-07 18:17:43,281][232226] Updated weights for policy 0, policy_version 94360 (0.0007) [2023-03-07 18:17:44,077][232226] Updated weights for policy 0, policy_version 94370 (0.0008) [2023-03-07 18:17:44,855][232226] Updated weights for policy 0, policy_version 94380 (0.0007) [2023-03-07 18:17:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 96647168. Throughput: 0: 12840.3. Samples: 96612346. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:45,069][231894] Avg episode reward: [(0, '197.319')] [2023-03-07 18:17:45,660][232226] Updated weights for policy 0, policy_version 94390 (0.0006) [2023-03-07 18:17:46,457][232226] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-07 18:17:47,254][232226] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-07 18:17:48,057][232226] Updated weights for policy 0, policy_version 94420 (0.0007) [2023-03-07 18:17:48,853][232226] Updated weights for policy 0, policy_version 94430 (0.0007) [2023-03-07 18:17:49,668][232226] Updated weights for policy 0, policy_version 94440 (0.0007) [2023-03-07 18:17:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 96711680. Throughput: 0: 12843.2. Samples: 96689423. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:50,069][231894] Avg episode reward: [(0, '191.212')] [2023-03-07 18:17:50,451][232226] Updated weights for policy 0, policy_version 94450 (0.0005) [2023-03-07 18:17:51,230][232226] Updated weights for policy 0, policy_version 94460 (0.0006) [2023-03-07 18:17:52,021][232226] Updated weights for policy 0, policy_version 94470 (0.0006) [2023-03-07 18:17:52,815][232226] Updated weights for policy 0, policy_version 94480 (0.0006) [2023-03-07 18:17:53,604][232226] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-07 18:17:54,414][232226] Updated weights for policy 0, policy_version 94500 (0.0006) [2023-03-07 18:17:55,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 96776192. Throughput: 0: 12844.0. Samples: 96766638. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 18:17:55,069][231894] Avg episode reward: [(0, '197.062')] [2023-03-07 18:17:55,198][232226] Updated weights for policy 0, policy_version 94510 (0.0006) [2023-03-07 18:17:56,020][232226] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-07 18:17:56,811][232226] Updated weights for policy 0, policy_version 94530 (0.0006) [2023-03-07 18:17:57,603][232226] Updated weights for policy 0, policy_version 94540 (0.0006) [2023-03-07 18:17:58,429][232226] Updated weights for policy 0, policy_version 94550 (0.0007) [2023-03-07 18:17:59,223][232226] Updated weights for policy 0, policy_version 94560 (0.0007) [2023-03-07 18:17:59,998][232226] Updated weights for policy 0, policy_version 94570 (0.0006) [2023-03-07 18:18:00,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 96840704. Throughput: 0: 12838.7. Samples: 96805012. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:00,069][231894] Avg episode reward: [(0, '193.997')] [2023-03-07 18:18:00,805][232226] Updated weights for policy 0, policy_version 94580 (0.0007) [2023-03-07 18:18:01,596][232226] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-07 18:18:02,410][232226] Updated weights for policy 0, policy_version 94600 (0.0006) [2023-03-07 18:18:03,212][232226] Updated weights for policy 0, policy_version 94610 (0.0006) [2023-03-07 18:18:03,994][232226] Updated weights for policy 0, policy_version 94620 (0.0007) [2023-03-07 18:18:04,798][232226] Updated weights for policy 0, policy_version 94630 (0.0006) [2023-03-07 18:18:05,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12843.4). Total num frames: 96904192. Throughput: 0: 12836.5. Samples: 96882140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:05,069][231894] Avg episode reward: [(0, '199.310')] [2023-03-07 18:18:05,595][232226] Updated weights for policy 0, policy_version 94640 (0.0007) [2023-03-07 18:18:06,385][232226] Updated weights for policy 0, policy_version 94650 (0.0006) [2023-03-07 18:18:07,183][232226] Updated weights for policy 0, policy_version 94660 (0.0006) [2023-03-07 18:18:07,992][232226] Updated weights for policy 0, policy_version 94670 (0.0006) [2023-03-07 18:18:08,773][232226] Updated weights for policy 0, policy_version 94680 (0.0007) [2023-03-07 18:18:09,561][232226] Updated weights for policy 0, policy_version 94690 (0.0006) [2023-03-07 18:18:10,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 96968704. Throughput: 0: 12840.8. Samples: 96959248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:10,069][231894] Avg episode reward: [(0, '200.206')] [2023-03-07 18:18:10,369][232226] Updated weights for policy 0, policy_version 94700 (0.0007) [2023-03-07 18:18:11,165][232226] Updated weights for policy 0, policy_version 94710 (0.0006) [2023-03-07 18:18:11,976][232226] Updated weights for policy 0, policy_version 94720 (0.0007) [2023-03-07 18:18:12,756][232226] Updated weights for policy 0, policy_version 94730 (0.0007) [2023-03-07 18:18:13,545][232226] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-07 18:18:14,354][232226] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-07 18:18:15,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 97033216. Throughput: 0: 12840.2. Samples: 96997732. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:15,069][231894] Avg episode reward: [(0, '192.935')] [2023-03-07 18:18:15,144][232226] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-07 18:18:15,927][232226] Updated weights for policy 0, policy_version 94770 (0.0006) [2023-03-07 18:18:16,726][232226] Updated weights for policy 0, policy_version 94780 (0.0006) [2023-03-07 18:18:17,512][232226] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-07 18:18:18,304][232226] Updated weights for policy 0, policy_version 94800 (0.0006) [2023-03-07 18:18:19,094][232226] Updated weights for policy 0, policy_version 94810 (0.0006) [2023-03-07 18:18:19,890][232226] Updated weights for policy 0, policy_version 94820 (0.0007) [2023-03-07 18:18:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 97097728. Throughput: 0: 12853.4. Samples: 97075257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:20,069][231894] Avg episode reward: [(0, '200.566')] [2023-03-07 18:18:20,705][232226] Updated weights for policy 0, policy_version 94830 (0.0007) [2023-03-07 18:18:21,486][232226] Updated weights for policy 0, policy_version 94840 (0.0006) [2023-03-07 18:18:22,311][232226] Updated weights for policy 0, policy_version 94850 (0.0007) [2023-03-07 18:18:23,091][232226] Updated weights for policy 0, policy_version 94860 (0.0007) [2023-03-07 18:18:23,854][232226] Updated weights for policy 0, policy_version 94870 (0.0006) [2023-03-07 18:18:24,670][232226] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-07 18:18:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 97162240. Throughput: 0: 12859.3. Samples: 97152463. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:25,069][231894] Avg episode reward: [(0, '202.618')] [2023-03-07 18:18:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000094885_97162240.pth... [2023-03-07 18:18:25,104][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000091873_94077952.pth [2023-03-07 18:18:25,457][232226] Updated weights for policy 0, policy_version 94890 (0.0007) [2023-03-07 18:18:26,263][232226] Updated weights for policy 0, policy_version 94900 (0.0006) [2023-03-07 18:18:27,054][232226] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-07 18:18:27,852][232226] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-07 18:18:28,640][232226] Updated weights for policy 0, policy_version 94930 (0.0007) [2023-03-07 18:18:29,427][232226] Updated weights for policy 0, policy_version 94940 (0.0007) [2023-03-07 18:18:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 97225728. Throughput: 0: 12861.1. Samples: 97191097. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:30,069][231894] Avg episode reward: [(0, '196.132')] [2023-03-07 18:18:30,229][232226] Updated weights for policy 0, policy_version 94950 (0.0006) [2023-03-07 18:18:31,037][232226] Updated weights for policy 0, policy_version 94960 (0.0007) [2023-03-07 18:18:31,826][232226] Updated weights for policy 0, policy_version 94970 (0.0006) [2023-03-07 18:18:32,614][232226] Updated weights for policy 0, policy_version 94980 (0.0006) [2023-03-07 18:18:33,416][232226] Updated weights for policy 0, policy_version 94990 (0.0007) [2023-03-07 18:18:34,218][232226] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-07 18:18:35,004][232226] Updated weights for policy 0, policy_version 95010 (0.0006) [2023-03-07 18:18:35,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 97290240. Throughput: 0: 12867.3. Samples: 97268451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:35,069][231894] Avg episode reward: [(0, '191.880')] [2023-03-07 18:18:35,803][232226] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-07 18:18:36,578][232226] Updated weights for policy 0, policy_version 95030 (0.0006) [2023-03-07 18:18:37,367][232226] Updated weights for policy 0, policy_version 95040 (0.0007) [2023-03-07 18:18:38,164][232226] Updated weights for policy 0, policy_version 95050 (0.0007) [2023-03-07 18:18:38,961][232226] Updated weights for policy 0, policy_version 95060 (0.0006) [2023-03-07 18:18:39,763][232226] Updated weights for policy 0, policy_version 95070 (0.0006) [2023-03-07 18:18:40,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 97354752. Throughput: 0: 12872.8. Samples: 97345914. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:40,069][231894] Avg episode reward: [(0, '198.052')] [2023-03-07 18:18:40,556][232226] Updated weights for policy 0, policy_version 95080 (0.0007) [2023-03-07 18:18:41,348][232226] Updated weights for policy 0, policy_version 95090 (0.0007) [2023-03-07 18:18:42,152][232226] Updated weights for policy 0, policy_version 95100 (0.0006) [2023-03-07 18:18:42,944][232226] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-07 18:18:43,733][232226] Updated weights for policy 0, policy_version 95120 (0.0006) [2023-03-07 18:18:44,550][232226] Updated weights for policy 0, policy_version 95130 (0.0006) [2023-03-07 18:18:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 97419264. Throughput: 0: 12876.3. Samples: 97384445. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:45,069][231894] Avg episode reward: [(0, '201.917')] [2023-03-07 18:18:45,332][232226] Updated weights for policy 0, policy_version 95140 (0.0006) [2023-03-07 18:18:46,129][232226] Updated weights for policy 0, policy_version 95150 (0.0006) [2023-03-07 18:18:46,911][232226] Updated weights for policy 0, policy_version 95160 (0.0007) [2023-03-07 18:18:47,713][232226] Updated weights for policy 0, policy_version 95170 (0.0006) [2023-03-07 18:18:48,514][232226] Updated weights for policy 0, policy_version 95180 (0.0006) [2023-03-07 18:18:49,293][232226] Updated weights for policy 0, policy_version 95190 (0.0006) [2023-03-07 18:18:50,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 97483776. Throughput: 0: 12879.8. Samples: 97461731. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:50,069][231894] Avg episode reward: [(0, '197.087')] [2023-03-07 18:18:50,074][232226] Updated weights for policy 0, policy_version 95200 (0.0007) [2023-03-07 18:18:50,878][232226] Updated weights for policy 0, policy_version 95210 (0.0007) [2023-03-07 18:18:51,683][232226] Updated weights for policy 0, policy_version 95220 (0.0006) [2023-03-07 18:18:52,492][232226] Updated weights for policy 0, policy_version 95230 (0.0006) [2023-03-07 18:18:53,282][232226] Updated weights for policy 0, policy_version 95240 (0.0007) [2023-03-07 18:18:54,069][232226] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-07 18:18:54,878][232226] Updated weights for policy 0, policy_version 95260 (0.0007) [2023-03-07 18:18:55,069][231894] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12850.3). Total num frames: 97548288. Throughput: 0: 12884.8. Samples: 97539065. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:18:55,069][231894] Avg episode reward: [(0, '193.711')] [2023-03-07 18:18:55,677][232226] Updated weights for policy 0, policy_version 95270 (0.0006) [2023-03-07 18:18:56,458][232226] Updated weights for policy 0, policy_version 95280 (0.0006) [2023-03-07 18:18:57,262][232226] Updated weights for policy 0, policy_version 95290 (0.0006) [2023-03-07 18:18:58,064][232226] Updated weights for policy 0, policy_version 95300 (0.0006) [2023-03-07 18:18:58,856][232226] Updated weights for policy 0, policy_version 95310 (0.0006) [2023-03-07 18:18:59,655][232226] Updated weights for policy 0, policy_version 95320 (0.0006) [2023-03-07 18:19:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12853.8). Total num frames: 97612800. Throughput: 0: 12882.4. Samples: 97577441. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:00,069][231894] Avg episode reward: [(0, '195.243')] [2023-03-07 18:19:00,446][232226] Updated weights for policy 0, policy_version 95330 (0.0007) [2023-03-07 18:19:01,231][232226] Updated weights for policy 0, policy_version 95340 (0.0006) [2023-03-07 18:19:02,046][232226] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-07 18:19:02,835][232226] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-07 18:19:03,617][232226] Updated weights for policy 0, policy_version 95370 (0.0007) [2023-03-07 18:19:04,416][232226] Updated weights for policy 0, policy_version 95380 (0.0006) [2023-03-07 18:19:05,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12853.8). Total num frames: 97677312. Throughput: 0: 12879.1. Samples: 97654818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:05,069][231894] Avg episode reward: [(0, '194.696')] [2023-03-07 18:19:05,212][232226] Updated weights for policy 0, policy_version 95390 (0.0008) [2023-03-07 18:19:06,002][232226] Updated weights for policy 0, policy_version 95400 (0.0007) [2023-03-07 18:19:06,808][232226] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-07 18:19:07,590][232226] Updated weights for policy 0, policy_version 95420 (0.0006) [2023-03-07 18:19:08,370][232226] Updated weights for policy 0, policy_version 95430 (0.0007) [2023-03-07 18:19:09,179][232226] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-07 18:19:09,986][232226] Updated weights for policy 0, policy_version 95450 (0.0008) [2023-03-07 18:19:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 97740800. Throughput: 0: 12881.5. Samples: 97732132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:10,069][231894] Avg episode reward: [(0, '195.006')] [2023-03-07 18:19:10,776][232226] Updated weights for policy 0, policy_version 95460 (0.0006) [2023-03-07 18:19:11,593][232226] Updated weights for policy 0, policy_version 95470 (0.0007) [2023-03-07 18:19:12,381][232226] Updated weights for policy 0, policy_version 95480 (0.0006) [2023-03-07 18:19:13,174][232226] Updated weights for policy 0, policy_version 95490 (0.0006) [2023-03-07 18:19:13,979][232226] Updated weights for policy 0, policy_version 95500 (0.0007) [2023-03-07 18:19:14,765][232226] Updated weights for policy 0, policy_version 95510 (0.0007) [2023-03-07 18:19:15,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 97805312. Throughput: 0: 12877.6. Samples: 97770588. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:15,069][231894] Avg episode reward: [(0, '194.771')] [2023-03-07 18:19:15,562][232226] Updated weights for policy 0, policy_version 95520 (0.0006) [2023-03-07 18:19:16,345][232226] Updated weights for policy 0, policy_version 95530 (0.0006) [2023-03-07 18:19:17,133][232226] Updated weights for policy 0, policy_version 95540 (0.0005) [2023-03-07 18:19:17,928][232226] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-07 18:19:18,722][232226] Updated weights for policy 0, policy_version 95560 (0.0007) [2023-03-07 18:19:19,521][232226] Updated weights for policy 0, policy_version 95570 (0.0006) [2023-03-07 18:19:20,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 97869824. Throughput: 0: 12883.0. Samples: 97848188. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:20,069][231894] Avg episode reward: [(0, '189.455')] [2023-03-07 18:19:20,341][232226] Updated weights for policy 0, policy_version 95580 (0.0006) [2023-03-07 18:19:21,114][232226] Updated weights for policy 0, policy_version 95590 (0.0006) [2023-03-07 18:19:21,907][232226] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-07 18:19:22,706][232226] Updated weights for policy 0, policy_version 95610 (0.0006) [2023-03-07 18:19:23,496][232226] Updated weights for policy 0, policy_version 95620 (0.0006) [2023-03-07 18:19:24,301][232226] Updated weights for policy 0, policy_version 95630 (0.0006) [2023-03-07 18:19:25,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 97934336. Throughput: 0: 12873.1. Samples: 97925204. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:25,069][231894] Avg episode reward: [(0, '190.547')] [2023-03-07 18:19:25,100][232226] Updated weights for policy 0, policy_version 95640 (0.0006) [2023-03-07 18:19:25,899][232226] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-07 18:19:26,699][232226] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-07 18:19:27,513][232226] Updated weights for policy 0, policy_version 95670 (0.0006) [2023-03-07 18:19:28,300][232226] Updated weights for policy 0, policy_version 95680 (0.0006) [2023-03-07 18:19:29,105][232226] Updated weights for policy 0, policy_version 95690 (0.0007) [2023-03-07 18:19:29,909][232226] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-07 18:19:30,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12857.3). Total num frames: 97998848. Throughput: 0: 12868.2. Samples: 97963516. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:30,069][231894] Avg episode reward: [(0, '190.494')] [2023-03-07 18:19:30,693][232226] Updated weights for policy 0, policy_version 95710 (0.0006) [2023-03-07 18:19:31,497][232226] Updated weights for policy 0, policy_version 95720 (0.0006) [2023-03-07 18:19:32,304][232226] Updated weights for policy 0, policy_version 95730 (0.0007) [2023-03-07 18:19:33,110][232226] Updated weights for policy 0, policy_version 95740 (0.0006) [2023-03-07 18:19:33,898][232226] Updated weights for policy 0, policy_version 95750 (0.0006) [2023-03-07 18:19:34,694][232226] Updated weights for policy 0, policy_version 95760 (0.0006) [2023-03-07 18:19:35,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12853.8). Total num frames: 98062336. Throughput: 0: 12859.7. Samples: 98040417. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:35,069][231894] Avg episode reward: [(0, '195.166')] [2023-03-07 18:19:35,497][232226] Updated weights for policy 0, policy_version 95770 (0.0007) [2023-03-07 18:19:36,285][232226] Updated weights for policy 0, policy_version 95780 (0.0005) [2023-03-07 18:19:37,103][232226] Updated weights for policy 0, policy_version 95790 (0.0006) [2023-03-07 18:19:37,895][232226] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-07 18:19:38,690][232226] Updated weights for policy 0, policy_version 95810 (0.0006) [2023-03-07 18:19:39,488][232226] Updated weights for policy 0, policy_version 95820 (0.0008) [2023-03-07 18:19:40,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 98126848. Throughput: 0: 12851.7. Samples: 98117391. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:40,080][231894] Avg episode reward: [(0, '201.072')] [2023-03-07 18:19:40,302][232226] Updated weights for policy 0, policy_version 95830 (0.0007) [2023-03-07 18:19:41,094][232226] Updated weights for policy 0, policy_version 95840 (0.0006) [2023-03-07 18:19:41,880][232226] Updated weights for policy 0, policy_version 95850 (0.0007) [2023-03-07 18:19:42,686][232226] Updated weights for policy 0, policy_version 95860 (0.0007) [2023-03-07 18:19:43,486][232226] Updated weights for policy 0, policy_version 95870 (0.0006) [2023-03-07 18:19:44,274][232226] Updated weights for policy 0, policy_version 95880 (0.0005) [2023-03-07 18:19:45,066][232226] Updated weights for policy 0, policy_version 95890 (0.0007) [2023-03-07 18:19:45,069][231894] Fps is (10 sec: 12902.5, 60 sec: 12868.2, 300 sec: 12857.3). Total num frames: 98191360. Throughput: 0: 12853.3. Samples: 98155838. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:45,080][231894] Avg episode reward: [(0, '204.464')] [2023-03-07 18:19:45,866][232226] Updated weights for policy 0, policy_version 95900 (0.0006) [2023-03-07 18:19:46,669][232226] Updated weights for policy 0, policy_version 95910 (0.0006) [2023-03-07 18:19:47,461][232226] Updated weights for policy 0, policy_version 95920 (0.0007) [2023-03-07 18:19:48,255][232226] Updated weights for policy 0, policy_version 95930 (0.0006) [2023-03-07 18:19:49,039][232226] Updated weights for policy 0, policy_version 95940 (0.0006) [2023-03-07 18:19:49,850][232226] Updated weights for policy 0, policy_version 95950 (0.0007) [2023-03-07 18:19:50,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 98254848. Throughput: 0: 12848.4. Samples: 98232999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:50,080][231894] Avg episode reward: [(0, '191.747')] [2023-03-07 18:19:50,654][232226] Updated weights for policy 0, policy_version 95960 (0.0007) [2023-03-07 18:19:51,429][232226] Updated weights for policy 0, policy_version 95970 (0.0007) [2023-03-07 18:19:52,247][232226] Updated weights for policy 0, policy_version 95980 (0.0006) [2023-03-07 18:19:53,040][232226] Updated weights for policy 0, policy_version 95990 (0.0007) [2023-03-07 18:19:53,822][232226] Updated weights for policy 0, policy_version 96000 (0.0006) [2023-03-07 18:19:54,626][232226] Updated weights for policy 0, policy_version 96010 (0.0006) [2023-03-07 18:19:55,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 98319360. Throughput: 0: 12844.6. Samples: 98310138. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:19:55,080][231894] Avg episode reward: [(0, '200.211')] [2023-03-07 18:19:55,429][232226] Updated weights for policy 0, policy_version 96020 (0.0006) [2023-03-07 18:19:56,215][232226] Updated weights for policy 0, policy_version 96030 (0.0005) [2023-03-07 18:19:57,015][232226] Updated weights for policy 0, policy_version 96040 (0.0007) [2023-03-07 18:19:57,800][232226] Updated weights for policy 0, policy_version 96050 (0.0007) [2023-03-07 18:19:58,620][232226] Updated weights for policy 0, policy_version 96060 (0.0006) [2023-03-07 18:19:59,409][232226] Updated weights for policy 0, policy_version 96070 (0.0007) [2023-03-07 18:20:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 98383872. Throughput: 0: 12847.1. Samples: 98348709. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:00,070][231894] Avg episode reward: [(0, '205.024')] [2023-03-07 18:20:00,197][232226] Updated weights for policy 0, policy_version 96080 (0.0006) [2023-03-07 18:20:01,025][232226] Updated weights for policy 0, policy_version 96090 (0.0006) [2023-03-07 18:20:01,806][232226] Updated weights for policy 0, policy_version 96100 (0.0006) [2023-03-07 18:20:02,633][232226] Updated weights for policy 0, policy_version 96110 (0.0007) [2023-03-07 18:20:03,443][232226] Updated weights for policy 0, policy_version 96120 (0.0007) [2023-03-07 18:20:04,248][232226] Updated weights for policy 0, policy_version 96130 (0.0007) [2023-03-07 18:20:05,029][232226] Updated weights for policy 0, policy_version 96140 (0.0007) [2023-03-07 18:20:05,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 98447360. Throughput: 0: 12825.1. Samples: 98425316. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:05,069][231894] Avg episode reward: [(0, '202.782')] [2023-03-07 18:20:05,823][232226] Updated weights for policy 0, policy_version 96150 (0.0007) [2023-03-07 18:20:06,639][232226] Updated weights for policy 0, policy_version 96160 (0.0006) [2023-03-07 18:20:07,427][232226] Updated weights for policy 0, policy_version 96170 (0.0007) [2023-03-07 18:20:08,229][232226] Updated weights for policy 0, policy_version 96180 (0.0006) [2023-03-07 18:20:09,018][232226] Updated weights for policy 0, policy_version 96190 (0.0006) [2023-03-07 18:20:09,806][232226] Updated weights for policy 0, policy_version 96200 (0.0007) [2023-03-07 18:20:10,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 98511872. Throughput: 0: 12826.5. Samples: 98502396. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:10,069][231894] Avg episode reward: [(0, '197.800')] [2023-03-07 18:20:10,604][232226] Updated weights for policy 0, policy_version 96210 (0.0006) [2023-03-07 18:20:11,405][232226] Updated weights for policy 0, policy_version 96220 (0.0008) [2023-03-07 18:20:12,221][232226] Updated weights for policy 0, policy_version 96230 (0.0006) [2023-03-07 18:20:13,013][232226] Updated weights for policy 0, policy_version 96240 (0.0006) [2023-03-07 18:20:13,818][232226] Updated weights for policy 0, policy_version 96250 (0.0007) [2023-03-07 18:20:14,610][232226] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-07 18:20:15,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 98575360. Throughput: 0: 12829.8. Samples: 98540857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:15,070][231894] Avg episode reward: [(0, '197.516')] [2023-03-07 18:20:15,408][232226] Updated weights for policy 0, policy_version 96270 (0.0007) [2023-03-07 18:20:16,196][232226] Updated weights for policy 0, policy_version 96280 (0.0005) [2023-03-07 18:20:17,012][232226] Updated weights for policy 0, policy_version 96290 (0.0005) [2023-03-07 18:20:17,804][232226] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-07 18:20:18,594][232226] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-07 18:20:19,385][232226] Updated weights for policy 0, policy_version 96320 (0.0006) [2023-03-07 18:20:20,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 98639872. Throughput: 0: 12832.0. Samples: 98617856. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:20,069][231894] Avg episode reward: [(0, '199.077')] [2023-03-07 18:20:20,162][232226] Updated weights for policy 0, policy_version 96330 (0.0006) [2023-03-07 18:20:20,967][232226] Updated weights for policy 0, policy_version 96340 (0.0006) [2023-03-07 18:20:21,755][232226] Updated weights for policy 0, policy_version 96350 (0.0006) [2023-03-07 18:20:22,559][232226] Updated weights for policy 0, policy_version 96360 (0.0006) [2023-03-07 18:20:23,353][232226] Updated weights for policy 0, policy_version 96370 (0.0007) [2023-03-07 18:20:24,155][232226] Updated weights for policy 0, policy_version 96380 (0.0006) [2023-03-07 18:20:24,978][232226] Updated weights for policy 0, policy_version 96390 (0.0006) [2023-03-07 18:20:25,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 98704384. Throughput: 0: 12831.7. Samples: 98694816. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:25,069][231894] Avg episode reward: [(0, '189.363')] [2023-03-07 18:20:25,074][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000096391_98704384.pth... [2023-03-07 18:20:25,107][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000093380_95621120.pth [2023-03-07 18:20:25,771][232226] Updated weights for policy 0, policy_version 96400 (0.0006) [2023-03-07 18:20:26,577][232226] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-07 18:20:27,385][232226] Updated weights for policy 0, policy_version 96420 (0.0006) [2023-03-07 18:20:28,171][232226] Updated weights for policy 0, policy_version 96430 (0.0006) [2023-03-07 18:20:28,978][232226] Updated weights for policy 0, policy_version 96440 (0.0006) [2023-03-07 18:20:29,773][232226] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-07 18:20:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12846.9). Total num frames: 98767872. Throughput: 0: 12827.6. Samples: 98733080. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:30,069][231894] Avg episode reward: [(0, '200.987')] [2023-03-07 18:20:30,557][232226] Updated weights for policy 0, policy_version 96460 (0.0005) [2023-03-07 18:20:31,368][232226] Updated weights for policy 0, policy_version 96470 (0.0007) [2023-03-07 18:20:32,152][232226] Updated weights for policy 0, policy_version 96480 (0.0007) [2023-03-07 18:20:32,940][232226] Updated weights for policy 0, policy_version 96490 (0.0006) [2023-03-07 18:20:33,750][232226] Updated weights for policy 0, policy_version 96500 (0.0006) [2023-03-07 18:20:34,566][232226] Updated weights for policy 0, policy_version 96510 (0.0007) [2023-03-07 18:20:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 98832384. Throughput: 0: 12830.2. Samples: 98810359. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:35,069][231894] Avg episode reward: [(0, '190.998')] [2023-03-07 18:20:35,362][232226] Updated weights for policy 0, policy_version 96520 (0.0006) [2023-03-07 18:20:36,151][232226] Updated weights for policy 0, policy_version 96530 (0.0006) [2023-03-07 18:20:36,975][232226] Updated weights for policy 0, policy_version 96540 (0.0007) [2023-03-07 18:20:37,749][232226] Updated weights for policy 0, policy_version 96550 (0.0008) [2023-03-07 18:20:38,568][232226] Updated weights for policy 0, policy_version 96560 (0.0007) [2023-03-07 18:20:39,376][232226] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-07 18:20:40,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12843.4). Total num frames: 98895872. Throughput: 0: 12818.0. Samples: 98886950. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:40,069][231894] Avg episode reward: [(0, '191.911')] [2023-03-07 18:20:40,165][232226] Updated weights for policy 0, policy_version 96580 (0.0006) [2023-03-07 18:20:40,963][232226] Updated weights for policy 0, policy_version 96590 (0.0007) [2023-03-07 18:20:41,741][232226] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-07 18:20:42,534][232226] Updated weights for policy 0, policy_version 96610 (0.0006) [2023-03-07 18:20:43,338][232226] Updated weights for policy 0, policy_version 96620 (0.0007) [2023-03-07 18:20:44,115][232226] Updated weights for policy 0, policy_version 96630 (0.0007) [2023-03-07 18:20:44,924][232226] Updated weights for policy 0, policy_version 96640 (0.0007) [2023-03-07 18:20:45,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12843.4). Total num frames: 98960384. Throughput: 0: 12819.5. Samples: 98925589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:45,070][231894] Avg episode reward: [(0, '194.255')] [2023-03-07 18:20:45,723][232226] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-07 18:20:46,513][232226] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-07 18:20:47,326][232226] Updated weights for policy 0, policy_version 96670 (0.0006) [2023-03-07 18:20:48,113][232226] Updated weights for policy 0, policy_version 96680 (0.0006) [2023-03-07 18:20:48,893][232226] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-07 18:20:49,700][232226] Updated weights for policy 0, policy_version 96700 (0.0006) [2023-03-07 18:20:50,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 99024896. Throughput: 0: 12836.6. Samples: 99002964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 18:20:50,069][231894] Avg episode reward: [(0, '194.129')] [2023-03-07 18:20:50,483][232226] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-07 18:20:51,286][232226] Updated weights for policy 0, policy_version 96720 (0.0007) [2023-03-07 18:20:52,085][232226] Updated weights for policy 0, policy_version 96730 (0.0006) [2023-03-07 18:20:52,881][232226] Updated weights for policy 0, policy_version 96740 (0.0006) [2023-03-07 18:20:53,677][232226] Updated weights for policy 0, policy_version 96750 (0.0006) [2023-03-07 18:20:54,477][232226] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-07 18:20:55,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 99089408. Throughput: 0: 12838.7. Samples: 99080138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:20:55,069][231894] Avg episode reward: [(0, '201.621')] [2023-03-07 18:20:55,261][232226] Updated weights for policy 0, policy_version 96770 (0.0006) [2023-03-07 18:20:56,069][232226] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-07 18:20:56,868][232226] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-07 18:20:57,669][232226] Updated weights for policy 0, policy_version 96800 (0.0007) [2023-03-07 18:20:58,459][232226] Updated weights for policy 0, policy_version 96810 (0.0006) [2023-03-07 18:20:59,238][232226] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-07 18:21:00,061][232226] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-07 18:21:00,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12834.2, 300 sec: 12850.3). Total num frames: 99153920. Throughput: 0: 12839.5. Samples: 99118631. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:00,069][231894] Avg episode reward: [(0, '197.837')] [2023-03-07 18:21:00,839][232226] Updated weights for policy 0, policy_version 96840 (0.0006) [2023-03-07 18:21:01,642][232226] Updated weights for policy 0, policy_version 96850 (0.0007) [2023-03-07 18:21:02,442][232226] Updated weights for policy 0, policy_version 96860 (0.0006) [2023-03-07 18:21:03,246][232226] Updated weights for policy 0, policy_version 96870 (0.0006) [2023-03-07 18:21:04,039][232226] Updated weights for policy 0, policy_version 96880 (0.0007) [2023-03-07 18:21:04,831][232226] Updated weights for policy 0, policy_version 96890 (0.0007) [2023-03-07 18:21:05,069][231894] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 99218432. Throughput: 0: 12842.0. Samples: 99195745. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:05,070][231894] Avg episode reward: [(0, '182.934')] [2023-03-07 18:21:05,598][232226] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-07 18:21:06,410][232226] Updated weights for policy 0, policy_version 96910 (0.0006) [2023-03-07 18:21:07,214][232226] Updated weights for policy 0, policy_version 96920 (0.0007) [2023-03-07 18:21:07,990][232226] Updated weights for policy 0, policy_version 96930 (0.0007) [2023-03-07 18:21:08,794][232226] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-07 18:21:09,589][232226] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-07 18:21:10,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 99281920. Throughput: 0: 12849.9. Samples: 99273063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:10,069][231894] Avg episode reward: [(0, '200.101')] [2023-03-07 18:21:10,392][232226] Updated weights for policy 0, policy_version 96960 (0.0006) [2023-03-07 18:21:11,182][232226] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-07 18:21:11,955][232226] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-07 18:21:12,764][232226] Updated weights for policy 0, policy_version 96990 (0.0006) [2023-03-07 18:21:13,569][232226] Updated weights for policy 0, policy_version 97000 (0.0006) [2023-03-07 18:21:14,354][232226] Updated weights for policy 0, policy_version 97010 (0.0007) [2023-03-07 18:21:15,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 99347456. Throughput: 0: 12862.5. Samples: 99311891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:15,069][231894] Avg episode reward: [(0, '202.075')] [2023-03-07 18:21:15,146][232226] Updated weights for policy 0, policy_version 97020 (0.0006) [2023-03-07 18:21:15,953][232226] Updated weights for policy 0, policy_version 97030 (0.0007) [2023-03-07 18:21:16,729][232226] Updated weights for policy 0, policy_version 97040 (0.0006) [2023-03-07 18:21:17,517][232226] Updated weights for policy 0, policy_version 97050 (0.0006) [2023-03-07 18:21:18,316][232226] Updated weights for policy 0, policy_version 97060 (0.0006) [2023-03-07 18:21:19,098][232226] Updated weights for policy 0, policy_version 97070 (0.0007) [2023-03-07 18:21:19,896][232226] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-07 18:21:20,069][231894] Fps is (10 sec: 13004.9, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 99411968. Throughput: 0: 12864.0. Samples: 99389240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:20,069][231894] Avg episode reward: [(0, '195.318')] [2023-03-07 18:21:20,677][232226] Updated weights for policy 0, policy_version 97090 (0.0006) [2023-03-07 18:21:21,473][232226] Updated weights for policy 0, policy_version 97100 (0.0007) [2023-03-07 18:21:22,293][232226] Updated weights for policy 0, policy_version 97110 (0.0005) [2023-03-07 18:21:23,066][232226] Updated weights for policy 0, policy_version 97120 (0.0006) [2023-03-07 18:21:23,865][232226] Updated weights for policy 0, policy_version 97130 (0.0006) [2023-03-07 18:21:24,681][232226] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-07 18:21:25,069][231894] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 99476480. Throughput: 0: 12885.3. Samples: 99466786. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:25,080][231894] Avg episode reward: [(0, '190.740')] [2023-03-07 18:21:25,468][232226] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-07 18:21:26,252][232226] Updated weights for policy 0, policy_version 97160 (0.0006) [2023-03-07 18:21:27,075][232226] Updated weights for policy 0, policy_version 97170 (0.0007) [2023-03-07 18:21:27,873][232226] Updated weights for policy 0, policy_version 97180 (0.0006) [2023-03-07 18:21:28,656][232226] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-07 18:21:29,442][232226] Updated weights for policy 0, policy_version 97200 (0.0007) [2023-03-07 18:21:30,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 99539968. Throughput: 0: 12882.7. Samples: 99505307. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:30,080][231894] Avg episode reward: [(0, '196.497')] [2023-03-07 18:21:30,252][232226] Updated weights for policy 0, policy_version 97210 (0.0007) [2023-03-07 18:21:31,063][232226] Updated weights for policy 0, policy_version 97220 (0.0007) [2023-03-07 18:21:31,862][232226] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-07 18:21:32,641][232226] Updated weights for policy 0, policy_version 97240 (0.0006) [2023-03-07 18:21:33,457][232226] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-07 18:21:34,248][232226] Updated weights for policy 0, policy_version 97260 (0.0007) [2023-03-07 18:21:35,053][232226] Updated weights for policy 0, policy_version 97270 (0.0006) [2023-03-07 18:21:35,069][231894] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 99604480. Throughput: 0: 12870.9. Samples: 99582153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:35,080][231894] Avg episode reward: [(0, '196.482')] [2023-03-07 18:21:35,832][232226] Updated weights for policy 0, policy_version 97280 (0.0007) [2023-03-07 18:21:36,625][232226] Updated weights for policy 0, policy_version 97290 (0.0007) [2023-03-07 18:21:37,409][232226] Updated weights for policy 0, policy_version 97300 (0.0006) [2023-03-07 18:21:38,201][232226] Updated weights for policy 0, policy_version 97310 (0.0005) [2023-03-07 18:21:38,998][232226] Updated weights for policy 0, policy_version 97320 (0.0006) [2023-03-07 18:21:39,799][232226] Updated weights for policy 0, policy_version 97330 (0.0007) [2023-03-07 18:21:40,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12853.8). Total num frames: 99668992. Throughput: 0: 12877.0. Samples: 99659603. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:40,080][231894] Avg episode reward: [(0, '194.216')] [2023-03-07 18:21:40,581][232226] Updated weights for policy 0, policy_version 97340 (0.0007) [2023-03-07 18:21:41,389][232226] Updated weights for policy 0, policy_version 97350 (0.0007) [2023-03-07 18:21:42,178][232226] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-07 18:21:42,975][232226] Updated weights for policy 0, policy_version 97370 (0.0007) [2023-03-07 18:21:43,789][232226] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-07 18:21:44,589][232226] Updated weights for policy 0, policy_version 97390 (0.0007) [2023-03-07 18:21:45,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12853.8). Total num frames: 99733504. Throughput: 0: 12881.0. Samples: 99698276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:45,080][231894] Avg episode reward: [(0, '190.373')] [2023-03-07 18:21:45,383][232226] Updated weights for policy 0, policy_version 97400 (0.0006) [2023-03-07 18:21:46,194][232226] Updated weights for policy 0, policy_version 97410 (0.0006) [2023-03-07 18:21:47,004][232226] Updated weights for policy 0, policy_version 97420 (0.0007) [2023-03-07 18:21:47,779][232226] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-07 18:21:48,572][232226] Updated weights for policy 0, policy_version 97440 (0.0006) [2023-03-07 18:21:49,370][232226] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-07 18:21:50,069][231894] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 99796992. Throughput: 0: 12874.2. Samples: 99775082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:50,080][231894] Avg episode reward: [(0, '201.908')] [2023-03-07 18:21:50,155][232226] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-07 18:21:50,982][232226] Updated weights for policy 0, policy_version 97470 (0.0006) [2023-03-07 18:21:51,757][232226] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-07 18:21:52,566][232226] Updated weights for policy 0, policy_version 97490 (0.0007) [2023-03-07 18:21:53,348][232226] Updated weights for policy 0, policy_version 97500 (0.0006) [2023-03-07 18:21:54,132][232226] Updated weights for policy 0, policy_version 97510 (0.0006) [2023-03-07 18:21:54,924][232226] Updated weights for policy 0, policy_version 97520 (0.0007) [2023-03-07 18:21:55,069][231894] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 99861504. Throughput: 0: 12875.3. Samples: 99852453. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:21:55,080][231894] Avg episode reward: [(0, '195.393')] [2023-03-07 18:21:55,709][232226] Updated weights for policy 0, policy_version 97530 (0.0006) [2023-03-07 18:21:56,506][232226] Updated weights for policy 0, policy_version 97540 (0.0007) [2023-03-07 18:21:57,315][232226] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-07 18:21:58,082][232226] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-07 18:21:58,869][232226] Updated weights for policy 0, policy_version 97570 (0.0006) [2023-03-07 18:21:59,678][232226] Updated weights for policy 0, policy_version 97580 (0.0007) [2023-03-07 18:22:00,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 99926016. Throughput: 0: 12874.8. Samples: 99891254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:22:00,069][231894] Avg episode reward: [(0, '194.746')] [2023-03-07 18:22:00,466][232226] Updated weights for policy 0, policy_version 97590 (0.0006) [2023-03-07 18:22:01,293][232226] Updated weights for policy 0, policy_version 97600 (0.0006) [2023-03-07 18:22:02,080][232226] Updated weights for policy 0, policy_version 97610 (0.0007) [2023-03-07 18:22:02,892][232226] Updated weights for policy 0, policy_version 97620 (0.0007) [2023-03-07 18:22:03,682][232226] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-07 18:22:04,495][232226] Updated weights for policy 0, policy_version 97640 (0.0006) [2023-03-07 18:22:05,069][231894] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 99990528. Throughput: 0: 12866.9. Samples: 99968250. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 18:22:05,069][231894] Avg episode reward: [(0, '196.091')] [2023-03-07 18:22:05,289][232226] Updated weights for policy 0, policy_version 97650 (0.0007) [2023-03-07 18:22:05,930][232355] Stopping RolloutWorker_w5... [2023-03-07 18:22:05,930][232496] Stopping RolloutWorker_w23... [2023-03-07 18:22:05,930][232431] Stopping RolloutWorker_w11... [2023-03-07 18:22:05,930][232356] Stopping RolloutWorker_w17... [2023-03-07 18:22:05,930][232229] Stopping RolloutWorker_w4... [2023-03-07 18:22:05,930][232463] Stopping RolloutWorker_w21... [2023-03-07 18:22:05,930][232430] Stopping RolloutWorker_w9... [2023-03-07 18:22:05,930][232427] Stopping RolloutWorker_w20... [2023-03-07 18:22:05,930][232357] Stopping RolloutWorker_w16... [2023-03-07 18:22:05,930][232565] Stopping RolloutWorker_w30... [2023-03-07 18:22:05,930][232428] Stopping RolloutWorker_w8... [2023-03-07 18:22:05,930][232692] Stopping RolloutWorker_w29... [2023-03-07 18:22:05,931][232496] Loop rollout_proc23_evt_loop terminating... [2023-03-07 18:22:05,931][232390] Stopping RolloutWorker_w13... [2023-03-07 18:22:05,931][232355] Loop rollout_proc5_evt_loop terminating... [2023-03-07 18:22:05,931][232431] Loop rollout_proc11_evt_loop terminating... [2023-03-07 18:22:05,931][232354] Stopping RolloutWorker_w6... [2023-03-07 18:22:05,931][232225] Stopping RolloutWorker_w1... [2023-03-07 18:22:05,931][232356] Loop rollout_proc17_evt_loop terminating... [2023-03-07 18:22:05,931][232426] Stopping RolloutWorker_w18... [2023-03-07 18:22:05,931][232429] Stopping RolloutWorker_w15... [2023-03-07 18:22:05,931][232430] Loop rollout_proc9_evt_loop terminating... [2023-03-07 18:22:05,931][232389] Stopping RolloutWorker_w7... [2023-03-07 18:22:05,931][232229] Loop rollout_proc4_evt_loop terminating... [2023-03-07 18:22:05,931][232173] Stopping Batcher_0... [2023-03-07 18:22:05,931][232498] Stopping RolloutWorker_w24... [2023-03-07 18:22:05,931][232224] Stopping RolloutWorker_w0... [2023-03-07 18:22:05,931][232463] Loop rollout_proc21_evt_loop terminating... [2023-03-07 18:22:05,931][232392] Stopping RolloutWorker_w14... [2023-03-07 18:22:05,931][232357] Loop rollout_proc16_evt_loop terminating... [2023-03-07 18:22:05,931][232755] Stopping RolloutWorker_w31... [2023-03-07 18:22:05,931][232228] Stopping RolloutWorker_w3... [2023-03-07 18:22:05,931][232427] Loop rollout_proc20_evt_loop terminating... [2023-03-07 18:22:05,931][232428] Loop rollout_proc8_evt_loop terminating... [2023-03-07 18:22:05,931][232426] Loop rollout_proc18_evt_loop terminating... [2023-03-07 18:22:05,931][232692] Loop rollout_proc29_evt_loop terminating... [2023-03-07 18:22:05,931][232227] Stopping RolloutWorker_w2... [2023-03-07 18:22:05,931][232390] Loop rollout_proc13_evt_loop terminating... [2023-03-07 18:22:05,931][232354] Loop rollout_proc6_evt_loop terminating... [2023-03-07 18:22:05,931][232391] Stopping RolloutWorker_w19... [2023-03-07 18:22:05,931][232225] Loop rollout_proc1_evt_loop terminating... [2023-03-07 18:22:05,931][232565] Loop rollout_proc30_evt_loop terminating... [2023-03-07 18:22:05,931][232392] Loop rollout_proc14_evt_loop terminating... [2023-03-07 18:22:05,931][232429] Loop rollout_proc15_evt_loop terminating... [2023-03-07 18:22:05,931][232598] Stopping RolloutWorker_w28... [2023-03-07 18:22:05,931][232389] Loop rollout_proc7_evt_loop terminating... [2023-03-07 18:22:05,931][232498] Loop rollout_proc24_evt_loop terminating... [2023-03-07 18:22:05,931][232224] Loop rollout_proc0_evt_loop terminating... [2023-03-07 18:22:05,931][232228] Loop rollout_proc3_evt_loop terminating... [2023-03-07 18:22:05,931][232411] Stopping RolloutWorker_w10... [2023-03-07 18:22:05,931][232755] Loop rollout_proc31_evt_loop terminating... [2023-03-07 18:22:05,931][232227] Loop rollout_proc2_evt_loop terminating... [2023-03-07 18:22:05,931][232391] Loop rollout_proc19_evt_loop terminating... [2023-03-07 18:22:05,931][232598] Loop rollout_proc28_evt_loop terminating... [2023-03-07 18:22:05,931][232173] Loop batcher_evt_loop terminating... [2023-03-07 18:22:05,931][232411] Loop rollout_proc10_evt_loop terminating... [2023-03-07 18:22:05,931][232501] Stopping RolloutWorker_w26... [2023-03-07 18:22:05,931][231894] Component RolloutWorker_w5 stopped! [2023-03-07 18:22:05,932][232501] Loop rollout_proc26_evt_loop terminating... [2023-03-07 18:22:05,931][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 18:22:05,931][232425] Stopping RolloutWorker_w12... [2023-03-07 18:22:05,932][232425] Loop rollout_proc12_evt_loop terminating... [2023-03-07 18:22:05,932][232566] Stopping RolloutWorker_w27... [2023-03-07 18:22:05,932][231894] Component RolloutWorker_w11 stopped! [2023-03-07 18:22:05,932][232566] Loop rollout_proc27_evt_loop terminating... [2023-03-07 18:22:05,932][231894] Component RolloutWorker_w23 stopped! [2023-03-07 18:22:05,933][231894] Component RolloutWorker_w17 stopped! [2023-03-07 18:22:05,933][231894] Component RolloutWorker_w4 stopped! [2023-03-07 18:22:05,933][231894] Component RolloutWorker_w21 stopped! [2023-03-07 18:22:05,934][231894] Component RolloutWorker_w9 stopped! [2023-03-07 18:22:05,934][231894] Component RolloutWorker_w20 stopped! [2023-03-07 18:22:05,934][231894] Component RolloutWorker_w30 stopped! [2023-03-07 18:22:05,935][231894] Component RolloutWorker_w16 stopped! [2023-03-07 18:22:05,934][232500] Stopping RolloutWorker_w25... [2023-03-07 18:22:05,935][231894] Component RolloutWorker_w8 stopped! [2023-03-07 18:22:05,935][231894] Component RolloutWorker_w29 stopped! [2023-03-07 18:22:05,936][231894] Component RolloutWorker_w6 stopped! [2023-03-07 18:22:05,936][231894] Component RolloutWorker_w13 stopped! [2023-03-07 18:22:05,937][231894] Component Batcher_0 stopped! [2023-03-07 18:22:05,937][231894] Component RolloutWorker_w1 stopped! [2023-03-07 18:22:05,937][231894] Component RolloutWorker_w15 stopped! [2023-03-07 18:22:05,938][231894] Component RolloutWorker_w7 stopped! [2023-03-07 18:22:05,938][231894] Component RolloutWorker_w18 stopped! [2023-03-07 18:22:05,939][231894] Component RolloutWorker_w24 stopped! [2023-03-07 18:22:05,939][231894] Component RolloutWorker_w0 stopped! [2023-03-07 18:22:05,939][231894] Component RolloutWorker_w14 stopped! [2023-03-07 18:22:05,940][231894] Component RolloutWorker_w31 stopped! [2023-03-07 18:22:05,940][231894] Component RolloutWorker_w3 stopped! [2023-03-07 18:22:05,941][231894] Component RolloutWorker_w2 stopped! [2023-03-07 18:22:05,941][231894] Component RolloutWorker_w19 stopped! [2023-03-07 18:22:05,941][231894] Component RolloutWorker_w28 stopped! [2023-03-07 18:22:05,942][231894] Component RolloutWorker_w10 stopped! [2023-03-07 18:22:05,943][232495] Stopping RolloutWorker_w22... [2023-03-07 18:22:05,944][232495] Loop rollout_proc22_evt_loop terminating... [2023-03-07 18:22:05,942][231894] Component RolloutWorker_w26 stopped! [2023-03-07 18:22:05,948][231894] Component RolloutWorker_w12 stopped! [2023-03-07 18:22:05,949][231894] Component RolloutWorker_w27 stopped! [2023-03-07 18:22:05,949][231894] Component RolloutWorker_w25 stopped! [2023-03-07 18:22:05,949][231894] Component RolloutWorker_w22 stopped! [2023-03-07 18:22:05,959][232500] Loop rollout_proc25_evt_loop terminating... [2023-03-07 18:22:06,002][232226] Weights refcount: 2 0 [2023-03-07 18:22:06,010][232226] Stopping InferenceWorker_p0-w0... [2023-03-07 18:22:06,011][232226] Loop inference_proc0-0_evt_loop terminating... [2023-03-07 18:22:06,011][231894] Component InferenceWorker_p0-w0 stopped! [2023-03-07 18:22:06,043][232173] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000094885_97162240.pth [2023-03-07 18:22:06,051][232173] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/disassemble-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 18:22:06,138][232173] Stopping LearnerWorker_p0... [2023-03-07 18:22:06,138][232173] Loop learner_proc0_evt_loop terminating... [2023-03-07 18:22:06,138][231894] Component LearnerWorker_p0 stopped! [2023-03-07 18:22:06,139][231894] Waiting for process learner_proc0 to stop... [2023-03-07 18:22:07,332][231894] Waiting for process inference_proc0-0 to join... [2023-03-07 18:22:07,333][231894] Waiting for process rollout_proc0 to join... [2023-03-07 18:22:07,333][231894] Waiting for process rollout_proc1 to join... [2023-03-07 18:22:07,334][231894] Waiting for process rollout_proc2 to join... [2023-03-07 18:22:07,334][231894] Waiting for process rollout_proc3 to join... [2023-03-07 18:22:07,334][231894] Waiting for process rollout_proc4 to join... [2023-03-07 18:22:07,335][231894] Waiting for process rollout_proc5 to join... [2023-03-07 18:22:07,335][231894] Waiting for process rollout_proc6 to join... [2023-03-07 18:22:07,335][231894] Waiting for process rollout_proc7 to join... [2023-03-07 18:22:07,336][231894] Waiting for process rollout_proc8 to join... [2023-03-07 18:22:07,336][231894] Waiting for process rollout_proc9 to join... [2023-03-07 18:22:07,337][231894] Waiting for process rollout_proc10 to join... [2023-03-07 18:22:07,337][231894] Waiting for process rollout_proc11 to join... [2023-03-07 18:22:07,337][231894] Waiting for process rollout_proc12 to join... [2023-03-07 18:22:07,338][231894] Waiting for process rollout_proc13 to join... [2023-03-07 18:22:07,338][231894] Waiting for process rollout_proc14 to join... [2023-03-07 18:22:07,338][231894] Waiting for process rollout_proc15 to join... [2023-03-07 18:22:07,339][231894] Waiting for process rollout_proc16 to join... [2023-03-07 18:22:07,339][231894] Waiting for process rollout_proc17 to join... [2023-03-07 18:22:07,340][231894] Waiting for process rollout_proc18 to join... [2023-03-07 18:22:07,340][231894] Waiting for process rollout_proc19 to join... [2023-03-07 18:22:07,340][231894] Waiting for process rollout_proc20 to join... [2023-03-07 18:22:07,341][231894] Waiting for process rollout_proc21 to join... [2023-03-07 18:22:07,341][231894] Waiting for process rollout_proc22 to join... [2023-03-07 18:22:07,341][231894] Waiting for process rollout_proc23 to join... [2023-03-07 18:22:07,342][231894] Waiting for process rollout_proc24 to join... [2023-03-07 18:22:07,342][231894] Waiting for process rollout_proc25 to join... [2023-03-07 18:22:07,342][231894] Waiting for process rollout_proc26 to join... [2023-03-07 18:22:07,343][231894] Waiting for process rollout_proc27 to join... [2023-03-07 18:22:07,343][231894] Waiting for process rollout_proc28 to join... [2023-03-07 18:22:07,344][231894] Waiting for process rollout_proc29 to join... [2023-03-07 18:22:07,344][231894] Waiting for process rollout_proc30 to join... [2023-03-07 18:22:07,344][231894] Waiting for process rollout_proc31 to join... [2023-03-07 18:22:07,345][231894] Batcher 0 profile tree view: batching: 830.6021, releasing_batches: 1.6006 [2023-03-07 18:22:07,345][231894] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 234.8079 update_model: 136.9400 weight_update: 0.0006 one_step: 0.0069 handle_policy_step: 7021.1943 deserialize: 208.9150, stack: 35.4075, obs_to_device_normalize: 1226.3975, forward: 3147.8750, send_messages: 1394.4966 prepare_outputs: 733.1722 to_cpu: 371.4035 [2023-03-07 18:22:07,345][231894] Learner 0 profile tree view: misc: 0.4471, prepare_batch: 398.5236 train: 894.7256 epoch_init: 0.3716, minibatch_init: 0.3885, losses_postprocess: 30.8738, kl_divergence: 35.3813, after_optimizer: 99.8185 calculate_losses: 295.1244 losses_init: 0.2129, forward_head: 16.2446, bptt_initial: 107.2656, tail: 59.9261, advantages_returns: 7.3966, losses: 28.1099 bptt: 67.2992 bptt_forward_core: 64.9171 update: 410.4445 clip: 55.2160 [2023-03-07 18:22:07,346][231894] RolloutWorker_w0 profile tree view: wait_for_trajectories: 3.6764, enqueue_policy_requests: 176.1179, env_step: 3105.3834, overhead: 163.4948, complete_rollouts: 9.1177 save_policy_outputs: 208.8446 split_output_tensors: 102.2220 [2023-03-07 18:22:07,346][231894] RolloutWorker_w31 profile tree view: wait_for_trajectories: 3.7440, enqueue_policy_requests: 179.2705, env_step: 3191.0645, overhead: 164.8364, complete_rollouts: 9.4454 save_policy_outputs: 205.0848 split_output_tensors: 100.8530 [2023-03-07 18:22:07,346][231894] Loop Runner_EvtLoop terminating... [2023-03-07 18:22:07,347][231894] Runner profile tree view: main_loop: 7778.8317 [2023-03-07 18:22:07,347][231894] Collected {0: 100001792}, FPS: 12855.6