[2023-03-07 09:43:51,334][175405] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/config.json... [2023-03-07 09:43:51,348][175405] Rollout worker 0 uses device cpu [2023-03-07 09:43:51,348][175405] Rollout worker 1 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 2 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 3 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 4 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 5 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 6 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 7 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 8 uses device cpu [2023-03-07 09:43:51,349][175405] Rollout worker 9 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 10 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 11 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 12 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 13 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 14 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 15 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 16 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 17 uses device cpu [2023-03-07 09:43:51,350][175405] Rollout worker 18 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 19 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 20 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 21 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 22 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 23 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 24 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 25 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 26 uses device cpu [2023-03-07 09:43:51,351][175405] Rollout worker 27 uses device cpu [2023-03-07 09:43:51,352][175405] Rollout worker 28 uses device cpu [2023-03-07 09:43:51,352][175405] Rollout worker 29 uses device cpu [2023-03-07 09:43:51,352][175405] Rollout worker 30 uses device cpu [2023-03-07 09:43:51,352][175405] Rollout worker 31 uses device cpu [2023-03-07 09:43:51,365][175405] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 09:43:51,365][175405] InferenceWorker_p0-w0: min num requests: 10 [2023-03-07 09:43:51,455][175405] Starting all processes... [2023-03-07 09:43:51,456][175405] Starting process learner_proc0 [2023-03-07 09:43:51,505][175405] Starting all processes... [2023-03-07 09:43:51,509][175405] Starting process inference_proc0-0 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc0 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc1 [2023-03-07 09:43:51,529][175405] Starting process rollout_proc16 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc3 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc4 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc5 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc6 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc7 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc8 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc9 [2023-03-07 09:43:51,520][175405] Starting process rollout_proc10 [2023-03-07 09:43:51,520][175405] Starting process rollout_proc11 [2023-03-07 09:43:51,520][175405] Starting process rollout_proc12 [2023-03-07 09:43:51,520][175405] Starting process rollout_proc13 [2023-03-07 09:43:51,520][175405] Starting process rollout_proc14 [2023-03-07 09:43:51,520][175405] Starting process rollout_proc15 [2023-03-07 09:43:51,519][175405] Starting process rollout_proc2 [2023-03-07 09:43:51,555][175405] Starting process rollout_proc17 [2023-03-07 09:43:51,557][175405] Starting process rollout_proc18 [2023-03-07 09:43:51,649][175405] Starting process rollout_proc19 [2023-03-07 09:43:51,657][175405] Starting process rollout_proc20 [2023-03-07 09:43:51,684][175405] Starting process rollout_proc21 [2023-03-07 09:43:51,685][175405] Starting process rollout_proc22 [2023-03-07 09:43:51,685][175405] Starting process rollout_proc23 [2023-03-07 09:43:51,685][175405] Starting process rollout_proc24 [2023-03-07 09:43:51,695][175405] Starting process rollout_proc25 [2023-03-07 09:43:51,695][175405] Starting process rollout_proc26 [2023-03-07 09:43:51,698][175405] Starting process rollout_proc27 [2023-03-07 09:43:51,698][175405] Starting process rollout_proc28 [2023-03-07 09:43:51,698][175405] Starting process rollout_proc29 [2023-03-07 09:43:51,716][175405] Starting process rollout_proc30 [2023-03-07 09:43:51,725][175405] Starting process rollout_proc31 [2023-03-07 09:43:53,414][175680] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 09:43:53,414][175680] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-07 09:43:53,427][175680] Num visible devices: 1 [2023-03-07 09:43:53,477][175680] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-07 09:43:53,478][175680] Starting seed is not provided [2023-03-07 09:43:53,478][175680] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 09:43:53,478][175680] Initializing actor-critic model on device cuda:0 [2023-03-07 09:43:53,478][175680] RunningMeanStd input shape: (39,) [2023-03-07 09:43:53,479][175680] RunningMeanStd input shape: (1,) [2023-03-07 09:43:53,535][175732] Worker 1 uses CPU cores [1] [2023-03-07 09:43:53,583][175865] Worker 2 uses CPU cores [2] [2023-03-07 09:43:53,600][175680] Created Actor Critic model with architecture: [2023-03-07 09:43:53,600][175680] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-07 09:43:53,718][176126] Worker 22 uses CPU cores [22] [2023-03-07 09:43:53,762][175864] Worker 10 uses CPU cores [10] [2023-03-07 09:43:53,888][175932] Worker 7 uses CPU cores [7] [2023-03-07 09:43:54,067][175861] Worker 6 uses CPU cores [6] [2023-03-07 09:43:54,111][175873] Worker 14 uses CPU cores [14] [2023-03-07 09:43:54,357][176161] Worker 26 uses CPU cores [26] [2023-03-07 09:43:54,398][175863] Worker 16 uses CPU cores [16] [2023-03-07 09:43:54,463][176358] Worker 29 uses CPU cores [29] [2023-03-07 09:43:54,552][176356] Worker 30 uses CPU cores [30] [2023-03-07 09:43:54,559][175731] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 09:43:54,559][175731] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-07 09:43:54,569][175731] Num visible devices: 1 [2023-03-07 09:43:54,757][175871] Worker 12 uses CPU cores [12] [2023-03-07 09:43:54,917][175868] Worker 18 uses CPU cores [18] [2023-03-07 09:43:55,007][175866] Worker 11 uses CPU cores [11] [2023-03-07 09:43:55,075][175870] Worker 13 uses CPU cores [13] [2023-03-07 09:43:55,215][175859] Worker 5 uses CPU cores [5] [2023-03-07 09:43:55,258][175680] Using optimizer [2023-03-07 09:43:55,258][175680] No checkpoints found [2023-03-07 09:43:55,259][175680] Did not load from checkpoint, starting from scratch! [2023-03-07 09:43:55,259][175680] Initialized policy 0 weights for model version 0 [2023-03-07 09:43:55,271][175680] LearnerWorker_p0 finished initialization! [2023-03-07 09:43:55,272][175680] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-07 09:43:55,311][175860] Worker 0 uses CPU cores [0] [2023-03-07 09:43:55,343][175731] RunningMeanStd input shape: (39,) [2023-03-07 09:43:55,343][175731] RunningMeanStd input shape: (1,) [2023-03-07 09:43:55,399][176036] Worker 19 uses CPU cores [19] [2023-03-07 09:43:55,445][175869] Worker 9 uses CPU cores [9] [2023-03-07 09:43:55,635][175862] Worker 15 uses CPU cores [15] [2023-03-07 09:43:55,660][176355] Worker 28 uses CPU cores [28] [2023-03-07 09:43:55,899][176158] Worker 23 uses CPU cores [23] [2023-03-07 09:43:55,931][176125] Worker 21 uses CPU cores [21] [2023-03-07 09:43:56,068][175867] Worker 17 uses CPU cores [17] [2023-03-07 09:43:56,095][175872] Worker 8 uses CPU cores [8] [2023-03-07 09:43:56,140][175405] Inference worker 0-0 is ready! [2023-03-07 09:43:56,140][175405] All inference workers are ready! Signal rollout workers to start! [2023-03-07 09:43:56,199][176218] Worker 24 uses CPU cores [24] [2023-03-07 09:43:56,279][175734] Worker 4 uses CPU cores [4] [2023-03-07 09:43:56,530][175733] Worker 3 uses CPU cores [3] [2023-03-07 09:43:56,742][176294] Worker 27 uses CPU cores [27] [2023-03-07 09:43:56,886][176319] Worker 31 uses CPU cores [31] [2023-03-07 09:43:56,904][176110] Worker 20 uses CPU cores [20] [2023-03-07 09:43:57,093][176321] Worker 25 uses CPU cores [25] [2023-03-07 09:43:57,588][175869] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,624][175866] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,717][176036] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,769][175732] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,792][175932] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,867][175872] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,914][175868] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,918][175864] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,934][176356] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,963][176125] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,966][175859] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,974][176355] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,978][175862] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,979][175861] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,979][176126] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,980][175871] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,988][176161] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,990][175865] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,996][175860] Decorrelating experience for 0 frames... [2023-03-07 09:43:57,997][175863] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,034][175870] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,051][175873] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,114][176158] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,151][176358] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,189][175867] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,315][175734] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,321][175405] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-07 09:43:58,354][176218] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,805][175733] Decorrelating experience for 0 frames... [2023-03-07 09:43:58,870][176294] Decorrelating experience for 0 frames... [2023-03-07 09:43:59,184][176319] Decorrelating experience for 0 frames... [2023-03-07 09:43:59,227][176110] Decorrelating experience for 0 frames... [2023-03-07 09:43:59,247][176321] Decorrelating experience for 0 frames... [2023-03-07 09:43:59,381][175869] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,478][175866] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,533][176036] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,550][175732] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,551][175932] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,629][175872] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,680][176356] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,722][175864] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,748][175868] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,772][175862] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,783][176161] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,786][175870] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,788][176158] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,794][175865] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,803][176125] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,813][175867] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,813][175859] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,816][176355] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,817][175861] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,817][176126] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,818][175873] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,822][175871] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,832][175860] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,834][175863] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,864][176358] Decorrelating experience for 32 frames... [2023-03-07 09:43:59,951][176218] Decorrelating experience for 32 frames... [2023-03-07 09:44:00,064][175734] Decorrelating experience for 32 frames... [2023-03-07 09:44:00,219][175680] Signal inference workers to stop experience collection... [2023-03-07 09:44:00,222][175731] InferenceWorker_p0-w0: stopping experience collection [2023-03-07 09:44:00,252][175733] Decorrelating experience for 32 frames... [2023-03-07 09:44:00,271][176294] Decorrelating experience for 32 frames... [2023-03-07 09:44:00,514][175680] Signal inference workers to resume experience collection... [2023-03-07 09:44:00,515][175731] InferenceWorker_p0-w0: resuming experience collection [2023-03-07 09:44:00,526][176319] Decorrelating experience for 32 frames... [2023-03-07 09:44:00,566][176321] Decorrelating experience for 32 frames... [2023-03-07 09:44:00,639][176110] Decorrelating experience for 32 frames... [2023-03-07 09:44:01,733][175731] Updated weights for policy 0, policy_version 10 (0.0217) [2023-03-07 09:44:02,517][175731] Updated weights for policy 0, policy_version 20 (0.0007) [2023-03-07 09:44:03,321][175405] Fps is (10 sec: 5939.5, 60 sec: 5939.5, 300 sec: 5939.5). Total num frames: 29696. Throughput: 0: 3429.6. Samples: 17147. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:44:03,322][175405] Avg episode reward: [(0, '19.745')] [2023-03-07 09:44:03,338][175731] Updated weights for policy 0, policy_version 30 (0.0006) [2023-03-07 09:44:04,130][175731] Updated weights for policy 0, policy_version 40 (0.0007) [2023-03-07 09:44:04,916][175731] Updated weights for policy 0, policy_version 50 (0.0008) [2023-03-07 09:44:05,715][175731] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-07 09:44:06,525][175731] Updated weights for policy 0, policy_version 70 (0.0006) [2023-03-07 09:44:07,317][175731] Updated weights for policy 0, policy_version 80 (0.0008) [2023-03-07 09:44:08,136][175731] Updated weights for policy 0, policy_version 90 (0.0007) [2023-03-07 09:44:08,321][175405] Fps is (10 sec: 9421.0, 60 sec: 9421.0, 300 sec: 9421.0). Total num frames: 94208. Throughput: 0: 9362.1. Samples: 93619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:44:08,322][175405] Avg episode reward: [(0, '20.100')] [2023-03-07 09:44:08,929][175731] Updated weights for policy 0, policy_version 100 (0.0007) [2023-03-07 09:44:09,706][175731] Updated weights for policy 0, policy_version 110 (0.0007) [2023-03-07 09:44:10,506][175731] Updated weights for policy 0, policy_version 120 (0.0006) [2023-03-07 09:44:11,312][175731] Updated weights for policy 0, policy_version 130 (0.0006) [2023-03-07 09:44:11,360][175405] Heartbeat connected on Batcher_0 [2023-03-07 09:44:11,363][175405] Heartbeat connected on LearnerWorker_p0 [2023-03-07 09:44:11,368][175405] Heartbeat connected on RolloutWorker_w0 [2023-03-07 09:44:11,368][175405] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-07 09:44:11,369][175405] Heartbeat connected on RolloutWorker_w1 [2023-03-07 09:44:11,372][175405] Heartbeat connected on RolloutWorker_w2 [2023-03-07 09:44:11,374][175405] Heartbeat connected on RolloutWorker_w3 [2023-03-07 09:44:11,375][175405] Heartbeat connected on RolloutWorker_w4 [2023-03-07 09:44:11,377][175405] Heartbeat connected on RolloutWorker_w5 [2023-03-07 09:44:11,378][175405] Heartbeat connected on RolloutWorker_w6 [2023-03-07 09:44:11,380][175405] Heartbeat connected on RolloutWorker_w7 [2023-03-07 09:44:11,382][175405] Heartbeat connected on RolloutWorker_w8 [2023-03-07 09:44:11,385][175405] Heartbeat connected on RolloutWorker_w9 [2023-03-07 09:44:11,386][175405] Heartbeat connected on RolloutWorker_w10 [2023-03-07 09:44:11,418][175405] Heartbeat connected on RolloutWorker_w11 [2023-03-07 09:44:11,421][175405] Heartbeat connected on RolloutWorker_w12 [2023-03-07 09:44:11,421][175405] Heartbeat connected on RolloutWorker_w13 [2023-03-07 09:44:11,423][175405] Heartbeat connected on RolloutWorker_w14 [2023-03-07 09:44:11,425][175405] Heartbeat connected on RolloutWorker_w15 [2023-03-07 09:44:11,427][175405] Heartbeat connected on RolloutWorker_w16 [2023-03-07 09:44:11,429][175405] Heartbeat connected on RolloutWorker_w17 [2023-03-07 09:44:11,430][175405] Heartbeat connected on RolloutWorker_w18 [2023-03-07 09:44:11,432][175405] Heartbeat connected on RolloutWorker_w19 [2023-03-07 09:44:11,435][175405] Heartbeat connected on RolloutWorker_w20 [2023-03-07 09:44:11,436][175405] Heartbeat connected on RolloutWorker_w21 [2023-03-07 09:44:11,438][175405] Heartbeat connected on RolloutWorker_w22 [2023-03-07 09:44:11,440][175405] Heartbeat connected on RolloutWorker_w23 [2023-03-07 09:44:11,442][175405] Heartbeat connected on RolloutWorker_w24 [2023-03-07 09:44:11,443][175405] Heartbeat connected on RolloutWorker_w25 [2023-03-07 09:44:11,445][175405] Heartbeat connected on RolloutWorker_w26 [2023-03-07 09:44:11,447][175405] Heartbeat connected on RolloutWorker_w27 [2023-03-07 09:44:11,449][175405] Heartbeat connected on RolloutWorker_w28 [2023-03-07 09:44:11,450][175405] Heartbeat connected on RolloutWorker_w29 [2023-03-07 09:44:11,453][175405] Heartbeat connected on RolloutWorker_w30 [2023-03-07 09:44:11,454][175405] Heartbeat connected on RolloutWorker_w31 [2023-03-07 09:44:12,101][175731] Updated weights for policy 0, policy_version 140 (0.0006) [2023-03-07 09:44:12,898][175731] Updated weights for policy 0, policy_version 150 (0.0006) [2023-03-07 09:44:13,321][175405] Fps is (10 sec: 12902.5, 60 sec: 10581.5, 300 sec: 10581.5). Total num frames: 158720. Throughput: 0: 8806.5. Samples: 132095. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 09:44:13,321][175405] Avg episode reward: [(0, '19.318')] [2023-03-07 09:44:13,322][175680] Saving new best policy, reward=19.318! [2023-03-07 09:44:13,726][175731] Updated weights for policy 0, policy_version 160 (0.0007) [2023-03-07 09:44:14,533][175731] Updated weights for policy 0, policy_version 170 (0.0007) [2023-03-07 09:44:15,309][175731] Updated weights for policy 0, policy_version 180 (0.0007) [2023-03-07 09:44:16,110][175731] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-07 09:44:16,912][175731] Updated weights for policy 0, policy_version 200 (0.0006) [2023-03-07 09:44:17,697][175731] Updated weights for policy 0, policy_version 210 (0.0007) [2023-03-07 09:44:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 11110.5, 300 sec: 11110.5). Total num frames: 222208. Throughput: 0: 10460.2. Samples: 209202. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:44:18,322][175405] Avg episode reward: [(0, '16.879')] [2023-03-07 09:44:18,477][175731] Updated weights for policy 0, policy_version 220 (0.0006) [2023-03-07 09:44:19,277][175731] Updated weights for policy 0, policy_version 230 (0.0007) [2023-03-07 09:44:20,073][175731] Updated weights for policy 0, policy_version 240 (0.0006) [2023-03-07 09:44:20,894][175731] Updated weights for policy 0, policy_version 250 (0.0007) [2023-03-07 09:44:21,676][175731] Updated weights for policy 0, policy_version 260 (0.0007) [2023-03-07 09:44:22,464][175731] Updated weights for policy 0, policy_version 270 (0.0006) [2023-03-07 09:44:23,278][175731] Updated weights for policy 0, policy_version 280 (0.0006) [2023-03-07 09:44:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 11468.9, 300 sec: 11468.9). Total num frames: 286720. Throughput: 0: 11457.3. Samples: 286429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:44:23,322][175405] Avg episode reward: [(0, '16.328')] [2023-03-07 09:44:24,086][175731] Updated weights for policy 0, policy_version 290 (0.0006) [2023-03-07 09:44:24,867][175731] Updated weights for policy 0, policy_version 300 (0.0006) [2023-03-07 09:44:25,670][175731] Updated weights for policy 0, policy_version 310 (0.0007) [2023-03-07 09:44:26,476][175731] Updated weights for policy 0, policy_version 320 (0.0006) [2023-03-07 09:44:27,266][175731] Updated weights for policy 0, policy_version 330 (0.0006) [2023-03-07 09:44:28,073][175731] Updated weights for policy 0, policy_version 340 (0.0007) [2023-03-07 09:44:28,321][175405] Fps is (10 sec: 12902.6, 60 sec: 11707.9, 300 sec: 11707.9). Total num frames: 351232. Throughput: 0: 10821.1. Samples: 324630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:44:28,321][175405] Avg episode reward: [(0, '11.020')] [2023-03-07 09:44:28,871][175731] Updated weights for policy 0, policy_version 350 (0.0006) [2023-03-07 09:44:29,672][175731] Updated weights for policy 0, policy_version 360 (0.0007) [2023-03-07 09:44:30,470][175731] Updated weights for policy 0, policy_version 370 (0.0007) [2023-03-07 09:44:31,281][175731] Updated weights for policy 0, policy_version 380 (0.0006) [2023-03-07 09:44:32,066][175731] Updated weights for policy 0, policy_version 390 (0.0006) [2023-03-07 09:44:32,866][175731] Updated weights for policy 0, policy_version 400 (0.0006) [2023-03-07 09:44:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 11849.3, 300 sec: 11849.3). Total num frames: 414720. Throughput: 0: 11473.1. Samples: 401555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 09:44:33,321][175405] Avg episode reward: [(0, '30.828')] [2023-03-07 09:44:33,322][175680] Saving new best policy, reward=30.828! [2023-03-07 09:44:33,692][175731] Updated weights for policy 0, policy_version 410 (0.0007) [2023-03-07 09:44:34,488][175731] Updated weights for policy 0, policy_version 420 (0.0006) [2023-03-07 09:44:35,301][175731] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-07 09:44:36,113][175731] Updated weights for policy 0, policy_version 440 (0.0006) [2023-03-07 09:44:36,923][175731] Updated weights for policy 0, policy_version 450 (0.0007) [2023-03-07 09:44:37,711][175731] Updated weights for policy 0, policy_version 460 (0.0006) [2023-03-07 09:44:38,321][175405] Fps is (10 sec: 12697.4, 60 sec: 11955.3, 300 sec: 11955.3). Total num frames: 478208. Throughput: 0: 11941.1. Samples: 477641. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:44:38,322][175405] Avg episode reward: [(0, '33.680')] [2023-03-07 09:44:38,324][175680] Saving new best policy, reward=33.680! [2023-03-07 09:44:38,515][175731] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-07 09:44:39,315][175731] Updated weights for policy 0, policy_version 480 (0.0006) [2023-03-07 09:44:40,098][175731] Updated weights for policy 0, policy_version 490 (0.0007) [2023-03-07 09:44:40,939][175731] Updated weights for policy 0, policy_version 500 (0.0007) [2023-03-07 09:44:41,747][175731] Updated weights for policy 0, policy_version 510 (0.0006) [2023-03-07 09:44:42,547][175731] Updated weights for policy 0, policy_version 520 (0.0007) [2023-03-07 09:44:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12037.8, 300 sec: 12037.8). Total num frames: 541696. Throughput: 0: 11460.6. Samples: 515721. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:44:43,321][175405] Avg episode reward: [(0, '29.879')] [2023-03-07 09:44:43,353][175731] Updated weights for policy 0, policy_version 530 (0.0007) [2023-03-07 09:44:44,170][175731] Updated weights for policy 0, policy_version 540 (0.0006) [2023-03-07 09:44:44,962][175731] Updated weights for policy 0, policy_version 550 (0.0006) [2023-03-07 09:44:45,759][175731] Updated weights for policy 0, policy_version 560 (0.0006) [2023-03-07 09:44:46,565][175731] Updated weights for policy 0, policy_version 570 (0.0006) [2023-03-07 09:44:47,382][175731] Updated weights for policy 0, policy_version 580 (0.0007) [2023-03-07 09:44:48,188][175731] Updated weights for policy 0, policy_version 590 (0.0006) [2023-03-07 09:44:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12103.7, 300 sec: 12103.7). Total num frames: 605184. Throughput: 0: 12779.3. Samples: 592217. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:44:48,322][175405] Avg episode reward: [(0, '36.675')] [2023-03-07 09:44:48,325][175680] Saving new best policy, reward=36.675! [2023-03-07 09:44:48,998][175731] Updated weights for policy 0, policy_version 600 (0.0007) [2023-03-07 09:44:49,809][175731] Updated weights for policy 0, policy_version 610 (0.0006) [2023-03-07 09:44:50,602][175731] Updated weights for policy 0, policy_version 620 (0.0007) [2023-03-07 09:44:51,407][175731] Updated weights for policy 0, policy_version 630 (0.0006) [2023-03-07 09:44:52,209][175731] Updated weights for policy 0, policy_version 640 (0.0007) [2023-03-07 09:44:53,003][175731] Updated weights for policy 0, policy_version 650 (0.0007) [2023-03-07 09:44:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12176.4, 300 sec: 12176.4). Total num frames: 669696. Throughput: 0: 12773.3. Samples: 668416. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:44:53,321][175405] Avg episode reward: [(0, '33.672')] [2023-03-07 09:44:53,806][175731] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-07 09:44:54,617][175731] Updated weights for policy 0, policy_version 670 (0.0007) [2023-03-07 09:44:55,411][175731] Updated weights for policy 0, policy_version 680 (0.0006) [2023-03-07 09:44:56,228][175731] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-07 09:44:57,041][175731] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-07 09:44:57,841][175731] Updated weights for policy 0, policy_version 710 (0.0007) [2023-03-07 09:44:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12202.7, 300 sec: 12202.7). Total num frames: 732160. Throughput: 0: 12770.8. Samples: 706784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:44:58,322][175405] Avg episode reward: [(0, '40.664')] [2023-03-07 09:44:58,325][175680] Saving new best policy, reward=40.664! [2023-03-07 09:44:58,645][175731] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-07 09:44:59,447][175731] Updated weights for policy 0, policy_version 730 (0.0007) [2023-03-07 09:45:00,249][175731] Updated weights for policy 0, policy_version 740 (0.0007) [2023-03-07 09:45:01,074][175731] Updated weights for policy 0, policy_version 750 (0.0006) [2023-03-07 09:45:01,853][175731] Updated weights for policy 0, policy_version 760 (0.0007) [2023-03-07 09:45:02,670][175731] Updated weights for policy 0, policy_version 770 (0.0006) [2023-03-07 09:45:03,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12256.5). Total num frames: 796672. Throughput: 0: 12755.1. Samples: 783180. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:45:03,322][175405] Avg episode reward: [(0, '41.624')] [2023-03-07 09:45:03,323][175680] Saving new best policy, reward=41.624! [2023-03-07 09:45:03,479][175731] Updated weights for policy 0, policy_version 780 (0.0006) [2023-03-07 09:45:04,270][175731] Updated weights for policy 0, policy_version 790 (0.0007) [2023-03-07 09:45:05,096][175731] Updated weights for policy 0, policy_version 800 (0.0006) [2023-03-07 09:45:05,879][175731] Updated weights for policy 0, policy_version 810 (0.0006) [2023-03-07 09:45:06,703][175731] Updated weights for policy 0, policy_version 820 (0.0006) [2023-03-07 09:45:07,499][175731] Updated weights for policy 0, policy_version 830 (0.0007) [2023-03-07 09:45:08,319][175731] Updated weights for policy 0, policy_version 840 (0.0006) [2023-03-07 09:45:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12288.1). Total num frames: 860160. Throughput: 0: 12731.2. Samples: 859333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:45:08,321][175405] Avg episode reward: [(0, '55.373')] [2023-03-07 09:45:08,325][175680] Saving new best policy, reward=55.373! [2023-03-07 09:45:09,149][175731] Updated weights for policy 0, policy_version 850 (0.0006) [2023-03-07 09:45:09,943][175731] Updated weights for policy 0, policy_version 860 (0.0007) [2023-03-07 09:45:10,755][175731] Updated weights for policy 0, policy_version 870 (0.0006) [2023-03-07 09:45:11,571][175731] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-07 09:45:12,366][175731] Updated weights for policy 0, policy_version 890 (0.0006) [2023-03-07 09:45:13,176][175731] Updated weights for policy 0, policy_version 900 (0.0007) [2023-03-07 09:45:13,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12731.7, 300 sec: 12301.7). Total num frames: 922624. Throughput: 0: 12722.2. Samples: 897130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:45:13,322][175405] Avg episode reward: [(0, '56.935')] [2023-03-07 09:45:13,330][175680] Saving new best policy, reward=56.935! [2023-03-07 09:45:13,989][175731] Updated weights for policy 0, policy_version 910 (0.0007) [2023-03-07 09:45:14,794][175731] Updated weights for policy 0, policy_version 920 (0.0007) [2023-03-07 09:45:15,598][175731] Updated weights for policy 0, policy_version 930 (0.0006) [2023-03-07 09:45:16,406][175731] Updated weights for policy 0, policy_version 940 (0.0007) [2023-03-07 09:45:17,200][175731] Updated weights for policy 0, policy_version 950 (0.0006) [2023-03-07 09:45:17,996][175731] Updated weights for policy 0, policy_version 960 (0.0006) [2023-03-07 09:45:18,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12731.8, 300 sec: 12326.5). Total num frames: 986112. Throughput: 0: 12706.0. Samples: 973326. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:45:18,321][175405] Avg episode reward: [(0, '47.718')] [2023-03-07 09:45:18,822][175731] Updated weights for policy 0, policy_version 970 (0.0006) [2023-03-07 09:45:19,625][175731] Updated weights for policy 0, policy_version 980 (0.0006) [2023-03-07 09:45:20,426][175731] Updated weights for policy 0, policy_version 990 (0.0006) [2023-03-07 09:45:21,230][175731] Updated weights for policy 0, policy_version 1000 (0.0006) [2023-03-07 09:45:22,034][175731] Updated weights for policy 0, policy_version 1010 (0.0006) [2023-03-07 09:45:22,840][175731] Updated weights for policy 0, policy_version 1020 (0.0007) [2023-03-07 09:45:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12360.3). Total num frames: 1050624. Throughput: 0: 12713.3. Samples: 1049738. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:45:23,321][175405] Avg episode reward: [(0, '45.145')] [2023-03-07 09:45:23,633][175731] Updated weights for policy 0, policy_version 1030 (0.0007) [2023-03-07 09:45:24,437][175731] Updated weights for policy 0, policy_version 1040 (0.0006) [2023-03-07 09:45:25,229][175731] Updated weights for policy 0, policy_version 1050 (0.0007) [2023-03-07 09:45:26,035][175731] Updated weights for policy 0, policy_version 1060 (0.0007) [2023-03-07 09:45:26,830][175731] Updated weights for policy 0, policy_version 1070 (0.0007) [2023-03-07 09:45:27,641][175731] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-07 09:45:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12379.1). Total num frames: 1114112. Throughput: 0: 12721.0. Samples: 1088165. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:45:28,321][175405] Avg episode reward: [(0, '40.212')] [2023-03-07 09:45:28,441][175731] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-07 09:45:29,237][175731] Updated weights for policy 0, policy_version 1100 (0.0006) [2023-03-07 09:45:30,065][175731] Updated weights for policy 0, policy_version 1110 (0.0007) [2023-03-07 09:45:30,868][175731] Updated weights for policy 0, policy_version 1120 (0.0007) [2023-03-07 09:45:31,665][175731] Updated weights for policy 0, policy_version 1130 (0.0006) [2023-03-07 09:45:32,466][175731] Updated weights for policy 0, policy_version 1140 (0.0007) [2023-03-07 09:45:33,268][175731] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-07 09:45:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12714.6, 300 sec: 12395.8). Total num frames: 1177600. Throughput: 0: 12718.0. Samples: 1164525. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:45:33,322][175405] Avg episode reward: [(0, '40.021')] [2023-03-07 09:45:34,061][175731] Updated weights for policy 0, policy_version 1160 (0.0007) [2023-03-07 09:45:34,881][175731] Updated weights for policy 0, policy_version 1170 (0.0006) [2023-03-07 09:45:35,673][175731] Updated weights for policy 0, policy_version 1180 (0.0006) [2023-03-07 09:45:36,489][175731] Updated weights for policy 0, policy_version 1190 (0.0007) [2023-03-07 09:45:37,281][175731] Updated weights for policy 0, policy_version 1200 (0.0007) [2023-03-07 09:45:38,092][175731] Updated weights for policy 0, policy_version 1210 (0.0006) [2023-03-07 09:45:38,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12714.7, 300 sec: 12410.9). Total num frames: 1241088. Throughput: 0: 12725.8. Samples: 1241079. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:45:38,322][175405] Avg episode reward: [(0, '35.979')] [2023-03-07 09:45:38,889][175731] Updated weights for policy 0, policy_version 1220 (0.0007) [2023-03-07 09:45:39,687][175731] Updated weights for policy 0, policy_version 1230 (0.0006) [2023-03-07 09:45:40,482][175731] Updated weights for policy 0, policy_version 1240 (0.0007) [2023-03-07 09:45:41,294][175731] Updated weights for policy 0, policy_version 1250 (0.0006) [2023-03-07 09:45:42,082][175731] Updated weights for policy 0, policy_version 1260 (0.0007) [2023-03-07 09:45:42,878][175731] Updated weights for policy 0, policy_version 1270 (0.0006) [2023-03-07 09:45:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12434.3). Total num frames: 1305600. Throughput: 0: 12721.8. Samples: 1279264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:45:43,321][175405] Avg episode reward: [(0, '41.095')] [2023-03-07 09:45:43,674][175731] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-07 09:45:44,494][175731] Updated weights for policy 0, policy_version 1290 (0.0006) [2023-03-07 09:45:45,297][175731] Updated weights for policy 0, policy_version 1300 (0.0006) [2023-03-07 09:45:46,102][175731] Updated weights for policy 0, policy_version 1310 (0.0005) [2023-03-07 09:45:46,910][175731] Updated weights for policy 0, policy_version 1320 (0.0006) [2023-03-07 09:45:47,712][175731] Updated weights for policy 0, policy_version 1330 (0.0005) [2023-03-07 09:45:48,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12731.8, 300 sec: 12446.3). Total num frames: 1369088. Throughput: 0: 12725.8. Samples: 1355838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:45:48,321][175405] Avg episode reward: [(0, '48.304')] [2023-03-07 09:45:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001337_1369088.pth... [2023-03-07 09:45:48,533][175731] Updated weights for policy 0, policy_version 1340 (0.0006) [2023-03-07 09:45:49,329][175731] Updated weights for policy 0, policy_version 1350 (0.0007) [2023-03-07 09:45:50,140][175731] Updated weights for policy 0, policy_version 1360 (0.0007) [2023-03-07 09:45:50,943][175731] Updated weights for policy 0, policy_version 1370 (0.0006) [2023-03-07 09:45:51,761][175731] Updated weights for policy 0, policy_version 1380 (0.0006) [2023-03-07 09:45:52,558][175731] Updated weights for policy 0, policy_version 1390 (0.0007) [2023-03-07 09:45:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12457.2). Total num frames: 1432576. Throughput: 0: 12731.7. Samples: 1432260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:45:53,321][175405] Avg episode reward: [(0, '46.900')] [2023-03-07 09:45:53,358][175731] Updated weights for policy 0, policy_version 1400 (0.0006) [2023-03-07 09:45:54,159][175731] Updated weights for policy 0, policy_version 1410 (0.0007) [2023-03-07 09:45:54,964][175731] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-07 09:45:55,751][175731] Updated weights for policy 0, policy_version 1430 (0.0006) [2023-03-07 09:45:56,553][175731] Updated weights for policy 0, policy_version 1440 (0.0005) [2023-03-07 09:45:57,357][175731] Updated weights for policy 0, policy_version 1450 (0.0006) [2023-03-07 09:45:58,154][175731] Updated weights for policy 0, policy_version 1460 (0.0006) [2023-03-07 09:45:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12475.8). Total num frames: 1497088. Throughput: 0: 12739.4. Samples: 1470400. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:45:58,321][175405] Avg episode reward: [(0, '60.136')] [2023-03-07 09:45:58,324][175680] Saving new best policy, reward=60.136! [2023-03-07 09:45:58,954][175731] Updated weights for policy 0, policy_version 1470 (0.0006) [2023-03-07 09:45:59,749][175731] Updated weights for policy 0, policy_version 1480 (0.0007) [2023-03-07 09:46:00,560][175731] Updated weights for policy 0, policy_version 1490 (0.0007) [2023-03-07 09:46:01,365][175731] Updated weights for policy 0, policy_version 1500 (0.0006) [2023-03-07 09:46:02,183][175731] Updated weights for policy 0, policy_version 1510 (0.0006) [2023-03-07 09:46:02,985][175731] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-07 09:46:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12484.6). Total num frames: 1560576. Throughput: 0: 12752.2. Samples: 1547176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:46:03,322][175405] Avg episode reward: [(0, '60.213')] [2023-03-07 09:46:03,322][175680] Saving new best policy, reward=60.213! [2023-03-07 09:46:03,820][175731] Updated weights for policy 0, policy_version 1530 (0.0007) [2023-03-07 09:46:04,617][175731] Updated weights for policy 0, policy_version 1540 (0.0007) [2023-03-07 09:46:05,416][175731] Updated weights for policy 0, policy_version 1550 (0.0006) [2023-03-07 09:46:06,202][175731] Updated weights for policy 0, policy_version 1560 (0.0006) [2023-03-07 09:46:07,014][175731] Updated weights for policy 0, policy_version 1570 (0.0007) [2023-03-07 09:46:07,829][175731] Updated weights for policy 0, policy_version 1580 (0.0006) [2023-03-07 09:46:08,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12484.9). Total num frames: 1623040. Throughput: 0: 12746.3. Samples: 1623321. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 09:46:08,332][175405] Avg episode reward: [(0, '52.408')] [2023-03-07 09:46:08,651][175731] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-07 09:46:09,427][175731] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-07 09:46:10,237][175731] Updated weights for policy 0, policy_version 1610 (0.0006) [2023-03-07 09:46:11,037][175731] Updated weights for policy 0, policy_version 1620 (0.0007) [2023-03-07 09:46:11,825][175731] Updated weights for policy 0, policy_version 1630 (0.0007) [2023-03-07 09:46:12,638][175731] Updated weights for policy 0, policy_version 1640 (0.0007) [2023-03-07 09:46:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12500.4). Total num frames: 1687552. Throughput: 0: 12740.1. Samples: 1661471. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:46:13,322][175405] Avg episode reward: [(0, '58.839')] [2023-03-07 09:46:13,442][175731] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-07 09:46:14,261][175731] Updated weights for policy 0, policy_version 1660 (0.0006) [2023-03-07 09:46:15,057][175731] Updated weights for policy 0, policy_version 1670 (0.0006) [2023-03-07 09:46:15,872][175731] Updated weights for policy 0, policy_version 1680 (0.0007) [2023-03-07 09:46:16,665][175731] Updated weights for policy 0, policy_version 1690 (0.0006) [2023-03-07 09:46:17,487][175731] Updated weights for policy 0, policy_version 1700 (0.0007) [2023-03-07 09:46:18,283][175731] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-07 09:46:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12507.4). Total num frames: 1751040. Throughput: 0: 12740.3. Samples: 1737839. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:46:18,332][175405] Avg episode reward: [(0, '68.911')] [2023-03-07 09:46:18,337][175680] Saving new best policy, reward=68.911! [2023-03-07 09:46:19,110][175731] Updated weights for policy 0, policy_version 1720 (0.0006) [2023-03-07 09:46:19,915][175731] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-07 09:46:20,720][175731] Updated weights for policy 0, policy_version 1740 (0.0006) [2023-03-07 09:46:21,539][175731] Updated weights for policy 0, policy_version 1750 (0.0006) [2023-03-07 09:46:22,345][175731] Updated weights for policy 0, policy_version 1760 (0.0006) [2023-03-07 09:46:23,142][175731] Updated weights for policy 0, policy_version 1770 (0.0006) [2023-03-07 09:46:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12514.0). Total num frames: 1814528. Throughput: 0: 12724.8. Samples: 1813696. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:46:23,332][175405] Avg episode reward: [(0, '74.184')] [2023-03-07 09:46:23,333][175680] Saving new best policy, reward=74.184! [2023-03-07 09:46:23,935][175731] Updated weights for policy 0, policy_version 1780 (0.0006) [2023-03-07 09:46:24,749][175731] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-07 09:46:25,546][175731] Updated weights for policy 0, policy_version 1800 (0.0005) [2023-03-07 09:46:26,358][175731] Updated weights for policy 0, policy_version 1810 (0.0006) [2023-03-07 09:46:27,162][175731] Updated weights for policy 0, policy_version 1820 (0.0007) [2023-03-07 09:46:27,968][175731] Updated weights for policy 0, policy_version 1830 (0.0007) [2023-03-07 09:46:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12520.1). Total num frames: 1878016. Throughput: 0: 12728.6. Samples: 1852052. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:46:28,332][175405] Avg episode reward: [(0, '72.848')] [2023-03-07 09:46:28,777][175731] Updated weights for policy 0, policy_version 1840 (0.0006) [2023-03-07 09:46:29,574][175731] Updated weights for policy 0, policy_version 1850 (0.0006) [2023-03-07 09:46:30,396][175731] Updated weights for policy 0, policy_version 1860 (0.0006) [2023-03-07 09:46:31,206][175731] Updated weights for policy 0, policy_version 1870 (0.0007) [2023-03-07 09:46:32,011][175731] Updated weights for policy 0, policy_version 1880 (0.0006) [2023-03-07 09:46:32,818][175731] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-07 09:46:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12525.9). Total num frames: 1941504. Throughput: 0: 12716.0. Samples: 1928061. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:46:33,332][175405] Avg episode reward: [(0, '88.796')] [2023-03-07 09:46:33,333][175680] Saving new best policy, reward=88.796! [2023-03-07 09:46:33,628][175731] Updated weights for policy 0, policy_version 1900 (0.0007) [2023-03-07 09:46:34,443][175731] Updated weights for policy 0, policy_version 1910 (0.0007) [2023-03-07 09:46:35,241][175731] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-07 09:46:36,054][175731] Updated weights for policy 0, policy_version 1930 (0.0006) [2023-03-07 09:46:36,861][175731] Updated weights for policy 0, policy_version 1940 (0.0007) [2023-03-07 09:46:37,659][175731] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-07 09:46:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12531.2). Total num frames: 2004992. Throughput: 0: 12704.8. Samples: 2003976. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:46:38,332][175405] Avg episode reward: [(0, '97.168')] [2023-03-07 09:46:38,336][175680] Saving new best policy, reward=97.168! [2023-03-07 09:46:38,483][175731] Updated weights for policy 0, policy_version 1960 (0.0006) [2023-03-07 09:46:39,286][175731] Updated weights for policy 0, policy_version 1970 (0.0007) [2023-03-07 09:46:40,090][175731] Updated weights for policy 0, policy_version 1980 (0.0006) [2023-03-07 09:46:40,907][175731] Updated weights for policy 0, policy_version 1990 (0.0006) [2023-03-07 09:46:41,723][175731] Updated weights for policy 0, policy_version 2000 (0.0006) [2023-03-07 09:46:42,530][175731] Updated weights for policy 0, policy_version 2010 (0.0006) [2023-03-07 09:46:43,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12530.1). Total num frames: 2067456. Throughput: 0: 12699.6. Samples: 2041885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:46:43,332][175405] Avg episode reward: [(0, '83.282')] [2023-03-07 09:46:43,334][175731] Updated weights for policy 0, policy_version 2020 (0.0007) [2023-03-07 09:46:44,139][175731] Updated weights for policy 0, policy_version 2030 (0.0007) [2023-03-07 09:46:44,941][175731] Updated weights for policy 0, policy_version 2040 (0.0007) [2023-03-07 09:46:45,749][175731] Updated weights for policy 0, policy_version 2050 (0.0007) [2023-03-07 09:46:46,558][175731] Updated weights for policy 0, policy_version 2060 (0.0006) [2023-03-07 09:46:47,400][175731] Updated weights for policy 0, policy_version 2070 (0.0007) [2023-03-07 09:46:48,194][175731] Updated weights for policy 0, policy_version 2080 (0.0007) [2023-03-07 09:46:48,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12535.0). Total num frames: 2130944. Throughput: 0: 12683.5. Samples: 2117935. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:46:48,321][175405] Avg episode reward: [(0, '108.921')] [2023-03-07 09:46:48,325][175680] Saving new best policy, reward=108.921! [2023-03-07 09:46:48,989][175731] Updated weights for policy 0, policy_version 2090 (0.0007) [2023-03-07 09:46:49,799][175731] Updated weights for policy 0, policy_version 2100 (0.0007) [2023-03-07 09:46:50,613][175731] Updated weights for policy 0, policy_version 2110 (0.0007) [2023-03-07 09:46:51,410][175731] Updated weights for policy 0, policy_version 2120 (0.0007) [2023-03-07 09:46:52,228][175731] Updated weights for policy 0, policy_version 2130 (0.0007) [2023-03-07 09:46:53,040][175731] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-07 09:46:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12539.6). Total num frames: 2194432. Throughput: 0: 12684.6. Samples: 2194127. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:46:53,322][175405] Avg episode reward: [(0, '149.404')] [2023-03-07 09:46:53,322][175680] Saving new best policy, reward=149.404! [2023-03-07 09:46:53,844][175731] Updated weights for policy 0, policy_version 2150 (0.0006) [2023-03-07 09:46:54,646][175731] Updated weights for policy 0, policy_version 2160 (0.0006) [2023-03-07 09:46:55,439][175731] Updated weights for policy 0, policy_version 2170 (0.0006) [2023-03-07 09:46:56,266][175731] Updated weights for policy 0, policy_version 2180 (0.0007) [2023-03-07 09:46:57,062][175731] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-07 09:46:57,870][175731] Updated weights for policy 0, policy_version 2200 (0.0007) [2023-03-07 09:46:58,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12544.0). Total num frames: 2257920. Throughput: 0: 12683.0. Samples: 2232205. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:46:58,332][175405] Avg episode reward: [(0, '199.205')] [2023-03-07 09:46:58,336][175680] Saving new best policy, reward=199.205! [2023-03-07 09:46:58,683][175731] Updated weights for policy 0, policy_version 2210 (0.0006) [2023-03-07 09:46:59,510][175731] Updated weights for policy 0, policy_version 2220 (0.0006) [2023-03-07 09:47:00,313][175731] Updated weights for policy 0, policy_version 2230 (0.0007) [2023-03-07 09:47:01,106][175731] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-07 09:47:01,921][175731] Updated weights for policy 0, policy_version 2250 (0.0007) [2023-03-07 09:47:02,741][175731] Updated weights for policy 0, policy_version 2260 (0.0006) [2023-03-07 09:47:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12548.2). Total num frames: 2321408. Throughput: 0: 12675.3. Samples: 2308226. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:47:03,332][175405] Avg episode reward: [(0, '195.209')] [2023-03-07 09:47:03,535][175731] Updated weights for policy 0, policy_version 2270 (0.0006) [2023-03-07 09:47:04,347][175731] Updated weights for policy 0, policy_version 2280 (0.0007) [2023-03-07 09:47:05,140][175731] Updated weights for policy 0, policy_version 2290 (0.0007) [2023-03-07 09:47:05,960][175731] Updated weights for policy 0, policy_version 2300 (0.0007) [2023-03-07 09:47:06,758][175731] Updated weights for policy 0, policy_version 2310 (0.0006) [2023-03-07 09:47:07,527][175731] Updated weights for policy 0, policy_version 2320 (0.0006) [2023-03-07 09:47:08,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12552.1). Total num frames: 2384896. Throughput: 0: 12687.0. Samples: 2384609. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:47:08,332][175405] Avg episode reward: [(0, '155.177')] [2023-03-07 09:47:08,342][175731] Updated weights for policy 0, policy_version 2330 (0.0006) [2023-03-07 09:47:09,068][175680] KL-divergence is very high: 137.9490 [2023-03-07 09:47:09,146][175731] Updated weights for policy 0, policy_version 2340 (0.0006) [2023-03-07 09:47:09,946][175731] Updated weights for policy 0, policy_version 2350 (0.0006) [2023-03-07 09:47:10,738][175731] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-07 09:47:11,542][175731] Updated weights for policy 0, policy_version 2370 (0.0006) [2023-03-07 09:47:12,338][175731] Updated weights for policy 0, policy_version 2380 (0.0006) [2023-03-07 09:47:13,157][175731] Updated weights for policy 0, policy_version 2390 (0.0006) [2023-03-07 09:47:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12697.6, 300 sec: 12561.1). Total num frames: 2449408. Throughput: 0: 12688.0. Samples: 2423011. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:47:13,332][175405] Avg episode reward: [(0, '126.321')] [2023-03-07 09:47:13,950][175731] Updated weights for policy 0, policy_version 2400 (0.0006) [2023-03-07 09:47:14,745][175731] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-07 09:47:15,556][175731] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-07 09:47:16,375][175731] Updated weights for policy 0, policy_version 2430 (0.0006) [2023-03-07 09:47:17,166][175731] Updated weights for policy 0, policy_version 2440 (0.0007) [2023-03-07 09:47:17,986][175731] Updated weights for policy 0, policy_version 2450 (0.0008) [2023-03-07 09:47:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12697.6, 300 sec: 12564.5). Total num frames: 2512896. Throughput: 0: 12695.9. Samples: 2499379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:47:18,332][175405] Avg episode reward: [(0, '169.458')] [2023-03-07 09:47:18,782][175731] Updated weights for policy 0, policy_version 2460 (0.0006) [2023-03-07 09:47:19,582][175731] Updated weights for policy 0, policy_version 2470 (0.0006) [2023-03-07 09:47:20,403][175731] Updated weights for policy 0, policy_version 2480 (0.0006) [2023-03-07 09:47:21,199][175731] Updated weights for policy 0, policy_version 2490 (0.0006) [2023-03-07 09:47:21,996][175731] Updated weights for policy 0, policy_version 2500 (0.0007) [2023-03-07 09:47:22,798][175731] Updated weights for policy 0, policy_version 2510 (0.0006) [2023-03-07 09:47:22,865][175680] KL-divergence is very high: 109.4033 [2023-03-07 09:47:23,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12567.7). Total num frames: 2576384. Throughput: 0: 12710.1. Samples: 2575929. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:47:23,332][175405] Avg episode reward: [(0, '101.154')] [2023-03-07 09:47:23,603][175731] Updated weights for policy 0, policy_version 2520 (0.0006) [2023-03-07 09:47:24,417][175731] Updated weights for policy 0, policy_version 2530 (0.0007) [2023-03-07 09:47:25,229][175731] Updated weights for policy 0, policy_version 2540 (0.0006) [2023-03-07 09:47:26,030][175731] Updated weights for policy 0, policy_version 2550 (0.0006) [2023-03-07 09:47:26,829][175731] Updated weights for policy 0, policy_version 2560 (0.0006) [2023-03-07 09:47:27,648][175731] Updated weights for policy 0, policy_version 2570 (0.0006) [2023-03-07 09:47:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12570.8). Total num frames: 2639872. Throughput: 0: 12712.4. Samples: 2613945. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:47:28,322][175405] Avg episode reward: [(0, '98.916')] [2023-03-07 09:47:28,465][175731] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-07 09:47:29,246][175731] Updated weights for policy 0, policy_version 2590 (0.0006) [2023-03-07 09:47:30,052][175731] Updated weights for policy 0, policy_version 2600 (0.0007) [2023-03-07 09:47:30,848][175731] Updated weights for policy 0, policy_version 2610 (0.0007) [2023-03-07 09:47:31,633][175731] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-07 09:47:32,458][175731] Updated weights for policy 0, policy_version 2630 (0.0007) [2023-03-07 09:47:33,249][175731] Updated weights for policy 0, policy_version 2640 (0.0006) [2023-03-07 09:47:33,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12573.8). Total num frames: 2703360. Throughput: 0: 12722.2. Samples: 2690436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:47:33,322][175405] Avg episode reward: [(0, '118.567')] [2023-03-07 09:47:34,061][175731] Updated weights for policy 0, policy_version 2650 (0.0006) [2023-03-07 09:47:34,876][175731] Updated weights for policy 0, policy_version 2660 (0.0006) [2023-03-07 09:47:35,674][175731] Updated weights for policy 0, policy_version 2670 (0.0006) [2023-03-07 09:47:36,470][175731] Updated weights for policy 0, policy_version 2680 (0.0006) [2023-03-07 09:47:37,294][175731] Updated weights for policy 0, policy_version 2690 (0.0007) [2023-03-07 09:47:38,084][175731] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-07 09:47:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12581.3). Total num frames: 2767872. Throughput: 0: 12731.8. Samples: 2767055. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:47:38,321][175405] Avg episode reward: [(0, '147.602')] [2023-03-07 09:47:38,869][175731] Updated weights for policy 0, policy_version 2710 (0.0007) [2023-03-07 09:47:39,665][175731] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-07 09:47:40,478][175731] Updated weights for policy 0, policy_version 2730 (0.0007) [2023-03-07 09:47:41,262][175731] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-07 09:47:42,083][175731] Updated weights for policy 0, policy_version 2750 (0.0006) [2023-03-07 09:47:42,873][175731] Updated weights for policy 0, policy_version 2760 (0.0007) [2023-03-07 09:47:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12583.8). Total num frames: 2831360. Throughput: 0: 12741.8. Samples: 2805585. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:47:43,322][175405] Avg episode reward: [(0, '170.609')] [2023-03-07 09:47:43,682][175731] Updated weights for policy 0, policy_version 2770 (0.0006) [2023-03-07 09:47:44,483][175731] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-07 09:47:45,265][175731] Updated weights for policy 0, policy_version 2790 (0.0007) [2023-03-07 09:47:46,084][175731] Updated weights for policy 0, policy_version 2800 (0.0006) [2023-03-07 09:47:46,870][175731] Updated weights for policy 0, policy_version 2810 (0.0008) [2023-03-07 09:47:47,684][175731] Updated weights for policy 0, policy_version 2820 (0.0006) [2023-03-07 09:47:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12586.3). Total num frames: 2894848. Throughput: 0: 12749.7. Samples: 2881962. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:47:48,322][175405] Avg episode reward: [(0, '91.278')] [2023-03-07 09:47:48,338][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002828_2895872.pth... [2023-03-07 09:47:48,502][175731] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-07 09:47:49,295][175731] Updated weights for policy 0, policy_version 2840 (0.0006) [2023-03-07 09:47:50,086][175731] Updated weights for policy 0, policy_version 2850 (0.0006) [2023-03-07 09:47:50,897][175731] Updated weights for policy 0, policy_version 2860 (0.0006) [2023-03-07 09:47:51,704][175731] Updated weights for policy 0, policy_version 2870 (0.0006) [2023-03-07 09:47:52,523][175731] Updated weights for policy 0, policy_version 2880 (0.0006) [2023-03-07 09:47:53,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12731.8, 300 sec: 12588.7). Total num frames: 2958336. Throughput: 0: 12750.3. Samples: 2958372. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:47:53,322][175405] Avg episode reward: [(0, '120.794')] [2023-03-07 09:47:53,329][175731] Updated weights for policy 0, policy_version 2890 (0.0007) [2023-03-07 09:47:54,123][175731] Updated weights for policy 0, policy_version 2900 (0.0007) [2023-03-07 09:47:54,938][175731] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-07 09:47:55,741][175731] Updated weights for policy 0, policy_version 2920 (0.0006) [2023-03-07 09:47:56,529][175731] Updated weights for policy 0, policy_version 2930 (0.0008) [2023-03-07 09:47:57,345][175731] Updated weights for policy 0, policy_version 2940 (0.0007) [2023-03-07 09:47:58,142][175731] Updated weights for policy 0, policy_version 2950 (0.0006) [2023-03-07 09:47:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12595.2). Total num frames: 3022848. Throughput: 0: 12744.8. Samples: 2996529. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:47:58,322][175405] Avg episode reward: [(0, '178.375')] [2023-03-07 09:47:58,961][175731] Updated weights for policy 0, policy_version 2960 (0.0007) [2023-03-07 09:47:59,757][175731] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-07 09:48:00,545][175731] Updated weights for policy 0, policy_version 2980 (0.0006) [2023-03-07 09:48:01,369][175731] Updated weights for policy 0, policy_version 2990 (0.0007) [2023-03-07 09:48:02,180][175731] Updated weights for policy 0, policy_version 3000 (0.0007) [2023-03-07 09:48:02,990][175731] Updated weights for policy 0, policy_version 3010 (0.0006) [2023-03-07 09:48:03,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12593.1). Total num frames: 3085312. Throughput: 0: 12743.6. Samples: 3072841. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:48:03,321][175405] Avg episode reward: [(0, '133.002')] [2023-03-07 09:48:03,790][175731] Updated weights for policy 0, policy_version 3020 (0.0006) [2023-03-07 09:48:04,577][175731] Updated weights for policy 0, policy_version 3030 (0.0006) [2023-03-07 09:48:05,394][175731] Updated weights for policy 0, policy_version 3040 (0.0008) [2023-03-07 09:48:06,213][175731] Updated weights for policy 0, policy_version 3050 (0.0007) [2023-03-07 09:48:06,986][175731] Updated weights for policy 0, policy_version 3060 (0.0006) [2023-03-07 09:48:07,805][175731] Updated weights for policy 0, policy_version 3070 (0.0006) [2023-03-07 09:48:08,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12748.8, 300 sec: 12599.3). Total num frames: 3149824. Throughput: 0: 12742.3. Samples: 3149332. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:48:08,321][175405] Avg episode reward: [(0, '152.472')] [2023-03-07 09:48:08,618][175731] Updated weights for policy 0, policy_version 3080 (0.0007) [2023-03-07 09:48:09,432][175731] Updated weights for policy 0, policy_version 3090 (0.0007) [2023-03-07 09:48:10,222][175731] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-07 09:48:11,034][175731] Updated weights for policy 0, policy_version 3110 (0.0006) [2023-03-07 09:48:11,817][175731] Updated weights for policy 0, policy_version 3120 (0.0006) [2023-03-07 09:48:12,628][175731] Updated weights for policy 0, policy_version 3130 (0.0006) [2023-03-07 09:48:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12601.2). Total num frames: 3213312. Throughput: 0: 12742.1. Samples: 3187341. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:48:13,322][175405] Avg episode reward: [(0, '105.880')] [2023-03-07 09:48:13,415][175731] Updated weights for policy 0, policy_version 3140 (0.0007) [2023-03-07 09:48:14,210][175731] Updated weights for policy 0, policy_version 3150 (0.0007) [2023-03-07 09:48:15,018][175731] Updated weights for policy 0, policy_version 3160 (0.0006) [2023-03-07 09:48:15,811][175731] Updated weights for policy 0, policy_version 3170 (0.0007) [2023-03-07 09:48:16,632][175731] Updated weights for policy 0, policy_version 3180 (0.0007) [2023-03-07 09:48:17,434][175731] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-07 09:48:18,226][175731] Updated weights for policy 0, policy_version 3200 (0.0006) [2023-03-07 09:48:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12607.0). Total num frames: 3277824. Throughput: 0: 12754.0. Samples: 3264365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:48:18,321][175405] Avg episode reward: [(0, '101.975')] [2023-03-07 09:48:19,021][175731] Updated weights for policy 0, policy_version 3210 (0.0006) [2023-03-07 09:48:19,828][175731] Updated weights for policy 0, policy_version 3220 (0.0006) [2023-03-07 09:48:20,613][175731] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-07 09:48:21,429][175731] Updated weights for policy 0, policy_version 3240 (0.0007) [2023-03-07 09:48:22,211][175731] Updated weights for policy 0, policy_version 3250 (0.0006) [2023-03-07 09:48:23,002][175731] Updated weights for policy 0, policy_version 3260 (0.0006) [2023-03-07 09:48:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12608.7). Total num frames: 3341312. Throughput: 0: 12758.9. Samples: 3341207. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:48:23,321][175405] Avg episode reward: [(0, '116.209')] [2023-03-07 09:48:23,794][175731] Updated weights for policy 0, policy_version 3270 (0.0006) [2023-03-07 09:48:24,597][175731] Updated weights for policy 0, policy_version 3280 (0.0007) [2023-03-07 09:48:25,401][175731] Updated weights for policy 0, policy_version 3290 (0.0006) [2023-03-07 09:48:26,199][175731] Updated weights for policy 0, policy_version 3300 (0.0006) [2023-03-07 09:48:26,997][175731] Updated weights for policy 0, policy_version 3310 (0.0006) [2023-03-07 09:48:27,797][175731] Updated weights for policy 0, policy_version 3320 (0.0007) [2023-03-07 09:48:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12614.2). Total num frames: 3405824. Throughput: 0: 12757.3. Samples: 3379664. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 09:48:28,321][175405] Avg episode reward: [(0, '84.814')] [2023-03-07 09:48:28,598][175731] Updated weights for policy 0, policy_version 3330 (0.0006) [2023-03-07 09:48:29,394][175731] Updated weights for policy 0, policy_version 3340 (0.0006) [2023-03-07 09:48:30,191][175731] Updated weights for policy 0, policy_version 3350 (0.0007) [2023-03-07 09:48:31,001][175731] Updated weights for policy 0, policy_version 3360 (0.0006) [2023-03-07 09:48:31,793][175731] Updated weights for policy 0, policy_version 3370 (0.0008) [2023-03-07 09:48:32,594][175731] Updated weights for policy 0, policy_version 3380 (0.0006) [2023-03-07 09:48:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12782.9, 300 sec: 12619.4). Total num frames: 3470336. Throughput: 0: 12771.3. Samples: 3456672. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:48:33,322][175405] Avg episode reward: [(0, '119.782')] [2023-03-07 09:48:33,394][175731] Updated weights for policy 0, policy_version 3390 (0.0006) [2023-03-07 09:48:34,188][175731] Updated weights for policy 0, policy_version 3400 (0.0008) [2023-03-07 09:48:34,991][175731] Updated weights for policy 0, policy_version 3410 (0.0007) [2023-03-07 09:48:35,810][175731] Updated weights for policy 0, policy_version 3420 (0.0006) [2023-03-07 09:48:36,589][175731] Updated weights for policy 0, policy_version 3430 (0.0007) [2023-03-07 09:48:37,389][175731] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-07 09:48:38,193][175731] Updated weights for policy 0, policy_version 3450 (0.0006) [2023-03-07 09:48:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12765.8, 300 sec: 12620.8). Total num frames: 3533824. Throughput: 0: 12777.1. Samples: 3533341. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:48:38,322][175405] Avg episode reward: [(0, '116.789')] [2023-03-07 09:48:38,998][175731] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-07 09:48:39,795][175731] Updated weights for policy 0, policy_version 3470 (0.0006) [2023-03-07 09:48:40,585][175731] Updated weights for policy 0, policy_version 3480 (0.0008) [2023-03-07 09:48:41,390][175731] Updated weights for policy 0, policy_version 3490 (0.0007) [2023-03-07 09:48:42,171][175731] Updated weights for policy 0, policy_version 3500 (0.0006) [2023-03-07 09:48:42,975][175731] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-07 09:48:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12625.8). Total num frames: 3598336. Throughput: 0: 12785.0. Samples: 3571854. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:48:43,321][175405] Avg episode reward: [(0, '59.106')] [2023-03-07 09:48:43,769][175731] Updated weights for policy 0, policy_version 3520 (0.0007) [2023-03-07 09:48:44,552][175731] Updated weights for policy 0, policy_version 3530 (0.0006) [2023-03-07 09:48:45,369][175731] Updated weights for policy 0, policy_version 3540 (0.0007) [2023-03-07 09:48:46,163][175731] Updated weights for policy 0, policy_version 3550 (0.0007) [2023-03-07 09:48:46,956][175731] Updated weights for policy 0, policy_version 3560 (0.0007) [2023-03-07 09:48:47,808][175731] Updated weights for policy 0, policy_version 3570 (0.0006) [2023-03-07 09:48:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12627.0). Total num frames: 3661824. Throughput: 0: 12801.9. Samples: 3648927. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 09:48:48,322][175405] Avg episode reward: [(0, '59.353')] [2023-03-07 09:48:48,589][175731] Updated weights for policy 0, policy_version 3580 (0.0006) [2023-03-07 09:48:49,378][175731] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-07 09:48:50,196][175731] Updated weights for policy 0, policy_version 3600 (0.0007) [2023-03-07 09:48:50,347][175680] KL-divergence is very high: 3434.0444 [2023-03-07 09:48:50,987][175731] Updated weights for policy 0, policy_version 3610 (0.0007) [2023-03-07 09:48:51,786][175731] Updated weights for policy 0, policy_version 3620 (0.0006) [2023-03-07 09:48:52,569][175731] Updated weights for policy 0, policy_version 3630 (0.0006) [2023-03-07 09:48:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12631.7). Total num frames: 3726336. Throughput: 0: 12806.7. Samples: 3725633. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:48:53,321][175405] Avg episode reward: [(0, '36.284')] [2023-03-07 09:48:53,361][175731] Updated weights for policy 0, policy_version 3640 (0.0006) [2023-03-07 09:48:54,170][175731] Updated weights for policy 0, policy_version 3650 (0.0007) [2023-03-07 09:48:54,973][175731] Updated weights for policy 0, policy_version 3660 (0.0007) [2023-03-07 09:48:55,770][175731] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-07 09:48:56,562][175731] Updated weights for policy 0, policy_version 3680 (0.0007) [2023-03-07 09:48:57,378][175731] Updated weights for policy 0, policy_version 3690 (0.0006) [2023-03-07 09:48:58,178][175731] Updated weights for policy 0, policy_version 3700 (0.0006) [2023-03-07 09:48:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12783.0, 300 sec: 12746.2). Total num frames: 3789824. Throughput: 0: 12817.9. Samples: 3764146. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:48:58,322][175405] Avg episode reward: [(0, '68.937')] [2023-03-07 09:48:58,955][175731] Updated weights for policy 0, policy_version 3710 (0.0007) [2023-03-07 09:48:59,764][175731] Updated weights for policy 0, policy_version 3720 (0.0007) [2023-03-07 09:49:00,589][175731] Updated weights for policy 0, policy_version 3730 (0.0007) [2023-03-07 09:49:01,370][175731] Updated weights for policy 0, policy_version 3740 (0.0007) [2023-03-07 09:49:02,179][175731] Updated weights for policy 0, policy_version 3750 (0.0007) [2023-03-07 09:49:02,982][175731] Updated weights for policy 0, policy_version 3760 (0.0006) [2023-03-07 09:49:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12746.2). Total num frames: 3854336. Throughput: 0: 12810.9. Samples: 3840856. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 09:49:03,321][175405] Avg episode reward: [(0, '49.599')] [2023-03-07 09:49:03,766][175731] Updated weights for policy 0, policy_version 3770 (0.0006) [2023-03-07 09:49:04,561][175731] Updated weights for policy 0, policy_version 3780 (0.0006) [2023-03-07 09:49:05,383][175731] Updated weights for policy 0, policy_version 3790 (0.0006) [2023-03-07 09:49:06,175][175731] Updated weights for policy 0, policy_version 3800 (0.0006) [2023-03-07 09:49:06,964][175731] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-07 09:49:07,774][175731] Updated weights for policy 0, policy_version 3820 (0.0006) [2023-03-07 09:49:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12742.7). Total num frames: 3917824. Throughput: 0: 12814.2. Samples: 3917847. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:49:08,322][175405] Avg episode reward: [(0, '36.110')] [2023-03-07 09:49:08,550][175731] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-07 09:49:09,382][175731] Updated weights for policy 0, policy_version 3840 (0.0006) [2023-03-07 09:49:10,172][175731] Updated weights for policy 0, policy_version 3850 (0.0006) [2023-03-07 09:49:10,962][175731] Updated weights for policy 0, policy_version 3860 (0.0006) [2023-03-07 09:49:11,768][175731] Updated weights for policy 0, policy_version 3870 (0.0008) [2023-03-07 09:49:12,581][175731] Updated weights for policy 0, policy_version 3880 (0.0007) [2023-03-07 09:49:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12746.2). Total num frames: 3982336. Throughput: 0: 12810.0. Samples: 3956115. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:49:13,322][175405] Avg episode reward: [(0, '31.565')] [2023-03-07 09:49:13,381][175731] Updated weights for policy 0, policy_version 3890 (0.0007) [2023-03-07 09:49:14,193][175731] Updated weights for policy 0, policy_version 3900 (0.0007) [2023-03-07 09:49:15,018][175731] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-07 09:49:15,804][175731] Updated weights for policy 0, policy_version 3920 (0.0007) [2023-03-07 09:49:16,613][175731] Updated weights for policy 0, policy_version 3930 (0.0006) [2023-03-07 09:49:17,409][175731] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-07 09:49:18,228][175731] Updated weights for policy 0, policy_version 3950 (0.0007) [2023-03-07 09:49:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12742.7). Total num frames: 4045824. Throughput: 0: 12797.2. Samples: 4032546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 09:49:18,322][175405] Avg episode reward: [(0, '33.849')] [2023-03-07 09:49:19,028][175731] Updated weights for policy 0, policy_version 3960 (0.0007) [2023-03-07 09:49:19,822][175731] Updated weights for policy 0, policy_version 3970 (0.0006) [2023-03-07 09:49:20,633][175731] Updated weights for policy 0, policy_version 3980 (0.0006) [2023-03-07 09:49:21,421][175731] Updated weights for policy 0, policy_version 3990 (0.0006) [2023-03-07 09:49:22,201][175731] Updated weights for policy 0, policy_version 4000 (0.0005) [2023-03-07 09:49:23,007][175731] Updated weights for policy 0, policy_version 4010 (0.0007) [2023-03-07 09:49:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12800.0, 300 sec: 12739.2). Total num frames: 4109312. Throughput: 0: 12800.7. Samples: 4109373. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 09:49:23,322][175405] Avg episode reward: [(0, '31.890')] [2023-03-07 09:49:23,812][175731] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-07 09:49:24,598][175731] Updated weights for policy 0, policy_version 4030 (0.0006) [2023-03-07 09:49:25,399][175731] Updated weights for policy 0, policy_version 4040 (0.0006) [2023-03-07 09:49:26,198][175731] Updated weights for policy 0, policy_version 4050 (0.0007) [2023-03-07 09:49:26,997][175731] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-07 09:49:27,797][175731] Updated weights for policy 0, policy_version 4070 (0.0006) [2023-03-07 09:49:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12742.7). Total num frames: 4173824. Throughput: 0: 12796.3. Samples: 4147686. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:49:28,322][175405] Avg episode reward: [(0, '28.625')] [2023-03-07 09:49:28,599][175731] Updated weights for policy 0, policy_version 4080 (0.0007) [2023-03-07 09:49:29,391][175731] Updated weights for policy 0, policy_version 4090 (0.0007) [2023-03-07 09:49:30,189][175731] Updated weights for policy 0, policy_version 4100 (0.0006) [2023-03-07 09:49:30,988][175731] Updated weights for policy 0, policy_version 4110 (0.0007) [2023-03-07 09:49:31,797][175731] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-07 09:49:32,591][175731] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-07 09:49:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12800.0, 300 sec: 12746.2). Total num frames: 4238336. Throughput: 0: 12793.1. Samples: 4224618. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:49:33,321][175405] Avg episode reward: [(0, '28.158')] [2023-03-07 09:49:33,403][175731] Updated weights for policy 0, policy_version 4140 (0.0006) [2023-03-07 09:49:34,198][175731] Updated weights for policy 0, policy_version 4150 (0.0006) [2023-03-07 09:49:34,994][175731] Updated weights for policy 0, policy_version 4160 (0.0006) [2023-03-07 09:49:35,804][175731] Updated weights for policy 0, policy_version 4170 (0.0007) [2023-03-07 09:49:36,602][175731] Updated weights for policy 0, policy_version 4180 (0.0006) [2023-03-07 09:49:37,399][175731] Updated weights for policy 0, policy_version 4190 (0.0007) [2023-03-07 09:49:38,202][175731] Updated weights for policy 0, policy_version 4200 (0.0006) [2023-03-07 09:49:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12746.2). Total num frames: 4301824. Throughput: 0: 12791.4. Samples: 4301245. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:49:38,322][175405] Avg episode reward: [(0, '29.066')] [2023-03-07 09:49:39,015][175731] Updated weights for policy 0, policy_version 4210 (0.0006) [2023-03-07 09:49:39,824][175731] Updated weights for policy 0, policy_version 4220 (0.0006) [2023-03-07 09:49:40,637][175731] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-07 09:49:41,434][175731] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-07 09:49:42,245][175731] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-07 09:49:43,036][175731] Updated weights for policy 0, policy_version 4260 (0.0007) [2023-03-07 09:49:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12746.2). Total num frames: 4365312. Throughput: 0: 12782.8. Samples: 4339371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:49:43,322][175405] Avg episode reward: [(0, '29.655')] [2023-03-07 09:49:43,825][175731] Updated weights for policy 0, policy_version 4270 (0.0008) [2023-03-07 09:49:44,624][175731] Updated weights for policy 0, policy_version 4280 (0.0006) [2023-03-07 09:49:45,432][175731] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-07 09:49:46,210][175731] Updated weights for policy 0, policy_version 4300 (0.0006) [2023-03-07 09:49:47,006][175731] Updated weights for policy 0, policy_version 4310 (0.0006) [2023-03-07 09:49:47,818][175731] Updated weights for policy 0, policy_version 4320 (0.0007) [2023-03-07 09:49:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12746.2). Total num frames: 4429824. Throughput: 0: 12788.2. Samples: 4416325. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:49:48,322][175405] Avg episode reward: [(0, '28.192')] [2023-03-07 09:49:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004326_4429824.pth... [2023-03-07 09:49:48,355][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000001337_1369088.pth [2023-03-07 09:49:48,618][175731] Updated weights for policy 0, policy_version 4330 (0.0007) [2023-03-07 09:49:49,411][175731] Updated weights for policy 0, policy_version 4340 (0.0007) [2023-03-07 09:49:50,228][175731] Updated weights for policy 0, policy_version 4350 (0.0006) [2023-03-07 09:49:51,018][175731] Updated weights for policy 0, policy_version 4360 (0.0006) [2023-03-07 09:49:51,818][175731] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-07 09:49:52,613][175731] Updated weights for policy 0, policy_version 4380 (0.0006) [2023-03-07 09:49:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12749.7). Total num frames: 4493312. Throughput: 0: 12785.0. Samples: 4493174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:49:53,322][175405] Avg episode reward: [(0, '28.961')] [2023-03-07 09:49:53,398][175731] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-07 09:49:54,204][175731] Updated weights for policy 0, policy_version 4400 (0.0007) [2023-03-07 09:49:55,025][175731] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-07 09:49:55,807][175731] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-07 09:49:56,597][175731] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-07 09:49:57,405][175731] Updated weights for policy 0, policy_version 4440 (0.0007) [2023-03-07 09:49:58,198][175731] Updated weights for policy 0, policy_version 4450 (0.0006) [2023-03-07 09:49:58,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12749.7). Total num frames: 4557824. Throughput: 0: 12790.4. Samples: 4531681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:49:58,332][175405] Avg episode reward: [(0, '31.844')] [2023-03-07 09:49:59,010][175731] Updated weights for policy 0, policy_version 4460 (0.0007) [2023-03-07 09:49:59,805][175731] Updated weights for policy 0, policy_version 4470 (0.0007) [2023-03-07 09:50:00,604][175731] Updated weights for policy 0, policy_version 4480 (0.0006) [2023-03-07 09:50:01,393][175731] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-07 09:50:02,183][175731] Updated weights for policy 0, policy_version 4500 (0.0007) [2023-03-07 09:50:02,997][175731] Updated weights for policy 0, policy_version 4510 (0.0006) [2023-03-07 09:50:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12800.0, 300 sec: 12753.1). Total num frames: 4622336. Throughput: 0: 12802.4. Samples: 4608653. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:50:03,332][175405] Avg episode reward: [(0, '29.274')] [2023-03-07 09:50:03,791][175731] Updated weights for policy 0, policy_version 4520 (0.0007) [2023-03-07 09:50:04,594][175731] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-07 09:50:05,379][175731] Updated weights for policy 0, policy_version 4540 (0.0006) [2023-03-07 09:50:06,185][175731] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-07 09:50:06,983][175731] Updated weights for policy 0, policy_version 4560 (0.0007) [2023-03-07 09:50:07,811][175731] Updated weights for policy 0, policy_version 4570 (0.0006) [2023-03-07 09:50:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12756.6). Total num frames: 4685824. Throughput: 0: 12799.1. Samples: 4685333. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:50:08,332][175405] Avg episode reward: [(0, '28.954')] [2023-03-07 09:50:08,597][175731] Updated weights for policy 0, policy_version 4580 (0.0007) [2023-03-07 09:50:09,416][175731] Updated weights for policy 0, policy_version 4590 (0.0006) [2023-03-07 09:50:10,202][175731] Updated weights for policy 0, policy_version 4600 (0.0006) [2023-03-07 09:50:10,989][175731] Updated weights for policy 0, policy_version 4610 (0.0006) [2023-03-07 09:50:11,789][175731] Updated weights for policy 0, policy_version 4620 (0.0006) [2023-03-07 09:50:12,566][175731] Updated weights for policy 0, policy_version 4630 (0.0006) [2023-03-07 09:50:13,327][175405] Fps is (10 sec: 12792.7, 60 sec: 12798.8, 300 sec: 12759.8). Total num frames: 4750336. Throughput: 0: 12800.3. Samples: 4723771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:50:13,334][175405] Avg episode reward: [(0, '28.643')] [2023-03-07 09:50:13,381][175731] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-07 09:50:14,185][175731] Updated weights for policy 0, policy_version 4650 (0.0006) [2023-03-07 09:50:15,006][175731] Updated weights for policy 0, policy_version 4660 (0.0007) [2023-03-07 09:50:15,794][175731] Updated weights for policy 0, policy_version 4670 (0.0006) [2023-03-07 09:50:16,584][175731] Updated weights for policy 0, policy_version 4680 (0.0006) [2023-03-07 09:50:17,398][175731] Updated weights for policy 0, policy_version 4690 (0.0006) [2023-03-07 09:50:18,190][175731] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-07 09:50:18,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12800.0, 300 sec: 12756.6). Total num frames: 4813824. Throughput: 0: 12799.6. Samples: 4800600. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:50:18,332][175405] Avg episode reward: [(0, '28.166')] [2023-03-07 09:50:18,992][175731] Updated weights for policy 0, policy_version 4710 (0.0006) [2023-03-07 09:50:19,801][175731] Updated weights for policy 0, policy_version 4720 (0.0006) [2023-03-07 09:50:20,603][175731] Updated weights for policy 0, policy_version 4730 (0.0007) [2023-03-07 09:50:21,398][175731] Updated weights for policy 0, policy_version 4740 (0.0007) [2023-03-07 09:50:22,204][175731] Updated weights for policy 0, policy_version 4750 (0.0006) [2023-03-07 09:50:23,004][175731] Updated weights for policy 0, policy_version 4760 (0.0005) [2023-03-07 09:50:23,321][175405] Fps is (10 sec: 12704.8, 60 sec: 12800.0, 300 sec: 12756.6). Total num frames: 4877312. Throughput: 0: 12799.8. Samples: 4877234. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:50:23,322][175405] Avg episode reward: [(0, '28.359')] [2023-03-07 09:50:23,797][175731] Updated weights for policy 0, policy_version 4770 (0.0006) [2023-03-07 09:50:24,609][175731] Updated weights for policy 0, policy_version 4780 (0.0007) [2023-03-07 09:50:25,409][175731] Updated weights for policy 0, policy_version 4790 (0.0007) [2023-03-07 09:50:26,195][175731] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-07 09:50:26,996][175731] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-07 09:50:27,789][175731] Updated weights for policy 0, policy_version 4820 (0.0006) [2023-03-07 09:50:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12760.1). Total num frames: 4941824. Throughput: 0: 12810.4. Samples: 4915841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:50:28,322][175405] Avg episode reward: [(0, '26.804')] [2023-03-07 09:50:28,599][175731] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-07 09:50:29,398][175731] Updated weights for policy 0, policy_version 4840 (0.0006) [2023-03-07 09:50:30,200][175731] Updated weights for policy 0, policy_version 4850 (0.0006) [2023-03-07 09:50:30,974][175731] Updated weights for policy 0, policy_version 4860 (0.0006) [2023-03-07 09:50:31,785][175731] Updated weights for policy 0, policy_version 4870 (0.0006) [2023-03-07 09:50:32,570][175731] Updated weights for policy 0, policy_version 4880 (0.0007) [2023-03-07 09:50:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12800.0, 300 sec: 12763.6). Total num frames: 5006336. Throughput: 0: 12812.6. Samples: 4992892. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:50:33,321][175405] Avg episode reward: [(0, '28.664')] [2023-03-07 09:50:33,352][175731] Updated weights for policy 0, policy_version 4890 (0.0006) [2023-03-07 09:50:34,133][175731] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-07 09:50:34,957][175731] Updated weights for policy 0, policy_version 4910 (0.0007) [2023-03-07 09:50:35,749][175731] Updated weights for policy 0, policy_version 4920 (0.0006) [2023-03-07 09:50:36,522][175731] Updated weights for policy 0, policy_version 4930 (0.0007) [2023-03-07 09:50:37,343][175731] Updated weights for policy 0, policy_version 4940 (0.0007) [2023-03-07 09:50:38,153][175731] Updated weights for policy 0, policy_version 4950 (0.0007) [2023-03-07 09:50:38,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12763.6). Total num frames: 5070848. Throughput: 0: 12821.9. Samples: 5070158. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:50:38,322][175405] Avg episode reward: [(0, '26.062')] [2023-03-07 09:50:38,954][175731] Updated weights for policy 0, policy_version 4960 (0.0006) [2023-03-07 09:50:39,741][175731] Updated weights for policy 0, policy_version 4970 (0.0006) [2023-03-07 09:50:40,550][175731] Updated weights for policy 0, policy_version 4980 (0.0006) [2023-03-07 09:50:41,365][175731] Updated weights for policy 0, policy_version 4990 (0.0007) [2023-03-07 09:50:42,169][175731] Updated weights for policy 0, policy_version 5000 (0.0007) [2023-03-07 09:50:42,957][175731] Updated weights for policy 0, policy_version 5010 (0.0006) [2023-03-07 09:50:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12763.5). Total num frames: 5134336. Throughput: 0: 12816.0. Samples: 5108404. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:50:43,322][175405] Avg episode reward: [(0, '26.867')] [2023-03-07 09:50:43,737][175731] Updated weights for policy 0, policy_version 5020 (0.0006) [2023-03-07 09:50:44,546][175731] Updated weights for policy 0, policy_version 5030 (0.0006) [2023-03-07 09:50:45,343][175731] Updated weights for policy 0, policy_version 5040 (0.0007) [2023-03-07 09:50:46,140][175731] Updated weights for policy 0, policy_version 5050 (0.0007) [2023-03-07 09:50:46,928][175731] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-07 09:50:47,740][175731] Updated weights for policy 0, policy_version 5070 (0.0007) [2023-03-07 09:50:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12767.0). Total num frames: 5198848. Throughput: 0: 12816.0. Samples: 5185372. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:50:48,322][175405] Avg episode reward: [(0, '27.688')] [2023-03-07 09:50:48,524][175731] Updated weights for policy 0, policy_version 5080 (0.0007) [2023-03-07 09:50:49,342][175731] Updated weights for policy 0, policy_version 5090 (0.0007) [2023-03-07 09:50:50,131][175731] Updated weights for policy 0, policy_version 5100 (0.0007) [2023-03-07 09:50:50,940][175731] Updated weights for policy 0, policy_version 5110 (0.0006) [2023-03-07 09:50:51,727][175731] Updated weights for policy 0, policy_version 5120 (0.0007) [2023-03-07 09:50:52,523][175731] Updated weights for policy 0, policy_version 5130 (0.0006) [2023-03-07 09:50:53,307][175731] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-07 09:50:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12767.0). Total num frames: 5263360. Throughput: 0: 12823.6. Samples: 5262396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:50:53,322][175405] Avg episode reward: [(0, '26.503')] [2023-03-07 09:50:54,110][175731] Updated weights for policy 0, policy_version 5150 (0.0006) [2023-03-07 09:50:54,913][175731] Updated weights for policy 0, policy_version 5160 (0.0007) [2023-03-07 09:50:55,701][175731] Updated weights for policy 0, policy_version 5170 (0.0007) [2023-03-07 09:50:56,506][175731] Updated weights for policy 0, policy_version 5180 (0.0006) [2023-03-07 09:50:57,297][175731] Updated weights for policy 0, policy_version 5190 (0.0007) [2023-03-07 09:50:58,106][175731] Updated weights for policy 0, policy_version 5200 (0.0006) [2023-03-07 09:50:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12767.0). Total num frames: 5326848. Throughput: 0: 12825.4. Samples: 5300841. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:50:58,332][175405] Avg episode reward: [(0, '26.846')] [2023-03-07 09:50:58,905][175731] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-07 09:50:59,703][175731] Updated weights for policy 0, policy_version 5220 (0.0007) [2023-03-07 09:51:00,501][175731] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-07 09:51:01,292][175731] Updated weights for policy 0, policy_version 5240 (0.0006) [2023-03-07 09:51:02,097][175731] Updated weights for policy 0, policy_version 5250 (0.0007) [2023-03-07 09:51:02,886][175731] Updated weights for policy 0, policy_version 5260 (0.0006) [2023-03-07 09:51:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12774.0). Total num frames: 5391360. Throughput: 0: 12828.7. Samples: 5377888. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:51:03,332][175405] Avg episode reward: [(0, '30.346')] [2023-03-07 09:51:03,691][175731] Updated weights for policy 0, policy_version 5270 (0.0007) [2023-03-07 09:51:04,502][175731] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-07 09:51:05,297][175731] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-07 09:51:06,085][175731] Updated weights for policy 0, policy_version 5300 (0.0007) [2023-03-07 09:51:06,884][175731] Updated weights for policy 0, policy_version 5310 (0.0008) [2023-03-07 09:51:07,682][175731] Updated weights for policy 0, policy_version 5320 (0.0007) [2023-03-07 09:51:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12774.0). Total num frames: 5455872. Throughput: 0: 12836.2. Samples: 5454861. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:51:08,332][175405] Avg episode reward: [(0, '28.772')] [2023-03-07 09:51:08,493][175731] Updated weights for policy 0, policy_version 5330 (0.0008) [2023-03-07 09:51:09,273][175731] Updated weights for policy 0, policy_version 5340 (0.0007) [2023-03-07 09:51:10,081][175731] Updated weights for policy 0, policy_version 5350 (0.0006) [2023-03-07 09:51:10,884][175731] Updated weights for policy 0, policy_version 5360 (0.0007) [2023-03-07 09:51:11,670][175731] Updated weights for policy 0, policy_version 5370 (0.0005) [2023-03-07 09:51:12,469][175731] Updated weights for policy 0, policy_version 5380 (0.0007) [2023-03-07 09:51:13,265][175731] Updated weights for policy 0, policy_version 5390 (0.0007) [2023-03-07 09:51:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12818.3, 300 sec: 12774.0). Total num frames: 5519360. Throughput: 0: 12829.3. Samples: 5493159. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 09:51:13,333][175405] Avg episode reward: [(0, '30.068')] [2023-03-07 09:51:14,064][175731] Updated weights for policy 0, policy_version 5400 (0.0006) [2023-03-07 09:51:14,867][175731] Updated weights for policy 0, policy_version 5410 (0.0008) [2023-03-07 09:51:15,668][175731] Updated weights for policy 0, policy_version 5420 (0.0007) [2023-03-07 09:51:16,461][175731] Updated weights for policy 0, policy_version 5430 (0.0006) [2023-03-07 09:51:17,294][175731] Updated weights for policy 0, policy_version 5440 (0.0007) [2023-03-07 09:51:18,073][175731] Updated weights for policy 0, policy_version 5450 (0.0006) [2023-03-07 09:51:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12777.4). Total num frames: 5583872. Throughput: 0: 12826.9. Samples: 5570100. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:51:18,321][175405] Avg episode reward: [(0, '27.578')] [2023-03-07 09:51:18,855][175731] Updated weights for policy 0, policy_version 5460 (0.0007) [2023-03-07 09:51:19,675][175731] Updated weights for policy 0, policy_version 5470 (0.0007) [2023-03-07 09:51:20,473][175731] Updated weights for policy 0, policy_version 5480 (0.0007) [2023-03-07 09:51:21,260][175731] Updated weights for policy 0, policy_version 5490 (0.0006) [2023-03-07 09:51:22,078][175731] Updated weights for policy 0, policy_version 5500 (0.0006) [2023-03-07 09:51:22,878][175731] Updated weights for policy 0, policy_version 5510 (0.0006) [2023-03-07 09:51:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12777.4). Total num frames: 5647360. Throughput: 0: 12819.4. Samples: 5647033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:51:23,332][175405] Avg episode reward: [(0, '28.688')] [2023-03-07 09:51:23,671][175731] Updated weights for policy 0, policy_version 5520 (0.0007) [2023-03-07 09:51:24,478][175731] Updated weights for policy 0, policy_version 5530 (0.0006) [2023-03-07 09:51:25,270][175731] Updated weights for policy 0, policy_version 5540 (0.0007) [2023-03-07 09:51:26,074][175731] Updated weights for policy 0, policy_version 5550 (0.0007) [2023-03-07 09:51:26,878][175731] Updated weights for policy 0, policy_version 5560 (0.0007) [2023-03-07 09:51:27,680][175731] Updated weights for policy 0, policy_version 5570 (0.0007) [2023-03-07 09:51:28,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12817.1, 300 sec: 12777.4). Total num frames: 5710848. Throughput: 0: 12819.3. Samples: 5685273. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:51:28,332][175405] Avg episode reward: [(0, '29.341')] [2023-03-07 09:51:28,481][175731] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-07 09:51:29,279][175731] Updated weights for policy 0, policy_version 5590 (0.0007) [2023-03-07 09:51:30,100][175731] Updated weights for policy 0, policy_version 5600 (0.0008) [2023-03-07 09:51:30,898][175731] Updated weights for policy 0, policy_version 5610 (0.0007) [2023-03-07 09:51:31,697][175731] Updated weights for policy 0, policy_version 5620 (0.0007) [2023-03-07 09:51:32,506][175731] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-07 09:51:33,289][175731] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-07 09:51:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12780.9). Total num frames: 5775360. Throughput: 0: 12809.3. Samples: 5761792. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:51:33,332][175405] Avg episode reward: [(0, '30.125')] [2023-03-07 09:51:34,075][175731] Updated weights for policy 0, policy_version 5650 (0.0006) [2023-03-07 09:51:34,898][175731] Updated weights for policy 0, policy_version 5660 (0.0007) [2023-03-07 09:51:35,684][175731] Updated weights for policy 0, policy_version 5670 (0.0007) [2023-03-07 09:51:36,486][175731] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-07 09:51:37,299][175731] Updated weights for policy 0, policy_version 5690 (0.0007) [2023-03-07 09:51:38,097][175731] Updated weights for policy 0, policy_version 5700 (0.0007) [2023-03-07 09:51:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12784.4). Total num frames: 5838848. Throughput: 0: 12806.5. Samples: 5838690. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:51:38,332][175405] Avg episode reward: [(0, '28.033')] [2023-03-07 09:51:38,905][175731] Updated weights for policy 0, policy_version 5710 (0.0006) [2023-03-07 09:51:39,705][175731] Updated weights for policy 0, policy_version 5720 (0.0007) [2023-03-07 09:51:40,501][175731] Updated weights for policy 0, policy_version 5730 (0.0007) [2023-03-07 09:51:41,306][175731] Updated weights for policy 0, policy_version 5740 (0.0007) [2023-03-07 09:51:42,104][175731] Updated weights for policy 0, policy_version 5750 (0.0007) [2023-03-07 09:51:42,900][175731] Updated weights for policy 0, policy_version 5760 (0.0006) [2023-03-07 09:51:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12787.9). Total num frames: 5903360. Throughput: 0: 12804.5. Samples: 5877045. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:51:43,332][175405] Avg episode reward: [(0, '29.769')] [2023-03-07 09:51:43,694][175731] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-07 09:51:44,513][175731] Updated weights for policy 0, policy_version 5780 (0.0006) [2023-03-07 09:51:45,280][175731] Updated weights for policy 0, policy_version 5790 (0.0007) [2023-03-07 09:51:46,082][175731] Updated weights for policy 0, policy_version 5800 (0.0007) [2023-03-07 09:51:46,893][175731] Updated weights for policy 0, policy_version 5810 (0.0006) [2023-03-07 09:51:47,685][175731] Updated weights for policy 0, policy_version 5820 (0.0006) [2023-03-07 09:51:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12787.9). Total num frames: 5966848. Throughput: 0: 12800.1. Samples: 5953892. Policy #0 lag: (min: 0.0, avg: 1.4, max: 4.0) [2023-03-07 09:51:48,332][175405] Avg episode reward: [(0, '29.694')] [2023-03-07 09:51:48,336][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005828_5967872.pth... [2023-03-07 09:51:48,365][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000002828_2895872.pth [2023-03-07 09:51:48,494][175731] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-07 09:51:49,302][175731] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-07 09:51:50,111][175731] Updated weights for policy 0, policy_version 5850 (0.0006) [2023-03-07 09:51:50,929][175731] Updated weights for policy 0, policy_version 5860 (0.0008) [2023-03-07 09:51:51,723][175731] Updated weights for policy 0, policy_version 5870 (0.0006) [2023-03-07 09:51:52,522][175731] Updated weights for policy 0, policy_version 5880 (0.0006) [2023-03-07 09:51:53,319][175731] Updated weights for policy 0, policy_version 5890 (0.0006) [2023-03-07 09:51:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12791.3). Total num frames: 6031360. Throughput: 0: 12788.7. Samples: 6030352. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:51:53,330][175405] Avg episode reward: [(0, '27.633')] [2023-03-07 09:51:54,118][175731] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-07 09:51:54,919][175731] Updated weights for policy 0, policy_version 5910 (0.0006) [2023-03-07 09:51:55,726][175731] Updated weights for policy 0, policy_version 5920 (0.0006) [2023-03-07 09:51:56,530][175731] Updated weights for policy 0, policy_version 5930 (0.0006) [2023-03-07 09:51:57,334][175731] Updated weights for policy 0, policy_version 5940 (0.0007) [2023-03-07 09:51:58,121][175731] Updated weights for policy 0, policy_version 5950 (0.0007) [2023-03-07 09:51:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12791.3). Total num frames: 6094848. Throughput: 0: 12789.1. Samples: 6068666. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:51:58,332][175405] Avg episode reward: [(0, '26.910')] [2023-03-07 09:51:58,909][175731] Updated weights for policy 0, policy_version 5960 (0.0007) [2023-03-07 09:51:59,707][175731] Updated weights for policy 0, policy_version 5970 (0.0005) [2023-03-07 09:52:00,497][175731] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-07 09:52:01,290][175731] Updated weights for policy 0, policy_version 5990 (0.0007) [2023-03-07 09:52:02,101][175731] Updated weights for policy 0, policy_version 6000 (0.0007) [2023-03-07 09:52:02,900][175731] Updated weights for policy 0, policy_version 6010 (0.0006) [2023-03-07 09:52:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12794.8). Total num frames: 6159360. Throughput: 0: 12793.9. Samples: 6145828. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:52:03,332][175405] Avg episode reward: [(0, '29.764')] [2023-03-07 09:52:03,699][175731] Updated weights for policy 0, policy_version 6020 (0.0007) [2023-03-07 09:52:04,494][175731] Updated weights for policy 0, policy_version 6030 (0.0006) [2023-03-07 09:52:05,293][175731] Updated weights for policy 0, policy_version 6040 (0.0007) [2023-03-07 09:52:06,094][175731] Updated weights for policy 0, policy_version 6050 (0.0006) [2023-03-07 09:52:06,902][175731] Updated weights for policy 0, policy_version 6060 (0.0007) [2023-03-07 09:52:07,689][175731] Updated weights for policy 0, policy_version 6070 (0.0006) [2023-03-07 09:52:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12791.3). Total num frames: 6222848. Throughput: 0: 12796.7. Samples: 6222882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:08,322][175405] Avg episode reward: [(0, '30.792')] [2023-03-07 09:52:08,494][175731] Updated weights for policy 0, policy_version 6080 (0.0006) [2023-03-07 09:52:09,279][175731] Updated weights for policy 0, policy_version 6090 (0.0007) [2023-03-07 09:52:10,085][175731] Updated weights for policy 0, policy_version 6100 (0.0006) [2023-03-07 09:52:10,873][175731] Updated weights for policy 0, policy_version 6110 (0.0006) [2023-03-07 09:52:11,673][175731] Updated weights for policy 0, policy_version 6120 (0.0007) [2023-03-07 09:52:12,474][175731] Updated weights for policy 0, policy_version 6130 (0.0005) [2023-03-07 09:52:13,264][175731] Updated weights for policy 0, policy_version 6140 (0.0006) [2023-03-07 09:52:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12794.8). Total num frames: 6287360. Throughput: 0: 12802.1. Samples: 6261367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:13,322][175405] Avg episode reward: [(0, '28.460')] [2023-03-07 09:52:14,069][175731] Updated weights for policy 0, policy_version 6150 (0.0007) [2023-03-07 09:52:14,879][175731] Updated weights for policy 0, policy_version 6160 (0.0006) [2023-03-07 09:52:15,669][175731] Updated weights for policy 0, policy_version 6170 (0.0007) [2023-03-07 09:52:16,474][175731] Updated weights for policy 0, policy_version 6180 (0.0006) [2023-03-07 09:52:17,261][175731] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-07 09:52:18,068][175731] Updated weights for policy 0, policy_version 6200 (0.0006) [2023-03-07 09:52:18,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 6351872. Throughput: 0: 12811.3. Samples: 6338303. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:52:18,322][175405] Avg episode reward: [(0, '28.657')] [2023-03-07 09:52:18,890][175731] Updated weights for policy 0, policy_version 6210 (0.0006) [2023-03-07 09:52:19,676][175731] Updated weights for policy 0, policy_version 6220 (0.0007) [2023-03-07 09:52:20,482][175731] Updated weights for policy 0, policy_version 6230 (0.0006) [2023-03-07 09:52:21,288][175731] Updated weights for policy 0, policy_version 6240 (0.0007) [2023-03-07 09:52:22,083][175731] Updated weights for policy 0, policy_version 6250 (0.0007) [2023-03-07 09:52:22,876][175731] Updated weights for policy 0, policy_version 6260 (0.0006) [2023-03-07 09:52:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 6415360. Throughput: 0: 12807.4. Samples: 6415022. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:52:23,321][175405] Avg episode reward: [(0, '27.683')] [2023-03-07 09:52:23,675][175731] Updated weights for policy 0, policy_version 6270 (0.0006) [2023-03-07 09:52:24,464][175731] Updated weights for policy 0, policy_version 6280 (0.0007) [2023-03-07 09:52:25,263][175731] Updated weights for policy 0, policy_version 6290 (0.0007) [2023-03-07 09:52:26,058][175731] Updated weights for policy 0, policy_version 6300 (0.0007) [2023-03-07 09:52:26,842][175731] Updated weights for policy 0, policy_version 6310 (0.0007) [2023-03-07 09:52:27,642][175731] Updated weights for policy 0, policy_version 6320 (0.0006) [2023-03-07 09:52:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 6479872. Throughput: 0: 12813.1. Samples: 6453637. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:52:28,321][175405] Avg episode reward: [(0, '26.694')] [2023-03-07 09:52:28,430][175731] Updated weights for policy 0, policy_version 6330 (0.0006) [2023-03-07 09:52:29,212][175731] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-07 09:52:30,014][175731] Updated weights for policy 0, policy_version 6350 (0.0006) [2023-03-07 09:52:30,825][175731] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-07 09:52:31,613][175731] Updated weights for policy 0, policy_version 6370 (0.0007) [2023-03-07 09:52:32,423][175731] Updated weights for policy 0, policy_version 6380 (0.0008) [2023-03-07 09:52:33,227][175731] Updated weights for policy 0, policy_version 6390 (0.0006) [2023-03-07 09:52:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 6544384. Throughput: 0: 12820.9. Samples: 6530830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:33,333][175405] Avg episode reward: [(0, '30.053')] [2023-03-07 09:52:34,040][175731] Updated weights for policy 0, policy_version 6400 (0.0007) [2023-03-07 09:52:34,833][175731] Updated weights for policy 0, policy_version 6410 (0.0007) [2023-03-07 09:52:35,641][175731] Updated weights for policy 0, policy_version 6420 (0.0006) [2023-03-07 09:52:36,438][175731] Updated weights for policy 0, policy_version 6430 (0.0006) [2023-03-07 09:52:37,246][175731] Updated weights for policy 0, policy_version 6440 (0.0007) [2023-03-07 09:52:38,047][175731] Updated weights for policy 0, policy_version 6450 (0.0006) [2023-03-07 09:52:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 6607872. Throughput: 0: 12820.8. Samples: 6607290. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:38,332][175405] Avg episode reward: [(0, '28.750')] [2023-03-07 09:52:38,833][175731] Updated weights for policy 0, policy_version 6460 (0.0006) [2023-03-07 09:52:39,624][175731] Updated weights for policy 0, policy_version 6470 (0.0006) [2023-03-07 09:52:40,432][175731] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-07 09:52:41,220][175731] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-07 09:52:42,005][175731] Updated weights for policy 0, policy_version 6500 (0.0006) [2023-03-07 09:52:42,797][175731] Updated weights for policy 0, policy_version 6510 (0.0007) [2023-03-07 09:52:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12805.2). Total num frames: 6672384. Throughput: 0: 12829.9. Samples: 6646014. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:43,332][175405] Avg episode reward: [(0, '27.246')] [2023-03-07 09:52:43,605][175731] Updated weights for policy 0, policy_version 6520 (0.0008) [2023-03-07 09:52:44,404][175731] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-07 09:52:45,202][175731] Updated weights for policy 0, policy_version 6540 (0.0006) [2023-03-07 09:52:46,011][175731] Updated weights for policy 0, policy_version 6550 (0.0006) [2023-03-07 09:52:46,812][175731] Updated weights for policy 0, policy_version 6560 (0.0007) [2023-03-07 09:52:47,599][175731] Updated weights for policy 0, policy_version 6570 (0.0006) [2023-03-07 09:52:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12808.7). Total num frames: 6736896. Throughput: 0: 12822.2. Samples: 6722827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:48,332][175405] Avg episode reward: [(0, '28.434')] [2023-03-07 09:52:48,406][175731] Updated weights for policy 0, policy_version 6580 (0.0007) [2023-03-07 09:52:49,205][175731] Updated weights for policy 0, policy_version 6590 (0.0006) [2023-03-07 09:52:50,011][175731] Updated weights for policy 0, policy_version 6600 (0.0007) [2023-03-07 09:52:50,819][175731] Updated weights for policy 0, policy_version 6610 (0.0007) [2023-03-07 09:52:51,616][175731] Updated weights for policy 0, policy_version 6620 (0.0007) [2023-03-07 09:52:52,422][175731] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-07 09:52:53,219][175731] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-07 09:52:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12805.2). Total num frames: 6800384. Throughput: 0: 12818.4. Samples: 6799712. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:53,322][175405] Avg episode reward: [(0, '28.245')] [2023-03-07 09:52:54,015][175731] Updated weights for policy 0, policy_version 6650 (0.0006) [2023-03-07 09:52:54,824][175731] Updated weights for policy 0, policy_version 6660 (0.0007) [2023-03-07 09:52:55,629][175731] Updated weights for policy 0, policy_version 6670 (0.0006) [2023-03-07 09:52:56,400][175731] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-07 09:52:57,199][175731] Updated weights for policy 0, policy_version 6690 (0.0007) [2023-03-07 09:52:58,005][175731] Updated weights for policy 0, policy_version 6700 (0.0006) [2023-03-07 09:52:58,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12817.0, 300 sec: 12808.7). Total num frames: 6863872. Throughput: 0: 12814.7. Samples: 6838030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:52:58,326][175405] Avg episode reward: [(0, '30.044')] [2023-03-07 09:52:58,824][175731] Updated weights for policy 0, policy_version 6710 (0.0007) [2023-03-07 09:52:59,636][175731] Updated weights for policy 0, policy_version 6720 (0.0007) [2023-03-07 09:53:00,431][175731] Updated weights for policy 0, policy_version 6730 (0.0006) [2023-03-07 09:53:01,232][175731] Updated weights for policy 0, policy_version 6740 (0.0007) [2023-03-07 09:53:02,019][175731] Updated weights for policy 0, policy_version 6750 (0.0006) [2023-03-07 09:53:02,808][175731] Updated weights for policy 0, policy_version 6760 (0.0006) [2023-03-07 09:53:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12808.7). Total num frames: 6928384. Throughput: 0: 12813.0. Samples: 6914886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:03,321][175405] Avg episode reward: [(0, '50.521')] [2023-03-07 09:53:03,623][175731] Updated weights for policy 0, policy_version 6770 (0.0006) [2023-03-07 09:53:04,434][175731] Updated weights for policy 0, policy_version 6780 (0.0007) [2023-03-07 09:53:05,227][175731] Updated weights for policy 0, policy_version 6790 (0.0006) [2023-03-07 09:53:06,008][175731] Updated weights for policy 0, policy_version 6800 (0.0006) [2023-03-07 09:53:06,805][175731] Updated weights for policy 0, policy_version 6810 (0.0006) [2023-03-07 09:53:07,594][175731] Updated weights for policy 0, policy_version 6820 (0.0006) [2023-03-07 09:53:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12812.2). Total num frames: 6992896. Throughput: 0: 12820.6. Samples: 6991949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:08,321][175405] Avg episode reward: [(0, '29.128')] [2023-03-07 09:53:08,388][175731] Updated weights for policy 0, policy_version 6830 (0.0006) [2023-03-07 09:53:09,180][175731] Updated weights for policy 0, policy_version 6840 (0.0007) [2023-03-07 09:53:09,970][175731] Updated weights for policy 0, policy_version 6850 (0.0006) [2023-03-07 09:53:10,773][175731] Updated weights for policy 0, policy_version 6860 (0.0006) [2023-03-07 09:53:11,567][175731] Updated weights for policy 0, policy_version 6870 (0.0007) [2023-03-07 09:53:12,358][175731] Updated weights for policy 0, policy_version 6880 (0.0006) [2023-03-07 09:53:13,161][175731] Updated weights for policy 0, policy_version 6890 (0.0006) [2023-03-07 09:53:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12808.7). Total num frames: 7056384. Throughput: 0: 12821.6. Samples: 7030611. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:13,322][175405] Avg episode reward: [(0, '27.659')] [2023-03-07 09:53:13,973][175731] Updated weights for policy 0, policy_version 6900 (0.0008) [2023-03-07 09:53:14,771][175731] Updated weights for policy 0, policy_version 6910 (0.0006) [2023-03-07 09:53:15,565][175731] Updated weights for policy 0, policy_version 6920 (0.0006) [2023-03-07 09:53:16,363][175731] Updated weights for policy 0, policy_version 6930 (0.0006) [2023-03-07 09:53:17,173][175731] Updated weights for policy 0, policy_version 6940 (0.0007) [2023-03-07 09:53:17,964][175731] Updated weights for policy 0, policy_version 6950 (0.0006) [2023-03-07 09:53:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 7120896. Throughput: 0: 12816.5. Samples: 7107575. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:18,322][175405] Avg episode reward: [(0, '28.742')] [2023-03-07 09:53:18,756][175731] Updated weights for policy 0, policy_version 6960 (0.0007) [2023-03-07 09:53:19,547][175731] Updated weights for policy 0, policy_version 6970 (0.0007) [2023-03-07 09:53:20,338][175731] Updated weights for policy 0, policy_version 6980 (0.0006) [2023-03-07 09:53:21,140][175731] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-07 09:53:21,949][175731] Updated weights for policy 0, policy_version 7000 (0.0007) [2023-03-07 09:53:22,763][175731] Updated weights for policy 0, policy_version 7010 (0.0007) [2023-03-07 09:53:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12808.7). Total num frames: 7184384. Throughput: 0: 12824.9. Samples: 7184410. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:23,321][175405] Avg episode reward: [(0, '27.786')] [2023-03-07 09:53:23,572][175731] Updated weights for policy 0, policy_version 7020 (0.0006) [2023-03-07 09:53:24,365][175731] Updated weights for policy 0, policy_version 7030 (0.0007) [2023-03-07 09:53:25,190][175731] Updated weights for policy 0, policy_version 7040 (0.0007) [2023-03-07 09:53:25,974][175731] Updated weights for policy 0, policy_version 7050 (0.0006) [2023-03-07 09:53:26,780][175731] Updated weights for policy 0, policy_version 7060 (0.0007) [2023-03-07 09:53:27,581][175731] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-07 09:53:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12808.7). Total num frames: 7248896. Throughput: 0: 12811.6. Samples: 7222535. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:28,321][175405] Avg episode reward: [(0, '27.361')] [2023-03-07 09:53:28,376][175731] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-07 09:53:29,176][175731] Updated weights for policy 0, policy_version 7090 (0.0006) [2023-03-07 09:53:29,988][175731] Updated weights for policy 0, policy_version 7100 (0.0007) [2023-03-07 09:53:30,783][175731] Updated weights for policy 0, policy_version 7110 (0.0006) [2023-03-07 09:53:31,589][175731] Updated weights for policy 0, policy_version 7120 (0.0007) [2023-03-07 09:53:32,399][175731] Updated weights for policy 0, policy_version 7130 (0.0006) [2023-03-07 09:53:33,190][175731] Updated weights for policy 0, policy_version 7140 (0.0006) [2023-03-07 09:53:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 7312384. Throughput: 0: 12807.6. Samples: 7299170. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 09:53:33,322][175405] Avg episode reward: [(0, '28.407')] [2023-03-07 09:53:34,006][175731] Updated weights for policy 0, policy_version 7150 (0.0007) [2023-03-07 09:53:34,782][175731] Updated weights for policy 0, policy_version 7160 (0.0007) [2023-03-07 09:53:35,566][175731] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-07 09:53:36,381][175731] Updated weights for policy 0, policy_version 7180 (0.0006) [2023-03-07 09:53:37,165][175731] Updated weights for policy 0, policy_version 7190 (0.0006) [2023-03-07 09:53:37,977][175731] Updated weights for policy 0, policy_version 7200 (0.0006) [2023-03-07 09:53:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12808.7). Total num frames: 7376896. Throughput: 0: 12808.1. Samples: 7376075. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:53:38,322][175405] Avg episode reward: [(0, '27.419')] [2023-03-07 09:53:38,781][175731] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-07 09:53:39,577][175731] Updated weights for policy 0, policy_version 7220 (0.0007) [2023-03-07 09:53:40,401][175731] Updated weights for policy 0, policy_version 7230 (0.0007) [2023-03-07 09:53:41,197][175731] Updated weights for policy 0, policy_version 7240 (0.0007) [2023-03-07 09:53:41,988][175731] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-07 09:53:42,777][175731] Updated weights for policy 0, policy_version 7260 (0.0006) [2023-03-07 09:53:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 7441408. Throughput: 0: 12809.3. Samples: 7414447. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:53:43,321][175405] Avg episode reward: [(0, '28.262')] [2023-03-07 09:53:43,569][175731] Updated weights for policy 0, policy_version 7270 (0.0007) [2023-03-07 09:53:44,370][175731] Updated weights for policy 0, policy_version 7280 (0.0007) [2023-03-07 09:53:45,153][175731] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-07 09:53:45,965][175731] Updated weights for policy 0, policy_version 7300 (0.0007) [2023-03-07 09:53:46,742][175731] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-07 09:53:47,549][175731] Updated weights for policy 0, policy_version 7320 (0.0006) [2023-03-07 09:53:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 7504896. Throughput: 0: 12817.1. Samples: 7491659. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:48,322][175405] Avg episode reward: [(0, '28.841')] [2023-03-07 09:53:48,327][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007329_7504896.pth... [2023-03-07 09:53:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000004326_4429824.pth [2023-03-07 09:53:48,377][175731] Updated weights for policy 0, policy_version 7330 (0.0006) [2023-03-07 09:53:49,147][175731] Updated weights for policy 0, policy_version 7340 (0.0006) [2023-03-07 09:53:49,941][175731] Updated weights for policy 0, policy_version 7350 (0.0007) [2023-03-07 09:53:50,759][175731] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-07 09:53:51,570][175731] Updated weights for policy 0, policy_version 7370 (0.0007) [2023-03-07 09:53:52,351][175731] Updated weights for policy 0, policy_version 7380 (0.0007) [2023-03-07 09:53:53,166][175731] Updated weights for policy 0, policy_version 7390 (0.0006) [2023-03-07 09:53:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12812.2). Total num frames: 7569408. Throughput: 0: 12810.9. Samples: 7568439. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:53:53,321][175405] Avg episode reward: [(0, '27.479')] [2023-03-07 09:53:53,973][175731] Updated weights for policy 0, policy_version 7400 (0.0007) [2023-03-07 09:53:54,777][175731] Updated weights for policy 0, policy_version 7410 (0.0007) [2023-03-07 09:53:55,577][175731] Updated weights for policy 0, policy_version 7420 (0.0006) [2023-03-07 09:53:56,366][175731] Updated weights for policy 0, policy_version 7430 (0.0006) [2023-03-07 09:53:57,187][175731] Updated weights for policy 0, policy_version 7440 (0.0006) [2023-03-07 09:53:57,980][175731] Updated weights for policy 0, policy_version 7450 (0.0006) [2023-03-07 09:53:58,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12808.7). Total num frames: 7632896. Throughput: 0: 12798.8. Samples: 7606558. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:53:58,321][175405] Avg episode reward: [(0, '30.981')] [2023-03-07 09:53:58,772][175731] Updated weights for policy 0, policy_version 7460 (0.0006) [2023-03-07 09:53:59,565][175731] Updated weights for policy 0, policy_version 7470 (0.0007) [2023-03-07 09:54:00,362][175731] Updated weights for policy 0, policy_version 7480 (0.0007) [2023-03-07 09:54:01,162][175731] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-07 09:54:01,977][175731] Updated weights for policy 0, policy_version 7500 (0.0008) [2023-03-07 09:54:02,766][175731] Updated weights for policy 0, policy_version 7510 (0.0007) [2023-03-07 09:54:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 7696384. Throughput: 0: 12794.0. Samples: 7683303. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:54:03,322][175405] Avg episode reward: [(0, '30.776')] [2023-03-07 09:54:03,575][175731] Updated weights for policy 0, policy_version 7520 (0.0007) [2023-03-07 09:54:04,370][175731] Updated weights for policy 0, policy_version 7530 (0.0006) [2023-03-07 09:54:04,438][175680] KL-divergence is very high: 141.8302 [2023-03-07 09:54:04,678][175680] KL-divergence is very high: 3169.7449 [2023-03-07 09:54:04,992][175680] KL-divergence is very high: 626.5637 [2023-03-07 09:54:05,145][175680] KL-divergence is very high: 1750.3737 [2023-03-07 09:54:05,154][175731] Updated weights for policy 0, policy_version 7540 (0.0007) [2023-03-07 09:54:05,238][175680] KL-divergence is very high: 418.7138 [2023-03-07 09:54:05,322][175680] KL-divergence is very high: 616.4938 [2023-03-07 09:54:05,967][175680] KL-divergence is very high: 124.7411 [2023-03-07 09:54:05,973][175731] Updated weights for policy 0, policy_version 7550 (0.0007) [2023-03-07 09:54:06,277][175680] KL-divergence is very high: 184.0284 [2023-03-07 09:54:06,431][175680] KL-divergence is very high: 105.8285 [2023-03-07 09:54:06,588][175680] KL-divergence is very high: 1444.8889 [2023-03-07 09:54:06,761][175731] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-07 09:54:07,233][175680] KL-divergence is very high: 323.6562 [2023-03-07 09:54:07,390][175680] KL-divergence is very high: 199.0942 [2023-03-07 09:54:07,568][175731] Updated weights for policy 0, policy_version 7570 (0.0006) [2023-03-07 09:54:07,801][175680] KL-divergence is very high: 631.8397 [2023-03-07 09:54:07,873][175680] KL-divergence is very high: 161.2656 [2023-03-07 09:54:07,952][175680] KL-divergence is very high: 188.8569 [2023-03-07 09:54:08,113][175680] KL-divergence is very high: 331.7034 [2023-03-07 09:54:08,184][175680] KL-divergence is very high: 141.2439 [2023-03-07 09:54:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 7760896. Throughput: 0: 12799.9. Samples: 7760407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:54:08,322][175405] Avg episode reward: [(0, '28.723')] [2023-03-07 09:54:08,356][175731] Updated weights for policy 0, policy_version 7580 (0.0007) [2023-03-07 09:54:08,594][175680] KL-divergence is very high: 132.5317 [2023-03-07 09:54:09,166][175731] Updated weights for policy 0, policy_version 7590 (0.0006) [2023-03-07 09:54:09,950][175731] Updated weights for policy 0, policy_version 7600 (0.0006) [2023-03-07 09:54:10,748][175731] Updated weights for policy 0, policy_version 7610 (0.0007) [2023-03-07 09:54:11,545][175731] Updated weights for policy 0, policy_version 7620 (0.0008) [2023-03-07 09:54:12,341][175731] Updated weights for policy 0, policy_version 7630 (0.0007) [2023-03-07 09:54:13,144][175731] Updated weights for policy 0, policy_version 7640 (0.0007) [2023-03-07 09:54:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12812.2). Total num frames: 7825408. Throughput: 0: 12810.2. Samples: 7798995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:54:13,322][175405] Avg episode reward: [(0, '29.304')] [2023-03-07 09:54:13,937][175731] Updated weights for policy 0, policy_version 7650 (0.0006) [2023-03-07 09:54:14,731][175731] Updated weights for policy 0, policy_version 7660 (0.0006) [2023-03-07 09:54:15,525][175731] Updated weights for policy 0, policy_version 7670 (0.0006) [2023-03-07 09:54:16,314][175731] Updated weights for policy 0, policy_version 7680 (0.0007) [2023-03-07 09:54:17,112][175731] Updated weights for policy 0, policy_version 7690 (0.0007) [2023-03-07 09:54:17,911][175731] Updated weights for policy 0, policy_version 7700 (0.0006) [2023-03-07 09:54:18,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 7889920. Throughput: 0: 12820.3. Samples: 7876084. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:54:18,322][175405] Avg episode reward: [(0, '25.307')] [2023-03-07 09:54:18,724][175731] Updated weights for policy 0, policy_version 7710 (0.0007) [2023-03-07 09:54:19,521][175731] Updated weights for policy 0, policy_version 7720 (0.0007) [2023-03-07 09:54:20,327][175731] Updated weights for policy 0, policy_version 7730 (0.0007) [2023-03-07 09:54:21,139][175731] Updated weights for policy 0, policy_version 7740 (0.0007) [2023-03-07 09:54:21,933][175731] Updated weights for policy 0, policy_version 7750 (0.0007) [2023-03-07 09:54:22,734][175731] Updated weights for policy 0, policy_version 7760 (0.0008) [2023-03-07 09:54:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 7953408. Throughput: 0: 12810.1. Samples: 7952529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:54:23,322][175405] Avg episode reward: [(0, '27.306')] [2023-03-07 09:54:23,542][175731] Updated weights for policy 0, policy_version 7770 (0.0006) [2023-03-07 09:54:24,342][175731] Updated weights for policy 0, policy_version 7780 (0.0007) [2023-03-07 09:54:25,153][175731] Updated weights for policy 0, policy_version 7790 (0.0006) [2023-03-07 09:54:25,969][175731] Updated weights for policy 0, policy_version 7800 (0.0006) [2023-03-07 09:54:26,754][175731] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-07 09:54:27,557][175731] Updated weights for policy 0, policy_version 7820 (0.0005) [2023-03-07 09:54:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 8016896. Throughput: 0: 12809.0. Samples: 7990852. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:54:28,322][175405] Avg episode reward: [(0, '31.614')] [2023-03-07 09:54:28,370][175731] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-07 09:54:29,162][175731] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-07 09:54:29,964][175731] Updated weights for policy 0, policy_version 7850 (0.0006) [2023-03-07 09:54:30,742][175731] Updated weights for policy 0, policy_version 7860 (0.0006) [2023-03-07 09:54:31,536][175731] Updated weights for policy 0, policy_version 7870 (0.0006) [2023-03-07 09:54:32,341][175731] Updated weights for policy 0, policy_version 7880 (0.0006) [2023-03-07 09:54:33,152][175731] Updated weights for policy 0, policy_version 7890 (0.0006) [2023-03-07 09:54:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 8081408. Throughput: 0: 12805.1. Samples: 8067888. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:54:33,322][175405] Avg episode reward: [(0, '30.015')] [2023-03-07 09:54:33,950][175731] Updated weights for policy 0, policy_version 7900 (0.0007) [2023-03-07 09:54:34,753][175731] Updated weights for policy 0, policy_version 7910 (0.0006) [2023-03-07 09:54:35,549][175731] Updated weights for policy 0, policy_version 7920 (0.0007) [2023-03-07 09:54:36,331][175731] Updated weights for policy 0, policy_version 7930 (0.0006) [2023-03-07 09:54:37,128][175731] Updated weights for policy 0, policy_version 7940 (0.0006) [2023-03-07 09:54:37,934][175731] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-07 09:54:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 8144896. Throughput: 0: 12803.7. Samples: 8144609. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:54:38,322][175405] Avg episode reward: [(0, '28.424')] [2023-03-07 09:54:38,734][175731] Updated weights for policy 0, policy_version 7960 (0.0006) [2023-03-07 09:54:39,518][175731] Updated weights for policy 0, policy_version 7970 (0.0006) [2023-03-07 09:54:40,316][175731] Updated weights for policy 0, policy_version 7980 (0.0006) [2023-03-07 09:54:41,117][175731] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-07 09:54:41,913][175731] Updated weights for policy 0, policy_version 8000 (0.0007) [2023-03-07 09:54:42,719][175731] Updated weights for policy 0, policy_version 8010 (0.0007) [2023-03-07 09:54:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12812.2). Total num frames: 8209408. Throughput: 0: 12814.5. Samples: 8183211. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:54:43,321][175405] Avg episode reward: [(0, '30.515')] [2023-03-07 09:54:43,538][175731] Updated weights for policy 0, policy_version 8020 (0.0006) [2023-03-07 09:54:44,320][175731] Updated weights for policy 0, policy_version 8030 (0.0006) [2023-03-07 09:54:45,119][175731] Updated weights for policy 0, policy_version 8040 (0.0006) [2023-03-07 09:54:45,908][175731] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-07 09:54:46,712][175731] Updated weights for policy 0, policy_version 8060 (0.0006) [2023-03-07 09:54:47,503][175731] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-07 09:54:48,305][175731] Updated weights for policy 0, policy_version 8080 (0.0007) [2023-03-07 09:54:48,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 8273920. Throughput: 0: 12824.1. Samples: 8260386. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:54:48,321][175405] Avg episode reward: [(0, '28.326')] [2023-03-07 09:54:49,093][175731] Updated weights for policy 0, policy_version 8090 (0.0006) [2023-03-07 09:54:49,884][175731] Updated weights for policy 0, policy_version 8100 (0.0006) [2023-03-07 09:54:50,681][175731] Updated weights for policy 0, policy_version 8110 (0.0007) [2023-03-07 09:54:51,484][175731] Updated weights for policy 0, policy_version 8120 (0.0006) [2023-03-07 09:54:52,270][175731] Updated weights for policy 0, policy_version 8130 (0.0007) [2023-03-07 09:54:53,082][175731] Updated weights for policy 0, policy_version 8140 (0.0006) [2023-03-07 09:54:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 8337408. Throughput: 0: 12820.8. Samples: 8337341. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:54:53,322][175405] Avg episode reward: [(0, '25.922')] [2023-03-07 09:54:53,881][175731] Updated weights for policy 0, policy_version 8150 (0.0007) [2023-03-07 09:54:54,686][175731] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-07 09:54:55,487][175731] Updated weights for policy 0, policy_version 8170 (0.0006) [2023-03-07 09:54:56,275][175731] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-07 09:54:57,081][175731] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-07 09:54:57,875][175731] Updated weights for policy 0, policy_version 8200 (0.0007) [2023-03-07 09:54:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12812.1). Total num frames: 8401920. Throughput: 0: 12818.5. Samples: 8375826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:54:58,322][175405] Avg episode reward: [(0, '28.201')] [2023-03-07 09:54:58,658][175731] Updated weights for policy 0, policy_version 8210 (0.0006) [2023-03-07 09:54:59,465][175731] Updated weights for policy 0, policy_version 8220 (0.0006) [2023-03-07 09:55:00,262][175731] Updated weights for policy 0, policy_version 8230 (0.0007) [2023-03-07 09:55:01,068][175731] Updated weights for policy 0, policy_version 8240 (0.0007) [2023-03-07 09:55:01,876][175731] Updated weights for policy 0, policy_version 8250 (0.0007) [2023-03-07 09:55:02,672][175731] Updated weights for policy 0, policy_version 8260 (0.0007) [2023-03-07 09:55:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12815.6). Total num frames: 8466432. Throughput: 0: 12816.8. Samples: 8452838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:55:03,321][175405] Avg episode reward: [(0, '27.727')] [2023-03-07 09:55:03,462][175731] Updated weights for policy 0, policy_version 8270 (0.0007) [2023-03-07 09:55:04,251][175731] Updated weights for policy 0, policy_version 8280 (0.0007) [2023-03-07 09:55:05,052][175731] Updated weights for policy 0, policy_version 8290 (0.0006) [2023-03-07 09:55:05,861][175731] Updated weights for policy 0, policy_version 8300 (0.0008) [2023-03-07 09:55:06,646][175731] Updated weights for policy 0, policy_version 8310 (0.0006) [2023-03-07 09:55:07,444][175731] Updated weights for policy 0, policy_version 8320 (0.0007) [2023-03-07 09:55:08,254][175731] Updated weights for policy 0, policy_version 8330 (0.0006) [2023-03-07 09:55:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12812.4). Total num frames: 8529920. Throughput: 0: 12829.9. Samples: 8529874. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:55:08,321][175405] Avg episode reward: [(0, '28.508')] [2023-03-07 09:55:09,053][175731] Updated weights for policy 0, policy_version 8340 (0.0006) [2023-03-07 09:55:09,829][175731] Updated weights for policy 0, policy_version 8350 (0.0006) [2023-03-07 09:55:10,648][175731] Updated weights for policy 0, policy_version 8360 (0.0006) [2023-03-07 09:55:11,432][175731] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-07 09:55:12,222][175731] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-07 09:55:13,026][175731] Updated weights for policy 0, policy_version 8390 (0.0007) [2023-03-07 09:55:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 8594432. Throughput: 0: 12833.1. Samples: 8568342. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:55:13,322][175405] Avg episode reward: [(0, '27.399')] [2023-03-07 09:55:13,833][175731] Updated weights for policy 0, policy_version 8400 (0.0007) [2023-03-07 09:55:14,619][175731] Updated weights for policy 0, policy_version 8410 (0.0007) [2023-03-07 09:55:15,431][175731] Updated weights for policy 0, policy_version 8420 (0.0007) [2023-03-07 09:55:16,223][175731] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-07 09:55:17,004][175731] Updated weights for policy 0, policy_version 8440 (0.0007) [2023-03-07 09:55:17,808][175731] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-07 09:55:18,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 8658944. Throughput: 0: 12833.3. Samples: 8645384. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:55:18,322][175405] Avg episode reward: [(0, '28.875')] [2023-03-07 09:55:18,618][175731] Updated weights for policy 0, policy_version 8460 (0.0007) [2023-03-07 09:55:19,412][175731] Updated weights for policy 0, policy_version 8470 (0.0006) [2023-03-07 09:55:20,206][175731] Updated weights for policy 0, policy_version 8480 (0.0006) [2023-03-07 09:55:21,005][175731] Updated weights for policy 0, policy_version 8490 (0.0007) [2023-03-07 09:55:21,805][175731] Updated weights for policy 0, policy_version 8500 (0.0006) [2023-03-07 09:55:22,611][175731] Updated weights for policy 0, policy_version 8510 (0.0006) [2023-03-07 09:55:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 8722432. Throughput: 0: 12840.5. Samples: 8722432. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:55:23,322][175405] Avg episode reward: [(0, '30.017')] [2023-03-07 09:55:23,396][175731] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-07 09:55:24,212][175731] Updated weights for policy 0, policy_version 8530 (0.0006) [2023-03-07 09:55:25,007][175731] Updated weights for policy 0, policy_version 8540 (0.0007) [2023-03-07 09:55:25,806][175731] Updated weights for policy 0, policy_version 8550 (0.0007) [2023-03-07 09:55:26,607][175731] Updated weights for policy 0, policy_version 8560 (0.0008) [2023-03-07 09:55:27,417][175731] Updated weights for policy 0, policy_version 8570 (0.0007) [2023-03-07 09:55:28,210][175731] Updated weights for policy 0, policy_version 8580 (0.0006) [2023-03-07 09:55:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12815.6). Total num frames: 8786944. Throughput: 0: 12833.3. Samples: 8760713. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:55:28,322][175405] Avg episode reward: [(0, '29.450')] [2023-03-07 09:55:29,006][175731] Updated weights for policy 0, policy_version 8590 (0.0006) [2023-03-07 09:55:29,821][175731] Updated weights for policy 0, policy_version 8600 (0.0006) [2023-03-07 09:55:30,602][175731] Updated weights for policy 0, policy_version 8610 (0.0006) [2023-03-07 09:55:31,406][175731] Updated weights for policy 0, policy_version 8620 (0.0006) [2023-03-07 09:55:32,212][175731] Updated weights for policy 0, policy_version 8630 (0.0007) [2023-03-07 09:55:33,028][175731] Updated weights for policy 0, policy_version 8640 (0.0006) [2023-03-07 09:55:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 8850432. Throughput: 0: 12825.7. Samples: 8837546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:55:33,322][175405] Avg episode reward: [(0, '29.301')] [2023-03-07 09:55:33,837][175731] Updated weights for policy 0, policy_version 8650 (0.0006) [2023-03-07 09:55:34,611][175731] Updated weights for policy 0, policy_version 8660 (0.0007) [2023-03-07 09:55:35,418][175731] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-07 09:55:36,219][175731] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-07 09:55:37,010][175731] Updated weights for policy 0, policy_version 8690 (0.0007) [2023-03-07 09:55:37,833][175731] Updated weights for policy 0, policy_version 8700 (0.0006) [2023-03-07 09:55:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12815.6). Total num frames: 8914944. Throughput: 0: 12816.9. Samples: 8914102. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:55:38,322][175405] Avg episode reward: [(0, '31.620')] [2023-03-07 09:55:38,631][175731] Updated weights for policy 0, policy_version 8710 (0.0006) [2023-03-07 09:55:39,421][175731] Updated weights for policy 0, policy_version 8720 (0.0006) [2023-03-07 09:55:40,227][175731] Updated weights for policy 0, policy_version 8730 (0.0006) [2023-03-07 09:55:41,026][175731] Updated weights for policy 0, policy_version 8740 (0.0007) [2023-03-07 09:55:41,813][175731] Updated weights for policy 0, policy_version 8750 (0.0007) [2023-03-07 09:55:42,618][175731] Updated weights for policy 0, policy_version 8760 (0.0006) [2023-03-07 09:55:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12812.2). Total num frames: 8978432. Throughput: 0: 12815.9. Samples: 8952543. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:55:43,322][175405] Avg episode reward: [(0, '32.033')] [2023-03-07 09:55:43,403][175731] Updated weights for policy 0, policy_version 8770 (0.0006) [2023-03-07 09:55:44,204][175731] Updated weights for policy 0, policy_version 8780 (0.0006) [2023-03-07 09:55:45,004][175731] Updated weights for policy 0, policy_version 8790 (0.0006) [2023-03-07 09:55:45,818][175731] Updated weights for policy 0, policy_version 8800 (0.0007) [2023-03-07 09:55:46,600][175731] Updated weights for policy 0, policy_version 8810 (0.0006) [2023-03-07 09:55:47,395][175731] Updated weights for policy 0, policy_version 8820 (0.0006) [2023-03-07 09:55:48,197][175731] Updated weights for policy 0, policy_version 8830 (0.0007) [2023-03-07 09:55:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 9042944. Throughput: 0: 12819.2. Samples: 9029703. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:55:48,322][175405] Avg episode reward: [(0, '29.537')] [2023-03-07 09:55:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008831_9042944.pth... [2023-03-07 09:55:48,353][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000005828_5967872.pth [2023-03-07 09:55:48,980][175731] Updated weights for policy 0, policy_version 8840 (0.0006) [2023-03-07 09:55:49,780][175731] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-07 09:55:50,586][175731] Updated weights for policy 0, policy_version 8860 (0.0006) [2023-03-07 09:55:51,383][175731] Updated weights for policy 0, policy_version 8870 (0.0006) [2023-03-07 09:55:52,211][175731] Updated weights for policy 0, policy_version 8880 (0.0007) [2023-03-07 09:55:53,022][175731] Updated weights for policy 0, policy_version 8890 (0.0007) [2023-03-07 09:55:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 9106432. Throughput: 0: 12809.4. Samples: 9106297. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:55:53,322][175405] Avg episode reward: [(0, '29.336')] [2023-03-07 09:55:53,822][175731] Updated weights for policy 0, policy_version 8900 (0.0007) [2023-03-07 09:55:54,635][175731] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-07 09:55:55,420][175731] Updated weights for policy 0, policy_version 8920 (0.0006) [2023-03-07 09:55:56,232][175731] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-07 09:55:57,035][175731] Updated weights for policy 0, policy_version 8940 (0.0007) [2023-03-07 09:55:57,822][175731] Updated weights for policy 0, policy_version 8950 (0.0006) [2023-03-07 09:55:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12812.1). Total num frames: 9170944. Throughput: 0: 12802.6. Samples: 9144458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:55:58,322][175405] Avg episode reward: [(0, '35.388')] [2023-03-07 09:55:58,619][175731] Updated weights for policy 0, policy_version 8960 (0.0006) [2023-03-07 09:55:59,409][175731] Updated weights for policy 0, policy_version 8970 (0.0007) [2023-03-07 09:56:00,198][175731] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-07 09:56:00,997][175731] Updated weights for policy 0, policy_version 8990 (0.0006) [2023-03-07 09:56:01,801][175731] Updated weights for policy 0, policy_version 9000 (0.0007) [2023-03-07 09:56:02,615][175731] Updated weights for policy 0, policy_version 9010 (0.0007) [2023-03-07 09:56:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 9234432. Throughput: 0: 12801.2. Samples: 9221436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:03,322][175405] Avg episode reward: [(0, '30.443')] [2023-03-07 09:56:03,408][175731] Updated weights for policy 0, policy_version 9020 (0.0006) [2023-03-07 09:56:04,206][175731] Updated weights for policy 0, policy_version 9030 (0.0006) [2023-03-07 09:56:05,007][175731] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-07 09:56:05,814][175731] Updated weights for policy 0, policy_version 9050 (0.0007) [2023-03-07 09:56:06,616][175731] Updated weights for policy 0, policy_version 9060 (0.0006) [2023-03-07 09:56:07,411][175731] Updated weights for policy 0, policy_version 9070 (0.0007) [2023-03-07 09:56:08,213][175731] Updated weights for policy 0, policy_version 9080 (0.0007) [2023-03-07 09:56:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12812.1). Total num frames: 9298944. Throughput: 0: 12796.0. Samples: 9298251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:08,322][175405] Avg episode reward: [(0, '30.376')] [2023-03-07 09:56:09,015][175731] Updated weights for policy 0, policy_version 9090 (0.0006) [2023-03-07 09:56:09,818][175731] Updated weights for policy 0, policy_version 9100 (0.0006) [2023-03-07 09:56:10,630][175731] Updated weights for policy 0, policy_version 9110 (0.0006) [2023-03-07 09:56:11,419][175731] Updated weights for policy 0, policy_version 9120 (0.0007) [2023-03-07 09:56:12,229][175731] Updated weights for policy 0, policy_version 9130 (0.0007) [2023-03-07 09:56:13,026][175731] Updated weights for policy 0, policy_version 9140 (0.0006) [2023-03-07 09:56:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 9362432. Throughput: 0: 12799.7. Samples: 9336697. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:13,321][175405] Avg episode reward: [(0, '32.467')] [2023-03-07 09:56:13,817][175731] Updated weights for policy 0, policy_version 9150 (0.0005) [2023-03-07 09:56:14,613][175731] Updated weights for policy 0, policy_version 9160 (0.0007) [2023-03-07 09:56:15,408][175731] Updated weights for policy 0, policy_version 9170 (0.0007) [2023-03-07 09:56:16,217][175731] Updated weights for policy 0, policy_version 9180 (0.0006) [2023-03-07 09:56:17,013][175731] Updated weights for policy 0, policy_version 9190 (0.0006) [2023-03-07 09:56:17,813][175731] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-07 09:56:18,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12812.2). Total num frames: 9426944. Throughput: 0: 12800.8. Samples: 9413580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:18,321][175405] Avg episode reward: [(0, '35.180')] [2023-03-07 09:56:18,604][175731] Updated weights for policy 0, policy_version 9210 (0.0006) [2023-03-07 09:56:19,401][175731] Updated weights for policy 0, policy_version 9220 (0.0007) [2023-03-07 09:56:20,210][175731] Updated weights for policy 0, policy_version 9230 (0.0007) [2023-03-07 09:56:21,012][175731] Updated weights for policy 0, policy_version 9240 (0.0007) [2023-03-07 09:56:21,814][175731] Updated weights for policy 0, policy_version 9250 (0.0007) [2023-03-07 09:56:22,624][175731] Updated weights for policy 0, policy_version 9260 (0.0007) [2023-03-07 09:56:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12812.2). Total num frames: 9490432. Throughput: 0: 12804.9. Samples: 9490321. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:23,322][175405] Avg episode reward: [(0, '36.133')] [2023-03-07 09:56:23,405][175731] Updated weights for policy 0, policy_version 9270 (0.0006) [2023-03-07 09:56:24,214][175731] Updated weights for policy 0, policy_version 9280 (0.0006) [2023-03-07 09:56:25,004][175731] Updated weights for policy 0, policy_version 9290 (0.0006) [2023-03-07 09:56:25,798][175731] Updated weights for policy 0, policy_version 9300 (0.0006) [2023-03-07 09:56:26,609][175731] Updated weights for policy 0, policy_version 9310 (0.0006) [2023-03-07 09:56:27,421][175731] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-07 09:56:28,211][175731] Updated weights for policy 0, policy_version 9330 (0.0007) [2023-03-07 09:56:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 9554944. Throughput: 0: 12804.6. Samples: 9528752. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:28,322][175405] Avg episode reward: [(0, '33.752')] [2023-03-07 09:56:29,028][175731] Updated weights for policy 0, policy_version 9340 (0.0006) [2023-03-07 09:56:29,839][175731] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-07 09:56:30,627][175731] Updated weights for policy 0, policy_version 9360 (0.0006) [2023-03-07 09:56:31,426][175731] Updated weights for policy 0, policy_version 9370 (0.0006) [2023-03-07 09:56:32,242][175731] Updated weights for policy 0, policy_version 9380 (0.0006) [2023-03-07 09:56:33,028][175731] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-07 09:56:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12812.2). Total num frames: 9618432. Throughput: 0: 12791.1. Samples: 9605304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:33,332][175405] Avg episode reward: [(0, '34.650')] [2023-03-07 09:56:33,830][175731] Updated weights for policy 0, policy_version 9400 (0.0006) [2023-03-07 09:56:34,632][175731] Updated weights for policy 0, policy_version 9410 (0.0007) [2023-03-07 09:56:35,422][175731] Updated weights for policy 0, policy_version 9420 (0.0006) [2023-03-07 09:56:36,252][175731] Updated weights for policy 0, policy_version 9430 (0.0007) [2023-03-07 09:56:37,035][175731] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-07 09:56:37,846][175731] Updated weights for policy 0, policy_version 9450 (0.0006) [2023-03-07 09:56:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 9682944. Throughput: 0: 12797.6. Samples: 9682186. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 09:56:38,332][175405] Avg episode reward: [(0, '34.234')] [2023-03-07 09:56:38,644][175731] Updated weights for policy 0, policy_version 9460 (0.0007) [2023-03-07 09:56:39,458][175731] Updated weights for policy 0, policy_version 9470 (0.0007) [2023-03-07 09:56:40,275][175731] Updated weights for policy 0, policy_version 9480 (0.0006) [2023-03-07 09:56:41,053][175731] Updated weights for policy 0, policy_version 9490 (0.0007) [2023-03-07 09:56:41,850][175731] Updated weights for policy 0, policy_version 9500 (0.0006) [2023-03-07 09:56:42,653][175731] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-07 09:56:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 9746432. Throughput: 0: 12795.4. Samples: 9720254. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:43,332][175405] Avg episode reward: [(0, '37.868')] [2023-03-07 09:56:43,442][175731] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-07 09:56:44,263][175731] Updated weights for policy 0, policy_version 9530 (0.0006) [2023-03-07 09:56:45,047][175731] Updated weights for policy 0, policy_version 9540 (0.0006) [2023-03-07 09:56:45,205][175680] KL-divergence is very high: 251.6855 [2023-03-07 09:56:45,851][175731] Updated weights for policy 0, policy_version 9550 (0.0007) [2023-03-07 09:56:46,671][175731] Updated weights for policy 0, policy_version 9560 (0.0007) [2023-03-07 09:56:47,471][175731] Updated weights for policy 0, policy_version 9570 (0.0006) [2023-03-07 09:56:48,285][175731] Updated weights for policy 0, policy_version 9580 (0.0007) [2023-03-07 09:56:48,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12808.7). Total num frames: 9809920. Throughput: 0: 12785.7. Samples: 9796793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:48,332][175405] Avg episode reward: [(0, '36.817')] [2023-03-07 09:56:49,079][175731] Updated weights for policy 0, policy_version 9590 (0.0006) [2023-03-07 09:56:49,864][175731] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-07 09:56:50,672][175731] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-07 09:56:51,477][175731] Updated weights for policy 0, policy_version 9620 (0.0007) [2023-03-07 09:56:52,286][175731] Updated weights for policy 0, policy_version 9630 (0.0007) [2023-03-07 09:56:53,114][175731] Updated weights for policy 0, policy_version 9640 (0.0007) [2023-03-07 09:56:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12808.7). Total num frames: 9873408. Throughput: 0: 12775.0. Samples: 9873126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:56:53,332][175405] Avg episode reward: [(0, '37.683')] [2023-03-07 09:56:53,916][175731] Updated weights for policy 0, policy_version 9650 (0.0006) [2023-03-07 09:56:54,712][175731] Updated weights for policy 0, policy_version 9660 (0.0007) [2023-03-07 09:56:55,506][175731] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-07 09:56:56,305][175731] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-07 09:56:57,093][175731] Updated weights for policy 0, policy_version 9690 (0.0006) [2023-03-07 09:56:57,886][175731] Updated weights for policy 0, policy_version 9700 (0.0007) [2023-03-07 09:56:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12808.7). Total num frames: 9937920. Throughput: 0: 12775.2. Samples: 9911581. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:56:58,322][175405] Avg episode reward: [(0, '34.590')] [2023-03-07 09:56:58,713][175731] Updated weights for policy 0, policy_version 9710 (0.0006) [2023-03-07 09:56:59,490][175731] Updated weights for policy 0, policy_version 9720 (0.0006) [2023-03-07 09:57:00,282][175731] Updated weights for policy 0, policy_version 9730 (0.0005) [2023-03-07 09:57:01,088][175731] Updated weights for policy 0, policy_version 9740 (0.0008) [2023-03-07 09:57:01,873][175731] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-07 09:57:02,670][175731] Updated weights for policy 0, policy_version 9760 (0.0008) [2023-03-07 09:57:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12800.0, 300 sec: 12812.2). Total num frames: 10002432. Throughput: 0: 12780.6. Samples: 9988709. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:57:03,321][175405] Avg episode reward: [(0, '36.295')] [2023-03-07 09:57:03,486][175731] Updated weights for policy 0, policy_version 9770 (0.0006) [2023-03-07 09:57:04,288][175731] Updated weights for policy 0, policy_version 9780 (0.0007) [2023-03-07 09:57:05,088][175731] Updated weights for policy 0, policy_version 9790 (0.0006) [2023-03-07 09:57:05,871][175731] Updated weights for policy 0, policy_version 9800 (0.0006) [2023-03-07 09:57:06,683][175731] Updated weights for policy 0, policy_version 9810 (0.0006) [2023-03-07 09:57:07,505][175731] Updated weights for policy 0, policy_version 9820 (0.0007) [2023-03-07 09:57:08,305][175731] Updated weights for policy 0, policy_version 9830 (0.0006) [2023-03-07 09:57:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12783.0, 300 sec: 12808.7). Total num frames: 10065920. Throughput: 0: 12779.2. Samples: 10065386. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:57:08,321][175405] Avg episode reward: [(0, '33.832')] [2023-03-07 09:57:09,103][175731] Updated weights for policy 0, policy_version 9840 (0.0006) [2023-03-07 09:57:09,913][175731] Updated weights for policy 0, policy_version 9850 (0.0006) [2023-03-07 09:57:10,710][175731] Updated weights for policy 0, policy_version 9860 (0.0006) [2023-03-07 09:57:11,500][175731] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-07 09:57:12,303][175731] Updated weights for policy 0, policy_version 9880 (0.0007) [2023-03-07 09:57:13,103][175731] Updated weights for policy 0, policy_version 9890 (0.0007) [2023-03-07 09:57:13,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12805.2). Total num frames: 10129408. Throughput: 0: 12773.0. Samples: 10103537. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:57:13,322][175405] Avg episode reward: [(0, '32.776')] [2023-03-07 09:57:13,898][175731] Updated weights for policy 0, policy_version 9900 (0.0006) [2023-03-07 09:57:14,697][175731] Updated weights for policy 0, policy_version 9910 (0.0006) [2023-03-07 09:57:15,497][175731] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-07 09:57:16,285][175731] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-07 09:57:17,092][175731] Updated weights for policy 0, policy_version 9940 (0.0007) [2023-03-07 09:57:17,897][175731] Updated weights for policy 0, policy_version 9950 (0.0006) [2023-03-07 09:57:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12808.7). Total num frames: 10193920. Throughput: 0: 12779.0. Samples: 10180359. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:57:18,321][175405] Avg episode reward: [(0, '34.130')] [2023-03-07 09:57:18,709][175731] Updated weights for policy 0, policy_version 9960 (0.0006) [2023-03-07 09:57:19,513][175731] Updated weights for policy 0, policy_version 9970 (0.0006) [2023-03-07 09:57:20,305][175731] Updated weights for policy 0, policy_version 9980 (0.0006) [2023-03-07 09:57:21,117][175731] Updated weights for policy 0, policy_version 9990 (0.0006) [2023-03-07 09:57:21,912][175731] Updated weights for policy 0, policy_version 10000 (0.0007) [2023-03-07 09:57:22,702][175731] Updated weights for policy 0, policy_version 10010 (0.0007) [2023-03-07 09:57:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12805.2). Total num frames: 10257408. Throughput: 0: 12774.0. Samples: 10257017. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:57:23,332][175405] Avg episode reward: [(0, '36.173')] [2023-03-07 09:57:23,505][175731] Updated weights for policy 0, policy_version 10020 (0.0007) [2023-03-07 09:57:24,314][175731] Updated weights for policy 0, policy_version 10030 (0.0006) [2023-03-07 09:57:25,117][175731] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-07 09:57:25,935][175731] Updated weights for policy 0, policy_version 10050 (0.0007) [2023-03-07 09:57:26,749][175731] Updated weights for policy 0, policy_version 10060 (0.0006) [2023-03-07 09:57:27,554][175731] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-07 09:57:28,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 12801.7). Total num frames: 10320896. Throughput: 0: 12774.6. Samples: 10295111. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:57:28,323][175405] Avg episode reward: [(0, '32.627')] [2023-03-07 09:57:28,349][175731] Updated weights for policy 0, policy_version 10080 (0.0007) [2023-03-07 09:57:29,146][175731] Updated weights for policy 0, policy_version 10090 (0.0006) [2023-03-07 09:57:29,946][175731] Updated weights for policy 0, policy_version 10100 (0.0006) [2023-03-07 09:57:30,739][175731] Updated weights for policy 0, policy_version 10110 (0.0006) [2023-03-07 09:57:31,558][175731] Updated weights for policy 0, policy_version 10120 (0.0007) [2023-03-07 09:57:32,359][175731] Updated weights for policy 0, policy_version 10130 (0.0007) [2023-03-07 09:57:33,158][175731] Updated weights for policy 0, policy_version 10140 (0.0006) [2023-03-07 09:57:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12805.2). Total num frames: 10385408. Throughput: 0: 12772.9. Samples: 10371576. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:57:33,332][175405] Avg episode reward: [(0, '35.070')] [2023-03-07 09:57:33,961][175731] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-07 09:57:34,772][175731] Updated weights for policy 0, policy_version 10160 (0.0007) [2023-03-07 09:57:35,557][175731] Updated weights for policy 0, policy_version 10170 (0.0007) [2023-03-07 09:57:36,358][175731] Updated weights for policy 0, policy_version 10180 (0.0007) [2023-03-07 09:57:37,159][175731] Updated weights for policy 0, policy_version 10190 (0.0006) [2023-03-07 09:57:37,951][175731] Updated weights for policy 0, policy_version 10200 (0.0006) [2023-03-07 09:57:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12801.7). Total num frames: 10448896. Throughput: 0: 12785.0. Samples: 10448448. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:57:38,332][175405] Avg episode reward: [(0, '32.316')] [2023-03-07 09:57:38,751][175731] Updated weights for policy 0, policy_version 10210 (0.0007) [2023-03-07 09:57:39,551][175731] Updated weights for policy 0, policy_version 10220 (0.0006) [2023-03-07 09:57:40,338][175731] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-07 09:57:41,141][175731] Updated weights for policy 0, policy_version 10240 (0.0006) [2023-03-07 09:57:41,946][175731] Updated weights for policy 0, policy_version 10250 (0.0006) [2023-03-07 09:57:42,744][175731] Updated weights for policy 0, policy_version 10260 (0.0007) [2023-03-07 09:57:43,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12765.9, 300 sec: 12798.3). Total num frames: 10512384. Throughput: 0: 12784.4. Samples: 10486879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:57:43,332][175405] Avg episode reward: [(0, '32.789')] [2023-03-07 09:57:43,551][175731] Updated weights for policy 0, policy_version 10270 (0.0006) [2023-03-07 09:57:44,351][175731] Updated weights for policy 0, policy_version 10280 (0.0006) [2023-03-07 09:57:45,146][175731] Updated weights for policy 0, policy_version 10290 (0.0007) [2023-03-07 09:57:45,949][175731] Updated weights for policy 0, policy_version 10300 (0.0006) [2023-03-07 09:57:46,761][175731] Updated weights for policy 0, policy_version 10310 (0.0007) [2023-03-07 09:57:47,549][175731] Updated weights for policy 0, policy_version 10320 (0.0007) [2023-03-07 09:57:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 10576896. Throughput: 0: 12777.6. Samples: 10563704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:57:48,332][175405] Avg episode reward: [(0, '32.639')] [2023-03-07 09:57:48,348][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010330_10577920.pth... [2023-03-07 09:57:48,350][175731] Updated weights for policy 0, policy_version 10330 (0.0006) [2023-03-07 09:57:48,377][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000007329_7504896.pth [2023-03-07 09:57:49,137][175731] Updated weights for policy 0, policy_version 10340 (0.0007) [2023-03-07 09:57:49,954][175731] Updated weights for policy 0, policy_version 10350 (0.0006) [2023-03-07 09:57:50,743][175731] Updated weights for policy 0, policy_version 10360 (0.0006) [2023-03-07 09:57:51,540][175731] Updated weights for policy 0, policy_version 10370 (0.0006) [2023-03-07 09:57:52,348][175731] Updated weights for policy 0, policy_version 10380 (0.0006) [2023-03-07 09:57:53,137][175731] Updated weights for policy 0, policy_version 10390 (0.0005) [2023-03-07 09:57:53,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 10641408. Throughput: 0: 12786.0. Samples: 10640756. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:57:53,332][175405] Avg episode reward: [(0, '32.180')] [2023-03-07 09:57:53,950][175731] Updated weights for policy 0, policy_version 10400 (0.0007) [2023-03-07 09:57:54,730][175731] Updated weights for policy 0, policy_version 10410 (0.0007) [2023-03-07 09:57:55,515][175731] Updated weights for policy 0, policy_version 10420 (0.0006) [2023-03-07 09:57:56,310][175731] Updated weights for policy 0, policy_version 10430 (0.0006) [2023-03-07 09:57:57,106][175731] Updated weights for policy 0, policy_version 10440 (0.0005) [2023-03-07 09:57:57,906][175731] Updated weights for policy 0, policy_version 10450 (0.0006) [2023-03-07 09:57:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 10705920. Throughput: 0: 12797.1. Samples: 10679408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:57:58,332][175405] Avg episode reward: [(0, '33.133')] [2023-03-07 09:57:58,695][175731] Updated weights for policy 0, policy_version 10460 (0.0006) [2023-03-07 09:57:59,498][175731] Updated weights for policy 0, policy_version 10470 (0.0007) [2023-03-07 09:58:00,318][175731] Updated weights for policy 0, policy_version 10480 (0.0006) [2023-03-07 09:58:01,113][175731] Updated weights for policy 0, policy_version 10490 (0.0006) [2023-03-07 09:58:01,918][175731] Updated weights for policy 0, policy_version 10500 (0.0007) [2023-03-07 09:58:02,736][175731] Updated weights for policy 0, policy_version 10510 (0.0007) [2023-03-07 09:58:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 10769408. Throughput: 0: 12790.8. Samples: 10755947. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:03,322][175405] Avg episode reward: [(0, '32.622')] [2023-03-07 09:58:03,537][175731] Updated weights for policy 0, policy_version 10520 (0.0006) [2023-03-07 09:58:04,344][175731] Updated weights for policy 0, policy_version 10530 (0.0006) [2023-03-07 09:58:05,140][175731] Updated weights for policy 0, policy_version 10540 (0.0008) [2023-03-07 09:58:05,938][175731] Updated weights for policy 0, policy_version 10550 (0.0006) [2023-03-07 09:58:06,739][175731] Updated weights for policy 0, policy_version 10560 (0.0006) [2023-03-07 09:58:07,550][175731] Updated weights for policy 0, policy_version 10570 (0.0006) [2023-03-07 09:58:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 10832896. Throughput: 0: 12793.0. Samples: 10832705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:08,322][175405] Avg episode reward: [(0, '35.601')] [2023-03-07 09:58:08,342][175731] Updated weights for policy 0, policy_version 10580 (0.0006) [2023-03-07 09:58:09,129][175731] Updated weights for policy 0, policy_version 10590 (0.0006) [2023-03-07 09:58:09,915][175731] Updated weights for policy 0, policy_version 10600 (0.0006) [2023-03-07 09:58:10,718][175731] Updated weights for policy 0, policy_version 10610 (0.0006) [2023-03-07 09:58:11,525][175731] Updated weights for policy 0, policy_version 10620 (0.0006) [2023-03-07 09:58:12,321][175731] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-07 09:58:13,110][175731] Updated weights for policy 0, policy_version 10640 (0.0006) [2023-03-07 09:58:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 10897408. Throughput: 0: 12802.2. Samples: 10871208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:13,321][175405] Avg episode reward: [(0, '34.742')] [2023-03-07 09:58:13,918][175731] Updated weights for policy 0, policy_version 10650 (0.0007) [2023-03-07 09:58:14,721][175731] Updated weights for policy 0, policy_version 10660 (0.0006) [2023-03-07 09:58:15,506][175731] Updated weights for policy 0, policy_version 10670 (0.0006) [2023-03-07 09:58:16,313][175731] Updated weights for policy 0, policy_version 10680 (0.0007) [2023-03-07 09:58:17,101][175731] Updated weights for policy 0, policy_version 10690 (0.0006) [2023-03-07 09:58:17,903][175731] Updated weights for policy 0, policy_version 10700 (0.0007) [2023-03-07 09:58:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 10961920. Throughput: 0: 12817.0. Samples: 10948342. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:18,322][175405] Avg episode reward: [(0, '35.382')] [2023-03-07 09:58:18,706][175731] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-07 09:58:19,522][175731] Updated weights for policy 0, policy_version 10720 (0.0006) [2023-03-07 09:58:20,329][175731] Updated weights for policy 0, policy_version 10730 (0.0007) [2023-03-07 09:58:21,137][175731] Updated weights for policy 0, policy_version 10740 (0.0006) [2023-03-07 09:58:21,942][175731] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-07 09:58:22,736][175731] Updated weights for policy 0, policy_version 10760 (0.0006) [2023-03-07 09:58:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 11025408. Throughput: 0: 12805.3. Samples: 11024687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:23,322][175405] Avg episode reward: [(0, '35.866')] [2023-03-07 09:58:23,536][175731] Updated weights for policy 0, policy_version 10770 (0.0007) [2023-03-07 09:58:24,344][175731] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-07 09:58:25,128][175731] Updated weights for policy 0, policy_version 10790 (0.0006) [2023-03-07 09:58:25,926][175731] Updated weights for policy 0, policy_version 10800 (0.0006) [2023-03-07 09:58:26,709][175731] Updated weights for policy 0, policy_version 10810 (0.0006) [2023-03-07 09:58:27,498][175731] Updated weights for policy 0, policy_version 10820 (0.0007) [2023-03-07 09:58:28,306][175731] Updated weights for policy 0, policy_version 10830 (0.0006) [2023-03-07 09:58:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12805.2). Total num frames: 11089920. Throughput: 0: 12807.4. Samples: 11063211. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:58:28,321][175405] Avg episode reward: [(0, '32.787')] [2023-03-07 09:58:29,115][175731] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-07 09:58:29,933][175731] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-07 09:58:30,732][175731] Updated weights for policy 0, policy_version 10860 (0.0006) [2023-03-07 09:58:31,534][175731] Updated weights for policy 0, policy_version 10870 (0.0006) [2023-03-07 09:58:32,343][175731] Updated weights for policy 0, policy_version 10880 (0.0006) [2023-03-07 09:58:33,140][175731] Updated weights for policy 0, policy_version 10890 (0.0006) [2023-03-07 09:58:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 11153408. Throughput: 0: 12803.5. Samples: 11139858. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:58:33,321][175405] Avg episode reward: [(0, '32.823')] [2023-03-07 09:58:33,940][175731] Updated weights for policy 0, policy_version 10900 (0.0006) [2023-03-07 09:58:34,735][175731] Updated weights for policy 0, policy_version 10910 (0.0006) [2023-03-07 09:58:35,525][175731] Updated weights for policy 0, policy_version 10920 (0.0006) [2023-03-07 09:58:36,318][175731] Updated weights for policy 0, policy_version 10930 (0.0006) [2023-03-07 09:58:37,114][175731] Updated weights for policy 0, policy_version 10940 (0.0006) [2023-03-07 09:58:37,914][175731] Updated weights for policy 0, policy_version 10950 (0.0005) [2023-03-07 09:58:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 11217920. Throughput: 0: 12806.9. Samples: 11217064. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:58:38,321][175405] Avg episode reward: [(0, '35.160')] [2023-03-07 09:58:38,711][175731] Updated weights for policy 0, policy_version 10960 (0.0007) [2023-03-07 09:58:39,532][175731] Updated weights for policy 0, policy_version 10970 (0.0007) [2023-03-07 09:58:40,328][175731] Updated weights for policy 0, policy_version 10980 (0.0006) [2023-03-07 09:58:41,128][175731] Updated weights for policy 0, policy_version 10990 (0.0007) [2023-03-07 09:58:41,933][175731] Updated weights for policy 0, policy_version 11000 (0.0007) [2023-03-07 09:58:42,733][175731] Updated weights for policy 0, policy_version 11010 (0.0006) [2023-03-07 09:58:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 11281408. Throughput: 0: 12792.4. Samples: 11255063. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:43,322][175405] Avg episode reward: [(0, '33.729')] [2023-03-07 09:58:43,544][175731] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-07 09:58:44,343][175731] Updated weights for policy 0, policy_version 11030 (0.0007) [2023-03-07 09:58:45,140][175731] Updated weights for policy 0, policy_version 11040 (0.0006) [2023-03-07 09:58:45,939][175731] Updated weights for policy 0, policy_version 11050 (0.0008) [2023-03-07 09:58:46,753][175731] Updated weights for policy 0, policy_version 11060 (0.0007) [2023-03-07 09:58:47,557][175731] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-07 09:58:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 11344896. Throughput: 0: 12793.4. Samples: 11331651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:48,322][175405] Avg episode reward: [(0, '33.834')] [2023-03-07 09:58:48,346][175731] Updated weights for policy 0, policy_version 11080 (0.0006) [2023-03-07 09:58:49,154][175731] Updated weights for policy 0, policy_version 11090 (0.0006) [2023-03-07 09:58:49,961][175731] Updated weights for policy 0, policy_version 11100 (0.0006) [2023-03-07 09:58:50,733][175731] Updated weights for policy 0, policy_version 11110 (0.0007) [2023-03-07 09:58:51,541][175731] Updated weights for policy 0, policy_version 11120 (0.0007) [2023-03-07 09:58:52,350][175731] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-07 09:58:53,151][175731] Updated weights for policy 0, policy_version 11140 (0.0006) [2023-03-07 09:58:53,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 11409408. Throughput: 0: 12797.8. Samples: 11408604. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:58:53,322][175405] Avg episode reward: [(0, '33.950')] [2023-03-07 09:58:53,946][175731] Updated weights for policy 0, policy_version 11150 (0.0006) [2023-03-07 09:58:54,754][175731] Updated weights for policy 0, policy_version 11160 (0.0007) [2023-03-07 09:58:55,560][175731] Updated weights for policy 0, policy_version 11170 (0.0007) [2023-03-07 09:58:56,346][175731] Updated weights for policy 0, policy_version 11180 (0.0006) [2023-03-07 09:58:57,135][175731] Updated weights for policy 0, policy_version 11190 (0.0005) [2023-03-07 09:58:57,938][175731] Updated weights for policy 0, policy_version 11200 (0.0007) [2023-03-07 09:58:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 11472896. Throughput: 0: 12793.0. Samples: 11446895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:58:58,322][175405] Avg episode reward: [(0, '32.972')] [2023-03-07 09:58:58,730][175731] Updated weights for policy 0, policy_version 11210 (0.0007) [2023-03-07 09:58:59,533][175731] Updated weights for policy 0, policy_version 11220 (0.0006) [2023-03-07 09:59:00,327][175731] Updated weights for policy 0, policy_version 11230 (0.0007) [2023-03-07 09:59:01,123][175731] Updated weights for policy 0, policy_version 11240 (0.0006) [2023-03-07 09:59:01,946][175731] Updated weights for policy 0, policy_version 11250 (0.0006) [2023-03-07 09:59:02,758][175731] Updated weights for policy 0, policy_version 11260 (0.0006) [2023-03-07 09:59:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12782.9, 300 sec: 12798.3). Total num frames: 11536384. Throughput: 0: 12789.4. Samples: 11523866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:59:03,322][175405] Avg episode reward: [(0, '33.124')] [2023-03-07 09:59:03,558][175731] Updated weights for policy 0, policy_version 11270 (0.0006) [2023-03-07 09:59:04,365][175731] Updated weights for policy 0, policy_version 11280 (0.0006) [2023-03-07 09:59:05,169][175731] Updated weights for policy 0, policy_version 11290 (0.0007) [2023-03-07 09:59:05,977][175731] Updated weights for policy 0, policy_version 11300 (0.0007) [2023-03-07 09:59:06,781][175731] Updated weights for policy 0, policy_version 11310 (0.0006) [2023-03-07 09:59:07,605][175731] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-07 09:59:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 11600896. Throughput: 0: 12785.5. Samples: 11600032. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:59:08,321][175405] Avg episode reward: [(0, '41.255')] [2023-03-07 09:59:08,402][175731] Updated weights for policy 0, policy_version 11330 (0.0006) [2023-03-07 09:59:09,189][175731] Updated weights for policy 0, policy_version 11340 (0.0006) [2023-03-07 09:59:09,997][175731] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-07 09:59:10,797][175731] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-07 09:59:11,579][175731] Updated weights for policy 0, policy_version 11370 (0.0007) [2023-03-07 09:59:12,385][175731] Updated weights for policy 0, policy_version 11380 (0.0006) [2023-03-07 09:59:13,185][175731] Updated weights for policy 0, policy_version 11390 (0.0007) [2023-03-07 09:59:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12794.8). Total num frames: 11664384. Throughput: 0: 12785.3. Samples: 11638551. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 09:59:13,322][175405] Avg episode reward: [(0, '32.807')] [2023-03-07 09:59:14,010][175731] Updated weights for policy 0, policy_version 11400 (0.0007) [2023-03-07 09:59:14,782][175731] Updated weights for policy 0, policy_version 11410 (0.0006) [2023-03-07 09:59:15,590][175731] Updated weights for policy 0, policy_version 11420 (0.0006) [2023-03-07 09:59:16,373][175731] Updated weights for policy 0, policy_version 11430 (0.0006) [2023-03-07 09:59:17,165][175731] Updated weights for policy 0, policy_version 11440 (0.0007) [2023-03-07 09:59:17,966][175731] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-07 09:59:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12798.3). Total num frames: 11728896. Throughput: 0: 12789.3. Samples: 11715375. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:59:18,322][175405] Avg episode reward: [(0, '30.928')] [2023-03-07 09:59:18,761][175731] Updated weights for policy 0, policy_version 11460 (0.0006) [2023-03-07 09:59:19,577][175731] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-07 09:59:20,401][175731] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-07 09:59:21,203][175731] Updated weights for policy 0, policy_version 11490 (0.0007) [2023-03-07 09:59:22,003][175731] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-07 09:59:22,835][175731] Updated weights for policy 0, policy_version 11510 (0.0006) [2023-03-07 09:59:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12783.0, 300 sec: 12798.3). Total num frames: 11792384. Throughput: 0: 12765.2. Samples: 11791497. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:59:23,321][175405] Avg episode reward: [(0, '32.943')] [2023-03-07 09:59:23,647][175731] Updated weights for policy 0, policy_version 11520 (0.0006) [2023-03-07 09:59:24,446][175731] Updated weights for policy 0, policy_version 11530 (0.0006) [2023-03-07 09:59:25,257][175731] Updated weights for policy 0, policy_version 11540 (0.0006) [2023-03-07 09:59:26,056][175731] Updated weights for policy 0, policy_version 11550 (0.0007) [2023-03-07 09:59:26,852][175731] Updated weights for policy 0, policy_version 11560 (0.0006) [2023-03-07 09:59:27,645][175731] Updated weights for policy 0, policy_version 11570 (0.0006) [2023-03-07 09:59:28,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.8, 300 sec: 12794.8). Total num frames: 11855872. Throughput: 0: 12770.3. Samples: 11829730. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:59:28,322][175405] Avg episode reward: [(0, '36.583')] [2023-03-07 09:59:28,445][175731] Updated weights for policy 0, policy_version 11580 (0.0006) [2023-03-07 09:59:29,237][175731] Updated weights for policy 0, policy_version 11590 (0.0007) [2023-03-07 09:59:30,049][175731] Updated weights for policy 0, policy_version 11600 (0.0006) [2023-03-07 09:59:30,862][175731] Updated weights for policy 0, policy_version 11610 (0.0006) [2023-03-07 09:59:31,675][175731] Updated weights for policy 0, policy_version 11620 (0.0006) [2023-03-07 09:59:32,465][175731] Updated weights for policy 0, policy_version 11630 (0.0006) [2023-03-07 09:59:33,270][175731] Updated weights for policy 0, policy_version 11640 (0.0007) [2023-03-07 09:59:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 12794.8). Total num frames: 11919360. Throughput: 0: 12768.3. Samples: 11906225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:59:33,322][175405] Avg episode reward: [(0, '36.316')] [2023-03-07 09:59:34,052][175731] Updated weights for policy 0, policy_version 11650 (0.0006) [2023-03-07 09:59:34,850][175731] Updated weights for policy 0, policy_version 11660 (0.0006) [2023-03-07 09:59:35,661][175731] Updated weights for policy 0, policy_version 11670 (0.0006) [2023-03-07 09:59:36,457][175731] Updated weights for policy 0, policy_version 11680 (0.0007) [2023-03-07 09:59:37,251][175731] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-07 09:59:38,080][175731] Updated weights for policy 0, policy_version 11700 (0.0007) [2023-03-07 09:59:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12794.8). Total num frames: 11983872. Throughput: 0: 12765.1. Samples: 11983033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 09:59:38,322][175405] Avg episode reward: [(0, '33.051')] [2023-03-07 09:59:38,870][175731] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-07 09:59:39,670][175731] Updated weights for policy 0, policy_version 11720 (0.0007) [2023-03-07 09:59:40,474][175731] Updated weights for policy 0, policy_version 11730 (0.0007) [2023-03-07 09:59:41,305][175731] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-07 09:59:42,096][175731] Updated weights for policy 0, policy_version 11750 (0.0007) [2023-03-07 09:59:42,885][175731] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-07 09:59:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12791.3). Total num frames: 12047360. Throughput: 0: 12763.3. Samples: 12021242. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:59:43,322][175405] Avg episode reward: [(0, '32.981')] [2023-03-07 09:59:43,688][175731] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-07 09:59:44,489][175731] Updated weights for policy 0, policy_version 11780 (0.0006) [2023-03-07 09:59:45,296][175731] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-07 09:59:46,112][175731] Updated weights for policy 0, policy_version 11800 (0.0006) [2023-03-07 09:59:46,927][175731] Updated weights for policy 0, policy_version 11810 (0.0006) [2023-03-07 09:59:47,736][175731] Updated weights for policy 0, policy_version 11820 (0.0006) [2023-03-07 09:59:48,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12765.8, 300 sec: 12791.3). Total num frames: 12110848. Throughput: 0: 12747.7. Samples: 12097516. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:59:48,322][175405] Avg episode reward: [(0, '32.178')] [2023-03-07 09:59:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011827_12110848.pth... [2023-03-07 09:59:48,361][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000008831_9042944.pth [2023-03-07 09:59:48,534][175731] Updated weights for policy 0, policy_version 11830 (0.0006) [2023-03-07 09:59:49,323][175731] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-07 09:59:50,149][175731] Updated weights for policy 0, policy_version 11850 (0.0006) [2023-03-07 09:59:50,938][175731] Updated weights for policy 0, policy_version 11860 (0.0007) [2023-03-07 09:59:51,739][175731] Updated weights for policy 0, policy_version 11870 (0.0006) [2023-03-07 09:59:52,545][175731] Updated weights for policy 0, policy_version 11880 (0.0007) [2023-03-07 09:59:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12787.9). Total num frames: 12174336. Throughput: 0: 12754.2. Samples: 12173973. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:59:53,322][175405] Avg episode reward: [(0, '74.281')] [2023-03-07 09:59:53,345][175731] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-07 09:59:54,157][175731] Updated weights for policy 0, policy_version 11900 (0.0007) [2023-03-07 09:59:54,953][175731] Updated weights for policy 0, policy_version 11910 (0.0005) [2023-03-07 09:59:55,762][175731] Updated weights for policy 0, policy_version 11920 (0.0006) [2023-03-07 09:59:56,547][175731] Updated weights for policy 0, policy_version 11930 (0.0007) [2023-03-07 09:59:57,367][175731] Updated weights for policy 0, policy_version 11940 (0.0006) [2023-03-07 09:59:58,162][175731] Updated weights for policy 0, policy_version 11950 (0.0006) [2023-03-07 09:59:58,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12748.8, 300 sec: 12784.4). Total num frames: 12237824. Throughput: 0: 12748.5. Samples: 12212232. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 09:59:58,322][175405] Avg episode reward: [(0, '32.725')] [2023-03-07 09:59:58,959][175731] Updated weights for policy 0, policy_version 11960 (0.0006) [2023-03-07 09:59:59,761][175731] Updated weights for policy 0, policy_version 11970 (0.0006) [2023-03-07 10:00:00,569][175731] Updated weights for policy 0, policy_version 11980 (0.0006) [2023-03-07 10:00:01,370][175731] Updated weights for policy 0, policy_version 11990 (0.0007) [2023-03-07 10:00:02,176][175731] Updated weights for policy 0, policy_version 12000 (0.0006) [2023-03-07 10:00:02,977][175731] Updated weights for policy 0, policy_version 12010 (0.0006) [2023-03-07 10:00:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12787.9). Total num frames: 12302336. Throughput: 0: 12746.3. Samples: 12288958. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:00:03,321][175405] Avg episode reward: [(0, '34.898')] [2023-03-07 10:00:03,782][175731] Updated weights for policy 0, policy_version 12020 (0.0006) [2023-03-07 10:00:04,572][175731] Updated weights for policy 0, policy_version 12030 (0.0007) [2023-03-07 10:00:05,367][175731] Updated weights for policy 0, policy_version 12040 (0.0007) [2023-03-07 10:00:06,181][175731] Updated weights for policy 0, policy_version 12050 (0.0006) [2023-03-07 10:00:06,974][175731] Updated weights for policy 0, policy_version 12060 (0.0007) [2023-03-07 10:00:07,790][175731] Updated weights for policy 0, policy_version 12070 (0.0007) [2023-03-07 10:00:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12765.9, 300 sec: 12787.9). Total num frames: 12366848. Throughput: 0: 12759.7. Samples: 12365684. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:00:08,322][175405] Avg episode reward: [(0, '34.228')] [2023-03-07 10:00:08,581][175731] Updated weights for policy 0, policy_version 12080 (0.0006) [2023-03-07 10:00:09,381][175731] Updated weights for policy 0, policy_version 12090 (0.0007) [2023-03-07 10:00:10,190][175731] Updated weights for policy 0, policy_version 12100 (0.0006) [2023-03-07 10:00:10,988][175731] Updated weights for policy 0, policy_version 12110 (0.0007) [2023-03-07 10:00:11,786][175731] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-07 10:00:12,597][175731] Updated weights for policy 0, policy_version 12130 (0.0007) [2023-03-07 10:00:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12784.4). Total num frames: 12430336. Throughput: 0: 12761.5. Samples: 12403996. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:00:13,321][175405] Avg episode reward: [(0, '35.476')] [2023-03-07 10:00:13,413][175731] Updated weights for policy 0, policy_version 12140 (0.0007) [2023-03-07 10:00:14,218][175731] Updated weights for policy 0, policy_version 12150 (0.0007) [2023-03-07 10:00:15,019][175731] Updated weights for policy 0, policy_version 12160 (0.0007) [2023-03-07 10:00:15,828][175731] Updated weights for policy 0, policy_version 12170 (0.0008) [2023-03-07 10:00:16,631][175731] Updated weights for policy 0, policy_version 12180 (0.0006) [2023-03-07 10:00:17,432][175731] Updated weights for policy 0, policy_version 12190 (0.0007) [2023-03-07 10:00:18,230][175731] Updated weights for policy 0, policy_version 12200 (0.0006) [2023-03-07 10:00:18,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12784.4). Total num frames: 12493824. Throughput: 0: 12753.2. Samples: 12480122. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:00:18,322][175405] Avg episode reward: [(0, '36.936')] [2023-03-07 10:00:19,051][175731] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-07 10:00:19,839][175731] Updated weights for policy 0, policy_version 12220 (0.0006) [2023-03-07 10:00:20,643][175731] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-07 10:00:21,440][175731] Updated weights for policy 0, policy_version 12240 (0.0006) [2023-03-07 10:00:22,234][175731] Updated weights for policy 0, policy_version 12250 (0.0006) [2023-03-07 10:00:23,030][175731] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-07 10:00:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12780.9). Total num frames: 12557312. Throughput: 0: 12751.0. Samples: 12556831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:00:23,322][175405] Avg episode reward: [(0, '36.793')] [2023-03-07 10:00:23,825][175731] Updated weights for policy 0, policy_version 12270 (0.0007) [2023-03-07 10:00:24,624][175731] Updated weights for policy 0, policy_version 12280 (0.0006) [2023-03-07 10:00:25,448][175731] Updated weights for policy 0, policy_version 12290 (0.0008) [2023-03-07 10:00:26,261][175731] Updated weights for policy 0, policy_version 12300 (0.0006) [2023-03-07 10:00:27,061][175731] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-07 10:00:27,865][175731] Updated weights for policy 0, policy_version 12320 (0.0007) [2023-03-07 10:00:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12780.9). Total num frames: 12620800. Throughput: 0: 12749.8. Samples: 12594985. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:00:28,322][175405] Avg episode reward: [(0, '33.010')] [2023-03-07 10:00:28,657][175731] Updated weights for policy 0, policy_version 12330 (0.0007) [2023-03-07 10:00:29,457][175731] Updated weights for policy 0, policy_version 12340 (0.0006) [2023-03-07 10:00:30,266][175731] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-07 10:00:31,068][175731] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-07 10:00:31,858][175731] Updated weights for policy 0, policy_version 12370 (0.0005) [2023-03-07 10:00:32,654][175731] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-07 10:00:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12780.9). Total num frames: 12685312. Throughput: 0: 12763.7. Samples: 12671881. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:00:33,322][175405] Avg episode reward: [(0, '37.063')] [2023-03-07 10:00:33,466][175731] Updated weights for policy 0, policy_version 12390 (0.0007) [2023-03-07 10:00:34,282][175731] Updated weights for policy 0, policy_version 12400 (0.0006) [2023-03-07 10:00:35,091][175731] Updated weights for policy 0, policy_version 12410 (0.0006) [2023-03-07 10:00:35,868][175731] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-07 10:00:36,689][175731] Updated weights for policy 0, policy_version 12430 (0.0007) [2023-03-07 10:00:37,475][175731] Updated weights for policy 0, policy_version 12440 (0.0006) [2023-03-07 10:00:38,281][175731] Updated weights for policy 0, policy_version 12450 (0.0006) [2023-03-07 10:00:38,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12780.9). Total num frames: 12748800. Throughput: 0: 12759.5. Samples: 12748151. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:00:38,322][175405] Avg episode reward: [(0, '34.615')] [2023-03-07 10:00:39,092][175731] Updated weights for policy 0, policy_version 12460 (0.0006) [2023-03-07 10:00:39,884][175731] Updated weights for policy 0, policy_version 12470 (0.0006) [2023-03-07 10:00:40,702][175731] Updated weights for policy 0, policy_version 12480 (0.0007) [2023-03-07 10:00:41,502][175731] Updated weights for policy 0, policy_version 12490 (0.0007) [2023-03-07 10:00:42,289][175731] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-07 10:00:43,090][175731] Updated weights for policy 0, policy_version 12510 (0.0006) [2023-03-07 10:00:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12777.4). Total num frames: 12812288. Throughput: 0: 12762.3. Samples: 12786536. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:00:43,322][175405] Avg episode reward: [(0, '33.362')] [2023-03-07 10:00:43,915][175731] Updated weights for policy 0, policy_version 12520 (0.0005) [2023-03-07 10:00:44,713][175731] Updated weights for policy 0, policy_version 12530 (0.0007) [2023-03-07 10:00:45,515][175731] Updated weights for policy 0, policy_version 12540 (0.0007) [2023-03-07 10:00:46,336][175731] Updated weights for policy 0, policy_version 12550 (0.0006) [2023-03-07 10:00:47,111][175731] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-07 10:00:47,905][175731] Updated weights for policy 0, policy_version 12570 (0.0006) [2023-03-07 10:00:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12780.9). Total num frames: 12876800. Throughput: 0: 12758.5. Samples: 12863093. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:00:48,322][175405] Avg episode reward: [(0, '35.225')] [2023-03-07 10:00:48,726][175731] Updated weights for policy 0, policy_version 12580 (0.0006) [2023-03-07 10:00:49,499][175731] Updated weights for policy 0, policy_version 12590 (0.0006) [2023-03-07 10:00:50,318][175731] Updated weights for policy 0, policy_version 12600 (0.0006) [2023-03-07 10:00:51,102][175731] Updated weights for policy 0, policy_version 12610 (0.0007) [2023-03-07 10:00:51,902][175731] Updated weights for policy 0, policy_version 12620 (0.0006) [2023-03-07 10:00:52,706][175731] Updated weights for policy 0, policy_version 12630 (0.0005) [2023-03-07 10:00:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12777.4). Total num frames: 12940288. Throughput: 0: 12755.9. Samples: 12939700. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:00:53,322][175405] Avg episode reward: [(0, '35.706')] [2023-03-07 10:00:53,512][175731] Updated weights for policy 0, policy_version 12640 (0.0006) [2023-03-07 10:00:54,325][175731] Updated weights for policy 0, policy_version 12650 (0.0007) [2023-03-07 10:00:55,131][175731] Updated weights for policy 0, policy_version 12660 (0.0007) [2023-03-07 10:00:55,931][175731] Updated weights for policy 0, policy_version 12670 (0.0006) [2023-03-07 10:00:56,738][175731] Updated weights for policy 0, policy_version 12680 (0.0006) [2023-03-07 10:00:57,516][175731] Updated weights for policy 0, policy_version 12690 (0.0006) [2023-03-07 10:00:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12777.4). Total num frames: 13003776. Throughput: 0: 12754.4. Samples: 12977946. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:00:58,332][175405] Avg episode reward: [(0, '35.470')] [2023-03-07 10:00:58,359][175731] Updated weights for policy 0, policy_version 12700 (0.0006) [2023-03-07 10:00:59,145][175731] Updated weights for policy 0, policy_version 12710 (0.0006) [2023-03-07 10:00:59,938][175731] Updated weights for policy 0, policy_version 12720 (0.0007) [2023-03-07 10:01:00,742][175731] Updated weights for policy 0, policy_version 12730 (0.0006) [2023-03-07 10:01:01,530][175731] Updated weights for policy 0, policy_version 12740 (0.0006) [2023-03-07 10:01:02,327][175731] Updated weights for policy 0, policy_version 12750 (0.0007) [2023-03-07 10:01:03,130][175731] Updated weights for policy 0, policy_version 12760 (0.0006) [2023-03-07 10:01:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12765.8, 300 sec: 12777.4). Total num frames: 13068288. Throughput: 0: 12769.8. Samples: 13054764. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:01:03,332][175405] Avg episode reward: [(0, '36.989')] [2023-03-07 10:01:03,919][175731] Updated weights for policy 0, policy_version 12770 (0.0006) [2023-03-07 10:01:04,726][175731] Updated weights for policy 0, policy_version 12780 (0.0007) [2023-03-07 10:01:05,560][175731] Updated weights for policy 0, policy_version 12790 (0.0006) [2023-03-07 10:01:06,349][175731] Updated weights for policy 0, policy_version 12800 (0.0006) [2023-03-07 10:01:07,158][175731] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-07 10:01:07,963][175731] Updated weights for policy 0, policy_version 12820 (0.0007) [2023-03-07 10:01:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12777.4). Total num frames: 13131776. Throughput: 0: 12765.4. Samples: 13131272. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:01:08,332][175405] Avg episode reward: [(0, '32.086')] [2023-03-07 10:01:08,760][175731] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-07 10:01:09,570][175731] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-07 10:01:10,389][175731] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-07 10:01:11,182][175731] Updated weights for policy 0, policy_version 12860 (0.0007) [2023-03-07 10:01:11,982][175731] Updated weights for policy 0, policy_version 12870 (0.0007) [2023-03-07 10:01:12,778][175731] Updated weights for policy 0, policy_version 12880 (0.0005) [2023-03-07 10:01:13,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12774.0). Total num frames: 13195264. Throughput: 0: 12766.2. Samples: 13169463. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:01:13,332][175405] Avg episode reward: [(0, '35.168')] [2023-03-07 10:01:13,568][175731] Updated weights for policy 0, policy_version 12890 (0.0006) [2023-03-07 10:01:14,389][175731] Updated weights for policy 0, policy_version 12900 (0.0007) [2023-03-07 10:01:15,207][175731] Updated weights for policy 0, policy_version 12910 (0.0006) [2023-03-07 10:01:16,014][175731] Updated weights for policy 0, policy_version 12920 (0.0007) [2023-03-07 10:01:16,813][175731] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-07 10:01:17,629][175731] Updated weights for policy 0, policy_version 12940 (0.0006) [2023-03-07 10:01:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12774.0). Total num frames: 13258752. Throughput: 0: 12750.6. Samples: 13245658. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:01:18,332][175405] Avg episode reward: [(0, '38.929')] [2023-03-07 10:01:18,426][175731] Updated weights for policy 0, policy_version 12950 (0.0006) [2023-03-07 10:01:19,228][175731] Updated weights for policy 0, policy_version 12960 (0.0007) [2023-03-07 10:01:20,040][175731] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-07 10:01:20,828][175731] Updated weights for policy 0, policy_version 12980 (0.0006) [2023-03-07 10:01:21,634][175731] Updated weights for policy 0, policy_version 12990 (0.0006) [2023-03-07 10:01:22,438][175731] Updated weights for policy 0, policy_version 13000 (0.0007) [2023-03-07 10:01:23,251][175731] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-07 10:01:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 13323264. Throughput: 0: 12758.9. Samples: 13322299. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:01:23,332][175405] Avg episode reward: [(0, '37.957')] [2023-03-07 10:01:24,040][175731] Updated weights for policy 0, policy_version 13020 (0.0007) [2023-03-07 10:01:24,827][175731] Updated weights for policy 0, policy_version 13030 (0.0007) [2023-03-07 10:01:25,629][175731] Updated weights for policy 0, policy_version 13040 (0.0007) [2023-03-07 10:01:26,445][175731] Updated weights for policy 0, policy_version 13050 (0.0006) [2023-03-07 10:01:27,242][175731] Updated weights for policy 0, policy_version 13060 (0.0006) [2023-03-07 10:01:28,052][175731] Updated weights for policy 0, policy_version 13070 (0.0007) [2023-03-07 10:01:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 13386752. Throughput: 0: 12759.1. Samples: 13360697. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:01:28,322][175405] Avg episode reward: [(0, '33.422')] [2023-03-07 10:01:28,844][175731] Updated weights for policy 0, policy_version 13080 (0.0007) [2023-03-07 10:01:29,653][175731] Updated weights for policy 0, policy_version 13090 (0.0007) [2023-03-07 10:01:30,455][175731] Updated weights for policy 0, policy_version 13100 (0.0006) [2023-03-07 10:01:31,249][175731] Updated weights for policy 0, policy_version 13110 (0.0007) [2023-03-07 10:01:32,053][175731] Updated weights for policy 0, policy_version 13120 (0.0006) [2023-03-07 10:01:32,854][175731] Updated weights for policy 0, policy_version 13130 (0.0006) [2023-03-07 10:01:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12770.5). Total num frames: 13450240. Throughput: 0: 12758.6. Samples: 13437230. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:01:33,322][175405] Avg episode reward: [(0, '35.077')] [2023-03-07 10:01:33,651][175731] Updated weights for policy 0, policy_version 13140 (0.0007) [2023-03-07 10:01:34,470][175731] Updated weights for policy 0, policy_version 13150 (0.0007) [2023-03-07 10:01:35,267][175731] Updated weights for policy 0, policy_version 13160 (0.0006) [2023-03-07 10:01:36,066][175731] Updated weights for policy 0, policy_version 13170 (0.0007) [2023-03-07 10:01:36,877][175731] Updated weights for policy 0, policy_version 13180 (0.0006) [2023-03-07 10:01:37,665][175731] Updated weights for policy 0, policy_version 13190 (0.0007) [2023-03-07 10:01:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 13514752. Throughput: 0: 12757.7. Samples: 13513795. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:01:38,322][175405] Avg episode reward: [(0, '35.245')] [2023-03-07 10:01:38,481][175731] Updated weights for policy 0, policy_version 13200 (0.0006) [2023-03-07 10:01:39,266][175731] Updated weights for policy 0, policy_version 13210 (0.0006) [2023-03-07 10:01:40,069][175731] Updated weights for policy 0, policy_version 13220 (0.0006) [2023-03-07 10:01:40,865][175731] Updated weights for policy 0, policy_version 13230 (0.0005) [2023-03-07 10:01:41,669][175731] Updated weights for policy 0, policy_version 13240 (0.0007) [2023-03-07 10:01:42,453][175731] Updated weights for policy 0, policy_version 13250 (0.0006) [2023-03-07 10:01:43,257][175731] Updated weights for policy 0, policy_version 13260 (0.0006) [2023-03-07 10:01:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 13578240. Throughput: 0: 12763.5. Samples: 13552304. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:01:43,322][175405] Avg episode reward: [(0, '35.535')] [2023-03-07 10:01:44,059][175731] Updated weights for policy 0, policy_version 13270 (0.0006) [2023-03-07 10:01:44,858][175731] Updated weights for policy 0, policy_version 13280 (0.0006) [2023-03-07 10:01:45,647][175731] Updated weights for policy 0, policy_version 13290 (0.0006) [2023-03-07 10:01:46,460][175731] Updated weights for policy 0, policy_version 13300 (0.0006) [2023-03-07 10:01:47,245][175731] Updated weights for policy 0, policy_version 13310 (0.0008) [2023-03-07 10:01:48,074][175731] Updated weights for policy 0, policy_version 13320 (0.0007) [2023-03-07 10:01:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12765.9, 300 sec: 12777.4). Total num frames: 13642752. Throughput: 0: 12770.0. Samples: 13629412. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:01:48,322][175405] Avg episode reward: [(0, '35.313')] [2023-03-07 10:01:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013323_13642752.pth... [2023-03-07 10:01:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000010330_10577920.pth [2023-03-07 10:01:48,868][175731] Updated weights for policy 0, policy_version 13330 (0.0007) [2023-03-07 10:01:49,654][175731] Updated weights for policy 0, policy_version 13340 (0.0006) [2023-03-07 10:01:50,465][175731] Updated weights for policy 0, policy_version 13350 (0.0006) [2023-03-07 10:01:51,249][175731] Updated weights for policy 0, policy_version 13360 (0.0006) [2023-03-07 10:01:52,058][175731] Updated weights for policy 0, policy_version 13370 (0.0005) [2023-03-07 10:01:52,855][175731] Updated weights for policy 0, policy_version 13380 (0.0006) [2023-03-07 10:01:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 13706240. Throughput: 0: 12768.1. Samples: 13705835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:01:53,322][175405] Avg episode reward: [(0, '38.163')] [2023-03-07 10:01:53,675][175731] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-07 10:01:54,476][175731] Updated weights for policy 0, policy_version 13400 (0.0007) [2023-03-07 10:01:55,281][175731] Updated weights for policy 0, policy_version 13410 (0.0006) [2023-03-07 10:01:56,071][175731] Updated weights for policy 0, policy_version 13420 (0.0007) [2023-03-07 10:01:56,878][175731] Updated weights for policy 0, policy_version 13430 (0.0006) [2023-03-07 10:01:57,678][175731] Updated weights for policy 0, policy_version 13440 (0.0006) [2023-03-07 10:01:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12770.5). Total num frames: 13769728. Throughput: 0: 12770.3. Samples: 13744128. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:01:58,322][175405] Avg episode reward: [(0, '59.855')] [2023-03-07 10:01:58,484][175731] Updated weights for policy 0, policy_version 13450 (0.0007) [2023-03-07 10:01:59,295][175731] Updated weights for policy 0, policy_version 13460 (0.0006) [2023-03-07 10:02:00,081][175731] Updated weights for policy 0, policy_version 13470 (0.0006) [2023-03-07 10:02:00,905][175731] Updated weights for policy 0, policy_version 13480 (0.0007) [2023-03-07 10:02:01,693][175731] Updated weights for policy 0, policy_version 13490 (0.0007) [2023-03-07 10:02:02,483][175731] Updated weights for policy 0, policy_version 13500 (0.0008) [2023-03-07 10:02:03,306][175731] Updated weights for policy 0, policy_version 13510 (0.0007) [2023-03-07 10:02:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 13834240. Throughput: 0: 12782.2. Samples: 13820857. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:03,321][175405] Avg episode reward: [(0, '33.199')] [2023-03-07 10:02:04,097][175731] Updated weights for policy 0, policy_version 13520 (0.0007) [2023-03-07 10:02:04,883][175731] Updated weights for policy 0, policy_version 13530 (0.0007) [2023-03-07 10:02:05,685][175731] Updated weights for policy 0, policy_version 13540 (0.0006) [2023-03-07 10:02:06,474][175731] Updated weights for policy 0, policy_version 13550 (0.0006) [2023-03-07 10:02:07,264][175731] Updated weights for policy 0, policy_version 13560 (0.0008) [2023-03-07 10:02:08,062][175731] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-07 10:02:08,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 13898752. Throughput: 0: 12790.5. Samples: 13897873. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:08,322][175405] Avg episode reward: [(0, '38.053')] [2023-03-07 10:02:08,851][175731] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-07 10:02:09,672][175731] Updated weights for policy 0, policy_version 13590 (0.0007) [2023-03-07 10:02:10,466][175731] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-07 10:02:11,262][175731] Updated weights for policy 0, policy_version 13610 (0.0007) [2023-03-07 10:02:12,069][175731] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-07 10:02:12,883][175731] Updated weights for policy 0, policy_version 13630 (0.0006) [2023-03-07 10:02:13,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12782.9, 300 sec: 12774.0). Total num frames: 13962240. Throughput: 0: 12792.3. Samples: 13936349. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:02:13,322][175405] Avg episode reward: [(0, '37.034')] [2023-03-07 10:02:13,691][175731] Updated weights for policy 0, policy_version 13640 (0.0007) [2023-03-07 10:02:14,486][175731] Updated weights for policy 0, policy_version 13650 (0.0007) [2023-03-07 10:02:15,289][175731] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-07 10:02:16,098][175731] Updated weights for policy 0, policy_version 13670 (0.0005) [2023-03-07 10:02:16,895][175731] Updated weights for policy 0, policy_version 13680 (0.0006) [2023-03-07 10:02:17,694][175731] Updated weights for policy 0, policy_version 13690 (0.0006) [2023-03-07 10:02:18,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12783.0, 300 sec: 12774.0). Total num frames: 14025728. Throughput: 0: 12788.7. Samples: 14012719. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:02:18,322][175405] Avg episode reward: [(0, '32.901')] [2023-03-07 10:02:18,501][175731] Updated weights for policy 0, policy_version 13700 (0.0006) [2023-03-07 10:02:19,303][175731] Updated weights for policy 0, policy_version 13710 (0.0006) [2023-03-07 10:02:20,094][175731] Updated weights for policy 0, policy_version 13720 (0.0006) [2023-03-07 10:02:20,894][175731] Updated weights for policy 0, policy_version 13730 (0.0007) [2023-03-07 10:02:21,688][175731] Updated weights for policy 0, policy_version 13740 (0.0007) [2023-03-07 10:02:22,508][175731] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-07 10:02:23,303][175731] Updated weights for policy 0, policy_version 13760 (0.0006) [2023-03-07 10:02:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 14090240. Throughput: 0: 12793.2. Samples: 14089490. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:02:23,322][175405] Avg episode reward: [(0, '32.889')] [2023-03-07 10:02:23,532][175680] KL-divergence is very high: 139.2737 [2023-03-07 10:02:24,118][175731] Updated weights for policy 0, policy_version 13770 (0.0007) [2023-03-07 10:02:24,902][175731] Updated weights for policy 0, policy_version 13780 (0.0007) [2023-03-07 10:02:25,724][175731] Updated weights for policy 0, policy_version 13790 (0.0006) [2023-03-07 10:02:26,515][175731] Updated weights for policy 0, policy_version 13800 (0.0007) [2023-03-07 10:02:27,318][175731] Updated weights for policy 0, policy_version 13810 (0.0007) [2023-03-07 10:02:28,129][175731] Updated weights for policy 0, policy_version 13820 (0.0007) [2023-03-07 10:02:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12774.0). Total num frames: 14153728. Throughput: 0: 12786.9. Samples: 14127712. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:02:28,322][175405] Avg episode reward: [(0, '33.609')] [2023-03-07 10:02:28,949][175731] Updated weights for policy 0, policy_version 13830 (0.0007) [2023-03-07 10:02:29,734][175731] Updated weights for policy 0, policy_version 13840 (0.0007) [2023-03-07 10:02:30,542][175731] Updated weights for policy 0, policy_version 13850 (0.0007) [2023-03-07 10:02:31,351][175731] Updated weights for policy 0, policy_version 13860 (0.0006) [2023-03-07 10:02:32,142][175731] Updated weights for policy 0, policy_version 13870 (0.0007) [2023-03-07 10:02:32,947][175731] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-07 10:02:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12782.9, 300 sec: 12774.0). Total num frames: 14217216. Throughput: 0: 12769.3. Samples: 14204032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:33,322][175405] Avg episode reward: [(0, '32.785')] [2023-03-07 10:02:33,758][175731] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-07 10:02:34,537][175731] Updated weights for policy 0, policy_version 13900 (0.0006) [2023-03-07 10:02:35,357][175731] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-07 10:02:36,142][175731] Updated weights for policy 0, policy_version 13920 (0.0006) [2023-03-07 10:02:36,953][175731] Updated weights for policy 0, policy_version 13930 (0.0006) [2023-03-07 10:02:37,749][175731] Updated weights for policy 0, policy_version 13940 (0.0008) [2023-03-07 10:02:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 14281728. Throughput: 0: 12781.5. Samples: 14281003. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:38,322][175405] Avg episode reward: [(0, '34.575')] [2023-03-07 10:02:38,529][175731] Updated weights for policy 0, policy_version 13950 (0.0007) [2023-03-07 10:02:39,325][175731] Updated weights for policy 0, policy_version 13960 (0.0006) [2023-03-07 10:02:40,126][175731] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-07 10:02:40,953][175731] Updated weights for policy 0, policy_version 13980 (0.0006) [2023-03-07 10:02:41,773][175731] Updated weights for policy 0, policy_version 13990 (0.0007) [2023-03-07 10:02:42,581][175731] Updated weights for policy 0, policy_version 14000 (0.0006) [2023-03-07 10:02:43,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12783.0, 300 sec: 12774.0). Total num frames: 14345216. Throughput: 0: 12777.7. Samples: 14319123. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:43,321][175405] Avg episode reward: [(0, '35.506')] [2023-03-07 10:02:43,379][175731] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-07 10:02:44,184][175731] Updated weights for policy 0, policy_version 14020 (0.0006) [2023-03-07 10:02:44,994][175731] Updated weights for policy 0, policy_version 14030 (0.0007) [2023-03-07 10:02:45,813][175731] Updated weights for policy 0, policy_version 14040 (0.0006) [2023-03-07 10:02:46,599][175731] Updated weights for policy 0, policy_version 14050 (0.0006) [2023-03-07 10:02:47,389][175731] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-07 10:02:48,204][175731] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-07 10:02:48,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12765.9, 300 sec: 12770.5). Total num frames: 14408704. Throughput: 0: 12770.0. Samples: 14395508. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:48,321][175405] Avg episode reward: [(0, '32.259')] [2023-03-07 10:02:49,005][175731] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-07 10:02:49,803][175731] Updated weights for policy 0, policy_version 14090 (0.0006) [2023-03-07 10:02:50,609][175731] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-07 10:02:51,398][175731] Updated weights for policy 0, policy_version 14110 (0.0007) [2023-03-07 10:02:52,180][175731] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-07 10:02:52,998][175731] Updated weights for policy 0, policy_version 14130 (0.0007) [2023-03-07 10:02:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 14473216. Throughput: 0: 12764.9. Samples: 14472294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:53,322][175405] Avg episode reward: [(0, '51.736')] [2023-03-07 10:02:53,797][175731] Updated weights for policy 0, policy_version 14140 (0.0007) [2023-03-07 10:02:54,588][175731] Updated weights for policy 0, policy_version 14150 (0.0007) [2023-03-07 10:02:55,376][175731] Updated weights for policy 0, policy_version 14160 (0.0007) [2023-03-07 10:02:56,202][175731] Updated weights for policy 0, policy_version 14170 (0.0006) [2023-03-07 10:02:56,988][175731] Updated weights for policy 0, policy_version 14180 (0.0007) [2023-03-07 10:02:57,785][175731] Updated weights for policy 0, policy_version 14190 (0.0006) [2023-03-07 10:02:58,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 14536704. Throughput: 0: 12764.3. Samples: 14510742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:02:58,322][175405] Avg episode reward: [(0, '34.935')] [2023-03-07 10:02:58,585][175731] Updated weights for policy 0, policy_version 14200 (0.0006) [2023-03-07 10:02:59,405][175731] Updated weights for policy 0, policy_version 14210 (0.0007) [2023-03-07 10:03:00,218][175731] Updated weights for policy 0, policy_version 14220 (0.0008) [2023-03-07 10:03:01,022][175731] Updated weights for policy 0, policy_version 14230 (0.0006) [2023-03-07 10:03:01,829][175731] Updated weights for policy 0, policy_version 14240 (0.0007) [2023-03-07 10:03:02,632][175731] Updated weights for policy 0, policy_version 14250 (0.0007) [2023-03-07 10:03:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12770.5). Total num frames: 14600192. Throughput: 0: 12763.8. Samples: 14587091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:03:03,322][175405] Avg episode reward: [(0, '37.504')] [2023-03-07 10:03:03,439][175731] Updated weights for policy 0, policy_version 14260 (0.0006) [2023-03-07 10:03:04,234][175731] Updated weights for policy 0, policy_version 14270 (0.0006) [2023-03-07 10:03:05,026][175731] Updated weights for policy 0, policy_version 14280 (0.0006) [2023-03-07 10:03:05,841][175731] Updated weights for policy 0, policy_version 14290 (0.0006) [2023-03-07 10:03:06,637][175731] Updated weights for policy 0, policy_version 14300 (0.0007) [2023-03-07 10:03:07,425][175731] Updated weights for policy 0, policy_version 14310 (0.0006) [2023-03-07 10:03:08,253][175731] Updated weights for policy 0, policy_version 14320 (0.0006) [2023-03-07 10:03:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 14663680. Throughput: 0: 12757.0. Samples: 14663554. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:03:08,322][175405] Avg episode reward: [(0, '32.864')] [2023-03-07 10:03:09,045][175731] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-07 10:03:09,854][175731] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-07 10:03:10,662][175731] Updated weights for policy 0, policy_version 14350 (0.0006) [2023-03-07 10:03:11,454][175731] Updated weights for policy 0, policy_version 14360 (0.0007) [2023-03-07 10:03:12,246][175731] Updated weights for policy 0, policy_version 14370 (0.0006) [2023-03-07 10:03:13,057][175731] Updated weights for policy 0, policy_version 14380 (0.0006) [2023-03-07 10:03:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 14728192. Throughput: 0: 12758.9. Samples: 14701864. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:03:13,322][175405] Avg episode reward: [(0, '34.359')] [2023-03-07 10:03:13,864][175731] Updated weights for policy 0, policy_version 14390 (0.0006) [2023-03-07 10:03:14,670][175731] Updated weights for policy 0, policy_version 14400 (0.0006) [2023-03-07 10:03:15,471][175731] Updated weights for policy 0, policy_version 14410 (0.0006) [2023-03-07 10:03:16,259][175731] Updated weights for policy 0, policy_version 14420 (0.0006) [2023-03-07 10:03:17,059][175731] Updated weights for policy 0, policy_version 14430 (0.0006) [2023-03-07 10:03:17,859][175731] Updated weights for policy 0, policy_version 14440 (0.0006) [2023-03-07 10:03:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 14791680. Throughput: 0: 12768.9. Samples: 14778630. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:03:18,322][175405] Avg episode reward: [(0, '32.620')] [2023-03-07 10:03:18,643][175731] Updated weights for policy 0, policy_version 14450 (0.0006) [2023-03-07 10:03:19,442][175731] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-07 10:03:20,230][175731] Updated weights for policy 0, policy_version 14470 (0.0006) [2023-03-07 10:03:21,025][175731] Updated weights for policy 0, policy_version 14480 (0.0007) [2023-03-07 10:03:21,826][175731] Updated weights for policy 0, policy_version 14490 (0.0006) [2023-03-07 10:03:22,633][175731] Updated weights for policy 0, policy_version 14500 (0.0007) [2023-03-07 10:03:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 14856192. Throughput: 0: 12776.6. Samples: 14855946. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:03:23,322][175405] Avg episode reward: [(0, '33.602')] [2023-03-07 10:03:23,424][175731] Updated weights for policy 0, policy_version 14510 (0.0006) [2023-03-07 10:03:24,221][175731] Updated weights for policy 0, policy_version 14520 (0.0007) [2023-03-07 10:03:25,033][175731] Updated weights for policy 0, policy_version 14530 (0.0008) [2023-03-07 10:03:25,840][175731] Updated weights for policy 0, policy_version 14540 (0.0006) [2023-03-07 10:03:26,637][175731] Updated weights for policy 0, policy_version 14550 (0.0006) [2023-03-07 10:03:27,447][175731] Updated weights for policy 0, policy_version 14560 (0.0006) [2023-03-07 10:03:28,260][175731] Updated weights for policy 0, policy_version 14570 (0.0007) [2023-03-07 10:03:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 14919680. Throughput: 0: 12777.6. Samples: 14894115. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:03:28,322][175405] Avg episode reward: [(0, '34.077')] [2023-03-07 10:03:29,064][175731] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-07 10:03:29,877][175731] Updated weights for policy 0, policy_version 14590 (0.0006) [2023-03-07 10:03:30,671][175731] Updated weights for policy 0, policy_version 14600 (0.0006) [2023-03-07 10:03:31,453][175731] Updated weights for policy 0, policy_version 14610 (0.0006) [2023-03-07 10:03:32,247][175731] Updated weights for policy 0, policy_version 14620 (0.0006) [2023-03-07 10:03:33,048][175731] Updated weights for policy 0, policy_version 14630 (0.0007) [2023-03-07 10:03:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12767.0). Total num frames: 14984192. Throughput: 0: 12786.1. Samples: 14970885. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:03:33,321][175405] Avg episode reward: [(0, '33.760')] [2023-03-07 10:03:33,846][175731] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-07 10:03:34,630][175731] Updated weights for policy 0, policy_version 14650 (0.0006) [2023-03-07 10:03:35,432][175731] Updated weights for policy 0, policy_version 14660 (0.0007) [2023-03-07 10:03:36,234][175731] Updated weights for policy 0, policy_version 14670 (0.0007) [2023-03-07 10:03:37,042][175731] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-07 10:03:37,839][175731] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-07 10:03:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 15047680. Throughput: 0: 12786.1. Samples: 15047666. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:03:38,321][175405] Avg episode reward: [(0, '55.083')] [2023-03-07 10:03:38,640][175731] Updated weights for policy 0, policy_version 14700 (0.0006) [2023-03-07 10:03:39,452][175731] Updated weights for policy 0, policy_version 14710 (0.0007) [2023-03-07 10:03:40,240][175731] Updated weights for policy 0, policy_version 14720 (0.0006) [2023-03-07 10:03:41,057][175731] Updated weights for policy 0, policy_version 14730 (0.0006) [2023-03-07 10:03:41,861][175731] Updated weights for policy 0, policy_version 14740 (0.0005) [2023-03-07 10:03:42,639][175731] Updated weights for policy 0, policy_version 14750 (0.0007) [2023-03-07 10:03:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 15112192. Throughput: 0: 12779.7. Samples: 15085830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:03:43,322][175405] Avg episode reward: [(0, '32.176')] [2023-03-07 10:03:43,463][175731] Updated weights for policy 0, policy_version 14760 (0.0006) [2023-03-07 10:03:44,248][175731] Updated weights for policy 0, policy_version 14770 (0.0007) [2023-03-07 10:03:45,041][175731] Updated weights for policy 0, policy_version 14780 (0.0006) [2023-03-07 10:03:45,856][175731] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-07 10:03:46,645][175731] Updated weights for policy 0, policy_version 14800 (0.0006) [2023-03-07 10:03:47,438][175731] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-07 10:03:48,242][175731] Updated weights for policy 0, policy_version 14820 (0.0006) [2023-03-07 10:03:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12767.0). Total num frames: 15175680. Throughput: 0: 12792.1. Samples: 15162736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:03:48,321][175405] Avg episode reward: [(0, '34.771')] [2023-03-07 10:03:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014820_15175680.pth... [2023-03-07 10:03:48,355][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000011827_12110848.pth [2023-03-07 10:03:49,046][175731] Updated weights for policy 0, policy_version 14830 (0.0006) [2023-03-07 10:03:49,851][175731] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-07 10:03:50,659][175731] Updated weights for policy 0, policy_version 14850 (0.0006) [2023-03-07 10:03:51,467][175731] Updated weights for policy 0, policy_version 14860 (0.0006) [2023-03-07 10:03:52,261][175731] Updated weights for policy 0, policy_version 14870 (0.0006) [2023-03-07 10:03:53,071][175731] Updated weights for policy 0, policy_version 14880 (0.0008) [2023-03-07 10:03:53,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 15239168. Throughput: 0: 12794.7. Samples: 15239314. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:03:53,322][175405] Avg episode reward: [(0, '36.833')] [2023-03-07 10:03:53,870][175731] Updated weights for policy 0, policy_version 14890 (0.0006) [2023-03-07 10:03:54,667][175731] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-07 10:03:55,476][175731] Updated weights for policy 0, policy_version 14910 (0.0006) [2023-03-07 10:03:56,278][175731] Updated weights for policy 0, policy_version 14920 (0.0006) [2023-03-07 10:03:57,074][175731] Updated weights for policy 0, policy_version 14930 (0.0007) [2023-03-07 10:03:57,884][175731] Updated weights for policy 0, policy_version 14940 (0.0007) [2023-03-07 10:03:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 15303680. Throughput: 0: 12793.2. Samples: 15277558. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:03:58,322][175405] Avg episode reward: [(0, '39.035')] [2023-03-07 10:03:58,685][175731] Updated weights for policy 0, policy_version 14950 (0.0007) [2023-03-07 10:03:59,499][175731] Updated weights for policy 0, policy_version 14960 (0.0007) [2023-03-07 10:04:00,306][175731] Updated weights for policy 0, policy_version 14970 (0.0006) [2023-03-07 10:04:01,118][175731] Updated weights for policy 0, policy_version 14980 (0.0007) [2023-03-07 10:04:01,909][175731] Updated weights for policy 0, policy_version 14990 (0.0007) [2023-03-07 10:04:02,717][175731] Updated weights for policy 0, policy_version 15000 (0.0006) [2023-03-07 10:04:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12767.0). Total num frames: 15367168. Throughput: 0: 12783.5. Samples: 15353886. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:04:03,322][175405] Avg episode reward: [(0, '71.144')] [2023-03-07 10:04:03,521][175731] Updated weights for policy 0, policy_version 15010 (0.0006) [2023-03-07 10:04:04,336][175731] Updated weights for policy 0, policy_version 15020 (0.0007) [2023-03-07 10:04:05,142][175731] Updated weights for policy 0, policy_version 15030 (0.0007) [2023-03-07 10:04:05,947][175731] Updated weights for policy 0, policy_version 15040 (0.0007) [2023-03-07 10:04:06,761][175731] Updated weights for policy 0, policy_version 15050 (0.0007) [2023-03-07 10:04:07,571][175731] Updated weights for policy 0, policy_version 15060 (0.0006) [2023-03-07 10:04:08,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12783.0, 300 sec: 12767.0). Total num frames: 15430656. Throughput: 0: 12758.5. Samples: 15430080. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 10:04:08,321][175405] Avg episode reward: [(0, '35.328')] [2023-03-07 10:04:08,372][175731] Updated weights for policy 0, policy_version 15070 (0.0006) [2023-03-07 10:04:09,188][175731] Updated weights for policy 0, policy_version 15080 (0.0007) [2023-03-07 10:04:09,993][175731] Updated weights for policy 0, policy_version 15090 (0.0006) [2023-03-07 10:04:10,789][175731] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-07 10:04:11,599][175731] Updated weights for policy 0, policy_version 15110 (0.0008) [2023-03-07 10:04:12,383][175731] Updated weights for policy 0, policy_version 15120 (0.0007) [2023-03-07 10:04:13,192][175731] Updated weights for policy 0, policy_version 15130 (0.0007) [2023-03-07 10:04:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12763.6). Total num frames: 15494144. Throughput: 0: 12756.6. Samples: 15468162. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 10:04:13,321][175405] Avg episode reward: [(0, '34.960')] [2023-03-07 10:04:14,000][175731] Updated weights for policy 0, policy_version 15140 (0.0005) [2023-03-07 10:04:14,802][175731] Updated weights for policy 0, policy_version 15150 (0.0007) [2023-03-07 10:04:15,607][175731] Updated weights for policy 0, policy_version 15160 (0.0007) [2023-03-07 10:04:16,407][175731] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-07 10:04:17,210][175731] Updated weights for policy 0, policy_version 15180 (0.0006) [2023-03-07 10:04:17,993][175731] Updated weights for policy 0, policy_version 15190 (0.0007) [2023-03-07 10:04:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 12763.5). Total num frames: 15557632. Throughput: 0: 12751.8. Samples: 15544717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:04:18,322][175405] Avg episode reward: [(0, '35.292')] [2023-03-07 10:04:18,800][175731] Updated weights for policy 0, policy_version 15200 (0.0007) [2023-03-07 10:04:19,625][175731] Updated weights for policy 0, policy_version 15210 (0.0007) [2023-03-07 10:04:20,420][175731] Updated weights for policy 0, policy_version 15220 (0.0007) [2023-03-07 10:04:21,217][175731] Updated weights for policy 0, policy_version 15230 (0.0006) [2023-03-07 10:04:22,044][175731] Updated weights for policy 0, policy_version 15240 (0.0008) [2023-03-07 10:04:22,831][175731] Updated weights for policy 0, policy_version 15250 (0.0006) [2023-03-07 10:04:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 15622144. Throughput: 0: 12743.6. Samples: 15621129. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:04:23,321][175405] Avg episode reward: [(0, '36.345')] [2023-03-07 10:04:23,635][175731] Updated weights for policy 0, policy_version 15260 (0.0006) [2023-03-07 10:04:24,461][175731] Updated weights for policy 0, policy_version 15270 (0.0007) [2023-03-07 10:04:25,252][175731] Updated weights for policy 0, policy_version 15280 (0.0006) [2023-03-07 10:04:26,071][175731] Updated weights for policy 0, policy_version 15290 (0.0006) [2023-03-07 10:04:26,876][175731] Updated weights for policy 0, policy_version 15300 (0.0006) [2023-03-07 10:04:27,693][175731] Updated weights for policy 0, policy_version 15310 (0.0006) [2023-03-07 10:04:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 15684608. Throughput: 0: 12738.1. Samples: 15659043. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:04:28,321][175405] Avg episode reward: [(0, '33.481')] [2023-03-07 10:04:28,486][175731] Updated weights for policy 0, policy_version 15320 (0.0008) [2023-03-07 10:04:29,293][175731] Updated weights for policy 0, policy_version 15330 (0.0006) [2023-03-07 10:04:30,106][175731] Updated weights for policy 0, policy_version 15340 (0.0006) [2023-03-07 10:04:30,917][175731] Updated weights for policy 0, policy_version 15350 (0.0006) [2023-03-07 10:04:31,711][175731] Updated weights for policy 0, policy_version 15360 (0.0007) [2023-03-07 10:04:32,503][175731] Updated weights for policy 0, policy_version 15370 (0.0006) [2023-03-07 10:04:33,316][175731] Updated weights for policy 0, policy_version 15380 (0.0005) [2023-03-07 10:04:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12763.5). Total num frames: 15749120. Throughput: 0: 12726.5. Samples: 15735427. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:04:33,322][175405] Avg episode reward: [(0, '35.582')] [2023-03-07 10:04:34,114][175731] Updated weights for policy 0, policy_version 15390 (0.0006) [2023-03-07 10:04:34,914][175731] Updated weights for policy 0, policy_version 15400 (0.0006) [2023-03-07 10:04:35,722][175731] Updated weights for policy 0, policy_version 15410 (0.0006) [2023-03-07 10:04:36,522][175731] Updated weights for policy 0, policy_version 15420 (0.0006) [2023-03-07 10:04:37,313][175731] Updated weights for policy 0, policy_version 15430 (0.0006) [2023-03-07 10:04:38,109][175731] Updated weights for policy 0, policy_version 15440 (0.0006) [2023-03-07 10:04:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 15812608. Throughput: 0: 12732.4. Samples: 15812270. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:04:38,322][175405] Avg episode reward: [(0, '33.152')] [2023-03-07 10:04:38,901][175731] Updated weights for policy 0, policy_version 15450 (0.0007) [2023-03-07 10:04:39,706][175731] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-07 10:04:40,494][175731] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-07 10:04:41,304][175731] Updated weights for policy 0, policy_version 15480 (0.0006) [2023-03-07 10:04:42,100][175731] Updated weights for policy 0, policy_version 15490 (0.0006) [2023-03-07 10:04:42,901][175731] Updated weights for policy 0, policy_version 15500 (0.0007) [2023-03-07 10:04:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 15877120. Throughput: 0: 12736.8. Samples: 15850713. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:04:43,322][175405] Avg episode reward: [(0, '36.060')] [2023-03-07 10:04:43,704][175731] Updated weights for policy 0, policy_version 15510 (0.0006) [2023-03-07 10:04:44,509][175731] Updated weights for policy 0, policy_version 15520 (0.0006) [2023-03-07 10:04:45,313][175731] Updated weights for policy 0, policy_version 15530 (0.0006) [2023-03-07 10:04:46,097][175731] Updated weights for policy 0, policy_version 15540 (0.0007) [2023-03-07 10:04:46,896][175731] Updated weights for policy 0, policy_version 15550 (0.0007) [2023-03-07 10:04:47,693][175731] Updated weights for policy 0, policy_version 15560 (0.0008) [2023-03-07 10:04:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 15940608. Throughput: 0: 12746.3. Samples: 15927471. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:04:48,321][175405] Avg episode reward: [(0, '34.033')] [2023-03-07 10:04:48,493][175731] Updated weights for policy 0, policy_version 15570 (0.0006) [2023-03-07 10:04:49,315][175731] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-07 10:04:50,107][175731] Updated weights for policy 0, policy_version 15590 (0.0007) [2023-03-07 10:04:50,908][175731] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-07 10:04:51,720][175731] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-07 10:04:52,544][175731] Updated weights for policy 0, policy_version 15620 (0.0007) [2023-03-07 10:04:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 16004096. Throughput: 0: 12752.8. Samples: 16003957. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:04:53,322][175405] Avg episode reward: [(0, '33.478')] [2023-03-07 10:04:53,348][175731] Updated weights for policy 0, policy_version 15630 (0.0006) [2023-03-07 10:04:54,142][175731] Updated weights for policy 0, policy_version 15640 (0.0007) [2023-03-07 10:04:54,947][175731] Updated weights for policy 0, policy_version 15650 (0.0006) [2023-03-07 10:04:55,751][175731] Updated weights for policy 0, policy_version 15660 (0.0006) [2023-03-07 10:04:56,568][175731] Updated weights for policy 0, policy_version 15670 (0.0006) [2023-03-07 10:04:57,352][175731] Updated weights for policy 0, policy_version 15680 (0.0006) [2023-03-07 10:04:58,157][175731] Updated weights for policy 0, policy_version 15690 (0.0007) [2023-03-07 10:04:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 16068608. Throughput: 0: 12753.8. Samples: 16042083. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:04:58,322][175405] Avg episode reward: [(0, '33.865')] [2023-03-07 10:04:58,951][175731] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-07 10:04:59,745][175731] Updated weights for policy 0, policy_version 15710 (0.0008) [2023-03-07 10:05:00,546][175731] Updated weights for policy 0, policy_version 15720 (0.0005) [2023-03-07 10:05:01,337][175731] Updated weights for policy 0, policy_version 15730 (0.0006) [2023-03-07 10:05:02,147][175731] Updated weights for policy 0, policy_version 15740 (0.0007) [2023-03-07 10:05:02,956][175731] Updated weights for policy 0, policy_version 15750 (0.0007) [2023-03-07 10:05:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 16132096. Throughput: 0: 12761.3. Samples: 16118974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:05:03,322][175405] Avg episode reward: [(0, '33.219')] [2023-03-07 10:05:03,742][175731] Updated weights for policy 0, policy_version 15760 (0.0007) [2023-03-07 10:05:04,564][175731] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-07 10:05:05,362][175731] Updated weights for policy 0, policy_version 15780 (0.0006) [2023-03-07 10:05:06,169][175731] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-07 10:05:06,967][175731] Updated weights for policy 0, policy_version 15800 (0.0007) [2023-03-07 10:05:07,784][175731] Updated weights for policy 0, policy_version 15810 (0.0008) [2023-03-07 10:05:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 16196608. Throughput: 0: 12763.3. Samples: 16195476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:05:08,321][175405] Avg episode reward: [(0, '33.727')] [2023-03-07 10:05:08,575][175731] Updated weights for policy 0, policy_version 15820 (0.0007) [2023-03-07 10:05:09,373][175731] Updated weights for policy 0, policy_version 15830 (0.0007) [2023-03-07 10:05:10,183][175731] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-07 10:05:10,989][175731] Updated weights for policy 0, policy_version 15850 (0.0007) [2023-03-07 10:05:11,787][175731] Updated weights for policy 0, policy_version 15860 (0.0007) [2023-03-07 10:05:12,592][175731] Updated weights for policy 0, policy_version 15870 (0.0006) [2023-03-07 10:05:13,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 16259072. Throughput: 0: 12768.8. Samples: 16233639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:05:13,322][175405] Avg episode reward: [(0, '39.060')] [2023-03-07 10:05:13,409][175731] Updated weights for policy 0, policy_version 15880 (0.0006) [2023-03-07 10:05:14,203][175731] Updated weights for policy 0, policy_version 15890 (0.0007) [2023-03-07 10:05:14,992][175731] Updated weights for policy 0, policy_version 15900 (0.0006) [2023-03-07 10:05:15,812][175731] Updated weights for policy 0, policy_version 15910 (0.0006) [2023-03-07 10:05:16,596][175731] Updated weights for policy 0, policy_version 15920 (0.0006) [2023-03-07 10:05:17,391][175731] Updated weights for policy 0, policy_version 15930 (0.0006) [2023-03-07 10:05:18,195][175731] Updated weights for policy 0, policy_version 15940 (0.0007) [2023-03-07 10:05:18,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 16323584. Throughput: 0: 12777.0. Samples: 16310393. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:05:18,322][175405] Avg episode reward: [(0, '33.744')] [2023-03-07 10:05:18,998][175731] Updated weights for policy 0, policy_version 15950 (0.0007) [2023-03-07 10:05:19,818][175731] Updated weights for policy 0, policy_version 15960 (0.0008) [2023-03-07 10:05:20,634][175731] Updated weights for policy 0, policy_version 15970 (0.0006) [2023-03-07 10:05:21,432][175731] Updated weights for policy 0, policy_version 15980 (0.0006) [2023-03-07 10:05:22,224][175731] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-07 10:05:23,030][175731] Updated weights for policy 0, policy_version 16000 (0.0008) [2023-03-07 10:05:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 16387072. Throughput: 0: 12765.2. Samples: 16386704. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:05:23,322][175405] Avg episode reward: [(0, '33.026')] [2023-03-07 10:05:23,839][175731] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-07 10:05:24,642][175731] Updated weights for policy 0, policy_version 16020 (0.0006) [2023-03-07 10:05:25,449][175731] Updated weights for policy 0, policy_version 16030 (0.0007) [2023-03-07 10:05:26,251][175731] Updated weights for policy 0, policy_version 16040 (0.0006) [2023-03-07 10:05:27,058][175731] Updated weights for policy 0, policy_version 16050 (0.0007) [2023-03-07 10:05:27,869][175731] Updated weights for policy 0, policy_version 16060 (0.0006) [2023-03-07 10:05:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12763.6). Total num frames: 16450560. Throughput: 0: 12757.4. Samples: 16424796. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:05:28,322][175405] Avg episode reward: [(0, '36.396')] [2023-03-07 10:05:28,676][175731] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-07 10:05:29,466][175731] Updated weights for policy 0, policy_version 16080 (0.0008) [2023-03-07 10:05:30,281][175731] Updated weights for policy 0, policy_version 16090 (0.0007) [2023-03-07 10:05:31,097][175731] Updated weights for policy 0, policy_version 16100 (0.0007) [2023-03-07 10:05:31,898][175731] Updated weights for policy 0, policy_version 16110 (0.0006) [2023-03-07 10:05:32,718][175731] Updated weights for policy 0, policy_version 16120 (0.0007) [2023-03-07 10:05:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 16514048. Throughput: 0: 12743.8. Samples: 16500943. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:05:33,322][175405] Avg episode reward: [(0, '38.631')] [2023-03-07 10:05:33,504][175731] Updated weights for policy 0, policy_version 16130 (0.0007) [2023-03-07 10:05:34,313][175731] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-07 10:05:35,101][175731] Updated weights for policy 0, policy_version 16150 (0.0007) [2023-03-07 10:05:35,905][175731] Updated weights for policy 0, policy_version 16160 (0.0007) [2023-03-07 10:05:36,701][175731] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-07 10:05:37,502][175731] Updated weights for policy 0, policy_version 16180 (0.0006) [2023-03-07 10:05:38,304][175731] Updated weights for policy 0, policy_version 16190 (0.0007) [2023-03-07 10:05:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12767.0). Total num frames: 16578560. Throughput: 0: 12753.6. Samples: 16577871. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:05:38,322][175405] Avg episode reward: [(0, '40.118')] [2023-03-07 10:05:39,111][175731] Updated weights for policy 0, policy_version 16200 (0.0007) [2023-03-07 10:05:39,926][175731] Updated weights for policy 0, policy_version 16210 (0.0006) [2023-03-07 10:05:40,726][175731] Updated weights for policy 0, policy_version 16220 (0.0006) [2023-03-07 10:05:41,535][175731] Updated weights for policy 0, policy_version 16230 (0.0006) [2023-03-07 10:05:42,346][175731] Updated weights for policy 0, policy_version 16240 (0.0006) [2023-03-07 10:05:43,161][175731] Updated weights for policy 0, policy_version 16250 (0.0007) [2023-03-07 10:05:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 16642048. Throughput: 0: 12750.1. Samples: 16615836. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:05:43,322][175405] Avg episode reward: [(0, '38.959')] [2023-03-07 10:05:43,958][175731] Updated weights for policy 0, policy_version 16260 (0.0006) [2023-03-07 10:05:44,751][175731] Updated weights for policy 0, policy_version 16270 (0.0007) [2023-03-07 10:05:45,561][175731] Updated weights for policy 0, policy_version 16280 (0.0006) [2023-03-07 10:05:46,370][175731] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-07 10:05:47,183][175731] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-07 10:05:47,990][175731] Updated weights for policy 0, policy_version 16310 (0.0007) [2023-03-07 10:05:48,147][175680] KL-divergence is very high: 688.3939 [2023-03-07 10:05:48,309][175680] KL-divergence is very high: 435.1739 [2023-03-07 10:05:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 16705536. Throughput: 0: 12734.9. Samples: 16692044. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:05:48,322][175405] Avg episode reward: [(0, '35.715')] [2023-03-07 10:05:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016314_16705536.pth... [2023-03-07 10:05:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000013323_13642752.pth [2023-03-07 10:05:48,704][175680] KL-divergence is very high: 237.9562 [2023-03-07 10:05:48,798][175731] Updated weights for policy 0, policy_version 16320 (0.0006) [2023-03-07 10:05:48,947][175680] KL-divergence is very high: 1595926528.0000 [2023-03-07 10:05:49,034][175680] KL-divergence is very high: 31897544.0000 [2023-03-07 10:05:49,605][175731] Updated weights for policy 0, policy_version 16330 (0.0006) [2023-03-07 10:05:50,413][175731] Updated weights for policy 0, policy_version 16340 (0.0007) [2023-03-07 10:05:51,237][175731] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-07 10:05:52,037][175731] Updated weights for policy 0, policy_version 16360 (0.0007) [2023-03-07 10:05:52,833][175731] Updated weights for policy 0, policy_version 16370 (0.0006) [2023-03-07 10:05:53,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12731.8, 300 sec: 12760.1). Total num frames: 16768000. Throughput: 0: 12722.2. Samples: 16767974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:05:53,321][175405] Avg episode reward: [(0, '35.896')] [2023-03-07 10:05:53,656][175731] Updated weights for policy 0, policy_version 16380 (0.0007) [2023-03-07 10:05:54,465][175731] Updated weights for policy 0, policy_version 16390 (0.0005) [2023-03-07 10:05:55,257][175731] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-07 10:05:56,064][175731] Updated weights for policy 0, policy_version 16410 (0.0007) [2023-03-07 10:05:56,870][175731] Updated weights for policy 0, policy_version 16420 (0.0005) [2023-03-07 10:05:57,682][175731] Updated weights for policy 0, policy_version 16430 (0.0007) [2023-03-07 10:05:58,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12760.1). Total num frames: 16832512. Throughput: 0: 12722.7. Samples: 16806159. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:05:58,322][175405] Avg episode reward: [(0, '37.883')] [2023-03-07 10:05:58,501][175731] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-07 10:05:59,309][175731] Updated weights for policy 0, policy_version 16450 (0.0007) [2023-03-07 10:06:00,089][175731] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-07 10:06:00,894][175731] Updated weights for policy 0, policy_version 16470 (0.0007) [2023-03-07 10:06:01,719][175731] Updated weights for policy 0, policy_version 16480 (0.0007) [2023-03-07 10:06:02,503][175731] Updated weights for policy 0, policy_version 16490 (0.0007) [2023-03-07 10:06:03,314][175731] Updated weights for policy 0, policy_version 16500 (0.0007) [2023-03-07 10:06:03,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12760.1). Total num frames: 16896000. Throughput: 0: 12708.3. Samples: 16882265. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:06:03,332][175405] Avg episode reward: [(0, '38.237')] [2023-03-07 10:06:04,126][175731] Updated weights for policy 0, policy_version 16510 (0.0007) [2023-03-07 10:06:04,927][175731] Updated weights for policy 0, policy_version 16520 (0.0006) [2023-03-07 10:06:05,758][175731] Updated weights for policy 0, policy_version 16530 (0.0007) [2023-03-07 10:06:06,559][175731] Updated weights for policy 0, policy_version 16540 (0.0006) [2023-03-07 10:06:07,350][175731] Updated weights for policy 0, policy_version 16550 (0.0006) [2023-03-07 10:06:08,163][175731] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-07 10:06:08,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12697.6, 300 sec: 12756.6). Total num frames: 16958464. Throughput: 0: 12705.3. Samples: 16958441. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:06:08,332][175405] Avg episode reward: [(0, '42.980')] [2023-03-07 10:06:08,983][175731] Updated weights for policy 0, policy_version 16570 (0.0007) [2023-03-07 10:06:09,788][175731] Updated weights for policy 0, policy_version 16580 (0.0007) [2023-03-07 10:06:10,601][175731] Updated weights for policy 0, policy_version 16590 (0.0007) [2023-03-07 10:06:11,414][175731] Updated weights for policy 0, policy_version 16600 (0.0007) [2023-03-07 10:06:12,205][175731] Updated weights for policy 0, policy_version 16610 (0.0007) [2023-03-07 10:06:13,019][175731] Updated weights for policy 0, policy_version 16620 (0.0007) [2023-03-07 10:06:13,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12714.7, 300 sec: 12756.6). Total num frames: 17021952. Throughput: 0: 12701.3. Samples: 16996353. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:13,332][175405] Avg episode reward: [(0, '38.928')] [2023-03-07 10:06:13,828][175731] Updated weights for policy 0, policy_version 16630 (0.0006) [2023-03-07 10:06:14,627][175731] Updated weights for policy 0, policy_version 16640 (0.0006) [2023-03-07 10:06:15,422][175731] Updated weights for policy 0, policy_version 16650 (0.0006) [2023-03-07 10:06:16,237][175731] Updated weights for policy 0, policy_version 16660 (0.0007) [2023-03-07 10:06:17,039][175731] Updated weights for policy 0, policy_version 16670 (0.0007) [2023-03-07 10:06:17,847][175731] Updated weights for policy 0, policy_version 16680 (0.0006) [2023-03-07 10:06:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12753.1). Total num frames: 17085440. Throughput: 0: 12705.0. Samples: 17072669. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:18,332][175405] Avg episode reward: [(0, '42.332')] [2023-03-07 10:06:18,662][175731] Updated weights for policy 0, policy_version 16690 (0.0006) [2023-03-07 10:06:19,471][175731] Updated weights for policy 0, policy_version 16700 (0.0007) [2023-03-07 10:06:20,263][175731] Updated weights for policy 0, policy_version 16710 (0.0007) [2023-03-07 10:06:21,091][175731] Updated weights for policy 0, policy_version 16720 (0.0007) [2023-03-07 10:06:21,895][175731] Updated weights for policy 0, policy_version 16730 (0.0006) [2023-03-07 10:06:22,713][175731] Updated weights for policy 0, policy_version 16740 (0.0006) [2023-03-07 10:06:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12753.1). Total num frames: 17148928. Throughput: 0: 12680.1. Samples: 17148474. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:23,322][175405] Avg episode reward: [(0, '38.224')] [2023-03-07 10:06:23,529][175731] Updated weights for policy 0, policy_version 16750 (0.0006) [2023-03-07 10:06:24,339][175731] Updated weights for policy 0, policy_version 16760 (0.0006) [2023-03-07 10:06:25,146][175731] Updated weights for policy 0, policy_version 16770 (0.0007) [2023-03-07 10:06:25,958][175731] Updated weights for policy 0, policy_version 16780 (0.0007) [2023-03-07 10:06:26,764][175731] Updated weights for policy 0, policy_version 16790 (0.0008) [2023-03-07 10:06:27,583][175731] Updated weights for policy 0, policy_version 16800 (0.0006) [2023-03-07 10:06:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12753.1). Total num frames: 17212416. Throughput: 0: 12675.2. Samples: 17186223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:28,321][175405] Avg episode reward: [(0, '41.346')] [2023-03-07 10:06:28,384][175731] Updated weights for policy 0, policy_version 16810 (0.0006) [2023-03-07 10:06:29,181][175731] Updated weights for policy 0, policy_version 16820 (0.0006) [2023-03-07 10:06:30,010][175731] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-07 10:06:30,781][175731] Updated weights for policy 0, policy_version 16840 (0.0006) [2023-03-07 10:06:31,605][175731] Updated weights for policy 0, policy_version 16850 (0.0006) [2023-03-07 10:06:32,406][175731] Updated weights for policy 0, policy_version 16860 (0.0007) [2023-03-07 10:06:33,209][175731] Updated weights for policy 0, policy_version 16870 (0.0006) [2023-03-07 10:06:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12749.7). Total num frames: 17275904. Throughput: 0: 12680.3. Samples: 17262657. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:33,322][175405] Avg episode reward: [(0, '42.345')] [2023-03-07 10:06:34,017][175731] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-07 10:06:34,827][175731] Updated weights for policy 0, policy_version 16890 (0.0006) [2023-03-07 10:06:35,626][175731] Updated weights for policy 0, policy_version 16900 (0.0006) [2023-03-07 10:06:36,422][175731] Updated weights for policy 0, policy_version 16910 (0.0007) [2023-03-07 10:06:37,230][175731] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-07 10:06:38,053][175731] Updated weights for policy 0, policy_version 16930 (0.0007) [2023-03-07 10:06:38,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12749.7). Total num frames: 17339392. Throughput: 0: 12685.6. Samples: 17338830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:06:38,322][175405] Avg episode reward: [(0, '46.152')] [2023-03-07 10:06:38,857][175731] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-07 10:06:39,674][175731] Updated weights for policy 0, policy_version 16950 (0.0006) [2023-03-07 10:06:40,473][175731] Updated weights for policy 0, policy_version 16960 (0.0006) [2023-03-07 10:06:41,281][175731] Updated weights for policy 0, policy_version 16970 (0.0006) [2023-03-07 10:06:42,082][175731] Updated weights for policy 0, policy_version 16980 (0.0007) [2023-03-07 10:06:42,899][175731] Updated weights for policy 0, policy_version 16990 (0.0006) [2023-03-07 10:06:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12746.2). Total num frames: 17402880. Throughput: 0: 12684.6. Samples: 17376966. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:06:43,322][175405] Avg episode reward: [(0, '41.123')] [2023-03-07 10:06:43,708][175731] Updated weights for policy 0, policy_version 17000 (0.0006) [2023-03-07 10:06:44,514][175731] Updated weights for policy 0, policy_version 17010 (0.0006) [2023-03-07 10:06:45,314][175731] Updated weights for policy 0, policy_version 17020 (0.0006) [2023-03-07 10:06:46,119][175731] Updated weights for policy 0, policy_version 17030 (0.0006) [2023-03-07 10:06:46,950][175731] Updated weights for policy 0, policy_version 17040 (0.0007) [2023-03-07 10:06:47,746][175731] Updated weights for policy 0, policy_version 17050 (0.0006) [2023-03-07 10:06:48,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12746.2). Total num frames: 17466368. Throughput: 0: 12677.7. Samples: 17452760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:48,322][175405] Avg episode reward: [(0, '45.250')] [2023-03-07 10:06:48,549][175731] Updated weights for policy 0, policy_version 17060 (0.0006) [2023-03-07 10:06:49,354][175731] Updated weights for policy 0, policy_version 17070 (0.0006) [2023-03-07 10:06:50,158][175731] Updated weights for policy 0, policy_version 17080 (0.0006) [2023-03-07 10:06:50,966][175731] Updated weights for policy 0, policy_version 17090 (0.0007) [2023-03-07 10:06:51,761][175731] Updated weights for policy 0, policy_version 17100 (0.0007) [2023-03-07 10:06:52,554][175731] Updated weights for policy 0, policy_version 17110 (0.0006) [2023-03-07 10:06:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12746.2). Total num frames: 17529856. Throughput: 0: 12688.5. Samples: 17529423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:53,322][175405] Avg episode reward: [(0, '43.635')] [2023-03-07 10:06:53,365][175731] Updated weights for policy 0, policy_version 17120 (0.0007) [2023-03-07 10:06:54,162][175731] Updated weights for policy 0, policy_version 17130 (0.0006) [2023-03-07 10:06:54,958][175731] Updated weights for policy 0, policy_version 17140 (0.0008) [2023-03-07 10:06:55,767][175731] Updated weights for policy 0, policy_version 17150 (0.0006) [2023-03-07 10:06:56,557][175731] Updated weights for policy 0, policy_version 17160 (0.0006) [2023-03-07 10:06:57,354][175731] Updated weights for policy 0, policy_version 17170 (0.0007) [2023-03-07 10:06:58,163][175731] Updated weights for policy 0, policy_version 17180 (0.0006) [2023-03-07 10:06:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12742.7). Total num frames: 17593344. Throughput: 0: 12695.9. Samples: 17567671. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:06:58,322][175405] Avg episode reward: [(0, '46.123')] [2023-03-07 10:06:58,961][175731] Updated weights for policy 0, policy_version 17190 (0.0007) [2023-03-07 10:06:59,762][175731] Updated weights for policy 0, policy_version 17200 (0.0006) [2023-03-07 10:07:00,584][175731] Updated weights for policy 0, policy_version 17210 (0.0007) [2023-03-07 10:07:01,401][175731] Updated weights for policy 0, policy_version 17220 (0.0006) [2023-03-07 10:07:02,220][175731] Updated weights for policy 0, policy_version 17230 (0.0007) [2023-03-07 10:07:03,025][175731] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-07 10:07:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12739.3). Total num frames: 17656832. Throughput: 0: 12690.6. Samples: 17643746. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:07:03,322][175405] Avg episode reward: [(0, '44.687')] [2023-03-07 10:07:03,831][175731] Updated weights for policy 0, policy_version 17250 (0.0006) [2023-03-07 10:07:04,642][175731] Updated weights for policy 0, policy_version 17260 (0.0007) [2023-03-07 10:07:05,438][175731] Updated weights for policy 0, policy_version 17270 (0.0006) [2023-03-07 10:07:06,246][175731] Updated weights for policy 0, policy_version 17280 (0.0007) [2023-03-07 10:07:07,047][175731] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-07 10:07:07,856][175731] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-07 10:07:08,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12739.3). Total num frames: 17720320. Throughput: 0: 12704.3. Samples: 17720168. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:07:08,321][175405] Avg episode reward: [(0, '54.353')] [2023-03-07 10:07:08,654][175731] Updated weights for policy 0, policy_version 17310 (0.0006) [2023-03-07 10:07:09,457][175731] Updated weights for policy 0, policy_version 17320 (0.0006) [2023-03-07 10:07:10,263][175731] Updated weights for policy 0, policy_version 17330 (0.0007) [2023-03-07 10:07:11,065][175731] Updated weights for policy 0, policy_version 17340 (0.0006) [2023-03-07 10:07:11,883][175731] Updated weights for policy 0, policy_version 17350 (0.0007) [2023-03-07 10:07:12,685][175731] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-07 10:07:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12739.3). Total num frames: 17783808. Throughput: 0: 12714.8. Samples: 17758389. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:07:13,322][175405] Avg episode reward: [(0, '43.897')] [2023-03-07 10:07:13,490][175731] Updated weights for policy 0, policy_version 17370 (0.0006) [2023-03-07 10:07:14,315][175731] Updated weights for policy 0, policy_version 17380 (0.0006) [2023-03-07 10:07:15,107][175731] Updated weights for policy 0, policy_version 17390 (0.0006) [2023-03-07 10:07:15,909][175731] Updated weights for policy 0, policy_version 17400 (0.0007) [2023-03-07 10:07:16,710][175731] Updated weights for policy 0, policy_version 17410 (0.0006) [2023-03-07 10:07:17,549][175731] Updated weights for policy 0, policy_version 17420 (0.0006) [2023-03-07 10:07:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12735.8). Total num frames: 17847296. Throughput: 0: 12704.4. Samples: 17834355. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:07:18,322][175405] Avg episode reward: [(0, '41.380')] [2023-03-07 10:07:18,345][175731] Updated weights for policy 0, policy_version 17430 (0.0006) [2023-03-07 10:07:19,172][175731] Updated weights for policy 0, policy_version 17440 (0.0006) [2023-03-07 10:07:19,974][175731] Updated weights for policy 0, policy_version 17450 (0.0007) [2023-03-07 10:07:20,777][175731] Updated weights for policy 0, policy_version 17460 (0.0006) [2023-03-07 10:07:21,586][175731] Updated weights for policy 0, policy_version 17470 (0.0007) [2023-03-07 10:07:22,393][175731] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-07 10:07:23,198][175731] Updated weights for policy 0, policy_version 17490 (0.0006) [2023-03-07 10:07:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12735.8). Total num frames: 17910784. Throughput: 0: 12705.3. Samples: 17910565. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:07:23,322][175405] Avg episode reward: [(0, '37.871')] [2023-03-07 10:07:23,984][175731] Updated weights for policy 0, policy_version 17500 (0.0006) [2023-03-07 10:07:24,782][175731] Updated weights for policy 0, policy_version 17510 (0.0006) [2023-03-07 10:07:25,594][175731] Updated weights for policy 0, policy_version 17520 (0.0006) [2023-03-07 10:07:26,388][175731] Updated weights for policy 0, policy_version 17530 (0.0007) [2023-03-07 10:07:27,193][175731] Updated weights for policy 0, policy_version 17540 (0.0006) [2023-03-07 10:07:28,005][175731] Updated weights for policy 0, policy_version 17550 (0.0006) [2023-03-07 10:07:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 17975296. Throughput: 0: 12707.2. Samples: 17948791. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:07:28,322][175405] Avg episode reward: [(0, '43.701')] [2023-03-07 10:07:28,798][175731] Updated weights for policy 0, policy_version 17560 (0.0006) [2023-03-07 10:07:29,608][175731] Updated weights for policy 0, policy_version 17570 (0.0007) [2023-03-07 10:07:30,408][175731] Updated weights for policy 0, policy_version 17580 (0.0006) [2023-03-07 10:07:31,222][175731] Updated weights for policy 0, policy_version 17590 (0.0007) [2023-03-07 10:07:32,029][175731] Updated weights for policy 0, policy_version 17600 (0.0005) [2023-03-07 10:07:32,814][175731] Updated weights for policy 0, policy_version 17610 (0.0007) [2023-03-07 10:07:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 18038784. Throughput: 0: 12721.9. Samples: 18025245. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:07:33,322][175405] Avg episode reward: [(0, '43.065')] [2023-03-07 10:07:33,624][175731] Updated weights for policy 0, policy_version 17620 (0.0007) [2023-03-07 10:07:34,436][175731] Updated weights for policy 0, policy_version 17630 (0.0006) [2023-03-07 10:07:35,233][175731] Updated weights for policy 0, policy_version 17640 (0.0007) [2023-03-07 10:07:36,053][175731] Updated weights for policy 0, policy_version 17650 (0.0008) [2023-03-07 10:07:36,855][175731] Updated weights for policy 0, policy_version 17660 (0.0006) [2023-03-07 10:07:37,660][175731] Updated weights for policy 0, policy_version 17670 (0.0006) [2023-03-07 10:07:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 18102272. Throughput: 0: 12713.9. Samples: 18101550. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:07:38,321][175405] Avg episode reward: [(0, '69.068')] [2023-03-07 10:07:38,446][175731] Updated weights for policy 0, policy_version 17680 (0.0006) [2023-03-07 10:07:39,253][175731] Updated weights for policy 0, policy_version 17690 (0.0007) [2023-03-07 10:07:40,073][175731] Updated weights for policy 0, policy_version 17700 (0.0006) [2023-03-07 10:07:40,875][175731] Updated weights for policy 0, policy_version 17710 (0.0006) [2023-03-07 10:07:41,676][175731] Updated weights for policy 0, policy_version 17720 (0.0006) [2023-03-07 10:07:42,491][175731] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-07 10:07:43,296][175731] Updated weights for policy 0, policy_version 17740 (0.0007) [2023-03-07 10:07:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 18165760. Throughput: 0: 12710.6. Samples: 18139648. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:07:43,322][175405] Avg episode reward: [(0, '56.430')] [2023-03-07 10:07:44,110][175731] Updated weights for policy 0, policy_version 17750 (0.0006) [2023-03-07 10:07:44,922][175731] Updated weights for policy 0, policy_version 17760 (0.0006) [2023-03-07 10:07:45,738][175731] Updated weights for policy 0, policy_version 17770 (0.0006) [2023-03-07 10:07:46,553][175731] Updated weights for policy 0, policy_version 17780 (0.0007) [2023-03-07 10:07:47,365][175731] Updated weights for policy 0, policy_version 17790 (0.0007) [2023-03-07 10:07:48,149][175731] Updated weights for policy 0, policy_version 17800 (0.0006) [2023-03-07 10:07:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12732.3). Total num frames: 18229248. Throughput: 0: 12709.1. Samples: 18215654. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:07:48,322][175405] Avg episode reward: [(0, '54.164')] [2023-03-07 10:07:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017802_18229248.pth... [2023-03-07 10:07:48,355][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000014820_15175680.pth [2023-03-07 10:07:48,960][175731] Updated weights for policy 0, policy_version 17810 (0.0006) [2023-03-07 10:07:49,758][175731] Updated weights for policy 0, policy_version 17820 (0.0006) [2023-03-07 10:07:50,585][175731] Updated weights for policy 0, policy_version 17830 (0.0006) [2023-03-07 10:07:51,393][175731] Updated weights for policy 0, policy_version 17840 (0.0007) [2023-03-07 10:07:52,201][175731] Updated weights for policy 0, policy_version 17850 (0.0007) [2023-03-07 10:07:53,002][175731] Updated weights for policy 0, policy_version 17860 (0.0006) [2023-03-07 10:07:53,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 18291712. Throughput: 0: 12700.3. Samples: 18291681. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:07:53,322][175405] Avg episode reward: [(0, '64.500')] [2023-03-07 10:07:53,830][175731] Updated weights for policy 0, policy_version 17870 (0.0006) [2023-03-07 10:07:54,637][175731] Updated weights for policy 0, policy_version 17880 (0.0006) [2023-03-07 10:07:55,469][175731] Updated weights for policy 0, policy_version 17890 (0.0007) [2023-03-07 10:07:56,255][175731] Updated weights for policy 0, policy_version 17900 (0.0007) [2023-03-07 10:07:57,059][175731] Updated weights for policy 0, policy_version 17910 (0.0007) [2023-03-07 10:07:57,854][175731] Updated weights for policy 0, policy_version 17920 (0.0006) [2023-03-07 10:07:58,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 18355200. Throughput: 0: 12692.3. Samples: 18329542. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:07:58,321][175405] Avg episode reward: [(0, '52.524')] [2023-03-07 10:07:58,677][175731] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-07 10:07:59,494][175731] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-07 10:08:00,307][175731] Updated weights for policy 0, policy_version 17950 (0.0007) [2023-03-07 10:08:01,121][175731] Updated weights for policy 0, policy_version 17960 (0.0007) [2023-03-07 10:08:01,928][175731] Updated weights for policy 0, policy_version 17970 (0.0006) [2023-03-07 10:08:02,721][175731] Updated weights for policy 0, policy_version 17980 (0.0006) [2023-03-07 10:08:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12728.8). Total num frames: 18418688. Throughput: 0: 12689.1. Samples: 18405364. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:08:03,321][175405] Avg episode reward: [(0, '53.904')] [2023-03-07 10:08:03,548][175731] Updated weights for policy 0, policy_version 17990 (0.0007) [2023-03-07 10:08:04,354][175731] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-07 10:08:05,176][175731] Updated weights for policy 0, policy_version 18010 (0.0007) [2023-03-07 10:08:05,973][175731] Updated weights for policy 0, policy_version 18020 (0.0007) [2023-03-07 10:08:06,787][175731] Updated weights for policy 0, policy_version 18030 (0.0006) [2023-03-07 10:08:07,598][175731] Updated weights for policy 0, policy_version 18040 (0.0007) [2023-03-07 10:08:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 18482176. Throughput: 0: 12686.1. Samples: 18481437. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:08:08,321][175405] Avg episode reward: [(0, '68.386')] [2023-03-07 10:08:08,369][175731] Updated weights for policy 0, policy_version 18050 (0.0006) [2023-03-07 10:08:09,178][175731] Updated weights for policy 0, policy_version 18060 (0.0006) [2023-03-07 10:08:09,989][175731] Updated weights for policy 0, policy_version 18070 (0.0006) [2023-03-07 10:08:10,781][175731] Updated weights for policy 0, policy_version 18080 (0.0006) [2023-03-07 10:08:11,596][175731] Updated weights for policy 0, policy_version 18090 (0.0006) [2023-03-07 10:08:12,385][175731] Updated weights for policy 0, policy_version 18100 (0.0006) [2023-03-07 10:08:13,193][175731] Updated weights for policy 0, policy_version 18110 (0.0006) [2023-03-07 10:08:13,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12725.4). Total num frames: 18545664. Throughput: 0: 12684.9. Samples: 18519613. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:08:13,322][175405] Avg episode reward: [(0, '59.080')] [2023-03-07 10:08:14,004][175731] Updated weights for policy 0, policy_version 18120 (0.0007) [2023-03-07 10:08:14,819][175731] Updated weights for policy 0, policy_version 18130 (0.0006) [2023-03-07 10:08:15,654][175731] Updated weights for policy 0, policy_version 18140 (0.0006) [2023-03-07 10:08:16,464][175731] Updated weights for policy 0, policy_version 18150 (0.0006) [2023-03-07 10:08:17,257][175731] Updated weights for policy 0, policy_version 18160 (0.0007) [2023-03-07 10:08:18,074][175731] Updated weights for policy 0, policy_version 18170 (0.0007) [2023-03-07 10:08:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 18609152. Throughput: 0: 12678.6. Samples: 18595781. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:08:18,321][175405] Avg episode reward: [(0, '58.907')] [2023-03-07 10:08:18,870][175731] Updated weights for policy 0, policy_version 18180 (0.0006) [2023-03-07 10:08:19,664][175731] Updated weights for policy 0, policy_version 18190 (0.0006) [2023-03-07 10:08:20,474][175731] Updated weights for policy 0, policy_version 18200 (0.0006) [2023-03-07 10:08:21,262][175731] Updated weights for policy 0, policy_version 18210 (0.0007) [2023-03-07 10:08:22,077][175731] Updated weights for policy 0, policy_version 18220 (0.0007) [2023-03-07 10:08:22,879][175731] Updated weights for policy 0, policy_version 18230 (0.0006) [2023-03-07 10:08:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12721.9). Total num frames: 18672640. Throughput: 0: 12680.9. Samples: 18672194. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:08:23,322][175405] Avg episode reward: [(0, '68.998')] [2023-03-07 10:08:23,685][175731] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-07 10:08:24,492][175731] Updated weights for policy 0, policy_version 18250 (0.0007) [2023-03-07 10:08:25,294][175731] Updated weights for policy 0, policy_version 18260 (0.0007) [2023-03-07 10:08:26,101][175731] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-07 10:08:26,910][175731] Updated weights for policy 0, policy_version 18280 (0.0007) [2023-03-07 10:08:27,712][175731] Updated weights for policy 0, policy_version 18290 (0.0006) [2023-03-07 10:08:28,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12680.5, 300 sec: 12718.4). Total num frames: 18736128. Throughput: 0: 12682.3. Samples: 18710353. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:08:28,322][175405] Avg episode reward: [(0, '95.694')] [2023-03-07 10:08:28,523][175731] Updated weights for policy 0, policy_version 18300 (0.0006) [2023-03-07 10:08:29,335][175731] Updated weights for policy 0, policy_version 18310 (0.0007) [2023-03-07 10:08:30,147][175731] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-07 10:08:30,945][175731] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-07 10:08:31,755][175731] Updated weights for policy 0, policy_version 18340 (0.0006) [2023-03-07 10:08:32,562][175731] Updated weights for policy 0, policy_version 18350 (0.0007) [2023-03-07 10:08:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12718.4). Total num frames: 18799616. Throughput: 0: 12679.5. Samples: 18786229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:08:33,322][175405] Avg episode reward: [(0, '59.375')] [2023-03-07 10:08:33,375][175731] Updated weights for policy 0, policy_version 18360 (0.0006) [2023-03-07 10:08:34,190][175731] Updated weights for policy 0, policy_version 18370 (0.0006) [2023-03-07 10:08:34,995][175731] Updated weights for policy 0, policy_version 18380 (0.0006) [2023-03-07 10:08:35,819][175731] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-07 10:08:36,604][175731] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-07 10:08:37,413][175731] Updated weights for policy 0, policy_version 18410 (0.0006) [2023-03-07 10:08:38,213][175731] Updated weights for policy 0, policy_version 18420 (0.0006) [2023-03-07 10:08:38,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 18863104. Throughput: 0: 12679.3. Samples: 18862249. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:08:38,321][175405] Avg episode reward: [(0, '75.895')] [2023-03-07 10:08:39,028][175731] Updated weights for policy 0, policy_version 18430 (0.0007) [2023-03-07 10:08:39,821][175731] Updated weights for policy 0, policy_version 18440 (0.0007) [2023-03-07 10:08:40,645][175731] Updated weights for policy 0, policy_version 18450 (0.0006) [2023-03-07 10:08:41,447][175731] Updated weights for policy 0, policy_version 18460 (0.0006) [2023-03-07 10:08:42,254][175731] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-07 10:08:43,061][175731] Updated weights for policy 0, policy_version 18480 (0.0006) [2023-03-07 10:08:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 18926592. Throughput: 0: 12687.0. Samples: 18900458. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:08:43,322][175405] Avg episode reward: [(0, '65.869')] [2023-03-07 10:08:43,872][175731] Updated weights for policy 0, policy_version 18490 (0.0007) [2023-03-07 10:08:44,670][175731] Updated weights for policy 0, policy_version 18500 (0.0006) [2023-03-07 10:08:45,493][175731] Updated weights for policy 0, policy_version 18510 (0.0006) [2023-03-07 10:08:46,299][175731] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-07 10:08:47,115][175731] Updated weights for policy 0, policy_version 18530 (0.0008) [2023-03-07 10:08:47,915][175731] Updated weights for policy 0, policy_version 18540 (0.0007) [2023-03-07 10:08:48,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12715.0). Total num frames: 18990080. Throughput: 0: 12693.7. Samples: 18976581. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:08:48,322][175405] Avg episode reward: [(0, '58.534')] [2023-03-07 10:08:48,722][175731] Updated weights for policy 0, policy_version 18550 (0.0006) [2023-03-07 10:08:49,538][175731] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-07 10:08:50,341][175731] Updated weights for policy 0, policy_version 18570 (0.0006) [2023-03-07 10:08:51,125][175731] Updated weights for policy 0, policy_version 18580 (0.0006) [2023-03-07 10:08:51,934][175731] Updated weights for policy 0, policy_version 18590 (0.0007) [2023-03-07 10:08:52,738][175731] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-07 10:08:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 19053568. Throughput: 0: 12694.9. Samples: 19052707. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:08:53,322][175405] Avg episode reward: [(0, '57.585')] [2023-03-07 10:08:53,551][175731] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-07 10:08:54,346][175731] Updated weights for policy 0, policy_version 18620 (0.0005) [2023-03-07 10:08:55,162][175731] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-07 10:08:55,964][175731] Updated weights for policy 0, policy_version 18640 (0.0007) [2023-03-07 10:08:56,758][175731] Updated weights for policy 0, policy_version 18650 (0.0006) [2023-03-07 10:08:57,571][175731] Updated weights for policy 0, policy_version 18660 (0.0007) [2023-03-07 10:08:58,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 19116032. Throughput: 0: 12692.8. Samples: 19090790. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:08:58,322][175405] Avg episode reward: [(0, '73.745')] [2023-03-07 10:08:58,389][175731] Updated weights for policy 0, policy_version 18670 (0.0007) [2023-03-07 10:08:59,197][175731] Updated weights for policy 0, policy_version 18680 (0.0006) [2023-03-07 10:08:59,998][175731] Updated weights for policy 0, policy_version 18690 (0.0007) [2023-03-07 10:09:00,818][175731] Updated weights for policy 0, policy_version 18700 (0.0006) [2023-03-07 10:09:01,631][175731] Updated weights for policy 0, policy_version 18710 (0.0006) [2023-03-07 10:09:02,456][175731] Updated weights for policy 0, policy_version 18720 (0.0007) [2023-03-07 10:09:03,255][175731] Updated weights for policy 0, policy_version 18730 (0.0006) [2023-03-07 10:09:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12711.5). Total num frames: 19180544. Throughput: 0: 12686.9. Samples: 19166694. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:09:03,322][175405] Avg episode reward: [(0, '68.876')] [2023-03-07 10:09:04,048][175731] Updated weights for policy 0, policy_version 18740 (0.0006) [2023-03-07 10:09:04,850][175731] Updated weights for policy 0, policy_version 18750 (0.0006) [2023-03-07 10:09:05,658][175731] Updated weights for policy 0, policy_version 18760 (0.0006) [2023-03-07 10:09:06,473][175731] Updated weights for policy 0, policy_version 18770 (0.0006) [2023-03-07 10:09:07,270][175731] Updated weights for policy 0, policy_version 18780 (0.0006) [2023-03-07 10:09:08,063][175731] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-07 10:09:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 19243008. Throughput: 0: 12688.2. Samples: 19243160. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:09:08,321][175405] Avg episode reward: [(0, '65.263')] [2023-03-07 10:09:08,893][175731] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-07 10:09:09,715][175731] Updated weights for policy 0, policy_version 18810 (0.0006) [2023-03-07 10:09:10,512][175731] Updated weights for policy 0, policy_version 18820 (0.0006) [2023-03-07 10:09:11,321][175731] Updated weights for policy 0, policy_version 18830 (0.0006) [2023-03-07 10:09:12,145][175731] Updated weights for policy 0, policy_version 18840 (0.0006) [2023-03-07 10:09:12,961][175731] Updated weights for policy 0, policy_version 18850 (0.0007) [2023-03-07 10:09:13,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12708.0). Total num frames: 19306496. Throughput: 0: 12679.6. Samples: 19280932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:09:13,322][175405] Avg episode reward: [(0, '72.802')] [2023-03-07 10:09:13,762][175731] Updated weights for policy 0, policy_version 18860 (0.0006) [2023-03-07 10:09:14,554][175731] Updated weights for policy 0, policy_version 18870 (0.0006) [2023-03-07 10:09:15,365][175731] Updated weights for policy 0, policy_version 18880 (0.0007) [2023-03-07 10:09:16,176][175731] Updated weights for policy 0, policy_version 18890 (0.0007) [2023-03-07 10:09:17,004][175731] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-07 10:09:17,826][175731] Updated weights for policy 0, policy_version 18910 (0.0007) [2023-03-07 10:09:18,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12680.5, 300 sec: 12704.5). Total num frames: 19369984. Throughput: 0: 12676.0. Samples: 19356650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:18,322][175405] Avg episode reward: [(0, '76.735')] [2023-03-07 10:09:18,638][175731] Updated weights for policy 0, policy_version 18920 (0.0006) [2023-03-07 10:09:19,438][175731] Updated weights for policy 0, policy_version 18930 (0.0006) [2023-03-07 10:09:20,246][175731] Updated weights for policy 0, policy_version 18940 (0.0007) [2023-03-07 10:09:21,036][175731] Updated weights for policy 0, policy_version 18950 (0.0007) [2023-03-07 10:09:21,848][175731] Updated weights for policy 0, policy_version 18960 (0.0008) [2023-03-07 10:09:22,664][175731] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-07 10:09:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.6, 300 sec: 12708.0). Total num frames: 19433472. Throughput: 0: 12675.2. Samples: 19432632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:23,321][175405] Avg episode reward: [(0, '116.121')] [2023-03-07 10:09:23,470][175731] Updated weights for policy 0, policy_version 18980 (0.0007) [2023-03-07 10:09:24,285][175731] Updated weights for policy 0, policy_version 18990 (0.0006) [2023-03-07 10:09:25,081][175731] Updated weights for policy 0, policy_version 19000 (0.0007) [2023-03-07 10:09:25,895][175731] Updated weights for policy 0, policy_version 19010 (0.0007) [2023-03-07 10:09:26,703][175731] Updated weights for policy 0, policy_version 19020 (0.0007) [2023-03-07 10:09:27,505][175731] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-07 10:09:28,313][175731] Updated weights for policy 0, policy_version 19040 (0.0007) [2023-03-07 10:09:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12704.5). Total num frames: 19496960. Throughput: 0: 12671.9. Samples: 19470692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:28,322][175405] Avg episode reward: [(0, '72.781')] [2023-03-07 10:09:29,138][175731] Updated weights for policy 0, policy_version 19050 (0.0007) [2023-03-07 10:09:29,941][175731] Updated weights for policy 0, policy_version 19060 (0.0006) [2023-03-07 10:09:30,762][175731] Updated weights for policy 0, policy_version 19070 (0.0006) [2023-03-07 10:09:31,549][175731] Updated weights for policy 0, policy_version 19080 (0.0006) [2023-03-07 10:09:32,357][175731] Updated weights for policy 0, policy_version 19090 (0.0007) [2023-03-07 10:09:33,173][175731] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-07 10:09:33,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12663.4, 300 sec: 12701.1). Total num frames: 19559424. Throughput: 0: 12669.1. Samples: 19546692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:33,322][175405] Avg episode reward: [(0, '69.563')] [2023-03-07 10:09:33,973][175731] Updated weights for policy 0, policy_version 19110 (0.0007) [2023-03-07 10:09:34,794][175731] Updated weights for policy 0, policy_version 19120 (0.0007) [2023-03-07 10:09:35,590][175731] Updated weights for policy 0, policy_version 19130 (0.0006) [2023-03-07 10:09:36,404][175731] Updated weights for policy 0, policy_version 19140 (0.0007) [2023-03-07 10:09:37,203][175731] Updated weights for policy 0, policy_version 19150 (0.0006) [2023-03-07 10:09:38,001][175731] Updated weights for policy 0, policy_version 19160 (0.0006) [2023-03-07 10:09:38,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12697.6). Total num frames: 19622912. Throughput: 0: 12669.2. Samples: 19622821. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:38,321][175405] Avg episode reward: [(0, '75.490')] [2023-03-07 10:09:38,809][175731] Updated weights for policy 0, policy_version 19170 (0.0007) [2023-03-07 10:09:39,617][175731] Updated weights for policy 0, policy_version 19180 (0.0007) [2023-03-07 10:09:40,422][175731] Updated weights for policy 0, policy_version 19190 (0.0005) [2023-03-07 10:09:41,232][175731] Updated weights for policy 0, policy_version 19200 (0.0006) [2023-03-07 10:09:42,048][175731] Updated weights for policy 0, policy_version 19210 (0.0006) [2023-03-07 10:09:42,852][175731] Updated weights for policy 0, policy_version 19220 (0.0006) [2023-03-07 10:09:43,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12663.5, 300 sec: 12697.6). Total num frames: 19686400. Throughput: 0: 12670.0. Samples: 19660942. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:43,332][175405] Avg episode reward: [(0, '76.790')] [2023-03-07 10:09:43,665][175731] Updated weights for policy 0, policy_version 19230 (0.0007) [2023-03-07 10:09:44,475][175731] Updated weights for policy 0, policy_version 19240 (0.0006) [2023-03-07 10:09:45,274][175731] Updated weights for policy 0, policy_version 19250 (0.0006) [2023-03-07 10:09:46,071][175731] Updated weights for policy 0, policy_version 19260 (0.0006) [2023-03-07 10:09:46,870][175731] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-07 10:09:47,673][175731] Updated weights for policy 0, policy_version 19280 (0.0006) [2023-03-07 10:09:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12697.6). Total num frames: 19749888. Throughput: 0: 12677.6. Samples: 19737184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:48,332][175405] Avg episode reward: [(0, '123.599')] [2023-03-07 10:09:48,338][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019288_19750912.pth... [2023-03-07 10:09:48,368][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000016314_16705536.pth [2023-03-07 10:09:48,486][175731] Updated weights for policy 0, policy_version 19290 (0.0006) [2023-03-07 10:09:49,306][175731] Updated weights for policy 0, policy_version 19300 (0.0007) [2023-03-07 10:09:50,112][175731] Updated weights for policy 0, policy_version 19310 (0.0006) [2023-03-07 10:09:50,946][175731] Updated weights for policy 0, policy_version 19320 (0.0006) [2023-03-07 10:09:51,767][175731] Updated weights for policy 0, policy_version 19330 (0.0006) [2023-03-07 10:09:52,580][175731] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-07 10:09:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12694.1). Total num frames: 19813376. Throughput: 0: 12659.2. Samples: 19812823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:53,332][175405] Avg episode reward: [(0, '84.999')] [2023-03-07 10:09:53,377][175731] Updated weights for policy 0, policy_version 19350 (0.0007) [2023-03-07 10:09:54,172][175731] Updated weights for policy 0, policy_version 19360 (0.0006) [2023-03-07 10:09:54,978][175731] Updated weights for policy 0, policy_version 19370 (0.0006) [2023-03-07 10:09:55,798][175731] Updated weights for policy 0, policy_version 19380 (0.0007) [2023-03-07 10:09:56,606][175731] Updated weights for policy 0, policy_version 19390 (0.0007) [2023-03-07 10:09:57,391][175731] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-07 10:09:58,193][175731] Updated weights for policy 0, policy_version 19410 (0.0006) [2023-03-07 10:09:58,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 19876864. Throughput: 0: 12665.9. Samples: 19850897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:09:58,332][175405] Avg episode reward: [(0, '82.696')] [2023-03-07 10:09:59,005][175731] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-07 10:09:59,802][175731] Updated weights for policy 0, policy_version 19430 (0.0006) [2023-03-07 10:10:00,621][175731] Updated weights for policy 0, policy_version 19440 (0.0005) [2023-03-07 10:10:01,427][175731] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-07 10:10:02,242][175731] Updated weights for policy 0, policy_version 19460 (0.0006) [2023-03-07 10:10:03,059][175731] Updated weights for policy 0, policy_version 19470 (0.0007) [2023-03-07 10:10:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12690.7). Total num frames: 19940352. Throughput: 0: 12675.1. Samples: 19927027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:10:03,332][175405] Avg episode reward: [(0, '118.934')] [2023-03-07 10:10:03,850][175731] Updated weights for policy 0, policy_version 19480 (0.0007) [2023-03-07 10:10:04,661][175731] Updated weights for policy 0, policy_version 19490 (0.0007) [2023-03-07 10:10:05,458][175731] Updated weights for policy 0, policy_version 19500 (0.0006) [2023-03-07 10:10:06,259][175731] Updated weights for policy 0, policy_version 19510 (0.0006) [2023-03-07 10:10:07,070][175731] Updated weights for policy 0, policy_version 19520 (0.0007) [2023-03-07 10:10:07,873][175731] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-07 10:10:08,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 20003840. Throughput: 0: 12678.6. Samples: 20003167. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:10:08,332][175405] Avg episode reward: [(0, '80.190')] [2023-03-07 10:10:08,684][175731] Updated weights for policy 0, policy_version 19540 (0.0006) [2023-03-07 10:10:09,498][175731] Updated weights for policy 0, policy_version 19550 (0.0007) [2023-03-07 10:10:10,298][175731] Updated weights for policy 0, policy_version 19560 (0.0006) [2023-03-07 10:10:11,133][175731] Updated weights for policy 0, policy_version 19570 (0.0007) [2023-03-07 10:10:11,946][175731] Updated weights for policy 0, policy_version 19580 (0.0007) [2023-03-07 10:10:12,765][175731] Updated weights for policy 0, policy_version 19590 (0.0006) [2023-03-07 10:10:13,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 20067328. Throughput: 0: 12672.7. Samples: 20040961. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:10:13,332][175405] Avg episode reward: [(0, '78.314')] [2023-03-07 10:10:13,561][175731] Updated weights for policy 0, policy_version 19600 (0.0007) [2023-03-07 10:10:14,367][175731] Updated weights for policy 0, policy_version 19610 (0.0006) [2023-03-07 10:10:15,181][175731] Updated weights for policy 0, policy_version 19620 (0.0007) [2023-03-07 10:10:15,985][175731] Updated weights for policy 0, policy_version 19630 (0.0006) [2023-03-07 10:10:16,794][175731] Updated weights for policy 0, policy_version 19640 (0.0006) [2023-03-07 10:10:17,604][175731] Updated weights for policy 0, policy_version 19650 (0.0007) [2023-03-07 10:10:18,321][175405] Fps is (10 sec: 12595.0, 60 sec: 12663.5, 300 sec: 12687.2). Total num frames: 20129792. Throughput: 0: 12672.2. Samples: 20116939. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:10:18,324][175405] Avg episode reward: [(0, '64.218')] [2023-03-07 10:10:18,398][175731] Updated weights for policy 0, policy_version 19660 (0.0008) [2023-03-07 10:10:19,231][175731] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-07 10:10:20,034][175731] Updated weights for policy 0, policy_version 19680 (0.0007) [2023-03-07 10:10:20,844][175731] Updated weights for policy 0, policy_version 19690 (0.0007) [2023-03-07 10:10:21,658][175731] Updated weights for policy 0, policy_version 19700 (0.0006) [2023-03-07 10:10:22,494][175731] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-07 10:10:23,287][175731] Updated weights for policy 0, policy_version 19720 (0.0006) [2023-03-07 10:10:23,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12687.2). Total num frames: 20193280. Throughput: 0: 12665.6. Samples: 20192773. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:10:23,321][175405] Avg episode reward: [(0, '80.651')] [2023-03-07 10:10:24,087][175731] Updated weights for policy 0, policy_version 19730 (0.0007) [2023-03-07 10:10:24,893][175731] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-07 10:10:25,682][175731] Updated weights for policy 0, policy_version 19750 (0.0006) [2023-03-07 10:10:26,510][175731] Updated weights for policy 0, policy_version 19760 (0.0006) [2023-03-07 10:10:27,309][175731] Updated weights for policy 0, policy_version 19770 (0.0006) [2023-03-07 10:10:28,118][175731] Updated weights for policy 0, policy_version 19780 (0.0006) [2023-03-07 10:10:28,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12663.5, 300 sec: 12687.2). Total num frames: 20256768. Throughput: 0: 12668.1. Samples: 20231008. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:10:28,322][175405] Avg episode reward: [(0, '84.329')] [2023-03-07 10:10:28,937][175731] Updated weights for policy 0, policy_version 19790 (0.0006) [2023-03-07 10:10:29,745][175731] Updated weights for policy 0, policy_version 19800 (0.0007) [2023-03-07 10:10:30,564][175731] Updated weights for policy 0, policy_version 19810 (0.0008) [2023-03-07 10:10:31,354][175731] Updated weights for policy 0, policy_version 19820 (0.0006) [2023-03-07 10:10:32,166][175731] Updated weights for policy 0, policy_version 19830 (0.0006) [2023-03-07 10:10:32,974][175731] Updated weights for policy 0, policy_version 19840 (0.0006) [2023-03-07 10:10:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.6, 300 sec: 12683.7). Total num frames: 20320256. Throughput: 0: 12664.4. Samples: 20307082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:10:33,321][175405] Avg episode reward: [(0, '93.014')] [2023-03-07 10:10:33,782][175731] Updated weights for policy 0, policy_version 19850 (0.0007) [2023-03-07 10:10:34,609][175731] Updated weights for policy 0, policy_version 19860 (0.0008) [2023-03-07 10:10:35,422][175731] Updated weights for policy 0, policy_version 19870 (0.0006) [2023-03-07 10:10:36,221][175731] Updated weights for policy 0, policy_version 19880 (0.0006) [2023-03-07 10:10:37,035][175731] Updated weights for policy 0, policy_version 19890 (0.0006) [2023-03-07 10:10:37,849][175731] Updated weights for policy 0, policy_version 19900 (0.0006) [2023-03-07 10:10:38,321][175405] Fps is (10 sec: 12594.9, 60 sec: 12663.4, 300 sec: 12680.2). Total num frames: 20382720. Throughput: 0: 12663.1. Samples: 20382665. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:10:38,322][175405] Avg episode reward: [(0, '88.934')] [2023-03-07 10:10:38,660][175731] Updated weights for policy 0, policy_version 19910 (0.0007) [2023-03-07 10:10:39,484][175731] Updated weights for policy 0, policy_version 19920 (0.0006) [2023-03-07 10:10:40,278][175731] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-07 10:10:41,085][175731] Updated weights for policy 0, policy_version 19940 (0.0006) [2023-03-07 10:10:41,879][175731] Updated weights for policy 0, policy_version 19950 (0.0006) [2023-03-07 10:10:42,684][175731] Updated weights for policy 0, policy_version 19960 (0.0007) [2023-03-07 10:10:43,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12680.2). Total num frames: 20446208. Throughput: 0: 12661.8. Samples: 20420676. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:10:43,322][175405] Avg episode reward: [(0, '81.682')] [2023-03-07 10:10:43,496][175731] Updated weights for policy 0, policy_version 19970 (0.0007) [2023-03-07 10:10:44,311][175731] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-07 10:10:45,127][175731] Updated weights for policy 0, policy_version 19990 (0.0006) [2023-03-07 10:10:45,925][175731] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-07 10:10:46,726][175731] Updated weights for policy 0, policy_version 20010 (0.0007) [2023-03-07 10:10:47,529][175731] Updated weights for policy 0, policy_version 20020 (0.0007) [2023-03-07 10:10:48,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12663.5, 300 sec: 12683.7). Total num frames: 20509696. Throughput: 0: 12661.9. Samples: 20496811. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:10:48,322][175405] Avg episode reward: [(0, '81.512')] [2023-03-07 10:10:48,340][175731] Updated weights for policy 0, policy_version 20030 (0.0005) [2023-03-07 10:10:49,133][175731] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-07 10:10:49,946][175731] Updated weights for policy 0, policy_version 20050 (0.0006) [2023-03-07 10:10:50,762][175731] Updated weights for policy 0, policy_version 20060 (0.0006) [2023-03-07 10:10:51,562][175731] Updated weights for policy 0, policy_version 20070 (0.0006) [2023-03-07 10:10:52,367][175731] Updated weights for policy 0, policy_version 20080 (0.0005) [2023-03-07 10:10:53,169][175731] Updated weights for policy 0, policy_version 20090 (0.0006) [2023-03-07 10:10:53,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12663.5, 300 sec: 12680.3). Total num frames: 20573184. Throughput: 0: 12663.5. Samples: 20573025. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:10:53,321][175405] Avg episode reward: [(0, '80.656')] [2023-03-07 10:10:53,959][175731] Updated weights for policy 0, policy_version 20100 (0.0007) [2023-03-07 10:10:54,777][175731] Updated weights for policy 0, policy_version 20110 (0.0006) [2023-03-07 10:10:55,590][175731] Updated weights for policy 0, policy_version 20120 (0.0007) [2023-03-07 10:10:56,401][175731] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-07 10:10:57,197][175731] Updated weights for policy 0, policy_version 20140 (0.0006) [2023-03-07 10:10:58,007][175731] Updated weights for policy 0, policy_version 20150 (0.0006) [2023-03-07 10:10:58,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12663.5, 300 sec: 12680.2). Total num frames: 20636672. Throughput: 0: 12669.6. Samples: 20611096. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:10:58,332][175405] Avg episode reward: [(0, '89.509')] [2023-03-07 10:10:58,808][175731] Updated weights for policy 0, policy_version 20160 (0.0007) [2023-03-07 10:10:59,615][175731] Updated weights for policy 0, policy_version 20170 (0.0006) [2023-03-07 10:11:00,428][175731] Updated weights for policy 0, policy_version 20180 (0.0006) [2023-03-07 10:11:01,229][175731] Updated weights for policy 0, policy_version 20190 (0.0007) [2023-03-07 10:11:02,042][175731] Updated weights for policy 0, policy_version 20200 (0.0006) [2023-03-07 10:11:02,866][175731] Updated weights for policy 0, policy_version 20210 (0.0006) [2023-03-07 10:11:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12683.7). Total num frames: 20700160. Throughput: 0: 12676.1. Samples: 20687361. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:11:03,332][175405] Avg episode reward: [(0, '75.634')] [2023-03-07 10:11:03,664][175731] Updated weights for policy 0, policy_version 20220 (0.0006) [2023-03-07 10:11:04,471][175731] Updated weights for policy 0, policy_version 20230 (0.0005) [2023-03-07 10:11:05,273][175731] Updated weights for policy 0, policy_version 20240 (0.0006) [2023-03-07 10:11:06,069][175731] Updated weights for policy 0, policy_version 20250 (0.0007) [2023-03-07 10:11:06,869][175731] Updated weights for policy 0, policy_version 20260 (0.0006) [2023-03-07 10:11:07,674][175731] Updated weights for policy 0, policy_version 20270 (0.0006) [2023-03-07 10:11:08,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12663.4, 300 sec: 12683.7). Total num frames: 20763648. Throughput: 0: 12687.7. Samples: 20763721. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:11:08,332][175405] Avg episode reward: [(0, '84.414')] [2023-03-07 10:11:08,485][175731] Updated weights for policy 0, policy_version 20280 (0.0006) [2023-03-07 10:11:09,289][175731] Updated weights for policy 0, policy_version 20290 (0.0008) [2023-03-07 10:11:10,101][175731] Updated weights for policy 0, policy_version 20300 (0.0007) [2023-03-07 10:11:10,898][175731] Updated weights for policy 0, policy_version 20310 (0.0007) [2023-03-07 10:11:11,714][175731] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-07 10:11:12,507][175731] Updated weights for policy 0, policy_version 20330 (0.0007) [2023-03-07 10:11:13,318][175731] Updated weights for policy 0, policy_version 20340 (0.0006) [2023-03-07 10:11:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 20828160. Throughput: 0: 12682.7. Samples: 20801733. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:11:13,332][175405] Avg episode reward: [(0, '84.480')] [2023-03-07 10:11:14,131][175731] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-07 10:11:14,949][175731] Updated weights for policy 0, policy_version 20360 (0.0007) [2023-03-07 10:11:15,765][175731] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-07 10:11:16,566][175731] Updated weights for policy 0, policy_version 20380 (0.0006) [2023-03-07 10:11:17,396][175731] Updated weights for policy 0, policy_version 20390 (0.0006) [2023-03-07 10:11:18,209][175731] Updated weights for policy 0, policy_version 20400 (0.0006) [2023-03-07 10:11:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 20890624. Throughput: 0: 12678.7. Samples: 20877626. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:11:18,332][175405] Avg episode reward: [(0, '94.472')] [2023-03-07 10:11:19,011][175731] Updated weights for policy 0, policy_version 20410 (0.0007) [2023-03-07 10:11:19,820][175731] Updated weights for policy 0, policy_version 20420 (0.0007) [2023-03-07 10:11:20,629][175731] Updated weights for policy 0, policy_version 20430 (0.0007) [2023-03-07 10:11:21,434][175731] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-07 10:11:22,253][175731] Updated weights for policy 0, policy_version 20450 (0.0006) [2023-03-07 10:11:23,052][175731] Updated weights for policy 0, policy_version 20460 (0.0006) [2023-03-07 10:11:23,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 20954112. Throughput: 0: 12684.5. Samples: 20953466. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:11:23,332][175405] Avg episode reward: [(0, '89.161')] [2023-03-07 10:11:23,857][175731] Updated weights for policy 0, policy_version 20470 (0.0007) [2023-03-07 10:11:24,656][175731] Updated weights for policy 0, policy_version 20480 (0.0008) [2023-03-07 10:11:25,455][175731] Updated weights for policy 0, policy_version 20490 (0.0007) [2023-03-07 10:11:26,266][175731] Updated weights for policy 0, policy_version 20500 (0.0006) [2023-03-07 10:11:27,083][175731] Updated weights for policy 0, policy_version 20510 (0.0006) [2023-03-07 10:11:27,890][175731] Updated weights for policy 0, policy_version 20520 (0.0007) [2023-03-07 10:11:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 21017600. Throughput: 0: 12691.6. Samples: 20991796. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:11:28,332][175405] Avg episode reward: [(0, '86.734')] [2023-03-07 10:11:28,667][175731] Updated weights for policy 0, policy_version 20530 (0.0007) [2023-03-07 10:11:29,498][175731] Updated weights for policy 0, policy_version 20540 (0.0006) [2023-03-07 10:11:30,302][175731] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-07 10:11:31,114][175731] Updated weights for policy 0, policy_version 20560 (0.0006) [2023-03-07 10:11:31,948][175731] Updated weights for policy 0, policy_version 20570 (0.0006) [2023-03-07 10:11:32,752][175731] Updated weights for policy 0, policy_version 20580 (0.0007) [2023-03-07 10:11:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 21081088. Throughput: 0: 12683.8. Samples: 21067581. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:11:33,332][175405] Avg episode reward: [(0, '92.878')] [2023-03-07 10:11:33,541][175731] Updated weights for policy 0, policy_version 20590 (0.0006) [2023-03-07 10:11:34,350][175731] Updated weights for policy 0, policy_version 20600 (0.0006) [2023-03-07 10:11:35,143][175731] Updated weights for policy 0, policy_version 20610 (0.0007) [2023-03-07 10:11:35,950][175731] Updated weights for policy 0, policy_version 20620 (0.0007) [2023-03-07 10:11:36,749][175731] Updated weights for policy 0, policy_version 20630 (0.0006) [2023-03-07 10:11:37,537][175731] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-07 10:11:38,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21144576. Throughput: 0: 12691.5. Samples: 21144143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:11:38,332][175405] Avg episode reward: [(0, '89.229')] [2023-03-07 10:11:38,358][175731] Updated weights for policy 0, policy_version 20650 (0.0006) [2023-03-07 10:11:39,163][175731] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-07 10:11:39,966][175731] Updated weights for policy 0, policy_version 20670 (0.0007) [2023-03-07 10:11:40,790][175731] Updated weights for policy 0, policy_version 20680 (0.0007) [2023-03-07 10:11:41,597][175731] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-07 10:11:42,402][175731] Updated weights for policy 0, policy_version 20700 (0.0007) [2023-03-07 10:11:43,210][175731] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-07 10:11:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21208064. Throughput: 0: 12683.2. Samples: 21181841. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:11:43,322][175405] Avg episode reward: [(0, '98.326')] [2023-03-07 10:11:44,017][175731] Updated weights for policy 0, policy_version 20720 (0.0006) [2023-03-07 10:11:44,822][175731] Updated weights for policy 0, policy_version 20730 (0.0006) [2023-03-07 10:11:45,640][175731] Updated weights for policy 0, policy_version 20740 (0.0006) [2023-03-07 10:11:46,444][175731] Updated weights for policy 0, policy_version 20750 (0.0006) [2023-03-07 10:11:47,268][175731] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-07 10:11:48,064][175731] Updated weights for policy 0, policy_version 20770 (0.0006) [2023-03-07 10:11:48,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21271552. Throughput: 0: 12678.5. Samples: 21257894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:11:48,322][175405] Avg episode reward: [(0, '89.696')] [2023-03-07 10:11:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000020773_21271552.pth... [2023-03-07 10:11:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000017802_18229248.pth [2023-03-07 10:11:48,881][175731] Updated weights for policy 0, policy_version 20780 (0.0006) [2023-03-07 10:11:49,676][175731] Updated weights for policy 0, policy_version 20790 (0.0006) [2023-03-07 10:11:50,475][175731] Updated weights for policy 0, policy_version 20800 (0.0006) [2023-03-07 10:11:51,275][175731] Updated weights for policy 0, policy_version 20810 (0.0007) [2023-03-07 10:11:52,096][175731] Updated weights for policy 0, policy_version 20820 (0.0007) [2023-03-07 10:11:52,886][175731] Updated weights for policy 0, policy_version 20830 (0.0007) [2023-03-07 10:11:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21335040. Throughput: 0: 12681.8. Samples: 21334403. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:11:53,322][175405] Avg episode reward: [(0, '91.113')] [2023-03-07 10:11:53,695][175731] Updated weights for policy 0, policy_version 20840 (0.0007) [2023-03-07 10:11:54,521][175731] Updated weights for policy 0, policy_version 20850 (0.0006) [2023-03-07 10:11:55,298][175731] Updated weights for policy 0, policy_version 20860 (0.0007) [2023-03-07 10:11:56,113][175731] Updated weights for policy 0, policy_version 20870 (0.0007) [2023-03-07 10:11:56,925][175731] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-07 10:11:57,732][175731] Updated weights for policy 0, policy_version 20890 (0.0006) [2023-03-07 10:11:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21398528. Throughput: 0: 12684.1. Samples: 21372518. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:11:58,322][175405] Avg episode reward: [(0, '94.140')] [2023-03-07 10:11:58,534][175731] Updated weights for policy 0, policy_version 20900 (0.0006) [2023-03-07 10:11:59,319][175731] Updated weights for policy 0, policy_version 20910 (0.0006) [2023-03-07 10:12:00,134][175731] Updated weights for policy 0, policy_version 20920 (0.0005) [2023-03-07 10:12:00,954][175731] Updated weights for policy 0, policy_version 20930 (0.0006) [2023-03-07 10:12:01,747][175731] Updated weights for policy 0, policy_version 20940 (0.0006) [2023-03-07 10:12:02,545][175731] Updated weights for policy 0, policy_version 20950 (0.0007) [2023-03-07 10:12:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21462016. Throughput: 0: 12697.5. Samples: 21449013. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:03,322][175405] Avg episode reward: [(0, '94.224')] [2023-03-07 10:12:03,359][175731] Updated weights for policy 0, policy_version 20960 (0.0007) [2023-03-07 10:12:04,152][175731] Updated weights for policy 0, policy_version 20970 (0.0007) [2023-03-07 10:12:04,959][175731] Updated weights for policy 0, policy_version 20980 (0.0006) [2023-03-07 10:12:05,771][175731] Updated weights for policy 0, policy_version 20990 (0.0007) [2023-03-07 10:12:06,579][175731] Updated weights for policy 0, policy_version 21000 (0.0006) [2023-03-07 10:12:07,390][175731] Updated weights for policy 0, policy_version 21010 (0.0006) [2023-03-07 10:12:08,182][175731] Updated weights for policy 0, policy_version 21020 (0.0007) [2023-03-07 10:12:08,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21525504. Throughput: 0: 12707.3. Samples: 21525295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:08,321][175405] Avg episode reward: [(0, '95.396')] [2023-03-07 10:12:08,988][175731] Updated weights for policy 0, policy_version 21030 (0.0007) [2023-03-07 10:12:09,797][175731] Updated weights for policy 0, policy_version 21040 (0.0007) [2023-03-07 10:12:10,600][175731] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-07 10:12:11,417][175731] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-07 10:12:12,217][175731] Updated weights for policy 0, policy_version 21070 (0.0006) [2023-03-07 10:12:13,020][175731] Updated weights for policy 0, policy_version 21080 (0.0006) [2023-03-07 10:12:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 21588992. Throughput: 0: 12701.7. Samples: 21563372. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:13,322][175405] Avg episode reward: [(0, '112.282')] [2023-03-07 10:12:13,834][175731] Updated weights for policy 0, policy_version 21090 (0.0006) [2023-03-07 10:12:14,639][175731] Updated weights for policy 0, policy_version 21100 (0.0006) [2023-03-07 10:12:15,449][175731] Updated weights for policy 0, policy_version 21110 (0.0007) [2023-03-07 10:12:16,249][175731] Updated weights for policy 0, policy_version 21120 (0.0007) [2023-03-07 10:12:17,086][175731] Updated weights for policy 0, policy_version 21130 (0.0006) [2023-03-07 10:12:17,871][175731] Updated weights for policy 0, policy_version 21140 (0.0007) [2023-03-07 10:12:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 21652480. Throughput: 0: 12706.9. Samples: 21639394. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:18,322][175405] Avg episode reward: [(0, '121.666')] [2023-03-07 10:12:18,677][175731] Updated weights for policy 0, policy_version 21150 (0.0006) [2023-03-07 10:12:19,494][175731] Updated weights for policy 0, policy_version 21160 (0.0006) [2023-03-07 10:12:20,289][175731] Updated weights for policy 0, policy_version 21170 (0.0007) [2023-03-07 10:12:21,097][175731] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-07 10:12:21,909][175731] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-07 10:12:22,716][175731] Updated weights for policy 0, policy_version 21200 (0.0007) [2023-03-07 10:12:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 21715968. Throughput: 0: 12697.9. Samples: 21715551. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:23,322][175405] Avg episode reward: [(0, '123.966')] [2023-03-07 10:12:23,513][175731] Updated weights for policy 0, policy_version 21210 (0.0005) [2023-03-07 10:12:24,326][175731] Updated weights for policy 0, policy_version 21220 (0.0006) [2023-03-07 10:12:25,148][175731] Updated weights for policy 0, policy_version 21230 (0.0006) [2023-03-07 10:12:25,947][175731] Updated weights for policy 0, policy_version 21240 (0.0007) [2023-03-07 10:12:26,733][175731] Updated weights for policy 0, policy_version 21250 (0.0006) [2023-03-07 10:12:27,555][175731] Updated weights for policy 0, policy_version 21260 (0.0006) [2023-03-07 10:12:28,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 21779456. Throughput: 0: 12704.7. Samples: 21753556. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:28,322][175405] Avg episode reward: [(0, '124.152')] [2023-03-07 10:12:28,359][175731] Updated weights for policy 0, policy_version 21270 (0.0006) [2023-03-07 10:12:29,171][175731] Updated weights for policy 0, policy_version 21280 (0.0005) [2023-03-07 10:12:29,989][175731] Updated weights for policy 0, policy_version 21290 (0.0007) [2023-03-07 10:12:30,797][175731] Updated weights for policy 0, policy_version 21300 (0.0006) [2023-03-07 10:12:31,610][175731] Updated weights for policy 0, policy_version 21310 (0.0006) [2023-03-07 10:12:32,411][175731] Updated weights for policy 0, policy_version 21320 (0.0006) [2023-03-07 10:12:33,216][175731] Updated weights for policy 0, policy_version 21330 (0.0006) [2023-03-07 10:12:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 21842944. Throughput: 0: 12705.4. Samples: 21829635. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:12:33,321][175405] Avg episode reward: [(0, '91.040')] [2023-03-07 10:12:34,026][175731] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-07 10:12:34,826][175731] Updated weights for policy 0, policy_version 21350 (0.0006) [2023-03-07 10:12:35,640][175731] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-07 10:12:36,449][175731] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-07 10:12:37,249][175731] Updated weights for policy 0, policy_version 21380 (0.0007) [2023-03-07 10:12:38,066][175731] Updated weights for policy 0, policy_version 21390 (0.0006) [2023-03-07 10:12:38,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 21906432. Throughput: 0: 12694.6. Samples: 21905660. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:12:38,321][175405] Avg episode reward: [(0, '87.670')] [2023-03-07 10:12:38,875][175731] Updated weights for policy 0, policy_version 21400 (0.0006) [2023-03-07 10:12:39,671][175731] Updated weights for policy 0, policy_version 21410 (0.0006) [2023-03-07 10:12:40,500][175731] Updated weights for policy 0, policy_version 21420 (0.0007) [2023-03-07 10:12:41,309][175731] Updated weights for policy 0, policy_version 21430 (0.0007) [2023-03-07 10:12:42,134][175731] Updated weights for policy 0, policy_version 21440 (0.0007) [2023-03-07 10:12:42,947][175731] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-07 10:12:43,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 21968896. Throughput: 0: 12693.2. Samples: 21943710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:43,322][175405] Avg episode reward: [(0, '96.558')] [2023-03-07 10:12:43,756][175731] Updated weights for policy 0, policy_version 21460 (0.0007) [2023-03-07 10:12:44,566][175731] Updated weights for policy 0, policy_version 21470 (0.0006) [2023-03-07 10:12:45,366][175731] Updated weights for policy 0, policy_version 21480 (0.0006) [2023-03-07 10:12:46,174][175731] Updated weights for policy 0, policy_version 21490 (0.0006) [2023-03-07 10:12:46,984][175731] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-07 10:12:47,787][175731] Updated weights for policy 0, policy_version 21510 (0.0007) [2023-03-07 10:12:48,321][175405] Fps is (10 sec: 12595.0, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22032384. Throughput: 0: 12674.4. Samples: 22019363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:48,322][175405] Avg episode reward: [(0, '97.160')] [2023-03-07 10:12:48,578][175731] Updated weights for policy 0, policy_version 21520 (0.0006) [2023-03-07 10:12:49,396][175731] Updated weights for policy 0, policy_version 21530 (0.0008) [2023-03-07 10:12:50,207][175731] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-07 10:12:51,014][175731] Updated weights for policy 0, policy_version 21550 (0.0007) [2023-03-07 10:12:51,799][175731] Updated weights for policy 0, policy_version 21560 (0.0006) [2023-03-07 10:12:52,622][175731] Updated weights for policy 0, policy_version 21570 (0.0006) [2023-03-07 10:12:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22095872. Throughput: 0: 12674.0. Samples: 22095627. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:12:53,322][175405] Avg episode reward: [(0, '94.626')] [2023-03-07 10:12:53,415][175731] Updated weights for policy 0, policy_version 21580 (0.0005) [2023-03-07 10:12:54,235][175731] Updated weights for policy 0, policy_version 21590 (0.0006) [2023-03-07 10:12:55,042][175731] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-07 10:12:55,847][175731] Updated weights for policy 0, policy_version 21610 (0.0006) [2023-03-07 10:12:56,655][175731] Updated weights for policy 0, policy_version 21620 (0.0006) [2023-03-07 10:12:57,475][175731] Updated weights for policy 0, policy_version 21630 (0.0006) [2023-03-07 10:12:58,281][175731] Updated weights for policy 0, policy_version 21640 (0.0007) [2023-03-07 10:12:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22159360. Throughput: 0: 12673.0. Samples: 22133659. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:12:58,322][175405] Avg episode reward: [(0, '120.040')] [2023-03-07 10:12:59,087][175731] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-07 10:12:59,907][175731] Updated weights for policy 0, policy_version 21660 (0.0007) [2023-03-07 10:13:00,714][175731] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-07 10:13:01,506][175731] Updated weights for policy 0, policy_version 21680 (0.0006) [2023-03-07 10:13:02,333][175731] Updated weights for policy 0, policy_version 21690 (0.0006) [2023-03-07 10:13:03,124][175731] Updated weights for policy 0, policy_version 21700 (0.0007) [2023-03-07 10:13:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22222848. Throughput: 0: 12670.8. Samples: 22209580. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:13:03,321][175405] Avg episode reward: [(0, '105.470')] [2023-03-07 10:13:03,933][175731] Updated weights for policy 0, policy_version 21710 (0.0006) [2023-03-07 10:13:04,731][175731] Updated weights for policy 0, policy_version 21720 (0.0006) [2023-03-07 10:13:05,542][175731] Updated weights for policy 0, policy_version 21730 (0.0007) [2023-03-07 10:13:06,341][175731] Updated weights for policy 0, policy_version 21740 (0.0007) [2023-03-07 10:13:07,157][175731] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-07 10:13:07,945][175731] Updated weights for policy 0, policy_version 21760 (0.0005) [2023-03-07 10:13:08,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22286336. Throughput: 0: 12672.9. Samples: 22285831. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:13:08,322][175405] Avg episode reward: [(0, '91.868')] [2023-03-07 10:13:08,759][175731] Updated weights for policy 0, policy_version 21770 (0.0007) [2023-03-07 10:13:09,566][175731] Updated weights for policy 0, policy_version 21780 (0.0006) [2023-03-07 10:13:10,380][175731] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-07 10:13:11,184][175731] Updated weights for policy 0, policy_version 21800 (0.0006) [2023-03-07 10:13:12,005][175731] Updated weights for policy 0, policy_version 21810 (0.0006) [2023-03-07 10:13:12,818][175731] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-07 10:13:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22349824. Throughput: 0: 12672.4. Samples: 22323814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:13:13,322][175405] Avg episode reward: [(0, '91.310')] [2023-03-07 10:13:13,626][175731] Updated weights for policy 0, policy_version 21830 (0.0006) [2023-03-07 10:13:14,420][175731] Updated weights for policy 0, policy_version 21840 (0.0006) [2023-03-07 10:13:15,219][175731] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-07 10:13:16,008][175731] Updated weights for policy 0, policy_version 21860 (0.0006) [2023-03-07 10:13:16,830][175731] Updated weights for policy 0, policy_version 21870 (0.0006) [2023-03-07 10:13:17,638][175731] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-07 10:13:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22413312. Throughput: 0: 12674.3. Samples: 22399978. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:13:18,322][175405] Avg episode reward: [(0, '92.763')] [2023-03-07 10:13:18,434][175731] Updated weights for policy 0, policy_version 21890 (0.0007) [2023-03-07 10:13:19,251][175731] Updated weights for policy 0, policy_version 21900 (0.0007) [2023-03-07 10:13:20,070][175731] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-07 10:13:20,860][175731] Updated weights for policy 0, policy_version 21920 (0.0006) [2023-03-07 10:13:21,669][175731] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-07 10:13:22,459][175731] Updated weights for policy 0, policy_version 21940 (0.0006) [2023-03-07 10:13:23,277][175731] Updated weights for policy 0, policy_version 21950 (0.0007) [2023-03-07 10:13:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.3). Total num frames: 22476800. Throughput: 0: 12681.9. Samples: 22476346. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:13:23,321][175405] Avg episode reward: [(0, '91.054')] [2023-03-07 10:13:24,079][175731] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-07 10:13:24,895][175731] Updated weights for policy 0, policy_version 21970 (0.0006) [2023-03-07 10:13:25,694][175731] Updated weights for policy 0, policy_version 21980 (0.0006) [2023-03-07 10:13:26,521][175731] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-07 10:13:27,349][175731] Updated weights for policy 0, policy_version 22000 (0.0007) [2023-03-07 10:13:28,153][175731] Updated weights for policy 0, policy_version 22010 (0.0006) [2023-03-07 10:13:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12680.2). Total num frames: 22540288. Throughput: 0: 12678.0. Samples: 22514219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:13:28,322][175405] Avg episode reward: [(0, '130.225')] [2023-03-07 10:13:28,953][175731] Updated weights for policy 0, policy_version 22020 (0.0006) [2023-03-07 10:13:29,760][175731] Updated weights for policy 0, policy_version 22030 (0.0006) [2023-03-07 10:13:30,575][175731] Updated weights for policy 0, policy_version 22040 (0.0006) [2023-03-07 10:13:31,391][175731] Updated weights for policy 0, policy_version 22050 (0.0007) [2023-03-07 10:13:32,188][175731] Updated weights for policy 0, policy_version 22060 (0.0006) [2023-03-07 10:13:32,991][175731] Updated weights for policy 0, policy_version 22070 (0.0006) [2023-03-07 10:13:33,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 22602752. Throughput: 0: 12685.3. Samples: 22590201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:13:33,321][175405] Avg episode reward: [(0, '157.507')] [2023-03-07 10:13:33,816][175731] Updated weights for policy 0, policy_version 22080 (0.0008) [2023-03-07 10:13:34,610][175731] Updated weights for policy 0, policy_version 22090 (0.0007) [2023-03-07 10:13:35,413][175731] Updated weights for policy 0, policy_version 22100 (0.0007) [2023-03-07 10:13:36,231][175731] Updated weights for policy 0, policy_version 22110 (0.0006) [2023-03-07 10:13:37,043][175731] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-07 10:13:37,859][175731] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-07 10:13:38,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12676.8). Total num frames: 22666240. Throughput: 0: 12676.9. Samples: 22666089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:13:38,321][175405] Avg episode reward: [(0, '85.142')] [2023-03-07 10:13:38,646][175731] Updated weights for policy 0, policy_version 22140 (0.0006) [2023-03-07 10:13:39,458][175731] Updated weights for policy 0, policy_version 22150 (0.0006) [2023-03-07 10:13:40,259][175731] Updated weights for policy 0, policy_version 22160 (0.0006) [2023-03-07 10:13:41,098][175731] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-07 10:13:41,888][175731] Updated weights for policy 0, policy_version 22180 (0.0007) [2023-03-07 10:13:42,694][175731] Updated weights for policy 0, policy_version 22190 (0.0006) [2023-03-07 10:13:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12676.8). Total num frames: 22729728. Throughput: 0: 12676.9. Samples: 22704119. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:13:43,322][175405] Avg episode reward: [(0, '89.697')] [2023-03-07 10:13:43,500][175731] Updated weights for policy 0, policy_version 22200 (0.0007) [2023-03-07 10:13:44,318][175731] Updated weights for policy 0, policy_version 22210 (0.0006) [2023-03-07 10:13:45,097][175731] Updated weights for policy 0, policy_version 22220 (0.0007) [2023-03-07 10:13:45,894][175731] Updated weights for policy 0, policy_version 22230 (0.0007) [2023-03-07 10:13:46,702][175731] Updated weights for policy 0, policy_version 22240 (0.0006) [2023-03-07 10:13:47,499][175731] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-07 10:13:48,305][175731] Updated weights for policy 0, policy_version 22260 (0.0006) [2023-03-07 10:13:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12697.6, 300 sec: 12680.2). Total num frames: 22794240. Throughput: 0: 12694.9. Samples: 22780850. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:13:48,322][175405] Avg episode reward: [(0, '91.660')] [2023-03-07 10:13:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000022260_22794240.pth... [2023-03-07 10:13:48,359][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000019288_19750912.pth [2023-03-07 10:13:49,116][175731] Updated weights for policy 0, policy_version 22270 (0.0007) [2023-03-07 10:13:49,917][175731] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-07 10:13:50,729][175731] Updated weights for policy 0, policy_version 22290 (0.0006) [2023-03-07 10:13:51,521][175731] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-07 10:13:52,329][175731] Updated weights for policy 0, policy_version 22310 (0.0006) [2023-03-07 10:13:53,138][175731] Updated weights for policy 0, policy_version 22320 (0.0006) [2023-03-07 10:13:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 22857728. Throughput: 0: 12691.2. Samples: 22856935. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:13:53,322][175405] Avg episode reward: [(0, '82.974')] [2023-03-07 10:13:53,958][175731] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-07 10:13:54,761][175731] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-07 10:13:55,579][175731] Updated weights for policy 0, policy_version 22350 (0.0007) [2023-03-07 10:13:56,394][175731] Updated weights for policy 0, policy_version 22360 (0.0006) [2023-03-07 10:13:57,182][175731] Updated weights for policy 0, policy_version 22370 (0.0007) [2023-03-07 10:13:57,999][175731] Updated weights for policy 0, policy_version 22380 (0.0006) [2023-03-07 10:13:58,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12680.6, 300 sec: 12676.8). Total num frames: 22920192. Throughput: 0: 12687.3. Samples: 22894741. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:13:58,321][175405] Avg episode reward: [(0, '87.929')] [2023-03-07 10:13:58,836][175731] Updated weights for policy 0, policy_version 22390 (0.0007) [2023-03-07 10:13:59,633][175731] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-07 10:14:00,432][175731] Updated weights for policy 0, policy_version 22410 (0.0006) [2023-03-07 10:14:01,238][175731] Updated weights for policy 0, policy_version 22420 (0.0006) [2023-03-07 10:14:02,059][175731] Updated weights for policy 0, policy_version 22430 (0.0006) [2023-03-07 10:14:02,854][175731] Updated weights for policy 0, policy_version 22440 (0.0006) [2023-03-07 10:14:03,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 22983680. Throughput: 0: 12684.1. Samples: 22970761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:03,321][175405] Avg episode reward: [(0, '129.417')] [2023-03-07 10:14:03,664][175731] Updated weights for policy 0, policy_version 22450 (0.0006) [2023-03-07 10:14:04,452][175731] Updated weights for policy 0, policy_version 22460 (0.0007) [2023-03-07 10:14:05,279][175731] Updated weights for policy 0, policy_version 22470 (0.0006) [2023-03-07 10:14:06,087][175731] Updated weights for policy 0, policy_version 22480 (0.0006) [2023-03-07 10:14:06,871][175731] Updated weights for policy 0, policy_version 22490 (0.0006) [2023-03-07 10:14:07,678][175731] Updated weights for policy 0, policy_version 22500 (0.0006) [2023-03-07 10:14:08,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 23047168. Throughput: 0: 12685.3. Samples: 23047186. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:08,322][175405] Avg episode reward: [(0, '100.105')] [2023-03-07 10:14:08,493][175731] Updated weights for policy 0, policy_version 22510 (0.0007) [2023-03-07 10:14:09,290][175731] Updated weights for policy 0, policy_version 22520 (0.0006) [2023-03-07 10:14:10,087][175731] Updated weights for policy 0, policy_version 22530 (0.0006) [2023-03-07 10:14:10,896][175731] Updated weights for policy 0, policy_version 22540 (0.0007) [2023-03-07 10:14:11,714][175731] Updated weights for policy 0, policy_version 22550 (0.0007) [2023-03-07 10:14:12,510][175731] Updated weights for policy 0, policy_version 22560 (0.0007) [2023-03-07 10:14:13,308][175731] Updated weights for policy 0, policy_version 22570 (0.0006) [2023-03-07 10:14:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 23111680. Throughput: 0: 12689.1. Samples: 23085227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:13,322][175405] Avg episode reward: [(0, '95.558')] [2023-03-07 10:14:14,102][175731] Updated weights for policy 0, policy_version 22580 (0.0007) [2023-03-07 10:14:14,924][175731] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-07 10:14:15,719][175731] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-07 10:14:16,545][175731] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-07 10:14:17,357][175731] Updated weights for policy 0, policy_version 22620 (0.0006) [2023-03-07 10:14:18,165][175731] Updated weights for policy 0, policy_version 22630 (0.0006) [2023-03-07 10:14:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 23174144. Throughput: 0: 12696.9. Samples: 23161562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:18,322][175405] Avg episode reward: [(0, '87.109')] [2023-03-07 10:14:18,979][175731] Updated weights for policy 0, policy_version 22640 (0.0006) [2023-03-07 10:14:19,784][175731] Updated weights for policy 0, policy_version 22650 (0.0006) [2023-03-07 10:14:20,606][175731] Updated weights for policy 0, policy_version 22660 (0.0006) [2023-03-07 10:14:21,402][175731] Updated weights for policy 0, policy_version 22670 (0.0007) [2023-03-07 10:14:22,194][175731] Updated weights for policy 0, policy_version 22680 (0.0008) [2023-03-07 10:14:23,016][175731] Updated weights for policy 0, policy_version 22690 (0.0006) [2023-03-07 10:14:23,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12680.2). Total num frames: 23237632. Throughput: 0: 12702.1. Samples: 23237685. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:23,321][175405] Avg episode reward: [(0, '89.342')] [2023-03-07 10:14:23,803][175731] Updated weights for policy 0, policy_version 22700 (0.0006) [2023-03-07 10:14:24,610][175731] Updated weights for policy 0, policy_version 22710 (0.0007) [2023-03-07 10:14:25,404][175731] Updated weights for policy 0, policy_version 22720 (0.0006) [2023-03-07 10:14:26,218][175731] Updated weights for policy 0, policy_version 22730 (0.0006) [2023-03-07 10:14:27,027][175731] Updated weights for policy 0, policy_version 22740 (0.0007) [2023-03-07 10:14:27,834][175731] Updated weights for policy 0, policy_version 22750 (0.0006) [2023-03-07 10:14:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 23301120. Throughput: 0: 12704.7. Samples: 23275833. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:28,322][175405] Avg episode reward: [(0, '92.582')] [2023-03-07 10:14:28,650][175731] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-07 10:14:29,456][175731] Updated weights for policy 0, policy_version 22770 (0.0007) [2023-03-07 10:14:30,266][175731] Updated weights for policy 0, policy_version 22780 (0.0007) [2023-03-07 10:14:31,076][175731] Updated weights for policy 0, policy_version 22790 (0.0006) [2023-03-07 10:14:31,880][175731] Updated weights for policy 0, policy_version 22800 (0.0007) [2023-03-07 10:14:32,681][175731] Updated weights for policy 0, policy_version 22810 (0.0007) [2023-03-07 10:14:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 23364608. Throughput: 0: 12692.8. Samples: 23352025. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:33,322][175405] Avg episode reward: [(0, '97.452')] [2023-03-07 10:14:33,514][175731] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-07 10:14:34,312][175731] Updated weights for policy 0, policy_version 22830 (0.0007) [2023-03-07 10:14:35,120][175731] Updated weights for policy 0, policy_version 22840 (0.0006) [2023-03-07 10:14:35,932][175731] Updated weights for policy 0, policy_version 22850 (0.0006) [2023-03-07 10:14:36,717][175731] Updated weights for policy 0, policy_version 22860 (0.0006) [2023-03-07 10:14:37,528][175731] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-07 10:14:38,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 23428096. Throughput: 0: 12687.7. Samples: 23427881. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:38,322][175405] Avg episode reward: [(0, '93.055')] [2023-03-07 10:14:38,353][175731] Updated weights for policy 0, policy_version 22880 (0.0006) [2023-03-07 10:14:39,147][175731] Updated weights for policy 0, policy_version 22890 (0.0007) [2023-03-07 10:14:39,957][175731] Updated weights for policy 0, policy_version 22900 (0.0007) [2023-03-07 10:14:40,759][175731] Updated weights for policy 0, policy_version 22910 (0.0006) [2023-03-07 10:14:41,577][175731] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-07 10:14:42,377][175731] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-07 10:14:43,174][175731] Updated weights for policy 0, policy_version 22940 (0.0007) [2023-03-07 10:14:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 23491584. Throughput: 0: 12695.5. Samples: 23466040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:43,322][175405] Avg episode reward: [(0, '98.640')] [2023-03-07 10:14:43,978][175731] Updated weights for policy 0, policy_version 22950 (0.0006) [2023-03-07 10:14:44,789][175731] Updated weights for policy 0, policy_version 22960 (0.0006) [2023-03-07 10:14:45,597][175731] Updated weights for policy 0, policy_version 22970 (0.0006) [2023-03-07 10:14:46,406][175731] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-07 10:14:47,238][175731] Updated weights for policy 0, policy_version 22990 (0.0006) [2023-03-07 10:14:48,029][175731] Updated weights for policy 0, policy_version 23000 (0.0006) [2023-03-07 10:14:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 23555072. Throughput: 0: 12692.0. Samples: 23541903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:48,322][175405] Avg episode reward: [(0, '89.286')] [2023-03-07 10:14:48,838][175731] Updated weights for policy 0, policy_version 23010 (0.0006) [2023-03-07 10:14:49,658][175731] Updated weights for policy 0, policy_version 23020 (0.0006) [2023-03-07 10:14:50,470][175731] Updated weights for policy 0, policy_version 23030 (0.0006) [2023-03-07 10:14:51,273][175731] Updated weights for policy 0, policy_version 23040 (0.0007) [2023-03-07 10:14:52,076][175731] Updated weights for policy 0, policy_version 23050 (0.0006) [2023-03-07 10:14:52,869][175731] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-07 10:14:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 23618560. Throughput: 0: 12688.9. Samples: 23618186. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:53,322][175405] Avg episode reward: [(0, '88.985')] [2023-03-07 10:14:53,682][175731] Updated weights for policy 0, policy_version 23070 (0.0006) [2023-03-07 10:14:54,496][175731] Updated weights for policy 0, policy_version 23080 (0.0007) [2023-03-07 10:14:55,288][175731] Updated weights for policy 0, policy_version 23090 (0.0005) [2023-03-07 10:14:56,114][175731] Updated weights for policy 0, policy_version 23100 (0.0007) [2023-03-07 10:14:56,908][175731] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-07 10:14:57,709][175731] Updated weights for policy 0, policy_version 23120 (0.0007) [2023-03-07 10:14:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 23682048. Throughput: 0: 12685.8. Samples: 23656090. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:14:58,322][175405] Avg episode reward: [(0, '99.877')] [2023-03-07 10:14:58,551][175731] Updated weights for policy 0, policy_version 23130 (0.0006) [2023-03-07 10:14:59,356][175731] Updated weights for policy 0, policy_version 23140 (0.0006) [2023-03-07 10:15:00,137][175731] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-07 10:15:00,966][175731] Updated weights for policy 0, policy_version 23160 (0.0007) [2023-03-07 10:15:01,772][175731] Updated weights for policy 0, policy_version 23170 (0.0008) [2023-03-07 10:15:02,570][175731] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-07 10:15:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 23745536. Throughput: 0: 12678.5. Samples: 23732094. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:15:03,322][175405] Avg episode reward: [(0, '96.185')] [2023-03-07 10:15:03,399][175731] Updated weights for policy 0, policy_version 23190 (0.0006) [2023-03-07 10:15:04,210][175731] Updated weights for policy 0, policy_version 23200 (0.0007) [2023-03-07 10:15:05,021][175731] Updated weights for policy 0, policy_version 23210 (0.0006) [2023-03-07 10:15:05,821][175731] Updated weights for policy 0, policy_version 23220 (0.0007) [2023-03-07 10:15:06,650][175731] Updated weights for policy 0, policy_version 23230 (0.0007) [2023-03-07 10:15:07,442][175731] Updated weights for policy 0, policy_version 23240 (0.0006) [2023-03-07 10:15:08,243][175731] Updated weights for policy 0, policy_version 23250 (0.0006) [2023-03-07 10:15:08,321][175405] Fps is (10 sec: 12595.4, 60 sec: 12680.6, 300 sec: 12680.2). Total num frames: 23808000. Throughput: 0: 12671.0. Samples: 23807882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:15:08,321][175405] Avg episode reward: [(0, '95.656')] [2023-03-07 10:15:09,068][175731] Updated weights for policy 0, policy_version 23260 (0.0006) [2023-03-07 10:15:09,861][175731] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-07 10:15:10,662][175731] Updated weights for policy 0, policy_version 23280 (0.0006) [2023-03-07 10:15:11,459][175731] Updated weights for policy 0, policy_version 23290 (0.0006) [2023-03-07 10:15:12,268][175731] Updated weights for policy 0, policy_version 23300 (0.0007) [2023-03-07 10:15:13,086][175731] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-07 10:15:13,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12663.5, 300 sec: 12683.7). Total num frames: 23871488. Throughput: 0: 12673.7. Samples: 23846151. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:15:13,322][175405] Avg episode reward: [(0, '93.783')] [2023-03-07 10:15:13,887][175731] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-07 10:15:14,677][175731] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-07 10:15:15,499][175731] Updated weights for policy 0, policy_version 23340 (0.0006) [2023-03-07 10:15:16,309][175731] Updated weights for policy 0, policy_version 23350 (0.0006) [2023-03-07 10:15:17,113][175731] Updated weights for policy 0, policy_version 23360 (0.0007) [2023-03-07 10:15:17,913][175731] Updated weights for policy 0, policy_version 23370 (0.0007) [2023-03-07 10:15:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 23936000. Throughput: 0: 12671.2. Samples: 23922229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:15:18,321][175405] Avg episode reward: [(0, '101.134')] [2023-03-07 10:15:18,730][175731] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-07 10:15:19,509][175731] Updated weights for policy 0, policy_version 23390 (0.0006) [2023-03-07 10:15:20,309][175731] Updated weights for policy 0, policy_version 23400 (0.0007) [2023-03-07 10:15:21,122][175731] Updated weights for policy 0, policy_version 23410 (0.0007) [2023-03-07 10:15:21,929][175731] Updated weights for policy 0, policy_version 23420 (0.0007) [2023-03-07 10:15:22,733][175731] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-07 10:15:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 23999488. Throughput: 0: 12685.9. Samples: 23998743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:15:23,321][175405] Avg episode reward: [(0, '141.619')] [2023-03-07 10:15:23,542][175731] Updated weights for policy 0, policy_version 23440 (0.0005) [2023-03-07 10:15:24,347][175731] Updated weights for policy 0, policy_version 23450 (0.0006) [2023-03-07 10:15:25,157][175731] Updated weights for policy 0, policy_version 23460 (0.0007) [2023-03-07 10:15:25,973][175731] Updated weights for policy 0, policy_version 23470 (0.0007) [2023-03-07 10:15:26,793][175731] Updated weights for policy 0, policy_version 23480 (0.0006) [2023-03-07 10:15:27,591][175731] Updated weights for policy 0, policy_version 23490 (0.0006) [2023-03-07 10:15:28,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 24061952. Throughput: 0: 12681.2. Samples: 24036694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:15:28,322][175405] Avg episode reward: [(0, '112.193')] [2023-03-07 10:15:28,407][175731] Updated weights for policy 0, policy_version 23500 (0.0006) [2023-03-07 10:15:29,212][175731] Updated weights for policy 0, policy_version 23510 (0.0006) [2023-03-07 10:15:30,018][175731] Updated weights for policy 0, policy_version 23520 (0.0006) [2023-03-07 10:15:30,827][175731] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-07 10:15:31,631][175731] Updated weights for policy 0, policy_version 23540 (0.0007) [2023-03-07 10:15:32,439][175731] Updated weights for policy 0, policy_version 23550 (0.0006) [2023-03-07 10:15:33,256][175731] Updated weights for policy 0, policy_version 23560 (0.0006) [2023-03-07 10:15:33,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24125440. Throughput: 0: 12683.5. Samples: 24112659. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:15:33,321][175405] Avg episode reward: [(0, '93.016')] [2023-03-07 10:15:34,068][175731] Updated weights for policy 0, policy_version 23570 (0.0007) [2023-03-07 10:15:34,878][175731] Updated weights for policy 0, policy_version 23580 (0.0007) [2023-03-07 10:15:35,682][175731] Updated weights for policy 0, policy_version 23590 (0.0007) [2023-03-07 10:15:36,493][175731] Updated weights for policy 0, policy_version 23600 (0.0007) [2023-03-07 10:15:37,288][175731] Updated weights for policy 0, policy_version 23610 (0.0007) [2023-03-07 10:15:38,105][175731] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-07 10:15:38,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24188928. Throughput: 0: 12673.2. Samples: 24188479. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:15:38,321][175405] Avg episode reward: [(0, '90.561')] [2023-03-07 10:15:38,927][175731] Updated weights for policy 0, policy_version 23630 (0.0007) [2023-03-07 10:15:39,716][175731] Updated weights for policy 0, policy_version 23640 (0.0006) [2023-03-07 10:15:40,534][175731] Updated weights for policy 0, policy_version 23650 (0.0006) [2023-03-07 10:15:41,344][175731] Updated weights for policy 0, policy_version 23660 (0.0006) [2023-03-07 10:15:42,165][175731] Updated weights for policy 0, policy_version 23670 (0.0007) [2023-03-07 10:15:42,958][175731] Updated weights for policy 0, policy_version 23680 (0.0007) [2023-03-07 10:15:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24252416. Throughput: 0: 12679.3. Samples: 24226660. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:15:43,322][175405] Avg episode reward: [(0, '82.678')] [2023-03-07 10:15:43,758][175731] Updated weights for policy 0, policy_version 23690 (0.0006) [2023-03-07 10:15:44,563][175731] Updated weights for policy 0, policy_version 23700 (0.0006) [2023-03-07 10:15:45,361][175731] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-07 10:15:46,170][175731] Updated weights for policy 0, policy_version 23720 (0.0006) [2023-03-07 10:15:46,978][175731] Updated weights for policy 0, policy_version 23730 (0.0007) [2023-03-07 10:15:47,759][175731] Updated weights for policy 0, policy_version 23740 (0.0006) [2023-03-07 10:15:48,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12687.2). Total num frames: 24315904. Throughput: 0: 12686.8. Samples: 24302999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:15:48,321][175405] Avg episode reward: [(0, '93.015')] [2023-03-07 10:15:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000023746_24315904.pth... [2023-03-07 10:15:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000020773_21271552.pth [2023-03-07 10:15:48,579][175731] Updated weights for policy 0, policy_version 23750 (0.0007) [2023-03-07 10:15:49,387][175731] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-07 10:15:50,187][175731] Updated weights for policy 0, policy_version 23770 (0.0007) [2023-03-07 10:15:51,014][175731] Updated weights for policy 0, policy_version 23780 (0.0006) [2023-03-07 10:15:51,809][175731] Updated weights for policy 0, policy_version 23790 (0.0006) [2023-03-07 10:15:52,623][175731] Updated weights for policy 0, policy_version 23800 (0.0006) [2023-03-07 10:15:53,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24379392. Throughput: 0: 12692.7. Samples: 24379054. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:15:53,322][175405] Avg episode reward: [(0, '96.338')] [2023-03-07 10:15:53,433][175731] Updated weights for policy 0, policy_version 23810 (0.0005) [2023-03-07 10:15:54,253][175731] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-07 10:15:55,053][175731] Updated weights for policy 0, policy_version 23830 (0.0007) [2023-03-07 10:15:55,853][175731] Updated weights for policy 0, policy_version 23840 (0.0006) [2023-03-07 10:15:56,649][175731] Updated weights for policy 0, policy_version 23850 (0.0007) [2023-03-07 10:15:57,451][175731] Updated weights for policy 0, policy_version 23860 (0.0007) [2023-03-07 10:15:58,245][175731] Updated weights for policy 0, policy_version 23870 (0.0006) [2023-03-07 10:15:58,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24442880. Throughput: 0: 12686.9. Samples: 24417062. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:15:58,322][175405] Avg episode reward: [(0, '116.520')] [2023-03-07 10:15:59,067][175731] Updated weights for policy 0, policy_version 23880 (0.0007) [2023-03-07 10:15:59,872][175731] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-07 10:16:00,704][175731] Updated weights for policy 0, policy_version 23900 (0.0007) [2023-03-07 10:16:01,517][175731] Updated weights for policy 0, policy_version 23910 (0.0006) [2023-03-07 10:16:02,303][175731] Updated weights for policy 0, policy_version 23920 (0.0006) [2023-03-07 10:16:03,105][175731] Updated weights for policy 0, policy_version 23930 (0.0006) [2023-03-07 10:16:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24506368. Throughput: 0: 12687.8. Samples: 24493180. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:03,321][175405] Avg episode reward: [(0, '104.401')] [2023-03-07 10:16:03,908][175731] Updated weights for policy 0, policy_version 23940 (0.0006) [2023-03-07 10:16:04,731][175731] Updated weights for policy 0, policy_version 23950 (0.0006) [2023-03-07 10:16:05,522][175731] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-07 10:16:06,330][175731] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-07 10:16:07,139][175731] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-07 10:16:07,941][175731] Updated weights for policy 0, policy_version 23990 (0.0007) [2023-03-07 10:16:08,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12683.7). Total num frames: 24569856. Throughput: 0: 12685.0. Samples: 24569569. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:08,321][175405] Avg episode reward: [(0, '103.903')] [2023-03-07 10:16:08,753][175731] Updated weights for policy 0, policy_version 24000 (0.0006) [2023-03-07 10:16:09,547][175731] Updated weights for policy 0, policy_version 24010 (0.0006) [2023-03-07 10:16:10,336][175731] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-07 10:16:11,162][175731] Updated weights for policy 0, policy_version 24030 (0.0007) [2023-03-07 10:16:11,968][175731] Updated weights for policy 0, policy_version 24040 (0.0007) [2023-03-07 10:16:12,785][175731] Updated weights for policy 0, policy_version 24050 (0.0007) [2023-03-07 10:16:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 24634368. Throughput: 0: 12694.5. Samples: 24607945. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:13,322][175405] Avg episode reward: [(0, '99.303')] [2023-03-07 10:16:13,595][175731] Updated weights for policy 0, policy_version 24060 (0.0006) [2023-03-07 10:16:14,404][175731] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-07 10:16:15,207][175731] Updated weights for policy 0, policy_version 24080 (0.0006) [2023-03-07 10:16:16,017][175731] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-07 10:16:16,823][175731] Updated weights for policy 0, policy_version 24100 (0.0006) [2023-03-07 10:16:17,629][175731] Updated weights for policy 0, policy_version 24110 (0.0007) [2023-03-07 10:16:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24696832. Throughput: 0: 12692.1. Samples: 24683802. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:18,321][175405] Avg episode reward: [(0, '91.065')] [2023-03-07 10:16:18,432][175731] Updated weights for policy 0, policy_version 24120 (0.0006) [2023-03-07 10:16:19,254][175731] Updated weights for policy 0, policy_version 24130 (0.0006) [2023-03-07 10:16:20,065][175731] Updated weights for policy 0, policy_version 24140 (0.0006) [2023-03-07 10:16:20,879][175731] Updated weights for policy 0, policy_version 24150 (0.0008) [2023-03-07 10:16:21,688][175731] Updated weights for policy 0, policy_version 24160 (0.0006) [2023-03-07 10:16:22,497][175731] Updated weights for policy 0, policy_version 24170 (0.0006) [2023-03-07 10:16:23,309][175731] Updated weights for policy 0, policy_version 24180 (0.0007) [2023-03-07 10:16:23,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 24760320. Throughput: 0: 12689.7. Samples: 24759514. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:23,321][175405] Avg episode reward: [(0, '89.892')] [2023-03-07 10:16:24,109][175731] Updated weights for policy 0, policy_version 24190 (0.0006) [2023-03-07 10:16:24,922][175731] Updated weights for policy 0, policy_version 24200 (0.0007) [2023-03-07 10:16:25,716][175731] Updated weights for policy 0, policy_version 24210 (0.0006) [2023-03-07 10:16:26,533][175731] Updated weights for policy 0, policy_version 24220 (0.0006) [2023-03-07 10:16:27,344][175731] Updated weights for policy 0, policy_version 24230 (0.0006) [2023-03-07 10:16:28,150][175731] Updated weights for policy 0, policy_version 24240 (0.0006) [2023-03-07 10:16:28,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 24823808. Throughput: 0: 12689.7. Samples: 24797695. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:28,322][175405] Avg episode reward: [(0, '102.321')] [2023-03-07 10:16:28,959][175731] Updated weights for policy 0, policy_version 24250 (0.0006) [2023-03-07 10:16:29,764][175731] Updated weights for policy 0, policy_version 24260 (0.0006) [2023-03-07 10:16:30,572][175731] Updated weights for policy 0, policy_version 24270 (0.0006) [2023-03-07 10:16:31,388][175731] Updated weights for policy 0, policy_version 24280 (0.0006) [2023-03-07 10:16:32,196][175731] Updated weights for policy 0, policy_version 24290 (0.0007) [2023-03-07 10:16:32,997][175731] Updated weights for policy 0, policy_version 24300 (0.0007) [2023-03-07 10:16:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 24887296. Throughput: 0: 12679.2. Samples: 24873562. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:33,322][175405] Avg episode reward: [(0, '89.407')] [2023-03-07 10:16:33,785][175731] Updated weights for policy 0, policy_version 24310 (0.0007) [2023-03-07 10:16:34,595][175731] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-07 10:16:35,394][175731] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-07 10:16:36,194][175731] Updated weights for policy 0, policy_version 24340 (0.0007) [2023-03-07 10:16:37,013][175731] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-07 10:16:37,818][175731] Updated weights for policy 0, policy_version 24360 (0.0006) [2023-03-07 10:16:38,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 24950784. Throughput: 0: 12687.3. Samples: 24949983. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:38,322][175405] Avg episode reward: [(0, '98.003')] [2023-03-07 10:16:38,630][175731] Updated weights for policy 0, policy_version 24370 (0.0006) [2023-03-07 10:16:39,434][175731] Updated weights for policy 0, policy_version 24380 (0.0006) [2023-03-07 10:16:40,229][175731] Updated weights for policy 0, policy_version 24390 (0.0007) [2023-03-07 10:16:41,034][175731] Updated weights for policy 0, policy_version 24400 (0.0007) [2023-03-07 10:16:41,836][175731] Updated weights for policy 0, policy_version 24410 (0.0006) [2023-03-07 10:16:42,651][175731] Updated weights for policy 0, policy_version 24420 (0.0007) [2023-03-07 10:16:43,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25014272. Throughput: 0: 12693.4. Samples: 24988262. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:43,322][175405] Avg episode reward: [(0, '108.343')] [2023-03-07 10:16:43,475][175731] Updated weights for policy 0, policy_version 24430 (0.0006) [2023-03-07 10:16:44,254][175731] Updated weights for policy 0, policy_version 24440 (0.0006) [2023-03-07 10:16:45,071][175731] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-07 10:16:45,885][175731] Updated weights for policy 0, policy_version 24460 (0.0006) [2023-03-07 10:16:46,678][175731] Updated weights for policy 0, policy_version 24470 (0.0007) [2023-03-07 10:16:47,485][175731] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-07 10:16:48,285][175731] Updated weights for policy 0, policy_version 24490 (0.0006) [2023-03-07 10:16:48,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25077760. Throughput: 0: 12693.0. Samples: 25064366. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:16:48,322][175405] Avg episode reward: [(0, '91.183')] [2023-03-07 10:16:49,094][175731] Updated weights for policy 0, policy_version 24500 (0.0006) [2023-03-07 10:16:49,908][175731] Updated weights for policy 0, policy_version 24510 (0.0006) [2023-03-07 10:16:50,726][175731] Updated weights for policy 0, policy_version 24520 (0.0006) [2023-03-07 10:16:51,523][175731] Updated weights for policy 0, policy_version 24530 (0.0007) [2023-03-07 10:16:52,326][175731] Updated weights for policy 0, policy_version 24540 (0.0006) [2023-03-07 10:16:53,134][175731] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-07 10:16:53,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25141248. Throughput: 0: 12687.0. Samples: 25140487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:16:53,322][175405] Avg episode reward: [(0, '105.156')] [2023-03-07 10:16:53,949][175731] Updated weights for policy 0, policy_version 24560 (0.0007) [2023-03-07 10:16:54,748][175731] Updated weights for policy 0, policy_version 24570 (0.0006) [2023-03-07 10:16:55,545][175731] Updated weights for policy 0, policy_version 24580 (0.0007) [2023-03-07 10:16:56,368][175731] Updated weights for policy 0, policy_version 24590 (0.0007) [2023-03-07 10:16:57,164][175731] Updated weights for policy 0, policy_version 24600 (0.0006) [2023-03-07 10:16:57,963][175731] Updated weights for policy 0, policy_version 24610 (0.0007) [2023-03-07 10:16:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25204736. Throughput: 0: 12681.8. Samples: 25178624. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:16:58,322][175405] Avg episode reward: [(0, '96.773')] [2023-03-07 10:16:58,761][175731] Updated weights for policy 0, policy_version 24620 (0.0006) [2023-03-07 10:16:59,575][175731] Updated weights for policy 0, policy_version 24630 (0.0007) [2023-03-07 10:17:00,382][175731] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-07 10:17:01,210][175731] Updated weights for policy 0, policy_version 24650 (0.0007) [2023-03-07 10:17:02,019][175731] Updated weights for policy 0, policy_version 24660 (0.0007) [2023-03-07 10:17:02,844][175731] Updated weights for policy 0, policy_version 24670 (0.0007) [2023-03-07 10:17:03,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 25267200. Throughput: 0: 12686.4. Samples: 25254691. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:03,321][175405] Avg episode reward: [(0, '95.923')] [2023-03-07 10:17:03,641][175731] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-07 10:17:04,455][175731] Updated weights for policy 0, policy_version 24690 (0.0007) [2023-03-07 10:17:05,257][175731] Updated weights for policy 0, policy_version 24700 (0.0006) [2023-03-07 10:17:06,062][175731] Updated weights for policy 0, policy_version 24710 (0.0007) [2023-03-07 10:17:06,873][175731] Updated weights for policy 0, policy_version 24720 (0.0006) [2023-03-07 10:17:07,673][175731] Updated weights for policy 0, policy_version 24730 (0.0006) [2023-03-07 10:17:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25331712. Throughput: 0: 12697.7. Samples: 25330911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:08,322][175405] Avg episode reward: [(0, '97.463')] [2023-03-07 10:17:08,476][175731] Updated weights for policy 0, policy_version 24740 (0.0007) [2023-03-07 10:17:09,286][175731] Updated weights for policy 0, policy_version 24750 (0.0007) [2023-03-07 10:17:10,082][175731] Updated weights for policy 0, policy_version 24760 (0.0007) [2023-03-07 10:17:10,882][175731] Updated weights for policy 0, policy_version 24770 (0.0006) [2023-03-07 10:17:11,685][175731] Updated weights for policy 0, policy_version 24780 (0.0007) [2023-03-07 10:17:12,501][175731] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-07 10:17:13,300][175731] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-07 10:17:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 25395200. Throughput: 0: 12693.6. Samples: 25368905. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:13,322][175405] Avg episode reward: [(0, '93.016')] [2023-03-07 10:17:14,118][175731] Updated weights for policy 0, policy_version 24810 (0.0006) [2023-03-07 10:17:14,939][175731] Updated weights for policy 0, policy_version 24820 (0.0007) [2023-03-07 10:17:15,749][175731] Updated weights for policy 0, policy_version 24830 (0.0006) [2023-03-07 10:17:16,531][175731] Updated weights for policy 0, policy_version 24840 (0.0006) [2023-03-07 10:17:17,336][175731] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-07 10:17:18,149][175731] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-07 10:17:18,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25458688. Throughput: 0: 12700.1. Samples: 25445066. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:18,321][175405] Avg episode reward: [(0, '87.687')] [2023-03-07 10:17:18,961][175731] Updated weights for policy 0, policy_version 24870 (0.0006) [2023-03-07 10:17:19,784][175731] Updated weights for policy 0, policy_version 24880 (0.0007) [2023-03-07 10:17:20,575][175731] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-07 10:17:21,381][175731] Updated weights for policy 0, policy_version 24900 (0.0007) [2023-03-07 10:17:22,198][175731] Updated weights for policy 0, policy_version 24910 (0.0006) [2023-03-07 10:17:22,990][175731] Updated weights for policy 0, policy_version 24920 (0.0006) [2023-03-07 10:17:23,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25522176. Throughput: 0: 12692.6. Samples: 25521148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:23,321][175405] Avg episode reward: [(0, '88.952')] [2023-03-07 10:17:23,815][175731] Updated weights for policy 0, policy_version 24930 (0.0007) [2023-03-07 10:17:24,609][175731] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-07 10:17:25,418][175731] Updated weights for policy 0, policy_version 24950 (0.0006) [2023-03-07 10:17:26,226][175731] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-07 10:17:27,049][175731] Updated weights for policy 0, policy_version 24970 (0.0007) [2023-03-07 10:17:27,849][175731] Updated weights for policy 0, policy_version 24980 (0.0006) [2023-03-07 10:17:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 25585664. Throughput: 0: 12686.6. Samples: 25559158. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:28,321][175405] Avg episode reward: [(0, '94.052')] [2023-03-07 10:17:28,657][175731] Updated weights for policy 0, policy_version 24990 (0.0007) [2023-03-07 10:17:29,461][175731] Updated weights for policy 0, policy_version 25000 (0.0006) [2023-03-07 10:17:30,257][175731] Updated weights for policy 0, policy_version 25010 (0.0006) [2023-03-07 10:17:31,064][175731] Updated weights for policy 0, policy_version 25020 (0.0007) [2023-03-07 10:17:31,869][175731] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-07 10:17:32,688][175731] Updated weights for policy 0, policy_version 25040 (0.0008) [2023-03-07 10:17:33,321][175405] Fps is (10 sec: 12595.0, 60 sec: 12680.5, 300 sec: 12683.7). Total num frames: 25648128. Throughput: 0: 12692.6. Samples: 25635535. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:33,322][175405] Avg episode reward: [(0, '85.337')] [2023-03-07 10:17:33,480][175731] Updated weights for policy 0, policy_version 25050 (0.0008) [2023-03-07 10:17:34,292][175731] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-07 10:17:35,105][175731] Updated weights for policy 0, policy_version 25070 (0.0006) [2023-03-07 10:17:35,904][175731] Updated weights for policy 0, policy_version 25080 (0.0007) [2023-03-07 10:17:36,697][175731] Updated weights for policy 0, policy_version 25090 (0.0006) [2023-03-07 10:17:37,501][175731] Updated weights for policy 0, policy_version 25100 (0.0007) [2023-03-07 10:17:38,312][175731] Updated weights for policy 0, policy_version 25110 (0.0007) [2023-03-07 10:17:38,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 25712640. Throughput: 0: 12696.6. Samples: 25711835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:38,322][175405] Avg episode reward: [(0, '101.747')] [2023-03-07 10:17:39,118][175731] Updated weights for policy 0, policy_version 25120 (0.0006) [2023-03-07 10:17:39,921][175731] Updated weights for policy 0, policy_version 25130 (0.0006) [2023-03-07 10:17:40,728][175731] Updated weights for policy 0, policy_version 25140 (0.0007) [2023-03-07 10:17:41,554][175731] Updated weights for policy 0, policy_version 25150 (0.0006) [2023-03-07 10:17:42,333][175731] Updated weights for policy 0, policy_version 25160 (0.0006) [2023-03-07 10:17:43,155][175731] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-07 10:17:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 25776128. Throughput: 0: 12692.6. Samples: 25749789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:43,321][175405] Avg episode reward: [(0, '88.100')] [2023-03-07 10:17:43,958][175731] Updated weights for policy 0, policy_version 25180 (0.0006) [2023-03-07 10:17:44,785][175731] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-07 10:17:45,577][175731] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-07 10:17:46,369][175731] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-07 10:17:47,193][175731] Updated weights for policy 0, policy_version 25220 (0.0006) [2023-03-07 10:17:47,984][175731] Updated weights for policy 0, policy_version 25230 (0.0006) [2023-03-07 10:17:48,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 25838592. Throughput: 0: 12694.0. Samples: 25825921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:48,321][175405] Avg episode reward: [(0, '87.254')] [2023-03-07 10:17:48,324][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000025234_25839616.pth... [2023-03-07 10:17:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000022260_22794240.pth [2023-03-07 10:17:48,804][175731] Updated weights for policy 0, policy_version 25240 (0.0007) [2023-03-07 10:17:49,622][175731] Updated weights for policy 0, policy_version 25250 (0.0007) [2023-03-07 10:17:50,419][175731] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-07 10:17:51,227][175731] Updated weights for policy 0, policy_version 25270 (0.0007) [2023-03-07 10:17:52,033][175731] Updated weights for policy 0, policy_version 25280 (0.0007) [2023-03-07 10:17:52,854][175731] Updated weights for policy 0, policy_version 25290 (0.0007) [2023-03-07 10:17:53,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12680.6, 300 sec: 12687.2). Total num frames: 25902080. Throughput: 0: 12689.6. Samples: 25901941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:17:53,322][175405] Avg episode reward: [(0, '87.989')] [2023-03-07 10:17:53,650][175731] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-07 10:17:54,449][175731] Updated weights for policy 0, policy_version 25310 (0.0007) [2023-03-07 10:17:55,257][175731] Updated weights for policy 0, policy_version 25320 (0.0006) [2023-03-07 10:17:56,057][175731] Updated weights for policy 0, policy_version 25330 (0.0007) [2023-03-07 10:17:56,839][175731] Updated weights for policy 0, policy_version 25340 (0.0006) [2023-03-07 10:17:57,660][175731] Updated weights for policy 0, policy_version 25350 (0.0007) [2023-03-07 10:17:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 25966592. Throughput: 0: 12700.6. Samples: 25940430. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:17:58,321][175405] Avg episode reward: [(0, '130.139')] [2023-03-07 10:17:58,475][175731] Updated weights for policy 0, policy_version 25360 (0.0007) [2023-03-07 10:17:59,275][175731] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-07 10:18:00,081][175731] Updated weights for policy 0, policy_version 25380 (0.0006) [2023-03-07 10:18:00,884][175731] Updated weights for policy 0, policy_version 25390 (0.0007) [2023-03-07 10:18:01,692][175731] Updated weights for policy 0, policy_version 25400 (0.0006) [2023-03-07 10:18:02,494][175731] Updated weights for policy 0, policy_version 25410 (0.0007) [2023-03-07 10:18:03,281][175731] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-07 10:18:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12690.7). Total num frames: 26030080. Throughput: 0: 12706.1. Samples: 26016843. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:18:03,322][175405] Avg episode reward: [(0, '81.792')] [2023-03-07 10:18:04,084][175731] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-07 10:18:04,890][175731] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-07 10:18:05,706][175731] Updated weights for policy 0, policy_version 25450 (0.0008) [2023-03-07 10:18:06,512][175731] Updated weights for policy 0, policy_version 25460 (0.0007) [2023-03-07 10:18:07,315][175731] Updated weights for policy 0, policy_version 25470 (0.0006) [2023-03-07 10:18:08,122][175731] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-07 10:18:08,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 26093568. Throughput: 0: 12705.9. Samples: 26092916. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:18:08,322][175405] Avg episode reward: [(0, '84.441')] [2023-03-07 10:18:08,936][175731] Updated weights for policy 0, policy_version 25490 (0.0006) [2023-03-07 10:18:09,749][175731] Updated weights for policy 0, policy_version 25500 (0.0006) [2023-03-07 10:18:10,554][175731] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-07 10:18:11,366][175731] Updated weights for policy 0, policy_version 25520 (0.0006) [2023-03-07 10:18:12,172][175731] Updated weights for policy 0, policy_version 25530 (0.0007) [2023-03-07 10:18:12,983][175731] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-07 10:18:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 26157056. Throughput: 0: 12707.1. Samples: 26130978. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:18:13,322][175405] Avg episode reward: [(0, '88.660')] [2023-03-07 10:18:13,798][175731] Updated weights for policy 0, policy_version 25550 (0.0007) [2023-03-07 10:18:14,602][175731] Updated weights for policy 0, policy_version 25560 (0.0006) [2023-03-07 10:18:15,410][175731] Updated weights for policy 0, policy_version 25570 (0.0006) [2023-03-07 10:18:16,231][175731] Updated weights for policy 0, policy_version 25580 (0.0006) [2023-03-07 10:18:17,034][175731] Updated weights for policy 0, policy_version 25590 (0.0007) [2023-03-07 10:18:17,818][175731] Updated weights for policy 0, policy_version 25600 (0.0006) [2023-03-07 10:18:18,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 26220544. Throughput: 0: 12698.5. Samples: 26206967. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:18:18,321][175405] Avg episode reward: [(0, '83.190')] [2023-03-07 10:18:18,641][175731] Updated weights for policy 0, policy_version 25610 (0.0006) [2023-03-07 10:18:19,431][175731] Updated weights for policy 0, policy_version 25620 (0.0006) [2023-03-07 10:18:20,235][175731] Updated weights for policy 0, policy_version 25630 (0.0007) [2023-03-07 10:18:21,052][175731] Updated weights for policy 0, policy_version 25640 (0.0007) [2023-03-07 10:18:21,868][175731] Updated weights for policy 0, policy_version 25650 (0.0006) [2023-03-07 10:18:22,668][175731] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-07 10:18:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 26284032. Throughput: 0: 12694.7. Samples: 26283097. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:18:23,322][175405] Avg episode reward: [(0, '91.965')] [2023-03-07 10:18:23,480][175731] Updated weights for policy 0, policy_version 25670 (0.0007) [2023-03-07 10:18:24,275][175731] Updated weights for policy 0, policy_version 25680 (0.0006) [2023-03-07 10:18:25,085][175731] Updated weights for policy 0, policy_version 25690 (0.0007) [2023-03-07 10:18:25,896][175731] Updated weights for policy 0, policy_version 25700 (0.0007) [2023-03-07 10:18:26,710][175731] Updated weights for policy 0, policy_version 25710 (0.0006) [2023-03-07 10:18:27,498][175731] Updated weights for policy 0, policy_version 25720 (0.0006) [2023-03-07 10:18:28,321][175731] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-07 10:18:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 26347520. Throughput: 0: 12698.6. Samples: 26321226. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:18:28,321][175405] Avg episode reward: [(0, '123.458')] [2023-03-07 10:18:29,129][175731] Updated weights for policy 0, policy_version 25740 (0.0006) [2023-03-07 10:18:29,926][175731] Updated weights for policy 0, policy_version 25750 (0.0005) [2023-03-07 10:18:30,740][175731] Updated weights for policy 0, policy_version 25760 (0.0008) [2023-03-07 10:18:31,531][175731] Updated weights for policy 0, policy_version 25770 (0.0006) [2023-03-07 10:18:32,325][175731] Updated weights for policy 0, policy_version 25780 (0.0006) [2023-03-07 10:18:33,130][175731] Updated weights for policy 0, policy_version 25790 (0.0006) [2023-03-07 10:18:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12694.1). Total num frames: 26411008. Throughput: 0: 12703.8. Samples: 26397593. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:18:33,322][175405] Avg episode reward: [(0, '83.290')] [2023-03-07 10:18:33,942][175731] Updated weights for policy 0, policy_version 25800 (0.0006) [2023-03-07 10:18:34,749][175731] Updated weights for policy 0, policy_version 25810 (0.0007) [2023-03-07 10:18:35,554][175731] Updated weights for policy 0, policy_version 25820 (0.0006) [2023-03-07 10:18:36,349][175731] Updated weights for policy 0, policy_version 25830 (0.0006) [2023-03-07 10:18:37,171][175731] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-07 10:18:37,981][175731] Updated weights for policy 0, policy_version 25850 (0.0006) [2023-03-07 10:18:38,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 26474496. Throughput: 0: 12710.1. Samples: 26473897. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:18:38,332][175405] Avg episode reward: [(0, '133.003')] [2023-03-07 10:18:38,765][175731] Updated weights for policy 0, policy_version 25860 (0.0006) [2023-03-07 10:18:39,569][175731] Updated weights for policy 0, policy_version 25870 (0.0006) [2023-03-07 10:18:40,361][175731] Updated weights for policy 0, policy_version 25880 (0.0006) [2023-03-07 10:18:41,175][175731] Updated weights for policy 0, policy_version 25890 (0.0006) [2023-03-07 10:18:41,985][175731] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-07 10:18:42,795][175731] Updated weights for policy 0, policy_version 25910 (0.0006) [2023-03-07 10:18:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 26537984. Throughput: 0: 12707.7. Samples: 26512277. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:18:43,322][175405] Avg episode reward: [(0, '93.796')] [2023-03-07 10:18:43,598][175731] Updated weights for policy 0, policy_version 25920 (0.0006) [2023-03-07 10:18:44,404][175731] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-07 10:18:45,201][175731] Updated weights for policy 0, policy_version 25940 (0.0006) [2023-03-07 10:18:46,022][175731] Updated weights for policy 0, policy_version 25950 (0.0007) [2023-03-07 10:18:46,820][175731] Updated weights for policy 0, policy_version 25960 (0.0007) [2023-03-07 10:18:47,624][175731] Updated weights for policy 0, policy_version 25970 (0.0005) [2023-03-07 10:18:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12690.7). Total num frames: 26601472. Throughput: 0: 12697.8. Samples: 26588245. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:18:48,322][175405] Avg episode reward: [(0, '97.003')] [2023-03-07 10:18:48,427][175731] Updated weights for policy 0, policy_version 25980 (0.0007) [2023-03-07 10:18:49,235][175731] Updated weights for policy 0, policy_version 25990 (0.0008) [2023-03-07 10:18:50,050][175731] Updated weights for policy 0, policy_version 26000 (0.0006) [2023-03-07 10:18:50,854][175731] Updated weights for policy 0, policy_version 26010 (0.0007) [2023-03-07 10:18:51,667][175731] Updated weights for policy 0, policy_version 26020 (0.0006) [2023-03-07 10:18:52,470][175731] Updated weights for policy 0, policy_version 26030 (0.0007) [2023-03-07 10:18:53,289][175731] Updated weights for policy 0, policy_version 26040 (0.0007) [2023-03-07 10:18:53,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12694.1). Total num frames: 26664960. Throughput: 0: 12700.2. Samples: 26664422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:18:53,321][175405] Avg episode reward: [(0, '93.599')] [2023-03-07 10:18:54,102][175731] Updated weights for policy 0, policy_version 26050 (0.0007) [2023-03-07 10:18:54,889][175731] Updated weights for policy 0, policy_version 26060 (0.0006) [2023-03-07 10:18:55,696][175731] Updated weights for policy 0, policy_version 26070 (0.0006) [2023-03-07 10:18:56,495][175731] Updated weights for policy 0, policy_version 26080 (0.0007) [2023-03-07 10:18:57,300][175731] Updated weights for policy 0, policy_version 26090 (0.0007) [2023-03-07 10:18:58,118][175731] Updated weights for policy 0, policy_version 26100 (0.0006) [2023-03-07 10:18:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 26728448. Throughput: 0: 12704.9. Samples: 26702699. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:18:58,322][175405] Avg episode reward: [(0, '88.940')] [2023-03-07 10:18:58,942][175731] Updated weights for policy 0, policy_version 26110 (0.0006) [2023-03-07 10:18:59,720][175731] Updated weights for policy 0, policy_version 26120 (0.0006) [2023-03-07 10:19:00,543][175731] Updated weights for policy 0, policy_version 26130 (0.0007) [2023-03-07 10:19:01,353][175731] Updated weights for policy 0, policy_version 26140 (0.0005) [2023-03-07 10:19:02,154][175731] Updated weights for policy 0, policy_version 26150 (0.0007) [2023-03-07 10:19:02,960][175731] Updated weights for policy 0, policy_version 26160 (0.0006) [2023-03-07 10:19:03,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 26791936. Throughput: 0: 12705.1. Samples: 26778698. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:03,322][175405] Avg episode reward: [(0, '100.184')] [2023-03-07 10:19:03,765][175731] Updated weights for policy 0, policy_version 26170 (0.0007) [2023-03-07 10:19:04,592][175731] Updated weights for policy 0, policy_version 26180 (0.0006) [2023-03-07 10:19:05,386][175731] Updated weights for policy 0, policy_version 26190 (0.0006) [2023-03-07 10:19:06,196][175731] Updated weights for policy 0, policy_version 26200 (0.0007) [2023-03-07 10:19:07,015][175731] Updated weights for policy 0, policy_version 26210 (0.0007) [2023-03-07 10:19:07,801][175731] Updated weights for policy 0, policy_version 26220 (0.0007) [2023-03-07 10:19:08,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12690.7). Total num frames: 26855424. Throughput: 0: 12705.9. Samples: 26854861. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:08,321][175405] Avg episode reward: [(0, '88.666')] [2023-03-07 10:19:08,618][175731] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-07 10:19:09,434][175731] Updated weights for policy 0, policy_version 26240 (0.0008) [2023-03-07 10:19:10,242][175731] Updated weights for policy 0, policy_version 26250 (0.0006) [2023-03-07 10:19:11,058][175731] Updated weights for policy 0, policy_version 26260 (0.0006) [2023-03-07 10:19:11,850][175731] Updated weights for policy 0, policy_version 26270 (0.0006) [2023-03-07 10:19:12,641][175731] Updated weights for policy 0, policy_version 26280 (0.0006) [2023-03-07 10:19:13,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 26918912. Throughput: 0: 12699.2. Samples: 26892692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:13,322][175405] Avg episode reward: [(0, '96.996')] [2023-03-07 10:19:13,458][175731] Updated weights for policy 0, policy_version 26290 (0.0009) [2023-03-07 10:19:14,251][175731] Updated weights for policy 0, policy_version 26300 (0.0007) [2023-03-07 10:19:15,053][175731] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-07 10:19:15,871][175731] Updated weights for policy 0, policy_version 26320 (0.0006) [2023-03-07 10:19:16,674][175731] Updated weights for policy 0, policy_version 26330 (0.0006) [2023-03-07 10:19:17,474][175731] Updated weights for policy 0, policy_version 26340 (0.0006) [2023-03-07 10:19:18,275][175731] Updated weights for policy 0, policy_version 26350 (0.0007) [2023-03-07 10:19:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 26982400. Throughput: 0: 12699.8. Samples: 26969086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:18,321][175405] Avg episode reward: [(0, '101.648')] [2023-03-07 10:19:19,100][175731] Updated weights for policy 0, policy_version 26360 (0.0006) [2023-03-07 10:19:19,890][175731] Updated weights for policy 0, policy_version 26370 (0.0006) [2023-03-07 10:19:20,703][175731] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-07 10:19:21,501][175731] Updated weights for policy 0, policy_version 26390 (0.0006) [2023-03-07 10:19:22,316][175731] Updated weights for policy 0, policy_version 26400 (0.0007) [2023-03-07 10:19:23,114][175731] Updated weights for policy 0, policy_version 26410 (0.0006) [2023-03-07 10:19:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 27045888. Throughput: 0: 12698.6. Samples: 27045335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:23,322][175405] Avg episode reward: [(0, '139.681')] [2023-03-07 10:19:23,918][175731] Updated weights for policy 0, policy_version 26420 (0.0006) [2023-03-07 10:19:24,716][175731] Updated weights for policy 0, policy_version 26430 (0.0005) [2023-03-07 10:19:25,522][175731] Updated weights for policy 0, policy_version 26440 (0.0006) [2023-03-07 10:19:26,335][175731] Updated weights for policy 0, policy_version 26450 (0.0007) [2023-03-07 10:19:27,142][175731] Updated weights for policy 0, policy_version 26460 (0.0006) [2023-03-07 10:19:27,940][175731] Updated weights for policy 0, policy_version 26470 (0.0007) [2023-03-07 10:19:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 27109376. Throughput: 0: 12692.6. Samples: 27083443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:28,322][175405] Avg episode reward: [(0, '103.214')] [2023-03-07 10:19:28,758][175731] Updated weights for policy 0, policy_version 26480 (0.0006) [2023-03-07 10:19:29,563][175731] Updated weights for policy 0, policy_version 26490 (0.0007) [2023-03-07 10:19:30,366][175731] Updated weights for policy 0, policy_version 26500 (0.0006) [2023-03-07 10:19:31,182][175731] Updated weights for policy 0, policy_version 26510 (0.0007) [2023-03-07 10:19:31,974][175731] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-07 10:19:32,779][175731] Updated weights for policy 0, policy_version 26530 (0.0006) [2023-03-07 10:19:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 27172864. Throughput: 0: 12702.5. Samples: 27159858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:33,321][175405] Avg episode reward: [(0, '126.496')] [2023-03-07 10:19:33,604][175731] Updated weights for policy 0, policy_version 26540 (0.0005) [2023-03-07 10:19:34,398][175731] Updated weights for policy 0, policy_version 26550 (0.0006) [2023-03-07 10:19:35,194][175731] Updated weights for policy 0, policy_version 26560 (0.0006) [2023-03-07 10:19:36,010][175731] Updated weights for policy 0, policy_version 26570 (0.0006) [2023-03-07 10:19:36,810][175731] Updated weights for policy 0, policy_version 26580 (0.0006) [2023-03-07 10:19:37,614][175731] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-07 10:19:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 27236352. Throughput: 0: 12705.8. Samples: 27236182. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:38,332][175405] Avg episode reward: [(0, '86.608')] [2023-03-07 10:19:38,426][175731] Updated weights for policy 0, policy_version 26600 (0.0006) [2023-03-07 10:19:39,222][175731] Updated weights for policy 0, policy_version 26610 (0.0007) [2023-03-07 10:19:40,037][175731] Updated weights for policy 0, policy_version 26620 (0.0006) [2023-03-07 10:19:40,844][175731] Updated weights for policy 0, policy_version 26630 (0.0007) [2023-03-07 10:19:41,662][175731] Updated weights for policy 0, policy_version 26640 (0.0007) [2023-03-07 10:19:42,454][175731] Updated weights for policy 0, policy_version 26650 (0.0006) [2023-03-07 10:19:43,262][175731] Updated weights for policy 0, policy_version 26660 (0.0007) [2023-03-07 10:19:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 27299840. Throughput: 0: 12701.1. Samples: 27274247. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:43,332][175405] Avg episode reward: [(0, '97.073')] [2023-03-07 10:19:44,072][175731] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-07 10:19:44,862][175731] Updated weights for policy 0, policy_version 26680 (0.0006) [2023-03-07 10:19:45,665][175731] Updated weights for policy 0, policy_version 26690 (0.0007) [2023-03-07 10:19:46,470][175731] Updated weights for policy 0, policy_version 26700 (0.0006) [2023-03-07 10:19:47,250][175731] Updated weights for policy 0, policy_version 26710 (0.0005) [2023-03-07 10:19:48,059][175731] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-07 10:19:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 27364352. Throughput: 0: 12711.8. Samples: 27350726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:48,332][175405] Avg episode reward: [(0, '101.528')] [2023-03-07 10:19:48,336][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000026723_27364352.pth... [2023-03-07 10:19:48,367][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000023746_24315904.pth [2023-03-07 10:19:48,856][175731] Updated weights for policy 0, policy_version 26730 (0.0006) [2023-03-07 10:19:49,669][175731] Updated weights for policy 0, policy_version 26740 (0.0006) [2023-03-07 10:19:50,479][175731] Updated weights for policy 0, policy_version 26750 (0.0006) [2023-03-07 10:19:51,287][175731] Updated weights for policy 0, policy_version 26760 (0.0006) [2023-03-07 10:19:52,102][175731] Updated weights for policy 0, policy_version 26770 (0.0007) [2023-03-07 10:19:52,902][175731] Updated weights for policy 0, policy_version 26780 (0.0006) [2023-03-07 10:19:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 27427840. Throughput: 0: 12714.6. Samples: 27427016. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:53,332][175405] Avg episode reward: [(0, '96.374')] [2023-03-07 10:19:53,707][175731] Updated weights for policy 0, policy_version 26790 (0.0006) [2023-03-07 10:19:54,523][175731] Updated weights for policy 0, policy_version 26800 (0.0006) [2023-03-07 10:19:55,341][175731] Updated weights for policy 0, policy_version 26810 (0.0006) [2023-03-07 10:19:56,147][175731] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-07 10:19:56,942][175731] Updated weights for policy 0, policy_version 26830 (0.0006) [2023-03-07 10:19:57,756][175731] Updated weights for policy 0, policy_version 26840 (0.0006) [2023-03-07 10:19:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12697.6). Total num frames: 27491328. Throughput: 0: 12717.9. Samples: 27464998. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:19:58,332][175405] Avg episode reward: [(0, '104.992')] [2023-03-07 10:19:58,554][175731] Updated weights for policy 0, policy_version 26850 (0.0007) [2023-03-07 10:19:59,354][175731] Updated weights for policy 0, policy_version 26860 (0.0006) [2023-03-07 10:20:00,168][175731] Updated weights for policy 0, policy_version 26870 (0.0006) [2023-03-07 10:20:00,974][175731] Updated weights for policy 0, policy_version 26880 (0.0006) [2023-03-07 10:20:01,777][175731] Updated weights for policy 0, policy_version 26890 (0.0007) [2023-03-07 10:20:02,575][175731] Updated weights for policy 0, policy_version 26900 (0.0006) [2023-03-07 10:20:03,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 27554816. Throughput: 0: 12715.5. Samples: 27541283. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:20:03,332][175405] Avg episode reward: [(0, '97.562')] [2023-03-07 10:20:03,385][175731] Updated weights for policy 0, policy_version 26910 (0.0007) [2023-03-07 10:20:04,194][175731] Updated weights for policy 0, policy_version 26920 (0.0007) [2023-03-07 10:20:04,997][175731] Updated weights for policy 0, policy_version 26930 (0.0008) [2023-03-07 10:20:05,814][175731] Updated weights for policy 0, policy_version 26940 (0.0005) [2023-03-07 10:20:06,624][175731] Updated weights for policy 0, policy_version 26950 (0.0007) [2023-03-07 10:20:07,413][175731] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-07 10:20:08,244][175731] Updated weights for policy 0, policy_version 26970 (0.0006) [2023-03-07 10:20:08,323][175405] Fps is (10 sec: 12592.1, 60 sec: 12697.1, 300 sec: 12697.5). Total num frames: 27617280. Throughput: 0: 12707.1. Samples: 27617184. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:20:08,335][175405] Avg episode reward: [(0, '110.388')] [2023-03-07 10:20:09,044][175731] Updated weights for policy 0, policy_version 26980 (0.0007) [2023-03-07 10:20:09,843][175731] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-07 10:20:10,659][175731] Updated weights for policy 0, policy_version 27000 (0.0006) [2023-03-07 10:20:11,496][175731] Updated weights for policy 0, policy_version 27010 (0.0006) [2023-03-07 10:20:12,283][175731] Updated weights for policy 0, policy_version 27020 (0.0006) [2023-03-07 10:20:13,100][175731] Updated weights for policy 0, policy_version 27030 (0.0006) [2023-03-07 10:20:13,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 27680768. Throughput: 0: 12706.1. Samples: 27655218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:20:13,322][175405] Avg episode reward: [(0, '113.573')] [2023-03-07 10:20:13,901][175731] Updated weights for policy 0, policy_version 27040 (0.0007) [2023-03-07 10:20:14,712][175731] Updated weights for policy 0, policy_version 27050 (0.0007) [2023-03-07 10:20:15,513][175731] Updated weights for policy 0, policy_version 27060 (0.0006) [2023-03-07 10:20:16,305][175731] Updated weights for policy 0, policy_version 27070 (0.0006) [2023-03-07 10:20:17,112][175731] Updated weights for policy 0, policy_version 27080 (0.0007) [2023-03-07 10:20:17,925][175731] Updated weights for policy 0, policy_version 27090 (0.0007) [2023-03-07 10:20:18,321][175405] Fps is (10 sec: 12700.7, 60 sec: 12697.6, 300 sec: 12694.1). Total num frames: 27744256. Throughput: 0: 12700.1. Samples: 27731365. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:20:18,322][175405] Avg episode reward: [(0, '105.986')] [2023-03-07 10:20:18,722][175731] Updated weights for policy 0, policy_version 27100 (0.0006) [2023-03-07 10:20:19,558][175731] Updated weights for policy 0, policy_version 27110 (0.0006) [2023-03-07 10:20:20,357][175731] Updated weights for policy 0, policy_version 27120 (0.0006) [2023-03-07 10:20:21,180][175731] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-07 10:20:22,000][175731] Updated weights for policy 0, policy_version 27140 (0.0007) [2023-03-07 10:20:22,805][175731] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-07 10:20:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 27807744. Throughput: 0: 12691.5. Samples: 27807298. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:20:23,322][175405] Avg episode reward: [(0, '101.772')] [2023-03-07 10:20:23,617][175731] Updated weights for policy 0, policy_version 27160 (0.0006) [2023-03-07 10:20:24,416][175731] Updated weights for policy 0, policy_version 27170 (0.0006) [2023-03-07 10:20:25,215][175731] Updated weights for policy 0, policy_version 27180 (0.0006) [2023-03-07 10:20:26,041][175731] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-07 10:20:26,843][175731] Updated weights for policy 0, policy_version 27200 (0.0006) [2023-03-07 10:20:27,655][175731] Updated weights for policy 0, policy_version 27210 (0.0006) [2023-03-07 10:20:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 27871232. Throughput: 0: 12690.8. Samples: 27845335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:20:28,322][175405] Avg episode reward: [(0, '100.223')] [2023-03-07 10:20:28,443][175731] Updated weights for policy 0, policy_version 27220 (0.0007) [2023-03-07 10:20:29,253][175731] Updated weights for policy 0, policy_version 27230 (0.0007) [2023-03-07 10:20:30,048][175731] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-07 10:20:30,872][175731] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-07 10:20:31,669][175731] Updated weights for policy 0, policy_version 27260 (0.0006) [2023-03-07 10:20:32,458][175731] Updated weights for policy 0, policy_version 27270 (0.0006) [2023-03-07 10:20:33,262][175731] Updated weights for policy 0, policy_version 27280 (0.0007) [2023-03-07 10:20:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 27934720. Throughput: 0: 12686.3. Samples: 27921611. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:20:33,321][175405] Avg episode reward: [(0, '102.172')] [2023-03-07 10:20:34,083][175731] Updated weights for policy 0, policy_version 27290 (0.0006) [2023-03-07 10:20:34,889][175731] Updated weights for policy 0, policy_version 27300 (0.0006) [2023-03-07 10:20:35,676][175731] Updated weights for policy 0, policy_version 27310 (0.0007) [2023-03-07 10:20:36,491][175731] Updated weights for policy 0, policy_version 27320 (0.0006) [2023-03-07 10:20:37,291][175731] Updated weights for policy 0, policy_version 27330 (0.0006) [2023-03-07 10:20:38,103][175731] Updated weights for policy 0, policy_version 27340 (0.0007) [2023-03-07 10:20:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 27998208. Throughput: 0: 12689.5. Samples: 27998041. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:20:38,321][175405] Avg episode reward: [(0, '99.230')] [2023-03-07 10:20:38,916][175731] Updated weights for policy 0, policy_version 27350 (0.0006) [2023-03-07 10:20:39,720][175731] Updated weights for policy 0, policy_version 27360 (0.0006) [2023-03-07 10:20:40,524][175731] Updated weights for policy 0, policy_version 27370 (0.0006) [2023-03-07 10:20:41,339][175731] Updated weights for policy 0, policy_version 27380 (0.0009) [2023-03-07 10:20:42,143][175731] Updated weights for policy 0, policy_version 27390 (0.0006) [2023-03-07 10:20:42,934][175731] Updated weights for policy 0, policy_version 27400 (0.0007) [2023-03-07 10:20:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 28061696. Throughput: 0: 12689.5. Samples: 28036028. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:20:43,321][175405] Avg episode reward: [(0, '132.618')] [2023-03-07 10:20:43,742][175731] Updated weights for policy 0, policy_version 27410 (0.0006) [2023-03-07 10:20:44,538][175731] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-07 10:20:45,332][175731] Updated weights for policy 0, policy_version 27430 (0.0007) [2023-03-07 10:20:46,137][175731] Updated weights for policy 0, policy_version 27440 (0.0006) [2023-03-07 10:20:46,949][175731] Updated weights for policy 0, policy_version 27450 (0.0006) [2023-03-07 10:20:47,757][175731] Updated weights for policy 0, policy_version 27460 (0.0007) [2023-03-07 10:20:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 28126208. Throughput: 0: 12693.6. Samples: 28112497. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 10:20:48,322][175405] Avg episode reward: [(0, '153.616')] [2023-03-07 10:20:48,556][175731] Updated weights for policy 0, policy_version 27470 (0.0006) [2023-03-07 10:20:49,371][175731] Updated weights for policy 0, policy_version 27480 (0.0007) [2023-03-07 10:20:50,161][175731] Updated weights for policy 0, policy_version 27490 (0.0006) [2023-03-07 10:20:50,975][175731] Updated weights for policy 0, policy_version 27500 (0.0006) [2023-03-07 10:20:51,799][175731] Updated weights for policy 0, policy_version 27510 (0.0006) [2023-03-07 10:20:52,604][175731] Updated weights for policy 0, policy_version 27520 (0.0006) [2023-03-07 10:20:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 28189696. Throughput: 0: 12704.0. Samples: 28188830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:20:53,322][175405] Avg episode reward: [(0, '160.475')] [2023-03-07 10:20:53,384][175731] Updated weights for policy 0, policy_version 27530 (0.0006) [2023-03-07 10:20:54,202][175731] Updated weights for policy 0, policy_version 27540 (0.0008) [2023-03-07 10:20:55,006][175731] Updated weights for policy 0, policy_version 27550 (0.0006) [2023-03-07 10:20:55,814][175731] Updated weights for policy 0, policy_version 27560 (0.0006) [2023-03-07 10:20:56,608][175731] Updated weights for policy 0, policy_version 27570 (0.0007) [2023-03-07 10:20:57,435][175731] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-07 10:20:58,229][175731] Updated weights for policy 0, policy_version 27590 (0.0006) [2023-03-07 10:20:58,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 28253184. Throughput: 0: 12703.1. Samples: 28226857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:20:58,321][175405] Avg episode reward: [(0, '157.539')] [2023-03-07 10:20:59,036][175731] Updated weights for policy 0, policy_version 27600 (0.0006) [2023-03-07 10:20:59,853][175731] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-07 10:21:00,647][175731] Updated weights for policy 0, policy_version 27620 (0.0007) [2023-03-07 10:21:01,463][175731] Updated weights for policy 0, policy_version 27630 (0.0006) [2023-03-07 10:21:02,269][175731] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-07 10:21:03,078][175731] Updated weights for policy 0, policy_version 27650 (0.0006) [2023-03-07 10:21:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 28316672. Throughput: 0: 12705.9. Samples: 28303131. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:21:03,332][175405] Avg episode reward: [(0, '176.291')] [2023-03-07 10:21:03,893][175731] Updated weights for policy 0, policy_version 27660 (0.0007) [2023-03-07 10:21:04,693][175731] Updated weights for policy 0, policy_version 27670 (0.0006) [2023-03-07 10:21:05,491][175731] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-07 10:21:06,320][175731] Updated weights for policy 0, policy_version 27690 (0.0006) [2023-03-07 10:21:07,135][175731] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-07 10:21:07,937][175731] Updated weights for policy 0, policy_version 27710 (0.0006) [2023-03-07 10:21:08,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12698.1, 300 sec: 12694.1). Total num frames: 28379136. Throughput: 0: 12705.6. Samples: 28379053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:21:08,332][175405] Avg episode reward: [(0, '115.452')] [2023-03-07 10:21:08,746][175731] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-07 10:21:09,544][175731] Updated weights for policy 0, policy_version 27730 (0.0006) [2023-03-07 10:21:10,355][175731] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-07 10:21:11,158][175731] Updated weights for policy 0, policy_version 27750 (0.0006) [2023-03-07 10:21:11,974][175731] Updated weights for policy 0, policy_version 27760 (0.0007) [2023-03-07 10:21:12,755][175731] Updated weights for policy 0, policy_version 27770 (0.0005) [2023-03-07 10:21:13,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 28443648. Throughput: 0: 12706.3. Samples: 28417118. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:21:13,332][175405] Avg episode reward: [(0, '142.270')] [2023-03-07 10:21:13,558][175731] Updated weights for policy 0, policy_version 27780 (0.0008) [2023-03-07 10:21:14,389][175731] Updated weights for policy 0, policy_version 27790 (0.0007) [2023-03-07 10:21:15,186][175731] Updated weights for policy 0, policy_version 27800 (0.0006) [2023-03-07 10:21:15,986][175731] Updated weights for policy 0, policy_version 27810 (0.0007) [2023-03-07 10:21:16,803][175731] Updated weights for policy 0, policy_version 27820 (0.0007) [2023-03-07 10:21:17,596][175731] Updated weights for policy 0, policy_version 27830 (0.0006) [2023-03-07 10:21:18,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 28506112. Throughput: 0: 12705.7. Samples: 28493368. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:21:18,332][175405] Avg episode reward: [(0, '143.708')] [2023-03-07 10:21:18,408][175731] Updated weights for policy 0, policy_version 27840 (0.0007) [2023-03-07 10:21:19,197][175731] Updated weights for policy 0, policy_version 27850 (0.0006) [2023-03-07 10:21:19,996][175731] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-07 10:21:20,813][175731] Updated weights for policy 0, policy_version 27870 (0.0006) [2023-03-07 10:21:21,619][175731] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-07 10:21:22,423][175731] Updated weights for policy 0, policy_version 27890 (0.0006) [2023-03-07 10:21:23,215][175731] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-07 10:21:23,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 28569600. Throughput: 0: 12705.7. Samples: 28569800. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:21:23,332][175405] Avg episode reward: [(0, '124.178')] [2023-03-07 10:21:24,045][175731] Updated weights for policy 0, policy_version 27910 (0.0008) [2023-03-07 10:21:24,844][175731] Updated weights for policy 0, policy_version 27920 (0.0007) [2023-03-07 10:21:25,658][175731] Updated weights for policy 0, policy_version 27930 (0.0007) [2023-03-07 10:21:26,481][175731] Updated weights for policy 0, policy_version 27940 (0.0006) [2023-03-07 10:21:27,275][175731] Updated weights for policy 0, policy_version 27950 (0.0006) [2023-03-07 10:21:28,060][175731] Updated weights for policy 0, policy_version 27960 (0.0007) [2023-03-07 10:21:28,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 28634112. Throughput: 0: 12705.1. Samples: 28607760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:21:28,332][175405] Avg episode reward: [(0, '119.119')] [2023-03-07 10:21:28,882][175731] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-07 10:21:29,681][175731] Updated weights for policy 0, policy_version 27980 (0.0007) [2023-03-07 10:21:30,479][175731] Updated weights for policy 0, policy_version 27990 (0.0007) [2023-03-07 10:21:31,292][175731] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-07 10:21:32,091][175731] Updated weights for policy 0, policy_version 28010 (0.0006) [2023-03-07 10:21:32,897][175731] Updated weights for policy 0, policy_version 28020 (0.0006) [2023-03-07 10:21:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 28697600. Throughput: 0: 12703.0. Samples: 28684131. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:21:33,332][175405] Avg episode reward: [(0, '107.467')] [2023-03-07 10:21:33,693][175731] Updated weights for policy 0, policy_version 28030 (0.0006) [2023-03-07 10:21:34,503][175731] Updated weights for policy 0, policy_version 28040 (0.0006) [2023-03-07 10:21:35,300][175731] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-07 10:21:36,115][175731] Updated weights for policy 0, policy_version 28060 (0.0006) [2023-03-07 10:21:36,911][175731] Updated weights for policy 0, policy_version 28070 (0.0007) [2023-03-07 10:21:37,704][175731] Updated weights for policy 0, policy_version 28080 (0.0006) [2023-03-07 10:21:38,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 28761088. Throughput: 0: 12708.6. Samples: 28760715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:21:38,321][175405] Avg episode reward: [(0, '121.852')] [2023-03-07 10:21:38,505][175731] Updated weights for policy 0, policy_version 28090 (0.0006) [2023-03-07 10:21:39,308][175731] Updated weights for policy 0, policy_version 28100 (0.0008) [2023-03-07 10:21:40,109][175731] Updated weights for policy 0, policy_version 28110 (0.0008) [2023-03-07 10:21:40,929][175731] Updated weights for policy 0, policy_version 28120 (0.0006) [2023-03-07 10:21:41,725][175731] Updated weights for policy 0, policy_version 28130 (0.0006) [2023-03-07 10:21:42,538][175731] Updated weights for policy 0, policy_version 28140 (0.0006) [2023-03-07 10:21:43,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 28824576. Throughput: 0: 12711.9. Samples: 28798893. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:21:43,321][175405] Avg episode reward: [(0, '126.642')] [2023-03-07 10:21:43,342][175731] Updated weights for policy 0, policy_version 28150 (0.0006) [2023-03-07 10:21:44,138][175731] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-07 10:21:44,934][175731] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-07 10:21:45,733][175731] Updated weights for policy 0, policy_version 28180 (0.0006) [2023-03-07 10:21:46,550][175731] Updated weights for policy 0, policy_version 28190 (0.0005) [2023-03-07 10:21:47,366][175731] Updated weights for policy 0, policy_version 28200 (0.0007) [2023-03-07 10:21:48,165][175731] Updated weights for policy 0, policy_version 28210 (0.0006) [2023-03-07 10:21:48,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 28888064. Throughput: 0: 12715.4. Samples: 28875324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:21:48,322][175405] Avg episode reward: [(0, '148.723')] [2023-03-07 10:21:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000028212_28889088.pth... [2023-03-07 10:21:48,354][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000025234_25839616.pth [2023-03-07 10:21:48,983][175731] Updated weights for policy 0, policy_version 28220 (0.0006) [2023-03-07 10:21:49,800][175731] Updated weights for policy 0, policy_version 28230 (0.0006) [2023-03-07 10:21:50,611][175731] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-07 10:21:51,405][175731] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-07 10:21:52,222][175731] Updated weights for policy 0, policy_version 28260 (0.0006) [2023-03-07 10:21:53,019][175731] Updated weights for policy 0, policy_version 28270 (0.0007) [2023-03-07 10:21:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 28951552. Throughput: 0: 12715.7. Samples: 28951257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:21:53,321][175405] Avg episode reward: [(0, '144.697')] [2023-03-07 10:21:53,837][175731] Updated weights for policy 0, policy_version 28280 (0.0007) [2023-03-07 10:21:54,641][175731] Updated weights for policy 0, policy_version 28290 (0.0007) [2023-03-07 10:21:55,447][175731] Updated weights for policy 0, policy_version 28300 (0.0007) [2023-03-07 10:21:56,250][175731] Updated weights for policy 0, policy_version 28310 (0.0007) [2023-03-07 10:21:57,042][175731] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-07 10:21:57,841][175731] Updated weights for policy 0, policy_version 28330 (0.0007) [2023-03-07 10:21:58,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 29015040. Throughput: 0: 12715.9. Samples: 28989335. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:21:58,321][175405] Avg episode reward: [(0, '155.279')] [2023-03-07 10:21:58,653][175731] Updated weights for policy 0, policy_version 28340 (0.0007) [2023-03-07 10:21:59,490][175731] Updated weights for policy 0, policy_version 28350 (0.0007) [2023-03-07 10:22:00,290][175731] Updated weights for policy 0, policy_version 28360 (0.0006) [2023-03-07 10:22:01,094][175731] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-07 10:22:01,913][175731] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-07 10:22:02,703][175731] Updated weights for policy 0, policy_version 28390 (0.0007) [2023-03-07 10:22:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 29078528. Throughput: 0: 12711.4. Samples: 29065380. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:22:03,321][175405] Avg episode reward: [(0, '152.377')] [2023-03-07 10:22:03,511][175731] Updated weights for policy 0, policy_version 28400 (0.0007) [2023-03-07 10:22:04,308][175731] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-07 10:22:05,116][175731] Updated weights for policy 0, policy_version 28420 (0.0006) [2023-03-07 10:22:05,935][175731] Updated weights for policy 0, policy_version 28430 (0.0006) [2023-03-07 10:22:06,712][175731] Updated weights for policy 0, policy_version 28440 (0.0006) [2023-03-07 10:22:07,538][175731] Updated weights for policy 0, policy_version 28450 (0.0006) [2023-03-07 10:22:08,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 29142016. Throughput: 0: 12710.2. Samples: 29141759. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:22:08,322][175405] Avg episode reward: [(0, '161.616')] [2023-03-07 10:22:08,343][175731] Updated weights for policy 0, policy_version 28460 (0.0007) [2023-03-07 10:22:09,145][175731] Updated weights for policy 0, policy_version 28470 (0.0007) [2023-03-07 10:22:09,969][175731] Updated weights for policy 0, policy_version 28480 (0.0007) [2023-03-07 10:22:10,759][175731] Updated weights for policy 0, policy_version 28490 (0.0007) [2023-03-07 10:22:11,575][175731] Updated weights for policy 0, policy_version 28500 (0.0006) [2023-03-07 10:22:12,378][175731] Updated weights for policy 0, policy_version 28510 (0.0006) [2023-03-07 10:22:13,192][175731] Updated weights for policy 0, policy_version 28520 (0.0007) [2023-03-07 10:22:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 29205504. Throughput: 0: 12712.4. Samples: 29179818. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:22:13,321][175405] Avg episode reward: [(0, '154.376')] [2023-03-07 10:22:13,990][175731] Updated weights for policy 0, policy_version 28530 (0.0007) [2023-03-07 10:22:14,786][175731] Updated weights for policy 0, policy_version 28540 (0.0007) [2023-03-07 10:22:15,590][175731] Updated weights for policy 0, policy_version 28550 (0.0006) [2023-03-07 10:22:16,419][175731] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-07 10:22:17,217][175731] Updated weights for policy 0, policy_version 28570 (0.0007) [2023-03-07 10:22:18,033][175731] Updated weights for policy 0, policy_version 28580 (0.0006) [2023-03-07 10:22:18,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 29268992. Throughput: 0: 12707.5. Samples: 29255968. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:22:18,322][175405] Avg episode reward: [(0, '158.394')] [2023-03-07 10:22:18,838][175731] Updated weights for policy 0, policy_version 28590 (0.0006) [2023-03-07 10:22:19,645][175731] Updated weights for policy 0, policy_version 28600 (0.0007) [2023-03-07 10:22:20,451][175731] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-07 10:22:21,282][175731] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-07 10:22:22,080][175731] Updated weights for policy 0, policy_version 28630 (0.0006) [2023-03-07 10:22:22,878][175731] Updated weights for policy 0, policy_version 28640 (0.0006) [2023-03-07 10:22:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 29332480. Throughput: 0: 12696.0. Samples: 29332034. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:22:23,321][175405] Avg episode reward: [(0, '152.244')] [2023-03-07 10:22:23,682][175731] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-07 10:22:24,494][175731] Updated weights for policy 0, policy_version 28660 (0.0007) [2023-03-07 10:22:25,283][175731] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-07 10:22:26,102][175731] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-07 10:22:26,916][175731] Updated weights for policy 0, policy_version 28690 (0.0007) [2023-03-07 10:22:27,731][175731] Updated weights for policy 0, policy_version 28700 (0.0006) [2023-03-07 10:22:28,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12704.5). Total num frames: 29395968. Throughput: 0: 12691.0. Samples: 29369990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:22:28,322][175405] Avg episode reward: [(0, '173.747')] [2023-03-07 10:22:28,537][175731] Updated weights for policy 0, policy_version 28710 (0.0008) [2023-03-07 10:22:29,359][175731] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-07 10:22:30,159][175731] Updated weights for policy 0, policy_version 28730 (0.0006) [2023-03-07 10:22:30,957][175731] Updated weights for policy 0, policy_version 28740 (0.0006) [2023-03-07 10:22:31,764][175731] Updated weights for policy 0, policy_version 28750 (0.0007) [2023-03-07 10:22:32,585][175731] Updated weights for policy 0, policy_version 28760 (0.0006) [2023-03-07 10:22:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12697.6, 300 sec: 12701.1). Total num frames: 29459456. Throughput: 0: 12681.8. Samples: 29446006. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:22:33,321][175405] Avg episode reward: [(0, '191.639')] [2023-03-07 10:22:33,399][175731] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-07 10:22:34,208][175731] Updated weights for policy 0, policy_version 28780 (0.0006) [2023-03-07 10:22:35,026][175731] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-07 10:22:35,843][175731] Updated weights for policy 0, policy_version 28800 (0.0006) [2023-03-07 10:22:36,633][175731] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-07 10:22:37,454][175731] Updated weights for policy 0, policy_version 28820 (0.0006) [2023-03-07 10:22:38,267][175731] Updated weights for policy 0, policy_version 28830 (0.0006) [2023-03-07 10:22:38,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 29521920. Throughput: 0: 12677.2. Samples: 29521731. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:22:38,322][175405] Avg episode reward: [(0, '197.159')] [2023-03-07 10:22:39,077][175731] Updated weights for policy 0, policy_version 28840 (0.0007) [2023-03-07 10:22:39,877][175731] Updated weights for policy 0, policy_version 28850 (0.0007) [2023-03-07 10:22:40,666][175731] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-07 10:22:41,477][175731] Updated weights for policy 0, policy_version 28870 (0.0007) [2023-03-07 10:22:42,294][175731] Updated weights for policy 0, policy_version 28880 (0.0007) [2023-03-07 10:22:43,104][175731] Updated weights for policy 0, policy_version 28890 (0.0008) [2023-03-07 10:22:43,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 12701.1). Total num frames: 29585408. Throughput: 0: 12677.4. Samples: 29559818. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:22:43,332][175405] Avg episode reward: [(0, '152.403')] [2023-03-07 10:22:43,913][175731] Updated weights for policy 0, policy_version 28900 (0.0006) [2023-03-07 10:22:44,729][175731] Updated weights for policy 0, policy_version 28910 (0.0008) [2023-03-07 10:22:45,537][175731] Updated weights for policy 0, policy_version 28920 (0.0006) [2023-03-07 10:22:46,345][175731] Updated weights for policy 0, policy_version 28930 (0.0006) [2023-03-07 10:22:47,146][175731] Updated weights for policy 0, policy_version 28940 (0.0007) [2023-03-07 10:22:47,958][175731] Updated weights for policy 0, policy_version 28950 (0.0006) [2023-03-07 10:22:48,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12701.1). Total num frames: 29648896. Throughput: 0: 12670.9. Samples: 29635573. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:22:48,332][175405] Avg episode reward: [(0, '158.021')] [2023-03-07 10:22:48,748][175731] Updated weights for policy 0, policy_version 28960 (0.0006) [2023-03-07 10:22:49,566][175731] Updated weights for policy 0, policy_version 28970 (0.0006) [2023-03-07 10:22:50,371][175731] Updated weights for policy 0, policy_version 28980 (0.0007) [2023-03-07 10:22:51,166][175731] Updated weights for policy 0, policy_version 28990 (0.0006) [2023-03-07 10:22:51,978][175731] Updated weights for policy 0, policy_version 29000 (0.0007) [2023-03-07 10:22:52,794][175731] Updated weights for policy 0, policy_version 29010 (0.0006) [2023-03-07 10:22:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 29712384. Throughput: 0: 12671.9. Samples: 29711993. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:22:53,332][175405] Avg episode reward: [(0, '169.897')] [2023-03-07 10:22:53,585][175731] Updated weights for policy 0, policy_version 29020 (0.0008) [2023-03-07 10:22:54,392][175731] Updated weights for policy 0, policy_version 29030 (0.0006) [2023-03-07 10:22:55,199][175731] Updated weights for policy 0, policy_version 29040 (0.0007) [2023-03-07 10:22:56,009][175731] Updated weights for policy 0, policy_version 29050 (0.0006) [2023-03-07 10:22:56,811][175731] Updated weights for policy 0, policy_version 29060 (0.0006) [2023-03-07 10:22:57,635][175731] Updated weights for policy 0, policy_version 29070 (0.0007) [2023-03-07 10:22:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 29775872. Throughput: 0: 12675.1. Samples: 29750198. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:22:58,332][175405] Avg episode reward: [(0, '172.243')] [2023-03-07 10:22:58,419][175731] Updated weights for policy 0, policy_version 29080 (0.0006) [2023-03-07 10:22:59,249][175731] Updated weights for policy 0, policy_version 29090 (0.0006) [2023-03-07 10:23:00,047][175731] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-07 10:23:00,518][175680] KL-divergence is very high: 113.5622 [2023-03-07 10:23:00,838][175731] Updated weights for policy 0, policy_version 29110 (0.0006) [2023-03-07 10:23:01,673][175731] Updated weights for policy 0, policy_version 29120 (0.0007) [2023-03-07 10:23:02,471][175731] Updated weights for policy 0, policy_version 29130 (0.0007) [2023-03-07 10:23:03,291][175731] Updated weights for policy 0, policy_version 29140 (0.0007) [2023-03-07 10:23:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 29839360. Throughput: 0: 12673.5. Samples: 29826278. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:03,332][175405] Avg episode reward: [(0, '190.899')] [2023-03-07 10:23:04,078][175731] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-07 10:23:04,893][175731] Updated weights for policy 0, policy_version 29160 (0.0007) [2023-03-07 10:23:05,699][175731] Updated weights for policy 0, policy_version 29170 (0.0007) [2023-03-07 10:23:06,498][175731] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-07 10:23:07,303][175731] Updated weights for policy 0, policy_version 29190 (0.0006) [2023-03-07 10:23:08,120][175731] Updated weights for policy 0, policy_version 29200 (0.0006) [2023-03-07 10:23:08,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.6, 300 sec: 12697.6). Total num frames: 29902848. Throughput: 0: 12674.1. Samples: 29902367. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:08,332][175405] Avg episode reward: [(0, '199.886')] [2023-03-07 10:23:08,336][175680] Saving new best policy, reward=199.886! [2023-03-07 10:23:08,930][175731] Updated weights for policy 0, policy_version 29210 (0.0006) [2023-03-07 10:23:09,723][175731] Updated weights for policy 0, policy_version 29220 (0.0006) [2023-03-07 10:23:10,541][175731] Updated weights for policy 0, policy_version 29230 (0.0007) [2023-03-07 10:23:11,351][175731] Updated weights for policy 0, policy_version 29240 (0.0006) [2023-03-07 10:23:12,148][175731] Updated weights for policy 0, policy_version 29250 (0.0006) [2023-03-07 10:23:12,934][175731] Updated weights for policy 0, policy_version 29260 (0.0006) [2023-03-07 10:23:13,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 29966336. Throughput: 0: 12677.9. Samples: 29940497. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:13,322][175405] Avg episode reward: [(0, '206.599')] [2023-03-07 10:23:13,322][175680] Saving new best policy, reward=206.599! [2023-03-07 10:23:13,752][175731] Updated weights for policy 0, policy_version 29270 (0.0006) [2023-03-07 10:23:14,558][175731] Updated weights for policy 0, policy_version 29280 (0.0006) [2023-03-07 10:23:15,365][175731] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-07 10:23:16,170][175731] Updated weights for policy 0, policy_version 29300 (0.0006) [2023-03-07 10:23:16,967][175731] Updated weights for policy 0, policy_version 29310 (0.0006) [2023-03-07 10:23:17,765][175731] Updated weights for policy 0, policy_version 29320 (0.0007) [2023-03-07 10:23:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 30029824. Throughput: 0: 12685.8. Samples: 30016868. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:18,322][175405] Avg episode reward: [(0, '217.914')] [2023-03-07 10:23:18,330][175680] Saving new best policy, reward=217.914! [2023-03-07 10:23:18,564][175731] Updated weights for policy 0, policy_version 29330 (0.0007) [2023-03-07 10:23:19,359][175731] Updated weights for policy 0, policy_version 29340 (0.0007) [2023-03-07 10:23:20,180][175731] Updated weights for policy 0, policy_version 29350 (0.0007) [2023-03-07 10:23:20,985][175731] Updated weights for policy 0, policy_version 29360 (0.0007) [2023-03-07 10:23:21,789][175731] Updated weights for policy 0, policy_version 29370 (0.0006) [2023-03-07 10:23:22,606][175680] KL-divergence is very high: 478.1151 [2023-03-07 10:23:22,613][175731] Updated weights for policy 0, policy_version 29380 (0.0006) [2023-03-07 10:23:23,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 30093312. Throughput: 0: 12696.6. Samples: 30093076. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:23,322][175405] Avg episode reward: [(0, '245.955')] [2023-03-07 10:23:23,323][175680] Saving new best policy, reward=245.955! [2023-03-07 10:23:23,436][175731] Updated weights for policy 0, policy_version 29390 (0.0007) [2023-03-07 10:23:24,231][175731] Updated weights for policy 0, policy_version 29400 (0.0005) [2023-03-07 10:23:24,385][175680] KL-divergence is very high: 173.2374 [2023-03-07 10:23:24,627][175680] KL-divergence is very high: 125.4685 [2023-03-07 10:23:24,959][175680] KL-divergence is very high: 104.0291 [2023-03-07 10:23:25,047][175731] Updated weights for policy 0, policy_version 29410 (0.0006) [2023-03-07 10:23:25,829][175731] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-07 10:23:26,642][175731] Updated weights for policy 0, policy_version 29430 (0.0007) [2023-03-07 10:23:27,460][175731] Updated weights for policy 0, policy_version 29440 (0.0006) [2023-03-07 10:23:28,267][175731] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-07 10:23:28,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 30156800. Throughput: 0: 12698.0. Samples: 30131227. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:28,322][175405] Avg episode reward: [(0, '204.368')] [2023-03-07 10:23:29,088][175731] Updated weights for policy 0, policy_version 29460 (0.0007) [2023-03-07 10:23:29,565][175680] KL-divergence is very high: 359.9844 [2023-03-07 10:23:29,632][175680] KL-divergence is very high: 338.1262 [2023-03-07 10:23:29,890][175731] Updated weights for policy 0, policy_version 29470 (0.0006) [2023-03-07 10:23:30,134][175680] KL-divergence is very high: 331.3398 [2023-03-07 10:23:30,209][175680] KL-divergence is very high: 660.3090 [2023-03-07 10:23:30,714][175731] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-07 10:23:31,525][175731] Updated weights for policy 0, policy_version 29490 (0.0006) [2023-03-07 10:23:32,146][175680] KL-divergence is very high: 122.1724 [2023-03-07 10:23:32,305][175680] KL-divergence is very high: 112.1696 [2023-03-07 10:23:32,313][175731] Updated weights for policy 0, policy_version 29500 (0.0006) [2023-03-07 10:23:33,146][175731] Updated weights for policy 0, policy_version 29510 (0.0006) [2023-03-07 10:23:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12697.6). Total num frames: 30220288. Throughput: 0: 12701.7. Samples: 30207148. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:33,322][175405] Avg episode reward: [(0, '209.697')] [2023-03-07 10:23:33,925][175731] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-07 10:23:34,406][175680] KL-divergence is very high: 128.6823 [2023-03-07 10:23:34,730][175731] Updated weights for policy 0, policy_version 29530 (0.0006) [2023-03-07 10:23:35,297][175680] KL-divergence is very high: 120.2804 [2023-03-07 10:23:35,538][175731] Updated weights for policy 0, policy_version 29540 (0.0006) [2023-03-07 10:23:36,344][175731] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-07 10:23:36,875][175680] KL-divergence is very high: 118.5824 [2023-03-07 10:23:36,975][175680] KL-divergence is very high: 562.6223 [2023-03-07 10:23:37,146][175731] Updated weights for policy 0, policy_version 29560 (0.0006) [2023-03-07 10:23:37,225][175680] KL-divergence is very high: 164.8904 [2023-03-07 10:23:37,950][175731] Updated weights for policy 0, policy_version 29570 (0.0006) [2023-03-07 10:23:38,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 30283776. Throughput: 0: 12698.7. Samples: 30283431. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:23:38,321][175405] Avg episode reward: [(0, '212.874')] [2023-03-07 10:23:38,745][175731] Updated weights for policy 0, policy_version 29580 (0.0006) [2023-03-07 10:23:39,228][175680] KL-divergence is very high: 182.6170 [2023-03-07 10:23:39,546][175731] Updated weights for policy 0, policy_version 29590 (0.0007) [2023-03-07 10:23:40,269][175680] KL-divergence is very high: 126.7869 [2023-03-07 10:23:40,354][175731] Updated weights for policy 0, policy_version 29600 (0.0006) [2023-03-07 10:23:40,430][175680] KL-divergence is very high: 205.4879 [2023-03-07 10:23:40,585][175680] KL-divergence is very high: 139.2995 [2023-03-07 10:23:41,154][175731] Updated weights for policy 0, policy_version 29610 (0.0007) [2023-03-07 10:23:41,968][175731] Updated weights for policy 0, policy_version 29620 (0.0007) [2023-03-07 10:23:42,355][175680] KL-divergence is very high: 142.6002 [2023-03-07 10:23:42,775][175731] Updated weights for policy 0, policy_version 29630 (0.0006) [2023-03-07 10:23:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12701.1). Total num frames: 30348288. Throughput: 0: 12699.6. Samples: 30321680. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:23:43,322][175405] Avg episode reward: [(0, '217.590')] [2023-03-07 10:23:43,564][175680] KL-divergence is very high: 123.7062 [2023-03-07 10:23:43,572][175731] Updated weights for policy 0, policy_version 29640 (0.0006) [2023-03-07 10:23:44,393][175731] Updated weights for policy 0, policy_version 29650 (0.0006) [2023-03-07 10:23:45,030][175680] KL-divergence is very high: 170.3768 [2023-03-07 10:23:45,203][175731] Updated weights for policy 0, policy_version 29660 (0.0006) [2023-03-07 10:23:46,012][175731] Updated weights for policy 0, policy_version 29670 (0.0006) [2023-03-07 10:23:46,829][175731] Updated weights for policy 0, policy_version 29680 (0.0006) [2023-03-07 10:23:47,650][175731] Updated weights for policy 0, policy_version 29690 (0.0006) [2023-03-07 10:23:48,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 30410752. Throughput: 0: 12698.1. Samples: 30397691. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:23:48,322][175405] Avg episode reward: [(0, '229.048')] [2023-03-07 10:23:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000029698_30410752.pth... [2023-03-07 10:23:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000026723_27364352.pth [2023-03-07 10:23:48,458][175731] Updated weights for policy 0, policy_version 29700 (0.0006) [2023-03-07 10:23:49,262][175731] Updated weights for policy 0, policy_version 29710 (0.0006) [2023-03-07 10:23:50,066][175731] Updated weights for policy 0, policy_version 29720 (0.0006) [2023-03-07 10:23:50,878][175731] Updated weights for policy 0, policy_version 29730 (0.0006) [2023-03-07 10:23:51,688][175731] Updated weights for policy 0, policy_version 29740 (0.0006) [2023-03-07 10:23:52,513][175731] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-07 10:23:53,321][175405] Fps is (10 sec: 12492.7, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 30473216. Throughput: 0: 12686.6. Samples: 30473264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:23:53,322][175405] Avg episode reward: [(0, '235.035')] [2023-03-07 10:23:53,324][175731] Updated weights for policy 0, policy_version 29760 (0.0006) [2023-03-07 10:23:53,549][175680] KL-divergence is very high: 113.2554 [2023-03-07 10:23:54,152][175731] Updated weights for policy 0, policy_version 29770 (0.0006) [2023-03-07 10:23:54,685][175680] KL-divergence is very high: 1414.1547 [2023-03-07 10:23:54,841][175680] KL-divergence is very high: 289.7794 [2023-03-07 10:23:54,948][175731] Updated weights for policy 0, policy_version 29780 (0.0006) [2023-03-07 10:23:55,752][175731] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-07 10:23:56,554][175731] Updated weights for policy 0, policy_version 29800 (0.0008) [2023-03-07 10:23:56,627][175680] KL-divergence is very high: 514.6101 [2023-03-07 10:23:56,944][175680] KL-divergence is very high: 115.5418 [2023-03-07 10:23:57,108][175680] KL-divergence is very high: 167.7484 [2023-03-07 10:23:57,350][175731] Updated weights for policy 0, policy_version 29810 (0.0007) [2023-03-07 10:23:57,509][175680] KL-divergence is very high: 136.1508 [2023-03-07 10:23:58,151][175731] Updated weights for policy 0, policy_version 29820 (0.0006) [2023-03-07 10:23:58,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 30537728. Throughput: 0: 12685.5. Samples: 30511344. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:23:58,322][175405] Avg episode reward: [(0, '220.555')] [2023-03-07 10:23:58,554][175680] KL-divergence is very high: 185.1103 [2023-03-07 10:23:58,870][175680] KL-divergence is very high: 104.3950 [2023-03-07 10:23:58,947][175731] Updated weights for policy 0, policy_version 29830 (0.0006) [2023-03-07 10:23:59,757][175731] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-07 10:24:00,569][175731] Updated weights for policy 0, policy_version 29850 (0.0008) [2023-03-07 10:24:01,390][175731] Updated weights for policy 0, policy_version 29860 (0.0007) [2023-03-07 10:24:01,927][175680] KL-divergence is very high: 234.6439 [2023-03-07 10:24:02,084][175680] KL-divergence is very high: 507.5903 [2023-03-07 10:24:02,194][175731] Updated weights for policy 0, policy_version 29870 (0.0007) [2023-03-07 10:24:02,243][175680] KL-divergence is very high: 102.4051 [2023-03-07 10:24:02,340][175680] KL-divergence is very high: 432.6976 [2023-03-07 10:24:02,412][175680] KL-divergence is very high: 195.8509 [2023-03-07 10:24:02,663][175680] KL-divergence is very high: 141.6211 [2023-03-07 10:24:02,983][175731] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-07 10:24:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12697.6, 300 sec: 12697.6). Total num frames: 30601216. Throughput: 0: 12685.2. Samples: 30587702. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:24:03,322][175405] Avg episode reward: [(0, '204.072')] [2023-03-07 10:24:03,785][175731] Updated weights for policy 0, policy_version 29890 (0.0007) [2023-03-07 10:24:04,594][175731] Updated weights for policy 0, policy_version 29900 (0.0006) [2023-03-07 10:24:05,406][175731] Updated weights for policy 0, policy_version 29910 (0.0006) [2023-03-07 10:24:06,253][175731] Updated weights for policy 0, policy_version 29920 (0.0006) [2023-03-07 10:24:07,039][175731] Updated weights for policy 0, policy_version 29930 (0.0006) [2023-03-07 10:24:07,190][175680] KL-divergence is very high: 858.0700 [2023-03-07 10:24:07,278][175680] KL-divergence is very high: 3907.8889 [2023-03-07 10:24:07,342][175680] KL-divergence is very high: 471.3178 [2023-03-07 10:24:07,506][175680] KL-divergence is very high: 151.9905 [2023-03-07 10:24:07,673][175680] KL-divergence is very high: 767.7336 [2023-03-07 10:24:07,841][175731] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-07 10:24:07,911][175680] KL-divergence is very high: 594.5944 [2023-03-07 10:24:08,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 30663680. Throughput: 0: 12680.3. Samples: 30663693. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:24:08,322][175405] Avg episode reward: [(0, '180.887')] [2023-03-07 10:24:08,666][175731] Updated weights for policy 0, policy_version 29950 (0.0007) [2023-03-07 10:24:09,471][175731] Updated weights for policy 0, policy_version 29960 (0.0006) [2023-03-07 10:24:09,701][175680] KL-divergence is very high: 188.7717 [2023-03-07 10:24:09,798][175680] KL-divergence is very high: 120.2473 [2023-03-07 10:24:10,287][175731] Updated weights for policy 0, policy_version 29970 (0.0007) [2023-03-07 10:24:11,086][175731] Updated weights for policy 0, policy_version 29980 (0.0006) [2023-03-07 10:24:11,872][175680] KL-divergence is very high: 166.8187 [2023-03-07 10:24:11,880][175731] Updated weights for policy 0, policy_version 29990 (0.0006) [2023-03-07 10:24:12,030][175680] KL-divergence is very high: 166.1744 [2023-03-07 10:24:12,196][175680] KL-divergence is very high: 717.7037 [2023-03-07 10:24:12,281][175680] KL-divergence is very high: 604.1915 [2023-03-07 10:24:12,522][175680] KL-divergence is very high: 362.0722 [2023-03-07 10:24:12,692][175680] KL-divergence is very high: 125.3390 [2023-03-07 10:24:12,700][175731] Updated weights for policy 0, policy_version 30000 (0.0006) [2023-03-07 10:24:13,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 30727168. Throughput: 0: 12676.0. Samples: 30701646. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:24:13,322][175405] Avg episode reward: [(0, '166.178')] [2023-03-07 10:24:13,510][175731] Updated weights for policy 0, policy_version 30010 (0.0006) [2023-03-07 10:24:14,298][175731] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-07 10:24:14,537][175680] KL-divergence is very high: 354.7731 [2023-03-07 10:24:14,621][175680] KL-divergence is very high: 929.6456 [2023-03-07 10:24:15,111][175731] Updated weights for policy 0, policy_version 30030 (0.0006) [2023-03-07 10:24:15,427][175680] KL-divergence is very high: 342.4931 [2023-03-07 10:24:15,501][175680] KL-divergence is very high: 161.3655 [2023-03-07 10:24:15,592][175680] KL-divergence is very high: 119.3842 [2023-03-07 10:24:15,925][175731] Updated weights for policy 0, policy_version 30040 (0.0006) [2023-03-07 10:24:16,726][175731] Updated weights for policy 0, policy_version 30050 (0.0006) [2023-03-07 10:24:17,541][175731] Updated weights for policy 0, policy_version 30060 (0.0006) [2023-03-07 10:24:17,616][175680] KL-divergence is very high: 254.7469 [2023-03-07 10:24:17,701][175680] KL-divergence is very high: 521.0743 [2023-03-07 10:24:17,776][175680] KL-divergence is very high: 505.5013 [2023-03-07 10:24:17,935][175680] KL-divergence is very high: 1119.0859 [2023-03-07 10:24:18,089][175680] KL-divergence is very high: 540.1917 [2023-03-07 10:24:18,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 30790656. Throughput: 0: 12681.4. Samples: 30777812. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:24:18,322][175405] Avg episode reward: [(0, '182.853')] [2023-03-07 10:24:18,335][175731] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-07 10:24:18,573][175680] KL-divergence is very high: 144.3796 [2023-03-07 10:24:19,144][175731] Updated weights for policy 0, policy_version 30080 (0.0007) [2023-03-07 10:24:19,946][175731] Updated weights for policy 0, policy_version 30090 (0.0007) [2023-03-07 10:24:20,762][175731] Updated weights for policy 0, policy_version 30100 (0.0006) [2023-03-07 10:24:21,569][175731] Updated weights for policy 0, policy_version 30110 (0.0007) [2023-03-07 10:24:22,374][175731] Updated weights for policy 0, policy_version 30120 (0.0006) [2023-03-07 10:24:23,205][175731] Updated weights for policy 0, policy_version 30130 (0.0007) [2023-03-07 10:24:23,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 30854144. Throughput: 0: 12674.7. Samples: 30853794. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:24:23,322][175405] Avg episode reward: [(0, '166.935')] [2023-03-07 10:24:24,009][175731] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-07 10:24:24,813][175731] Updated weights for policy 0, policy_version 30150 (0.0006) [2023-03-07 10:24:25,362][175680] KL-divergence is very high: 215.2612 [2023-03-07 10:24:25,447][175680] KL-divergence is very high: 252.1096 [2023-03-07 10:24:25,610][175680] KL-divergence is very high: 191.9124 [2023-03-07 10:24:25,617][175731] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-07 10:24:26,434][175731] Updated weights for policy 0, policy_version 30170 (0.0006) [2023-03-07 10:24:27,239][175731] Updated weights for policy 0, policy_version 30180 (0.0007) [2023-03-07 10:24:28,054][175731] Updated weights for policy 0, policy_version 30190 (0.0007) [2023-03-07 10:24:28,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 30917632. Throughput: 0: 12669.9. Samples: 30891826. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:24:28,322][175405] Avg episode reward: [(0, '193.839')] [2023-03-07 10:24:28,866][175731] Updated weights for policy 0, policy_version 30200 (0.0006) [2023-03-07 10:24:29,671][175731] Updated weights for policy 0, policy_version 30210 (0.0006) [2023-03-07 10:24:30,489][175731] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-07 10:24:31,279][175731] Updated weights for policy 0, policy_version 30230 (0.0007) [2023-03-07 10:24:32,092][175731] Updated weights for policy 0, policy_version 30240 (0.0007) [2023-03-07 10:24:32,893][175731] Updated weights for policy 0, policy_version 30250 (0.0006) [2023-03-07 10:24:33,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 30981120. Throughput: 0: 12667.7. Samples: 30967739. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:24:33,322][175405] Avg episode reward: [(0, '197.380')] [2023-03-07 10:24:33,695][175731] Updated weights for policy 0, policy_version 30260 (0.0007) [2023-03-07 10:24:34,512][175731] Updated weights for policy 0, policy_version 30270 (0.0006) [2023-03-07 10:24:35,322][175731] Updated weights for policy 0, policy_version 30280 (0.0006) [2023-03-07 10:24:36,122][175731] Updated weights for policy 0, policy_version 30290 (0.0007) [2023-03-07 10:24:36,927][175731] Updated weights for policy 0, policy_version 30300 (0.0007) [2023-03-07 10:24:37,730][175731] Updated weights for policy 0, policy_version 30310 (0.0006) [2023-03-07 10:24:38,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12680.5, 300 sec: 12694.1). Total num frames: 31044608. Throughput: 0: 12678.6. Samples: 31043798. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:24:38,322][175405] Avg episode reward: [(0, '165.380')] [2023-03-07 10:24:38,554][175731] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-07 10:24:39,350][175731] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-07 10:24:40,154][175731] Updated weights for policy 0, policy_version 30340 (0.0006) [2023-03-07 10:24:40,984][175731] Updated weights for policy 0, policy_version 30350 (0.0007) [2023-03-07 10:24:41,787][175731] Updated weights for policy 0, policy_version 30360 (0.0006) [2023-03-07 10:24:42,609][175731] Updated weights for policy 0, policy_version 30370 (0.0006) [2023-03-07 10:24:43,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12646.4, 300 sec: 12687.2). Total num frames: 31107072. Throughput: 0: 12673.0. Samples: 31081629. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:24:43,322][175405] Avg episode reward: [(0, '201.418')] [2023-03-07 10:24:43,435][175731] Updated weights for policy 0, policy_version 30380 (0.0007) [2023-03-07 10:24:44,235][175731] Updated weights for policy 0, policy_version 30390 (0.0007) [2023-03-07 10:24:45,032][175731] Updated weights for policy 0, policy_version 30400 (0.0007) [2023-03-07 10:24:45,850][175731] Updated weights for policy 0, policy_version 30410 (0.0007) [2023-03-07 10:24:46,642][175731] Updated weights for policy 0, policy_version 30420 (0.0006) [2023-03-07 10:24:47,429][175731] Updated weights for policy 0, policy_version 30430 (0.0007) [2023-03-07 10:24:48,249][175731] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-07 10:24:48,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12687.2). Total num frames: 31170560. Throughput: 0: 12669.2. Samples: 31157818. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:24:48,322][175405] Avg episode reward: [(0, '176.917')] [2023-03-07 10:24:49,050][175731] Updated weights for policy 0, policy_version 30450 (0.0006) [2023-03-07 10:24:49,845][175731] Updated weights for policy 0, policy_version 30460 (0.0006) [2023-03-07 10:24:50,683][175731] Updated weights for policy 0, policy_version 30470 (0.0006) [2023-03-07 10:24:51,475][175731] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-07 10:24:52,290][175731] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-07 10:24:53,091][175731] Updated weights for policy 0, policy_version 30500 (0.0007) [2023-03-07 10:24:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12687.2). Total num frames: 31234048. Throughput: 0: 12668.4. Samples: 31233770. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:24:53,321][175405] Avg episode reward: [(0, '180.107')] [2023-03-07 10:24:53,920][175731] Updated weights for policy 0, policy_version 30510 (0.0006) [2023-03-07 10:24:54,727][175731] Updated weights for policy 0, policy_version 30520 (0.0007) [2023-03-07 10:24:55,533][175731] Updated weights for policy 0, policy_version 30530 (0.0006) [2023-03-07 10:24:56,352][175731] Updated weights for policy 0, policy_version 30540 (0.0006) [2023-03-07 10:24:57,164][175731] Updated weights for policy 0, policy_version 30550 (0.0006) [2023-03-07 10:24:57,956][175731] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-07 10:24:58,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12663.5, 300 sec: 12687.2). Total num frames: 31297536. Throughput: 0: 12667.0. Samples: 31271661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:24:58,322][175405] Avg episode reward: [(0, '177.161')] [2023-03-07 10:24:58,775][175731] Updated weights for policy 0, policy_version 30570 (0.0007) [2023-03-07 10:24:59,590][175731] Updated weights for policy 0, policy_version 30580 (0.0007) [2023-03-07 10:25:00,384][175731] Updated weights for policy 0, policy_version 30590 (0.0007) [2023-03-07 10:25:01,190][175731] Updated weights for policy 0, policy_version 30600 (0.0007) [2023-03-07 10:25:01,993][175731] Updated weights for policy 0, policy_version 30610 (0.0007) [2023-03-07 10:25:02,473][175680] KL-divergence is very high: 175.7074 [2023-03-07 10:25:02,801][175731] Updated weights for policy 0, policy_version 30620 (0.0007) [2023-03-07 10:25:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12663.5, 300 sec: 12690.8). Total num frames: 31361024. Throughput: 0: 12666.1. Samples: 31347786. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:03,322][175405] Avg episode reward: [(0, '264.617')] [2023-03-07 10:25:03,322][175680] Saving new best policy, reward=264.617! [2023-03-07 10:25:03,603][175731] Updated weights for policy 0, policy_version 30630 (0.0007) [2023-03-07 10:25:04,402][175731] Updated weights for policy 0, policy_version 30640 (0.0007) [2023-03-07 10:25:05,192][175731] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-07 10:25:05,995][175731] Updated weights for policy 0, policy_version 30660 (0.0006) [2023-03-07 10:25:06,786][175731] Updated weights for policy 0, policy_version 30670 (0.0006) [2023-03-07 10:25:07,592][175731] Updated weights for policy 0, policy_version 30680 (0.0007) [2023-03-07 10:25:08,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.6, 300 sec: 12690.7). Total num frames: 31424512. Throughput: 0: 12682.2. Samples: 31424494. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:08,322][175405] Avg episode reward: [(0, '215.748')] [2023-03-07 10:25:08,387][175731] Updated weights for policy 0, policy_version 30690 (0.0006) [2023-03-07 10:25:09,204][175731] Updated weights for policy 0, policy_version 30700 (0.0007) [2023-03-07 10:25:10,023][175731] Updated weights for policy 0, policy_version 30710 (0.0007) [2023-03-07 10:25:10,813][175731] Updated weights for policy 0, policy_version 30720 (0.0007) [2023-03-07 10:25:11,629][175731] Updated weights for policy 0, policy_version 30730 (0.0006) [2023-03-07 10:25:12,443][175731] Updated weights for policy 0, policy_version 30740 (0.0006) [2023-03-07 10:25:13,273][175731] Updated weights for policy 0, policy_version 30750 (0.0007) [2023-03-07 10:25:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 31488000. Throughput: 0: 12679.9. Samples: 31462422. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:13,322][175405] Avg episode reward: [(0, '252.200')] [2023-03-07 10:25:14,073][175731] Updated weights for policy 0, policy_version 30760 (0.0007) [2023-03-07 10:25:14,857][175731] Updated weights for policy 0, policy_version 30770 (0.0006) [2023-03-07 10:25:15,678][175731] Updated weights for policy 0, policy_version 30780 (0.0006) [2023-03-07 10:25:16,486][175731] Updated weights for policy 0, policy_version 30790 (0.0007) [2023-03-07 10:25:17,293][175731] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-07 10:25:18,107][175731] Updated weights for policy 0, policy_version 30810 (0.0008) [2023-03-07 10:25:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 31551488. Throughput: 0: 12683.8. Samples: 31538509. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:18,322][175405] Avg episode reward: [(0, '261.135')] [2023-03-07 10:25:18,917][175731] Updated weights for policy 0, policy_version 30820 (0.0006) [2023-03-07 10:25:19,557][175680] KL-divergence is very high: 169.9197 [2023-03-07 10:25:19,721][175731] Updated weights for policy 0, policy_version 30830 (0.0006) [2023-03-07 10:25:20,538][175731] Updated weights for policy 0, policy_version 30840 (0.0007) [2023-03-07 10:25:21,356][175731] Updated weights for policy 0, policy_version 30850 (0.0007) [2023-03-07 10:25:22,156][175731] Updated weights for policy 0, policy_version 30860 (0.0007) [2023-03-07 10:25:22,967][175731] Updated weights for policy 0, policy_version 30870 (0.0007) [2023-03-07 10:25:23,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 31614976. Throughput: 0: 12678.1. Samples: 31614313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:23,321][175405] Avg episode reward: [(0, '219.720')] [2023-03-07 10:25:23,764][175731] Updated weights for policy 0, policy_version 30880 (0.0006) [2023-03-07 10:25:24,568][175731] Updated weights for policy 0, policy_version 30890 (0.0008) [2023-03-07 10:25:25,392][175731] Updated weights for policy 0, policy_version 30900 (0.0007) [2023-03-07 10:25:26,205][175731] Updated weights for policy 0, policy_version 30910 (0.0006) [2023-03-07 10:25:27,017][175731] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-07 10:25:27,829][175731] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-07 10:25:28,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12680.6, 300 sec: 12690.7). Total num frames: 31678464. Throughput: 0: 12683.7. Samples: 31652393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:28,321][175405] Avg episode reward: [(0, '235.825')] [2023-03-07 10:25:28,632][175731] Updated weights for policy 0, policy_version 30940 (0.0006) [2023-03-07 10:25:29,446][175731] Updated weights for policy 0, policy_version 30950 (0.0006) [2023-03-07 10:25:30,243][175731] Updated weights for policy 0, policy_version 30960 (0.0007) [2023-03-07 10:25:31,043][175731] Updated weights for policy 0, policy_version 30970 (0.0006) [2023-03-07 10:25:31,858][175731] Updated weights for policy 0, policy_version 30980 (0.0007) [2023-03-07 10:25:32,662][175731] Updated weights for policy 0, policy_version 30990 (0.0008) [2023-03-07 10:25:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12680.5, 300 sec: 12690.7). Total num frames: 31741952. Throughput: 0: 12679.0. Samples: 31728372. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:33,322][175405] Avg episode reward: [(0, '202.304')] [2023-03-07 10:25:33,467][175731] Updated weights for policy 0, policy_version 31000 (0.0006) [2023-03-07 10:25:34,265][175731] Updated weights for policy 0, policy_version 31010 (0.0006) [2023-03-07 10:25:35,073][175731] Updated weights for policy 0, policy_version 31020 (0.0006) [2023-03-07 10:25:35,881][175731] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-07 10:25:36,705][175731] Updated weights for policy 0, policy_version 31040 (0.0007) [2023-03-07 10:25:37,508][175731] Updated weights for policy 0, policy_version 31050 (0.0006) [2023-03-07 10:25:38,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12663.5, 300 sec: 12687.2). Total num frames: 31804416. Throughput: 0: 12684.4. Samples: 31804570. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:25:38,321][175405] Avg episode reward: [(0, '234.906')] [2023-03-07 10:25:38,326][175731] Updated weights for policy 0, policy_version 31060 (0.0006) [2023-03-07 10:25:39,137][175731] Updated weights for policy 0, policy_version 31070 (0.0006) [2023-03-07 10:25:39,933][175731] Updated weights for policy 0, policy_version 31080 (0.0007) [2023-03-07 10:25:40,729][175731] Updated weights for policy 0, policy_version 31090 (0.0007) [2023-03-07 10:25:41,275][175680] KL-divergence is very high: 117.5497 [2023-03-07 10:25:41,525][175731] Updated weights for policy 0, policy_version 31100 (0.0006) [2023-03-07 10:25:42,326][175731] Updated weights for policy 0, policy_version 31110 (0.0007) [2023-03-07 10:25:43,114][175680] KL-divergence is very high: 321.8772 [2023-03-07 10:25:43,122][175731] Updated weights for policy 0, policy_version 31120 (0.0007) [2023-03-07 10:25:43,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 31868928. Throughput: 0: 12691.4. Samples: 31842771. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:25:43,322][175405] Avg episode reward: [(0, '182.526')] [2023-03-07 10:25:43,439][175680] KL-divergence is very high: 102.1349 [2023-03-07 10:25:43,527][175680] KL-divergence is very high: 147.2781 [2023-03-07 10:25:43,933][175731] Updated weights for policy 0, policy_version 31130 (0.0006) [2023-03-07 10:25:44,737][175731] Updated weights for policy 0, policy_version 31140 (0.0007) [2023-03-07 10:25:45,540][175731] Updated weights for policy 0, policy_version 31150 (0.0006) [2023-03-07 10:25:46,339][175731] Updated weights for policy 0, policy_version 31160 (0.0007) [2023-03-07 10:25:47,143][175731] Updated weights for policy 0, policy_version 31170 (0.0006) [2023-03-07 10:25:47,943][175731] Updated weights for policy 0, policy_version 31180 (0.0007) [2023-03-07 10:25:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12697.6, 300 sec: 12687.2). Total num frames: 31932416. Throughput: 0: 12699.9. Samples: 31919283. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:25:48,321][175405] Avg episode reward: [(0, '148.935')] [2023-03-07 10:25:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000031185_31933440.pth... [2023-03-07 10:25:48,367][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000028212_28889088.pth [2023-03-07 10:25:48,755][175731] Updated weights for policy 0, policy_version 31190 (0.0007) [2023-03-07 10:25:49,550][175731] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-07 10:25:50,335][175731] Updated weights for policy 0, policy_version 31210 (0.0006) [2023-03-07 10:25:51,135][175731] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-07 10:25:51,935][175731] Updated weights for policy 0, policy_version 31230 (0.0007) [2023-03-07 10:25:52,730][175731] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-07 10:25:53,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12714.6, 300 sec: 12690.7). Total num frames: 31996928. Throughput: 0: 12705.6. Samples: 31996248. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:25:53,322][175405] Avg episode reward: [(0, '209.042')] [2023-03-07 10:25:53,532][175731] Updated weights for policy 0, policy_version 31250 (0.0006) [2023-03-07 10:25:54,329][175731] Updated weights for policy 0, policy_version 31260 (0.0007) [2023-03-07 10:25:55,134][175731] Updated weights for policy 0, policy_version 31270 (0.0007) [2023-03-07 10:25:55,945][175731] Updated weights for policy 0, policy_version 31280 (0.0007) [2023-03-07 10:25:56,742][175731] Updated weights for policy 0, policy_version 31290 (0.0007) [2023-03-07 10:25:57,560][175731] Updated weights for policy 0, policy_version 31300 (0.0006) [2023-03-07 10:25:58,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12714.7, 300 sec: 12690.6). Total num frames: 32060416. Throughput: 0: 12711.7. Samples: 32034449. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:25:58,322][175405] Avg episode reward: [(0, '221.566')] [2023-03-07 10:25:58,337][175731] Updated weights for policy 0, policy_version 31310 (0.0006) [2023-03-07 10:25:59,148][175731] Updated weights for policy 0, policy_version 31320 (0.0006) [2023-03-07 10:25:59,779][175680] KL-divergence is very high: 112.7455 [2023-03-07 10:25:59,865][175680] KL-divergence is very high: 129.9062 [2023-03-07 10:25:59,944][175731] Updated weights for policy 0, policy_version 31330 (0.0007) [2023-03-07 10:26:00,023][175680] KL-divergence is very high: 185.6182 [2023-03-07 10:26:00,751][175731] Updated weights for policy 0, policy_version 31340 (0.0006) [2023-03-07 10:26:01,566][175731] Updated weights for policy 0, policy_version 31350 (0.0007) [2023-03-07 10:26:02,369][175731] Updated weights for policy 0, policy_version 31360 (0.0007) [2023-03-07 10:26:03,190][175731] Updated weights for policy 0, policy_version 31370 (0.0006) [2023-03-07 10:26:03,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12694.1). Total num frames: 32123904. Throughput: 0: 12720.0. Samples: 32110906. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:03,332][175405] Avg episode reward: [(0, '114.802')] [2023-03-07 10:26:03,997][175731] Updated weights for policy 0, policy_version 31380 (0.0006) [2023-03-07 10:26:04,780][175731] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-07 10:26:05,583][175731] Updated weights for policy 0, policy_version 31400 (0.0006) [2023-03-07 10:26:06,364][175731] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-07 10:26:07,175][175731] Updated weights for policy 0, policy_version 31420 (0.0008) [2023-03-07 10:26:07,977][175731] Updated weights for policy 0, policy_version 31430 (0.0007) [2023-03-07 10:26:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12694.1). Total num frames: 32188416. Throughput: 0: 12743.7. Samples: 32187781. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:08,322][175405] Avg episode reward: [(0, '137.391')] [2023-03-07 10:26:08,768][175731] Updated weights for policy 0, policy_version 31440 (0.0006) [2023-03-07 10:26:09,568][175731] Updated weights for policy 0, policy_version 31450 (0.0006) [2023-03-07 10:26:10,371][175731] Updated weights for policy 0, policy_version 31460 (0.0007) [2023-03-07 10:26:11,161][175731] Updated weights for policy 0, policy_version 31470 (0.0006) [2023-03-07 10:26:11,986][175731] Updated weights for policy 0, policy_version 31480 (0.0006) [2023-03-07 10:26:12,778][175731] Updated weights for policy 0, policy_version 31490 (0.0006) [2023-03-07 10:26:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12697.6). Total num frames: 32251904. Throughput: 0: 12750.8. Samples: 32226180. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:13,321][175405] Avg episode reward: [(0, '203.293')] [2023-03-07 10:26:13,582][175731] Updated weights for policy 0, policy_version 31500 (0.0007) [2023-03-07 10:26:14,386][175731] Updated weights for policy 0, policy_version 31510 (0.0006) [2023-03-07 10:26:15,191][175731] Updated weights for policy 0, policy_version 31520 (0.0006) [2023-03-07 10:26:15,740][175680] KL-divergence is very high: 346.2697 [2023-03-07 10:26:15,964][175731] Updated weights for policy 0, policy_version 31530 (0.0006) [2023-03-07 10:26:16,786][175731] Updated weights for policy 0, policy_version 31540 (0.0006) [2023-03-07 10:26:17,316][175680] KL-divergence is very high: 143.8280 [2023-03-07 10:26:17,578][175731] Updated weights for policy 0, policy_version 31550 (0.0007) [2023-03-07 10:26:17,896][175680] KL-divergence is very high: 296.4497 [2023-03-07 10:26:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12701.1). Total num frames: 32316416. Throughput: 0: 12766.0. Samples: 32302843. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:26:18,322][175405] Avg episode reward: [(0, '101.303')] [2023-03-07 10:26:18,362][175731] Updated weights for policy 0, policy_version 31560 (0.0006) [2023-03-07 10:26:18,773][175680] KL-divergence is very high: 260.6092 [2023-03-07 10:26:19,091][175680] KL-divergence is very high: 1949.3467 [2023-03-07 10:26:19,169][175731] Updated weights for policy 0, policy_version 31570 (0.0007) [2023-03-07 10:26:19,255][175680] KL-divergence is very high: 883.7693 [2023-03-07 10:26:19,966][175731] Updated weights for policy 0, policy_version 31580 (0.0006) [2023-03-07 10:26:20,206][175680] KL-divergence is very high: 406.6461 [2023-03-07 10:26:20,765][175731] Updated weights for policy 0, policy_version 31590 (0.0006) [2023-03-07 10:26:20,996][175680] KL-divergence is very high: 240.9830 [2023-03-07 10:26:21,556][175731] Updated weights for policy 0, policy_version 31600 (0.0007) [2023-03-07 10:26:22,362][175731] Updated weights for policy 0, policy_version 31610 (0.0006) [2023-03-07 10:26:22,909][175680] KL-divergence is very high: 250.7964 [2023-03-07 10:26:23,171][175731] Updated weights for policy 0, policy_version 31620 (0.0007) [2023-03-07 10:26:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12697.6). Total num frames: 32379904. Throughput: 0: 12785.8. Samples: 32379932. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:26:23,322][175405] Avg episode reward: [(0, '103.782')] [2023-03-07 10:26:23,549][175680] KL-divergence is very high: 192.9202 [2023-03-07 10:26:23,804][175680] KL-divergence is very high: 322.5970 [2023-03-07 10:26:23,958][175680] KL-divergence is very high: 127.2462 [2023-03-07 10:26:23,965][175731] Updated weights for policy 0, policy_version 31630 (0.0008) [2023-03-07 10:26:24,031][175680] KL-divergence is very high: 100.6385 [2023-03-07 10:26:24,117][175680] KL-divergence is very high: 109.6421 [2023-03-07 10:26:24,447][175680] KL-divergence is very high: 545.1400 [2023-03-07 10:26:24,764][175680] KL-divergence is very high: 222.1684 [2023-03-07 10:26:24,774][175731] Updated weights for policy 0, policy_version 31640 (0.0006) [2023-03-07 10:26:25,556][175731] Updated weights for policy 0, policy_version 31650 (0.0006) [2023-03-07 10:26:26,357][175731] Updated weights for policy 0, policy_version 31660 (0.0007) [2023-03-07 10:26:27,173][175731] Updated weights for policy 0, policy_version 31670 (0.0007) [2023-03-07 10:26:27,951][175731] Updated weights for policy 0, policy_version 31680 (0.0007) [2023-03-07 10:26:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12765.8, 300 sec: 12701.1). Total num frames: 32444416. Throughput: 0: 12789.8. Samples: 32418314. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:26:28,322][175405] Avg episode reward: [(0, '81.631')] [2023-03-07 10:26:28,763][175731] Updated weights for policy 0, policy_version 31690 (0.0006) [2023-03-07 10:26:29,569][175731] Updated weights for policy 0, policy_version 31700 (0.0006) [2023-03-07 10:26:29,794][175680] KL-divergence is very high: 231.8912 [2023-03-07 10:26:29,875][175680] KL-divergence is very high: 136.4165 [2023-03-07 10:26:30,359][175731] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-07 10:26:31,146][175731] Updated weights for policy 0, policy_version 31720 (0.0007) [2023-03-07 10:26:31,947][175731] Updated weights for policy 0, policy_version 31730 (0.0006) [2023-03-07 10:26:32,756][175731] Updated weights for policy 0, policy_version 31740 (0.0006) [2023-03-07 10:26:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12782.9, 300 sec: 12704.5). Total num frames: 32508928. Throughput: 0: 12799.4. Samples: 32495257. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:26:33,322][175405] Avg episode reward: [(0, '69.221')] [2023-03-07 10:26:33,551][175731] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-07 10:26:34,345][175731] Updated weights for policy 0, policy_version 31760 (0.0007) [2023-03-07 10:26:35,144][175731] Updated weights for policy 0, policy_version 31770 (0.0007) [2023-03-07 10:26:35,958][175731] Updated weights for policy 0, policy_version 31780 (0.0006) [2023-03-07 10:26:36,748][175731] Updated weights for policy 0, policy_version 31790 (0.0006) [2023-03-07 10:26:37,545][175731] Updated weights for policy 0, policy_version 31800 (0.0006) [2023-03-07 10:26:38,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12800.0, 300 sec: 12704.5). Total num frames: 32572416. Throughput: 0: 12796.9. Samples: 32572109. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:38,322][175405] Avg episode reward: [(0, '139.422')] [2023-03-07 10:26:38,334][175731] Updated weights for policy 0, policy_version 31810 (0.0007) [2023-03-07 10:26:39,138][175731] Updated weights for policy 0, policy_version 31820 (0.0006) [2023-03-07 10:26:39,939][175731] Updated weights for policy 0, policy_version 31830 (0.0006) [2023-03-07 10:26:40,765][175731] Updated weights for policy 0, policy_version 31840 (0.0007) [2023-03-07 10:26:41,574][175731] Updated weights for policy 0, policy_version 31850 (0.0007) [2023-03-07 10:26:42,364][175731] Updated weights for policy 0, policy_version 31860 (0.0006) [2023-03-07 10:26:43,142][175731] Updated weights for policy 0, policy_version 31870 (0.0006) [2023-03-07 10:26:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12708.0). Total num frames: 32636928. Throughput: 0: 12797.2. Samples: 32610319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:43,321][175405] Avg episode reward: [(0, '175.743')] [2023-03-07 10:26:43,949][175731] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-07 10:26:44,753][175731] Updated weights for policy 0, policy_version 31890 (0.0006) [2023-03-07 10:26:45,546][175731] Updated weights for policy 0, policy_version 31900 (0.0006) [2023-03-07 10:26:46,349][175731] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-07 10:26:47,156][175731] Updated weights for policy 0, policy_version 31920 (0.0007) [2023-03-07 10:26:47,959][175731] Updated weights for policy 0, policy_version 31930 (0.0006) [2023-03-07 10:26:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12708.0). Total num frames: 32700416. Throughput: 0: 12802.3. Samples: 32687013. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:48,322][175405] Avg episode reward: [(0, '149.420')] [2023-03-07 10:26:48,771][175731] Updated weights for policy 0, policy_version 31940 (0.0007) [2023-03-07 10:26:49,583][175731] Updated weights for policy 0, policy_version 31950 (0.0006) [2023-03-07 10:26:50,405][175731] Updated weights for policy 0, policy_version 31960 (0.0006) [2023-03-07 10:26:51,197][175731] Updated weights for policy 0, policy_version 31970 (0.0007) [2023-03-07 10:26:51,993][175731] Updated weights for policy 0, policy_version 31980 (0.0007) [2023-03-07 10:26:52,805][175731] Updated weights for policy 0, policy_version 31990 (0.0006) [2023-03-07 10:26:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12708.0). Total num frames: 32763904. Throughput: 0: 12789.8. Samples: 32763319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:53,321][175405] Avg episode reward: [(0, '180.130')] [2023-03-07 10:26:53,638][175731] Updated weights for policy 0, policy_version 32000 (0.0006) [2023-03-07 10:26:54,438][175731] Updated weights for policy 0, policy_version 32010 (0.0006) [2023-03-07 10:26:55,241][175731] Updated weights for policy 0, policy_version 32020 (0.0007) [2023-03-07 10:26:56,049][175731] Updated weights for policy 0, policy_version 32030 (0.0006) [2023-03-07 10:26:56,875][175731] Updated weights for policy 0, policy_version 32040 (0.0007) [2023-03-07 10:26:57,674][175731] Updated weights for policy 0, policy_version 32050 (0.0006) [2023-03-07 10:26:58,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12783.0, 300 sec: 12708.0). Total num frames: 32827392. Throughput: 0: 12778.4. Samples: 32801206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:26:58,322][175405] Avg episode reward: [(0, '227.369')] [2023-03-07 10:26:58,472][175731] Updated weights for policy 0, policy_version 32060 (0.0006) [2023-03-07 10:26:59,286][175731] Updated weights for policy 0, policy_version 32070 (0.0006) [2023-03-07 10:27:00,073][175731] Updated weights for policy 0, policy_version 32080 (0.0006) [2023-03-07 10:27:00,894][175731] Updated weights for policy 0, policy_version 32090 (0.0006) [2023-03-07 10:27:01,685][175731] Updated weights for policy 0, policy_version 32100 (0.0006) [2023-03-07 10:27:02,492][175731] Updated weights for policy 0, policy_version 32110 (0.0006) [2023-03-07 10:27:03,286][175731] Updated weights for policy 0, policy_version 32120 (0.0006) [2023-03-07 10:27:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12782.9, 300 sec: 12708.0). Total num frames: 32890880. Throughput: 0: 12772.8. Samples: 32877619. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:03,321][175405] Avg episode reward: [(0, '197.533')] [2023-03-07 10:27:04,104][175731] Updated weights for policy 0, policy_version 32130 (0.0007) [2023-03-07 10:27:04,899][175731] Updated weights for policy 0, policy_version 32140 (0.0007) [2023-03-07 10:27:05,714][175731] Updated weights for policy 0, policy_version 32150 (0.0006) [2023-03-07 10:27:06,516][175731] Updated weights for policy 0, policy_version 32160 (0.0008) [2023-03-07 10:27:07,327][175731] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-07 10:27:08,138][175731] Updated weights for policy 0, policy_version 32180 (0.0007) [2023-03-07 10:27:08,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 12708.0). Total num frames: 32954368. Throughput: 0: 12747.4. Samples: 32953566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:08,322][175405] Avg episode reward: [(0, '217.378')] [2023-03-07 10:27:08,958][175731] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-07 10:27:09,758][175731] Updated weights for policy 0, policy_version 32200 (0.0006) [2023-03-07 10:27:10,573][175731] Updated weights for policy 0, policy_version 32210 (0.0007) [2023-03-07 10:27:11,370][175731] Updated weights for policy 0, policy_version 32220 (0.0007) [2023-03-07 10:27:12,162][175731] Updated weights for policy 0, policy_version 32230 (0.0007) [2023-03-07 10:27:12,963][175731] Updated weights for policy 0, policy_version 32240 (0.0006) [2023-03-07 10:27:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12708.0). Total num frames: 33017856. Throughput: 0: 12744.8. Samples: 32991830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:13,322][175405] Avg episode reward: [(0, '168.283')] [2023-03-07 10:27:13,780][175731] Updated weights for policy 0, policy_version 32250 (0.0007) [2023-03-07 10:27:14,590][175731] Updated weights for policy 0, policy_version 32260 (0.0007) [2023-03-07 10:27:15,390][175731] Updated weights for policy 0, policy_version 32270 (0.0006) [2023-03-07 10:27:16,220][175731] Updated weights for policy 0, policy_version 32280 (0.0007) [2023-03-07 10:27:17,034][175731] Updated weights for policy 0, policy_version 32290 (0.0006) [2023-03-07 10:27:17,844][175731] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-07 10:27:18,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12748.8, 300 sec: 12708.0). Total num frames: 33081344. Throughput: 0: 12721.5. Samples: 33067722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:18,321][175405] Avg episode reward: [(0, '192.810')] [2023-03-07 10:27:18,636][175731] Updated weights for policy 0, policy_version 32310 (0.0006) [2023-03-07 10:27:19,469][175731] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-07 10:27:20,270][175731] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-07 10:27:21,071][175731] Updated weights for policy 0, policy_version 32340 (0.0006) [2023-03-07 10:27:21,866][175731] Updated weights for policy 0, policy_version 32350 (0.0005) [2023-03-07 10:27:22,665][175731] Updated weights for policy 0, policy_version 32360 (0.0006) [2023-03-07 10:27:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12708.0). Total num frames: 33144832. Throughput: 0: 12706.3. Samples: 33143892. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:27:23,322][175405] Avg episode reward: [(0, '181.481')] [2023-03-07 10:27:23,478][175731] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-07 10:27:24,277][175731] Updated weights for policy 0, policy_version 32380 (0.0007) [2023-03-07 10:27:25,079][175731] Updated weights for policy 0, policy_version 32390 (0.0007) [2023-03-07 10:27:25,881][175731] Updated weights for policy 0, policy_version 32400 (0.0006) [2023-03-07 10:27:26,686][175731] Updated weights for policy 0, policy_version 32410 (0.0007) [2023-03-07 10:27:27,484][175731] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-07 10:27:28,283][175731] Updated weights for policy 0, policy_version 32430 (0.0008) [2023-03-07 10:27:28,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12731.7, 300 sec: 12708.0). Total num frames: 33208320. Throughput: 0: 12708.1. Samples: 33182185. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:27:28,322][175405] Avg episode reward: [(0, '175.128')] [2023-03-07 10:27:29,074][175731] Updated weights for policy 0, policy_version 32440 (0.0007) [2023-03-07 10:27:29,878][175731] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-07 10:27:30,676][175731] Updated weights for policy 0, policy_version 32460 (0.0007) [2023-03-07 10:27:31,473][175731] Updated weights for policy 0, policy_version 32470 (0.0006) [2023-03-07 10:27:32,270][175731] Updated weights for policy 0, policy_version 32480 (0.0006) [2023-03-07 10:27:33,071][175731] Updated weights for policy 0, policy_version 32490 (0.0006) [2023-03-07 10:27:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12711.5). Total num frames: 33271808. Throughput: 0: 12712.0. Samples: 33259053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:27:33,332][175405] Avg episode reward: [(0, '111.003')] [2023-03-07 10:27:33,889][175731] Updated weights for policy 0, policy_version 32500 (0.0007) [2023-03-07 10:27:34,670][175731] Updated weights for policy 0, policy_version 32510 (0.0006) [2023-03-07 10:27:35,482][175731] Updated weights for policy 0, policy_version 32520 (0.0006) [2023-03-07 10:27:36,286][175731] Updated weights for policy 0, policy_version 32530 (0.0006) [2023-03-07 10:27:37,076][175731] Updated weights for policy 0, policy_version 32540 (0.0007) [2023-03-07 10:27:37,872][175731] Updated weights for policy 0, policy_version 32550 (0.0007) [2023-03-07 10:27:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12731.8, 300 sec: 12715.0). Total num frames: 33336320. Throughput: 0: 12724.9. Samples: 33335939. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:27:38,332][175405] Avg episode reward: [(0, '100.752')] [2023-03-07 10:27:38,663][175731] Updated weights for policy 0, policy_version 32560 (0.0007) [2023-03-07 10:27:39,463][175731] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-07 10:27:40,256][175731] Updated weights for policy 0, policy_version 32580 (0.0006) [2023-03-07 10:27:41,037][175731] Updated weights for policy 0, policy_version 32590 (0.0007) [2023-03-07 10:27:41,854][175731] Updated weights for policy 0, policy_version 32600 (0.0006) [2023-03-07 10:27:42,629][175731] Updated weights for policy 0, policy_version 32610 (0.0006) [2023-03-07 10:27:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 33400832. Throughput: 0: 12742.8. Samples: 33374630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:43,332][175405] Avg episode reward: [(0, '90.884')] [2023-03-07 10:27:43,454][175731] Updated weights for policy 0, policy_version 32620 (0.0007) [2023-03-07 10:27:44,258][175731] Updated weights for policy 0, policy_version 32630 (0.0006) [2023-03-07 10:27:45,072][175731] Updated weights for policy 0, policy_version 32640 (0.0006) [2023-03-07 10:27:45,855][175731] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-07 10:27:46,648][175731] Updated weights for policy 0, policy_version 32660 (0.0006) [2023-03-07 10:27:47,459][175731] Updated weights for policy 0, policy_version 32670 (0.0008) [2023-03-07 10:27:48,257][175731] Updated weights for policy 0, policy_version 32680 (0.0006) [2023-03-07 10:27:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12718.4). Total num frames: 33464320. Throughput: 0: 12749.1. Samples: 33451333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:48,332][175405] Avg episode reward: [(0, '139.268')] [2023-03-07 10:27:48,336][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000032681_33465344.pth... [2023-03-07 10:27:48,368][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000029698_30410752.pth [2023-03-07 10:27:49,075][175731] Updated weights for policy 0, policy_version 32690 (0.0006) [2023-03-07 10:27:49,869][175731] Updated weights for policy 0, policy_version 32700 (0.0006) [2023-03-07 10:27:50,655][175731] Updated weights for policy 0, policy_version 32710 (0.0006) [2023-03-07 10:27:51,474][175731] Updated weights for policy 0, policy_version 32720 (0.0007) [2023-03-07 10:27:52,278][175731] Updated weights for policy 0, policy_version 32730 (0.0006) [2023-03-07 10:27:53,068][175731] Updated weights for policy 0, policy_version 32740 (0.0006) [2023-03-07 10:27:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 33528832. Throughput: 0: 12765.3. Samples: 33528005. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:53,322][175405] Avg episode reward: [(0, '105.595')] [2023-03-07 10:27:53,845][175731] Updated weights for policy 0, policy_version 32750 (0.0006) [2023-03-07 10:27:54,661][175731] Updated weights for policy 0, policy_version 32760 (0.0006) [2023-03-07 10:27:55,451][175731] Updated weights for policy 0, policy_version 32770 (0.0006) [2023-03-07 10:27:56,245][175731] Updated weights for policy 0, policy_version 32780 (0.0006) [2023-03-07 10:27:57,062][175731] Updated weights for policy 0, policy_version 32790 (0.0006) [2023-03-07 10:27:57,863][175731] Updated weights for policy 0, policy_version 32800 (0.0007) [2023-03-07 10:27:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 33592320. Throughput: 0: 12773.8. Samples: 33566652. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:27:58,322][175405] Avg episode reward: [(0, '177.251')] [2023-03-07 10:27:58,661][175731] Updated weights for policy 0, policy_version 32810 (0.0007) [2023-03-07 10:27:59,480][175731] Updated weights for policy 0, policy_version 32820 (0.0007) [2023-03-07 10:28:00,293][175731] Updated weights for policy 0, policy_version 32830 (0.0007) [2023-03-07 10:28:01,098][175731] Updated weights for policy 0, policy_version 32840 (0.0007) [2023-03-07 10:28:01,897][175731] Updated weights for policy 0, policy_version 32850 (0.0005) [2023-03-07 10:28:02,681][175731] Updated weights for policy 0, policy_version 32860 (0.0006) [2023-03-07 10:28:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12721.9). Total num frames: 33655808. Throughput: 0: 12782.7. Samples: 33642946. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:28:03,321][175405] Avg episode reward: [(0, '194.086')] [2023-03-07 10:28:03,490][175731] Updated weights for policy 0, policy_version 32870 (0.0006) [2023-03-07 10:28:04,313][175731] Updated weights for policy 0, policy_version 32880 (0.0006) [2023-03-07 10:28:05,107][175731] Updated weights for policy 0, policy_version 32890 (0.0007) [2023-03-07 10:28:05,915][175731] Updated weights for policy 0, policy_version 32900 (0.0007) [2023-03-07 10:28:06,703][175731] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-07 10:28:07,507][175731] Updated weights for policy 0, policy_version 32920 (0.0007) [2023-03-07 10:28:08,315][175731] Updated weights for policy 0, policy_version 32930 (0.0007) [2023-03-07 10:28:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12725.4). Total num frames: 33720320. Throughput: 0: 12790.8. Samples: 33719478. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:28:08,321][175405] Avg episode reward: [(0, '143.075')] [2023-03-07 10:28:09,108][175731] Updated weights for policy 0, policy_version 32940 (0.0006) [2023-03-07 10:28:09,934][175731] Updated weights for policy 0, policy_version 32950 (0.0007) [2023-03-07 10:28:10,722][175731] Updated weights for policy 0, policy_version 32960 (0.0007) [2023-03-07 10:28:11,516][175731] Updated weights for policy 0, policy_version 32970 (0.0007) [2023-03-07 10:28:12,317][175731] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-07 10:28:13,112][175731] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-07 10:28:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12725.4). Total num frames: 33783808. Throughput: 0: 12789.4. Samples: 33757707. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:28:13,322][175405] Avg episode reward: [(0, '123.416')] [2023-03-07 10:28:13,913][175731] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-07 10:28:14,736][175731] Updated weights for policy 0, policy_version 33010 (0.0006) [2023-03-07 10:28:15,524][175731] Updated weights for policy 0, policy_version 33020 (0.0007) [2023-03-07 10:28:16,331][175731] Updated weights for policy 0, policy_version 33030 (0.0006) [2023-03-07 10:28:17,132][175731] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-07 10:28:17,930][175731] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-07 10:28:18,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12782.9, 300 sec: 12728.8). Total num frames: 33848320. Throughput: 0: 12785.7. Samples: 33834410. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:28:18,322][175405] Avg episode reward: [(0, '105.157')] [2023-03-07 10:28:18,707][175731] Updated weights for policy 0, policy_version 33060 (0.0006) [2023-03-07 10:28:19,529][175731] Updated weights for policy 0, policy_version 33070 (0.0007) [2023-03-07 10:28:20,321][175731] Updated weights for policy 0, policy_version 33080 (0.0007) [2023-03-07 10:28:21,122][175731] Updated weights for policy 0, policy_version 33090 (0.0005) [2023-03-07 10:28:21,914][175731] Updated weights for policy 0, policy_version 33100 (0.0007) [2023-03-07 10:28:22,738][175731] Updated weights for policy 0, policy_version 33110 (0.0007) [2023-03-07 10:28:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12783.0, 300 sec: 12728.8). Total num frames: 33911808. Throughput: 0: 12784.7. Samples: 33911252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:23,321][175405] Avg episode reward: [(0, '102.774')] [2023-03-07 10:28:23,523][175731] Updated weights for policy 0, policy_version 33120 (0.0007) [2023-03-07 10:28:24,305][175731] Updated weights for policy 0, policy_version 33130 (0.0006) [2023-03-07 10:28:25,119][175731] Updated weights for policy 0, policy_version 33140 (0.0007) [2023-03-07 10:28:25,908][175731] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-07 10:28:26,726][175731] Updated weights for policy 0, policy_version 33160 (0.0006) [2023-03-07 10:28:27,521][175731] Updated weights for policy 0, policy_version 33170 (0.0007) [2023-03-07 10:28:28,318][175731] Updated weights for policy 0, policy_version 33180 (0.0007) [2023-03-07 10:28:28,321][175405] Fps is (10 sec: 12800.3, 60 sec: 12800.0, 300 sec: 12732.3). Total num frames: 33976320. Throughput: 0: 12782.6. Samples: 33949844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:28,329][175405] Avg episode reward: [(0, '108.863')] [2023-03-07 10:28:29,134][175731] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-07 10:28:29,951][175731] Updated weights for policy 0, policy_version 33200 (0.0007) [2023-03-07 10:28:30,731][175731] Updated weights for policy 0, policy_version 33210 (0.0007) [2023-03-07 10:28:31,534][175731] Updated weights for policy 0, policy_version 33220 (0.0007) [2023-03-07 10:28:32,350][175731] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-07 10:28:33,149][175731] Updated weights for policy 0, policy_version 33240 (0.0007) [2023-03-07 10:28:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12732.3). Total num frames: 34039808. Throughput: 0: 12778.7. Samples: 34026370. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:33,332][175405] Avg episode reward: [(0, '107.373')] [2023-03-07 10:28:33,938][175731] Updated weights for policy 0, policy_version 33250 (0.0006) [2023-03-07 10:28:34,752][175731] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-07 10:28:35,536][175731] Updated weights for policy 0, policy_version 33270 (0.0007) [2023-03-07 10:28:36,336][175731] Updated weights for policy 0, policy_version 33280 (0.0006) [2023-03-07 10:28:37,133][175731] Updated weights for policy 0, policy_version 33290 (0.0007) [2023-03-07 10:28:37,931][175731] Updated weights for policy 0, policy_version 33300 (0.0007) [2023-03-07 10:28:38,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12728.8). Total num frames: 34103296. Throughput: 0: 12781.6. Samples: 34103178. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:38,332][175405] Avg episode reward: [(0, '74.166')] [2023-03-07 10:28:38,734][175731] Updated weights for policy 0, policy_version 33310 (0.0007) [2023-03-07 10:28:39,548][175731] Updated weights for policy 0, policy_version 33320 (0.0005) [2023-03-07 10:28:40,337][175731] Updated weights for policy 0, policy_version 33330 (0.0006) [2023-03-07 10:28:41,143][175731] Updated weights for policy 0, policy_version 33340 (0.0006) [2023-03-07 10:28:41,962][175731] Updated weights for policy 0, policy_version 33350 (0.0007) [2023-03-07 10:28:42,367][175680] KL-divergence is very high: 436.3680 [2023-03-07 10:28:42,768][175731] Updated weights for policy 0, policy_version 33360 (0.0007) [2023-03-07 10:28:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 12732.3). Total num frames: 34166784. Throughput: 0: 12773.0. Samples: 34141437. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:43,332][175405] Avg episode reward: [(0, '90.456')] [2023-03-07 10:28:43,554][175731] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-07 10:28:44,356][175731] Updated weights for policy 0, policy_version 33380 (0.0007) [2023-03-07 10:28:45,144][175731] Updated weights for policy 0, policy_version 33390 (0.0007) [2023-03-07 10:28:45,946][175731] Updated weights for policy 0, policy_version 33400 (0.0006) [2023-03-07 10:28:46,741][175731] Updated weights for policy 0, policy_version 33410 (0.0006) [2023-03-07 10:28:47,547][175731] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-07 10:28:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12782.9, 300 sec: 12739.3). Total num frames: 34231296. Throughput: 0: 12780.1. Samples: 34218055. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:48,332][175405] Avg episode reward: [(0, '88.074')] [2023-03-07 10:28:48,350][175731] Updated weights for policy 0, policy_version 33430 (0.0006) [2023-03-07 10:28:49,150][175731] Updated weights for policy 0, policy_version 33440 (0.0006) [2023-03-07 10:28:49,960][175731] Updated weights for policy 0, policy_version 33450 (0.0007) [2023-03-07 10:28:50,733][175731] Updated weights for policy 0, policy_version 33460 (0.0006) [2023-03-07 10:28:51,525][175731] Updated weights for policy 0, policy_version 33470 (0.0007) [2023-03-07 10:28:52,350][175731] Updated weights for policy 0, policy_version 33480 (0.0006) [2023-03-07 10:28:53,139][175731] Updated weights for policy 0, policy_version 33490 (0.0006) [2023-03-07 10:28:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12783.0, 300 sec: 12739.3). Total num frames: 34295808. Throughput: 0: 12787.7. Samples: 34294925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:53,332][175405] Avg episode reward: [(0, '158.433')] [2023-03-07 10:28:53,950][175731] Updated weights for policy 0, policy_version 33500 (0.0006) [2023-03-07 10:28:54,763][175731] Updated weights for policy 0, policy_version 33510 (0.0007) [2023-03-07 10:28:55,549][175731] Updated weights for policy 0, policy_version 33520 (0.0007) [2023-03-07 10:28:56,340][175731] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-07 10:28:57,166][175731] Updated weights for policy 0, policy_version 33540 (0.0006) [2023-03-07 10:28:57,947][175731] Updated weights for policy 0, policy_version 33550 (0.0006) [2023-03-07 10:28:58,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12782.9, 300 sec: 12739.3). Total num frames: 34359296. Throughput: 0: 12793.3. Samples: 34333405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:28:58,332][175405] Avg episode reward: [(0, '172.385')] [2023-03-07 10:28:58,758][175731] Updated weights for policy 0, policy_version 33560 (0.0006) [2023-03-07 10:28:59,565][175731] Updated weights for policy 0, policy_version 33570 (0.0007) [2023-03-07 10:29:00,353][175731] Updated weights for policy 0, policy_version 33580 (0.0006) [2023-03-07 10:29:01,164][175731] Updated weights for policy 0, policy_version 33590 (0.0006) [2023-03-07 10:29:01,954][175731] Updated weights for policy 0, policy_version 33600 (0.0006) [2023-03-07 10:29:02,760][175731] Updated weights for policy 0, policy_version 33610 (0.0007) [2023-03-07 10:29:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12746.2). Total num frames: 34423808. Throughput: 0: 12790.9. Samples: 34410000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:29:03,332][175405] Avg episode reward: [(0, '240.066')] [2023-03-07 10:29:03,559][175731] Updated weights for policy 0, policy_version 33620 (0.0007) [2023-03-07 10:29:04,355][175731] Updated weights for policy 0, policy_version 33630 (0.0007) [2023-03-07 10:29:05,182][175731] Updated weights for policy 0, policy_version 33640 (0.0006) [2023-03-07 10:29:05,983][175731] Updated weights for policy 0, policy_version 33650 (0.0007) [2023-03-07 10:29:06,798][175731] Updated weights for policy 0, policy_version 33660 (0.0007) [2023-03-07 10:29:07,618][175731] Updated weights for policy 0, policy_version 33670 (0.0007) [2023-03-07 10:29:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12742.7). Total num frames: 34486272. Throughput: 0: 12775.5. Samples: 34486149. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:29:08,332][175405] Avg episode reward: [(0, '184.597')] [2023-03-07 10:29:08,402][175731] Updated weights for policy 0, policy_version 33680 (0.0007) [2023-03-07 10:29:09,227][175731] Updated weights for policy 0, policy_version 33690 (0.0007) [2023-03-07 10:29:10,024][175731] Updated weights for policy 0, policy_version 33700 (0.0006) [2023-03-07 10:29:10,822][175731] Updated weights for policy 0, policy_version 33710 (0.0006) [2023-03-07 10:29:11,622][175731] Updated weights for policy 0, policy_version 33720 (0.0006) [2023-03-07 10:29:12,430][175731] Updated weights for policy 0, policy_version 33730 (0.0007) [2023-03-07 10:29:13,226][175731] Updated weights for policy 0, policy_version 33740 (0.0006) [2023-03-07 10:29:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12746.2). Total num frames: 34550784. Throughput: 0: 12767.2. Samples: 34524370. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:29:13,332][175405] Avg episode reward: [(0, '189.153')] [2023-03-07 10:29:14,026][175731] Updated weights for policy 0, policy_version 33750 (0.0006) [2023-03-07 10:29:14,818][175731] Updated weights for policy 0, policy_version 33760 (0.0007) [2023-03-07 10:29:15,625][175731] Updated weights for policy 0, policy_version 33770 (0.0006) [2023-03-07 10:29:16,434][175731] Updated weights for policy 0, policy_version 33780 (0.0006) [2023-03-07 10:29:17,245][175731] Updated weights for policy 0, policy_version 33790 (0.0006) [2023-03-07 10:29:18,034][175731] Updated weights for policy 0, policy_version 33800 (0.0005) [2023-03-07 10:29:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 34614272. Throughput: 0: 12772.7. Samples: 34601145. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:29:18,332][175405] Avg episode reward: [(0, '187.160')] [2023-03-07 10:29:18,853][175731] Updated weights for policy 0, policy_version 33810 (0.0006) [2023-03-07 10:29:19,662][175731] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-07 10:29:20,448][175731] Updated weights for policy 0, policy_version 33830 (0.0006) [2023-03-07 10:29:21,250][175731] Updated weights for policy 0, policy_version 33840 (0.0006) [2023-03-07 10:29:22,049][175731] Updated weights for policy 0, policy_version 33850 (0.0008) [2023-03-07 10:29:22,861][175731] Updated weights for policy 0, policy_version 33860 (0.0007) [2023-03-07 10:29:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.9, 300 sec: 12746.2). Total num frames: 34677760. Throughput: 0: 12764.3. Samples: 34677574. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:29:23,322][175405] Avg episode reward: [(0, '146.452')] [2023-03-07 10:29:23,647][175731] Updated weights for policy 0, policy_version 33870 (0.0006) [2023-03-07 10:29:24,443][175731] Updated weights for policy 0, policy_version 33880 (0.0007) [2023-03-07 10:29:25,246][175731] Updated weights for policy 0, policy_version 33890 (0.0006) [2023-03-07 10:29:26,045][175731] Updated weights for policy 0, policy_version 33900 (0.0007) [2023-03-07 10:29:26,859][175731] Updated weights for policy 0, policy_version 33910 (0.0007) [2023-03-07 10:29:27,680][175731] Updated weights for policy 0, policy_version 33920 (0.0007) [2023-03-07 10:29:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.8, 300 sec: 12749.7). Total num frames: 34742272. Throughput: 0: 12767.5. Samples: 34715974. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:29:28,322][175405] Avg episode reward: [(0, '193.078')] [2023-03-07 10:29:28,470][175731] Updated weights for policy 0, policy_version 33930 (0.0006) [2023-03-07 10:29:29,280][175731] Updated weights for policy 0, policy_version 33940 (0.0007) [2023-03-07 10:29:30,066][175731] Updated weights for policy 0, policy_version 33950 (0.0007) [2023-03-07 10:29:30,874][175731] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-07 10:29:31,662][175731] Updated weights for policy 0, policy_version 33970 (0.0006) [2023-03-07 10:29:32,474][175731] Updated weights for policy 0, policy_version 33980 (0.0006) [2023-03-07 10:29:33,263][175731] Updated weights for policy 0, policy_version 33990 (0.0006) [2023-03-07 10:29:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12749.7). Total num frames: 34805760. Throughput: 0: 12764.9. Samples: 34792475. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:29:33,321][175405] Avg episode reward: [(0, '225.400')] [2023-03-07 10:29:34,070][175731] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-07 10:29:34,866][175731] Updated weights for policy 0, policy_version 34010 (0.0006) [2023-03-07 10:29:35,645][175731] Updated weights for policy 0, policy_version 34020 (0.0006) [2023-03-07 10:29:36,465][175731] Updated weights for policy 0, policy_version 34030 (0.0006) [2023-03-07 10:29:37,293][175731] Updated weights for policy 0, policy_version 34040 (0.0007) [2023-03-07 10:29:38,094][175731] Updated weights for policy 0, policy_version 34050 (0.0006) [2023-03-07 10:29:38,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12765.9, 300 sec: 12753.1). Total num frames: 34869248. Throughput: 0: 12758.9. Samples: 34869075. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:29:38,322][175405] Avg episode reward: [(0, '185.695')] [2023-03-07 10:29:38,902][175731] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-07 10:29:39,679][175731] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-07 10:29:40,502][175731] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-07 10:29:41,284][175731] Updated weights for policy 0, policy_version 34090 (0.0007) [2023-03-07 10:29:42,092][175731] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-07 10:29:42,895][175731] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-07 10:29:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12756.6). Total num frames: 34933760. Throughput: 0: 12758.2. Samples: 34907522. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:29:43,321][175405] Avg episode reward: [(0, '170.241')] [2023-03-07 10:29:43,696][175731] Updated weights for policy 0, policy_version 34120 (0.0005) [2023-03-07 10:29:44,517][175731] Updated weights for policy 0, policy_version 34130 (0.0007) [2023-03-07 10:29:45,317][175731] Updated weights for policy 0, policy_version 34140 (0.0005) [2023-03-07 10:29:46,118][175731] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-07 10:29:46,933][175731] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-07 10:29:47,730][175731] Updated weights for policy 0, policy_version 34170 (0.0007) [2023-03-07 10:29:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12756.6). Total num frames: 34997248. Throughput: 0: 12748.6. Samples: 34983686. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:29:48,321][175405] Avg episode reward: [(0, '188.836')] [2023-03-07 10:29:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000034177_34997248.pth... [2023-03-07 10:29:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000031185_31933440.pth [2023-03-07 10:29:48,545][175731] Updated weights for policy 0, policy_version 34180 (0.0006) [2023-03-07 10:29:49,355][175731] Updated weights for policy 0, policy_version 34190 (0.0006) [2023-03-07 10:29:50,159][175731] Updated weights for policy 0, policy_version 34200 (0.0006) [2023-03-07 10:29:50,950][175731] Updated weights for policy 0, policy_version 34210 (0.0006) [2023-03-07 10:29:51,761][175731] Updated weights for policy 0, policy_version 34220 (0.0007) [2023-03-07 10:29:52,570][175731] Updated weights for policy 0, policy_version 34230 (0.0007) [2023-03-07 10:29:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12756.6). Total num frames: 35060736. Throughput: 0: 12756.0. Samples: 35060170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:29:53,322][175405] Avg episode reward: [(0, '215.859')] [2023-03-07 10:29:53,378][175731] Updated weights for policy 0, policy_version 34240 (0.0007) [2023-03-07 10:29:54,173][175731] Updated weights for policy 0, policy_version 34250 (0.0006) [2023-03-07 10:29:54,975][175731] Updated weights for policy 0, policy_version 34260 (0.0007) [2023-03-07 10:29:55,778][175731] Updated weights for policy 0, policy_version 34270 (0.0007) [2023-03-07 10:29:56,569][175731] Updated weights for policy 0, policy_version 34280 (0.0007) [2023-03-07 10:29:57,378][175731] Updated weights for policy 0, policy_version 34290 (0.0007) [2023-03-07 10:29:58,189][175731] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-07 10:29:58,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12756.6). Total num frames: 35124224. Throughput: 0: 12757.4. Samples: 35098455. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:29:58,321][175405] Avg episode reward: [(0, '238.150')] [2023-03-07 10:29:58,985][175731] Updated weights for policy 0, policy_version 34310 (0.0005) [2023-03-07 10:29:59,797][175731] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-07 10:30:00,598][175731] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-07 10:30:01,412][175731] Updated weights for policy 0, policy_version 34340 (0.0006) [2023-03-07 10:30:02,213][175731] Updated weights for policy 0, policy_version 34350 (0.0006) [2023-03-07 10:30:03,018][175731] Updated weights for policy 0, policy_version 34360 (0.0006) [2023-03-07 10:30:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12756.6). Total num frames: 35187712. Throughput: 0: 12750.7. Samples: 35174928. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:03,321][175405] Avg episode reward: [(0, '215.826')] [2023-03-07 10:30:03,807][175731] Updated weights for policy 0, policy_version 34370 (0.0006) [2023-03-07 10:30:04,622][175731] Updated weights for policy 0, policy_version 34380 (0.0006) [2023-03-07 10:30:05,400][175731] Updated weights for policy 0, policy_version 34390 (0.0006) [2023-03-07 10:30:06,208][175731] Updated weights for policy 0, policy_version 34400 (0.0006) [2023-03-07 10:30:07,001][175731] Updated weights for policy 0, policy_version 34410 (0.0006) [2023-03-07 10:30:07,809][175731] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-07 10:30:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12760.1). Total num frames: 35252224. Throughput: 0: 12753.6. Samples: 35251485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:08,321][175405] Avg episode reward: [(0, '172.806')] [2023-03-07 10:30:08,625][175731] Updated weights for policy 0, policy_version 34430 (0.0007) [2023-03-07 10:30:09,428][175731] Updated weights for policy 0, policy_version 34440 (0.0006) [2023-03-07 10:30:10,230][175731] Updated weights for policy 0, policy_version 34450 (0.0006) [2023-03-07 10:30:11,048][175731] Updated weights for policy 0, policy_version 34460 (0.0007) [2023-03-07 10:30:11,861][175731] Updated weights for policy 0, policy_version 34470 (0.0006) [2023-03-07 10:30:12,658][175731] Updated weights for policy 0, policy_version 34480 (0.0006) [2023-03-07 10:30:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12760.1). Total num frames: 35315712. Throughput: 0: 12742.8. Samples: 35289398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:13,322][175405] Avg episode reward: [(0, '201.308')] [2023-03-07 10:30:13,470][175731] Updated weights for policy 0, policy_version 34490 (0.0007) [2023-03-07 10:30:14,266][175731] Updated weights for policy 0, policy_version 34500 (0.0006) [2023-03-07 10:30:15,065][175731] Updated weights for policy 0, policy_version 34510 (0.0007) [2023-03-07 10:30:15,857][175731] Updated weights for policy 0, policy_version 34520 (0.0007) [2023-03-07 10:30:16,665][175731] Updated weights for policy 0, policy_version 34530 (0.0007) [2023-03-07 10:30:17,455][175731] Updated weights for policy 0, policy_version 34540 (0.0006) [2023-03-07 10:30:18,266][175731] Updated weights for policy 0, policy_version 34550 (0.0007) [2023-03-07 10:30:18,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12760.1). Total num frames: 35379200. Throughput: 0: 12744.3. Samples: 35365966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:18,321][175405] Avg episode reward: [(0, '206.027')] [2023-03-07 10:30:19,067][175731] Updated weights for policy 0, policy_version 34560 (0.0006) [2023-03-07 10:30:19,859][175731] Updated weights for policy 0, policy_version 34570 (0.0006) [2023-03-07 10:30:20,660][175731] Updated weights for policy 0, policy_version 34580 (0.0007) [2023-03-07 10:30:21,444][175731] Updated weights for policy 0, policy_version 34590 (0.0007) [2023-03-07 10:30:22,258][175731] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-07 10:30:23,043][175731] Updated weights for policy 0, policy_version 34610 (0.0006) [2023-03-07 10:30:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12763.5). Total num frames: 35443712. Throughput: 0: 12755.9. Samples: 35443089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:23,322][175405] Avg episode reward: [(0, '216.573')] [2023-03-07 10:30:23,863][175731] Updated weights for policy 0, policy_version 34620 (0.0006) [2023-03-07 10:30:24,657][175731] Updated weights for policy 0, policy_version 34630 (0.0007) [2023-03-07 10:30:25,458][175731] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-07 10:30:26,255][175731] Updated weights for policy 0, policy_version 34650 (0.0006) [2023-03-07 10:30:27,082][175731] Updated weights for policy 0, policy_version 34660 (0.0006) [2023-03-07 10:30:27,860][175731] Updated weights for policy 0, policy_version 34670 (0.0006) [2023-03-07 10:30:28,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12748.8, 300 sec: 12763.5). Total num frames: 35507200. Throughput: 0: 12748.2. Samples: 35481195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:28,322][175405] Avg episode reward: [(0, '281.701')] [2023-03-07 10:30:28,326][175680] Saving new best policy, reward=281.701! [2023-03-07 10:30:28,687][175731] Updated weights for policy 0, policy_version 34680 (0.0007) [2023-03-07 10:30:29,482][175731] Updated weights for policy 0, policy_version 34690 (0.0006) [2023-03-07 10:30:30,283][175731] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-07 10:30:31,104][175731] Updated weights for policy 0, policy_version 34710 (0.0007) [2023-03-07 10:30:31,902][175731] Updated weights for policy 0, policy_version 34720 (0.0006) [2023-03-07 10:30:32,718][175731] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-07 10:30:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 35570688. Throughput: 0: 12752.2. Samples: 35557537. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:33,322][175405] Avg episode reward: [(0, '233.272')] [2023-03-07 10:30:33,528][175731] Updated weights for policy 0, policy_version 34740 (0.0006) [2023-03-07 10:30:34,330][175731] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-07 10:30:35,136][175731] Updated weights for policy 0, policy_version 34760 (0.0007) [2023-03-07 10:30:35,924][175731] Updated weights for policy 0, policy_version 34770 (0.0006) [2023-03-07 10:30:36,718][175731] Updated weights for policy 0, policy_version 34780 (0.0006) [2023-03-07 10:30:37,532][175731] Updated weights for policy 0, policy_version 34790 (0.0006) [2023-03-07 10:30:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12748.8, 300 sec: 12763.5). Total num frames: 35634176. Throughput: 0: 12752.6. Samples: 35634037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:38,322][175405] Avg episode reward: [(0, '178.678')] [2023-03-07 10:30:38,330][175731] Updated weights for policy 0, policy_version 34800 (0.0006) [2023-03-07 10:30:39,133][175731] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-07 10:30:39,934][175731] Updated weights for policy 0, policy_version 34820 (0.0006) [2023-03-07 10:30:40,729][175731] Updated weights for policy 0, policy_version 34830 (0.0007) [2023-03-07 10:30:41,525][175731] Updated weights for policy 0, policy_version 34840 (0.0006) [2023-03-07 10:30:42,320][175731] Updated weights for policy 0, policy_version 34850 (0.0007) [2023-03-07 10:30:43,106][175731] Updated weights for policy 0, policy_version 34860 (0.0007) [2023-03-07 10:30:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12767.0). Total num frames: 35698688. Throughput: 0: 12757.2. Samples: 35672529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:43,322][175405] Avg episode reward: [(0, '38.468')] [2023-03-07 10:30:43,900][175731] Updated weights for policy 0, policy_version 34870 (0.0007) [2023-03-07 10:30:44,702][175731] Updated weights for policy 0, policy_version 34880 (0.0007) [2023-03-07 10:30:45,501][175731] Updated weights for policy 0, policy_version 34890 (0.0007) [2023-03-07 10:30:46,312][175731] Updated weights for policy 0, policy_version 34900 (0.0008) [2023-03-07 10:30:47,107][175731] Updated weights for policy 0, policy_version 34910 (0.0006) [2023-03-07 10:30:47,912][175731] Updated weights for policy 0, policy_version 34920 (0.0007) [2023-03-07 10:30:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12765.8, 300 sec: 12767.0). Total num frames: 35763200. Throughput: 0: 12769.8. Samples: 35749571. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:48,322][175405] Avg episode reward: [(0, '26.389')] [2023-03-07 10:30:48,694][175731] Updated weights for policy 0, policy_version 34930 (0.0006) [2023-03-07 10:30:49,509][175731] Updated weights for policy 0, policy_version 34940 (0.0007) [2023-03-07 10:30:50,307][175731] Updated weights for policy 0, policy_version 34950 (0.0006) [2023-03-07 10:30:51,095][175731] Updated weights for policy 0, policy_version 34960 (0.0006) [2023-03-07 10:30:51,901][175731] Updated weights for policy 0, policy_version 34970 (0.0006) [2023-03-07 10:30:52,702][175731] Updated weights for policy 0, policy_version 34980 (0.0007) [2023-03-07 10:30:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 35827712. Throughput: 0: 12779.5. Samples: 35826562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:30:53,322][175405] Avg episode reward: [(0, '36.540')] [2023-03-07 10:30:53,493][175731] Updated weights for policy 0, policy_version 34990 (0.0006) [2023-03-07 10:30:54,304][175731] Updated weights for policy 0, policy_version 35000 (0.0007) [2023-03-07 10:30:55,106][175731] Updated weights for policy 0, policy_version 35010 (0.0007) [2023-03-07 10:30:55,894][175731] Updated weights for policy 0, policy_version 35020 (0.0007) [2023-03-07 10:30:56,698][175731] Updated weights for policy 0, policy_version 35030 (0.0006) [2023-03-07 10:30:57,332][175680] KL-divergence is very high: 7371.6729 [2023-03-07 10:30:57,490][175680] KL-divergence is very high: 1135267.2500 [2023-03-07 10:30:57,498][175731] Updated weights for policy 0, policy_version 35040 (0.0007) [2023-03-07 10:30:57,656][175680] KL-divergence is very high: 5326.8696 [2023-03-07 10:30:57,730][175680] KL-divergence is very high: 155.3832 [2023-03-07 10:30:57,892][175680] KL-divergence is very high: 8475.3447 [2023-03-07 10:30:58,047][175680] KL-divergence is very high: 409829.3750 [2023-03-07 10:30:58,210][175680] KL-divergence is very high: 37170.3906 [2023-03-07 10:30:58,303][175731] Updated weights for policy 0, policy_version 35050 (0.0008) [2023-03-07 10:30:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 35891200. Throughput: 0: 12789.6. Samples: 35864932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:30:58,322][175405] Avg episode reward: [(0, '34.961')] [2023-03-07 10:30:58,456][175680] KL-divergence is very high: 115.9033 [2023-03-07 10:30:59,115][175731] Updated weights for policy 0, policy_version 35060 (0.0006) [2023-03-07 10:30:59,910][175731] Updated weights for policy 0, policy_version 35070 (0.0006) [2023-03-07 10:31:00,725][175731] Updated weights for policy 0, policy_version 35080 (0.0007) [2023-03-07 10:31:01,526][175731] Updated weights for policy 0, policy_version 35090 (0.0007) [2023-03-07 10:31:02,315][175731] Updated weights for policy 0, policy_version 35100 (0.0006) [2023-03-07 10:31:03,117][175731] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-07 10:31:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12767.0). Total num frames: 35954688. Throughput: 0: 12788.4. Samples: 35941443. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:03,321][175405] Avg episode reward: [(0, '32.650')] [2023-03-07 10:31:03,896][175731] Updated weights for policy 0, policy_version 35120 (0.0006) [2023-03-07 10:31:04,703][175731] Updated weights for policy 0, policy_version 35130 (0.0007) [2023-03-07 10:31:05,498][175731] Updated weights for policy 0, policy_version 35140 (0.0005) [2023-03-07 10:31:06,300][175731] Updated weights for policy 0, policy_version 35150 (0.0007) [2023-03-07 10:31:07,105][175731] Updated weights for policy 0, policy_version 35160 (0.0007) [2023-03-07 10:31:07,907][175731] Updated weights for policy 0, policy_version 35170 (0.0007) [2023-03-07 10:31:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 36019200. Throughput: 0: 12783.9. Samples: 36018366. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:08,322][175405] Avg episode reward: [(0, '24.486')] [2023-03-07 10:31:08,716][175731] Updated weights for policy 0, policy_version 35180 (0.0007) [2023-03-07 10:31:09,518][175731] Updated weights for policy 0, policy_version 35190 (0.0006) [2023-03-07 10:31:10,316][175731] Updated weights for policy 0, policy_version 35200 (0.0008) [2023-03-07 10:31:11,109][175731] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-07 10:31:11,915][175731] Updated weights for policy 0, policy_version 35220 (0.0006) [2023-03-07 10:31:12,705][175731] Updated weights for policy 0, policy_version 35230 (0.0007) [2023-03-07 10:31:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12767.0). Total num frames: 36082688. Throughput: 0: 12789.8. Samples: 36056736. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:13,322][175405] Avg episode reward: [(0, '27.741')] [2023-03-07 10:31:13,505][175731] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-07 10:31:14,303][175731] Updated weights for policy 0, policy_version 35250 (0.0007) [2023-03-07 10:31:15,100][175731] Updated weights for policy 0, policy_version 35260 (0.0006) [2023-03-07 10:31:15,885][175731] Updated weights for policy 0, policy_version 35270 (0.0006) [2023-03-07 10:31:16,687][175731] Updated weights for policy 0, policy_version 35280 (0.0006) [2023-03-07 10:31:17,487][175731] Updated weights for policy 0, policy_version 35290 (0.0007) [2023-03-07 10:31:18,276][175731] Updated weights for policy 0, policy_version 35300 (0.0006) [2023-03-07 10:31:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12770.5). Total num frames: 36147200. Throughput: 0: 12806.8. Samples: 36133843. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:18,321][175405] Avg episode reward: [(0, '26.784')] [2023-03-07 10:31:19,081][175731] Updated weights for policy 0, policy_version 35310 (0.0007) [2023-03-07 10:31:19,883][175731] Updated weights for policy 0, policy_version 35320 (0.0008) [2023-03-07 10:31:20,678][175731] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-07 10:31:21,486][175731] Updated weights for policy 0, policy_version 35340 (0.0006) [2023-03-07 10:31:22,288][175731] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-07 10:31:23,078][175731] Updated weights for policy 0, policy_version 35360 (0.0007) [2023-03-07 10:31:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12767.0). Total num frames: 36210688. Throughput: 0: 12812.1. Samples: 36210581. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:23,332][175405] Avg episode reward: [(0, '41.886')] [2023-03-07 10:31:23,879][175731] Updated weights for policy 0, policy_version 35370 (0.0007) [2023-03-07 10:31:24,709][175731] Updated weights for policy 0, policy_version 35380 (0.0006) [2023-03-07 10:31:25,504][175731] Updated weights for policy 0, policy_version 35390 (0.0007) [2023-03-07 10:31:26,295][175731] Updated weights for policy 0, policy_version 35400 (0.0006) [2023-03-07 10:31:27,084][175731] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-07 10:31:27,890][175731] Updated weights for policy 0, policy_version 35420 (0.0006) [2023-03-07 10:31:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12767.0). Total num frames: 36275200. Throughput: 0: 12807.1. Samples: 36248847. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:28,332][175405] Avg episode reward: [(0, '34.678')] [2023-03-07 10:31:28,693][175731] Updated weights for policy 0, policy_version 35430 (0.0006) [2023-03-07 10:31:29,478][175731] Updated weights for policy 0, policy_version 35440 (0.0006) [2023-03-07 10:31:30,279][175731] Updated weights for policy 0, policy_version 35450 (0.0007) [2023-03-07 10:31:31,090][175731] Updated weights for policy 0, policy_version 35460 (0.0006) [2023-03-07 10:31:31,886][175731] Updated weights for policy 0, policy_version 35470 (0.0007) [2023-03-07 10:31:32,702][175731] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-07 10:31:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12767.0). Total num frames: 36338688. Throughput: 0: 12805.3. Samples: 36325809. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:33,332][175405] Avg episode reward: [(0, '40.612')] [2023-03-07 10:31:33,501][175731] Updated weights for policy 0, policy_version 35490 (0.0007) [2023-03-07 10:31:34,311][175731] Updated weights for policy 0, policy_version 35500 (0.0006) [2023-03-07 10:31:35,121][175731] Updated weights for policy 0, policy_version 35510 (0.0006) [2023-03-07 10:31:35,912][175731] Updated weights for policy 0, policy_version 35520 (0.0006) [2023-03-07 10:31:36,698][175731] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-07 10:31:37,505][175731] Updated weights for policy 0, policy_version 35540 (0.0007) [2023-03-07 10:31:38,303][175731] Updated weights for policy 0, policy_version 35550 (0.0006) [2023-03-07 10:31:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12767.0). Total num frames: 36403200. Throughput: 0: 12795.7. Samples: 36402368. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:38,332][175405] Avg episode reward: [(0, '35.816')] [2023-03-07 10:31:39,115][175731] Updated weights for policy 0, policy_version 35560 (0.0006) [2023-03-07 10:31:39,906][175731] Updated weights for policy 0, policy_version 35570 (0.0006) [2023-03-07 10:31:40,705][175731] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-07 10:31:41,508][175731] Updated weights for policy 0, policy_version 35590 (0.0007) [2023-03-07 10:31:42,310][175731] Updated weights for policy 0, policy_version 35600 (0.0007) [2023-03-07 10:31:43,106][175731] Updated weights for policy 0, policy_version 35610 (0.0007) [2023-03-07 10:31:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12767.0). Total num frames: 36466688. Throughput: 0: 12796.6. Samples: 36440776. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:43,321][175405] Avg episode reward: [(0, '37.174')] [2023-03-07 10:31:43,908][175731] Updated weights for policy 0, policy_version 35620 (0.0007) [2023-03-07 10:31:44,717][175731] Updated weights for policy 0, policy_version 35630 (0.0006) [2023-03-07 10:31:45,509][175731] Updated weights for policy 0, policy_version 35640 (0.0006) [2023-03-07 10:31:46,330][175731] Updated weights for policy 0, policy_version 35650 (0.0007) [2023-03-07 10:31:47,132][175731] Updated weights for policy 0, policy_version 35660 (0.0007) [2023-03-07 10:31:47,937][175731] Updated weights for policy 0, policy_version 35670 (0.0006) [2023-03-07 10:31:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12767.0). Total num frames: 36530176. Throughput: 0: 12796.9. Samples: 36517305. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:48,332][175405] Avg episode reward: [(0, '38.558')] [2023-03-07 10:31:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000035675_36531200.pth... [2023-03-07 10:31:48,366][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000032681_33465344.pth [2023-03-07 10:31:48,736][175731] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-07 10:31:49,539][175731] Updated weights for policy 0, policy_version 35690 (0.0007) [2023-03-07 10:31:50,333][175731] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-07 10:31:51,126][175731] Updated weights for policy 0, policy_version 35710 (0.0006) [2023-03-07 10:31:51,924][175731] Updated weights for policy 0, policy_version 35720 (0.0007) [2023-03-07 10:31:52,711][175731] Updated weights for policy 0, policy_version 35730 (0.0006) [2023-03-07 10:31:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 36594688. Throughput: 0: 12797.1. Samples: 36594235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:53,332][175405] Avg episode reward: [(0, '37.387')] [2023-03-07 10:31:53,500][175731] Updated weights for policy 0, policy_version 35740 (0.0006) [2023-03-07 10:31:54,316][175731] Updated weights for policy 0, policy_version 35750 (0.0006) [2023-03-07 10:31:55,115][175731] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-07 10:31:55,930][175731] Updated weights for policy 0, policy_version 35770 (0.0006) [2023-03-07 10:31:56,730][175731] Updated weights for policy 0, policy_version 35780 (0.0006) [2023-03-07 10:31:57,521][175731] Updated weights for policy 0, policy_version 35790 (0.0006) [2023-03-07 10:31:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12770.5). Total num frames: 36658176. Throughput: 0: 12794.0. Samples: 36632469. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:31:58,322][175405] Avg episode reward: [(0, '48.476')] [2023-03-07 10:31:58,346][175731] Updated weights for policy 0, policy_version 35800 (0.0007) [2023-03-07 10:31:59,123][175731] Updated weights for policy 0, policy_version 35810 (0.0006) [2023-03-07 10:31:59,929][175731] Updated weights for policy 0, policy_version 35820 (0.0006) [2023-03-07 10:32:00,713][175731] Updated weights for policy 0, policy_version 35830 (0.0007) [2023-03-07 10:32:01,527][175731] Updated weights for policy 0, policy_version 35840 (0.0007) [2023-03-07 10:32:02,320][175731] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-07 10:32:03,118][175731] Updated weights for policy 0, policy_version 35860 (0.0007) [2023-03-07 10:32:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12774.0). Total num frames: 36722688. Throughput: 0: 12793.3. Samples: 36709541. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:32:03,322][175405] Avg episode reward: [(0, '42.745')] [2023-03-07 10:32:03,925][175731] Updated weights for policy 0, policy_version 35870 (0.0007) [2023-03-07 10:32:04,725][175731] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-07 10:32:05,534][175731] Updated weights for policy 0, policy_version 35890 (0.0006) [2023-03-07 10:32:06,317][175731] Updated weights for policy 0, policy_version 35900 (0.0007) [2023-03-07 10:32:07,113][175731] Updated weights for policy 0, policy_version 35910 (0.0006) [2023-03-07 10:32:07,933][175731] Updated weights for policy 0, policy_version 35920 (0.0006) [2023-03-07 10:32:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12774.0). Total num frames: 36786176. Throughput: 0: 12790.8. Samples: 36786168. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:32:08,322][175405] Avg episode reward: [(0, '45.611')] [2023-03-07 10:32:08,714][175731] Updated weights for policy 0, policy_version 35930 (0.0007) [2023-03-07 10:32:09,520][175731] Updated weights for policy 0, policy_version 35940 (0.0006) [2023-03-07 10:32:10,323][175731] Updated weights for policy 0, policy_version 35950 (0.0006) [2023-03-07 10:32:11,137][175731] Updated weights for policy 0, policy_version 35960 (0.0007) [2023-03-07 10:32:11,927][175731] Updated weights for policy 0, policy_version 35970 (0.0005) [2023-03-07 10:32:12,726][175731] Updated weights for policy 0, policy_version 35980 (0.0006) [2023-03-07 10:32:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12777.4). Total num frames: 36850688. Throughput: 0: 12789.8. Samples: 36824389. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:32:13,322][175405] Avg episode reward: [(0, '73.788')] [2023-03-07 10:32:13,537][175731] Updated weights for policy 0, policy_version 35990 (0.0006) [2023-03-07 10:32:14,342][175731] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-07 10:32:15,141][175731] Updated weights for policy 0, policy_version 36010 (0.0006) [2023-03-07 10:32:15,953][175731] Updated weights for policy 0, policy_version 36020 (0.0006) [2023-03-07 10:32:16,757][175731] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-07 10:32:17,553][175731] Updated weights for policy 0, policy_version 36040 (0.0006) [2023-03-07 10:32:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 36914176. Throughput: 0: 12785.3. Samples: 36901145. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:32:18,321][175405] Avg episode reward: [(0, '56.495')] [2023-03-07 10:32:18,363][175731] Updated weights for policy 0, policy_version 36050 (0.0006) [2023-03-07 10:32:19,149][175731] Updated weights for policy 0, policy_version 36060 (0.0006) [2023-03-07 10:32:19,960][175731] Updated weights for policy 0, policy_version 36070 (0.0007) [2023-03-07 10:32:20,765][175731] Updated weights for policy 0, policy_version 36080 (0.0006) [2023-03-07 10:32:21,566][175731] Updated weights for policy 0, policy_version 36090 (0.0006) [2023-03-07 10:32:22,361][175731] Updated weights for policy 0, policy_version 36100 (0.0007) [2023-03-07 10:32:23,147][175731] Updated weights for policy 0, policy_version 36110 (0.0006) [2023-03-07 10:32:23,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12780.9). Total num frames: 36978688. Throughput: 0: 12789.5. Samples: 36977896. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:32:23,321][175405] Avg episode reward: [(0, '55.877')] [2023-03-07 10:32:23,956][175731] Updated weights for policy 0, policy_version 36120 (0.0006) [2023-03-07 10:32:24,764][175731] Updated weights for policy 0, policy_version 36130 (0.0006) [2023-03-07 10:32:25,574][175731] Updated weights for policy 0, policy_version 36140 (0.0007) [2023-03-07 10:32:26,370][175731] Updated weights for policy 0, policy_version 36150 (0.0006) [2023-03-07 10:32:27,158][175731] Updated weights for policy 0, policy_version 36160 (0.0006) [2023-03-07 10:32:27,963][175731] Updated weights for policy 0, policy_version 36170 (0.0007) [2023-03-07 10:32:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12780.9). Total num frames: 37042176. Throughput: 0: 12783.5. Samples: 37016033. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:32:28,322][175405] Avg episode reward: [(0, '90.169')] [2023-03-07 10:32:28,773][175731] Updated weights for policy 0, policy_version 36180 (0.0006) [2023-03-07 10:32:29,572][175731] Updated weights for policy 0, policy_version 36190 (0.0007) [2023-03-07 10:32:30,373][175731] Updated weights for policy 0, policy_version 36200 (0.0007) [2023-03-07 10:32:31,175][175731] Updated weights for policy 0, policy_version 36210 (0.0006) [2023-03-07 10:32:31,969][175731] Updated weights for policy 0, policy_version 36220 (0.0008) [2023-03-07 10:32:32,775][175731] Updated weights for policy 0, policy_version 36230 (0.0008) [2023-03-07 10:32:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37105664. Throughput: 0: 12785.5. Samples: 37092651. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:32:33,321][175405] Avg episode reward: [(0, '47.234')] [2023-03-07 10:32:33,586][175731] Updated weights for policy 0, policy_version 36240 (0.0006) [2023-03-07 10:32:34,368][175731] Updated weights for policy 0, policy_version 36250 (0.0006) [2023-03-07 10:32:35,160][175731] Updated weights for policy 0, policy_version 36260 (0.0007) [2023-03-07 10:32:35,960][175731] Updated weights for policy 0, policy_version 36270 (0.0006) [2023-03-07 10:32:36,761][175731] Updated weights for policy 0, policy_version 36280 (0.0007) [2023-03-07 10:32:37,555][175731] Updated weights for policy 0, policy_version 36290 (0.0007) [2023-03-07 10:32:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37170176. Throughput: 0: 12782.1. Samples: 37169428. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:32:38,322][175405] Avg episode reward: [(0, '58.834')] [2023-03-07 10:32:38,382][175731] Updated weights for policy 0, policy_version 36300 (0.0006) [2023-03-07 10:32:39,186][175731] Updated weights for policy 0, policy_version 36310 (0.0006) [2023-03-07 10:32:39,982][175731] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-07 10:32:40,800][175731] Updated weights for policy 0, policy_version 36330 (0.0007) [2023-03-07 10:32:41,592][175731] Updated weights for policy 0, policy_version 36340 (0.0007) [2023-03-07 10:32:42,394][175731] Updated weights for policy 0, policy_version 36350 (0.0006) [2023-03-07 10:32:43,204][175731] Updated weights for policy 0, policy_version 36360 (0.0006) [2023-03-07 10:32:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37233664. Throughput: 0: 12780.1. Samples: 37207575. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:32:43,322][175405] Avg episode reward: [(0, '57.443')] [2023-03-07 10:32:43,998][175731] Updated weights for policy 0, policy_version 36370 (0.0007) [2023-03-07 10:32:44,793][175731] Updated weights for policy 0, policy_version 36380 (0.0006) [2023-03-07 10:32:45,601][175731] Updated weights for policy 0, policy_version 36390 (0.0007) [2023-03-07 10:32:46,404][175731] Updated weights for policy 0, policy_version 36400 (0.0006) [2023-03-07 10:32:47,170][175731] Updated weights for policy 0, policy_version 36410 (0.0007) [2023-03-07 10:32:47,978][175731] Updated weights for policy 0, policy_version 36420 (0.0007) [2023-03-07 10:32:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12777.4). Total num frames: 37298176. Throughput: 0: 12777.9. Samples: 37284546. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:32:48,322][175405] Avg episode reward: [(0, '50.460')] [2023-03-07 10:32:48,798][175731] Updated weights for policy 0, policy_version 36430 (0.0006) [2023-03-07 10:32:49,602][175731] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-07 10:32:50,402][175731] Updated weights for policy 0, policy_version 36450 (0.0006) [2023-03-07 10:32:51,199][175731] Updated weights for policy 0, policy_version 36460 (0.0007) [2023-03-07 10:32:51,990][175731] Updated weights for policy 0, policy_version 36470 (0.0006) [2023-03-07 10:32:52,795][175731] Updated weights for policy 0, policy_version 36480 (0.0007) [2023-03-07 10:32:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37361664. Throughput: 0: 12774.1. Samples: 37361001. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:32:53,321][175405] Avg episode reward: [(0, '56.625')] [2023-03-07 10:32:53,602][175731] Updated weights for policy 0, policy_version 36490 (0.0006) [2023-03-07 10:32:54,417][175731] Updated weights for policy 0, policy_version 36500 (0.0007) [2023-03-07 10:32:55,215][175731] Updated weights for policy 0, policy_version 36510 (0.0007) [2023-03-07 10:32:56,019][175731] Updated weights for policy 0, policy_version 36520 (0.0007) [2023-03-07 10:32:56,838][175731] Updated weights for policy 0, policy_version 36530 (0.0007) [2023-03-07 10:32:57,634][175731] Updated weights for policy 0, policy_version 36540 (0.0006) [2023-03-07 10:32:58,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12783.0, 300 sec: 12777.4). Total num frames: 37425152. Throughput: 0: 12774.8. Samples: 37399255. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:32:58,321][175405] Avg episode reward: [(0, '63.287')] [2023-03-07 10:32:58,434][175731] Updated weights for policy 0, policy_version 36550 (0.0007) [2023-03-07 10:32:59,230][175731] Updated weights for policy 0, policy_version 36560 (0.0007) [2023-03-07 10:33:00,037][175731] Updated weights for policy 0, policy_version 36570 (0.0005) [2023-03-07 10:33:00,834][175731] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-07 10:33:01,633][175731] Updated weights for policy 0, policy_version 36590 (0.0006) [2023-03-07 10:33:02,431][175731] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-07 10:33:03,243][175731] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-07 10:33:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37489664. Throughput: 0: 12775.3. Samples: 37476032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:33:03,322][175405] Avg episode reward: [(0, '52.796')] [2023-03-07 10:33:04,043][175731] Updated weights for policy 0, policy_version 36620 (0.0006) [2023-03-07 10:33:04,854][175731] Updated weights for policy 0, policy_version 36630 (0.0006) [2023-03-07 10:33:05,676][175731] Updated weights for policy 0, policy_version 36640 (0.0006) [2023-03-07 10:33:06,450][175731] Updated weights for policy 0, policy_version 36650 (0.0006) [2023-03-07 10:33:07,247][175731] Updated weights for policy 0, policy_version 36660 (0.0006) [2023-03-07 10:33:08,050][175731] Updated weights for policy 0, policy_version 36670 (0.0006) [2023-03-07 10:33:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37553152. Throughput: 0: 12772.1. Samples: 37552643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:33:08,322][175405] Avg episode reward: [(0, '56.099')] [2023-03-07 10:33:08,852][175731] Updated weights for policy 0, policy_version 36680 (0.0007) [2023-03-07 10:33:09,636][175731] Updated weights for policy 0, policy_version 36690 (0.0005) [2023-03-07 10:33:10,422][175731] Updated weights for policy 0, policy_version 36700 (0.0007) [2023-03-07 10:33:11,235][175731] Updated weights for policy 0, policy_version 36710 (0.0006) [2023-03-07 10:33:12,031][175731] Updated weights for policy 0, policy_version 36720 (0.0006) [2023-03-07 10:33:12,823][175731] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-07 10:33:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12783.0, 300 sec: 12777.4). Total num frames: 37617664. Throughput: 0: 12781.0. Samples: 37591177. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:13,321][175405] Avg episode reward: [(0, '79.059')] [2023-03-07 10:33:13,625][175731] Updated weights for policy 0, policy_version 36740 (0.0007) [2023-03-07 10:33:14,435][175731] Updated weights for policy 0, policy_version 36750 (0.0007) [2023-03-07 10:33:15,245][175731] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-07 10:33:16,050][175731] Updated weights for policy 0, policy_version 36770 (0.0006) [2023-03-07 10:33:16,847][175731] Updated weights for policy 0, policy_version 36780 (0.0007) [2023-03-07 10:33:17,649][175731] Updated weights for policy 0, policy_version 36790 (0.0006) [2023-03-07 10:33:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37681152. Throughput: 0: 12782.6. Samples: 37667866. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:18,322][175405] Avg episode reward: [(0, '55.899')] [2023-03-07 10:33:18,447][175731] Updated weights for policy 0, policy_version 36800 (0.0007) [2023-03-07 10:33:19,244][175731] Updated weights for policy 0, policy_version 36810 (0.0007) [2023-03-07 10:33:20,056][175731] Updated weights for policy 0, policy_version 36820 (0.0007) [2023-03-07 10:33:20,861][175731] Updated weights for policy 0, policy_version 36830 (0.0007) [2023-03-07 10:33:21,662][175731] Updated weights for policy 0, policy_version 36840 (0.0005) [2023-03-07 10:33:22,452][175731] Updated weights for policy 0, policy_version 36850 (0.0006) [2023-03-07 10:33:23,259][175731] Updated weights for policy 0, policy_version 36860 (0.0006) [2023-03-07 10:33:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12765.8, 300 sec: 12774.0). Total num frames: 37744640. Throughput: 0: 12777.1. Samples: 37744399. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:23,322][175405] Avg episode reward: [(0, '57.155')] [2023-03-07 10:33:24,057][175731] Updated weights for policy 0, policy_version 36870 (0.0007) [2023-03-07 10:33:24,841][175731] Updated weights for policy 0, policy_version 36880 (0.0006) [2023-03-07 10:33:25,670][175731] Updated weights for policy 0, policy_version 36890 (0.0006) [2023-03-07 10:33:26,457][175731] Updated weights for policy 0, policy_version 36900 (0.0006) [2023-03-07 10:33:27,258][175731] Updated weights for policy 0, policy_version 36910 (0.0006) [2023-03-07 10:33:28,048][175731] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-07 10:33:28,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37809152. Throughput: 0: 12781.5. Samples: 37782742. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:28,322][175405] Avg episode reward: [(0, '63.241')] [2023-03-07 10:33:28,877][175731] Updated weights for policy 0, policy_version 36930 (0.0006) [2023-03-07 10:33:29,664][175731] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-07 10:33:30,472][175731] Updated weights for policy 0, policy_version 36950 (0.0007) [2023-03-07 10:33:31,287][175731] Updated weights for policy 0, policy_version 36960 (0.0006) [2023-03-07 10:33:32,107][175731] Updated weights for policy 0, policy_version 36970 (0.0007) [2023-03-07 10:33:32,902][175731] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-07 10:33:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12777.4). Total num frames: 37872640. Throughput: 0: 12766.4. Samples: 37859034. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:33,322][175405] Avg episode reward: [(0, '71.786')] [2023-03-07 10:33:33,697][175731] Updated weights for policy 0, policy_version 36990 (0.0007) [2023-03-07 10:33:34,500][175731] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-07 10:33:35,322][175731] Updated weights for policy 0, policy_version 37010 (0.0006) [2023-03-07 10:33:36,126][175731] Updated weights for policy 0, policy_version 37020 (0.0006) [2023-03-07 10:33:36,915][175731] Updated weights for policy 0, policy_version 37030 (0.0007) [2023-03-07 10:33:37,718][175731] Updated weights for policy 0, policy_version 37040 (0.0006) [2023-03-07 10:33:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12777.4). Total num frames: 37936128. Throughput: 0: 12772.0. Samples: 37935743. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:38,322][175405] Avg episode reward: [(0, '78.757')] [2023-03-07 10:33:38,535][175731] Updated weights for policy 0, policy_version 37050 (0.0006) [2023-03-07 10:33:39,349][175731] Updated weights for policy 0, policy_version 37060 (0.0006) [2023-03-07 10:33:40,145][175731] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-07 10:33:40,945][175731] Updated weights for policy 0, policy_version 37080 (0.0006) [2023-03-07 10:33:41,750][175731] Updated weights for policy 0, policy_version 37090 (0.0007) [2023-03-07 10:33:42,558][175731] Updated weights for policy 0, policy_version 37100 (0.0006) [2023-03-07 10:33:43,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 37999616. Throughput: 0: 12768.8. Samples: 37973852. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:43,321][175405] Avg episode reward: [(0, '71.110')] [2023-03-07 10:33:43,357][175731] Updated weights for policy 0, policy_version 37110 (0.0006) [2023-03-07 10:33:44,157][175731] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-07 10:33:44,969][175731] Updated weights for policy 0, policy_version 37130 (0.0006) [2023-03-07 10:33:45,760][175731] Updated weights for policy 0, policy_version 37140 (0.0006) [2023-03-07 10:33:46,584][175731] Updated weights for policy 0, policy_version 37150 (0.0007) [2023-03-07 10:33:47,386][175731] Updated weights for policy 0, policy_version 37160 (0.0006) [2023-03-07 10:33:48,177][175731] Updated weights for policy 0, policy_version 37170 (0.0006) [2023-03-07 10:33:48,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12770.5). Total num frames: 38063104. Throughput: 0: 12751.8. Samples: 38049866. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:48,322][175405] Avg episode reward: [(0, '79.244')] [2023-03-07 10:33:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000037171_38063104.pth... [2023-03-07 10:33:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000034177_34997248.pth [2023-03-07 10:33:48,998][175731] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-07 10:33:49,810][175731] Updated weights for policy 0, policy_version 37190 (0.0008) [2023-03-07 10:33:50,601][175731] Updated weights for policy 0, policy_version 37200 (0.0006) [2023-03-07 10:33:51,406][175731] Updated weights for policy 0, policy_version 37210 (0.0006) [2023-03-07 10:33:52,209][175731] Updated weights for policy 0, policy_version 37220 (0.0008) [2023-03-07 10:33:53,002][175731] Updated weights for policy 0, policy_version 37230 (0.0006) [2023-03-07 10:33:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12765.9, 300 sec: 12774.0). Total num frames: 38127616. Throughput: 0: 12751.4. Samples: 38126454. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:53,321][175405] Avg episode reward: [(0, '66.419')] [2023-03-07 10:33:53,810][175731] Updated weights for policy 0, policy_version 37240 (0.0006) [2023-03-07 10:33:54,613][175731] Updated weights for policy 0, policy_version 37250 (0.0007) [2023-03-07 10:33:55,406][175731] Updated weights for policy 0, policy_version 37260 (0.0007) [2023-03-07 10:33:56,221][175731] Updated weights for policy 0, policy_version 37270 (0.0006) [2023-03-07 10:33:57,025][175731] Updated weights for policy 0, policy_version 37280 (0.0007) [2023-03-07 10:33:57,825][175731] Updated weights for policy 0, policy_version 37290 (0.0007) [2023-03-07 10:33:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12765.9, 300 sec: 12770.5). Total num frames: 38191104. Throughput: 0: 12746.6. Samples: 38164774. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:33:58,321][175405] Avg episode reward: [(0, '65.891')] [2023-03-07 10:33:58,633][175731] Updated weights for policy 0, policy_version 37300 (0.0007) [2023-03-07 10:33:59,422][175731] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-07 10:34:00,219][175731] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-07 10:34:01,037][175731] Updated weights for policy 0, policy_version 37330 (0.0007) [2023-03-07 10:34:01,831][175731] Updated weights for policy 0, policy_version 37340 (0.0006) [2023-03-07 10:34:02,645][175731] Updated weights for policy 0, policy_version 37350 (0.0006) [2023-03-07 10:34:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12748.8, 300 sec: 12774.0). Total num frames: 38254592. Throughput: 0: 12744.4. Samples: 38241364. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:03,321][175405] Avg episode reward: [(0, '64.390')] [2023-03-07 10:34:03,458][175731] Updated weights for policy 0, policy_version 37360 (0.0006) [2023-03-07 10:34:04,263][175731] Updated weights for policy 0, policy_version 37370 (0.0006) [2023-03-07 10:34:05,079][175731] Updated weights for policy 0, policy_version 37380 (0.0006) [2023-03-07 10:34:05,883][175731] Updated weights for policy 0, policy_version 37390 (0.0007) [2023-03-07 10:34:06,682][175731] Updated weights for policy 0, policy_version 37400 (0.0007) [2023-03-07 10:34:07,494][175731] Updated weights for policy 0, policy_version 37410 (0.0005) [2023-03-07 10:34:08,289][175731] Updated weights for policy 0, policy_version 37420 (0.0006) [2023-03-07 10:34:08,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12770.5). Total num frames: 38318080. Throughput: 0: 12735.6. Samples: 38317502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:08,322][175405] Avg episode reward: [(0, '73.245')] [2023-03-07 10:34:09,098][175731] Updated weights for policy 0, policy_version 37430 (0.0006) [2023-03-07 10:34:09,894][175731] Updated weights for policy 0, policy_version 37440 (0.0006) [2023-03-07 10:34:10,707][175731] Updated weights for policy 0, policy_version 37450 (0.0006) [2023-03-07 10:34:11,520][175731] Updated weights for policy 0, policy_version 37460 (0.0007) [2023-03-07 10:34:12,317][175731] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-07 10:34:13,140][175731] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-07 10:34:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12770.5). Total num frames: 38381568. Throughput: 0: 12729.6. Samples: 38355572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:13,321][175405] Avg episode reward: [(0, '73.651')] [2023-03-07 10:34:13,927][175731] Updated weights for policy 0, policy_version 37490 (0.0007) [2023-03-07 10:34:14,741][175731] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-07 10:34:15,537][175731] Updated weights for policy 0, policy_version 37510 (0.0006) [2023-03-07 10:34:16,326][175731] Updated weights for policy 0, policy_version 37520 (0.0006) [2023-03-07 10:34:17,114][175731] Updated weights for policy 0, policy_version 37530 (0.0006) [2023-03-07 10:34:17,922][175731] Updated weights for policy 0, policy_version 37540 (0.0006) [2023-03-07 10:34:18,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12748.8, 300 sec: 12774.0). Total num frames: 38446080. Throughput: 0: 12736.7. Samples: 38432186. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:18,321][175405] Avg episode reward: [(0, '72.433')] [2023-03-07 10:34:18,716][175731] Updated weights for policy 0, policy_version 37550 (0.0006) [2023-03-07 10:34:19,514][175731] Updated weights for policy 0, policy_version 37560 (0.0007) [2023-03-07 10:34:20,324][175731] Updated weights for policy 0, policy_version 37570 (0.0006) [2023-03-07 10:34:21,145][175731] Updated weights for policy 0, policy_version 37580 (0.0006) [2023-03-07 10:34:21,968][175731] Updated weights for policy 0, policy_version 37590 (0.0006) [2023-03-07 10:34:22,770][175731] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-07 10:34:23,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12767.0). Total num frames: 38508544. Throughput: 0: 12724.2. Samples: 38508332. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:23,321][175405] Avg episode reward: [(0, '68.146')] [2023-03-07 10:34:23,576][175731] Updated weights for policy 0, policy_version 37610 (0.0006) [2023-03-07 10:34:24,365][175731] Updated weights for policy 0, policy_version 37620 (0.0006) [2023-03-07 10:34:25,181][175731] Updated weights for policy 0, policy_version 37630 (0.0007) [2023-03-07 10:34:25,973][175731] Updated weights for policy 0, policy_version 37640 (0.0007) [2023-03-07 10:34:26,777][175731] Updated weights for policy 0, policy_version 37650 (0.0007) [2023-03-07 10:34:27,593][175731] Updated weights for policy 0, policy_version 37660 (0.0006) [2023-03-07 10:34:28,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12770.5). Total num frames: 38573056. Throughput: 0: 12727.9. Samples: 38546608. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:34:28,321][175405] Avg episode reward: [(0, '76.695')] [2023-03-07 10:34:28,385][175731] Updated weights for policy 0, policy_version 37670 (0.0007) [2023-03-07 10:34:29,205][175731] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-07 10:34:30,001][175731] Updated weights for policy 0, policy_version 37690 (0.0007) [2023-03-07 10:34:30,810][175731] Updated weights for policy 0, policy_version 37700 (0.0007) [2023-03-07 10:34:31,631][175731] Updated weights for policy 0, policy_version 37710 (0.0007) [2023-03-07 10:34:32,430][175731] Updated weights for policy 0, policy_version 37720 (0.0007) [2023-03-07 10:34:33,242][175731] Updated weights for policy 0, policy_version 37730 (0.0006) [2023-03-07 10:34:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12770.5). Total num frames: 38636544. Throughput: 0: 12735.5. Samples: 38622963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:34:33,321][175405] Avg episode reward: [(0, '83.150')] [2023-03-07 10:34:34,059][175731] Updated weights for policy 0, policy_version 37740 (0.0007) [2023-03-07 10:34:34,881][175731] Updated weights for policy 0, policy_version 37750 (0.0006) [2023-03-07 10:34:35,684][175731] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-07 10:34:36,491][175731] Updated weights for policy 0, policy_version 37770 (0.0007) [2023-03-07 10:34:37,289][175731] Updated weights for policy 0, policy_version 37780 (0.0006) [2023-03-07 10:34:38,099][175731] Updated weights for policy 0, policy_version 37790 (0.0006) [2023-03-07 10:34:38,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12714.7, 300 sec: 12763.6). Total num frames: 38699008. Throughput: 0: 12718.8. Samples: 38698802. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:34:38,321][175405] Avg episode reward: [(0, '73.985')] [2023-03-07 10:34:38,898][175731] Updated weights for policy 0, policy_version 37800 (0.0006) [2023-03-07 10:34:39,696][175731] Updated weights for policy 0, policy_version 37810 (0.0006) [2023-03-07 10:34:40,502][175731] Updated weights for policy 0, policy_version 37820 (0.0007) [2023-03-07 10:34:41,302][175731] Updated weights for policy 0, policy_version 37830 (0.0006) [2023-03-07 10:34:42,109][175731] Updated weights for policy 0, policy_version 37840 (0.0007) [2023-03-07 10:34:42,915][175731] Updated weights for policy 0, policy_version 37850 (0.0006) [2023-03-07 10:34:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12767.0). Total num frames: 38763520. Throughput: 0: 12718.2. Samples: 38737092. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:34:43,322][175405] Avg episode reward: [(0, '105.518')] [2023-03-07 10:34:43,715][175731] Updated weights for policy 0, policy_version 37860 (0.0007) [2023-03-07 10:34:44,520][175731] Updated weights for policy 0, policy_version 37870 (0.0006) [2023-03-07 10:34:45,313][175731] Updated weights for policy 0, policy_version 37880 (0.0006) [2023-03-07 10:34:46,114][175731] Updated weights for policy 0, policy_version 37890 (0.0007) [2023-03-07 10:34:46,926][175731] Updated weights for policy 0, policy_version 37900 (0.0006) [2023-03-07 10:34:47,729][175731] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-07 10:34:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12731.7, 300 sec: 12767.0). Total num frames: 38827008. Throughput: 0: 12718.9. Samples: 38813715. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:48,322][175405] Avg episode reward: [(0, '74.310')] [2023-03-07 10:34:48,531][175731] Updated weights for policy 0, policy_version 37920 (0.0007) [2023-03-07 10:34:49,346][175731] Updated weights for policy 0, policy_version 37930 (0.0006) [2023-03-07 10:34:50,126][175731] Updated weights for policy 0, policy_version 37940 (0.0007) [2023-03-07 10:34:50,938][175731] Updated weights for policy 0, policy_version 37950 (0.0006) [2023-03-07 10:34:51,735][175731] Updated weights for policy 0, policy_version 37960 (0.0007) [2023-03-07 10:34:52,553][175731] Updated weights for policy 0, policy_version 37970 (0.0006) [2023-03-07 10:34:53,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12767.0). Total num frames: 38890496. Throughput: 0: 12721.9. Samples: 38889984. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:53,322][175405] Avg episode reward: [(0, '77.129')] [2023-03-07 10:34:53,367][175731] Updated weights for policy 0, policy_version 37980 (0.0007) [2023-03-07 10:34:54,165][175731] Updated weights for policy 0, policy_version 37990 (0.0007) [2023-03-07 10:34:54,944][175731] Updated weights for policy 0, policy_version 38000 (0.0007) [2023-03-07 10:34:55,765][175731] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-07 10:34:56,543][175731] Updated weights for policy 0, policy_version 38020 (0.0007) [2023-03-07 10:34:57,342][175731] Updated weights for policy 0, policy_version 38030 (0.0007) [2023-03-07 10:34:58,154][175731] Updated weights for policy 0, policy_version 38040 (0.0006) [2023-03-07 10:34:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12731.7, 300 sec: 12770.5). Total num frames: 38955008. Throughput: 0: 12731.5. Samples: 38928491. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:34:58,322][175405] Avg episode reward: [(0, '68.776')] [2023-03-07 10:34:58,959][175731] Updated weights for policy 0, policy_version 38050 (0.0007) [2023-03-07 10:34:59,747][175731] Updated weights for policy 0, policy_version 38060 (0.0006) [2023-03-07 10:35:00,574][175731] Updated weights for policy 0, policy_version 38070 (0.0006) [2023-03-07 10:35:01,397][175731] Updated weights for policy 0, policy_version 38080 (0.0006) [2023-03-07 10:35:02,190][175731] Updated weights for policy 0, policy_version 38090 (0.0006) [2023-03-07 10:35:02,986][175731] Updated weights for policy 0, policy_version 38100 (0.0007) [2023-03-07 10:35:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12763.6). Total num frames: 39017472. Throughput: 0: 12725.7. Samples: 39004843. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:03,321][175405] Avg episode reward: [(0, '55.899')] [2023-03-07 10:35:03,815][175731] Updated weights for policy 0, policy_version 38110 (0.0007) [2023-03-07 10:35:04,619][175731] Updated weights for policy 0, policy_version 38120 (0.0006) [2023-03-07 10:35:05,429][175731] Updated weights for policy 0, policy_version 38130 (0.0007) [2023-03-07 10:35:06,250][175731] Updated weights for policy 0, policy_version 38140 (0.0007) [2023-03-07 10:35:07,031][175731] Updated weights for policy 0, policy_version 38150 (0.0005) [2023-03-07 10:35:07,847][175731] Updated weights for policy 0, policy_version 38160 (0.0006) [2023-03-07 10:35:08,321][175405] Fps is (10 sec: 12595.3, 60 sec: 12714.7, 300 sec: 12763.6). Total num frames: 39080960. Throughput: 0: 12724.8. Samples: 39080949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:08,321][175405] Avg episode reward: [(0, '62.554')] [2023-03-07 10:35:08,649][175731] Updated weights for policy 0, policy_version 38170 (0.0007) [2023-03-07 10:35:09,477][175731] Updated weights for policy 0, policy_version 38180 (0.0006) [2023-03-07 10:35:10,276][175731] Updated weights for policy 0, policy_version 38190 (0.0006) [2023-03-07 10:35:11,086][175731] Updated weights for policy 0, policy_version 38200 (0.0006) [2023-03-07 10:35:11,893][175731] Updated weights for policy 0, policy_version 38210 (0.0007) [2023-03-07 10:35:12,676][175731] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-07 10:35:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12763.6). Total num frames: 39144448. Throughput: 0: 12718.1. Samples: 39118924. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:13,321][175405] Avg episode reward: [(0, '56.285')] [2023-03-07 10:35:13,507][175731] Updated weights for policy 0, policy_version 38230 (0.0006) [2023-03-07 10:35:14,299][175731] Updated weights for policy 0, policy_version 38240 (0.0006) [2023-03-07 10:35:15,118][175731] Updated weights for policy 0, policy_version 38250 (0.0006) [2023-03-07 10:35:15,933][175731] Updated weights for policy 0, policy_version 38260 (0.0006) [2023-03-07 10:35:16,732][175731] Updated weights for policy 0, policy_version 38270 (0.0006) [2023-03-07 10:35:17,534][175731] Updated weights for policy 0, policy_version 38280 (0.0007) [2023-03-07 10:35:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12760.1). Total num frames: 39207936. Throughput: 0: 12713.9. Samples: 39195091. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:18,322][175405] Avg episode reward: [(0, '68.193')] [2023-03-07 10:35:18,344][175731] Updated weights for policy 0, policy_version 38290 (0.0007) [2023-03-07 10:35:19,162][175731] Updated weights for policy 0, policy_version 38300 (0.0006) [2023-03-07 10:35:19,942][175731] Updated weights for policy 0, policy_version 38310 (0.0006) [2023-03-07 10:35:20,737][175731] Updated weights for policy 0, policy_version 38320 (0.0007) [2023-03-07 10:35:21,554][175731] Updated weights for policy 0, policy_version 38330 (0.0007) [2023-03-07 10:35:22,356][175731] Updated weights for policy 0, policy_version 38340 (0.0007) [2023-03-07 10:35:23,153][175731] Updated weights for policy 0, policy_version 38350 (0.0006) [2023-03-07 10:35:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12763.6). Total num frames: 39272448. Throughput: 0: 12724.5. Samples: 39271406. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:23,322][175405] Avg episode reward: [(0, '50.803')] [2023-03-07 10:35:23,961][175731] Updated weights for policy 0, policy_version 38360 (0.0007) [2023-03-07 10:35:24,756][175731] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-07 10:35:25,550][175731] Updated weights for policy 0, policy_version 38380 (0.0006) [2023-03-07 10:35:26,357][175731] Updated weights for policy 0, policy_version 38390 (0.0007) [2023-03-07 10:35:27,166][175731] Updated weights for policy 0, policy_version 38400 (0.0007) [2023-03-07 10:35:27,944][175731] Updated weights for policy 0, policy_version 38410 (0.0006) [2023-03-07 10:35:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.6, 300 sec: 12763.6). Total num frames: 39335936. Throughput: 0: 12724.8. Samples: 39309709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:28,322][175405] Avg episode reward: [(0, '57.131')] [2023-03-07 10:35:28,767][175731] Updated weights for policy 0, policy_version 38420 (0.0007) [2023-03-07 10:35:29,561][175731] Updated weights for policy 0, policy_version 38430 (0.0007) [2023-03-07 10:35:30,369][175731] Updated weights for policy 0, policy_version 38440 (0.0007) [2023-03-07 10:35:31,178][175731] Updated weights for policy 0, policy_version 38450 (0.0007) [2023-03-07 10:35:31,974][175731] Updated weights for policy 0, policy_version 38460 (0.0006) [2023-03-07 10:35:32,767][175731] Updated weights for policy 0, policy_version 38470 (0.0007) [2023-03-07 10:35:33,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12763.6). Total num frames: 39399424. Throughput: 0: 12727.7. Samples: 39386459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:33,332][175405] Avg episode reward: [(0, '62.331')] [2023-03-07 10:35:33,595][175731] Updated weights for policy 0, policy_version 38480 (0.0006) [2023-03-07 10:35:34,398][175731] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-07 10:35:35,194][175731] Updated weights for policy 0, policy_version 38500 (0.0006) [2023-03-07 10:35:35,995][175731] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-07 10:35:36,815][175731] Updated weights for policy 0, policy_version 38520 (0.0007) [2023-03-07 10:35:37,606][175731] Updated weights for policy 0, policy_version 38530 (0.0006) [2023-03-07 10:35:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12763.6). Total num frames: 39463936. Throughput: 0: 12734.2. Samples: 39463022. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:38,326][175405] Avg episode reward: [(0, '48.076')] [2023-03-07 10:35:38,407][175731] Updated weights for policy 0, policy_version 38540 (0.0006) [2023-03-07 10:35:39,203][175731] Updated weights for policy 0, policy_version 38550 (0.0006) [2023-03-07 10:35:40,022][175731] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-07 10:35:40,852][175731] Updated weights for policy 0, policy_version 38570 (0.0007) [2023-03-07 10:35:41,649][175731] Updated weights for policy 0, policy_version 38580 (0.0007) [2023-03-07 10:35:42,454][175731] Updated weights for policy 0, policy_version 38590 (0.0006) [2023-03-07 10:35:43,266][175731] Updated weights for policy 0, policy_version 38600 (0.0007) [2023-03-07 10:35:43,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12756.6). Total num frames: 39526400. Throughput: 0: 12718.6. Samples: 39500828. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:43,321][175405] Avg episode reward: [(0, '65.292')] [2023-03-07 10:35:44,061][175731] Updated weights for policy 0, policy_version 38610 (0.0007) [2023-03-07 10:35:44,860][175731] Updated weights for policy 0, policy_version 38620 (0.0006) [2023-03-07 10:35:45,668][175731] Updated weights for policy 0, policy_version 38630 (0.0006) [2023-03-07 10:35:46,457][175731] Updated weights for policy 0, policy_version 38640 (0.0007) [2023-03-07 10:35:47,267][175731] Updated weights for policy 0, policy_version 38650 (0.0006) [2023-03-07 10:35:48,067][175731] Updated weights for policy 0, policy_version 38660 (0.0007) [2023-03-07 10:35:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12731.8, 300 sec: 12756.6). Total num frames: 39590912. Throughput: 0: 12720.2. Samples: 39577251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:48,321][175405] Avg episode reward: [(0, '61.608')] [2023-03-07 10:35:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000038663_39590912.pth... [2023-03-07 10:35:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000035675_36531200.pth [2023-03-07 10:35:48,880][175731] Updated weights for policy 0, policy_version 38670 (0.0007) [2023-03-07 10:35:49,682][175731] Updated weights for policy 0, policy_version 38680 (0.0005) [2023-03-07 10:35:50,481][175731] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-07 10:35:51,277][175731] Updated weights for policy 0, policy_version 38700 (0.0006) [2023-03-07 10:35:52,061][175731] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-07 10:35:52,890][175731] Updated weights for policy 0, policy_version 38720 (0.0006) [2023-03-07 10:35:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12731.7, 300 sec: 12756.6). Total num frames: 39654400. Throughput: 0: 12734.6. Samples: 39654005. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:53,322][175405] Avg episode reward: [(0, '78.546')] [2023-03-07 10:35:53,693][175731] Updated weights for policy 0, policy_version 38730 (0.0008) [2023-03-07 10:35:54,475][175731] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-07 10:35:55,284][175731] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-07 10:35:56,085][175731] Updated weights for policy 0, policy_version 38760 (0.0005) [2023-03-07 10:35:56,881][175731] Updated weights for policy 0, policy_version 38770 (0.0007) [2023-03-07 10:35:57,689][175731] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-07 10:35:58,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12756.6). Total num frames: 39717888. Throughput: 0: 12740.2. Samples: 39692233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:35:58,332][175405] Avg episode reward: [(0, '71.513')] [2023-03-07 10:35:58,493][175731] Updated weights for policy 0, policy_version 38790 (0.0006) [2023-03-07 10:35:59,301][175731] Updated weights for policy 0, policy_version 38800 (0.0006) [2023-03-07 10:36:00,098][175731] Updated weights for policy 0, policy_version 38810 (0.0006) [2023-03-07 10:36:00,907][175731] Updated weights for policy 0, policy_version 38820 (0.0006) [2023-03-07 10:36:01,723][175731] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-07 10:36:02,511][175731] Updated weights for policy 0, policy_version 38840 (0.0008) [2023-03-07 10:36:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12756.6). Total num frames: 39782400. Throughput: 0: 12746.4. Samples: 39768679. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:36:03,322][175731] Updated weights for policy 0, policy_version 38850 (0.0006) [2023-03-07 10:36:03,332][175405] Avg episode reward: [(0, '107.839')] [2023-03-07 10:36:04,133][175731] Updated weights for policy 0, policy_version 38860 (0.0006) [2023-03-07 10:36:04,926][175731] Updated weights for policy 0, policy_version 38870 (0.0007) [2023-03-07 10:36:05,719][175731] Updated weights for policy 0, policy_version 38880 (0.0006) [2023-03-07 10:36:06,533][175731] Updated weights for policy 0, policy_version 38890 (0.0008) [2023-03-07 10:36:07,349][175731] Updated weights for policy 0, policy_version 38900 (0.0006) [2023-03-07 10:36:08,149][175731] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-07 10:36:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12748.8, 300 sec: 12756.6). Total num frames: 39845888. Throughput: 0: 12745.8. Samples: 39844967. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:36:08,332][175405] Avg episode reward: [(0, '78.634')] [2023-03-07 10:36:08,947][175731] Updated weights for policy 0, policy_version 38920 (0.0006) [2023-03-07 10:36:09,731][175731] Updated weights for policy 0, policy_version 38930 (0.0006) [2023-03-07 10:36:10,553][175731] Updated weights for policy 0, policy_version 38940 (0.0006) [2023-03-07 10:36:11,358][175731] Updated weights for policy 0, policy_version 38950 (0.0006) [2023-03-07 10:36:12,160][175731] Updated weights for policy 0, policy_version 38960 (0.0006) [2023-03-07 10:36:12,969][175731] Updated weights for policy 0, policy_version 38970 (0.0006) [2023-03-07 10:36:13,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 39909376. Throughput: 0: 12746.7. Samples: 39883309. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:36:13,332][175405] Avg episode reward: [(0, '77.059')] [2023-03-07 10:36:13,783][175731] Updated weights for policy 0, policy_version 38980 (0.0006) [2023-03-07 10:36:14,586][175731] Updated weights for policy 0, policy_version 38990 (0.0005) [2023-03-07 10:36:15,394][175731] Updated weights for policy 0, policy_version 39000 (0.0007) [2023-03-07 10:36:16,209][175731] Updated weights for policy 0, policy_version 39010 (0.0006) [2023-03-07 10:36:17,001][175731] Updated weights for policy 0, policy_version 39020 (0.0007) [2023-03-07 10:36:17,806][175731] Updated weights for policy 0, policy_version 39030 (0.0006) [2023-03-07 10:36:18,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12748.8, 300 sec: 12753.1). Total num frames: 39972864. Throughput: 0: 12736.6. Samples: 39959606. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:36:18,332][175405] Avg episode reward: [(0, '76.551')] [2023-03-07 10:36:18,596][175731] Updated weights for policy 0, policy_version 39040 (0.0007) [2023-03-07 10:36:19,390][175731] Updated weights for policy 0, policy_version 39050 (0.0006) [2023-03-07 10:36:20,195][175731] Updated weights for policy 0, policy_version 39060 (0.0006) [2023-03-07 10:36:21,004][175731] Updated weights for policy 0, policy_version 39070 (0.0006) [2023-03-07 10:36:21,808][175731] Updated weights for policy 0, policy_version 39080 (0.0006) [2023-03-07 10:36:22,598][175731] Updated weights for policy 0, policy_version 39090 (0.0006) [2023-03-07 10:36:23,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12731.8, 300 sec: 12749.7). Total num frames: 40036352. Throughput: 0: 12739.0. Samples: 40036274. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:36:23,332][175405] Avg episode reward: [(0, '70.679')] [2023-03-07 10:36:23,406][175731] Updated weights for policy 0, policy_version 39100 (0.0006) [2023-03-07 10:36:24,197][175731] Updated weights for policy 0, policy_version 39110 (0.0006) [2023-03-07 10:36:25,006][175731] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-07 10:36:25,807][175731] Updated weights for policy 0, policy_version 39130 (0.0007) [2023-03-07 10:36:26,617][175731] Updated weights for policy 0, policy_version 39140 (0.0007) [2023-03-07 10:36:27,438][175731] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-07 10:36:28,242][175731] Updated weights for policy 0, policy_version 39160 (0.0006) [2023-03-07 10:36:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12749.7). Total num frames: 40099840. Throughput: 0: 12747.5. Samples: 40074465. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:36:28,332][175405] Avg episode reward: [(0, '69.506')] [2023-03-07 10:36:29,064][175731] Updated weights for policy 0, policy_version 39170 (0.0007) [2023-03-07 10:36:29,870][175731] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-07 10:36:30,665][175731] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-07 10:36:31,470][175731] Updated weights for policy 0, policy_version 39200 (0.0006) [2023-03-07 10:36:32,286][175731] Updated weights for policy 0, policy_version 39210 (0.0006) [2023-03-07 10:36:33,093][175731] Updated weights for policy 0, policy_version 39220 (0.0007) [2023-03-07 10:36:33,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 40163328. Throughput: 0: 12739.8. Samples: 40150540. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:36:33,332][175405] Avg episode reward: [(0, '83.668')] [2023-03-07 10:36:33,878][175731] Updated weights for policy 0, policy_version 39230 (0.0006) [2023-03-07 10:36:34,685][175731] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-07 10:36:35,487][175731] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-07 10:36:36,314][175731] Updated weights for policy 0, policy_version 39260 (0.0006) [2023-03-07 10:36:37,122][175731] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-07 10:36:37,919][175731] Updated weights for policy 0, policy_version 39280 (0.0006) [2023-03-07 10:36:38,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12714.7, 300 sec: 12746.2). Total num frames: 40226816. Throughput: 0: 12727.8. Samples: 40226754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:36:38,332][175405] Avg episode reward: [(0, '105.937')] [2023-03-07 10:36:38,719][175731] Updated weights for policy 0, policy_version 39290 (0.0006) [2023-03-07 10:36:39,538][175731] Updated weights for policy 0, policy_version 39300 (0.0007) [2023-03-07 10:36:40,336][175731] Updated weights for policy 0, policy_version 39310 (0.0006) [2023-03-07 10:36:41,135][175731] Updated weights for policy 0, policy_version 39320 (0.0007) [2023-03-07 10:36:41,943][175731] Updated weights for policy 0, policy_version 39330 (0.0007) [2023-03-07 10:36:42,745][175731] Updated weights for policy 0, policy_version 39340 (0.0006) [2023-03-07 10:36:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 40290304. Throughput: 0: 12725.8. Samples: 40264892. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:36:43,332][175405] Avg episode reward: [(0, '75.521')] [2023-03-07 10:36:43,549][175731] Updated weights for policy 0, policy_version 39350 (0.0006) [2023-03-07 10:36:44,369][175731] Updated weights for policy 0, policy_version 39360 (0.0006) [2023-03-07 10:36:45,196][175731] Updated weights for policy 0, policy_version 39370 (0.0007) [2023-03-07 10:36:45,993][175731] Updated weights for policy 0, policy_version 39380 (0.0006) [2023-03-07 10:36:46,798][175731] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-07 10:36:47,619][175731] Updated weights for policy 0, policy_version 39400 (0.0005) [2023-03-07 10:36:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12742.7). Total num frames: 40353792. Throughput: 0: 12715.9. Samples: 40340893. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:36:48,332][175405] Avg episode reward: [(0, '83.979')] [2023-03-07 10:36:48,409][175731] Updated weights for policy 0, policy_version 39410 (0.0007) [2023-03-07 10:36:49,242][175731] Updated weights for policy 0, policy_version 39420 (0.0006) [2023-03-07 10:36:50,031][175731] Updated weights for policy 0, policy_version 39430 (0.0007) [2023-03-07 10:36:50,837][175731] Updated weights for policy 0, policy_version 39440 (0.0006) [2023-03-07 10:36:51,630][175731] Updated weights for policy 0, policy_version 39450 (0.0006) [2023-03-07 10:36:52,431][175731] Updated weights for policy 0, policy_version 39460 (0.0006) [2023-03-07 10:36:53,262][175731] Updated weights for policy 0, policy_version 39470 (0.0006) [2023-03-07 10:36:53,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12731.7, 300 sec: 12746.2). Total num frames: 40418304. Throughput: 0: 12717.1. Samples: 40417234. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:36:53,332][175405] Avg episode reward: [(0, '74.768')] [2023-03-07 10:36:54,061][175731] Updated weights for policy 0, policy_version 39480 (0.0006) [2023-03-07 10:36:54,851][175731] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-07 10:36:55,653][175731] Updated weights for policy 0, policy_version 39500 (0.0007) [2023-03-07 10:36:56,461][175731] Updated weights for policy 0, policy_version 39510 (0.0006) [2023-03-07 10:36:57,303][175731] Updated weights for policy 0, policy_version 39520 (0.0006) [2023-03-07 10:36:58,082][175731] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-07 10:36:58,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 40480768. Throughput: 0: 12715.3. Samples: 40455495. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:36:58,322][175405] Avg episode reward: [(0, '68.388')] [2023-03-07 10:36:58,885][175731] Updated weights for policy 0, policy_version 39540 (0.0005) [2023-03-07 10:36:59,698][175731] Updated weights for policy 0, policy_version 39550 (0.0007) [2023-03-07 10:37:00,492][175731] Updated weights for policy 0, policy_version 39560 (0.0006) [2023-03-07 10:37:01,317][175731] Updated weights for policy 0, policy_version 39570 (0.0006) [2023-03-07 10:37:02,105][175731] Updated weights for policy 0, policy_version 39580 (0.0007) [2023-03-07 10:37:02,935][175731] Updated weights for policy 0, policy_version 39590 (0.0007) [2023-03-07 10:37:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12742.7). Total num frames: 40545280. Throughput: 0: 12712.3. Samples: 40531659. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:37:03,321][175405] Avg episode reward: [(0, '85.891')] [2023-03-07 10:37:03,739][175731] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-07 10:37:04,530][175731] Updated weights for policy 0, policy_version 39610 (0.0006) [2023-03-07 10:37:05,326][175731] Updated weights for policy 0, policy_version 39620 (0.0007) [2023-03-07 10:37:06,139][175731] Updated weights for policy 0, policy_version 39630 (0.0005) [2023-03-07 10:37:06,937][175731] Updated weights for policy 0, policy_version 39640 (0.0007) [2023-03-07 10:37:07,762][175731] Updated weights for policy 0, policy_version 39650 (0.0006) [2023-03-07 10:37:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12739.3). Total num frames: 40608768. Throughput: 0: 12700.3. Samples: 40607790. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:37:08,322][175405] Avg episode reward: [(0, '78.143')] [2023-03-07 10:37:08,569][175731] Updated weights for policy 0, policy_version 39660 (0.0007) [2023-03-07 10:37:09,386][175731] Updated weights for policy 0, policy_version 39670 (0.0006) [2023-03-07 10:37:10,190][175731] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-07 10:37:10,997][175731] Updated weights for policy 0, policy_version 39690 (0.0006) [2023-03-07 10:37:11,807][175731] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-07 10:37:12,609][175731] Updated weights for policy 0, policy_version 39710 (0.0006) [2023-03-07 10:37:13,321][175405] Fps is (10 sec: 12595.2, 60 sec: 12697.6, 300 sec: 12735.8). Total num frames: 40671232. Throughput: 0: 12693.0. Samples: 40645647. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:37:13,321][175405] Avg episode reward: [(0, '92.813')] [2023-03-07 10:37:13,417][175731] Updated weights for policy 0, policy_version 39720 (0.0007) [2023-03-07 10:37:14,226][175731] Updated weights for policy 0, policy_version 39730 (0.0007) [2023-03-07 10:37:15,036][175731] Updated weights for policy 0, policy_version 39740 (0.0007) [2023-03-07 10:37:15,838][175731] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-07 10:37:16,642][175731] Updated weights for policy 0, policy_version 39760 (0.0007) [2023-03-07 10:37:17,444][175731] Updated weights for policy 0, policy_version 39770 (0.0006) [2023-03-07 10:37:18,243][175731] Updated weights for policy 0, policy_version 39780 (0.0006) [2023-03-07 10:37:18,321][175405] Fps is (10 sec: 12595.1, 60 sec: 12697.6, 300 sec: 12732.3). Total num frames: 40734720. Throughput: 0: 12695.3. Samples: 40721830. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:37:18,322][175405] Avg episode reward: [(0, '80.681')] [2023-03-07 10:37:19,054][175731] Updated weights for policy 0, policy_version 39790 (0.0007) [2023-03-07 10:37:19,866][175731] Updated weights for policy 0, policy_version 39800 (0.0006) [2023-03-07 10:37:20,674][175731] Updated weights for policy 0, policy_version 39810 (0.0006) [2023-03-07 10:37:21,487][175731] Updated weights for policy 0, policy_version 39820 (0.0006) [2023-03-07 10:37:22,293][175731] Updated weights for policy 0, policy_version 39830 (0.0006) [2023-03-07 10:37:23,095][175731] Updated weights for policy 0, policy_version 39840 (0.0007) [2023-03-07 10:37:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12697.6, 300 sec: 12732.3). Total num frames: 40798208. Throughput: 0: 12697.1. Samples: 40798127. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:37:23,322][175405] Avg episode reward: [(0, '76.491')] [2023-03-07 10:37:23,907][175731] Updated weights for policy 0, policy_version 39850 (0.0008) [2023-03-07 10:37:24,716][175731] Updated weights for policy 0, policy_version 39860 (0.0007) [2023-03-07 10:37:25,509][175731] Updated weights for policy 0, policy_version 39870 (0.0006) [2023-03-07 10:37:26,312][175731] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-07 10:37:27,107][175731] Updated weights for policy 0, policy_version 39890 (0.0007) [2023-03-07 10:37:27,895][175731] Updated weights for policy 0, policy_version 39900 (0.0006) [2023-03-07 10:37:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.7, 300 sec: 12735.8). Total num frames: 40862720. Throughput: 0: 12700.2. Samples: 40836401. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:37:28,322][175405] Avg episode reward: [(0, '100.430')] [2023-03-07 10:37:28,714][175731] Updated weights for policy 0, policy_version 39910 (0.0006) [2023-03-07 10:37:29,502][175731] Updated weights for policy 0, policy_version 39920 (0.0006) [2023-03-07 10:37:30,319][175731] Updated weights for policy 0, policy_version 39930 (0.0006) [2023-03-07 10:37:31,126][175731] Updated weights for policy 0, policy_version 39940 (0.0006) [2023-03-07 10:37:31,925][175731] Updated weights for policy 0, policy_version 39950 (0.0006) [2023-03-07 10:37:32,737][175731] Updated weights for policy 0, policy_version 39960 (0.0006) [2023-03-07 10:37:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 40926208. Throughput: 0: 12711.9. Samples: 40912931. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:37:33,322][175405] Avg episode reward: [(0, '87.101')] [2023-03-07 10:37:33,552][175731] Updated weights for policy 0, policy_version 39970 (0.0008) [2023-03-07 10:37:34,360][175731] Updated weights for policy 0, policy_version 39980 (0.0006) [2023-03-07 10:37:35,162][175731] Updated weights for policy 0, policy_version 39990 (0.0005) [2023-03-07 10:37:35,968][175731] Updated weights for policy 0, policy_version 40000 (0.0007) [2023-03-07 10:37:36,792][175731] Updated weights for policy 0, policy_version 40010 (0.0006) [2023-03-07 10:37:37,589][175731] Updated weights for policy 0, policy_version 40020 (0.0007) [2023-03-07 10:37:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 40989696. Throughput: 0: 12700.2. Samples: 40988744. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:37:38,322][175405] Avg episode reward: [(0, '88.147')] [2023-03-07 10:37:38,402][175731] Updated weights for policy 0, policy_version 40030 (0.0006) [2023-03-07 10:37:39,197][175731] Updated weights for policy 0, policy_version 40040 (0.0006) [2023-03-07 10:37:40,007][175731] Updated weights for policy 0, policy_version 40050 (0.0007) [2023-03-07 10:37:40,794][175731] Updated weights for policy 0, policy_version 40060 (0.0006) [2023-03-07 10:37:41,602][175731] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-07 10:37:42,414][175731] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-07 10:37:43,202][175731] Updated weights for policy 0, policy_version 40090 (0.0005) [2023-03-07 10:37:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 41053184. Throughput: 0: 12702.1. Samples: 41027090. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:37:43,322][175405] Avg episode reward: [(0, '106.757')] [2023-03-07 10:37:44,014][175731] Updated weights for policy 0, policy_version 40100 (0.0007) [2023-03-07 10:37:44,808][175731] Updated weights for policy 0, policy_version 40110 (0.0007) [2023-03-07 10:37:45,616][175731] Updated weights for policy 0, policy_version 40120 (0.0006) [2023-03-07 10:37:46,429][175731] Updated weights for policy 0, policy_version 40130 (0.0007) [2023-03-07 10:37:47,221][175731] Updated weights for policy 0, policy_version 40140 (0.0007) [2023-03-07 10:37:48,014][175731] Updated weights for policy 0, policy_version 40150 (0.0006) [2023-03-07 10:37:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.7, 300 sec: 12728.8). Total num frames: 41116672. Throughput: 0: 12708.9. Samples: 41103561. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:37:48,322][175405] Avg episode reward: [(0, '54.554')] [2023-03-07 10:37:48,331][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000040154_41117696.pth... [2023-03-07 10:37:48,364][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000037171_38063104.pth [2023-03-07 10:37:48,829][175731] Updated weights for policy 0, policy_version 40160 (0.0007) [2023-03-07 10:37:49,637][175731] Updated weights for policy 0, policy_version 40170 (0.0008) [2023-03-07 10:37:50,437][175731] Updated weights for policy 0, policy_version 40180 (0.0007) [2023-03-07 10:37:51,232][175731] Updated weights for policy 0, policy_version 40190 (0.0006) [2023-03-07 10:37:52,033][175731] Updated weights for policy 0, policy_version 40200 (0.0006) [2023-03-07 10:37:52,830][175731] Updated weights for policy 0, policy_version 40210 (0.0006) [2023-03-07 10:37:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12714.6, 300 sec: 12732.3). Total num frames: 41181184. Throughput: 0: 12723.3. Samples: 41180341. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:37:53,322][175405] Avg episode reward: [(0, '50.660')] [2023-03-07 10:37:53,628][175731] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-07 10:37:54,443][175731] Updated weights for policy 0, policy_version 40230 (0.0007) [2023-03-07 10:37:55,236][175731] Updated weights for policy 0, policy_version 40240 (0.0007) [2023-03-07 10:37:56,037][175731] Updated weights for policy 0, policy_version 40250 (0.0007) [2023-03-07 10:37:56,857][175731] Updated weights for policy 0, policy_version 40260 (0.0007) [2023-03-07 10:37:57,649][175731] Updated weights for policy 0, policy_version 40270 (0.0007) [2023-03-07 10:37:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12728.8). Total num frames: 41244672. Throughput: 0: 12730.1. Samples: 41218504. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:37:58,322][175405] Avg episode reward: [(0, '63.890')] [2023-03-07 10:37:58,466][175731] Updated weights for policy 0, policy_version 40280 (0.0007) [2023-03-07 10:37:59,261][175731] Updated weights for policy 0, policy_version 40290 (0.0007) [2023-03-07 10:38:00,049][175731] Updated weights for policy 0, policy_version 40300 (0.0006) [2023-03-07 10:38:00,867][175731] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-07 10:38:01,659][175731] Updated weights for policy 0, policy_version 40320 (0.0007) [2023-03-07 10:38:02,474][175731] Updated weights for policy 0, policy_version 40330 (0.0006) [2023-03-07 10:38:03,275][175731] Updated weights for policy 0, policy_version 40340 (0.0006) [2023-03-07 10:38:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12714.6, 300 sec: 12728.8). Total num frames: 41308160. Throughput: 0: 12738.4. Samples: 41295058. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 10:38:03,322][175405] Avg episode reward: [(0, '60.584')] [2023-03-07 10:38:04,078][175731] Updated weights for policy 0, policy_version 40350 (0.0008) [2023-03-07 10:38:04,880][175731] Updated weights for policy 0, policy_version 40360 (0.0007) [2023-03-07 10:38:05,689][175731] Updated weights for policy 0, policy_version 40370 (0.0006) [2023-03-07 10:38:06,494][175731] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-07 10:38:07,315][175731] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-07 10:38:08,095][175731] Updated weights for policy 0, policy_version 40400 (0.0007) [2023-03-07 10:38:08,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12714.7, 300 sec: 12725.4). Total num frames: 41371648. Throughput: 0: 12737.0. Samples: 41371294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:38:08,321][175405] Avg episode reward: [(0, '58.481')] [2023-03-07 10:38:08,918][175731] Updated weights for policy 0, policy_version 40410 (0.0007) [2023-03-07 10:38:09,734][175731] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-07 10:38:10,545][175731] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-07 10:38:11,333][175731] Updated weights for policy 0, policy_version 40440 (0.0007) [2023-03-07 10:38:12,148][175731] Updated weights for policy 0, policy_version 40450 (0.0007) [2023-03-07 10:38:12,934][175731] Updated weights for policy 0, policy_version 40460 (0.0007) [2023-03-07 10:38:13,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 41435136. Throughput: 0: 12733.6. Samples: 41409411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:38:13,321][175405] Avg episode reward: [(0, '68.540')] [2023-03-07 10:38:13,740][175731] Updated weights for policy 0, policy_version 40470 (0.0006) [2023-03-07 10:38:14,546][175731] Updated weights for policy 0, policy_version 40480 (0.0007) [2023-03-07 10:38:15,346][175731] Updated weights for policy 0, policy_version 40490 (0.0006) [2023-03-07 10:38:16,149][175731] Updated weights for policy 0, policy_version 40500 (0.0006) [2023-03-07 10:38:16,961][175731] Updated weights for policy 0, policy_version 40510 (0.0007) [2023-03-07 10:38:17,755][175731] Updated weights for policy 0, policy_version 40520 (0.0006) [2023-03-07 10:38:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 41498624. Throughput: 0: 12731.1. Samples: 41485832. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:38:18,322][175405] Avg episode reward: [(0, '62.558')] [2023-03-07 10:38:18,559][175731] Updated weights for policy 0, policy_version 40530 (0.0006) [2023-03-07 10:38:19,374][175731] Updated weights for policy 0, policy_version 40540 (0.0005) [2023-03-07 10:38:20,176][175731] Updated weights for policy 0, policy_version 40550 (0.0006) [2023-03-07 10:38:20,986][175731] Updated weights for policy 0, policy_version 40560 (0.0007) [2023-03-07 10:38:21,781][175731] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-07 10:38:22,585][175731] Updated weights for policy 0, policy_version 40580 (0.0007) [2023-03-07 10:38:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12721.9). Total num frames: 41562112. Throughput: 0: 12744.5. Samples: 41562248. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:38:23,322][175405] Avg episode reward: [(0, '66.393')] [2023-03-07 10:38:23,383][175731] Updated weights for policy 0, policy_version 40590 (0.0007) [2023-03-07 10:38:24,181][175731] Updated weights for policy 0, policy_version 40600 (0.0008) [2023-03-07 10:38:24,988][175731] Updated weights for policy 0, policy_version 40610 (0.0007) [2023-03-07 10:38:25,793][175731] Updated weights for policy 0, policy_version 40620 (0.0006) [2023-03-07 10:38:26,602][175731] Updated weights for policy 0, policy_version 40630 (0.0007) [2023-03-07 10:38:27,395][175731] Updated weights for policy 0, policy_version 40640 (0.0006) [2023-03-07 10:38:28,202][175731] Updated weights for policy 0, policy_version 40650 (0.0006) [2023-03-07 10:38:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12731.8, 300 sec: 12725.4). Total num frames: 41626624. Throughput: 0: 12743.0. Samples: 41600525. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:38:28,321][175405] Avg episode reward: [(0, '60.874')] [2023-03-07 10:38:29,007][175731] Updated weights for policy 0, policy_version 40660 (0.0006) [2023-03-07 10:38:29,833][175731] Updated weights for policy 0, policy_version 40670 (0.0007) [2023-03-07 10:38:30,630][175731] Updated weights for policy 0, policy_version 40680 (0.0005) [2023-03-07 10:38:31,431][175731] Updated weights for policy 0, policy_version 40690 (0.0006) [2023-03-07 10:38:32,234][175731] Updated weights for policy 0, policy_version 40700 (0.0007) [2023-03-07 10:38:33,032][175731] Updated weights for policy 0, policy_version 40710 (0.0006) [2023-03-07 10:38:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 41690112. Throughput: 0: 12741.5. Samples: 41676930. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:38:33,322][175405] Avg episode reward: [(0, '58.861')] [2023-03-07 10:38:33,822][175731] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-07 10:38:34,619][175731] Updated weights for policy 0, policy_version 40730 (0.0007) [2023-03-07 10:38:35,434][175731] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-07 10:38:36,246][175731] Updated weights for policy 0, policy_version 40750 (0.0006) [2023-03-07 10:38:37,044][175731] Updated weights for policy 0, policy_version 40760 (0.0007) [2023-03-07 10:38:37,870][175731] Updated weights for policy 0, policy_version 40770 (0.0007) [2023-03-07 10:38:38,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 41753600. Throughput: 0: 12731.1. Samples: 41753240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:38:38,322][175405] Avg episode reward: [(0, '92.277')] [2023-03-07 10:38:38,666][175731] Updated weights for policy 0, policy_version 40780 (0.0007) [2023-03-07 10:38:39,459][175731] Updated weights for policy 0, policy_version 40790 (0.0007) [2023-03-07 10:38:40,261][175731] Updated weights for policy 0, policy_version 40800 (0.0006) [2023-03-07 10:38:41,055][175731] Updated weights for policy 0, policy_version 40810 (0.0007) [2023-03-07 10:38:41,855][175731] Updated weights for policy 0, policy_version 40820 (0.0006) [2023-03-07 10:38:42,662][175731] Updated weights for policy 0, policy_version 40830 (0.0006) [2023-03-07 10:38:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 41818112. Throughput: 0: 12740.9. Samples: 41791846. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:38:43,322][175405] Avg episode reward: [(0, '56.888')] [2023-03-07 10:38:43,471][175731] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-07 10:38:44,262][175731] Updated weights for policy 0, policy_version 40850 (0.0007) [2023-03-07 10:38:45,065][175731] Updated weights for policy 0, policy_version 40860 (0.0007) [2023-03-07 10:38:45,873][175731] Updated weights for policy 0, policy_version 40870 (0.0006) [2023-03-07 10:38:46,654][175731] Updated weights for policy 0, policy_version 40880 (0.0007) [2023-03-07 10:38:47,458][175731] Updated weights for policy 0, policy_version 40890 (0.0006) [2023-03-07 10:38:48,274][175731] Updated weights for policy 0, policy_version 40900 (0.0006) [2023-03-07 10:38:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12748.8, 300 sec: 12725.4). Total num frames: 41881600. Throughput: 0: 12741.5. Samples: 41868425. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:38:48,322][175405] Avg episode reward: [(0, '63.897')] [2023-03-07 10:38:49,064][175731] Updated weights for policy 0, policy_version 40910 (0.0006) [2023-03-07 10:38:49,876][175731] Updated weights for policy 0, policy_version 40920 (0.0006) [2023-03-07 10:38:50,685][175731] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-07 10:38:51,481][175731] Updated weights for policy 0, policy_version 40940 (0.0006) [2023-03-07 10:38:52,292][175731] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-07 10:38:53,089][175731] Updated weights for policy 0, policy_version 40960 (0.0006) [2023-03-07 10:38:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 41945088. Throughput: 0: 12747.9. Samples: 41944951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:38:53,322][175405] Avg episode reward: [(0, '60.731')] [2023-03-07 10:38:53,913][175731] Updated weights for policy 0, policy_version 40970 (0.0007) [2023-03-07 10:38:54,703][175731] Updated weights for policy 0, policy_version 40980 (0.0007) [2023-03-07 10:38:55,501][175731] Updated weights for policy 0, policy_version 40990 (0.0006) [2023-03-07 10:38:56,304][175731] Updated weights for policy 0, policy_version 41000 (0.0006) [2023-03-07 10:38:57,112][175731] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-07 10:38:57,587][175680] KL-divergence is very high: 133.7207 [2023-03-07 10:38:57,930][175731] Updated weights for policy 0, policy_version 41020 (0.0007) [2023-03-07 10:38:58,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12731.7, 300 sec: 12725.4). Total num frames: 42008576. Throughput: 0: 12752.1. Samples: 41983257. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:38:58,322][175405] Avg episode reward: [(0, '58.657')] [2023-03-07 10:38:58,726][175731] Updated weights for policy 0, policy_version 41030 (0.0007) [2023-03-07 10:38:59,519][175731] Updated weights for policy 0, policy_version 41040 (0.0006) [2023-03-07 10:39:00,319][175731] Updated weights for policy 0, policy_version 41050 (0.0006) [2023-03-07 10:39:01,109][175731] Updated weights for policy 0, policy_version 41060 (0.0007) [2023-03-07 10:39:01,898][175731] Updated weights for policy 0, policy_version 41070 (0.0006) [2023-03-07 10:39:02,686][175731] Updated weights for policy 0, policy_version 41080 (0.0006) [2023-03-07 10:39:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12748.8, 300 sec: 12728.8). Total num frames: 42073088. Throughput: 0: 12760.9. Samples: 42060074. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:03,322][175405] Avg episode reward: [(0, '37.037')] [2023-03-07 10:39:03,516][175731] Updated weights for policy 0, policy_version 41090 (0.0006) [2023-03-07 10:39:04,310][175731] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-07 10:39:05,111][175731] Updated weights for policy 0, policy_version 41110 (0.0006) [2023-03-07 10:39:05,909][175731] Updated weights for policy 0, policy_version 41120 (0.0006) [2023-03-07 10:39:06,705][175731] Updated weights for policy 0, policy_version 41130 (0.0006) [2023-03-07 10:39:07,489][175731] Updated weights for policy 0, policy_version 41140 (0.0005) [2023-03-07 10:39:08,291][175731] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-07 10:39:08,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12765.8, 300 sec: 12732.3). Total num frames: 42137600. Throughput: 0: 12771.3. Samples: 42136959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:08,322][175405] Avg episode reward: [(0, '40.293')] [2023-03-07 10:39:09,096][175731] Updated weights for policy 0, policy_version 41160 (0.0007) [2023-03-07 10:39:09,901][175731] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-07 10:39:10,677][175731] Updated weights for policy 0, policy_version 41180 (0.0007) [2023-03-07 10:39:11,463][175731] Updated weights for policy 0, policy_version 41190 (0.0006) [2023-03-07 10:39:12,259][175731] Updated weights for policy 0, policy_version 41200 (0.0007) [2023-03-07 10:39:13,069][175731] Updated weights for policy 0, policy_version 41210 (0.0007) [2023-03-07 10:39:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12782.9, 300 sec: 12732.3). Total num frames: 42202112. Throughput: 0: 12778.2. Samples: 42175546. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:13,322][175405] Avg episode reward: [(0, '38.485')] [2023-03-07 10:39:13,856][175731] Updated weights for policy 0, policy_version 41220 (0.0007) [2023-03-07 10:39:14,658][175731] Updated weights for policy 0, policy_version 41230 (0.0006) [2023-03-07 10:39:15,451][175731] Updated weights for policy 0, policy_version 41240 (0.0006) [2023-03-07 10:39:16,242][175731] Updated weights for policy 0, policy_version 41250 (0.0006) [2023-03-07 10:39:17,064][175731] Updated weights for policy 0, policy_version 41260 (0.0007) [2023-03-07 10:39:17,884][175731] Updated weights for policy 0, policy_version 41270 (0.0007) [2023-03-07 10:39:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12735.8). Total num frames: 42265600. Throughput: 0: 12787.7. Samples: 42252376. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:18,322][175405] Avg episode reward: [(0, '34.496')] [2023-03-07 10:39:18,658][175731] Updated weights for policy 0, policy_version 41280 (0.0006) [2023-03-07 10:39:19,478][175731] Updated weights for policy 0, policy_version 41290 (0.0007) [2023-03-07 10:39:20,275][175731] Updated weights for policy 0, policy_version 41300 (0.0006) [2023-03-07 10:39:21,071][175731] Updated weights for policy 0, policy_version 41310 (0.0006) [2023-03-07 10:39:21,869][175731] Updated weights for policy 0, policy_version 41320 (0.0007) [2023-03-07 10:39:22,686][175731] Updated weights for policy 0, policy_version 41330 (0.0007) [2023-03-07 10:39:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12735.8). Total num frames: 42330112. Throughput: 0: 12799.2. Samples: 42329202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:23,322][175405] Avg episode reward: [(0, '45.565')] [2023-03-07 10:39:23,481][175731] Updated weights for policy 0, policy_version 41340 (0.0006) [2023-03-07 10:39:24,273][175731] Updated weights for policy 0, policy_version 41350 (0.0006) [2023-03-07 10:39:25,068][175731] Updated weights for policy 0, policy_version 41360 (0.0007) [2023-03-07 10:39:25,878][175731] Updated weights for policy 0, policy_version 41370 (0.0007) [2023-03-07 10:39:26,698][175731] Updated weights for policy 0, policy_version 41380 (0.0007) [2023-03-07 10:39:27,475][175731] Updated weights for policy 0, policy_version 41390 (0.0007) [2023-03-07 10:39:28,276][175731] Updated weights for policy 0, policy_version 41400 (0.0006) [2023-03-07 10:39:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12735.8). Total num frames: 42393600. Throughput: 0: 12794.3. Samples: 42367590. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:28,322][175405] Avg episode reward: [(0, '31.570')] [2023-03-07 10:39:29,091][175731] Updated weights for policy 0, policy_version 41410 (0.0006) [2023-03-07 10:39:29,877][175731] Updated weights for policy 0, policy_version 41420 (0.0007) [2023-03-07 10:39:30,693][175731] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-07 10:39:31,506][175731] Updated weights for policy 0, policy_version 41440 (0.0006) [2023-03-07 10:39:32,296][175731] Updated weights for policy 0, policy_version 41450 (0.0006) [2023-03-07 10:39:33,078][175731] Updated weights for policy 0, policy_version 41460 (0.0006) [2023-03-07 10:39:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12742.7). Total num frames: 42458112. Throughput: 0: 12797.6. Samples: 42444316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:33,322][175405] Avg episode reward: [(0, '31.851')] [2023-03-07 10:39:33,877][175731] Updated weights for policy 0, policy_version 41470 (0.0006) [2023-03-07 10:39:34,689][175731] Updated weights for policy 0, policy_version 41480 (0.0007) [2023-03-07 10:39:35,495][175731] Updated weights for policy 0, policy_version 41490 (0.0006) [2023-03-07 10:39:36,293][175731] Updated weights for policy 0, policy_version 41500 (0.0007) [2023-03-07 10:39:36,521][175680] KL-divergence is very high: 207.4741 [2023-03-07 10:39:37,092][175731] Updated weights for policy 0, policy_version 41510 (0.0007) [2023-03-07 10:39:37,884][175731] Updated weights for policy 0, policy_version 41520 (0.0007) [2023-03-07 10:39:38,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12739.3). Total num frames: 42521600. Throughput: 0: 12801.8. Samples: 42521032. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:38,322][175405] Avg episode reward: [(0, '32.541')] [2023-03-07 10:39:38,684][175731] Updated weights for policy 0, policy_version 41530 (0.0006) [2023-03-07 10:39:39,496][175731] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-07 10:39:40,288][175731] Updated weights for policy 0, policy_version 41550 (0.0007) [2023-03-07 10:39:41,089][175731] Updated weights for policy 0, policy_version 41560 (0.0006) [2023-03-07 10:39:41,887][175731] Updated weights for policy 0, policy_version 41570 (0.0007) [2023-03-07 10:39:42,680][175731] Updated weights for policy 0, policy_version 41580 (0.0007) [2023-03-07 10:39:43,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12739.3). Total num frames: 42585088. Throughput: 0: 12803.1. Samples: 42559399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:43,322][175405] Avg episode reward: [(0, '25.210')] [2023-03-07 10:39:43,483][175731] Updated weights for policy 0, policy_version 41590 (0.0006) [2023-03-07 10:39:44,289][175731] Updated weights for policy 0, policy_version 41600 (0.0008) [2023-03-07 10:39:45,085][175731] Updated weights for policy 0, policy_version 41610 (0.0006) [2023-03-07 10:39:45,888][175731] Updated weights for policy 0, policy_version 41620 (0.0006) [2023-03-07 10:39:46,687][175731] Updated weights for policy 0, policy_version 41630 (0.0006) [2023-03-07 10:39:47,485][175731] Updated weights for policy 0, policy_version 41640 (0.0007) [2023-03-07 10:39:48,283][175731] Updated weights for policy 0, policy_version 41650 (0.0007) [2023-03-07 10:39:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12800.0, 300 sec: 12742.7). Total num frames: 42649600. Throughput: 0: 12802.1. Samples: 42636170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:39:48,332][175405] Avg episode reward: [(0, '26.625')] [2023-03-07 10:39:48,336][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000041650_42649600.pth... [2023-03-07 10:39:48,366][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000038663_39590912.pth [2023-03-07 10:39:49,091][175731] Updated weights for policy 0, policy_version 41660 (0.0006) [2023-03-07 10:39:49,897][175731] Updated weights for policy 0, policy_version 41670 (0.0006) [2023-03-07 10:39:50,704][175731] Updated weights for policy 0, policy_version 41680 (0.0007) [2023-03-07 10:39:51,500][175731] Updated weights for policy 0, policy_version 41690 (0.0007) [2023-03-07 10:39:52,279][175731] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-07 10:39:53,072][175731] Updated weights for policy 0, policy_version 41710 (0.0005) [2023-03-07 10:39:53,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12817.1, 300 sec: 12742.7). Total num frames: 42714112. Throughput: 0: 12805.0. Samples: 42713183. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:39:53,332][175405] Avg episode reward: [(0, '31.981')] [2023-03-07 10:39:53,870][175731] Updated weights for policy 0, policy_version 41720 (0.0007) [2023-03-07 10:39:54,674][175731] Updated weights for policy 0, policy_version 41730 (0.0006) [2023-03-07 10:39:55,479][175731] Updated weights for policy 0, policy_version 41740 (0.0006) [2023-03-07 10:39:56,256][175731] Updated weights for policy 0, policy_version 41750 (0.0006) [2023-03-07 10:39:57,052][175731] Updated weights for policy 0, policy_version 41760 (0.0006) [2023-03-07 10:39:57,857][175731] Updated weights for policy 0, policy_version 41770 (0.0007) [2023-03-07 10:39:58,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.2, 300 sec: 12749.7). Total num frames: 42778624. Throughput: 0: 12804.8. Samples: 42751762. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:39:58,332][175405] Avg episode reward: [(0, '32.852')] [2023-03-07 10:39:58,635][175731] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-07 10:39:59,423][175731] Updated weights for policy 0, policy_version 41790 (0.0006) [2023-03-07 10:40:00,228][175731] Updated weights for policy 0, policy_version 41800 (0.0006) [2023-03-07 10:40:01,030][175731] Updated weights for policy 0, policy_version 41810 (0.0007) [2023-03-07 10:40:01,827][175731] Updated weights for policy 0, policy_version 41820 (0.0007) [2023-03-07 10:40:02,620][175731] Updated weights for policy 0, policy_version 41830 (0.0006) [2023-03-07 10:40:03,321][175405] Fps is (10 sec: 12799.7, 60 sec: 12817.1, 300 sec: 12749.7). Total num frames: 42842112. Throughput: 0: 12813.6. Samples: 42828991. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:40:03,332][175405] Avg episode reward: [(0, '29.270')] [2023-03-07 10:40:03,412][175731] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-07 10:40:04,204][175731] Updated weights for policy 0, policy_version 41850 (0.0006) [2023-03-07 10:40:04,996][175731] Updated weights for policy 0, policy_version 41860 (0.0006) [2023-03-07 10:40:05,807][175731] Updated weights for policy 0, policy_version 41870 (0.0006) [2023-03-07 10:40:06,614][175731] Updated weights for policy 0, policy_version 41880 (0.0007) [2023-03-07 10:40:07,389][175731] Updated weights for policy 0, policy_version 41890 (0.0007) [2023-03-07 10:40:08,197][175731] Updated weights for policy 0, policy_version 41900 (0.0008) [2023-03-07 10:40:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12753.1). Total num frames: 42906624. Throughput: 0: 12824.6. Samples: 42906308. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:40:08,332][175405] Avg episode reward: [(0, '33.853')] [2023-03-07 10:40:08,994][175731] Updated weights for policy 0, policy_version 41910 (0.0006) [2023-03-07 10:40:09,794][175731] Updated weights for policy 0, policy_version 41920 (0.0007) [2023-03-07 10:40:10,595][175731] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-07 10:40:11,373][175731] Updated weights for policy 0, policy_version 41940 (0.0007) [2023-03-07 10:40:12,169][175731] Updated weights for policy 0, policy_version 41950 (0.0006) [2023-03-07 10:40:12,967][175731] Updated weights for policy 0, policy_version 41960 (0.0007) [2023-03-07 10:40:13,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12756.6). Total num frames: 42971136. Throughput: 0: 12827.8. Samples: 42944838. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:40:13,332][175405] Avg episode reward: [(0, '30.274')] [2023-03-07 10:40:13,743][175731] Updated weights for policy 0, policy_version 41970 (0.0006) [2023-03-07 10:40:14,538][175731] Updated weights for policy 0, policy_version 41980 (0.0007) [2023-03-07 10:40:15,334][175731] Updated weights for policy 0, policy_version 41990 (0.0007) [2023-03-07 10:40:16,140][175731] Updated weights for policy 0, policy_version 42000 (0.0007) [2023-03-07 10:40:16,952][175731] Updated weights for policy 0, policy_version 42010 (0.0006) [2023-03-07 10:40:17,737][175731] Updated weights for policy 0, policy_version 42020 (0.0008) [2023-03-07 10:40:18,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12834.1, 300 sec: 12756.6). Total num frames: 43035648. Throughput: 0: 12844.1. Samples: 43022305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:40:18,332][175405] Avg episode reward: [(0, '26.024')] [2023-03-07 10:40:18,531][175731] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-07 10:40:19,325][175731] Updated weights for policy 0, policy_version 42040 (0.0006) [2023-03-07 10:40:20,126][175731] Updated weights for policy 0, policy_version 42050 (0.0007) [2023-03-07 10:40:20,917][175731] Updated weights for policy 0, policy_version 42060 (0.0006) [2023-03-07 10:40:21,726][175731] Updated weights for policy 0, policy_version 42070 (0.0007) [2023-03-07 10:40:22,217][175680] KL-divergence is very high: 69220.9766 [2023-03-07 10:40:22,538][175731] Updated weights for policy 0, policy_version 42080 (0.0007) [2023-03-07 10:40:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12756.6). Total num frames: 43099136. Throughput: 0: 12845.3. Samples: 43099073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:40:23,324][175731] Updated weights for policy 0, policy_version 42090 (0.0007) [2023-03-07 10:40:23,332][175405] Avg episode reward: [(0, '26.309')] [2023-03-07 10:40:24,165][175731] Updated weights for policy 0, policy_version 42100 (0.0007) [2023-03-07 10:40:24,939][175731] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-07 10:40:25,750][175731] Updated weights for policy 0, policy_version 42120 (0.0007) [2023-03-07 10:40:26,541][175731] Updated weights for policy 0, policy_version 42130 (0.0007) [2023-03-07 10:40:27,340][175731] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-07 10:40:28,136][175731] Updated weights for policy 0, policy_version 42150 (0.0006) [2023-03-07 10:40:28,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.1, 300 sec: 12760.1). Total num frames: 43163648. Throughput: 0: 12843.4. Samples: 43137353. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:40:28,322][175405] Avg episode reward: [(0, '24.711')] [2023-03-07 10:40:28,921][175731] Updated weights for policy 0, policy_version 42160 (0.0006) [2023-03-07 10:40:29,702][175731] Updated weights for policy 0, policy_version 42170 (0.0008) [2023-03-07 10:40:30,497][175731] Updated weights for policy 0, policy_version 42180 (0.0007) [2023-03-07 10:40:31,302][175731] Updated weights for policy 0, policy_version 42190 (0.0007) [2023-03-07 10:40:32,085][175731] Updated weights for policy 0, policy_version 42200 (0.0006) [2023-03-07 10:40:32,891][175731] Updated weights for policy 0, policy_version 42210 (0.0006) [2023-03-07 10:40:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12760.1). Total num frames: 43228160. Throughput: 0: 12859.1. Samples: 43214827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:40:33,322][175405] Avg episode reward: [(0, '27.105')] [2023-03-07 10:40:33,688][175731] Updated weights for policy 0, policy_version 42220 (0.0007) [2023-03-07 10:40:34,490][175731] Updated weights for policy 0, policy_version 42230 (0.0006) [2023-03-07 10:40:35,305][175731] Updated weights for policy 0, policy_version 42240 (0.0007) [2023-03-07 10:40:36,098][175731] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-07 10:40:36,916][175731] Updated weights for policy 0, policy_version 42260 (0.0005) [2023-03-07 10:40:37,690][175731] Updated weights for policy 0, policy_version 42270 (0.0007) [2023-03-07 10:40:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12763.6). Total num frames: 43291648. Throughput: 0: 12850.9. Samples: 43291473. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:40:38,321][175405] Avg episode reward: [(0, '34.397')] [2023-03-07 10:40:38,490][175731] Updated weights for policy 0, policy_version 42280 (0.0007) [2023-03-07 10:40:39,300][175731] Updated weights for policy 0, policy_version 42290 (0.0007) [2023-03-07 10:40:40,093][175731] Updated weights for policy 0, policy_version 42300 (0.0006) [2023-03-07 10:40:40,893][175731] Updated weights for policy 0, policy_version 42310 (0.0006) [2023-03-07 10:40:41,706][175731] Updated weights for policy 0, policy_version 42320 (0.0007) [2023-03-07 10:40:42,484][175731] Updated weights for policy 0, policy_version 42330 (0.0007) [2023-03-07 10:40:43,294][175731] Updated weights for policy 0, policy_version 42340 (0.0007) [2023-03-07 10:40:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12763.6). Total num frames: 43356160. Throughput: 0: 12847.6. Samples: 43329903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 10:40:43,321][175405] Avg episode reward: [(0, '35.514')] [2023-03-07 10:40:44,125][175731] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-07 10:40:44,903][175731] Updated weights for policy 0, policy_version 42360 (0.0007) [2023-03-07 10:40:45,692][175731] Updated weights for policy 0, policy_version 42370 (0.0006) [2023-03-07 10:40:46,516][175731] Updated weights for policy 0, policy_version 42380 (0.0007) [2023-03-07 10:40:47,315][175731] Updated weights for policy 0, policy_version 42390 (0.0006) [2023-03-07 10:40:48,118][175731] Updated weights for policy 0, policy_version 42400 (0.0007) [2023-03-07 10:40:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12763.6). Total num frames: 43419648. Throughput: 0: 12833.9. Samples: 43406514. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:40:48,321][175405] Avg episode reward: [(0, '37.216')] [2023-03-07 10:40:48,910][175731] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-07 10:40:49,716][175731] Updated weights for policy 0, policy_version 42420 (0.0007) [2023-03-07 10:40:50,513][175731] Updated weights for policy 0, policy_version 42430 (0.0006) [2023-03-07 10:40:51,319][175731] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-07 10:40:52,117][175731] Updated weights for policy 0, policy_version 42450 (0.0006) [2023-03-07 10:40:52,902][175731] Updated weights for policy 0, policy_version 42460 (0.0006) [2023-03-07 10:40:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12767.0). Total num frames: 43484160. Throughput: 0: 12825.6. Samples: 43483459. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:40:53,322][175405] Avg episode reward: [(0, '32.581')] [2023-03-07 10:40:53,703][175731] Updated weights for policy 0, policy_version 42470 (0.0007) [2023-03-07 10:40:54,489][175731] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-07 10:40:55,290][175731] Updated weights for policy 0, policy_version 42490 (0.0007) [2023-03-07 10:40:56,085][175731] Updated weights for policy 0, policy_version 42500 (0.0006) [2023-03-07 10:40:56,883][175731] Updated weights for policy 0, policy_version 42510 (0.0007) [2023-03-07 10:40:57,701][175731] Updated weights for policy 0, policy_version 42520 (0.0007) [2023-03-07 10:40:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12763.6). Total num frames: 43547648. Throughput: 0: 12824.4. Samples: 43521935. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:40:58,326][175405] Avg episode reward: [(0, '31.378')] [2023-03-07 10:40:58,485][175731] Updated weights for policy 0, policy_version 42530 (0.0006) [2023-03-07 10:40:59,288][175731] Updated weights for policy 0, policy_version 42540 (0.0006) [2023-03-07 10:41:00,073][175731] Updated weights for policy 0, policy_version 42550 (0.0007) [2023-03-07 10:41:00,857][175731] Updated weights for policy 0, policy_version 42560 (0.0006) [2023-03-07 10:41:01,648][175731] Updated weights for policy 0, policy_version 42570 (0.0007) [2023-03-07 10:41:02,446][175731] Updated weights for policy 0, policy_version 42580 (0.0007) [2023-03-07 10:41:03,252][175731] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-07 10:41:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12767.0). Total num frames: 43612160. Throughput: 0: 12819.2. Samples: 43599167. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:41:03,332][175405] Avg episode reward: [(0, '28.137')] [2023-03-07 10:41:04,065][175731] Updated weights for policy 0, policy_version 42600 (0.0007) [2023-03-07 10:41:04,869][175731] Updated weights for policy 0, policy_version 42610 (0.0006) [2023-03-07 10:41:05,663][175731] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-07 10:41:06,477][175731] Updated weights for policy 0, policy_version 42630 (0.0007) [2023-03-07 10:41:07,254][175731] Updated weights for policy 0, policy_version 42640 (0.0006) [2023-03-07 10:41:08,048][175731] Updated weights for policy 0, policy_version 42650 (0.0007) [2023-03-07 10:41:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12770.5). Total num frames: 43676672. Throughput: 0: 12821.2. Samples: 43676025. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:41:08,332][175405] Avg episode reward: [(0, '23.329')] [2023-03-07 10:41:08,844][175731] Updated weights for policy 0, policy_version 42660 (0.0005) [2023-03-07 10:41:09,637][175731] Updated weights for policy 0, policy_version 42670 (0.0006) [2023-03-07 10:41:10,429][175731] Updated weights for policy 0, policy_version 42680 (0.0007) [2023-03-07 10:41:11,234][175731] Updated weights for policy 0, policy_version 42690 (0.0006) [2023-03-07 10:41:12,034][175731] Updated weights for policy 0, policy_version 42700 (0.0006) [2023-03-07 10:41:12,825][175731] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-07 10:41:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12774.0). Total num frames: 43741184. Throughput: 0: 12831.9. Samples: 43714789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:13,332][175405] Avg episode reward: [(0, '22.658')] [2023-03-07 10:41:13,626][175731] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-07 10:41:14,427][175731] Updated weights for policy 0, policy_version 42730 (0.0006) [2023-03-07 10:41:15,223][175731] Updated weights for policy 0, policy_version 42740 (0.0007) [2023-03-07 10:41:16,003][175731] Updated weights for policy 0, policy_version 42750 (0.0007) [2023-03-07 10:41:16,801][175731] Updated weights for policy 0, policy_version 42760 (0.0006) [2023-03-07 10:41:17,607][175731] Updated weights for policy 0, policy_version 42770 (0.0007) [2023-03-07 10:41:18,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12834.1, 300 sec: 12777.4). Total num frames: 43805696. Throughput: 0: 12826.6. Samples: 43792027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:18,332][175405] Avg episode reward: [(0, '25.738')] [2023-03-07 10:41:18,397][175731] Updated weights for policy 0, policy_version 42780 (0.0006) [2023-03-07 10:41:19,190][175731] Updated weights for policy 0, policy_version 42790 (0.0006) [2023-03-07 10:41:19,990][175731] Updated weights for policy 0, policy_version 42800 (0.0006) [2023-03-07 10:41:20,783][175731] Updated weights for policy 0, policy_version 42810 (0.0006) [2023-03-07 10:41:21,560][175731] Updated weights for policy 0, policy_version 42820 (0.0008) [2023-03-07 10:41:22,370][175731] Updated weights for policy 0, policy_version 42830 (0.0006) [2023-03-07 10:41:23,165][175731] Updated weights for policy 0, policy_version 42840 (0.0007) [2023-03-07 10:41:23,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12780.9). Total num frames: 43870208. Throughput: 0: 12837.9. Samples: 43869179. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:23,332][175405] Avg episode reward: [(0, '23.422')] [2023-03-07 10:41:23,971][175731] Updated weights for policy 0, policy_version 42850 (0.0007) [2023-03-07 10:41:24,739][175731] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-07 10:41:25,532][175731] Updated weights for policy 0, policy_version 42870 (0.0006) [2023-03-07 10:41:26,337][175731] Updated weights for policy 0, policy_version 42880 (0.0007) [2023-03-07 10:41:27,127][175731] Updated weights for policy 0, policy_version 42890 (0.0006) [2023-03-07 10:41:27,929][175731] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-07 10:41:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12780.9). Total num frames: 43933696. Throughput: 0: 12843.4. Samples: 43907855. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:28,332][175405] Avg episode reward: [(0, '24.590')] [2023-03-07 10:41:28,719][175731] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-07 10:41:29,514][175731] Updated weights for policy 0, policy_version 42920 (0.0006) [2023-03-07 10:41:30,325][175731] Updated weights for policy 0, policy_version 42930 (0.0006) [2023-03-07 10:41:31,108][175731] Updated weights for policy 0, policy_version 42940 (0.0007) [2023-03-07 10:41:31,914][175731] Updated weights for policy 0, policy_version 42950 (0.0006) [2023-03-07 10:41:32,709][175731] Updated weights for policy 0, policy_version 42960 (0.0006) [2023-03-07 10:41:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12784.4). Total num frames: 43998208. Throughput: 0: 12856.1. Samples: 43985037. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:33,332][175405] Avg episode reward: [(0, '22.605')] [2023-03-07 10:41:33,518][175731] Updated weights for policy 0, policy_version 42970 (0.0006) [2023-03-07 10:41:34,314][175731] Updated weights for policy 0, policy_version 42980 (0.0006) [2023-03-07 10:41:35,098][175731] Updated weights for policy 0, policy_version 42990 (0.0007) [2023-03-07 10:41:35,912][175731] Updated weights for policy 0, policy_version 43000 (0.0007) [2023-03-07 10:41:36,726][175731] Updated weights for policy 0, policy_version 43010 (0.0007) [2023-03-07 10:41:37,518][175731] Updated weights for policy 0, policy_version 43020 (0.0007) [2023-03-07 10:41:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12784.4). Total num frames: 44061696. Throughput: 0: 12852.7. Samples: 44061830. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:38,325][175731] Updated weights for policy 0, policy_version 43030 (0.0006) [2023-03-07 10:41:38,332][175405] Avg episode reward: [(0, '22.787')] [2023-03-07 10:41:39,121][175731] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-07 10:41:39,937][175731] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-07 10:41:40,729][175731] Updated weights for policy 0, policy_version 43060 (0.0006) [2023-03-07 10:41:41,514][175731] Updated weights for policy 0, policy_version 43070 (0.0007) [2023-03-07 10:41:42,306][175731] Updated weights for policy 0, policy_version 43080 (0.0006) [2023-03-07 10:41:43,099][175731] Updated weights for policy 0, policy_version 43090 (0.0007) [2023-03-07 10:41:43,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12787.8). Total num frames: 44126208. Throughput: 0: 12848.8. Samples: 44100133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:43,332][175405] Avg episode reward: [(0, '25.093')] [2023-03-07 10:41:43,909][175731] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-07 10:41:44,700][175731] Updated weights for policy 0, policy_version 43110 (0.0007) [2023-03-07 10:41:45,489][175731] Updated weights for policy 0, policy_version 43120 (0.0008) [2023-03-07 10:41:46,291][175731] Updated weights for policy 0, policy_version 43130 (0.0006) [2023-03-07 10:41:47,063][175731] Updated weights for policy 0, policy_version 43140 (0.0007) [2023-03-07 10:41:47,887][175731] Updated weights for policy 0, policy_version 43150 (0.0006) [2023-03-07 10:41:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12787.8). Total num frames: 44190720. Throughput: 0: 12850.9. Samples: 44177458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:48,332][175405] Avg episode reward: [(0, '23.484')] [2023-03-07 10:41:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000043155_44190720.pth... [2023-03-07 10:41:48,368][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000040154_41117696.pth [2023-03-07 10:41:48,678][175731] Updated weights for policy 0, policy_version 43160 (0.0006) [2023-03-07 10:41:49,457][175731] Updated weights for policy 0, policy_version 43170 (0.0007) [2023-03-07 10:41:50,263][175731] Updated weights for policy 0, policy_version 43180 (0.0006) [2023-03-07 10:41:51,054][175731] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-07 10:41:51,849][175731] Updated weights for policy 0, policy_version 43200 (0.0007) [2023-03-07 10:41:52,646][175731] Updated weights for policy 0, policy_version 43210 (0.0006) [2023-03-07 10:41:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12794.8). Total num frames: 44255232. Throughput: 0: 12857.3. Samples: 44254604. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:53,332][175405] Avg episode reward: [(0, '23.746')] [2023-03-07 10:41:53,446][175731] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-07 10:41:54,242][175731] Updated weights for policy 0, policy_version 43230 (0.0007) [2023-03-07 10:41:55,043][175731] Updated weights for policy 0, policy_version 43240 (0.0008) [2023-03-07 10:41:55,837][175731] Updated weights for policy 0, policy_version 43250 (0.0006) [2023-03-07 10:41:56,642][175731] Updated weights for policy 0, policy_version 43260 (0.0006) [2023-03-07 10:41:57,428][175731] Updated weights for policy 0, policy_version 43270 (0.0007) [2023-03-07 10:41:58,238][175731] Updated weights for policy 0, policy_version 43280 (0.0007) [2023-03-07 10:41:58,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12794.8). Total num frames: 44319744. Throughput: 0: 12853.1. Samples: 44293177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:41:58,332][175405] Avg episode reward: [(0, '23.357')] [2023-03-07 10:41:59,051][175731] Updated weights for policy 0, policy_version 43290 (0.0006) [2023-03-07 10:41:59,834][175731] Updated weights for policy 0, policy_version 43300 (0.0006) [2023-03-07 10:42:00,634][175731] Updated weights for policy 0, policy_version 43310 (0.0006) [2023-03-07 10:42:01,455][175731] Updated weights for policy 0, policy_version 43320 (0.0007) [2023-03-07 10:42:02,234][175731] Updated weights for policy 0, policy_version 43330 (0.0006) [2023-03-07 10:42:03,024][175731] Updated weights for policy 0, policy_version 43340 (0.0006) [2023-03-07 10:42:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12794.8). Total num frames: 44383232. Throughput: 0: 12843.1. Samples: 44369966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:42:03,332][175405] Avg episode reward: [(0, '21.462')] [2023-03-07 10:42:03,833][175731] Updated weights for policy 0, policy_version 43350 (0.0008) [2023-03-07 10:42:04,618][175731] Updated weights for policy 0, policy_version 43360 (0.0006) [2023-03-07 10:42:05,407][175731] Updated weights for policy 0, policy_version 43370 (0.0006) [2023-03-07 10:42:06,206][175731] Updated weights for policy 0, policy_version 43380 (0.0006) [2023-03-07 10:42:07,018][175731] Updated weights for policy 0, policy_version 43390 (0.0007) [2023-03-07 10:42:07,808][175731] Updated weights for policy 0, policy_version 43400 (0.0006) [2023-03-07 10:42:08,321][175405] Fps is (10 sec: 12799.7, 60 sec: 12851.2, 300 sec: 12801.7). Total num frames: 44447744. Throughput: 0: 12842.7. Samples: 44447103. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:42:08,332][175405] Avg episode reward: [(0, '23.691')] [2023-03-07 10:42:08,608][175731] Updated weights for policy 0, policy_version 43410 (0.0006) [2023-03-07 10:42:09,405][175731] Updated weights for policy 0, policy_version 43420 (0.0006) [2023-03-07 10:42:10,199][175731] Updated weights for policy 0, policy_version 43430 (0.0007) [2023-03-07 10:42:10,999][175731] Updated weights for policy 0, policy_version 43440 (0.0006) [2023-03-07 10:42:11,804][175731] Updated weights for policy 0, policy_version 43450 (0.0006) [2023-03-07 10:42:12,581][175731] Updated weights for policy 0, policy_version 43460 (0.0006) [2023-03-07 10:42:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12805.2). Total num frames: 44512256. Throughput: 0: 12843.7. Samples: 44485821. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:42:13,321][175405] Avg episode reward: [(0, '22.638')] [2023-03-07 10:42:13,374][175731] Updated weights for policy 0, policy_version 43470 (0.0007) [2023-03-07 10:42:14,180][175731] Updated weights for policy 0, policy_version 43480 (0.0006) [2023-03-07 10:42:14,976][175731] Updated weights for policy 0, policy_version 43490 (0.0007) [2023-03-07 10:42:15,777][175731] Updated weights for policy 0, policy_version 43500 (0.0008) [2023-03-07 10:42:16,573][175731] Updated weights for policy 0, policy_version 43510 (0.0006) [2023-03-07 10:42:17,380][175731] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-07 10:42:18,158][175731] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-07 10:42:18,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12808.7). Total num frames: 44576768. Throughput: 0: 12845.7. Samples: 44563094. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:42:18,321][175405] Avg episode reward: [(0, '23.236')] [2023-03-07 10:42:18,952][175731] Updated weights for policy 0, policy_version 43540 (0.0007) [2023-03-07 10:42:19,733][175731] Updated weights for policy 0, policy_version 43550 (0.0006) [2023-03-07 10:42:20,515][175731] Updated weights for policy 0, policy_version 43560 (0.0007) [2023-03-07 10:42:21,321][175731] Updated weights for policy 0, policy_version 43570 (0.0007) [2023-03-07 10:42:22,117][175731] Updated weights for policy 0, policy_version 43580 (0.0006) [2023-03-07 10:42:22,917][175731] Updated weights for policy 0, policy_version 43590 (0.0006) [2023-03-07 10:42:23,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12808.7). Total num frames: 44641280. Throughput: 0: 12858.9. Samples: 44640480. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:42:23,322][175405] Avg episode reward: [(0, '22.471')] [2023-03-07 10:42:23,703][175731] Updated weights for policy 0, policy_version 43600 (0.0006) [2023-03-07 10:42:24,495][175731] Updated weights for policy 0, policy_version 43610 (0.0007) [2023-03-07 10:42:25,301][175731] Updated weights for policy 0, policy_version 43620 (0.0006) [2023-03-07 10:42:26,102][175731] Updated weights for policy 0, policy_version 43630 (0.0006) [2023-03-07 10:42:26,898][175731] Updated weights for policy 0, policy_version 43640 (0.0006) [2023-03-07 10:42:27,710][175731] Updated weights for policy 0, policy_version 43650 (0.0006) [2023-03-07 10:42:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12808.7). Total num frames: 44704768. Throughput: 0: 12860.3. Samples: 44678848. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:42:28,322][175405] Avg episode reward: [(0, '23.377')] [2023-03-07 10:42:28,521][175731] Updated weights for policy 0, policy_version 43660 (0.0007) [2023-03-07 10:42:29,299][175731] Updated weights for policy 0, policy_version 43670 (0.0006) [2023-03-07 10:42:30,106][175731] Updated weights for policy 0, policy_version 43680 (0.0007) [2023-03-07 10:42:30,920][175731] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-07 10:42:31,723][175731] Updated weights for policy 0, policy_version 43700 (0.0006) [2023-03-07 10:42:32,509][175731] Updated weights for policy 0, policy_version 43710 (0.0007) [2023-03-07 10:42:33,306][175731] Updated weights for policy 0, policy_version 43720 (0.0006) [2023-03-07 10:42:33,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12812.2). Total num frames: 44769280. Throughput: 0: 12849.3. Samples: 44755677. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:42:33,321][175405] Avg episode reward: [(0, '21.746')] [2023-03-07 10:42:34,125][175731] Updated weights for policy 0, policy_version 43730 (0.0006) [2023-03-07 10:42:34,908][175731] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-07 10:42:35,716][175731] Updated weights for policy 0, policy_version 43750 (0.0007) [2023-03-07 10:42:36,510][175731] Updated weights for policy 0, policy_version 43760 (0.0006) [2023-03-07 10:42:37,293][175731] Updated weights for policy 0, policy_version 43770 (0.0007) [2023-03-07 10:42:38,090][175731] Updated weights for policy 0, policy_version 43780 (0.0006) [2023-03-07 10:42:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12815.6). Total num frames: 44833792. Throughput: 0: 12849.6. Samples: 44832839. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:42:38,322][175405] Avg episode reward: [(0, '24.238')] [2023-03-07 10:42:38,874][175731] Updated weights for policy 0, policy_version 43790 (0.0007) [2023-03-07 10:42:39,663][175731] Updated weights for policy 0, policy_version 43800 (0.0006) [2023-03-07 10:42:40,462][175731] Updated weights for policy 0, policy_version 43810 (0.0006) [2023-03-07 10:42:41,272][175731] Updated weights for policy 0, policy_version 43820 (0.0006) [2023-03-07 10:42:42,064][175731] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-07 10:42:42,869][175731] Updated weights for policy 0, policy_version 43840 (0.0007) [2023-03-07 10:42:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12815.6). Total num frames: 44897280. Throughput: 0: 12845.6. Samples: 44871229. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:42:43,321][175405] Avg episode reward: [(0, '22.903')] [2023-03-07 10:42:43,651][175731] Updated weights for policy 0, policy_version 43850 (0.0006) [2023-03-07 10:42:44,460][175731] Updated weights for policy 0, policy_version 43860 (0.0006) [2023-03-07 10:42:45,262][175731] Updated weights for policy 0, policy_version 43870 (0.0006) [2023-03-07 10:42:46,054][175731] Updated weights for policy 0, policy_version 43880 (0.0007) [2023-03-07 10:42:46,873][175731] Updated weights for policy 0, policy_version 43890 (0.0006) [2023-03-07 10:42:47,647][175731] Updated weights for policy 0, policy_version 43900 (0.0007) [2023-03-07 10:42:48,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12815.6). Total num frames: 44961792. Throughput: 0: 12852.3. Samples: 44948319. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:42:48,322][175405] Avg episode reward: [(0, '23.259')] [2023-03-07 10:42:48,468][175731] Updated weights for policy 0, policy_version 43910 (0.0006) [2023-03-07 10:42:49,261][175731] Updated weights for policy 0, policy_version 43920 (0.0007) [2023-03-07 10:42:50,030][175731] Updated weights for policy 0, policy_version 43930 (0.0006) [2023-03-07 10:42:50,830][175731] Updated weights for policy 0, policy_version 43940 (0.0006) [2023-03-07 10:42:51,637][175731] Updated weights for policy 0, policy_version 43950 (0.0006) [2023-03-07 10:42:52,419][175731] Updated weights for policy 0, policy_version 43960 (0.0007) [2023-03-07 10:42:53,216][175731] Updated weights for policy 0, policy_version 43970 (0.0007) [2023-03-07 10:42:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12819.1). Total num frames: 45026304. Throughput: 0: 12854.9. Samples: 45025572. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:42:53,321][175405] Avg episode reward: [(0, '22.438')] [2023-03-07 10:42:54,013][175731] Updated weights for policy 0, policy_version 43980 (0.0006) [2023-03-07 10:42:54,806][175731] Updated weights for policy 0, policy_version 43990 (0.0006) [2023-03-07 10:42:55,612][175731] Updated weights for policy 0, policy_version 44000 (0.0006) [2023-03-07 10:42:56,408][175731] Updated weights for policy 0, policy_version 44010 (0.0007) [2023-03-07 10:42:57,194][175731] Updated weights for policy 0, policy_version 44020 (0.0006) [2023-03-07 10:42:57,993][175731] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-07 10:42:58,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12822.6). Total num frames: 45090816. Throughput: 0: 12852.1. Samples: 45064166. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:42:58,322][175405] Avg episode reward: [(0, '22.282')] [2023-03-07 10:42:58,789][175731] Updated weights for policy 0, policy_version 44040 (0.0006) [2023-03-07 10:42:59,594][175731] Updated weights for policy 0, policy_version 44050 (0.0007) [2023-03-07 10:43:00,383][175731] Updated weights for policy 0, policy_version 44060 (0.0007) [2023-03-07 10:43:01,184][175731] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-07 10:43:01,990][175731] Updated weights for policy 0, policy_version 44080 (0.0006) [2023-03-07 10:43:02,792][175731] Updated weights for policy 0, policy_version 44090 (0.0006) [2023-03-07 10:43:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12826.0). Total num frames: 45155328. Throughput: 0: 12846.8. Samples: 45141197. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:43:03,321][175405] Avg episode reward: [(0, '22.176')] [2023-03-07 10:43:03,584][175731] Updated weights for policy 0, policy_version 44100 (0.0007) [2023-03-07 10:43:04,353][175731] Updated weights for policy 0, policy_version 44110 (0.0006) [2023-03-07 10:43:05,146][175731] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-07 10:43:05,959][175731] Updated weights for policy 0, policy_version 44130 (0.0006) [2023-03-07 10:43:06,738][175731] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-07 10:43:07,545][175731] Updated weights for policy 0, policy_version 44150 (0.0006) [2023-03-07 10:43:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 45218816. Throughput: 0: 12850.6. Samples: 45218757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:43:08,322][175405] Avg episode reward: [(0, '23.146')] [2023-03-07 10:43:08,331][175731] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-07 10:43:09,121][175731] Updated weights for policy 0, policy_version 44170 (0.0007) [2023-03-07 10:43:09,938][175731] Updated weights for policy 0, policy_version 44180 (0.0007) [2023-03-07 10:43:10,725][175731] Updated weights for policy 0, policy_version 44190 (0.0006) [2023-03-07 10:43:11,517][175731] Updated weights for policy 0, policy_version 44200 (0.0007) [2023-03-07 10:43:12,310][175731] Updated weights for policy 0, policy_version 44210 (0.0006) [2023-03-07 10:43:13,089][175731] Updated weights for policy 0, policy_version 44220 (0.0006) [2023-03-07 10:43:13,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12868.3, 300 sec: 12833.0). Total num frames: 45284352. Throughput: 0: 12855.1. Samples: 45257329. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:43:13,322][175405] Avg episode reward: [(0, '22.445')] [2023-03-07 10:43:13,880][175731] Updated weights for policy 0, policy_version 44230 (0.0007) [2023-03-07 10:43:14,681][175731] Updated weights for policy 0, policy_version 44240 (0.0007) [2023-03-07 10:43:15,478][175731] Updated weights for policy 0, policy_version 44250 (0.0006) [2023-03-07 10:43:16,288][175731] Updated weights for policy 0, policy_version 44260 (0.0007) [2023-03-07 10:43:17,079][175731] Updated weights for policy 0, policy_version 44270 (0.0006) [2023-03-07 10:43:17,863][175731] Updated weights for policy 0, policy_version 44280 (0.0006) [2023-03-07 10:43:18,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 45347840. Throughput: 0: 12866.4. Samples: 45334663. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:43:18,321][175405] Avg episode reward: [(0, '22.595')] [2023-03-07 10:43:18,671][175731] Updated weights for policy 0, policy_version 44290 (0.0007) [2023-03-07 10:43:19,469][175731] Updated weights for policy 0, policy_version 44300 (0.0007) [2023-03-07 10:43:20,260][175731] Updated weights for policy 0, policy_version 44310 (0.0007) [2023-03-07 10:43:21,056][175731] Updated weights for policy 0, policy_version 44320 (0.0007) [2023-03-07 10:43:21,842][175731] Updated weights for policy 0, policy_version 44330 (0.0006) [2023-03-07 10:43:22,637][175731] Updated weights for policy 0, policy_version 44340 (0.0007) [2023-03-07 10:43:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 45412352. Throughput: 0: 12872.0. Samples: 45412077. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:43:23,322][175405] Avg episode reward: [(0, '23.822')] [2023-03-07 10:43:23,438][175731] Updated weights for policy 0, policy_version 44350 (0.0006) [2023-03-07 10:43:24,218][175731] Updated weights for policy 0, policy_version 44360 (0.0007) [2023-03-07 10:43:25,019][175731] Updated weights for policy 0, policy_version 44370 (0.0007) [2023-03-07 10:43:25,821][175731] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-07 10:43:26,605][175731] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-07 10:43:27,412][175731] Updated weights for policy 0, policy_version 44400 (0.0006) [2023-03-07 10:43:28,205][175731] Updated weights for policy 0, policy_version 44410 (0.0006) [2023-03-07 10:43:28,321][175405] Fps is (10 sec: 12902.1, 60 sec: 12868.2, 300 sec: 12836.4). Total num frames: 45476864. Throughput: 0: 12871.3. Samples: 45450440. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:43:28,322][175405] Avg episode reward: [(0, '23.679')] [2023-03-07 10:43:28,997][175731] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-07 10:43:29,790][175731] Updated weights for policy 0, policy_version 44430 (0.0006) [2023-03-07 10:43:30,573][175731] Updated weights for policy 0, policy_version 44440 (0.0007) [2023-03-07 10:43:31,364][175731] Updated weights for policy 0, policy_version 44450 (0.0007) [2023-03-07 10:43:32,156][175731] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-07 10:43:32,935][175731] Updated weights for policy 0, policy_version 44470 (0.0006) [2023-03-07 10:43:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12839.9). Total num frames: 45541376. Throughput: 0: 12883.8. Samples: 45528089. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:43:33,321][175405] Avg episode reward: [(0, '21.843')] [2023-03-07 10:43:33,737][175731] Updated weights for policy 0, policy_version 44480 (0.0007) [2023-03-07 10:43:34,536][175731] Updated weights for policy 0, policy_version 44490 (0.0007) [2023-03-07 10:43:35,306][175731] Updated weights for policy 0, policy_version 44500 (0.0007) [2023-03-07 10:43:36,113][175731] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-07 10:43:36,904][175731] Updated weights for policy 0, policy_version 44520 (0.0005) [2023-03-07 10:43:37,690][175731] Updated weights for policy 0, policy_version 44530 (0.0006) [2023-03-07 10:43:38,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12839.9). Total num frames: 45605888. Throughput: 0: 12893.9. Samples: 45605800. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:43:38,322][175405] Avg episode reward: [(0, '22.900')] [2023-03-07 10:43:38,488][175731] Updated weights for policy 0, policy_version 44540 (0.0006) [2023-03-07 10:43:39,277][175731] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-07 10:43:40,060][175731] Updated weights for policy 0, policy_version 44560 (0.0006) [2023-03-07 10:43:40,871][175731] Updated weights for policy 0, policy_version 44570 (0.0006) [2023-03-07 10:43:41,666][175731] Updated weights for policy 0, policy_version 44580 (0.0007) [2023-03-07 10:43:42,464][175731] Updated weights for policy 0, policy_version 44590 (0.0007) [2023-03-07 10:43:43,275][175731] Updated weights for policy 0, policy_version 44600 (0.0006) [2023-03-07 10:43:43,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12843.4). Total num frames: 45670400. Throughput: 0: 12895.1. Samples: 45644444. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:43:43,322][175405] Avg episode reward: [(0, '23.249')] [2023-03-07 10:43:44,061][175731] Updated weights for policy 0, policy_version 44610 (0.0007) [2023-03-07 10:43:44,861][175731] Updated weights for policy 0, policy_version 44620 (0.0007) [2023-03-07 10:43:45,669][175731] Updated weights for policy 0, policy_version 44630 (0.0006) [2023-03-07 10:43:46,453][175731] Updated weights for policy 0, policy_version 44640 (0.0007) [2023-03-07 10:43:47,255][175731] Updated weights for policy 0, policy_version 44650 (0.0006) [2023-03-07 10:43:48,047][175731] Updated weights for policy 0, policy_version 44660 (0.0007) [2023-03-07 10:43:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12846.9). Total num frames: 45734912. Throughput: 0: 12896.1. Samples: 45721522. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:43:48,322][175405] Avg episode reward: [(0, '23.695')] [2023-03-07 10:43:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000044663_45734912.pth... [2023-03-07 10:43:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000041650_42649600.pth [2023-03-07 10:43:48,856][175731] Updated weights for policy 0, policy_version 44670 (0.0006) [2023-03-07 10:43:49,636][175731] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-07 10:43:50,421][175731] Updated weights for policy 0, policy_version 44690 (0.0007) [2023-03-07 10:43:51,218][175731] Updated weights for policy 0, policy_version 44700 (0.0006) [2023-03-07 10:43:52,012][175731] Updated weights for policy 0, policy_version 44710 (0.0006) [2023-03-07 10:43:52,810][175731] Updated weights for policy 0, policy_version 44720 (0.0007) [2023-03-07 10:43:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12850.3). Total num frames: 45799424. Throughput: 0: 12887.9. Samples: 45798712. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:43:53,321][175405] Avg episode reward: [(0, '23.569')] [2023-03-07 10:43:53,620][175731] Updated weights for policy 0, policy_version 44730 (0.0007) [2023-03-07 10:43:54,409][175731] Updated weights for policy 0, policy_version 44740 (0.0007) [2023-03-07 10:43:55,210][175731] Updated weights for policy 0, policy_version 44750 (0.0006) [2023-03-07 10:43:56,010][175731] Updated weights for policy 0, policy_version 44760 (0.0006) [2023-03-07 10:43:56,801][175731] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-07 10:43:57,583][175731] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-07 10:43:58,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12850.3). Total num frames: 45863936. Throughput: 0: 12888.6. Samples: 45837319. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:43:58,322][175405] Avg episode reward: [(0, '22.812')] [2023-03-07 10:43:58,378][175731] Updated weights for policy 0, policy_version 44790 (0.0006) [2023-03-07 10:43:59,183][175731] Updated weights for policy 0, policy_version 44800 (0.0006) [2023-03-07 10:43:59,958][175731] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-07 10:44:00,748][175731] Updated weights for policy 0, policy_version 44820 (0.0006) [2023-03-07 10:44:01,553][175731] Updated weights for policy 0, policy_version 44830 (0.0007) [2023-03-07 10:44:02,340][175731] Updated weights for policy 0, policy_version 44840 (0.0007) [2023-03-07 10:44:03,134][175731] Updated weights for policy 0, policy_version 44850 (0.0007) [2023-03-07 10:44:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12850.3). Total num frames: 45928448. Throughput: 0: 12894.1. Samples: 45914899. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:44:03,321][175405] Avg episode reward: [(0, '22.900')] [2023-03-07 10:44:03,948][175731] Updated weights for policy 0, policy_version 44860 (0.0007) [2023-03-07 10:44:04,755][175731] Updated weights for policy 0, policy_version 44870 (0.0006) [2023-03-07 10:44:05,547][175731] Updated weights for policy 0, policy_version 44880 (0.0007) [2023-03-07 10:44:06,349][175731] Updated weights for policy 0, policy_version 44890 (0.0007) [2023-03-07 10:44:07,130][175731] Updated weights for policy 0, policy_version 44900 (0.0006) [2023-03-07 10:44:07,932][175731] Updated weights for policy 0, policy_version 44910 (0.0006) [2023-03-07 10:44:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12885.3, 300 sec: 12846.9). Total num frames: 45991936. Throughput: 0: 12881.9. Samples: 45991761. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:44:08,322][175405] Avg episode reward: [(0, '22.225')] [2023-03-07 10:44:08,746][175731] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-07 10:44:09,542][175731] Updated weights for policy 0, policy_version 44930 (0.0007) [2023-03-07 10:44:10,339][175731] Updated weights for policy 0, policy_version 44940 (0.0006) [2023-03-07 10:44:11,144][175731] Updated weights for policy 0, policy_version 44950 (0.0007) [2023-03-07 10:44:11,929][175731] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-07 10:44:12,723][175731] Updated weights for policy 0, policy_version 44970 (0.0007) [2023-03-07 10:44:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 46056448. Throughput: 0: 12881.1. Samples: 46030089. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:44:13,321][175405] Avg episode reward: [(0, '22.885')] [2023-03-07 10:44:13,537][175731] Updated weights for policy 0, policy_version 44980 (0.0007) [2023-03-07 10:44:14,343][175731] Updated weights for policy 0, policy_version 44990 (0.0006) [2023-03-07 10:44:15,125][175731] Updated weights for policy 0, policy_version 45000 (0.0006) [2023-03-07 10:44:15,923][175731] Updated weights for policy 0, policy_version 45010 (0.0006) [2023-03-07 10:44:16,728][175731] Updated weights for policy 0, policy_version 45020 (0.0007) [2023-03-07 10:44:17,510][175731] Updated weights for policy 0, policy_version 45030 (0.0006) [2023-03-07 10:44:18,303][175731] Updated weights for policy 0, policy_version 45040 (0.0006) [2023-03-07 10:44:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12850.3). Total num frames: 46120960. Throughput: 0: 12871.7. Samples: 46107314. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:44:18,322][175405] Avg episode reward: [(0, '22.640')] [2023-03-07 10:44:19,099][175731] Updated weights for policy 0, policy_version 45050 (0.0007) [2023-03-07 10:44:19,901][175731] Updated weights for policy 0, policy_version 45060 (0.0007) [2023-03-07 10:44:20,675][175731] Updated weights for policy 0, policy_version 45070 (0.0007) [2023-03-07 10:44:21,485][175731] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-07 10:44:22,294][175731] Updated weights for policy 0, policy_version 45090 (0.0006) [2023-03-07 10:44:23,074][175731] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-07 10:44:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 46184448. Throughput: 0: 12860.2. Samples: 46184508. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:44:23,322][175405] Avg episode reward: [(0, '22.900')] [2023-03-07 10:44:23,885][175731] Updated weights for policy 0, policy_version 45110 (0.0007) [2023-03-07 10:44:24,674][175731] Updated weights for policy 0, policy_version 45120 (0.0006) [2023-03-07 10:44:25,457][175731] Updated weights for policy 0, policy_version 45130 (0.0007) [2023-03-07 10:44:26,253][175731] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-07 10:44:27,055][175731] Updated weights for policy 0, policy_version 45150 (0.0006) [2023-03-07 10:44:27,844][175731] Updated weights for policy 0, policy_version 45160 (0.0007) [2023-03-07 10:44:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12853.8). Total num frames: 46249984. Throughput: 0: 12859.3. Samples: 46223113. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:44:28,322][175405] Avg episode reward: [(0, '23.183')] [2023-03-07 10:44:28,640][175731] Updated weights for policy 0, policy_version 45170 (0.0006) [2023-03-07 10:44:29,442][175731] Updated weights for policy 0, policy_version 45180 (0.0007) [2023-03-07 10:44:30,224][175731] Updated weights for policy 0, policy_version 45190 (0.0006) [2023-03-07 10:44:31,042][175731] Updated weights for policy 0, policy_version 45200 (0.0007) [2023-03-07 10:44:31,840][175731] Updated weights for policy 0, policy_version 45210 (0.0007) [2023-03-07 10:44:32,615][175731] Updated weights for policy 0, policy_version 45220 (0.0006) [2023-03-07 10:44:33,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12868.2, 300 sec: 12853.8). Total num frames: 46313472. Throughput: 0: 12864.6. Samples: 46300428. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:44:33,332][175405] Avg episode reward: [(0, '21.978')] [2023-03-07 10:44:33,402][175731] Updated weights for policy 0, policy_version 45230 (0.0007) [2023-03-07 10:44:34,206][175731] Updated weights for policy 0, policy_version 45240 (0.0007) [2023-03-07 10:44:34,994][175731] Updated weights for policy 0, policy_version 45250 (0.0006) [2023-03-07 10:44:35,795][175731] Updated weights for policy 0, policy_version 45260 (0.0006) [2023-03-07 10:44:36,582][175731] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-07 10:44:37,367][175731] Updated weights for policy 0, policy_version 45280 (0.0007) [2023-03-07 10:44:38,164][175731] Updated weights for policy 0, policy_version 45290 (0.0006) [2023-03-07 10:44:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12860.8). Total num frames: 46379008. Throughput: 0: 12872.5. Samples: 46377973. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:44:38,332][175405] Avg episode reward: [(0, '22.884')] [2023-03-07 10:44:38,986][175731] Updated weights for policy 0, policy_version 45300 (0.0006) [2023-03-07 10:44:39,764][175731] Updated weights for policy 0, policy_version 45310 (0.0007) [2023-03-07 10:44:40,565][175731] Updated weights for policy 0, policy_version 45320 (0.0006) [2023-03-07 10:44:41,357][175731] Updated weights for policy 0, policy_version 45330 (0.0006) [2023-03-07 10:44:42,154][175731] Updated weights for policy 0, policy_version 45340 (0.0006) [2023-03-07 10:44:42,929][175731] Updated weights for policy 0, policy_version 45350 (0.0007) [2023-03-07 10:44:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 46442496. Throughput: 0: 12868.2. Samples: 46416385. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:44:43,332][175405] Avg episode reward: [(0, '21.460')] [2023-03-07 10:44:43,726][175731] Updated weights for policy 0, policy_version 45360 (0.0007) [2023-03-07 10:44:44,514][175731] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-07 10:44:45,318][175731] Updated weights for policy 0, policy_version 45380 (0.0006) [2023-03-07 10:44:46,118][175731] Updated weights for policy 0, policy_version 45390 (0.0007) [2023-03-07 10:44:46,917][175731] Updated weights for policy 0, policy_version 45400 (0.0006) [2023-03-07 10:44:47,724][175731] Updated weights for policy 0, policy_version 45410 (0.0007) [2023-03-07 10:44:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12857.3). Total num frames: 46507008. Throughput: 0: 12862.0. Samples: 46493693. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:44:48,332][175405] Avg episode reward: [(0, '22.118')] [2023-03-07 10:44:48,528][175731] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-07 10:44:49,315][175731] Updated weights for policy 0, policy_version 45430 (0.0007) [2023-03-07 10:44:50,111][175731] Updated weights for policy 0, policy_version 45440 (0.0007) [2023-03-07 10:44:50,910][175731] Updated weights for policy 0, policy_version 45450 (0.0006) [2023-03-07 10:44:51,701][175731] Updated weights for policy 0, policy_version 45460 (0.0006) [2023-03-07 10:44:52,491][175731] Updated weights for policy 0, policy_version 45470 (0.0007) [2023-03-07 10:44:53,296][175731] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-07 10:44:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12857.3). Total num frames: 46571520. Throughput: 0: 12869.2. Samples: 46570874. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:44:53,332][175405] Avg episode reward: [(0, '23.531')] [2023-03-07 10:44:54,080][175731] Updated weights for policy 0, policy_version 45490 (0.0007) [2023-03-07 10:44:54,902][175731] Updated weights for policy 0, policy_version 45500 (0.0006) [2023-03-07 10:44:55,702][175731] Updated weights for policy 0, policy_version 45510 (0.0006) [2023-03-07 10:44:56,509][175731] Updated weights for policy 0, policy_version 45520 (0.0007) [2023-03-07 10:44:57,311][175731] Updated weights for policy 0, policy_version 45530 (0.0007) [2023-03-07 10:44:58,083][175731] Updated weights for policy 0, policy_version 45540 (0.0006) [2023-03-07 10:44:58,321][175405] Fps is (10 sec: 12902.7, 60 sec: 12868.3, 300 sec: 12860.8). Total num frames: 46636032. Throughput: 0: 12871.1. Samples: 46609289. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:44:58,332][175405] Avg episode reward: [(0, '21.925')] [2023-03-07 10:44:58,880][175731] Updated weights for policy 0, policy_version 45550 (0.0006) [2023-03-07 10:44:59,679][175731] Updated weights for policy 0, policy_version 45560 (0.0006) [2023-03-07 10:45:00,458][175731] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-07 10:45:01,263][175731] Updated weights for policy 0, policy_version 45580 (0.0007) [2023-03-07 10:45:02,066][175731] Updated weights for policy 0, policy_version 45590 (0.0007) [2023-03-07 10:45:02,846][175731] Updated weights for policy 0, policy_version 45600 (0.0006) [2023-03-07 10:45:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 46700544. Throughput: 0: 12870.3. Samples: 46686475. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:03,332][175405] Avg episode reward: [(0, '22.178')] [2023-03-07 10:45:03,637][175731] Updated weights for policy 0, policy_version 45610 (0.0007) [2023-03-07 10:45:04,432][175731] Updated weights for policy 0, policy_version 45620 (0.0006) [2023-03-07 10:45:05,231][175731] Updated weights for policy 0, policy_version 45630 (0.0006) [2023-03-07 10:45:06,018][175731] Updated weights for policy 0, policy_version 45640 (0.0006) [2023-03-07 10:45:06,817][175731] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-07 10:45:07,595][175731] Updated weights for policy 0, policy_version 45660 (0.0006) [2023-03-07 10:45:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12885.4, 300 sec: 12860.7). Total num frames: 46765056. Throughput: 0: 12881.8. Samples: 46764191. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:08,332][175405] Avg episode reward: [(0, '23.934')] [2023-03-07 10:45:08,392][175731] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-07 10:45:09,194][175731] Updated weights for policy 0, policy_version 45680 (0.0007) [2023-03-07 10:45:09,995][175731] Updated weights for policy 0, policy_version 45690 (0.0007) [2023-03-07 10:45:10,798][175731] Updated weights for policy 0, policy_version 45700 (0.0007) [2023-03-07 10:45:11,588][175731] Updated weights for policy 0, policy_version 45710 (0.0007) [2023-03-07 10:45:12,395][175731] Updated weights for policy 0, policy_version 45720 (0.0006) [2023-03-07 10:45:13,183][175731] Updated weights for policy 0, policy_version 45730 (0.0006) [2023-03-07 10:45:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 46828544. Throughput: 0: 12876.5. Samples: 46802556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:13,332][175405] Avg episode reward: [(0, '23.248')] [2023-03-07 10:45:13,986][175731] Updated weights for policy 0, policy_version 45740 (0.0006) [2023-03-07 10:45:14,767][175731] Updated weights for policy 0, policy_version 45750 (0.0006) [2023-03-07 10:45:15,559][175731] Updated weights for policy 0, policy_version 45760 (0.0006) [2023-03-07 10:45:16,363][175731] Updated weights for policy 0, policy_version 45770 (0.0007) [2023-03-07 10:45:17,144][175731] Updated weights for policy 0, policy_version 45780 (0.0007) [2023-03-07 10:45:17,941][175731] Updated weights for policy 0, policy_version 45790 (0.0006) [2023-03-07 10:45:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12860.8). Total num frames: 46893056. Throughput: 0: 12877.5. Samples: 46879912. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:18,332][175405] Avg episode reward: [(0, '22.082')] [2023-03-07 10:45:18,729][175731] Updated weights for policy 0, policy_version 45800 (0.0006) [2023-03-07 10:45:19,509][175731] Updated weights for policy 0, policy_version 45810 (0.0007) [2023-03-07 10:45:20,325][175731] Updated weights for policy 0, policy_version 45820 (0.0006) [2023-03-07 10:45:21,109][175731] Updated weights for policy 0, policy_version 45830 (0.0006) [2023-03-07 10:45:21,909][175731] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-07 10:45:22,700][175731] Updated weights for policy 0, policy_version 45850 (0.0006) [2023-03-07 10:45:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12885.3, 300 sec: 12860.7). Total num frames: 46957568. Throughput: 0: 12873.9. Samples: 46957301. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:23,332][175405] Avg episode reward: [(0, '23.858')] [2023-03-07 10:45:23,493][175731] Updated weights for policy 0, policy_version 45860 (0.0007) [2023-03-07 10:45:24,306][175731] Updated weights for policy 0, policy_version 45870 (0.0007) [2023-03-07 10:45:25,094][175731] Updated weights for policy 0, policy_version 45880 (0.0006) [2023-03-07 10:45:25,892][175731] Updated weights for policy 0, policy_version 45890 (0.0006) [2023-03-07 10:45:26,707][175731] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-07 10:45:27,479][175731] Updated weights for policy 0, policy_version 45910 (0.0006) [2023-03-07 10:45:28,285][175731] Updated weights for policy 0, policy_version 45920 (0.0007) [2023-03-07 10:45:28,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 47022080. Throughput: 0: 12874.4. Samples: 46995734. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:28,332][175405] Avg episode reward: [(0, '22.298')] [2023-03-07 10:45:29,081][175731] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-07 10:45:29,873][175731] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-07 10:45:30,674][175731] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-07 10:45:31,473][175731] Updated weights for policy 0, policy_version 45960 (0.0006) [2023-03-07 10:45:32,266][175731] Updated weights for policy 0, policy_version 45970 (0.0006) [2023-03-07 10:45:33,070][175731] Updated weights for policy 0, policy_version 45980 (0.0006) [2023-03-07 10:45:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12864.2). Total num frames: 47086592. Throughput: 0: 12873.4. Samples: 47072993. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:33,322][175405] Avg episode reward: [(0, '22.754')] [2023-03-07 10:45:33,858][175731] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-07 10:45:34,665][175731] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-07 10:45:35,455][175731] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-07 10:45:36,254][175731] Updated weights for policy 0, policy_version 46020 (0.0006) [2023-03-07 10:45:37,057][175731] Updated weights for policy 0, policy_version 46030 (0.0006) [2023-03-07 10:45:37,866][175731] Updated weights for policy 0, policy_version 46040 (0.0006) [2023-03-07 10:45:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12860.7). Total num frames: 47150080. Throughput: 0: 12869.2. Samples: 47149986. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:38,322][175405] Avg episode reward: [(0, '21.893')] [2023-03-07 10:45:38,659][175731] Updated weights for policy 0, policy_version 46050 (0.0006) [2023-03-07 10:45:39,447][175731] Updated weights for policy 0, policy_version 46060 (0.0008) [2023-03-07 10:45:40,224][175731] Updated weights for policy 0, policy_version 46070 (0.0007) [2023-03-07 10:45:41,012][175731] Updated weights for policy 0, policy_version 46080 (0.0007) [2023-03-07 10:45:41,824][175731] Updated weights for policy 0, policy_version 46090 (0.0007) [2023-03-07 10:45:42,625][175731] Updated weights for policy 0, policy_version 46100 (0.0006) [2023-03-07 10:45:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 47214592. Throughput: 0: 12876.5. Samples: 47188734. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:43,322][175405] Avg episode reward: [(0, '22.520')] [2023-03-07 10:45:43,417][175731] Updated weights for policy 0, policy_version 46110 (0.0007) [2023-03-07 10:45:44,222][175731] Updated weights for policy 0, policy_version 46120 (0.0007) [2023-03-07 10:45:45,006][175731] Updated weights for policy 0, policy_version 46130 (0.0007) [2023-03-07 10:45:45,812][175731] Updated weights for policy 0, policy_version 46140 (0.0007) [2023-03-07 10:45:46,606][175731] Updated weights for policy 0, policy_version 46150 (0.0006) [2023-03-07 10:45:47,400][175731] Updated weights for policy 0, policy_version 46160 (0.0007) [2023-03-07 10:45:48,190][175731] Updated weights for policy 0, policy_version 46170 (0.0007) [2023-03-07 10:45:48,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 47279104. Throughput: 0: 12876.1. Samples: 47265901. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:45:48,322][175405] Avg episode reward: [(0, '21.864')] [2023-03-07 10:45:48,327][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000046171_47279104.pth... [2023-03-07 10:45:48,360][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000043155_44190720.pth [2023-03-07 10:45:48,978][175731] Updated weights for policy 0, policy_version 46180 (0.0006) [2023-03-07 10:45:49,783][175731] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-07 10:45:50,595][175731] Updated weights for policy 0, policy_version 46200 (0.0007) [2023-03-07 10:45:51,370][175731] Updated weights for policy 0, policy_version 46210 (0.0006) [2023-03-07 10:45:52,173][175731] Updated weights for policy 0, policy_version 46220 (0.0006) [2023-03-07 10:45:52,969][175731] Updated weights for policy 0, policy_version 46230 (0.0006) [2023-03-07 10:45:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 47343616. Throughput: 0: 12866.5. Samples: 47343183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:45:53,322][175405] Avg episode reward: [(0, '23.616')] [2023-03-07 10:45:53,757][175731] Updated weights for policy 0, policy_version 46240 (0.0008) [2023-03-07 10:45:54,557][175731] Updated weights for policy 0, policy_version 46250 (0.0007) [2023-03-07 10:45:55,350][175731] Updated weights for policy 0, policy_version 46260 (0.0007) [2023-03-07 10:45:56,139][175731] Updated weights for policy 0, policy_version 46270 (0.0006) [2023-03-07 10:45:56,953][175731] Updated weights for policy 0, policy_version 46280 (0.0008) [2023-03-07 10:45:57,734][175731] Updated weights for policy 0, policy_version 46290 (0.0006) [2023-03-07 10:45:58,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 47408128. Throughput: 0: 12872.1. Samples: 47381802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:45:58,332][175405] Avg episode reward: [(0, '23.036')] [2023-03-07 10:45:58,519][175731] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-07 10:45:59,322][175731] Updated weights for policy 0, policy_version 46310 (0.0006) [2023-03-07 10:46:00,114][175731] Updated weights for policy 0, policy_version 46320 (0.0008) [2023-03-07 10:46:00,904][175731] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-07 10:46:01,706][175731] Updated weights for policy 0, policy_version 46340 (0.0006) [2023-03-07 10:46:02,501][175731] Updated weights for policy 0, policy_version 46350 (0.0006) [2023-03-07 10:46:03,300][175731] Updated weights for policy 0, policy_version 46360 (0.0006) [2023-03-07 10:46:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 47472640. Throughput: 0: 12870.9. Samples: 47459103. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:03,332][175405] Avg episode reward: [(0, '22.666')] [2023-03-07 10:46:04,074][175731] Updated weights for policy 0, policy_version 46370 (0.0006) [2023-03-07 10:46:04,885][175731] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-07 10:46:05,686][175731] Updated weights for policy 0, policy_version 46390 (0.0006) [2023-03-07 10:46:06,477][175731] Updated weights for policy 0, policy_version 46400 (0.0006) [2023-03-07 10:46:07,278][175731] Updated weights for policy 0, policy_version 46410 (0.0006) [2023-03-07 10:46:08,083][175731] Updated weights for policy 0, policy_version 46420 (0.0007) [2023-03-07 10:46:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 47537152. Throughput: 0: 12862.9. Samples: 47536130. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:08,332][175405] Avg episode reward: [(0, '21.872')] [2023-03-07 10:46:08,878][175731] Updated weights for policy 0, policy_version 46430 (0.0006) [2023-03-07 10:46:09,653][175731] Updated weights for policy 0, policy_version 46440 (0.0007) [2023-03-07 10:46:10,472][175731] Updated weights for policy 0, policy_version 46450 (0.0006) [2023-03-07 10:46:11,256][175731] Updated weights for policy 0, policy_version 46460 (0.0006) [2023-03-07 10:46:12,027][175731] Updated weights for policy 0, policy_version 46470 (0.0006) [2023-03-07 10:46:12,841][175731] Updated weights for policy 0, policy_version 46480 (0.0006) [2023-03-07 10:46:13,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12885.3, 300 sec: 12867.7). Total num frames: 47601664. Throughput: 0: 12872.5. Samples: 47574995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:13,332][175405] Avg episode reward: [(0, '22.546')] [2023-03-07 10:46:13,635][175731] Updated weights for policy 0, policy_version 46490 (0.0006) [2023-03-07 10:46:14,422][175731] Updated weights for policy 0, policy_version 46500 (0.0007) [2023-03-07 10:46:15,233][175731] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-07 10:46:16,032][175731] Updated weights for policy 0, policy_version 46520 (0.0007) [2023-03-07 10:46:16,843][175731] Updated weights for policy 0, policy_version 46530 (0.0006) [2023-03-07 10:46:17,644][175731] Updated weights for policy 0, policy_version 46540 (0.0006) [2023-03-07 10:46:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 47665152. Throughput: 0: 12862.5. Samples: 47651803. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:18,332][175405] Avg episode reward: [(0, '21.964')] [2023-03-07 10:46:18,449][175731] Updated weights for policy 0, policy_version 46550 (0.0006) [2023-03-07 10:46:19,244][175731] Updated weights for policy 0, policy_version 46560 (0.0006) [2023-03-07 10:46:20,045][175731] Updated weights for policy 0, policy_version 46570 (0.0006) [2023-03-07 10:46:20,849][175731] Updated weights for policy 0, policy_version 46580 (0.0008) [2023-03-07 10:46:21,640][175731] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-07 10:46:22,443][175731] Updated weights for policy 0, policy_version 46600 (0.0006) [2023-03-07 10:46:23,234][175731] Updated weights for policy 0, policy_version 46610 (0.0007) [2023-03-07 10:46:23,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 47729664. Throughput: 0: 12862.0. Samples: 47728775. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:23,332][175405] Avg episode reward: [(0, '23.173')] [2023-03-07 10:46:24,035][175731] Updated weights for policy 0, policy_version 46620 (0.0006) [2023-03-07 10:46:24,817][175731] Updated weights for policy 0, policy_version 46630 (0.0006) [2023-03-07 10:46:25,619][175731] Updated weights for policy 0, policy_version 46640 (0.0006) [2023-03-07 10:46:26,423][175731] Updated weights for policy 0, policy_version 46650 (0.0006) [2023-03-07 10:46:27,237][175731] Updated weights for policy 0, policy_version 46660 (0.0006) [2023-03-07 10:46:28,038][175731] Updated weights for policy 0, policy_version 46670 (0.0007) [2023-03-07 10:46:28,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 47793152. Throughput: 0: 12856.5. Samples: 47767276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:28,332][175405] Avg episode reward: [(0, '22.069')] [2023-03-07 10:46:28,823][175731] Updated weights for policy 0, policy_version 46680 (0.0006) [2023-03-07 10:46:29,627][175731] Updated weights for policy 0, policy_version 46690 (0.0007) [2023-03-07 10:46:30,433][175731] Updated weights for policy 0, policy_version 46700 (0.0007) [2023-03-07 10:46:31,218][175731] Updated weights for policy 0, policy_version 46710 (0.0006) [2023-03-07 10:46:31,999][175731] Updated weights for policy 0, policy_version 46720 (0.0005) [2023-03-07 10:46:32,810][175731] Updated weights for policy 0, policy_version 46730 (0.0007) [2023-03-07 10:46:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 47857664. Throughput: 0: 12849.6. Samples: 47844131. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:33,333][175405] Avg episode reward: [(0, '22.141')] [2023-03-07 10:46:33,597][175731] Updated weights for policy 0, policy_version 46740 (0.0007) [2023-03-07 10:46:34,387][175731] Updated weights for policy 0, policy_version 46750 (0.0006) [2023-03-07 10:46:35,180][175731] Updated weights for policy 0, policy_version 46760 (0.0006) [2023-03-07 10:46:35,967][175731] Updated weights for policy 0, policy_version 46770 (0.0006) [2023-03-07 10:46:36,776][175731] Updated weights for policy 0, policy_version 46780 (0.0006) [2023-03-07 10:46:37,580][175731] Updated weights for policy 0, policy_version 46790 (0.0007) [2023-03-07 10:46:38,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 47922176. Throughput: 0: 12854.5. Samples: 47921634. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:38,332][175405] Avg episode reward: [(0, '22.379')] [2023-03-07 10:46:38,373][175731] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-07 10:46:39,169][175731] Updated weights for policy 0, policy_version 46810 (0.0007) [2023-03-07 10:46:39,959][175731] Updated weights for policy 0, policy_version 46820 (0.0007) [2023-03-07 10:46:40,759][175731] Updated weights for policy 0, policy_version 46830 (0.0006) [2023-03-07 10:46:41,550][175731] Updated weights for policy 0, policy_version 46840 (0.0006) [2023-03-07 10:46:42,345][175731] Updated weights for policy 0, policy_version 46850 (0.0007) [2023-03-07 10:46:43,133][175731] Updated weights for policy 0, policy_version 46860 (0.0006) [2023-03-07 10:46:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 47986688. Throughput: 0: 12852.3. Samples: 47960156. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:46:43,332][175405] Avg episode reward: [(0, '22.136')] [2023-03-07 10:46:43,930][175731] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-07 10:46:44,731][175731] Updated weights for policy 0, policy_version 46880 (0.0007) [2023-03-07 10:46:45,529][175731] Updated weights for policy 0, policy_version 46890 (0.0007) [2023-03-07 10:46:46,325][175731] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-07 10:46:47,141][175731] Updated weights for policy 0, policy_version 46910 (0.0006) [2023-03-07 10:46:47,931][175731] Updated weights for policy 0, policy_version 46920 (0.0007) [2023-03-07 10:46:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 48050176. Throughput: 0: 12847.3. Samples: 48037231. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:46:48,332][175405] Avg episode reward: [(0, '22.492')] [2023-03-07 10:46:48,712][175731] Updated weights for policy 0, policy_version 46930 (0.0006) [2023-03-07 10:46:49,521][175731] Updated weights for policy 0, policy_version 46940 (0.0005) [2023-03-07 10:46:50,302][175731] Updated weights for policy 0, policy_version 46950 (0.0006) [2023-03-07 10:46:51,099][175731] Updated weights for policy 0, policy_version 46960 (0.0007) [2023-03-07 10:46:51,902][175731] Updated weights for policy 0, policy_version 46970 (0.0007) [2023-03-07 10:46:52,689][175731] Updated weights for policy 0, policy_version 46980 (0.0006) [2023-03-07 10:46:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 48114688. Throughput: 0: 12853.2. Samples: 48114522. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:46:53,332][175405] Avg episode reward: [(0, '22.544')] [2023-03-07 10:46:53,484][175731] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-07 10:46:54,277][175731] Updated weights for policy 0, policy_version 47000 (0.0006) [2023-03-07 10:46:55,081][175731] Updated weights for policy 0, policy_version 47010 (0.0006) [2023-03-07 10:46:55,864][175731] Updated weights for policy 0, policy_version 47020 (0.0006) [2023-03-07 10:46:56,656][175731] Updated weights for policy 0, policy_version 47030 (0.0007) [2023-03-07 10:46:57,455][175731] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-07 10:46:58,265][175731] Updated weights for policy 0, policy_version 47050 (0.0007) [2023-03-07 10:46:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 48179200. Throughput: 0: 12849.7. Samples: 48153234. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:46:58,332][175405] Avg episode reward: [(0, '23.149')] [2023-03-07 10:46:59,042][175731] Updated weights for policy 0, policy_version 47060 (0.0006) [2023-03-07 10:46:59,825][175731] Updated weights for policy 0, policy_version 47070 (0.0007) [2023-03-07 10:47:00,644][175731] Updated weights for policy 0, policy_version 47080 (0.0006) [2023-03-07 10:47:01,427][175731] Updated weights for policy 0, policy_version 47090 (0.0006) [2023-03-07 10:47:02,221][175731] Updated weights for policy 0, policy_version 47100 (0.0006) [2023-03-07 10:47:03,010][175731] Updated weights for policy 0, policy_version 47110 (0.0006) [2023-03-07 10:47:03,321][175405] Fps is (10 sec: 13004.7, 60 sec: 12868.3, 300 sec: 12871.2). Total num frames: 48244736. Throughput: 0: 12863.6. Samples: 48230668. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:47:03,332][175405] Avg episode reward: [(0, '23.559')] [2023-03-07 10:47:03,815][175731] Updated weights for policy 0, policy_version 47120 (0.0007) [2023-03-07 10:47:04,621][175731] Updated weights for policy 0, policy_version 47130 (0.0006) [2023-03-07 10:47:05,413][175731] Updated weights for policy 0, policy_version 47140 (0.0007) [2023-03-07 10:47:06,222][175731] Updated weights for policy 0, policy_version 47150 (0.0006) [2023-03-07 10:47:07,009][175731] Updated weights for policy 0, policy_version 47160 (0.0007) [2023-03-07 10:47:07,801][175731] Updated weights for policy 0, policy_version 47170 (0.0007) [2023-03-07 10:47:08,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 48308224. Throughput: 0: 12866.8. Samples: 48307783. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:47:08,332][175405] Avg episode reward: [(0, '22.821')] [2023-03-07 10:47:08,594][175731] Updated weights for policy 0, policy_version 47180 (0.0006) [2023-03-07 10:47:09,407][175731] Updated weights for policy 0, policy_version 47190 (0.0007) [2023-03-07 10:47:10,186][175731] Updated weights for policy 0, policy_version 47200 (0.0005) [2023-03-07 10:47:10,965][175731] Updated weights for policy 0, policy_version 47210 (0.0007) [2023-03-07 10:47:11,754][175731] Updated weights for policy 0, policy_version 47220 (0.0007) [2023-03-07 10:47:12,569][175731] Updated weights for policy 0, policy_version 47230 (0.0006) [2023-03-07 10:47:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 48372736. Throughput: 0: 12873.9. Samples: 48346603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:47:13,322][175405] Avg episode reward: [(0, '23.912')] [2023-03-07 10:47:13,370][175731] Updated weights for policy 0, policy_version 47240 (0.0006) [2023-03-07 10:47:14,177][175731] Updated weights for policy 0, policy_version 47250 (0.0006) [2023-03-07 10:47:14,969][175731] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-07 10:47:15,759][175731] Updated weights for policy 0, policy_version 47270 (0.0006) [2023-03-07 10:47:16,574][175731] Updated weights for policy 0, policy_version 47280 (0.0007) [2023-03-07 10:47:17,368][175731] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-07 10:47:18,170][175731] Updated weights for policy 0, policy_version 47300 (0.0007) [2023-03-07 10:47:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 48436224. Throughput: 0: 12872.6. Samples: 48423400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:18,322][175405] Avg episode reward: [(0, '21.811')] [2023-03-07 10:47:18,950][175731] Updated weights for policy 0, policy_version 47310 (0.0006) [2023-03-07 10:47:19,745][175731] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-07 10:47:20,544][175731] Updated weights for policy 0, policy_version 47330 (0.0007) [2023-03-07 10:47:21,351][175731] Updated weights for policy 0, policy_version 47340 (0.0007) [2023-03-07 10:47:22,133][175731] Updated weights for policy 0, policy_version 47350 (0.0006) [2023-03-07 10:47:22,939][175731] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-07 10:47:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 48500736. Throughput: 0: 12865.8. Samples: 48500593. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:23,321][175405] Avg episode reward: [(0, '22.266')] [2023-03-07 10:47:23,725][175731] Updated weights for policy 0, policy_version 47370 (0.0006) [2023-03-07 10:47:24,520][175731] Updated weights for policy 0, policy_version 47380 (0.0007) [2023-03-07 10:47:25,321][175731] Updated weights for policy 0, policy_version 47390 (0.0005) [2023-03-07 10:47:26,129][175731] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-07 10:47:26,930][175731] Updated weights for policy 0, policy_version 47410 (0.0007) [2023-03-07 10:47:27,726][175731] Updated weights for policy 0, policy_version 47420 (0.0008) [2023-03-07 10:47:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 48565248. Throughput: 0: 12862.2. Samples: 48538954. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:28,321][175405] Avg episode reward: [(0, '21.531')] [2023-03-07 10:47:28,514][175731] Updated weights for policy 0, policy_version 47430 (0.0006) [2023-03-07 10:47:29,321][175731] Updated weights for policy 0, policy_version 47440 (0.0006) [2023-03-07 10:47:30,140][175731] Updated weights for policy 0, policy_version 47450 (0.0006) [2023-03-07 10:47:30,926][175731] Updated weights for policy 0, policy_version 47460 (0.0006) [2023-03-07 10:47:31,722][175731] Updated weights for policy 0, policy_version 47470 (0.0006) [2023-03-07 10:47:32,527][175731] Updated weights for policy 0, policy_version 47480 (0.0007) [2023-03-07 10:47:33,305][175731] Updated weights for policy 0, policy_version 47490 (0.0007) [2023-03-07 10:47:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12867.7). Total num frames: 48629760. Throughput: 0: 12864.4. Samples: 48616127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:33,322][175405] Avg episode reward: [(0, '22.111')] [2023-03-07 10:47:34,112][175731] Updated weights for policy 0, policy_version 47500 (0.0005) [2023-03-07 10:47:34,894][175731] Updated weights for policy 0, policy_version 47510 (0.0006) [2023-03-07 10:47:35,691][175731] Updated weights for policy 0, policy_version 47520 (0.0006) [2023-03-07 10:47:36,478][175731] Updated weights for policy 0, policy_version 47530 (0.0006) [2023-03-07 10:47:37,274][175731] Updated weights for policy 0, policy_version 47540 (0.0005) [2023-03-07 10:47:38,070][175731] Updated weights for policy 0, policy_version 47550 (0.0008) [2023-03-07 10:47:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 48693248. Throughput: 0: 12862.6. Samples: 48693337. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:38,321][175405] Avg episode reward: [(0, '23.006')] [2023-03-07 10:47:38,890][175731] Updated weights for policy 0, policy_version 47560 (0.0006) [2023-03-07 10:47:39,689][175731] Updated weights for policy 0, policy_version 47570 (0.0007) [2023-03-07 10:47:40,496][175731] Updated weights for policy 0, policy_version 47580 (0.0007) [2023-03-07 10:47:41,297][175731] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-07 10:47:42,086][175731] Updated weights for policy 0, policy_version 47600 (0.0006) [2023-03-07 10:47:42,881][175731] Updated weights for policy 0, policy_version 47610 (0.0007) [2023-03-07 10:47:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12867.7). Total num frames: 48757760. Throughput: 0: 12851.6. Samples: 48731555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:43,321][175405] Avg episode reward: [(0, '22.297')] [2023-03-07 10:47:43,691][175731] Updated weights for policy 0, policy_version 47620 (0.0006) [2023-03-07 10:47:44,488][175731] Updated weights for policy 0, policy_version 47630 (0.0006) [2023-03-07 10:47:45,261][175731] Updated weights for policy 0, policy_version 47640 (0.0006) [2023-03-07 10:47:46,078][175731] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-07 10:47:46,876][175731] Updated weights for policy 0, policy_version 47660 (0.0006) [2023-03-07 10:47:47,666][175731] Updated weights for policy 0, policy_version 47670 (0.0006) [2023-03-07 10:47:48,321][175405] Fps is (10 sec: 12902.1, 60 sec: 12868.2, 300 sec: 12867.7). Total num frames: 48822272. Throughput: 0: 12842.3. Samples: 48808572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:48,322][175405] Avg episode reward: [(0, '22.728')] [2023-03-07 10:47:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000047678_48822272.pth... [2023-03-07 10:47:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000044663_45734912.pth [2023-03-07 10:47:48,453][175731] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-07 10:47:49,259][175731] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-07 10:47:50,065][175731] Updated weights for policy 0, policy_version 47700 (0.0006) [2023-03-07 10:47:50,870][175731] Updated weights for policy 0, policy_version 47710 (0.0006) [2023-03-07 10:47:51,683][175731] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-07 10:47:52,479][175731] Updated weights for policy 0, policy_version 47730 (0.0006) [2023-03-07 10:47:53,291][175731] Updated weights for policy 0, policy_version 47740 (0.0007) [2023-03-07 10:47:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 48885760. Throughput: 0: 12833.4. Samples: 48885284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:53,322][175405] Avg episode reward: [(0, '23.358')] [2023-03-07 10:47:54,084][175731] Updated weights for policy 0, policy_version 47750 (0.0008) [2023-03-07 10:47:54,879][175731] Updated weights for policy 0, policy_version 47760 (0.0007) [2023-03-07 10:47:55,680][175731] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-07 10:47:56,471][175731] Updated weights for policy 0, policy_version 47780 (0.0006) [2023-03-07 10:47:57,262][175731] Updated weights for policy 0, policy_version 47790 (0.0006) [2023-03-07 10:47:58,048][175731] Updated weights for policy 0, policy_version 47800 (0.0007) [2023-03-07 10:47:58,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 48950272. Throughput: 0: 12829.4. Samples: 48923927. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:47:58,322][175405] Avg episode reward: [(0, '22.459')] [2023-03-07 10:47:58,849][175731] Updated weights for policy 0, policy_version 47810 (0.0006) [2023-03-07 10:47:59,655][175731] Updated weights for policy 0, policy_version 47820 (0.0006) [2023-03-07 10:48:00,455][175731] Updated weights for policy 0, policy_version 47830 (0.0007) [2023-03-07 10:48:01,237][175731] Updated weights for policy 0, policy_version 47840 (0.0007) [2023-03-07 10:48:02,038][175731] Updated weights for policy 0, policy_version 47850 (0.0007) [2023-03-07 10:48:02,836][175731] Updated weights for policy 0, policy_version 47860 (0.0007) [2023-03-07 10:48:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12867.7). Total num frames: 49014784. Throughput: 0: 12838.3. Samples: 49001125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:03,322][175405] Avg episode reward: [(0, '22.214')] [2023-03-07 10:48:03,625][175731] Updated weights for policy 0, policy_version 47870 (0.0006) [2023-03-07 10:48:04,425][175731] Updated weights for policy 0, policy_version 47880 (0.0007) [2023-03-07 10:48:05,219][175731] Updated weights for policy 0, policy_version 47890 (0.0006) [2023-03-07 10:48:06,029][175731] Updated weights for policy 0, policy_version 47900 (0.0006) [2023-03-07 10:48:06,829][175731] Updated weights for policy 0, policy_version 47910 (0.0007) [2023-03-07 10:48:07,626][175731] Updated weights for policy 0, policy_version 47920 (0.0006) [2023-03-07 10:48:08,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 49079296. Throughput: 0: 12837.8. Samples: 49078293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:08,321][175405] Avg episode reward: [(0, '22.051')] [2023-03-07 10:48:08,407][175731] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-07 10:48:09,200][175731] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-07 10:48:09,990][175731] Updated weights for policy 0, policy_version 47950 (0.0006) [2023-03-07 10:48:10,784][175731] Updated weights for policy 0, policy_version 47960 (0.0006) [2023-03-07 10:48:11,582][175731] Updated weights for policy 0, policy_version 47970 (0.0007) [2023-03-07 10:48:12,388][175731] Updated weights for policy 0, policy_version 47980 (0.0007) [2023-03-07 10:48:13,188][175731] Updated weights for policy 0, policy_version 47990 (0.0007) [2023-03-07 10:48:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12864.2). Total num frames: 49142784. Throughput: 0: 12845.0. Samples: 49116979. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:13,322][175405] Avg episode reward: [(0, '24.042')] [2023-03-07 10:48:13,983][175731] Updated weights for policy 0, policy_version 48000 (0.0007) [2023-03-07 10:48:14,777][175731] Updated weights for policy 0, policy_version 48010 (0.0007) [2023-03-07 10:48:15,562][175731] Updated weights for policy 0, policy_version 48020 (0.0006) [2023-03-07 10:48:16,366][175731] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-07 10:48:17,145][175731] Updated weights for policy 0, policy_version 48040 (0.0006) [2023-03-07 10:48:17,952][175731] Updated weights for policy 0, policy_version 48050 (0.0006) [2023-03-07 10:48:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 49207296. Throughput: 0: 12841.7. Samples: 49194005. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:18,321][175405] Avg episode reward: [(0, '21.226')] [2023-03-07 10:48:18,770][175731] Updated weights for policy 0, policy_version 48060 (0.0006) [2023-03-07 10:48:19,571][175731] Updated weights for policy 0, policy_version 48070 (0.0006) [2023-03-07 10:48:20,366][175731] Updated weights for policy 0, policy_version 48080 (0.0007) [2023-03-07 10:48:21,156][175731] Updated weights for policy 0, policy_version 48090 (0.0006) [2023-03-07 10:48:21,953][175731] Updated weights for policy 0, policy_version 48100 (0.0007) [2023-03-07 10:48:22,765][175731] Updated weights for policy 0, policy_version 48110 (0.0006) [2023-03-07 10:48:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12864.2). Total num frames: 49271808. Throughput: 0: 12832.0. Samples: 49270776. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:23,322][175405] Avg episode reward: [(0, '22.668')] [2023-03-07 10:48:23,567][175731] Updated weights for policy 0, policy_version 48120 (0.0007) [2023-03-07 10:48:24,376][175731] Updated weights for policy 0, policy_version 48130 (0.0006) [2023-03-07 10:48:25,177][175731] Updated weights for policy 0, policy_version 48140 (0.0006) [2023-03-07 10:48:25,967][175731] Updated weights for policy 0, policy_version 48150 (0.0006) [2023-03-07 10:48:26,761][175731] Updated weights for policy 0, policy_version 48160 (0.0006) [2023-03-07 10:48:27,558][175731] Updated weights for policy 0, policy_version 48170 (0.0006) [2023-03-07 10:48:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12860.7). Total num frames: 49335296. Throughput: 0: 12835.1. Samples: 49309133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:28,332][175405] Avg episode reward: [(0, '23.731')] [2023-03-07 10:48:28,353][175731] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-07 10:48:29,132][175731] Updated weights for policy 0, policy_version 48190 (0.0007) [2023-03-07 10:48:29,926][175731] Updated weights for policy 0, policy_version 48200 (0.0006) [2023-03-07 10:48:30,726][175731] Updated weights for policy 0, policy_version 48210 (0.0006) [2023-03-07 10:48:31,536][175731] Updated weights for policy 0, policy_version 48220 (0.0006) [2023-03-07 10:48:32,321][175731] Updated weights for policy 0, policy_version 48230 (0.0007) [2023-03-07 10:48:33,111][175731] Updated weights for policy 0, policy_version 48240 (0.0006) [2023-03-07 10:48:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12860.8). Total num frames: 49399808. Throughput: 0: 12843.2. Samples: 49386514. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:33,332][175405] Avg episode reward: [(0, '22.396')] [2023-03-07 10:48:33,902][175731] Updated weights for policy 0, policy_version 48250 (0.0006) [2023-03-07 10:48:34,710][175731] Updated weights for policy 0, policy_version 48260 (0.0006) [2023-03-07 10:48:35,516][175731] Updated weights for policy 0, policy_version 48270 (0.0007) [2023-03-07 10:48:36,337][175731] Updated weights for policy 0, policy_version 48280 (0.0007) [2023-03-07 10:48:37,125][175731] Updated weights for policy 0, policy_version 48290 (0.0006) [2023-03-07 10:48:37,932][175731] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-07 10:48:38,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12860.8). Total num frames: 49464320. Throughput: 0: 12843.7. Samples: 49463251. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:38,332][175405] Avg episode reward: [(0, '22.705')] [2023-03-07 10:48:38,749][175731] Updated weights for policy 0, policy_version 48310 (0.0007) [2023-03-07 10:48:39,535][175731] Updated weights for policy 0, policy_version 48320 (0.0007) [2023-03-07 10:48:40,337][175731] Updated weights for policy 0, policy_version 48330 (0.0006) [2023-03-07 10:48:41,125][175731] Updated weights for policy 0, policy_version 48340 (0.0006) [2023-03-07 10:48:41,922][175731] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-07 10:48:42,712][175731] Updated weights for policy 0, policy_version 48360 (0.0007) [2023-03-07 10:48:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12857.3). Total num frames: 49527808. Throughput: 0: 12841.2. Samples: 49501778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:43,332][175405] Avg episode reward: [(0, '22.406')] [2023-03-07 10:48:43,511][175731] Updated weights for policy 0, policy_version 48370 (0.0007) [2023-03-07 10:48:44,322][175731] Updated weights for policy 0, policy_version 48380 (0.0007) [2023-03-07 10:48:45,130][175731] Updated weights for policy 0, policy_version 48390 (0.0006) [2023-03-07 10:48:45,912][175731] Updated weights for policy 0, policy_version 48400 (0.0007) [2023-03-07 10:48:46,703][175731] Updated weights for policy 0, policy_version 48410 (0.0006) [2023-03-07 10:48:47,495][175731] Updated weights for policy 0, policy_version 48420 (0.0007) [2023-03-07 10:48:48,309][175731] Updated weights for policy 0, policy_version 48430 (0.0007) [2023-03-07 10:48:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.2, 300 sec: 12857.3). Total num frames: 49592320. Throughput: 0: 12836.2. Samples: 49578752. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:48,332][175405] Avg episode reward: [(0, '21.780')] [2023-03-07 10:48:49,121][175731] Updated weights for policy 0, policy_version 48440 (0.0006) [2023-03-07 10:48:49,917][175731] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-07 10:48:50,700][175731] Updated weights for policy 0, policy_version 48460 (0.0006) [2023-03-07 10:48:51,504][175731] Updated weights for policy 0, policy_version 48470 (0.0007) [2023-03-07 10:48:52,297][175731] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-07 10:48:53,102][175731] Updated weights for policy 0, policy_version 48490 (0.0006) [2023-03-07 10:48:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 49656832. Throughput: 0: 12833.7. Samples: 49655811. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:53,322][175405] Avg episode reward: [(0, '23.561')] [2023-03-07 10:48:53,899][175731] Updated weights for policy 0, policy_version 48500 (0.0006) [2023-03-07 10:48:54,691][175731] Updated weights for policy 0, policy_version 48510 (0.0007) [2023-03-07 10:48:55,477][175731] Updated weights for policy 0, policy_version 48520 (0.0006) [2023-03-07 10:48:56,288][175731] Updated weights for policy 0, policy_version 48530 (0.0007) [2023-03-07 10:48:57,088][175731] Updated weights for policy 0, policy_version 48540 (0.0007) [2023-03-07 10:48:57,884][175731] Updated weights for policy 0, policy_version 48550 (0.0007) [2023-03-07 10:48:58,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 49720320. Throughput: 0: 12827.2. Samples: 49694205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:48:58,322][175405] Avg episode reward: [(0, '23.071')] [2023-03-07 10:48:58,698][175731] Updated weights for policy 0, policy_version 48560 (0.0007) [2023-03-07 10:48:59,485][175731] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-07 10:49:00,277][175731] Updated weights for policy 0, policy_version 48580 (0.0006) [2023-03-07 10:49:01,073][175731] Updated weights for policy 0, policy_version 48590 (0.0007) [2023-03-07 10:49:01,856][175731] Updated weights for policy 0, policy_version 48600 (0.0006) [2023-03-07 10:49:02,661][175731] Updated weights for policy 0, policy_version 48610 (0.0006) [2023-03-07 10:49:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12857.3). Total num frames: 49784832. Throughput: 0: 12827.4. Samples: 49771238. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:49:03,322][175405] Avg episode reward: [(0, '23.206')] [2023-03-07 10:49:03,451][175731] Updated weights for policy 0, policy_version 48620 (0.0007) [2023-03-07 10:49:04,240][175731] Updated weights for policy 0, policy_version 48630 (0.0007) [2023-03-07 10:49:05,049][175731] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-07 10:49:05,844][175731] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-07 10:49:06,658][175731] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-07 10:49:07,450][175731] Updated weights for policy 0, policy_version 48670 (0.0006) [2023-03-07 10:49:08,227][175731] Updated weights for policy 0, policy_version 48680 (0.0007) [2023-03-07 10:49:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12853.8). Total num frames: 49848320. Throughput: 0: 12835.5. Samples: 49848373. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:49:08,322][175405] Avg episode reward: [(0, '21.751')] [2023-03-07 10:49:09,033][175731] Updated weights for policy 0, policy_version 48690 (0.0007) [2023-03-07 10:49:09,823][175731] Updated weights for policy 0, policy_version 48700 (0.0006) [2023-03-07 10:49:10,619][175731] Updated weights for policy 0, policy_version 48710 (0.0007) [2023-03-07 10:49:11,408][175731] Updated weights for policy 0, policy_version 48720 (0.0007) [2023-03-07 10:49:12,201][175731] Updated weights for policy 0, policy_version 48730 (0.0006) [2023-03-07 10:49:12,998][175731] Updated weights for policy 0, policy_version 48740 (0.0006) [2023-03-07 10:49:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 49913856. Throughput: 0: 12840.7. Samples: 49886963. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:49:13,322][175405] Avg episode reward: [(0, '23.080')] [2023-03-07 10:49:13,777][175731] Updated weights for policy 0, policy_version 48750 (0.0006) [2023-03-07 10:49:14,574][175731] Updated weights for policy 0, policy_version 48760 (0.0007) [2023-03-07 10:49:15,359][175731] Updated weights for policy 0, policy_version 48770 (0.0006) [2023-03-07 10:49:16,170][175731] Updated weights for policy 0, policy_version 48780 (0.0006) [2023-03-07 10:49:16,973][175731] Updated weights for policy 0, policy_version 48790 (0.0006) [2023-03-07 10:49:17,757][175731] Updated weights for policy 0, policy_version 48800 (0.0006) [2023-03-07 10:49:18,321][175405] Fps is (10 sec: 13005.0, 60 sec: 12851.2, 300 sec: 12860.7). Total num frames: 49978368. Throughput: 0: 12845.0. Samples: 49964538. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:49:18,321][175405] Avg episode reward: [(0, '23.611')] [2023-03-07 10:49:18,546][175731] Updated weights for policy 0, policy_version 48810 (0.0007) [2023-03-07 10:49:19,347][175731] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-07 10:49:20,147][175731] Updated weights for policy 0, policy_version 48830 (0.0006) [2023-03-07 10:49:20,933][175731] Updated weights for policy 0, policy_version 48840 (0.0006) [2023-03-07 10:49:21,734][175731] Updated weights for policy 0, policy_version 48850 (0.0005) [2023-03-07 10:49:22,521][175731] Updated weights for policy 0, policy_version 48860 (0.0007) [2023-03-07 10:49:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 50041856. Throughput: 0: 12858.0. Samples: 50041860. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:49:23,321][175405] Avg episode reward: [(0, '24.883')] [2023-03-07 10:49:23,332][175731] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-07 10:49:24,132][175731] Updated weights for policy 0, policy_version 48880 (0.0006) [2023-03-07 10:49:24,937][175731] Updated weights for policy 0, policy_version 48890 (0.0007) [2023-03-07 10:49:25,725][175731] Updated weights for policy 0, policy_version 48900 (0.0006) [2023-03-07 10:49:26,506][175731] Updated weights for policy 0, policy_version 48910 (0.0006) [2023-03-07 10:49:27,297][175731] Updated weights for policy 0, policy_version 48920 (0.0006) [2023-03-07 10:49:28,101][175731] Updated weights for policy 0, policy_version 48930 (0.0006) [2023-03-07 10:49:28,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 50107392. Throughput: 0: 12856.8. Samples: 50080334. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:49:28,322][175405] Avg episode reward: [(0, '22.144')] [2023-03-07 10:49:28,890][175731] Updated weights for policy 0, policy_version 48940 (0.0007) [2023-03-07 10:49:29,682][175731] Updated weights for policy 0, policy_version 48950 (0.0006) [2023-03-07 10:49:30,476][175731] Updated weights for policy 0, policy_version 48960 (0.0007) [2023-03-07 10:49:31,269][175731] Updated weights for policy 0, policy_version 48970 (0.0007) [2023-03-07 10:49:32,061][175731] Updated weights for policy 0, policy_version 48980 (0.0006) [2023-03-07 10:49:32,862][175731] Updated weights for policy 0, policy_version 48990 (0.0006) [2023-03-07 10:49:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 50170880. Throughput: 0: 12866.4. Samples: 50157741. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:49:33,322][175405] Avg episode reward: [(0, '24.836')] [2023-03-07 10:49:33,642][175731] Updated weights for policy 0, policy_version 49000 (0.0006) [2023-03-07 10:49:34,437][175731] Updated weights for policy 0, policy_version 49010 (0.0007) [2023-03-07 10:49:35,245][175731] Updated weights for policy 0, policy_version 49020 (0.0007) [2023-03-07 10:49:36,036][175731] Updated weights for policy 0, policy_version 49030 (0.0006) [2023-03-07 10:49:36,845][175731] Updated weights for policy 0, policy_version 49040 (0.0006) [2023-03-07 10:49:37,654][175731] Updated weights for policy 0, policy_version 49050 (0.0007) [2023-03-07 10:49:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 50235392. Throughput: 0: 12863.7. Samples: 50234676. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:49:38,321][175405] Avg episode reward: [(0, '23.549')] [2023-03-07 10:49:38,458][175731] Updated weights for policy 0, policy_version 49060 (0.0006) [2023-03-07 10:49:39,256][175731] Updated weights for policy 0, policy_version 49070 (0.0006) [2023-03-07 10:49:40,054][175731] Updated weights for policy 0, policy_version 49080 (0.0007) [2023-03-07 10:49:40,836][175731] Updated weights for policy 0, policy_version 49090 (0.0006) [2023-03-07 10:49:41,626][175731] Updated weights for policy 0, policy_version 49100 (0.0008) [2023-03-07 10:49:42,432][175731] Updated weights for policy 0, policy_version 49110 (0.0006) [2023-03-07 10:49:43,234][175731] Updated weights for policy 0, policy_version 49120 (0.0007) [2023-03-07 10:49:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 50299904. Throughput: 0: 12869.7. Samples: 50273339. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:49:43,322][175405] Avg episode reward: [(0, '23.140')] [2023-03-07 10:49:44,038][175731] Updated weights for policy 0, policy_version 49130 (0.0006) [2023-03-07 10:49:44,823][175731] Updated weights for policy 0, policy_version 49140 (0.0007) [2023-03-07 10:49:45,613][175731] Updated weights for policy 0, policy_version 49150 (0.0006) [2023-03-07 10:49:46,391][175731] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-07 10:49:47,186][175731] Updated weights for policy 0, policy_version 49170 (0.0006) [2023-03-07 10:49:47,981][175731] Updated weights for policy 0, policy_version 49180 (0.0006) [2023-03-07 10:49:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12857.3). Total num frames: 50364416. Throughput: 0: 12875.2. Samples: 50350623. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:49:48,322][175405] Avg episode reward: [(0, '22.489')] [2023-03-07 10:49:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000049184_50364416.pth... [2023-03-07 10:49:48,359][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000046171_47279104.pth [2023-03-07 10:49:48,778][175731] Updated weights for policy 0, policy_version 49190 (0.0007) [2023-03-07 10:49:49,574][175731] Updated weights for policy 0, policy_version 49200 (0.0008) [2023-03-07 10:49:50,386][175731] Updated weights for policy 0, policy_version 49210 (0.0006) [2023-03-07 10:49:51,166][175731] Updated weights for policy 0, policy_version 49220 (0.0006) [2023-03-07 10:49:51,966][175731] Updated weights for policy 0, policy_version 49230 (0.0007) [2023-03-07 10:49:52,770][175731] Updated weights for policy 0, policy_version 49240 (0.0006) [2023-03-07 10:49:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 50428928. Throughput: 0: 12878.9. Samples: 50427922. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:49:53,322][175405] Avg episode reward: [(0, '22.305')] [2023-03-07 10:49:53,554][175731] Updated weights for policy 0, policy_version 49250 (0.0007) [2023-03-07 10:49:54,348][175731] Updated weights for policy 0, policy_version 49260 (0.0006) [2023-03-07 10:49:55,159][175731] Updated weights for policy 0, policy_version 49270 (0.0006) [2023-03-07 10:49:55,949][175731] Updated weights for policy 0, policy_version 49280 (0.0006) [2023-03-07 10:49:56,757][175731] Updated weights for policy 0, policy_version 49290 (0.0006) [2023-03-07 10:49:57,541][175731] Updated weights for policy 0, policy_version 49300 (0.0006) [2023-03-07 10:49:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 50492416. Throughput: 0: 12877.9. Samples: 50466468. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:49:58,321][175405] Avg episode reward: [(0, '23.764')] [2023-03-07 10:49:58,329][175731] Updated weights for policy 0, policy_version 49310 (0.0006) [2023-03-07 10:49:59,128][175731] Updated weights for policy 0, policy_version 49320 (0.0006) [2023-03-07 10:49:59,924][175731] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-07 10:50:00,720][175731] Updated weights for policy 0, policy_version 49340 (0.0007) [2023-03-07 10:50:01,513][175731] Updated weights for policy 0, policy_version 49350 (0.0006) [2023-03-07 10:50:02,314][175731] Updated weights for policy 0, policy_version 49360 (0.0007) [2023-03-07 10:50:03,097][175731] Updated weights for policy 0, policy_version 49370 (0.0007) [2023-03-07 10:50:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.2, 300 sec: 12853.8). Total num frames: 50556928. Throughput: 0: 12872.0. Samples: 50543781. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:03,322][175405] Avg episode reward: [(0, '22.653')] [2023-03-07 10:50:03,910][175731] Updated weights for policy 0, policy_version 49380 (0.0007) [2023-03-07 10:50:04,699][175731] Updated weights for policy 0, policy_version 49390 (0.0006) [2023-03-07 10:50:05,497][175731] Updated weights for policy 0, policy_version 49400 (0.0007) [2023-03-07 10:50:06,297][175731] Updated weights for policy 0, policy_version 49410 (0.0006) [2023-03-07 10:50:07,098][175731] Updated weights for policy 0, policy_version 49420 (0.0006) [2023-03-07 10:50:07,893][175731] Updated weights for policy 0, policy_version 49430 (0.0006) [2023-03-07 10:50:08,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12857.3). Total num frames: 50621440. Throughput: 0: 12864.1. Samples: 50620748. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:08,322][175405] Avg episode reward: [(0, '22.311')] [2023-03-07 10:50:08,687][175731] Updated weights for policy 0, policy_version 49440 (0.0006) [2023-03-07 10:50:09,497][175731] Updated weights for policy 0, policy_version 49450 (0.0007) [2023-03-07 10:50:10,286][175731] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-07 10:50:11,088][175731] Updated weights for policy 0, policy_version 49470 (0.0007) [2023-03-07 10:50:11,896][175731] Updated weights for policy 0, policy_version 49480 (0.0007) [2023-03-07 10:50:12,684][175731] Updated weights for policy 0, policy_version 49490 (0.0007) [2023-03-07 10:50:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 50684928. Throughput: 0: 12863.4. Samples: 50659186. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:13,322][175405] Avg episode reward: [(0, '23.711')] [2023-03-07 10:50:13,479][175731] Updated weights for policy 0, policy_version 49500 (0.0007) [2023-03-07 10:50:14,277][175731] Updated weights for policy 0, policy_version 49510 (0.0007) [2023-03-07 10:50:15,083][175731] Updated weights for policy 0, policy_version 49520 (0.0007) [2023-03-07 10:50:15,881][175731] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-07 10:50:16,515][175680] KL-divergence is very high: 18029842.0000 [2023-03-07 10:50:16,691][175731] Updated weights for policy 0, policy_version 49540 (0.0007) [2023-03-07 10:50:16,760][175680] KL-divergence is very high: 2978.5444 [2023-03-07 10:50:17,490][175731] Updated weights for policy 0, policy_version 49550 (0.0006) [2023-03-07 10:50:18,283][175731] Updated weights for policy 0, policy_version 49560 (0.0007) [2023-03-07 10:50:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 50749440. Throughput: 0: 12850.9. Samples: 50736032. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:18,322][175405] Avg episode reward: [(0, '21.409')] [2023-03-07 10:50:19,076][175731] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-07 10:50:19,860][175731] Updated weights for policy 0, policy_version 49580 (0.0007) [2023-03-07 10:50:20,662][175731] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-07 10:50:21,437][175731] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-07 10:50:22,260][175731] Updated weights for policy 0, policy_version 49610 (0.0007) [2023-03-07 10:50:23,025][175731] Updated weights for policy 0, policy_version 49620 (0.0006) [2023-03-07 10:50:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 50813952. Throughput: 0: 12863.8. Samples: 50813546. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:23,322][175405] Avg episode reward: [(0, '23.336')] [2023-03-07 10:50:23,843][175731] Updated weights for policy 0, policy_version 49630 (0.0006) [2023-03-07 10:50:24,647][175731] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-07 10:50:25,441][175731] Updated weights for policy 0, policy_version 49650 (0.0007) [2023-03-07 10:50:26,245][175731] Updated weights for policy 0, policy_version 49660 (0.0006) [2023-03-07 10:50:27,044][175731] Updated weights for policy 0, policy_version 49670 (0.0006) [2023-03-07 10:50:27,838][175731] Updated weights for policy 0, policy_version 49680 (0.0007) [2023-03-07 10:50:28,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 50878464. Throughput: 0: 12859.0. Samples: 50851995. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:28,321][175405] Avg episode reward: [(0, '22.735')] [2023-03-07 10:50:28,639][175731] Updated weights for policy 0, policy_version 49690 (0.0007) [2023-03-07 10:50:29,437][175731] Updated weights for policy 0, policy_version 49700 (0.0007) [2023-03-07 10:50:30,217][175731] Updated weights for policy 0, policy_version 49710 (0.0006) [2023-03-07 10:50:31,028][175731] Updated weights for policy 0, policy_version 49720 (0.0007) [2023-03-07 10:50:31,816][175731] Updated weights for policy 0, policy_version 49730 (0.0007) [2023-03-07 10:50:32,613][175731] Updated weights for policy 0, policy_version 49740 (0.0007) [2023-03-07 10:50:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 50941952. Throughput: 0: 12854.6. Samples: 50929079. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:33,321][175405] Avg episode reward: [(0, '22.857')] [2023-03-07 10:50:33,416][175731] Updated weights for policy 0, policy_version 49750 (0.0006) [2023-03-07 10:50:34,204][175731] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-07 10:50:35,016][175731] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-07 10:50:35,803][175731] Updated weights for policy 0, policy_version 49780 (0.0006) [2023-03-07 10:50:36,578][175731] Updated weights for policy 0, policy_version 49790 (0.0006) [2023-03-07 10:50:37,383][175731] Updated weights for policy 0, policy_version 49800 (0.0007) [2023-03-07 10:50:38,181][175731] Updated weights for policy 0, policy_version 49810 (0.0006) [2023-03-07 10:50:38,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 51006464. Throughput: 0: 12853.3. Samples: 51006323. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:38,322][175405] Avg episode reward: [(0, '22.013')] [2023-03-07 10:50:38,974][175731] Updated weights for policy 0, policy_version 49820 (0.0007) [2023-03-07 10:50:39,776][175731] Updated weights for policy 0, policy_version 49830 (0.0007) [2023-03-07 10:50:40,586][175731] Updated weights for policy 0, policy_version 49840 (0.0007) [2023-03-07 10:50:41,382][175731] Updated weights for policy 0, policy_version 49850 (0.0006) [2023-03-07 10:50:42,184][175731] Updated weights for policy 0, policy_version 49860 (0.0007) [2023-03-07 10:50:42,981][175731] Updated weights for policy 0, policy_version 49870 (0.0006) [2023-03-07 10:50:43,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 51070976. Throughput: 0: 12850.3. Samples: 51044734. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:43,322][175405] Avg episode reward: [(0, '22.407')] [2023-03-07 10:50:43,783][175731] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-07 10:50:44,577][175731] Updated weights for policy 0, policy_version 49890 (0.0006) [2023-03-07 10:50:45,373][175731] Updated weights for policy 0, policy_version 49900 (0.0006) [2023-03-07 10:50:46,177][175731] Updated weights for policy 0, policy_version 49910 (0.0006) [2023-03-07 10:50:46,977][175731] Updated weights for policy 0, policy_version 49920 (0.0007) [2023-03-07 10:50:47,754][175731] Updated weights for policy 0, policy_version 49930 (0.0007) [2023-03-07 10:50:48,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 51135488. Throughput: 0: 12844.4. Samples: 51121780. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:48,322][175405] Avg episode reward: [(0, '23.397')] [2023-03-07 10:50:48,542][175731] Updated weights for policy 0, policy_version 49940 (0.0006) [2023-03-07 10:50:49,341][175731] Updated weights for policy 0, policy_version 49950 (0.0006) [2023-03-07 10:50:50,129][175731] Updated weights for policy 0, policy_version 49960 (0.0007) [2023-03-07 10:50:50,935][175731] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-07 10:50:51,735][175731] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-07 10:50:52,534][175731] Updated weights for policy 0, policy_version 49990 (0.0007) [2023-03-07 10:50:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 51198976. Throughput: 0: 12845.9. Samples: 51198812. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:53,322][175405] Avg episode reward: [(0, '22.601')] [2023-03-07 10:50:53,336][175731] Updated weights for policy 0, policy_version 50000 (0.0006) [2023-03-07 10:50:54,118][175731] Updated weights for policy 0, policy_version 50010 (0.0007) [2023-03-07 10:50:54,945][175731] Updated weights for policy 0, policy_version 50020 (0.0007) [2023-03-07 10:50:55,747][175731] Updated weights for policy 0, policy_version 50030 (0.0006) [2023-03-07 10:50:56,537][175731] Updated weights for policy 0, policy_version 50040 (0.0007) [2023-03-07 10:50:57,320][175731] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-07 10:50:58,114][175731] Updated weights for policy 0, policy_version 50060 (0.0006) [2023-03-07 10:50:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 51263488. Throughput: 0: 12847.3. Samples: 51237313. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:50:58,321][175405] Avg episode reward: [(0, '23.449')] [2023-03-07 10:50:58,910][175731] Updated weights for policy 0, policy_version 50070 (0.0006) [2023-03-07 10:50:59,697][175731] Updated weights for policy 0, policy_version 50080 (0.0007) [2023-03-07 10:51:00,473][175731] Updated weights for policy 0, policy_version 50090 (0.0007) [2023-03-07 10:51:01,286][175731] Updated weights for policy 0, policy_version 50100 (0.0006) [2023-03-07 10:51:02,075][175731] Updated weights for policy 0, policy_version 50110 (0.0007) [2023-03-07 10:51:02,866][175731] Updated weights for policy 0, policy_version 50120 (0.0007) [2023-03-07 10:51:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 51328000. Throughput: 0: 12862.6. Samples: 51314847. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:51:03,321][175405] Avg episode reward: [(0, '23.337')] [2023-03-07 10:51:03,651][175731] Updated weights for policy 0, policy_version 50130 (0.0006) [2023-03-07 10:51:04,442][175731] Updated weights for policy 0, policy_version 50140 (0.0006) [2023-03-07 10:51:05,239][175731] Updated weights for policy 0, policy_version 50150 (0.0007) [2023-03-07 10:51:06,038][175731] Updated weights for policy 0, policy_version 50160 (0.0007) [2023-03-07 10:51:06,833][175731] Updated weights for policy 0, policy_version 50170 (0.0007) [2023-03-07 10:51:07,624][175731] Updated weights for policy 0, policy_version 50180 (0.0006) [2023-03-07 10:51:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 51392512. Throughput: 0: 12858.7. Samples: 51392188. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:51:08,332][175405] Avg episode reward: [(0, '21.860')] [2023-03-07 10:51:08,433][175731] Updated weights for policy 0, policy_version 50190 (0.0007) [2023-03-07 10:51:09,228][175731] Updated weights for policy 0, policy_version 50200 (0.0007) [2023-03-07 10:51:10,031][175731] Updated weights for policy 0, policy_version 50210 (0.0005) [2023-03-07 10:51:10,823][175731] Updated weights for policy 0, policy_version 50220 (0.0007) [2023-03-07 10:51:11,621][175731] Updated weights for policy 0, policy_version 50230 (0.0007) [2023-03-07 10:51:12,413][175731] Updated weights for policy 0, policy_version 50240 (0.0007) [2023-03-07 10:51:13,205][175731] Updated weights for policy 0, policy_version 50250 (0.0007) [2023-03-07 10:51:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 51457024. Throughput: 0: 12856.1. Samples: 51430519. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:51:13,332][175405] Avg episode reward: [(0, '22.881')] [2023-03-07 10:51:14,016][175731] Updated weights for policy 0, policy_version 50260 (0.0007) [2023-03-07 10:51:14,804][175731] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-07 10:51:15,586][175731] Updated weights for policy 0, policy_version 50280 (0.0006) [2023-03-07 10:51:16,391][175731] Updated weights for policy 0, policy_version 50290 (0.0007) [2023-03-07 10:51:17,192][175731] Updated weights for policy 0, policy_version 50300 (0.0007) [2023-03-07 10:51:18,000][175731] Updated weights for policy 0, policy_version 50310 (0.0006) [2023-03-07 10:51:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 51520512. Throughput: 0: 12859.6. Samples: 51507762. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:51:18,332][175405] Avg episode reward: [(0, '22.851')] [2023-03-07 10:51:18,798][175731] Updated weights for policy 0, policy_version 50320 (0.0007) [2023-03-07 10:51:19,593][175731] Updated weights for policy 0, policy_version 50330 (0.0006) [2023-03-07 10:51:20,386][175731] Updated weights for policy 0, policy_version 50340 (0.0007) [2023-03-07 10:51:21,189][175731] Updated weights for policy 0, policy_version 50350 (0.0007) [2023-03-07 10:51:21,961][175731] Updated weights for policy 0, policy_version 50360 (0.0006) [2023-03-07 10:51:22,777][175731] Updated weights for policy 0, policy_version 50370 (0.0007) [2023-03-07 10:51:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 51585024. Throughput: 0: 12857.9. Samples: 51584925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:51:23,332][175405] Avg episode reward: [(0, '23.031')] [2023-03-07 10:51:23,594][175731] Updated weights for policy 0, policy_version 50380 (0.0006) [2023-03-07 10:51:24,379][175731] Updated weights for policy 0, policy_version 50390 (0.0007) [2023-03-07 10:51:25,158][175731] Updated weights for policy 0, policy_version 50400 (0.0007) [2023-03-07 10:51:25,967][175731] Updated weights for policy 0, policy_version 50410 (0.0007) [2023-03-07 10:51:26,763][175731] Updated weights for policy 0, policy_version 50420 (0.0007) [2023-03-07 10:51:27,570][175731] Updated weights for policy 0, policy_version 50430 (0.0007) [2023-03-07 10:51:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 51649536. Throughput: 0: 12857.0. Samples: 51623299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:51:28,321][175405] Avg episode reward: [(0, '22.176')] [2023-03-07 10:51:28,359][175731] Updated weights for policy 0, policy_version 50440 (0.0007) [2023-03-07 10:51:29,168][175731] Updated weights for policy 0, policy_version 50450 (0.0007) [2023-03-07 10:51:29,954][175731] Updated weights for policy 0, policy_version 50460 (0.0006) [2023-03-07 10:51:30,773][175731] Updated weights for policy 0, policy_version 50470 (0.0007) [2023-03-07 10:51:31,568][175731] Updated weights for policy 0, policy_version 50480 (0.0007) [2023-03-07 10:51:32,341][175731] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-07 10:51:33,134][175731] Updated weights for policy 0, policy_version 50500 (0.0007) [2023-03-07 10:51:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12853.8). Total num frames: 51714048. Throughput: 0: 12861.4. Samples: 51700541. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:51:33,322][175405] Avg episode reward: [(0, '23.442')] [2023-03-07 10:51:33,946][175731] Updated weights for policy 0, policy_version 50510 (0.0006) [2023-03-07 10:51:34,732][175731] Updated weights for policy 0, policy_version 50520 (0.0006) [2023-03-07 10:51:35,512][175731] Updated weights for policy 0, policy_version 50530 (0.0007) [2023-03-07 10:51:36,317][175731] Updated weights for policy 0, policy_version 50540 (0.0007) [2023-03-07 10:51:37,098][175731] Updated weights for policy 0, policy_version 50550 (0.0006) [2023-03-07 10:51:37,895][175731] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-07 10:51:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 51778560. Throughput: 0: 12868.2. Samples: 51777881. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:51:38,321][175405] Avg episode reward: [(0, '23.845')] [2023-03-07 10:51:38,691][175731] Updated weights for policy 0, policy_version 50570 (0.0006) [2023-03-07 10:51:39,471][175731] Updated weights for policy 0, policy_version 50580 (0.0006) [2023-03-07 10:51:40,275][175731] Updated weights for policy 0, policy_version 50590 (0.0006) [2023-03-07 10:51:41,077][175731] Updated weights for policy 0, policy_version 50600 (0.0006) [2023-03-07 10:51:41,864][175731] Updated weights for policy 0, policy_version 50610 (0.0006) [2023-03-07 10:51:42,646][175731] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-07 10:51:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 51843072. Throughput: 0: 12869.4. Samples: 51816436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:51:43,322][175405] Avg episode reward: [(0, '22.108')] [2023-03-07 10:51:43,453][175731] Updated weights for policy 0, policy_version 50630 (0.0006) [2023-03-07 10:51:44,253][175731] Updated weights for policy 0, policy_version 50640 (0.0006) [2023-03-07 10:51:45,054][175731] Updated weights for policy 0, policy_version 50650 (0.0006) [2023-03-07 10:51:45,871][175731] Updated weights for policy 0, policy_version 50660 (0.0006) [2023-03-07 10:51:46,674][175731] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-07 10:51:47,465][175731] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-07 10:51:48,265][175731] Updated weights for policy 0, policy_version 50690 (0.0006) [2023-03-07 10:51:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 51906560. Throughput: 0: 12860.3. Samples: 51893561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:51:48,322][175405] Avg episode reward: [(0, '22.882')] [2023-03-07 10:51:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000050690_51906560.pth... [2023-03-07 10:51:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000047678_48822272.pth [2023-03-07 10:51:49,073][175731] Updated weights for policy 0, policy_version 50700 (0.0007) [2023-03-07 10:51:49,871][175731] Updated weights for policy 0, policy_version 50710 (0.0006) [2023-03-07 10:51:50,664][175731] Updated weights for policy 0, policy_version 50720 (0.0006) [2023-03-07 10:51:51,471][175731] Updated weights for policy 0, policy_version 50730 (0.0006) [2023-03-07 10:51:52,269][175731] Updated weights for policy 0, policy_version 50740 (0.0006) [2023-03-07 10:51:53,057][175731] Updated weights for policy 0, policy_version 50750 (0.0006) [2023-03-07 10:51:53,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 51971072. Throughput: 0: 12849.5. Samples: 51970416. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:51:53,322][175405] Avg episode reward: [(0, '22.829')] [2023-03-07 10:51:53,860][175731] Updated weights for policy 0, policy_version 50760 (0.0006) [2023-03-07 10:51:54,644][175731] Updated weights for policy 0, policy_version 50770 (0.0006) [2023-03-07 10:51:55,419][175731] Updated weights for policy 0, policy_version 50780 (0.0006) [2023-03-07 10:51:56,224][175731] Updated weights for policy 0, policy_version 50790 (0.0007) [2023-03-07 10:51:57,014][175731] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-07 10:51:57,813][175731] Updated weights for policy 0, policy_version 50810 (0.0006) [2023-03-07 10:51:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12850.3). Total num frames: 52035584. Throughput: 0: 12860.0. Samples: 52009223. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:51:58,322][175405] Avg episode reward: [(0, '22.421')] [2023-03-07 10:51:58,611][175731] Updated weights for policy 0, policy_version 50820 (0.0006) [2023-03-07 10:51:59,406][175731] Updated weights for policy 0, policy_version 50830 (0.0007) [2023-03-07 10:52:00,190][175731] Updated weights for policy 0, policy_version 50840 (0.0006) [2023-03-07 10:52:01,002][175731] Updated weights for policy 0, policy_version 50850 (0.0006) [2023-03-07 10:52:01,793][175731] Updated weights for policy 0, policy_version 50860 (0.0007) [2023-03-07 10:52:02,619][175731] Updated weights for policy 0, policy_version 50870 (0.0006) [2023-03-07 10:52:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 52099072. Throughput: 0: 12855.7. Samples: 52086269. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:03,322][175405] Avg episode reward: [(0, '23.613')] [2023-03-07 10:52:03,397][175731] Updated weights for policy 0, policy_version 50880 (0.0006) [2023-03-07 10:52:04,205][175731] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-07 10:52:05,004][175731] Updated weights for policy 0, policy_version 50900 (0.0007) [2023-03-07 10:52:05,818][175731] Updated weights for policy 0, policy_version 50910 (0.0006) [2023-03-07 10:52:06,595][175731] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-07 10:52:07,394][175731] Updated weights for policy 0, policy_version 50930 (0.0007) [2023-03-07 10:52:08,179][175731] Updated weights for policy 0, policy_version 50940 (0.0006) [2023-03-07 10:52:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 52163584. Throughput: 0: 12855.7. Samples: 52163434. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:08,322][175405] Avg episode reward: [(0, '23.592')] [2023-03-07 10:52:08,978][175731] Updated weights for policy 0, policy_version 50950 (0.0006) [2023-03-07 10:52:09,769][175731] Updated weights for policy 0, policy_version 50960 (0.0007) [2023-03-07 10:52:10,570][175731] Updated weights for policy 0, policy_version 50970 (0.0007) [2023-03-07 10:52:11,376][175731] Updated weights for policy 0, policy_version 50980 (0.0005) [2023-03-07 10:52:12,170][175731] Updated weights for policy 0, policy_version 50990 (0.0006) [2023-03-07 10:52:12,969][175731] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-07 10:52:13,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52228096. Throughput: 0: 12854.5. Samples: 52201752. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:13,321][175405] Avg episode reward: [(0, '22.704')] [2023-03-07 10:52:13,759][175731] Updated weights for policy 0, policy_version 51010 (0.0006) [2023-03-07 10:52:14,553][175731] Updated weights for policy 0, policy_version 51020 (0.0007) [2023-03-07 10:52:15,349][175731] Updated weights for policy 0, policy_version 51030 (0.0007) [2023-03-07 10:52:16,147][175731] Updated weights for policy 0, policy_version 51040 (0.0007) [2023-03-07 10:52:16,939][175731] Updated weights for policy 0, policy_version 51050 (0.0007) [2023-03-07 10:52:17,749][175731] Updated weights for policy 0, policy_version 51060 (0.0007) [2023-03-07 10:52:18,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 52292608. Throughput: 0: 12854.0. Samples: 52278969. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:18,322][175405] Avg episode reward: [(0, '23.104')] [2023-03-07 10:52:18,542][175731] Updated weights for policy 0, policy_version 51070 (0.0007) [2023-03-07 10:52:19,346][175731] Updated weights for policy 0, policy_version 51080 (0.0005) [2023-03-07 10:52:20,142][175731] Updated weights for policy 0, policy_version 51090 (0.0007) [2023-03-07 10:52:20,911][175731] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-07 10:52:21,709][175731] Updated weights for policy 0, policy_version 51110 (0.0006) [2023-03-07 10:52:22,503][175731] Updated weights for policy 0, policy_version 51120 (0.0006) [2023-03-07 10:52:23,302][175731] Updated weights for policy 0, policy_version 51130 (0.0006) [2023-03-07 10:52:23,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 52357120. Throughput: 0: 12853.7. Samples: 52356296. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:23,322][175405] Avg episode reward: [(0, '23.226')] [2023-03-07 10:52:24,112][175731] Updated weights for policy 0, policy_version 51140 (0.0007) [2023-03-07 10:52:24,910][175731] Updated weights for policy 0, policy_version 51150 (0.0007) [2023-03-07 10:52:25,709][175731] Updated weights for policy 0, policy_version 51160 (0.0007) [2023-03-07 10:52:26,514][175731] Updated weights for policy 0, policy_version 51170 (0.0006) [2023-03-07 10:52:27,301][175731] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-07 10:52:28,097][175731] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-07 10:52:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 52420608. Throughput: 0: 12849.4. Samples: 52394660. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:28,322][175405] Avg episode reward: [(0, '25.337')] [2023-03-07 10:52:28,894][175731] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-07 10:52:29,690][175731] Updated weights for policy 0, policy_version 51210 (0.0006) [2023-03-07 10:52:30,482][175731] Updated weights for policy 0, policy_version 51220 (0.0006) [2023-03-07 10:52:31,288][175731] Updated weights for policy 0, policy_version 51230 (0.0007) [2023-03-07 10:52:32,086][175731] Updated weights for policy 0, policy_version 51240 (0.0006) [2023-03-07 10:52:32,885][175731] Updated weights for policy 0, policy_version 51250 (0.0007) [2023-03-07 10:52:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52485120. Throughput: 0: 12850.1. Samples: 52471817. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:33,322][175405] Avg episode reward: [(0, '23.003')] [2023-03-07 10:52:33,679][175731] Updated weights for policy 0, policy_version 51260 (0.0006) [2023-03-07 10:52:34,470][175731] Updated weights for policy 0, policy_version 51270 (0.0007) [2023-03-07 10:52:35,276][175731] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-07 10:52:36,087][175731] Updated weights for policy 0, policy_version 51290 (0.0006) [2023-03-07 10:52:36,878][175731] Updated weights for policy 0, policy_version 51300 (0.0006) [2023-03-07 10:52:37,669][175731] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-07 10:52:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52549632. Throughput: 0: 12851.9. Samples: 52548752. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:38,321][175405] Avg episode reward: [(0, '22.811')] [2023-03-07 10:52:38,485][175731] Updated weights for policy 0, policy_version 51320 (0.0006) [2023-03-07 10:52:39,275][175731] Updated weights for policy 0, policy_version 51330 (0.0006) [2023-03-07 10:52:40,079][175731] Updated weights for policy 0, policy_version 51340 (0.0007) [2023-03-07 10:52:40,870][175731] Updated weights for policy 0, policy_version 51350 (0.0007) [2023-03-07 10:52:41,673][175731] Updated weights for policy 0, policy_version 51360 (0.0006) [2023-03-07 10:52:42,463][175731] Updated weights for policy 0, policy_version 51370 (0.0006) [2023-03-07 10:52:43,277][175731] Updated weights for policy 0, policy_version 51380 (0.0007) [2023-03-07 10:52:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 52613120. Throughput: 0: 12844.8. Samples: 52587237. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:43,322][175405] Avg episode reward: [(0, '22.063')] [2023-03-07 10:52:44,080][175731] Updated weights for policy 0, policy_version 51390 (0.0007) [2023-03-07 10:52:44,868][175731] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-07 10:52:45,648][175731] Updated weights for policy 0, policy_version 51410 (0.0007) [2023-03-07 10:52:46,463][175731] Updated weights for policy 0, policy_version 51420 (0.0007) [2023-03-07 10:52:47,245][175731] Updated weights for policy 0, policy_version 51430 (0.0006) [2023-03-07 10:52:48,029][175731] Updated weights for policy 0, policy_version 51440 (0.0006) [2023-03-07 10:52:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52677632. Throughput: 0: 12844.2. Samples: 52664261. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:48,322][175405] Avg episode reward: [(0, '22.666')] [2023-03-07 10:52:48,833][175731] Updated weights for policy 0, policy_version 51450 (0.0008) [2023-03-07 10:52:49,618][175731] Updated weights for policy 0, policy_version 51460 (0.0005) [2023-03-07 10:52:50,408][175731] Updated weights for policy 0, policy_version 51470 (0.0006) [2023-03-07 10:52:51,197][175731] Updated weights for policy 0, policy_version 51480 (0.0007) [2023-03-07 10:52:52,002][175731] Updated weights for policy 0, policy_version 51490 (0.0006) [2023-03-07 10:52:52,790][175731] Updated weights for policy 0, policy_version 51500 (0.0007) [2023-03-07 10:52:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52742144. Throughput: 0: 12854.2. Samples: 52741873. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:52:53,322][175405] Avg episode reward: [(0, '23.398')] [2023-03-07 10:52:53,586][175731] Updated weights for policy 0, policy_version 51510 (0.0007) [2023-03-07 10:52:54,379][175731] Updated weights for policy 0, policy_version 51520 (0.0007) [2023-03-07 10:52:55,174][175731] Updated weights for policy 0, policy_version 51530 (0.0006) [2023-03-07 10:52:55,962][175731] Updated weights for policy 0, policy_version 51540 (0.0006) [2023-03-07 10:52:56,773][175731] Updated weights for policy 0, policy_version 51550 (0.0007) [2023-03-07 10:52:57,573][175731] Updated weights for policy 0, policy_version 51560 (0.0006) [2023-03-07 10:52:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52806656. Throughput: 0: 12857.4. Samples: 52780339. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:52:58,322][175405] Avg episode reward: [(0, '22.806')] [2023-03-07 10:52:58,380][175731] Updated weights for policy 0, policy_version 51570 (0.0007) [2023-03-07 10:52:59,166][175731] Updated weights for policy 0, policy_version 51580 (0.0007) [2023-03-07 10:52:59,983][175731] Updated weights for policy 0, policy_version 51590 (0.0007) [2023-03-07 10:53:00,761][175731] Updated weights for policy 0, policy_version 51600 (0.0006) [2023-03-07 10:53:01,553][175731] Updated weights for policy 0, policy_version 51610 (0.0007) [2023-03-07 10:53:02,341][175731] Updated weights for policy 0, policy_version 51620 (0.0006) [2023-03-07 10:53:03,148][175731] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-07 10:53:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 52871168. Throughput: 0: 12856.3. Samples: 52857501. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:53:03,321][175405] Avg episode reward: [(0, '22.319')] [2023-03-07 10:53:03,950][175731] Updated weights for policy 0, policy_version 51640 (0.0007) [2023-03-07 10:53:04,748][175731] Updated weights for policy 0, policy_version 51650 (0.0006) [2023-03-07 10:53:05,555][175731] Updated weights for policy 0, policy_version 51660 (0.0007) [2023-03-07 10:53:06,330][175731] Updated weights for policy 0, policy_version 51670 (0.0006) [2023-03-07 10:53:07,141][175731] Updated weights for policy 0, policy_version 51680 (0.0007) [2023-03-07 10:53:07,942][175731] Updated weights for policy 0, policy_version 51690 (0.0007) [2023-03-07 10:53:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52934656. Throughput: 0: 12847.7. Samples: 52934443. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:53:08,322][175405] Avg episode reward: [(0, '23.005')] [2023-03-07 10:53:08,749][175731] Updated weights for policy 0, policy_version 51700 (0.0006) [2023-03-07 10:53:09,551][175731] Updated weights for policy 0, policy_version 51710 (0.0006) [2023-03-07 10:53:10,353][175731] Updated weights for policy 0, policy_version 51720 (0.0007) [2023-03-07 10:53:11,135][175731] Updated weights for policy 0, policy_version 51730 (0.0006) [2023-03-07 10:53:11,922][175731] Updated weights for policy 0, policy_version 51740 (0.0006) [2023-03-07 10:53:12,713][175731] Updated weights for policy 0, policy_version 51750 (0.0006) [2023-03-07 10:53:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 52999168. Throughput: 0: 12850.0. Samples: 52972908. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:53:13,322][175405] Avg episode reward: [(0, '23.335')] [2023-03-07 10:53:13,517][175731] Updated weights for policy 0, policy_version 51760 (0.0006) [2023-03-07 10:53:14,312][175731] Updated weights for policy 0, policy_version 51770 (0.0006) [2023-03-07 10:53:15,105][175731] Updated weights for policy 0, policy_version 51780 (0.0007) [2023-03-07 10:53:15,898][175731] Updated weights for policy 0, policy_version 51790 (0.0008) [2023-03-07 10:53:16,701][175731] Updated weights for policy 0, policy_version 51800 (0.0007) [2023-03-07 10:53:17,497][175731] Updated weights for policy 0, policy_version 51810 (0.0005) [2023-03-07 10:53:18,294][175731] Updated weights for policy 0, policy_version 51820 (0.0006) [2023-03-07 10:53:18,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 53063680. Throughput: 0: 12853.0. Samples: 53050201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:53:18,322][175405] Avg episode reward: [(0, '21.986')] [2023-03-07 10:53:19,082][175731] Updated weights for policy 0, policy_version 51830 (0.0007) [2023-03-07 10:53:19,886][175731] Updated weights for policy 0, policy_version 51840 (0.0007) [2023-03-07 10:53:20,672][175731] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-07 10:53:21,453][175731] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-07 10:53:22,262][175731] Updated weights for policy 0, policy_version 51870 (0.0007) [2023-03-07 10:53:23,063][175731] Updated weights for policy 0, policy_version 51880 (0.0006) [2023-03-07 10:53:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 53128192. Throughput: 0: 12858.2. Samples: 53127371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:53:23,321][175405] Avg episode reward: [(0, '22.433')] [2023-03-07 10:53:23,858][175731] Updated weights for policy 0, policy_version 51890 (0.0006) [2023-03-07 10:53:24,665][175731] Updated weights for policy 0, policy_version 51900 (0.0006) [2023-03-07 10:53:25,462][175731] Updated weights for policy 0, policy_version 51910 (0.0006) [2023-03-07 10:53:26,265][175731] Updated weights for policy 0, policy_version 51920 (0.0006) [2023-03-07 10:53:27,042][175731] Updated weights for policy 0, policy_version 51930 (0.0007) [2023-03-07 10:53:27,837][175731] Updated weights for policy 0, policy_version 51940 (0.0006) [2023-03-07 10:53:28,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 53192704. Throughput: 0: 12859.5. Samples: 53165912. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:53:28,321][175405] Avg episode reward: [(0, '22.589')] [2023-03-07 10:53:28,630][175731] Updated weights for policy 0, policy_version 51950 (0.0006) [2023-03-07 10:53:29,417][175731] Updated weights for policy 0, policy_version 51960 (0.0007) [2023-03-07 10:53:30,214][175731] Updated weights for policy 0, policy_version 51970 (0.0007) [2023-03-07 10:53:31,005][175731] Updated weights for policy 0, policy_version 51980 (0.0007) [2023-03-07 10:53:31,794][175731] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-07 10:53:32,606][175731] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-07 10:53:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 53257216. Throughput: 0: 12872.2. Samples: 53243510. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:53:33,332][175405] Avg episode reward: [(0, '22.198')] [2023-03-07 10:53:33,391][175731] Updated weights for policy 0, policy_version 52010 (0.0006) [2023-03-07 10:53:34,187][175731] Updated weights for policy 0, policy_version 52020 (0.0007) [2023-03-07 10:53:34,973][175731] Updated weights for policy 0, policy_version 52030 (0.0006) [2023-03-07 10:53:35,777][175731] Updated weights for policy 0, policy_version 52040 (0.0006) [2023-03-07 10:53:36,572][175731] Updated weights for policy 0, policy_version 52050 (0.0008) [2023-03-07 10:53:37,369][175731] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-07 10:53:38,154][175731] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-07 10:53:38,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12860.7). Total num frames: 53321728. Throughput: 0: 12864.5. Samples: 53320777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:53:38,332][175405] Avg episode reward: [(0, '23.902')] [2023-03-07 10:53:38,953][175731] Updated weights for policy 0, policy_version 52080 (0.0006) [2023-03-07 10:53:39,753][175731] Updated weights for policy 0, policy_version 52090 (0.0007) [2023-03-07 10:53:40,540][175731] Updated weights for policy 0, policy_version 52100 (0.0007) [2023-03-07 10:53:41,336][175731] Updated weights for policy 0, policy_version 52110 (0.0007) [2023-03-07 10:53:42,139][175731] Updated weights for policy 0, policy_version 52120 (0.0006) [2023-03-07 10:53:42,944][175731] Updated weights for policy 0, policy_version 52130 (0.0006) [2023-03-07 10:53:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 53385216. Throughput: 0: 12871.2. Samples: 53359542. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:53:43,332][175405] Avg episode reward: [(0, '23.957')] [2023-03-07 10:53:43,728][175731] Updated weights for policy 0, policy_version 52140 (0.0007) [2023-03-07 10:53:44,529][175731] Updated weights for policy 0, policy_version 52150 (0.0007) [2023-03-07 10:53:45,341][175731] Updated weights for policy 0, policy_version 52160 (0.0007) [2023-03-07 10:53:46,128][175731] Updated weights for policy 0, policy_version 52170 (0.0006) [2023-03-07 10:53:46,941][175731] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-07 10:53:47,720][175731] Updated weights for policy 0, policy_version 52190 (0.0007) [2023-03-07 10:53:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 53449728. Throughput: 0: 12864.6. Samples: 53436409. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:53:48,332][175405] Avg episode reward: [(0, '22.887')] [2023-03-07 10:53:48,350][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000052198_53450752.pth... [2023-03-07 10:53:48,380][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000049184_50364416.pth [2023-03-07 10:53:48,520][175731] Updated weights for policy 0, policy_version 52200 (0.0006) [2023-03-07 10:53:49,322][175731] Updated weights for policy 0, policy_version 52210 (0.0006) [2023-03-07 10:53:50,116][175731] Updated weights for policy 0, policy_version 52220 (0.0007) [2023-03-07 10:53:50,906][175731] Updated weights for policy 0, policy_version 52230 (0.0006) [2023-03-07 10:53:51,717][175731] Updated weights for policy 0, policy_version 52240 (0.0006) [2023-03-07 10:53:52,511][175731] Updated weights for policy 0, policy_version 52250 (0.0008) [2023-03-07 10:53:53,315][175731] Updated weights for policy 0, policy_version 52260 (0.0006) [2023-03-07 10:53:53,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12860.8). Total num frames: 53514240. Throughput: 0: 12866.2. Samples: 53513422. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:53:53,332][175405] Avg episode reward: [(0, '23.058')] [2023-03-07 10:53:54,114][175731] Updated weights for policy 0, policy_version 52270 (0.0007) [2023-03-07 10:53:54,912][175731] Updated weights for policy 0, policy_version 52280 (0.0007) [2023-03-07 10:53:55,700][175731] Updated weights for policy 0, policy_version 52290 (0.0006) [2023-03-07 10:53:56,496][175731] Updated weights for policy 0, policy_version 52300 (0.0007) [2023-03-07 10:53:57,295][175731] Updated weights for policy 0, policy_version 52310 (0.0007) [2023-03-07 10:53:58,088][175731] Updated weights for policy 0, policy_version 52320 (0.0007) [2023-03-07 10:53:58,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 53578752. Throughput: 0: 12868.2. Samples: 53551977. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:53:58,321][175405] Avg episode reward: [(0, '22.263')] [2023-03-07 10:53:58,877][175731] Updated weights for policy 0, policy_version 52330 (0.0007) [2023-03-07 10:53:59,672][175731] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-07 10:54:00,475][175731] Updated weights for policy 0, policy_version 52350 (0.0006) [2023-03-07 10:54:01,264][175731] Updated weights for policy 0, policy_version 52360 (0.0007) [2023-03-07 10:54:02,058][175731] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-07 10:54:02,850][175731] Updated weights for policy 0, policy_version 52380 (0.0006) [2023-03-07 10:54:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12860.8). Total num frames: 53642240. Throughput: 0: 12865.0. Samples: 53629126. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:54:03,321][175405] Avg episode reward: [(0, '22.504')] [2023-03-07 10:54:03,640][175731] Updated weights for policy 0, policy_version 52390 (0.0006) [2023-03-07 10:54:04,444][175731] Updated weights for policy 0, policy_version 52400 (0.0007) [2023-03-07 10:54:05,257][175731] Updated weights for policy 0, policy_version 52410 (0.0006) [2023-03-07 10:54:06,073][175731] Updated weights for policy 0, policy_version 52420 (0.0006) [2023-03-07 10:54:06,878][175731] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-07 10:54:07,677][175731] Updated weights for policy 0, policy_version 52440 (0.0007) [2023-03-07 10:54:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 53706752. Throughput: 0: 12861.3. Samples: 53706129. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:54:08,321][175405] Avg episode reward: [(0, '23.163')] [2023-03-07 10:54:08,465][175731] Updated weights for policy 0, policy_version 52450 (0.0006) [2023-03-07 10:54:09,248][175731] Updated weights for policy 0, policy_version 52460 (0.0007) [2023-03-07 10:54:10,050][175731] Updated weights for policy 0, policy_version 52470 (0.0006) [2023-03-07 10:54:10,857][175731] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-07 10:54:11,653][175731] Updated weights for policy 0, policy_version 52490 (0.0007) [2023-03-07 10:54:12,431][175731] Updated weights for policy 0, policy_version 52500 (0.0007) [2023-03-07 10:54:13,238][175731] Updated weights for policy 0, policy_version 52510 (0.0008) [2023-03-07 10:54:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 53771264. Throughput: 0: 12859.3. Samples: 53744583. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:54:13,322][175405] Avg episode reward: [(0, '23.602')] [2023-03-07 10:54:14,017][175731] Updated weights for policy 0, policy_version 52520 (0.0007) [2023-03-07 10:54:14,821][175731] Updated weights for policy 0, policy_version 52530 (0.0007) [2023-03-07 10:54:15,610][175731] Updated weights for policy 0, policy_version 52540 (0.0006) [2023-03-07 10:54:16,427][175731] Updated weights for policy 0, policy_version 52550 (0.0007) [2023-03-07 10:54:17,226][175731] Updated weights for policy 0, policy_version 52560 (0.0007) [2023-03-07 10:54:18,013][175731] Updated weights for policy 0, policy_version 52570 (0.0007) [2023-03-07 10:54:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 53834752. Throughput: 0: 12846.3. Samples: 53821595. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:54:18,322][175405] Avg episode reward: [(0, '23.271')] [2023-03-07 10:54:18,811][175731] Updated weights for policy 0, policy_version 52580 (0.0007) [2023-03-07 10:54:19,618][175731] Updated weights for policy 0, policy_version 52590 (0.0007) [2023-03-07 10:54:20,405][175731] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-07 10:54:21,191][175731] Updated weights for policy 0, policy_version 52610 (0.0007) [2023-03-07 10:54:21,984][175731] Updated weights for policy 0, policy_version 52620 (0.0006) [2023-03-07 10:54:22,799][175731] Updated weights for policy 0, policy_version 52630 (0.0007) [2023-03-07 10:54:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 53899264. Throughput: 0: 12849.0. Samples: 53898983. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:54:23,322][175405] Avg episode reward: [(0, '22.920')] [2023-03-07 10:54:23,588][175731] Updated weights for policy 0, policy_version 52640 (0.0007) [2023-03-07 10:54:24,372][175731] Updated weights for policy 0, policy_version 52650 (0.0007) [2023-03-07 10:54:25,157][175731] Updated weights for policy 0, policy_version 52660 (0.0006) [2023-03-07 10:54:25,963][175731] Updated weights for policy 0, policy_version 52670 (0.0006) [2023-03-07 10:54:26,749][175731] Updated weights for policy 0, policy_version 52680 (0.0007) [2023-03-07 10:54:27,550][175731] Updated weights for policy 0, policy_version 52690 (0.0006) [2023-03-07 10:54:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 53963776. Throughput: 0: 12847.2. Samples: 53937668. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:54:28,321][175405] Avg episode reward: [(0, '22.046')] [2023-03-07 10:54:28,350][175731] Updated weights for policy 0, policy_version 52700 (0.0006) [2023-03-07 10:54:29,136][175731] Updated weights for policy 0, policy_version 52710 (0.0006) [2023-03-07 10:54:29,934][175731] Updated weights for policy 0, policy_version 52720 (0.0007) [2023-03-07 10:54:30,712][175731] Updated weights for policy 0, policy_version 52730 (0.0007) [2023-03-07 10:54:31,503][175731] Updated weights for policy 0, policy_version 52740 (0.0007) [2023-03-07 10:54:32,310][175731] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-07 10:54:33,119][175731] Updated weights for policy 0, policy_version 52760 (0.0007) [2023-03-07 10:54:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 54028288. Throughput: 0: 12858.5. Samples: 54015040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:54:33,322][175405] Avg episode reward: [(0, '23.592')] [2023-03-07 10:54:33,929][175731] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-07 10:54:34,725][175731] Updated weights for policy 0, policy_version 52780 (0.0008) [2023-03-07 10:54:35,507][175731] Updated weights for policy 0, policy_version 52790 (0.0007) [2023-03-07 10:54:36,293][175731] Updated weights for policy 0, policy_version 52800 (0.0006) [2023-03-07 10:54:37,093][175731] Updated weights for policy 0, policy_version 52810 (0.0007) [2023-03-07 10:54:37,905][175731] Updated weights for policy 0, policy_version 52820 (0.0005) [2023-03-07 10:54:38,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 54092800. Throughput: 0: 12857.7. Samples: 54092021. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:54:38,322][175405] Avg episode reward: [(0, '22.811')] [2023-03-07 10:54:38,690][175731] Updated weights for policy 0, policy_version 52830 (0.0007) [2023-03-07 10:54:39,480][175731] Updated weights for policy 0, policy_version 52840 (0.0005) [2023-03-07 10:54:40,269][175731] Updated weights for policy 0, policy_version 52850 (0.0006) [2023-03-07 10:54:41,077][175731] Updated weights for policy 0, policy_version 52860 (0.0006) [2023-03-07 10:54:41,881][175731] Updated weights for policy 0, policy_version 52870 (0.0007) [2023-03-07 10:54:42,653][175731] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-07 10:54:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 54157312. Throughput: 0: 12862.0. Samples: 54130765. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:54:43,322][175405] Avg episode reward: [(0, '21.139')] [2023-03-07 10:54:43,462][175731] Updated weights for policy 0, policy_version 52890 (0.0006) [2023-03-07 10:54:44,252][175731] Updated weights for policy 0, policy_version 52900 (0.0007) [2023-03-07 10:54:45,069][175731] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-07 10:54:45,864][175731] Updated weights for policy 0, policy_version 52920 (0.0006) [2023-03-07 10:54:46,643][175731] Updated weights for policy 0, policy_version 52930 (0.0006) [2023-03-07 10:54:47,447][175731] Updated weights for policy 0, policy_version 52940 (0.0008) [2023-03-07 10:54:48,243][175731] Updated weights for policy 0, policy_version 52950 (0.0008) [2023-03-07 10:54:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 54220800. Throughput: 0: 12862.3. Samples: 54207931. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:54:48,321][175405] Avg episode reward: [(0, '22.693')] [2023-03-07 10:54:49,047][175731] Updated weights for policy 0, policy_version 52960 (0.0006) [2023-03-07 10:54:49,855][175731] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-07 10:54:50,635][175731] Updated weights for policy 0, policy_version 52980 (0.0006) [2023-03-07 10:54:51,440][175731] Updated weights for policy 0, policy_version 52990 (0.0006) [2023-03-07 10:54:52,234][175731] Updated weights for policy 0, policy_version 53000 (0.0005) [2023-03-07 10:54:53,033][175731] Updated weights for policy 0, policy_version 53010 (0.0007) [2023-03-07 10:54:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 54285312. Throughput: 0: 12865.6. Samples: 54285081. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:54:53,321][175405] Avg episode reward: [(0, '23.051')] [2023-03-07 10:54:53,814][175731] Updated weights for policy 0, policy_version 53020 (0.0006) [2023-03-07 10:54:54,603][175731] Updated weights for policy 0, policy_version 53030 (0.0006) [2023-03-07 10:54:55,403][175731] Updated weights for policy 0, policy_version 53040 (0.0007) [2023-03-07 10:54:56,209][175731] Updated weights for policy 0, policy_version 53050 (0.0006) [2023-03-07 10:54:57,009][175731] Updated weights for policy 0, policy_version 53060 (0.0006) [2023-03-07 10:54:57,803][175731] Updated weights for policy 0, policy_version 53070 (0.0006) [2023-03-07 10:54:58,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 54349824. Throughput: 0: 12867.4. Samples: 54323617. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:54:58,322][175405] Avg episode reward: [(0, '24.828')] [2023-03-07 10:54:58,589][175731] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-07 10:54:59,387][175731] Updated weights for policy 0, policy_version 53090 (0.0006) [2023-03-07 10:55:00,164][175731] Updated weights for policy 0, policy_version 53100 (0.0005) [2023-03-07 10:55:00,969][175731] Updated weights for policy 0, policy_version 53110 (0.0006) [2023-03-07 10:55:01,762][175731] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-07 10:55:02,557][175731] Updated weights for policy 0, policy_version 53130 (0.0006) [2023-03-07 10:55:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 54414336. Throughput: 0: 12876.6. Samples: 54401043. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 10:55:03,321][175405] Avg episode reward: [(0, '22.662')] [2023-03-07 10:55:03,352][175731] Updated weights for policy 0, policy_version 53140 (0.0008) [2023-03-07 10:55:04,150][175731] Updated weights for policy 0, policy_version 53150 (0.0006) [2023-03-07 10:55:04,949][175731] Updated weights for policy 0, policy_version 53160 (0.0007) [2023-03-07 10:55:05,741][175731] Updated weights for policy 0, policy_version 53170 (0.0006) [2023-03-07 10:55:06,542][175731] Updated weights for policy 0, policy_version 53180 (0.0006) [2023-03-07 10:55:07,308][175731] Updated weights for policy 0, policy_version 53190 (0.0005) [2023-03-07 10:55:08,118][175731] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-07 10:55:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 54478848. Throughput: 0: 12873.8. Samples: 54478302. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:08,321][175405] Avg episode reward: [(0, '23.614')] [2023-03-07 10:55:08,917][175731] Updated weights for policy 0, policy_version 53210 (0.0007) [2023-03-07 10:55:09,722][175731] Updated weights for policy 0, policy_version 53220 (0.0007) [2023-03-07 10:55:10,524][175731] Updated weights for policy 0, policy_version 53230 (0.0007) [2023-03-07 10:55:11,340][175731] Updated weights for policy 0, policy_version 53240 (0.0007) [2023-03-07 10:55:12,128][175731] Updated weights for policy 0, policy_version 53250 (0.0006) [2023-03-07 10:55:12,917][175731] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-07 10:55:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12860.8). Total num frames: 54543360. Throughput: 0: 12867.5. Samples: 54516706. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:13,322][175405] Avg episode reward: [(0, '22.733')] [2023-03-07 10:55:13,717][175731] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-07 10:55:14,505][175731] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-07 10:55:15,293][175731] Updated weights for policy 0, policy_version 53290 (0.0006) [2023-03-07 10:55:16,078][175731] Updated weights for policy 0, policy_version 53300 (0.0007) [2023-03-07 10:55:16,872][175731] Updated weights for policy 0, policy_version 53310 (0.0006) [2023-03-07 10:55:17,666][175731] Updated weights for policy 0, policy_version 53320 (0.0007) [2023-03-07 10:55:18,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12885.3, 300 sec: 12860.7). Total num frames: 54607872. Throughput: 0: 12868.7. Samples: 54594133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:18,322][175405] Avg episode reward: [(0, '23.744')] [2023-03-07 10:55:18,461][175731] Updated weights for policy 0, policy_version 53330 (0.0007) [2023-03-07 10:55:19,255][175731] Updated weights for policy 0, policy_version 53340 (0.0006) [2023-03-07 10:55:20,037][175731] Updated weights for policy 0, policy_version 53350 (0.0007) [2023-03-07 10:55:20,833][175731] Updated weights for policy 0, policy_version 53360 (0.0006) [2023-03-07 10:55:21,641][175731] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-07 10:55:22,433][175731] Updated weights for policy 0, policy_version 53380 (0.0007) [2023-03-07 10:55:23,255][175731] Updated weights for policy 0, policy_version 53390 (0.0006) [2023-03-07 10:55:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 54671360. Throughput: 0: 12874.6. Samples: 54671379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:23,322][175405] Avg episode reward: [(0, '23.902')] [2023-03-07 10:55:24,034][175731] Updated weights for policy 0, policy_version 53400 (0.0006) [2023-03-07 10:55:24,838][175731] Updated weights for policy 0, policy_version 53410 (0.0006) [2023-03-07 10:55:25,654][175731] Updated weights for policy 0, policy_version 53420 (0.0006) [2023-03-07 10:55:26,441][175731] Updated weights for policy 0, policy_version 53430 (0.0007) [2023-03-07 10:55:27,242][175731] Updated weights for policy 0, policy_version 53440 (0.0007) [2023-03-07 10:55:28,031][175731] Updated weights for policy 0, policy_version 53450 (0.0007) [2023-03-07 10:55:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 54735872. Throughput: 0: 12866.3. Samples: 54709748. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:28,322][175405] Avg episode reward: [(0, '22.572')] [2023-03-07 10:55:28,839][175731] Updated weights for policy 0, policy_version 53460 (0.0007) [2023-03-07 10:55:29,630][175731] Updated weights for policy 0, policy_version 53470 (0.0006) [2023-03-07 10:55:30,437][175731] Updated weights for policy 0, policy_version 53480 (0.0006) [2023-03-07 10:55:31,225][175731] Updated weights for policy 0, policy_version 53490 (0.0007) [2023-03-07 10:55:32,018][175731] Updated weights for policy 0, policy_version 53500 (0.0006) [2023-03-07 10:55:32,807][175731] Updated weights for policy 0, policy_version 53510 (0.0006) [2023-03-07 10:55:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12860.8). Total num frames: 54800384. Throughput: 0: 12864.6. Samples: 54786839. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:33,322][175405] Avg episode reward: [(0, '23.705')] [2023-03-07 10:55:33,613][175731] Updated weights for policy 0, policy_version 53520 (0.0007) [2023-03-07 10:55:34,402][175731] Updated weights for policy 0, policy_version 53530 (0.0006) [2023-03-07 10:55:35,219][175731] Updated weights for policy 0, policy_version 53540 (0.0007) [2023-03-07 10:55:36,016][175731] Updated weights for policy 0, policy_version 53550 (0.0007) [2023-03-07 10:55:36,814][175731] Updated weights for policy 0, policy_version 53560 (0.0006) [2023-03-07 10:55:37,619][175731] Updated weights for policy 0, policy_version 53570 (0.0007) [2023-03-07 10:55:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 54863872. Throughput: 0: 12860.4. Samples: 54863801. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:38,322][175405] Avg episode reward: [(0, '23.883')] [2023-03-07 10:55:38,389][175731] Updated weights for policy 0, policy_version 53580 (0.0006) [2023-03-07 10:55:39,203][175731] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-07 10:55:40,002][175731] Updated weights for policy 0, policy_version 53600 (0.0006) [2023-03-07 10:55:40,784][175731] Updated weights for policy 0, policy_version 53610 (0.0006) [2023-03-07 10:55:41,592][175731] Updated weights for policy 0, policy_version 53620 (0.0006) [2023-03-07 10:55:42,400][175731] Updated weights for policy 0, policy_version 53630 (0.0006) [2023-03-07 10:55:43,205][175731] Updated weights for policy 0, policy_version 53640 (0.0007) [2023-03-07 10:55:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 54928384. Throughput: 0: 12859.3. Samples: 54902284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:43,322][175405] Avg episode reward: [(0, '22.918')] [2023-03-07 10:55:43,992][175731] Updated weights for policy 0, policy_version 53650 (0.0006) [2023-03-07 10:55:44,773][175731] Updated weights for policy 0, policy_version 53660 (0.0006) [2023-03-07 10:55:45,578][175731] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-07 10:55:46,374][175731] Updated weights for policy 0, policy_version 53680 (0.0006) [2023-03-07 10:55:47,163][175731] Updated weights for policy 0, policy_version 53690 (0.0006) [2023-03-07 10:55:47,968][175731] Updated weights for policy 0, policy_version 53700 (0.0006) [2023-03-07 10:55:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 54992896. Throughput: 0: 12857.8. Samples: 54979644. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:48,322][175405] Avg episode reward: [(0, '23.547')] [2023-03-07 10:55:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000053704_54992896.pth... [2023-03-07 10:55:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000050690_51906560.pth [2023-03-07 10:55:48,778][175731] Updated weights for policy 0, policy_version 53710 (0.0007) [2023-03-07 10:55:49,565][175731] Updated weights for policy 0, policy_version 53720 (0.0006) [2023-03-07 10:55:50,373][175731] Updated weights for policy 0, policy_version 53730 (0.0007) [2023-03-07 10:55:51,169][175731] Updated weights for policy 0, policy_version 53740 (0.0006) [2023-03-07 10:55:51,953][175731] Updated weights for policy 0, policy_version 53750 (0.0006) [2023-03-07 10:55:52,745][175731] Updated weights for policy 0, policy_version 53760 (0.0007) [2023-03-07 10:55:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55056384. Throughput: 0: 12850.5. Samples: 55056576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:53,322][175405] Avg episode reward: [(0, '22.566')] [2023-03-07 10:55:53,561][175731] Updated weights for policy 0, policy_version 53770 (0.0007) [2023-03-07 10:55:54,338][175731] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-07 10:55:55,142][175731] Updated weights for policy 0, policy_version 53790 (0.0006) [2023-03-07 10:55:55,947][175731] Updated weights for policy 0, policy_version 53800 (0.0006) [2023-03-07 10:55:56,738][175731] Updated weights for policy 0, policy_version 53810 (0.0007) [2023-03-07 10:55:57,546][175731] Updated weights for policy 0, policy_version 53820 (0.0006) [2023-03-07 10:55:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55120896. Throughput: 0: 12848.8. Samples: 55094902. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:55:58,332][175405] Avg episode reward: [(0, '24.319')] [2023-03-07 10:55:58,349][175731] Updated weights for policy 0, policy_version 53830 (0.0007) [2023-03-07 10:55:59,130][175731] Updated weights for policy 0, policy_version 53840 (0.0006) [2023-03-07 10:55:59,939][175731] Updated weights for policy 0, policy_version 53850 (0.0007) [2023-03-07 10:56:00,754][175731] Updated weights for policy 0, policy_version 53860 (0.0007) [2023-03-07 10:56:01,533][175731] Updated weights for policy 0, policy_version 53870 (0.0008) [2023-03-07 10:56:02,335][175731] Updated weights for policy 0, policy_version 53880 (0.0006) [2023-03-07 10:56:03,134][175731] Updated weights for policy 0, policy_version 53890 (0.0007) [2023-03-07 10:56:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55185408. Throughput: 0: 12843.6. Samples: 55172095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:03,332][175405] Avg episode reward: [(0, '24.282')] [2023-03-07 10:56:03,927][175731] Updated weights for policy 0, policy_version 53900 (0.0006) [2023-03-07 10:56:04,702][175731] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-07 10:56:05,492][175731] Updated weights for policy 0, policy_version 53920 (0.0007) [2023-03-07 10:56:06,294][175731] Updated weights for policy 0, policy_version 53930 (0.0007) [2023-03-07 10:56:07,091][175731] Updated weights for policy 0, policy_version 53940 (0.0006) [2023-03-07 10:56:07,883][175731] Updated weights for policy 0, policy_version 53950 (0.0006) [2023-03-07 10:56:08,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55249920. Throughput: 0: 12843.9. Samples: 55249354. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:08,332][175405] Avg episode reward: [(0, '23.159')] [2023-03-07 10:56:08,681][175731] Updated weights for policy 0, policy_version 53960 (0.0007) [2023-03-07 10:56:09,480][175731] Updated weights for policy 0, policy_version 53970 (0.0006) [2023-03-07 10:56:10,277][175731] Updated weights for policy 0, policy_version 53980 (0.0007) [2023-03-07 10:56:11,079][175731] Updated weights for policy 0, policy_version 53990 (0.0007) [2023-03-07 10:56:11,886][175731] Updated weights for policy 0, policy_version 54000 (0.0007) [2023-03-07 10:56:12,678][175731] Updated weights for policy 0, policy_version 54010 (0.0008) [2023-03-07 10:56:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12860.7). Total num frames: 55314432. Throughput: 0: 12848.9. Samples: 55287950. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:13,332][175405] Avg episode reward: [(0, '22.480')] [2023-03-07 10:56:13,482][175731] Updated weights for policy 0, policy_version 54020 (0.0006) [2023-03-07 10:56:14,277][175731] Updated weights for policy 0, policy_version 54030 (0.0007) [2023-03-07 10:56:15,072][175731] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-07 10:56:15,873][175731] Updated weights for policy 0, policy_version 54050 (0.0007) [2023-03-07 10:56:16,662][175731] Updated weights for policy 0, policy_version 54060 (0.0006) [2023-03-07 10:56:17,448][175731] Updated weights for policy 0, policy_version 54070 (0.0006) [2023-03-07 10:56:18,241][175731] Updated weights for policy 0, policy_version 54080 (0.0007) [2023-03-07 10:56:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12857.3). Total num frames: 55377920. Throughput: 0: 12846.1. Samples: 55364912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:18,332][175405] Avg episode reward: [(0, '22.628')] [2023-03-07 10:56:19,036][175731] Updated weights for policy 0, policy_version 54090 (0.0006) [2023-03-07 10:56:19,834][175731] Updated weights for policy 0, policy_version 54100 (0.0007) [2023-03-07 10:56:20,618][175731] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-07 10:56:21,426][175731] Updated weights for policy 0, policy_version 54120 (0.0006) [2023-03-07 10:56:22,231][175731] Updated weights for policy 0, policy_version 54130 (0.0007) [2023-03-07 10:56:23,022][175731] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-07 10:56:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55442432. Throughput: 0: 12853.4. Samples: 55442202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:23,332][175405] Avg episode reward: [(0, '23.025')] [2023-03-07 10:56:23,820][175731] Updated weights for policy 0, policy_version 54150 (0.0006) [2023-03-07 10:56:24,623][175731] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-07 10:56:25,418][175731] Updated weights for policy 0, policy_version 54170 (0.0006) [2023-03-07 10:56:26,214][175731] Updated weights for policy 0, policy_version 54180 (0.0007) [2023-03-07 10:56:27,027][175731] Updated weights for policy 0, policy_version 54190 (0.0006) [2023-03-07 10:56:27,814][175731] Updated weights for policy 0, policy_version 54200 (0.0007) [2023-03-07 10:56:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55506944. Throughput: 0: 12851.8. Samples: 55480614. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:28,332][175405] Avg episode reward: [(0, '24.274')] [2023-03-07 10:56:28,601][175731] Updated weights for policy 0, policy_version 54210 (0.0006) [2023-03-07 10:56:29,402][175731] Updated weights for policy 0, policy_version 54220 (0.0006) [2023-03-07 10:56:30,173][175731] Updated weights for policy 0, policy_version 54230 (0.0006) [2023-03-07 10:56:30,990][175731] Updated weights for policy 0, policy_version 54240 (0.0006) [2023-03-07 10:56:31,800][175731] Updated weights for policy 0, policy_version 54250 (0.0008) [2023-03-07 10:56:32,590][175731] Updated weights for policy 0, policy_version 54260 (0.0007) [2023-03-07 10:56:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55571456. Throughput: 0: 12847.8. Samples: 55557797. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:33,332][175405] Avg episode reward: [(0, '23.408')] [2023-03-07 10:56:33,386][175731] Updated weights for policy 0, policy_version 54270 (0.0008) [2023-03-07 10:56:34,185][175731] Updated weights for policy 0, policy_version 54280 (0.0006) [2023-03-07 10:56:34,969][175731] Updated weights for policy 0, policy_version 54290 (0.0007) [2023-03-07 10:56:35,751][175731] Updated weights for policy 0, policy_version 54300 (0.0006) [2023-03-07 10:56:36,569][175731] Updated weights for policy 0, policy_version 54310 (0.0006) [2023-03-07 10:56:37,364][175731] Updated weights for policy 0, policy_version 54320 (0.0008) [2023-03-07 10:56:38,153][175731] Updated weights for policy 0, policy_version 54330 (0.0006) [2023-03-07 10:56:38,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 55635968. Throughput: 0: 12856.9. Samples: 55635136. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:38,332][175405] Avg episode reward: [(0, '22.233')] [2023-03-07 10:56:38,942][175731] Updated weights for policy 0, policy_version 54340 (0.0006) [2023-03-07 10:56:39,742][175731] Updated weights for policy 0, policy_version 54350 (0.0006) [2023-03-07 10:56:40,543][175731] Updated weights for policy 0, policy_version 54360 (0.0006) [2023-03-07 10:56:41,329][175731] Updated weights for policy 0, policy_version 54370 (0.0006) [2023-03-07 10:56:42,127][175731] Updated weights for policy 0, policy_version 54380 (0.0006) [2023-03-07 10:56:42,920][175731] Updated weights for policy 0, policy_version 54390 (0.0006) [2023-03-07 10:56:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12860.8). Total num frames: 55700480. Throughput: 0: 12862.4. Samples: 55673708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:43,332][175405] Avg episode reward: [(0, '22.820')] [2023-03-07 10:56:43,707][175731] Updated weights for policy 0, policy_version 54400 (0.0006) [2023-03-07 10:56:44,508][175731] Updated weights for policy 0, policy_version 54410 (0.0006) [2023-03-07 10:56:45,306][175731] Updated weights for policy 0, policy_version 54420 (0.0006) [2023-03-07 10:56:46,091][175731] Updated weights for policy 0, policy_version 54430 (0.0008) [2023-03-07 10:56:46,910][175731] Updated weights for policy 0, policy_version 54440 (0.0007) [2023-03-07 10:56:47,701][175731] Updated weights for policy 0, policy_version 54450 (0.0008) [2023-03-07 10:56:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 55763968. Throughput: 0: 12860.7. Samples: 55750829. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:48,332][175405] Avg episode reward: [(0, '22.712')] [2023-03-07 10:56:48,505][175731] Updated weights for policy 0, policy_version 54460 (0.0006) [2023-03-07 10:56:49,294][175731] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-07 10:56:50,094][175731] Updated weights for policy 0, policy_version 54480 (0.0006) [2023-03-07 10:56:50,875][175731] Updated weights for policy 0, policy_version 54490 (0.0006) [2023-03-07 10:56:51,663][175731] Updated weights for policy 0, policy_version 54500 (0.0006) [2023-03-07 10:56:52,463][175731] Updated weights for policy 0, policy_version 54510 (0.0007) [2023-03-07 10:56:53,259][175731] Updated weights for policy 0, policy_version 54520 (0.0006) [2023-03-07 10:56:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 55828480. Throughput: 0: 12865.3. Samples: 55828295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:53,332][175405] Avg episode reward: [(0, '22.892')] [2023-03-07 10:56:54,060][175731] Updated weights for policy 0, policy_version 54530 (0.0006) [2023-03-07 10:56:54,859][175731] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-07 10:56:55,669][175731] Updated weights for policy 0, policy_version 54550 (0.0006) [2023-03-07 10:56:56,458][175731] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-07 10:56:57,226][175731] Updated weights for policy 0, policy_version 54570 (0.0006) [2023-03-07 10:56:58,029][175731] Updated weights for policy 0, policy_version 54580 (0.0006) [2023-03-07 10:56:58,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 55892992. Throughput: 0: 12862.4. Samples: 55866761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:56:58,333][175405] Avg episode reward: [(0, '23.318')] [2023-03-07 10:56:58,807][175731] Updated weights for policy 0, policy_version 54590 (0.0006) [2023-03-07 10:56:59,596][175731] Updated weights for policy 0, policy_version 54600 (0.0006) [2023-03-07 10:57:00,393][175731] Updated weights for policy 0, policy_version 54610 (0.0006) [2023-03-07 10:57:01,194][175731] Updated weights for policy 0, policy_version 54620 (0.0007) [2023-03-07 10:57:01,982][175731] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-07 10:57:02,805][175731] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-07 10:57:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12860.7). Total num frames: 55957504. Throughput: 0: 12878.2. Samples: 55944431. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:03,332][175405] Avg episode reward: [(0, '23.721')] [2023-03-07 10:57:03,572][175731] Updated weights for policy 0, policy_version 54650 (0.0007) [2023-03-07 10:57:04,363][175731] Updated weights for policy 0, policy_version 54660 (0.0006) [2023-03-07 10:57:05,168][175731] Updated weights for policy 0, policy_version 54670 (0.0006) [2023-03-07 10:57:05,959][175731] Updated weights for policy 0, policy_version 54680 (0.0007) [2023-03-07 10:57:06,756][175731] Updated weights for policy 0, policy_version 54690 (0.0007) [2023-03-07 10:57:07,561][175731] Updated weights for policy 0, policy_version 54700 (0.0006) [2023-03-07 10:57:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.2, 300 sec: 12860.7). Total num frames: 56022016. Throughput: 0: 12878.5. Samples: 56021736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:08,322][175405] Avg episode reward: [(0, '24.784')] [2023-03-07 10:57:08,349][175731] Updated weights for policy 0, policy_version 54710 (0.0007) [2023-03-07 10:57:09,149][175731] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-07 10:57:09,958][175731] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-07 10:57:10,754][175731] Updated weights for policy 0, policy_version 54740 (0.0006) [2023-03-07 10:57:11,542][175731] Updated weights for policy 0, policy_version 54750 (0.0007) [2023-03-07 10:57:12,347][175731] Updated weights for policy 0, policy_version 54760 (0.0006) [2023-03-07 10:57:13,125][175731] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-07 10:57:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 56086528. Throughput: 0: 12877.9. Samples: 56060122. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:13,322][175405] Avg episode reward: [(0, '23.340')] [2023-03-07 10:57:13,930][175731] Updated weights for policy 0, policy_version 54780 (0.0007) [2023-03-07 10:57:14,730][175731] Updated weights for policy 0, policy_version 54790 (0.0006) [2023-03-07 10:57:15,514][175731] Updated weights for policy 0, policy_version 54800 (0.0007) [2023-03-07 10:57:16,312][175731] Updated weights for policy 0, policy_version 54810 (0.0008) [2023-03-07 10:57:17,105][175731] Updated weights for policy 0, policy_version 54820 (0.0007) [2023-03-07 10:57:17,906][175731] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-07 10:57:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12885.3, 300 sec: 12860.7). Total num frames: 56151040. Throughput: 0: 12877.0. Samples: 56137262. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:18,321][175405] Avg episode reward: [(0, '22.713')] [2023-03-07 10:57:18,717][175731] Updated weights for policy 0, policy_version 54840 (0.0006) [2023-03-07 10:57:19,517][175731] Updated weights for policy 0, policy_version 54850 (0.0007) [2023-03-07 10:57:20,309][175731] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-07 10:57:21,123][175731] Updated weights for policy 0, policy_version 54870 (0.0007) [2023-03-07 10:57:21,912][175731] Updated weights for policy 0, policy_version 54880 (0.0007) [2023-03-07 10:57:22,701][175731] Updated weights for policy 0, policy_version 54890 (0.0007) [2023-03-07 10:57:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12868.2, 300 sec: 12860.7). Total num frames: 56214528. Throughput: 0: 12871.8. Samples: 56214366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:23,322][175405] Avg episode reward: [(0, '23.005')] [2023-03-07 10:57:23,490][175731] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-07 10:57:24,293][175731] Updated weights for policy 0, policy_version 54910 (0.0006) [2023-03-07 10:57:25,106][175731] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-07 10:57:25,894][175731] Updated weights for policy 0, policy_version 54930 (0.0006) [2023-03-07 10:57:26,687][175731] Updated weights for policy 0, policy_version 54940 (0.0006) [2023-03-07 10:57:27,493][175731] Updated weights for policy 0, policy_version 54950 (0.0006) [2023-03-07 10:57:28,284][175731] Updated weights for policy 0, policy_version 54960 (0.0007) [2023-03-07 10:57:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12868.3, 300 sec: 12860.8). Total num frames: 56279040. Throughput: 0: 12867.1. Samples: 56252729. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:28,321][175405] Avg episode reward: [(0, '22.570')] [2023-03-07 10:57:29,089][175731] Updated weights for policy 0, policy_version 54970 (0.0006) [2023-03-07 10:57:29,880][175731] Updated weights for policy 0, policy_version 54980 (0.0007) [2023-03-07 10:57:30,679][175731] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-07 10:57:31,473][175731] Updated weights for policy 0, policy_version 55000 (0.0007) [2023-03-07 10:57:32,266][175731] Updated weights for policy 0, policy_version 55010 (0.0007) [2023-03-07 10:57:33,066][175731] Updated weights for policy 0, policy_version 55020 (0.0006) [2023-03-07 10:57:33,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12860.7). Total num frames: 56343552. Throughput: 0: 12866.7. Samples: 56329831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:33,322][175405] Avg episode reward: [(0, '23.481')] [2023-03-07 10:57:33,862][175731] Updated weights for policy 0, policy_version 55030 (0.0007) [2023-03-07 10:57:34,656][175731] Updated weights for policy 0, policy_version 55040 (0.0006) [2023-03-07 10:57:35,453][175731] Updated weights for policy 0, policy_version 55050 (0.0006) [2023-03-07 10:57:36,242][175731] Updated weights for policy 0, policy_version 55060 (0.0007) [2023-03-07 10:57:37,031][175731] Updated weights for policy 0, policy_version 55070 (0.0006) [2023-03-07 10:57:37,840][175731] Updated weights for policy 0, policy_version 55080 (0.0006) [2023-03-07 10:57:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 56408064. Throughput: 0: 12865.1. Samples: 56407223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:38,322][175405] Avg episode reward: [(0, '23.787')] [2023-03-07 10:57:38,637][175731] Updated weights for policy 0, policy_version 55090 (0.0007) [2023-03-07 10:57:39,433][175731] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-07 10:57:40,238][175731] Updated weights for policy 0, policy_version 55110 (0.0007) [2023-03-07 10:57:41,036][175731] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-07 10:57:41,829][175731] Updated weights for policy 0, policy_version 55130 (0.0007) [2023-03-07 10:57:42,623][175731] Updated weights for policy 0, policy_version 55140 (0.0006) [2023-03-07 10:57:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12864.2). Total num frames: 56472576. Throughput: 0: 12864.7. Samples: 56445672. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:43,322][175405] Avg episode reward: [(0, '23.013')] [2023-03-07 10:57:43,413][175731] Updated weights for policy 0, policy_version 55150 (0.0007) [2023-03-07 10:57:44,206][175731] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-07 10:57:44,989][175731] Updated weights for policy 0, policy_version 55170 (0.0006) [2023-03-07 10:57:45,788][175731] Updated weights for policy 0, policy_version 55180 (0.0006) [2023-03-07 10:57:46,605][175731] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-07 10:57:47,398][175731] Updated weights for policy 0, policy_version 55200 (0.0007) [2023-03-07 10:57:48,206][175731] Updated weights for policy 0, policy_version 55210 (0.0006) [2023-03-07 10:57:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12868.2, 300 sec: 12860.7). Total num frames: 56536064. Throughput: 0: 12852.1. Samples: 56522778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:48,322][175405] Avg episode reward: [(0, '22.637')] [2023-03-07 10:57:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000055211_56536064.pth... [2023-03-07 10:57:48,355][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000052198_53450752.pth [2023-03-07 10:57:49,005][175731] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-07 10:57:49,810][175731] Updated weights for policy 0, policy_version 55230 (0.0006) [2023-03-07 10:57:50,617][175731] Updated weights for policy 0, policy_version 55240 (0.0007) [2023-03-07 10:57:51,397][175731] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-07 10:57:52,213][175731] Updated weights for policy 0, policy_version 55260 (0.0006) [2023-03-07 10:57:53,008][175731] Updated weights for policy 0, policy_version 55270 (0.0007) [2023-03-07 10:57:53,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 56599552. Throughput: 0: 12840.1. Samples: 56599539. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:57:53,322][175405] Avg episode reward: [(0, '23.912')] [2023-03-07 10:57:53,806][175731] Updated weights for policy 0, policy_version 55280 (0.0006) [2023-03-07 10:57:54,601][175731] Updated weights for policy 0, policy_version 55290 (0.0006) [2023-03-07 10:57:55,410][175731] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-07 10:57:56,196][175731] Updated weights for policy 0, policy_version 55310 (0.0006) [2023-03-07 10:57:56,999][175731] Updated weights for policy 0, policy_version 55320 (0.0007) [2023-03-07 10:57:57,799][175731] Updated weights for policy 0, policy_version 55330 (0.0006) [2023-03-07 10:57:58,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 56664064. Throughput: 0: 12838.6. Samples: 56637858. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 10:57:58,321][175405] Avg episode reward: [(0, '23.319')] [2023-03-07 10:57:58,594][175731] Updated weights for policy 0, policy_version 55340 (0.0006) [2023-03-07 10:57:59,377][175731] Updated weights for policy 0, policy_version 55350 (0.0008) [2023-03-07 10:58:00,179][175731] Updated weights for policy 0, policy_version 55360 (0.0007) [2023-03-07 10:58:00,975][175731] Updated weights for policy 0, policy_version 55370 (0.0007) [2023-03-07 10:58:01,780][175731] Updated weights for policy 0, policy_version 55380 (0.0006) [2023-03-07 10:58:02,590][175731] Updated weights for policy 0, policy_version 55390 (0.0007) [2023-03-07 10:58:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12860.7). Total num frames: 56728576. Throughput: 0: 12835.8. Samples: 56714876. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 10:58:03,322][175405] Avg episode reward: [(0, '24.051')] [2023-03-07 10:58:03,375][175731] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-07 10:58:04,182][175731] Updated weights for policy 0, policy_version 55410 (0.0006) [2023-03-07 10:58:04,953][175731] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-07 10:58:05,774][175731] Updated weights for policy 0, policy_version 55430 (0.0007) [2023-03-07 10:58:06,566][175731] Updated weights for policy 0, policy_version 55440 (0.0006) [2023-03-07 10:58:07,377][175731] Updated weights for policy 0, policy_version 55450 (0.0006) [2023-03-07 10:58:08,166][175731] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-07 10:58:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12857.3). Total num frames: 56792064. Throughput: 0: 12836.2. Samples: 56791994. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 10:58:08,321][175405] Avg episode reward: [(0, '22.012')] [2023-03-07 10:58:08,976][175731] Updated weights for policy 0, policy_version 55470 (0.0007) [2023-03-07 10:58:09,779][175731] Updated weights for policy 0, policy_version 55480 (0.0008) [2023-03-07 10:58:10,573][175731] Updated weights for policy 0, policy_version 55490 (0.0007) [2023-03-07 10:58:11,368][175731] Updated weights for policy 0, policy_version 55500 (0.0006) [2023-03-07 10:58:12,159][175731] Updated weights for policy 0, policy_version 55510 (0.0007) [2023-03-07 10:58:12,953][175731] Updated weights for policy 0, policy_version 55520 (0.0006) [2023-03-07 10:58:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12857.3). Total num frames: 56856576. Throughput: 0: 12838.0. Samples: 56830441. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 10:58:13,321][175405] Avg episode reward: [(0, '24.767')] [2023-03-07 10:58:13,761][175731] Updated weights for policy 0, policy_version 55530 (0.0007) [2023-03-07 10:58:14,550][175731] Updated weights for policy 0, policy_version 55540 (0.0006) [2023-03-07 10:58:15,349][175731] Updated weights for policy 0, policy_version 55550 (0.0006) [2023-03-07 10:58:16,138][175731] Updated weights for policy 0, policy_version 55560 (0.0007) [2023-03-07 10:58:16,942][175731] Updated weights for policy 0, policy_version 55570 (0.0006) [2023-03-07 10:58:17,761][175731] Updated weights for policy 0, policy_version 55580 (0.0006) [2023-03-07 10:58:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12853.8). Total num frames: 56920064. Throughput: 0: 12833.4. Samples: 56907336. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 10:58:18,322][175405] Avg episode reward: [(0, '23.770')] [2023-03-07 10:58:18,546][175731] Updated weights for policy 0, policy_version 55590 (0.0007) [2023-03-07 10:58:19,330][175731] Updated weights for policy 0, policy_version 55600 (0.0007) [2023-03-07 10:58:20,133][175731] Updated weights for policy 0, policy_version 55610 (0.0006) [2023-03-07 10:58:20,926][175731] Updated weights for policy 0, policy_version 55620 (0.0006) [2023-03-07 10:58:21,722][175731] Updated weights for policy 0, policy_version 55630 (0.0006) [2023-03-07 10:58:22,523][175731] Updated weights for policy 0, policy_version 55640 (0.0007) [2023-03-07 10:58:23,301][175731] Updated weights for policy 0, policy_version 55650 (0.0006) [2023-03-07 10:58:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 56985600. Throughput: 0: 12830.7. Samples: 56984603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 10:58:23,322][175405] Avg episode reward: [(0, '23.112')] [2023-03-07 10:58:24,123][175731] Updated weights for policy 0, policy_version 55660 (0.0007) [2023-03-07 10:58:24,914][175731] Updated weights for policy 0, policy_version 55670 (0.0006) [2023-03-07 10:58:25,710][175731] Updated weights for policy 0, policy_version 55680 (0.0006) [2023-03-07 10:58:26,504][175731] Updated weights for policy 0, policy_version 55690 (0.0007) [2023-03-07 10:58:27,304][175731] Updated weights for policy 0, policy_version 55700 (0.0007) [2023-03-07 10:58:28,096][175731] Updated weights for policy 0, policy_version 55710 (0.0008) [2023-03-07 10:58:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 57049088. Throughput: 0: 12832.3. Samples: 57023128. Policy #0 lag: (min: 0.0, avg: 1.3, max: 4.0) [2023-03-07 10:58:28,322][175405] Avg episode reward: [(0, '22.928')] [2023-03-07 10:58:28,880][175731] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-07 10:58:29,692][175731] Updated weights for policy 0, policy_version 55730 (0.0006) [2023-03-07 10:58:30,480][175731] Updated weights for policy 0, policy_version 55740 (0.0006) [2023-03-07 10:58:31,290][175731] Updated weights for policy 0, policy_version 55750 (0.0006) [2023-03-07 10:58:32,074][175731] Updated weights for policy 0, policy_version 55760 (0.0006) [2023-03-07 10:58:32,873][175731] Updated weights for policy 0, policy_version 55770 (0.0006) [2023-03-07 10:58:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 57113600. Throughput: 0: 12835.2. Samples: 57100361. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:58:33,321][175405] Avg episode reward: [(0, '22.233')] [2023-03-07 10:58:33,669][175731] Updated weights for policy 0, policy_version 55780 (0.0006) [2023-03-07 10:58:34,470][175731] Updated weights for policy 0, policy_version 55790 (0.0007) [2023-03-07 10:58:35,268][175731] Updated weights for policy 0, policy_version 55800 (0.0007) [2023-03-07 10:58:36,063][175731] Updated weights for policy 0, policy_version 55810 (0.0006) [2023-03-07 10:58:36,871][175731] Updated weights for policy 0, policy_version 55820 (0.0006) [2023-03-07 10:58:37,682][175731] Updated weights for policy 0, policy_version 55830 (0.0007) [2023-03-07 10:58:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12853.8). Total num frames: 57177088. Throughput: 0: 12837.2. Samples: 57177214. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:58:38,322][175405] Avg episode reward: [(0, '21.805')] [2023-03-07 10:58:38,487][175731] Updated weights for policy 0, policy_version 55840 (0.0007) [2023-03-07 10:58:39,272][175731] Updated weights for policy 0, policy_version 55850 (0.0007) [2023-03-07 10:58:40,076][175731] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-07 10:58:40,898][175731] Updated weights for policy 0, policy_version 55870 (0.0006) [2023-03-07 10:58:41,694][175731] Updated weights for policy 0, policy_version 55880 (0.0006) [2023-03-07 10:58:42,477][175731] Updated weights for policy 0, policy_version 55890 (0.0007) [2023-03-07 10:58:43,265][175731] Updated weights for policy 0, policy_version 55900 (0.0006) [2023-03-07 10:58:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12853.8). Total num frames: 57241600. Throughput: 0: 12838.3. Samples: 57215582. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:58:43,321][175405] Avg episode reward: [(0, '22.894')] [2023-03-07 10:58:44,054][175731] Updated weights for policy 0, policy_version 55910 (0.0006) [2023-03-07 10:58:44,858][175731] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-07 10:58:45,650][175731] Updated weights for policy 0, policy_version 55930 (0.0007) [2023-03-07 10:58:46,452][175731] Updated weights for policy 0, policy_version 55940 (0.0006) [2023-03-07 10:58:47,255][175731] Updated weights for policy 0, policy_version 55950 (0.0007) [2023-03-07 10:58:48,045][175731] Updated weights for policy 0, policy_version 55960 (0.0006) [2023-03-07 10:58:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 57306112. Throughput: 0: 12840.6. Samples: 57292702. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:58:48,333][175405] Avg episode reward: [(0, '23.669')] [2023-03-07 10:58:48,857][175731] Updated weights for policy 0, policy_version 55970 (0.0006) [2023-03-07 10:58:49,657][175731] Updated weights for policy 0, policy_version 55980 (0.0007) [2023-03-07 10:58:50,446][175731] Updated weights for policy 0, policy_version 55990 (0.0007) [2023-03-07 10:58:51,239][175731] Updated weights for policy 0, policy_version 56000 (0.0006) [2023-03-07 10:58:52,033][175731] Updated weights for policy 0, policy_version 56010 (0.0006) [2023-03-07 10:58:52,823][175731] Updated weights for policy 0, policy_version 56020 (0.0007) [2023-03-07 10:58:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 57370624. Throughput: 0: 12839.1. Samples: 57369751. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:58:53,332][175405] Avg episode reward: [(0, '23.932')] [2023-03-07 10:58:53,641][175731] Updated weights for policy 0, policy_version 56030 (0.0007) [2023-03-07 10:58:54,437][175731] Updated weights for policy 0, policy_version 56040 (0.0006) [2023-03-07 10:58:55,223][175731] Updated weights for policy 0, policy_version 56050 (0.0006) [2023-03-07 10:58:56,023][175731] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-07 10:58:56,802][175731] Updated weights for policy 0, policy_version 56070 (0.0006) [2023-03-07 10:58:57,609][175731] Updated weights for policy 0, policy_version 56080 (0.0007) [2023-03-07 10:58:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 57435136. Throughput: 0: 12843.5. Samples: 57408401. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:58:58,333][175405] Avg episode reward: [(0, '23.929')] [2023-03-07 10:58:58,400][175731] Updated weights for policy 0, policy_version 56090 (0.0006) [2023-03-07 10:58:59,201][175731] Updated weights for policy 0, policy_version 56100 (0.0007) [2023-03-07 10:58:59,977][175731] Updated weights for policy 0, policy_version 56110 (0.0007) [2023-03-07 10:59:00,796][175731] Updated weights for policy 0, policy_version 56120 (0.0007) [2023-03-07 10:59:01,583][175731] Updated weights for policy 0, policy_version 56130 (0.0006) [2023-03-07 10:59:02,370][175731] Updated weights for policy 0, policy_version 56140 (0.0007) [2023-03-07 10:59:03,172][175731] Updated weights for policy 0, policy_version 56150 (0.0006) [2023-03-07 10:59:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 57498624. Throughput: 0: 12850.8. Samples: 57485623. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:59:03,332][175405] Avg episode reward: [(0, '21.623')] [2023-03-07 10:59:03,973][175731] Updated weights for policy 0, policy_version 56160 (0.0006) [2023-03-07 10:59:04,765][175731] Updated weights for policy 0, policy_version 56170 (0.0006) [2023-03-07 10:59:05,561][175731] Updated weights for policy 0, policy_version 56180 (0.0007) [2023-03-07 10:59:06,368][175731] Updated weights for policy 0, policy_version 56190 (0.0006) [2023-03-07 10:59:07,142][175731] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-07 10:59:07,962][175731] Updated weights for policy 0, policy_version 56210 (0.0007) [2023-03-07 10:59:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 57563136. Throughput: 0: 12846.9. Samples: 57562716. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:59:08,332][175405] Avg episode reward: [(0, '23.165')] [2023-03-07 10:59:08,749][175731] Updated weights for policy 0, policy_version 56220 (0.0007) [2023-03-07 10:59:09,538][175731] Updated weights for policy 0, policy_version 56230 (0.0008) [2023-03-07 10:59:10,337][175731] Updated weights for policy 0, policy_version 56240 (0.0006) [2023-03-07 10:59:11,147][175731] Updated weights for policy 0, policy_version 56250 (0.0006) [2023-03-07 10:59:11,918][175731] Updated weights for policy 0, policy_version 56260 (0.0006) [2023-03-07 10:59:12,734][175731] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-07 10:59:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12857.3). Total num frames: 57627648. Throughput: 0: 12850.0. Samples: 57601378. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:59:13,322][175405] Avg episode reward: [(0, '22.619')] [2023-03-07 10:59:13,531][175731] Updated weights for policy 0, policy_version 56280 (0.0006) [2023-03-07 10:59:14,335][175731] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-07 10:59:15,128][175731] Updated weights for policy 0, policy_version 56300 (0.0008) [2023-03-07 10:59:15,921][175731] Updated weights for policy 0, policy_version 56310 (0.0006) [2023-03-07 10:59:16,721][175731] Updated weights for policy 0, policy_version 56320 (0.0007) [2023-03-07 10:59:17,525][175731] Updated weights for policy 0, policy_version 56330 (0.0006) [2023-03-07 10:59:18,314][175731] Updated weights for policy 0, policy_version 56340 (0.0006) [2023-03-07 10:59:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12857.3). Total num frames: 57692160. Throughput: 0: 12846.9. Samples: 57678471. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:59:18,321][175405] Avg episode reward: [(0, '23.718')] [2023-03-07 10:59:19,104][175731] Updated weights for policy 0, policy_version 56350 (0.0006) [2023-03-07 10:59:19,908][175731] Updated weights for policy 0, policy_version 56360 (0.0006) [2023-03-07 10:59:20,698][175731] Updated weights for policy 0, policy_version 56370 (0.0008) [2023-03-07 10:59:21,508][175731] Updated weights for policy 0, policy_version 56380 (0.0007) [2023-03-07 10:59:22,310][175731] Updated weights for policy 0, policy_version 56390 (0.0006) [2023-03-07 10:59:23,119][175731] Updated weights for policy 0, policy_version 56400 (0.0006) [2023-03-07 10:59:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12853.8). Total num frames: 57755648. Throughput: 0: 12846.8. Samples: 57755318. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:59:23,322][175405] Avg episode reward: [(0, '23.239')] [2023-03-07 10:59:23,906][175731] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-07 10:59:24,707][175731] Updated weights for policy 0, policy_version 56420 (0.0007) [2023-03-07 10:59:25,479][175731] Updated weights for policy 0, policy_version 56430 (0.0005) [2023-03-07 10:59:26,292][175731] Updated weights for policy 0, policy_version 56440 (0.0006) [2023-03-07 10:59:27,085][175731] Updated weights for policy 0, policy_version 56450 (0.0006) [2023-03-07 10:59:27,873][175731] Updated weights for policy 0, policy_version 56460 (0.0006) [2023-03-07 10:59:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 57820160. Throughput: 0: 12848.9. Samples: 57793784. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:59:28,322][175405] Avg episode reward: [(0, '23.072')] [2023-03-07 10:59:28,659][175731] Updated weights for policy 0, policy_version 56470 (0.0006) [2023-03-07 10:59:29,477][175731] Updated weights for policy 0, policy_version 56480 (0.0006) [2023-03-07 10:59:30,292][175731] Updated weights for policy 0, policy_version 56490 (0.0007) [2023-03-07 10:59:31,074][175731] Updated weights for policy 0, policy_version 56500 (0.0007) [2023-03-07 10:59:31,873][175731] Updated weights for policy 0, policy_version 56510 (0.0006) [2023-03-07 10:59:32,669][175731] Updated weights for policy 0, policy_version 56520 (0.0006) [2023-03-07 10:59:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 57884672. Throughput: 0: 12847.3. Samples: 57870829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 10:59:33,321][175405] Avg episode reward: [(0, '23.385')] [2023-03-07 10:59:33,482][175731] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-07 10:59:34,277][175731] Updated weights for policy 0, policy_version 56540 (0.0006) [2023-03-07 10:59:35,060][175731] Updated weights for policy 0, policy_version 56550 (0.0007) [2023-03-07 10:59:35,865][175731] Updated weights for policy 0, policy_version 56560 (0.0006) [2023-03-07 10:59:36,655][175731] Updated weights for policy 0, policy_version 56570 (0.0006) [2023-03-07 10:59:37,454][175731] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-07 10:59:38,246][175731] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-07 10:59:38,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12868.3, 300 sec: 12853.8). Total num frames: 57949184. Throughput: 0: 12852.8. Samples: 57948127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:59:38,322][175405] Avg episode reward: [(0, '23.451')] [2023-03-07 10:59:39,026][175731] Updated weights for policy 0, policy_version 56600 (0.0007) [2023-03-07 10:59:39,826][175731] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-07 10:59:40,643][175731] Updated weights for policy 0, policy_version 56620 (0.0007) [2023-03-07 10:59:41,435][175731] Updated weights for policy 0, policy_version 56630 (0.0006) [2023-03-07 10:59:42,218][175731] Updated weights for policy 0, policy_version 56640 (0.0006) [2023-03-07 10:59:43,024][175731] Updated weights for policy 0, policy_version 56650 (0.0005) [2023-03-07 10:59:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 58012672. Throughput: 0: 12853.5. Samples: 57986810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:59:43,322][175405] Avg episode reward: [(0, '22.855')] [2023-03-07 10:59:43,814][175731] Updated weights for policy 0, policy_version 56660 (0.0007) [2023-03-07 10:59:44,605][175731] Updated weights for policy 0, policy_version 56670 (0.0006) [2023-03-07 10:59:45,397][175731] Updated weights for policy 0, policy_version 56680 (0.0007) [2023-03-07 10:59:46,201][175731] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-07 10:59:46,993][175731] Updated weights for policy 0, policy_version 56700 (0.0006) [2023-03-07 10:59:47,814][175731] Updated weights for policy 0, policy_version 56710 (0.0007) [2023-03-07 10:59:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 58077184. Throughput: 0: 12850.8. Samples: 58063912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:59:48,322][175405] Avg episode reward: [(0, '23.251')] [2023-03-07 10:59:48,327][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000056716_58077184.pth... [2023-03-07 10:59:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000053704_54992896.pth [2023-03-07 10:59:48,598][175731] Updated weights for policy 0, policy_version 56720 (0.0006) [2023-03-07 10:59:49,398][175731] Updated weights for policy 0, policy_version 56730 (0.0006) [2023-03-07 10:59:50,202][175731] Updated weights for policy 0, policy_version 56740 (0.0007) [2023-03-07 10:59:51,009][175731] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-07 10:59:51,798][175731] Updated weights for policy 0, policy_version 56760 (0.0006) [2023-03-07 10:59:52,585][175731] Updated weights for policy 0, policy_version 56770 (0.0006) [2023-03-07 10:59:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 58141696. Throughput: 0: 12846.9. Samples: 58140826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:59:53,322][175405] Avg episode reward: [(0, '23.867')] [2023-03-07 10:59:53,389][175731] Updated weights for policy 0, policy_version 56780 (0.0006) [2023-03-07 10:59:54,202][175731] Updated weights for policy 0, policy_version 56790 (0.0006) [2023-03-07 10:59:54,984][175731] Updated weights for policy 0, policy_version 56800 (0.0006) [2023-03-07 10:59:55,786][175731] Updated weights for policy 0, policy_version 56810 (0.0006) [2023-03-07 10:59:56,577][175731] Updated weights for policy 0, policy_version 56820 (0.0007) [2023-03-07 10:59:57,384][175731] Updated weights for policy 0, policy_version 56830 (0.0007) [2023-03-07 10:59:58,177][175731] Updated weights for policy 0, policy_version 56840 (0.0007) [2023-03-07 10:59:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 58205184. Throughput: 0: 12838.4. Samples: 58179105. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 10:59:58,322][175405] Avg episode reward: [(0, '22.715')] [2023-03-07 10:59:58,968][175731] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-07 10:59:59,759][175731] Updated weights for policy 0, policy_version 56860 (0.0005) [2023-03-07 11:00:00,576][175731] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-07 11:00:01,369][175731] Updated weights for policy 0, policy_version 56880 (0.0006) [2023-03-07 11:00:02,188][175731] Updated weights for policy 0, policy_version 56890 (0.0006) [2023-03-07 11:00:02,978][175731] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-07 11:00:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 58269696. Throughput: 0: 12840.2. Samples: 58256280. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:00:03,322][175405] Avg episode reward: [(0, '23.204')] [2023-03-07 11:00:03,763][175731] Updated weights for policy 0, policy_version 56910 (0.0006) [2023-03-07 11:00:04,562][175731] Updated weights for policy 0, policy_version 56920 (0.0006) [2023-03-07 11:00:05,347][175731] Updated weights for policy 0, policy_version 56930 (0.0007) [2023-03-07 11:00:06,189][175731] Updated weights for policy 0, policy_version 56940 (0.0007) [2023-03-07 11:00:06,959][175731] Updated weights for policy 0, policy_version 56950 (0.0006) [2023-03-07 11:00:07,756][175731] Updated weights for policy 0, policy_version 56960 (0.0007) [2023-03-07 11:00:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 58333184. Throughput: 0: 12843.4. Samples: 58333269. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:00:08,322][175405] Avg episode reward: [(0, '22.742')] [2023-03-07 11:00:08,545][175731] Updated weights for policy 0, policy_version 56970 (0.0006) [2023-03-07 11:00:09,343][175731] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-07 11:00:10,120][175731] Updated weights for policy 0, policy_version 56990 (0.0007) [2023-03-07 11:00:10,915][175731] Updated weights for policy 0, policy_version 57000 (0.0005) [2023-03-07 11:00:11,726][175731] Updated weights for policy 0, policy_version 57010 (0.0006) [2023-03-07 11:00:12,498][175731] Updated weights for policy 0, policy_version 57020 (0.0006) [2023-03-07 11:00:13,301][175731] Updated weights for policy 0, policy_version 57030 (0.0007) [2023-03-07 11:00:13,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 58398720. Throughput: 0: 12853.5. Samples: 58372190. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:13,321][175405] Avg episode reward: [(0, '22.015')] [2023-03-07 11:00:14,083][175731] Updated weights for policy 0, policy_version 57040 (0.0008) [2023-03-07 11:00:14,884][175731] Updated weights for policy 0, policy_version 57050 (0.0007) [2023-03-07 11:00:15,674][175731] Updated weights for policy 0, policy_version 57060 (0.0006) [2023-03-07 11:00:16,453][175731] Updated weights for policy 0, policy_version 57070 (0.0007) [2023-03-07 11:00:17,267][175731] Updated weights for policy 0, policy_version 57080 (0.0007) [2023-03-07 11:00:18,075][175731] Updated weights for policy 0, policy_version 57090 (0.0006) [2023-03-07 11:00:18,321][175405] Fps is (10 sec: 13004.7, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 58463232. Throughput: 0: 12863.0. Samples: 58449664. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:18,322][175405] Avg episode reward: [(0, '24.804')] [2023-03-07 11:00:18,881][175731] Updated weights for policy 0, policy_version 57100 (0.0006) [2023-03-07 11:00:19,675][175731] Updated weights for policy 0, policy_version 57110 (0.0007) [2023-03-07 11:00:20,469][175731] Updated weights for policy 0, policy_version 57120 (0.0006) [2023-03-07 11:00:21,272][175731] Updated weights for policy 0, policy_version 57130 (0.0007) [2023-03-07 11:00:22,077][175731] Updated weights for policy 0, policy_version 57140 (0.0006) [2023-03-07 11:00:22,845][175731] Updated weights for policy 0, policy_version 57150 (0.0006) [2023-03-07 11:00:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 58526720. Throughput: 0: 12852.3. Samples: 58526481. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:23,322][175405] Avg episode reward: [(0, '22.416')] [2023-03-07 11:00:23,662][175731] Updated weights for policy 0, policy_version 57160 (0.0006) [2023-03-07 11:00:24,463][175731] Updated weights for policy 0, policy_version 57170 (0.0007) [2023-03-07 11:00:25,262][175731] Updated weights for policy 0, policy_version 57180 (0.0006) [2023-03-07 11:00:26,065][175731] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-07 11:00:26,858][175731] Updated weights for policy 0, policy_version 57200 (0.0007) [2023-03-07 11:00:27,653][175731] Updated weights for policy 0, policy_version 57210 (0.0006) [2023-03-07 11:00:28,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 58591232. Throughput: 0: 12844.8. Samples: 58564825. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:28,321][175405] Avg episode reward: [(0, '24.042')] [2023-03-07 11:00:28,448][175731] Updated weights for policy 0, policy_version 57220 (0.0007) [2023-03-07 11:00:29,253][175731] Updated weights for policy 0, policy_version 57230 (0.0007) [2023-03-07 11:00:30,034][175731] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-07 11:00:30,831][175731] Updated weights for policy 0, policy_version 57250 (0.0007) [2023-03-07 11:00:31,627][175731] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-07 11:00:32,410][175731] Updated weights for policy 0, policy_version 57270 (0.0006) [2023-03-07 11:00:33,215][175731] Updated weights for policy 0, policy_version 57280 (0.0006) [2023-03-07 11:00:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 58655744. Throughput: 0: 12849.8. Samples: 58642151. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:33,322][175405] Avg episode reward: [(0, '23.913')] [2023-03-07 11:00:34,014][175731] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-07 11:00:34,814][175731] Updated weights for policy 0, policy_version 57300 (0.0007) [2023-03-07 11:00:35,594][175731] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-07 11:00:36,409][175731] Updated weights for policy 0, policy_version 57320 (0.0006) [2023-03-07 11:00:37,203][175731] Updated weights for policy 0, policy_version 57330 (0.0006) [2023-03-07 11:00:38,006][175731] Updated weights for policy 0, policy_version 57340 (0.0007) [2023-03-07 11:00:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 58719232. Throughput: 0: 12853.9. Samples: 58719249. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:38,321][175405] Avg episode reward: [(0, '23.220')] [2023-03-07 11:00:38,810][175731] Updated weights for policy 0, policy_version 57350 (0.0006) [2023-03-07 11:00:39,605][175731] Updated weights for policy 0, policy_version 57360 (0.0006) [2023-03-07 11:00:40,386][175731] Updated weights for policy 0, policy_version 57370 (0.0006) [2023-03-07 11:00:41,186][175731] Updated weights for policy 0, policy_version 57380 (0.0007) [2023-03-07 11:00:41,974][175731] Updated weights for policy 0, policy_version 57390 (0.0007) [2023-03-07 11:00:42,780][175731] Updated weights for policy 0, policy_version 57400 (0.0007) [2023-03-07 11:00:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 58783744. Throughput: 0: 12859.6. Samples: 58757789. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:43,322][175405] Avg episode reward: [(0, '23.226')] [2023-03-07 11:00:43,582][175731] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-07 11:00:44,381][175731] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-07 11:00:45,165][175731] Updated weights for policy 0, policy_version 57430 (0.0006) [2023-03-07 11:00:45,990][175731] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-07 11:00:46,772][175731] Updated weights for policy 0, policy_version 57450 (0.0007) [2023-03-07 11:00:47,578][175731] Updated weights for policy 0, policy_version 57460 (0.0007) [2023-03-07 11:00:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 58848256. Throughput: 0: 12856.9. Samples: 58834842. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:00:48,322][175405] Avg episode reward: [(0, '23.684')] [2023-03-07 11:00:48,385][175731] Updated weights for policy 0, policy_version 57470 (0.0007) [2023-03-07 11:00:49,171][175731] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-07 11:00:49,989][175731] Updated weights for policy 0, policy_version 57490 (0.0006) [2023-03-07 11:00:50,778][175731] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-07 11:00:51,567][175731] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-07 11:00:52,373][175731] Updated weights for policy 0, policy_version 57520 (0.0007) [2023-03-07 11:00:53,175][175731] Updated weights for policy 0, policy_version 57530 (0.0006) [2023-03-07 11:00:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 58911744. Throughput: 0: 12852.4. Samples: 58911626. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:00:53,321][175405] Avg episode reward: [(0, '21.585')] [2023-03-07 11:00:53,973][175731] Updated weights for policy 0, policy_version 57540 (0.0006) [2023-03-07 11:00:54,765][175731] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-07 11:00:55,559][175731] Updated weights for policy 0, policy_version 57560 (0.0006) [2023-03-07 11:00:56,363][175731] Updated weights for policy 0, policy_version 57570 (0.0007) [2023-03-07 11:00:57,155][175731] Updated weights for policy 0, policy_version 57580 (0.0007) [2023-03-07 11:00:57,951][175731] Updated weights for policy 0, policy_version 57590 (0.0007) [2023-03-07 11:00:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 58976256. Throughput: 0: 12843.2. Samples: 58950134. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:00:58,322][175405] Avg episode reward: [(0, '23.528')] [2023-03-07 11:00:58,757][175731] Updated weights for policy 0, policy_version 57600 (0.0006) [2023-03-07 11:00:59,536][175731] Updated weights for policy 0, policy_version 57610 (0.0006) [2023-03-07 11:01:00,322][175731] Updated weights for policy 0, policy_version 57620 (0.0006) [2023-03-07 11:01:01,121][175731] Updated weights for policy 0, policy_version 57630 (0.0007) [2023-03-07 11:01:01,937][175731] Updated weights for policy 0, policy_version 57640 (0.0006) [2023-03-07 11:01:02,730][175731] Updated weights for policy 0, policy_version 57650 (0.0007) [2023-03-07 11:01:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 59040768. Throughput: 0: 12837.5. Samples: 59027349. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:03,332][175405] Avg episode reward: [(0, '23.043')] [2023-03-07 11:01:03,547][175731] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-07 11:01:04,336][175731] Updated weights for policy 0, policy_version 57670 (0.0006) [2023-03-07 11:01:05,122][175731] Updated weights for policy 0, policy_version 57680 (0.0006) [2023-03-07 11:01:05,904][175731] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-07 11:01:06,703][175731] Updated weights for policy 0, policy_version 57700 (0.0006) [2023-03-07 11:01:07,497][175731] Updated weights for policy 0, policy_version 57710 (0.0006) [2023-03-07 11:01:08,282][175731] Updated weights for policy 0, policy_version 57720 (0.0006) [2023-03-07 11:01:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 59105280. Throughput: 0: 12848.4. Samples: 59104661. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:08,332][175405] Avg episode reward: [(0, '24.578')] [2023-03-07 11:01:09,099][175731] Updated weights for policy 0, policy_version 57730 (0.0007) [2023-03-07 11:01:09,898][175731] Updated weights for policy 0, policy_version 57740 (0.0007) [2023-03-07 11:01:10,703][175731] Updated weights for policy 0, policy_version 57750 (0.0006) [2023-03-07 11:01:11,506][175731] Updated weights for policy 0, policy_version 57760 (0.0007) [2023-03-07 11:01:12,296][175731] Updated weights for policy 0, policy_version 57770 (0.0006) [2023-03-07 11:01:13,081][175731] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-07 11:01:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12853.8). Total num frames: 59169792. Throughput: 0: 12847.1. Samples: 59142947. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:13,332][175405] Avg episode reward: [(0, '23.669')] [2023-03-07 11:01:13,876][175731] Updated weights for policy 0, policy_version 57790 (0.0006) [2023-03-07 11:01:14,681][175731] Updated weights for policy 0, policy_version 57800 (0.0007) [2023-03-07 11:01:15,475][175731] Updated weights for policy 0, policy_version 57810 (0.0007) [2023-03-07 11:01:16,262][175731] Updated weights for policy 0, policy_version 57820 (0.0006) [2023-03-07 11:01:17,073][175731] Updated weights for policy 0, policy_version 57830 (0.0007) [2023-03-07 11:01:17,863][175731] Updated weights for policy 0, policy_version 57840 (0.0006) [2023-03-07 11:01:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 59233280. Throughput: 0: 12842.4. Samples: 59220059. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:18,332][175405] Avg episode reward: [(0, '23.627')] [2023-03-07 11:01:18,672][175731] Updated weights for policy 0, policy_version 57850 (0.0006) [2023-03-07 11:01:19,466][175731] Updated weights for policy 0, policy_version 57860 (0.0006) [2023-03-07 11:01:20,279][175731] Updated weights for policy 0, policy_version 57870 (0.0008) [2023-03-07 11:01:21,048][175731] Updated weights for policy 0, policy_version 57880 (0.0007) [2023-03-07 11:01:21,870][175731] Updated weights for policy 0, policy_version 57890 (0.0007) [2023-03-07 11:01:22,656][175731] Updated weights for policy 0, policy_version 57900 (0.0007) [2023-03-07 11:01:23,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 59297792. Throughput: 0: 12841.9. Samples: 59297132. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:23,330][175405] Avg episode reward: [(0, '22.972')] [2023-03-07 11:01:23,462][175731] Updated weights for policy 0, policy_version 57910 (0.0007) [2023-03-07 11:01:24,242][175731] Updated weights for policy 0, policy_version 57920 (0.0008) [2023-03-07 11:01:25,037][175731] Updated weights for policy 0, policy_version 57930 (0.0006) [2023-03-07 11:01:25,830][175731] Updated weights for policy 0, policy_version 57940 (0.0007) [2023-03-07 11:01:26,616][175731] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-07 11:01:27,409][175731] Updated weights for policy 0, policy_version 57960 (0.0007) [2023-03-07 11:01:28,209][175731] Updated weights for policy 0, policy_version 57970 (0.0007) [2023-03-07 11:01:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 59362304. Throughput: 0: 12847.2. Samples: 59335911. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:28,332][175405] Avg episode reward: [(0, '22.358')] [2023-03-07 11:01:29,015][175731] Updated weights for policy 0, policy_version 57980 (0.0006) [2023-03-07 11:01:29,799][175731] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-07 11:01:30,606][175731] Updated weights for policy 0, policy_version 58000 (0.0007) [2023-03-07 11:01:31,386][175731] Updated weights for policy 0, policy_version 58010 (0.0007) [2023-03-07 11:01:32,217][175731] Updated weights for policy 0, policy_version 58020 (0.0007) [2023-03-07 11:01:33,009][175731] Updated weights for policy 0, policy_version 58030 (0.0007) [2023-03-07 11:01:33,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 59425792. Throughput: 0: 12847.1. Samples: 59412964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:33,332][175405] Avg episode reward: [(0, '26.857')] [2023-03-07 11:01:33,782][175731] Updated weights for policy 0, policy_version 58040 (0.0007) [2023-03-07 11:01:34,592][175731] Updated weights for policy 0, policy_version 58050 (0.0007) [2023-03-07 11:01:35,394][175731] Updated weights for policy 0, policy_version 58060 (0.0007) [2023-03-07 11:01:36,194][175731] Updated weights for policy 0, policy_version 58070 (0.0006) [2023-03-07 11:01:36,993][175731] Updated weights for policy 0, policy_version 58080 (0.0007) [2023-03-07 11:01:37,782][175731] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-07 11:01:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 59490304. Throughput: 0: 12854.3. Samples: 59490071. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:38,332][175405] Avg episode reward: [(0, '22.498')] [2023-03-07 11:01:38,595][175731] Updated weights for policy 0, policy_version 58100 (0.0006) [2023-03-07 11:01:39,386][175731] Updated weights for policy 0, policy_version 58110 (0.0006) [2023-03-07 11:01:40,179][175731] Updated weights for policy 0, policy_version 58120 (0.0006) [2023-03-07 11:01:40,991][175731] Updated weights for policy 0, policy_version 58130 (0.0007) [2023-03-07 11:01:41,799][175731] Updated weights for policy 0, policy_version 58140 (0.0006) [2023-03-07 11:01:42,583][175731] Updated weights for policy 0, policy_version 58150 (0.0006) [2023-03-07 11:01:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 59554816. Throughput: 0: 12849.8. Samples: 59528373. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:43,332][175405] Avg episode reward: [(0, '21.876')] [2023-03-07 11:01:43,378][175731] Updated weights for policy 0, policy_version 58160 (0.0006) [2023-03-07 11:01:44,191][175731] Updated weights for policy 0, policy_version 58170 (0.0006) [2023-03-07 11:01:44,973][175731] Updated weights for policy 0, policy_version 58180 (0.0007) [2023-03-07 11:01:45,780][175731] Updated weights for policy 0, policy_version 58190 (0.0007) [2023-03-07 11:01:46,578][175731] Updated weights for policy 0, policy_version 58200 (0.0007) [2023-03-07 11:01:47,405][175731] Updated weights for policy 0, policy_version 58210 (0.0007) [2023-03-07 11:01:48,200][175731] Updated weights for policy 0, policy_version 58220 (0.0006) [2023-03-07 11:01:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 59618304. Throughput: 0: 12838.1. Samples: 59605065. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:48,332][175405] Avg episode reward: [(0, '22.834')] [2023-03-07 11:01:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000058221_59618304.pth... [2023-03-07 11:01:48,364][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000055211_56536064.pth [2023-03-07 11:01:48,984][175731] Updated weights for policy 0, policy_version 58230 (0.0007) [2023-03-07 11:01:49,799][175731] Updated weights for policy 0, policy_version 58240 (0.0006) [2023-03-07 11:01:50,587][175731] Updated weights for policy 0, policy_version 58250 (0.0007) [2023-03-07 11:01:51,389][175731] Updated weights for policy 0, policy_version 58260 (0.0006) [2023-03-07 11:01:52,178][175731] Updated weights for policy 0, policy_version 58270 (0.0007) [2023-03-07 11:01:52,971][175731] Updated weights for policy 0, policy_version 58280 (0.0006) [2023-03-07 11:01:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 59682816. Throughput: 0: 12833.3. Samples: 59682161. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:53,332][175405] Avg episode reward: [(0, '21.921')] [2023-03-07 11:01:53,758][175731] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-07 11:01:54,559][175731] Updated weights for policy 0, policy_version 58300 (0.0006) [2023-03-07 11:01:55,364][175731] Updated weights for policy 0, policy_version 58310 (0.0007) [2023-03-07 11:01:56,167][175731] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-07 11:01:56,977][175731] Updated weights for policy 0, policy_version 58330 (0.0006) [2023-03-07 11:01:57,786][175731] Updated weights for policy 0, policy_version 58340 (0.0006) [2023-03-07 11:01:58,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.2, 300 sec: 12843.4). Total num frames: 59746304. Throughput: 0: 12836.6. Samples: 59720591. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:01:58,332][175405] Avg episode reward: [(0, '23.360')] [2023-03-07 11:01:58,587][175731] Updated weights for policy 0, policy_version 58350 (0.0006) [2023-03-07 11:01:59,388][175731] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-07 11:02:00,177][175731] Updated weights for policy 0, policy_version 58370 (0.0007) [2023-03-07 11:02:00,956][175731] Updated weights for policy 0, policy_version 58380 (0.0007) [2023-03-07 11:02:01,766][175731] Updated weights for policy 0, policy_version 58390 (0.0007) [2023-03-07 11:02:02,552][175731] Updated weights for policy 0, policy_version 58400 (0.0005) [2023-03-07 11:02:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 59810816. Throughput: 0: 12835.5. Samples: 59797655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:03,332][175405] Avg episode reward: [(0, '23.080')] [2023-03-07 11:02:03,337][175731] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-07 11:02:04,103][175731] Updated weights for policy 0, policy_version 58420 (0.0007) [2023-03-07 11:02:04,902][175731] Updated weights for policy 0, policy_version 58430 (0.0006) [2023-03-07 11:02:05,705][175731] Updated weights for policy 0, policy_version 58440 (0.0006) [2023-03-07 11:02:06,503][175731] Updated weights for policy 0, policy_version 58450 (0.0005) [2023-03-07 11:02:07,296][175731] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-07 11:02:08,086][175731] Updated weights for policy 0, policy_version 58470 (0.0006) [2023-03-07 11:02:08,321][175405] Fps is (10 sec: 13004.8, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 59876352. Throughput: 0: 12846.9. Samples: 59875242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:08,322][175405] Avg episode reward: [(0, '23.232')] [2023-03-07 11:02:08,892][175731] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-07 11:02:09,691][175731] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-07 11:02:10,493][175731] Updated weights for policy 0, policy_version 58500 (0.0006) [2023-03-07 11:02:11,269][175731] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-07 11:02:12,086][175731] Updated weights for policy 0, policy_version 58520 (0.0007) [2023-03-07 11:02:12,871][175731] Updated weights for policy 0, policy_version 58530 (0.0007) [2023-03-07 11:02:13,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 59939840. Throughput: 0: 12842.6. Samples: 59913831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:13,322][175405] Avg episode reward: [(0, '22.941')] [2023-03-07 11:02:13,658][175731] Updated weights for policy 0, policy_version 58540 (0.0007) [2023-03-07 11:02:14,455][175731] Updated weights for policy 0, policy_version 58550 (0.0007) [2023-03-07 11:02:15,268][175731] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-07 11:02:16,064][175731] Updated weights for policy 0, policy_version 58570 (0.0007) [2023-03-07 11:02:16,843][175731] Updated weights for policy 0, policy_version 58580 (0.0006) [2023-03-07 11:02:17,630][175731] Updated weights for policy 0, policy_version 58590 (0.0007) [2023-03-07 11:02:18,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 60004352. Throughput: 0: 12846.9. Samples: 59991073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:18,322][175405] Avg episode reward: [(0, '24.009')] [2023-03-07 11:02:18,433][175731] Updated weights for policy 0, policy_version 58600 (0.0006) [2023-03-07 11:02:19,214][175731] Updated weights for policy 0, policy_version 58610 (0.0007) [2023-03-07 11:02:20,010][175731] Updated weights for policy 0, policy_version 58620 (0.0007) [2023-03-07 11:02:20,825][175731] Updated weights for policy 0, policy_version 58630 (0.0007) [2023-03-07 11:02:21,613][175731] Updated weights for policy 0, policy_version 58640 (0.0007) [2023-03-07 11:02:22,413][175731] Updated weights for policy 0, policy_version 58650 (0.0006) [2023-03-07 11:02:23,235][175731] Updated weights for policy 0, policy_version 58660 (0.0006) [2023-03-07 11:02:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 60068864. Throughput: 0: 12845.4. Samples: 60068114. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:23,322][175405] Avg episode reward: [(0, '22.721')] [2023-03-07 11:02:24,027][175731] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-07 11:02:24,815][175731] Updated weights for policy 0, policy_version 58680 (0.0006) [2023-03-07 11:02:25,606][175731] Updated weights for policy 0, policy_version 58690 (0.0006) [2023-03-07 11:02:26,392][175731] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-07 11:02:27,187][175731] Updated weights for policy 0, policy_version 58710 (0.0007) [2023-03-07 11:02:27,997][175731] Updated weights for policy 0, policy_version 58720 (0.0007) [2023-03-07 11:02:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 60133376. Throughput: 0: 12854.6. Samples: 60106831. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:28,321][175405] Avg episode reward: [(0, '23.002')] [2023-03-07 11:02:28,783][175731] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-07 11:02:29,597][175731] Updated weights for policy 0, policy_version 58740 (0.0006) [2023-03-07 11:02:30,386][175731] Updated weights for policy 0, policy_version 58750 (0.0006) [2023-03-07 11:02:31,177][175731] Updated weights for policy 0, policy_version 58760 (0.0007) [2023-03-07 11:02:31,968][175731] Updated weights for policy 0, policy_version 58770 (0.0006) [2023-03-07 11:02:32,764][175731] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-07 11:02:33,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12843.4). Total num frames: 60196864. Throughput: 0: 12865.0. Samples: 60183989. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:33,322][175405] Avg episode reward: [(0, '22.823')] [2023-03-07 11:02:33,541][175731] Updated weights for policy 0, policy_version 58790 (0.0006) [2023-03-07 11:02:34,337][175731] Updated weights for policy 0, policy_version 58800 (0.0006) [2023-03-07 11:02:35,142][175731] Updated weights for policy 0, policy_version 58810 (0.0007) [2023-03-07 11:02:35,941][175731] Updated weights for policy 0, policy_version 58820 (0.0006) [2023-03-07 11:02:36,737][175731] Updated weights for policy 0, policy_version 58830 (0.0007) [2023-03-07 11:02:37,521][175731] Updated weights for policy 0, policy_version 58840 (0.0005) [2023-03-07 11:02:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12843.4). Total num frames: 60261376. Throughput: 0: 12871.5. Samples: 60261378. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:38,322][175405] Avg episode reward: [(0, '23.469')] [2023-03-07 11:02:38,334][175731] Updated weights for policy 0, policy_version 58850 (0.0006) [2023-03-07 11:02:39,122][175731] Updated weights for policy 0, policy_version 58860 (0.0006) [2023-03-07 11:02:39,922][175731] Updated weights for policy 0, policy_version 58870 (0.0006) [2023-03-07 11:02:40,713][175731] Updated weights for policy 0, policy_version 58880 (0.0007) [2023-03-07 11:02:41,502][175731] Updated weights for policy 0, policy_version 58890 (0.0007) [2023-03-07 11:02:42,297][175731] Updated weights for policy 0, policy_version 58900 (0.0007) [2023-03-07 11:02:43,113][175731] Updated weights for policy 0, policy_version 58910 (0.0007) [2023-03-07 11:02:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 60325888. Throughput: 0: 12873.0. Samples: 60299878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:43,322][175405] Avg episode reward: [(0, '24.659')] [2023-03-07 11:02:43,917][175731] Updated weights for policy 0, policy_version 58920 (0.0007) [2023-03-07 11:02:44,707][175731] Updated weights for policy 0, policy_version 58930 (0.0006) [2023-03-07 11:02:45,521][175731] Updated weights for policy 0, policy_version 58940 (0.0007) [2023-03-07 11:02:46,318][175731] Updated weights for policy 0, policy_version 58950 (0.0007) [2023-03-07 11:02:47,116][175731] Updated weights for policy 0, policy_version 58960 (0.0007) [2023-03-07 11:02:47,928][175731] Updated weights for policy 0, policy_version 58970 (0.0007) [2023-03-07 11:02:48,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12850.3). Total num frames: 60390400. Throughput: 0: 12868.1. Samples: 60376718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:48,322][175405] Avg episode reward: [(0, '32.650')] [2023-03-07 11:02:48,719][175731] Updated weights for policy 0, policy_version 58980 (0.0006) [2023-03-07 11:02:49,529][175731] Updated weights for policy 0, policy_version 58990 (0.0007) [2023-03-07 11:02:50,307][175731] Updated weights for policy 0, policy_version 59000 (0.0006) [2023-03-07 11:02:51,111][175731] Updated weights for policy 0, policy_version 59010 (0.0006) [2023-03-07 11:02:51,903][175731] Updated weights for policy 0, policy_version 59020 (0.0006) [2023-03-07 11:02:52,699][175731] Updated weights for policy 0, policy_version 59030 (0.0006) [2023-03-07 11:02:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 60453888. Throughput: 0: 12854.2. Samples: 60453680. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:53,321][175405] Avg episode reward: [(0, '23.348')] [2023-03-07 11:02:53,512][175731] Updated weights for policy 0, policy_version 59040 (0.0007) [2023-03-07 11:02:54,294][175731] Updated weights for policy 0, policy_version 59050 (0.0006) [2023-03-07 11:02:55,091][175731] Updated weights for policy 0, policy_version 59060 (0.0007) [2023-03-07 11:02:55,882][175731] Updated weights for policy 0, policy_version 59070 (0.0006) [2023-03-07 11:02:56,669][175731] Updated weights for policy 0, policy_version 59080 (0.0006) [2023-03-07 11:02:57,485][175731] Updated weights for policy 0, policy_version 59090 (0.0006) [2023-03-07 11:02:58,291][175731] Updated weights for policy 0, policy_version 59100 (0.0006) [2023-03-07 11:02:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12868.3, 300 sec: 12846.9). Total num frames: 60518400. Throughput: 0: 12853.6. Samples: 60492241. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:02:58,322][175405] Avg episode reward: [(0, '24.696')] [2023-03-07 11:02:59,083][175731] Updated weights for policy 0, policy_version 59110 (0.0007) [2023-03-07 11:02:59,882][175731] Updated weights for policy 0, policy_version 59120 (0.0006) [2023-03-07 11:03:00,674][175731] Updated weights for policy 0, policy_version 59130 (0.0007) [2023-03-07 11:03:01,470][175731] Updated weights for policy 0, policy_version 59140 (0.0007) [2023-03-07 11:03:02,274][175731] Updated weights for policy 0, policy_version 59150 (0.0007) [2023-03-07 11:03:03,066][175731] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-07 11:03:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12868.2, 300 sec: 12850.3). Total num frames: 60582912. Throughput: 0: 12849.2. Samples: 60569286. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:03:03,322][175405] Avg episode reward: [(0, '24.219')] [2023-03-07 11:03:03,868][175731] Updated weights for policy 0, policy_version 59170 (0.0007) [2023-03-07 11:03:04,661][175731] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-07 11:03:05,455][175731] Updated weights for policy 0, policy_version 59190 (0.0007) [2023-03-07 11:03:06,252][175731] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-07 11:03:07,046][175731] Updated weights for policy 0, policy_version 59210 (0.0006) [2023-03-07 11:03:07,845][175731] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-07 11:03:08,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.2, 300 sec: 12846.9). Total num frames: 60646400. Throughput: 0: 12844.9. Samples: 60646134. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:03:08,321][175405] Avg episode reward: [(0, '21.835')] [2023-03-07 11:03:08,656][175731] Updated weights for policy 0, policy_version 59230 (0.0008) [2023-03-07 11:03:09,448][175731] Updated weights for policy 0, policy_version 59240 (0.0006) [2023-03-07 11:03:10,245][175731] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-07 11:03:11,033][175731] Updated weights for policy 0, policy_version 59260 (0.0006) [2023-03-07 11:03:11,845][175731] Updated weights for policy 0, policy_version 59270 (0.0006) [2023-03-07 11:03:12,655][175731] Updated weights for policy 0, policy_version 59280 (0.0006) [2023-03-07 11:03:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 60710912. Throughput: 0: 12842.2. Samples: 60684732. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:13,322][175405] Avg episode reward: [(0, '23.612')] [2023-03-07 11:03:13,460][175731] Updated weights for policy 0, policy_version 59290 (0.0006) [2023-03-07 11:03:14,235][175731] Updated weights for policy 0, policy_version 59300 (0.0006) [2023-03-07 11:03:15,030][175731] Updated weights for policy 0, policy_version 59310 (0.0006) [2023-03-07 11:03:15,823][175731] Updated weights for policy 0, policy_version 59320 (0.0007) [2023-03-07 11:03:16,613][175731] Updated weights for policy 0, policy_version 59330 (0.0006) [2023-03-07 11:03:17,408][175731] Updated weights for policy 0, policy_version 59340 (0.0006) [2023-03-07 11:03:18,176][175731] Updated weights for policy 0, policy_version 59350 (0.0007) [2023-03-07 11:03:18,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 60775424. Throughput: 0: 12848.7. Samples: 60762178. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:18,321][175405] Avg episode reward: [(0, '23.987')] [2023-03-07 11:03:18,974][175731] Updated weights for policy 0, policy_version 59360 (0.0007) [2023-03-07 11:03:19,770][175731] Updated weights for policy 0, policy_version 59370 (0.0006) [2023-03-07 11:03:20,579][175731] Updated weights for policy 0, policy_version 59380 (0.0008) [2023-03-07 11:03:21,362][175731] Updated weights for policy 0, policy_version 59390 (0.0007) [2023-03-07 11:03:22,158][175731] Updated weights for policy 0, policy_version 59400 (0.0007) [2023-03-07 11:03:22,957][175731] Updated weights for policy 0, policy_version 59410 (0.0007) [2023-03-07 11:03:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 60839936. Throughput: 0: 12845.5. Samples: 60839428. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:23,322][175405] Avg episode reward: [(0, '23.143')] [2023-03-07 11:03:23,751][175731] Updated weights for policy 0, policy_version 59420 (0.0006) [2023-03-07 11:03:24,561][175731] Updated weights for policy 0, policy_version 59430 (0.0006) [2023-03-07 11:03:25,360][175731] Updated weights for policy 0, policy_version 59440 (0.0007) [2023-03-07 11:03:26,157][175731] Updated weights for policy 0, policy_version 59450 (0.0007) [2023-03-07 11:03:26,955][175731] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-07 11:03:27,754][175731] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-07 11:03:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 60904448. Throughput: 0: 12844.8. Samples: 60877893. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:28,322][175405] Avg episode reward: [(0, '23.430')] [2023-03-07 11:03:28,553][175731] Updated weights for policy 0, policy_version 59480 (0.0006) [2023-03-07 11:03:29,341][175731] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-07 11:03:30,140][175731] Updated weights for policy 0, policy_version 59500 (0.0007) [2023-03-07 11:03:30,941][175731] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-07 11:03:31,742][175731] Updated weights for policy 0, policy_version 59520 (0.0007) [2023-03-07 11:03:32,526][175731] Updated weights for policy 0, policy_version 59530 (0.0006) [2023-03-07 11:03:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 60967936. Throughput: 0: 12847.8. Samples: 60954872. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:33,322][175405] Avg episode reward: [(0, '23.470')] [2023-03-07 11:03:33,335][175731] Updated weights for policy 0, policy_version 59540 (0.0006) [2023-03-07 11:03:34,131][175731] Updated weights for policy 0, policy_version 59550 (0.0007) [2023-03-07 11:03:34,917][175731] Updated weights for policy 0, policy_version 59560 (0.0006) [2023-03-07 11:03:35,712][175731] Updated weights for policy 0, policy_version 59570 (0.0006) [2023-03-07 11:03:36,529][175731] Updated weights for policy 0, policy_version 59580 (0.0006) [2023-03-07 11:03:37,313][175731] Updated weights for policy 0, policy_version 59590 (0.0007) [2023-03-07 11:03:38,127][175731] Updated weights for policy 0, policy_version 59600 (0.0006) [2023-03-07 11:03:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 61032448. Throughput: 0: 12851.2. Samples: 61031984. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:38,321][175405] Avg episode reward: [(0, '23.012')] [2023-03-07 11:03:38,914][175731] Updated weights for policy 0, policy_version 59610 (0.0006) [2023-03-07 11:03:39,712][175731] Updated weights for policy 0, policy_version 59620 (0.0006) [2023-03-07 11:03:40,512][175731] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-07 11:03:41,301][175731] Updated weights for policy 0, policy_version 59640 (0.0006) [2023-03-07 11:03:42,100][175731] Updated weights for policy 0, policy_version 59650 (0.0006) [2023-03-07 11:03:42,878][175731] Updated weights for policy 0, policy_version 59660 (0.0006) [2023-03-07 11:03:43,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 61096960. Throughput: 0: 12851.8. Samples: 61070571. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:43,322][175405] Avg episode reward: [(0, '24.564')] [2023-03-07 11:03:43,678][175731] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-07 11:03:44,475][175731] Updated weights for policy 0, policy_version 59680 (0.0007) [2023-03-07 11:03:45,252][175731] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-07 11:03:46,055][175731] Updated weights for policy 0, policy_version 59700 (0.0007) [2023-03-07 11:03:46,859][175731] Updated weights for policy 0, policy_version 59710 (0.0006) [2023-03-07 11:03:47,654][175731] Updated weights for policy 0, policy_version 59720 (0.0007) [2023-03-07 11:03:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 61161472. Throughput: 0: 12861.4. Samples: 61148046. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:03:48,321][175405] Avg episode reward: [(0, '23.211')] [2023-03-07 11:03:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000059728_61161472.pth... [2023-03-07 11:03:48,354][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000056716_58077184.pth [2023-03-07 11:03:48,458][175731] Updated weights for policy 0, policy_version 59730 (0.0006) [2023-03-07 11:03:49,237][175731] Updated weights for policy 0, policy_version 59740 (0.0007) [2023-03-07 11:03:50,027][175731] Updated weights for policy 0, policy_version 59750 (0.0007) [2023-03-07 11:03:50,837][175731] Updated weights for policy 0, policy_version 59760 (0.0007) [2023-03-07 11:03:51,634][175731] Updated weights for policy 0, policy_version 59770 (0.0006) [2023-03-07 11:03:52,427][175731] Updated weights for policy 0, policy_version 59780 (0.0007) [2023-03-07 11:03:53,225][175731] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-07 11:03:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.2, 300 sec: 12850.3). Total num frames: 61225984. Throughput: 0: 12865.8. Samples: 61225098. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:03:53,322][175405] Avg episode reward: [(0, '24.375')] [2023-03-07 11:03:54,013][175731] Updated weights for policy 0, policy_version 59800 (0.0007) [2023-03-07 11:03:54,825][175731] Updated weights for policy 0, policy_version 59810 (0.0006) [2023-03-07 11:03:55,633][175731] Updated weights for policy 0, policy_version 59820 (0.0006) [2023-03-07 11:03:56,403][175731] Updated weights for policy 0, policy_version 59830 (0.0006) [2023-03-07 11:03:57,237][175731] Updated weights for policy 0, policy_version 59840 (0.0007) [2023-03-07 11:03:58,049][175731] Updated weights for policy 0, policy_version 59850 (0.0006) [2023-03-07 11:03:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 61289472. Throughput: 0: 12866.5. Samples: 61263723. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:03:58,322][175405] Avg episode reward: [(0, '23.531')] [2023-03-07 11:03:58,837][175731] Updated weights for policy 0, policy_version 59860 (0.0007) [2023-03-07 11:03:59,642][175731] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-07 11:04:00,429][175731] Updated weights for policy 0, policy_version 59880 (0.0006) [2023-03-07 11:04:01,262][175731] Updated weights for policy 0, policy_version 59890 (0.0006) [2023-03-07 11:04:02,055][175731] Updated weights for policy 0, policy_version 59900 (0.0007) [2023-03-07 11:04:02,836][175731] Updated weights for policy 0, policy_version 59910 (0.0007) [2023-03-07 11:04:03,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 61353984. Throughput: 0: 12842.8. Samples: 61340102. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:03,322][175405] Avg episode reward: [(0, '24.112')] [2023-03-07 11:04:03,646][175731] Updated weights for policy 0, policy_version 59920 (0.0007) [2023-03-07 11:04:04,421][175731] Updated weights for policy 0, policy_version 59930 (0.0006) [2023-03-07 11:04:05,230][175731] Updated weights for policy 0, policy_version 59940 (0.0007) [2023-03-07 11:04:06,028][175731] Updated weights for policy 0, policy_version 59950 (0.0006) [2023-03-07 11:04:06,803][175731] Updated weights for policy 0, policy_version 59960 (0.0006) [2023-03-07 11:04:07,613][175731] Updated weights for policy 0, policy_version 59970 (0.0007) [2023-03-07 11:04:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.2, 300 sec: 12850.3). Total num frames: 61418496. Throughput: 0: 12846.8. Samples: 61417533. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:08,321][175405] Avg episode reward: [(0, '23.376')] [2023-03-07 11:04:08,409][175731] Updated weights for policy 0, policy_version 59980 (0.0006) [2023-03-07 11:04:09,195][175731] Updated weights for policy 0, policy_version 59990 (0.0005) [2023-03-07 11:04:10,009][175731] Updated weights for policy 0, policy_version 60000 (0.0007) [2023-03-07 11:04:10,806][175731] Updated weights for policy 0, policy_version 60010 (0.0006) [2023-03-07 11:04:11,593][175731] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-07 11:04:12,383][175731] Updated weights for policy 0, policy_version 60030 (0.0006) [2023-03-07 11:04:13,188][175731] Updated weights for policy 0, policy_version 60040 (0.0007) [2023-03-07 11:04:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 61481984. Throughput: 0: 12842.8. Samples: 61455819. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:13,322][175405] Avg episode reward: [(0, '24.035')] [2023-03-07 11:04:13,970][175731] Updated weights for policy 0, policy_version 60050 (0.0008) [2023-03-07 11:04:14,786][175731] Updated weights for policy 0, policy_version 60060 (0.0007) [2023-03-07 11:04:15,575][175731] Updated weights for policy 0, policy_version 60070 (0.0006) [2023-03-07 11:04:16,365][175731] Updated weights for policy 0, policy_version 60080 (0.0006) [2023-03-07 11:04:17,190][175731] Updated weights for policy 0, policy_version 60090 (0.0006) [2023-03-07 11:04:17,987][175731] Updated weights for policy 0, policy_version 60100 (0.0007) [2023-03-07 11:04:18,321][175405] Fps is (10 sec: 12697.3, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 61545472. Throughput: 0: 12846.9. Samples: 61532983. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:18,322][175405] Avg episode reward: [(0, '25.179')] [2023-03-07 11:04:18,777][175731] Updated weights for policy 0, policy_version 60110 (0.0008) [2023-03-07 11:04:19,584][175731] Updated weights for policy 0, policy_version 60120 (0.0006) [2023-03-07 11:04:20,378][175731] Updated weights for policy 0, policy_version 60130 (0.0006) [2023-03-07 11:04:21,173][175731] Updated weights for policy 0, policy_version 60140 (0.0007) [2023-03-07 11:04:21,962][175731] Updated weights for policy 0, policy_version 60150 (0.0006) [2023-03-07 11:04:22,770][175731] Updated weights for policy 0, policy_version 60160 (0.0006) [2023-03-07 11:04:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12850.3). Total num frames: 61611008. Throughput: 0: 12843.4. Samples: 61609938. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:23,322][175405] Avg episode reward: [(0, '23.002')] [2023-03-07 11:04:23,565][175731] Updated weights for policy 0, policy_version 60170 (0.0005) [2023-03-07 11:04:24,374][175731] Updated weights for policy 0, policy_version 60180 (0.0006) [2023-03-07 11:04:25,155][175731] Updated weights for policy 0, policy_version 60190 (0.0007) [2023-03-07 11:04:25,950][175731] Updated weights for policy 0, policy_version 60200 (0.0007) [2023-03-07 11:04:26,765][175731] Updated weights for policy 0, policy_version 60210 (0.0007) [2023-03-07 11:04:27,580][175731] Updated weights for policy 0, policy_version 60220 (0.0006) [2023-03-07 11:04:28,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 61674496. Throughput: 0: 12846.1. Samples: 61648645. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:28,322][175405] Avg episode reward: [(0, '27.093')] [2023-03-07 11:04:28,353][175731] Updated weights for policy 0, policy_version 60230 (0.0006) [2023-03-07 11:04:29,148][175731] Updated weights for policy 0, policy_version 60240 (0.0006) [2023-03-07 11:04:29,957][175731] Updated weights for policy 0, policy_version 60250 (0.0006) [2023-03-07 11:04:30,749][175731] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-07 11:04:31,561][175731] Updated weights for policy 0, policy_version 60270 (0.0006) [2023-03-07 11:04:32,350][175731] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-07 11:04:33,149][175731] Updated weights for policy 0, policy_version 60290 (0.0007) [2023-03-07 11:04:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12846.9). Total num frames: 61739008. Throughput: 0: 12828.8. Samples: 61725344. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:33,322][175405] Avg episode reward: [(0, '24.101')] [2023-03-07 11:04:33,959][175731] Updated weights for policy 0, policy_version 60300 (0.0006) [2023-03-07 11:04:34,740][175731] Updated weights for policy 0, policy_version 60310 (0.0006) [2023-03-07 11:04:35,557][175731] Updated weights for policy 0, policy_version 60320 (0.0009) [2023-03-07 11:04:36,338][175731] Updated weights for policy 0, policy_version 60330 (0.0006) [2023-03-07 11:04:37,146][175731] Updated weights for policy 0, policy_version 60340 (0.0006) [2023-03-07 11:04:37,946][175731] Updated weights for policy 0, policy_version 60350 (0.0006) [2023-03-07 11:04:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 61802496. Throughput: 0: 12828.4. Samples: 61802373. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:38,321][175405] Avg episode reward: [(0, '24.180')] [2023-03-07 11:04:38,733][175731] Updated weights for policy 0, policy_version 60360 (0.0007) [2023-03-07 11:04:39,526][175731] Updated weights for policy 0, policy_version 60370 (0.0007) [2023-03-07 11:04:40,307][175731] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-07 11:04:41,107][175731] Updated weights for policy 0, policy_version 60390 (0.0006) [2023-03-07 11:04:41,918][175731] Updated weights for policy 0, policy_version 60400 (0.0007) [2023-03-07 11:04:42,712][175731] Updated weights for policy 0, policy_version 60410 (0.0007) [2023-03-07 11:04:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12846.9). Total num frames: 61867008. Throughput: 0: 12828.9. Samples: 61841023. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:43,321][175405] Avg episode reward: [(0, '24.120')] [2023-03-07 11:04:43,519][175731] Updated weights for policy 0, policy_version 60420 (0.0006) [2023-03-07 11:04:44,310][175731] Updated weights for policy 0, policy_version 60430 (0.0007) [2023-03-07 11:04:45,099][175731] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-07 11:04:45,922][175731] Updated weights for policy 0, policy_version 60450 (0.0006) [2023-03-07 11:04:46,691][175731] Updated weights for policy 0, policy_version 60460 (0.0007) [2023-03-07 11:04:47,498][175731] Updated weights for policy 0, policy_version 60470 (0.0007) [2023-03-07 11:04:48,286][175731] Updated weights for policy 0, policy_version 60480 (0.0006) [2023-03-07 11:04:48,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 61931520. Throughput: 0: 12841.4. Samples: 61917968. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:48,322][175405] Avg episode reward: [(0, '26.319')] [2023-03-07 11:04:49,093][175731] Updated weights for policy 0, policy_version 60490 (0.0007) [2023-03-07 11:04:49,901][175731] Updated weights for policy 0, policy_version 60500 (0.0007) [2023-03-07 11:04:50,688][175731] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-07 11:04:51,485][175731] Updated weights for policy 0, policy_version 60520 (0.0005) [2023-03-07 11:04:52,282][175731] Updated weights for policy 0, policy_version 60530 (0.0006) [2023-03-07 11:04:53,074][175731] Updated weights for policy 0, policy_version 60540 (0.0007) [2023-03-07 11:04:53,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 61996032. Throughput: 0: 12836.3. Samples: 61995167. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:53,322][175405] Avg episode reward: [(0, '24.094')] [2023-03-07 11:04:53,863][175731] Updated weights for policy 0, policy_version 60550 (0.0007) [2023-03-07 11:04:54,664][175731] Updated weights for policy 0, policy_version 60560 (0.0007) [2023-03-07 11:04:55,469][175731] Updated weights for policy 0, policy_version 60570 (0.0006) [2023-03-07 11:04:56,260][175731] Updated weights for policy 0, policy_version 60580 (0.0007) [2023-03-07 11:04:57,066][175731] Updated weights for policy 0, policy_version 60590 (0.0007) [2023-03-07 11:04:57,866][175731] Updated weights for policy 0, policy_version 60600 (0.0007) [2023-03-07 11:04:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12846.9). Total num frames: 62059520. Throughput: 0: 12841.6. Samples: 62033693. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:04:58,322][175405] Avg episode reward: [(0, '24.700')] [2023-03-07 11:04:58,645][175731] Updated weights for policy 0, policy_version 60610 (0.0006) [2023-03-07 11:04:59,454][175731] Updated weights for policy 0, policy_version 60620 (0.0007) [2023-03-07 11:05:00,257][175731] Updated weights for policy 0, policy_version 60630 (0.0007) [2023-03-07 11:05:01,070][175731] Updated weights for policy 0, policy_version 60640 (0.0007) [2023-03-07 11:05:01,871][175731] Updated weights for policy 0, policy_version 60650 (0.0007) [2023-03-07 11:05:02,651][175731] Updated weights for policy 0, policy_version 60660 (0.0006) [2023-03-07 11:05:03,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.1, 300 sec: 12850.3). Total num frames: 62124032. Throughput: 0: 12830.1. Samples: 62110334. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:05:03,322][175405] Avg episode reward: [(0, '23.166')] [2023-03-07 11:05:03,468][175731] Updated weights for policy 0, policy_version 60670 (0.0007) [2023-03-07 11:05:04,250][175731] Updated weights for policy 0, policy_version 60680 (0.0006) [2023-03-07 11:05:05,074][175731] Updated weights for policy 0, policy_version 60690 (0.0007) [2023-03-07 11:05:05,875][175731] Updated weights for policy 0, policy_version 60700 (0.0007) [2023-03-07 11:05:06,665][175731] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-07 11:05:07,473][175731] Updated weights for policy 0, policy_version 60720 (0.0006) [2023-03-07 11:05:08,257][175731] Updated weights for policy 0, policy_version 60730 (0.0006) [2023-03-07 11:05:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12843.4). Total num frames: 62187520. Throughput: 0: 12830.4. Samples: 62187305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:08,322][175405] Avg episode reward: [(0, '23.633')] [2023-03-07 11:05:09,066][175731] Updated weights for policy 0, policy_version 60740 (0.0006) [2023-03-07 11:05:09,864][175731] Updated weights for policy 0, policy_version 60750 (0.0006) [2023-03-07 11:05:10,672][175731] Updated weights for policy 0, policy_version 60760 (0.0006) [2023-03-07 11:05:11,448][175731] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-07 11:05:12,249][175731] Updated weights for policy 0, policy_version 60780 (0.0006) [2023-03-07 11:05:13,054][175731] Updated weights for policy 0, policy_version 60790 (0.0007) [2023-03-07 11:05:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 62252032. Throughput: 0: 12826.0. Samples: 62225814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:13,322][175405] Avg episode reward: [(0, '23.309')] [2023-03-07 11:05:13,865][175731] Updated weights for policy 0, policy_version 60800 (0.0007) [2023-03-07 11:05:14,653][175731] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-07 11:05:15,446][175731] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-07 11:05:16,249][175731] Updated weights for policy 0, policy_version 60830 (0.0006) [2023-03-07 11:05:17,046][175731] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-07 11:05:17,848][175731] Updated weights for policy 0, policy_version 60850 (0.0006) [2023-03-07 11:05:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.3, 300 sec: 12846.9). Total num frames: 62316544. Throughput: 0: 12827.2. Samples: 62302566. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:18,321][175405] Avg episode reward: [(0, '24.142')] [2023-03-07 11:05:18,633][175731] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-07 11:05:19,434][175731] Updated weights for policy 0, policy_version 60870 (0.0006) [2023-03-07 11:05:20,243][175731] Updated weights for policy 0, policy_version 60880 (0.0006) [2023-03-07 11:05:21,032][175731] Updated weights for policy 0, policy_version 60890 (0.0006) [2023-03-07 11:05:21,841][175731] Updated weights for policy 0, policy_version 60900 (0.0006) [2023-03-07 11:05:22,635][175731] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-07 11:05:23,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12843.4). Total num frames: 62380032. Throughput: 0: 12828.0. Samples: 62379636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:23,322][175405] Avg episode reward: [(0, '24.218')] [2023-03-07 11:05:23,417][175731] Updated weights for policy 0, policy_version 60920 (0.0006) [2023-03-07 11:05:24,238][175731] Updated weights for policy 0, policy_version 60930 (0.0006) [2023-03-07 11:05:25,020][175731] Updated weights for policy 0, policy_version 60940 (0.0006) [2023-03-07 11:05:25,827][175731] Updated weights for policy 0, policy_version 60950 (0.0007) [2023-03-07 11:05:26,609][175731] Updated weights for policy 0, policy_version 60960 (0.0007) [2023-03-07 11:05:27,410][175731] Updated weights for policy 0, policy_version 60970 (0.0007) [2023-03-07 11:05:28,227][175731] Updated weights for policy 0, policy_version 60980 (0.0007) [2023-03-07 11:05:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12843.4). Total num frames: 62444544. Throughput: 0: 12827.5. Samples: 62418261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:28,321][175405] Avg episode reward: [(0, '24.444')] [2023-03-07 11:05:29,021][175731] Updated weights for policy 0, policy_version 60990 (0.0007) [2023-03-07 11:05:29,824][175731] Updated weights for policy 0, policy_version 61000 (0.0006) [2023-03-07 11:05:30,608][175731] Updated weights for policy 0, policy_version 61010 (0.0007) [2023-03-07 11:05:31,408][175731] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-07 11:05:32,211][175731] Updated weights for policy 0, policy_version 61030 (0.0007) [2023-03-07 11:05:33,004][175731] Updated weights for policy 0, policy_version 61040 (0.0006) [2023-03-07 11:05:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12843.4). Total num frames: 62508032. Throughput: 0: 12826.2. Samples: 62495146. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:33,322][175405] Avg episode reward: [(0, '24.286')] [2023-03-07 11:05:33,807][175731] Updated weights for policy 0, policy_version 61050 (0.0006) [2023-03-07 11:05:34,630][175731] Updated weights for policy 0, policy_version 61060 (0.0007) [2023-03-07 11:05:35,413][175731] Updated weights for policy 0, policy_version 61070 (0.0006) [2023-03-07 11:05:36,204][175731] Updated weights for policy 0, policy_version 61080 (0.0006) [2023-03-07 11:05:37,013][175731] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-07 11:05:37,834][175731] Updated weights for policy 0, policy_version 61100 (0.0008) [2023-03-07 11:05:38,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 62572544. Throughput: 0: 12813.2. Samples: 62571760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:38,322][175405] Avg episode reward: [(0, '24.351')] [2023-03-07 11:05:38,627][175731] Updated weights for policy 0, policy_version 61110 (0.0007) [2023-03-07 11:05:39,433][175731] Updated weights for policy 0, policy_version 61120 (0.0006) [2023-03-07 11:05:40,228][175731] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-07 11:05:41,013][175731] Updated weights for policy 0, policy_version 61140 (0.0006) [2023-03-07 11:05:41,826][175731] Updated weights for policy 0, policy_version 61150 (0.0006) [2023-03-07 11:05:42,619][175731] Updated weights for policy 0, policy_version 61160 (0.0006) [2023-03-07 11:05:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12839.9). Total num frames: 62636032. Throughput: 0: 12810.2. Samples: 62610150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:43,322][175405] Avg episode reward: [(0, '23.884')] [2023-03-07 11:05:43,418][175731] Updated weights for policy 0, policy_version 61170 (0.0006) [2023-03-07 11:05:44,221][175731] Updated weights for policy 0, policy_version 61180 (0.0006) [2023-03-07 11:05:45,017][175731] Updated weights for policy 0, policy_version 61190 (0.0007) [2023-03-07 11:05:45,828][175731] Updated weights for policy 0, policy_version 61200 (0.0006) [2023-03-07 11:05:46,617][175731] Updated weights for policy 0, policy_version 61210 (0.0007) [2023-03-07 11:05:47,421][175731] Updated weights for policy 0, policy_version 61220 (0.0007) [2023-03-07 11:05:48,218][175731] Updated weights for policy 0, policy_version 61230 (0.0006) [2023-03-07 11:05:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12843.4). Total num frames: 62700544. Throughput: 0: 12816.0. Samples: 62687053. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:48,321][175405] Avg episode reward: [(0, '23.861')] [2023-03-07 11:05:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000061231_62700544.pth... [2023-03-07 11:05:48,355][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000058221_59618304.pth [2023-03-07 11:05:49,013][175731] Updated weights for policy 0, policy_version 61240 (0.0007) [2023-03-07 11:05:49,808][175731] Updated weights for policy 0, policy_version 61250 (0.0006) [2023-03-07 11:05:50,614][175731] Updated weights for policy 0, policy_version 61260 (0.0007) [2023-03-07 11:05:51,409][175731] Updated weights for policy 0, policy_version 61270 (0.0007) [2023-03-07 11:05:52,199][175731] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-07 11:05:52,994][175731] Updated weights for policy 0, policy_version 61290 (0.0006) [2023-03-07 11:05:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12843.4). Total num frames: 62765056. Throughput: 0: 12819.9. Samples: 62764202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:53,322][175405] Avg episode reward: [(0, '23.122')] [2023-03-07 11:05:53,776][175731] Updated weights for policy 0, policy_version 61300 (0.0006) [2023-03-07 11:05:54,569][175731] Updated weights for policy 0, policy_version 61310 (0.0007) [2023-03-07 11:05:55,385][175731] Updated weights for policy 0, policy_version 61320 (0.0006) [2023-03-07 11:05:56,179][175731] Updated weights for policy 0, policy_version 61330 (0.0008) [2023-03-07 11:05:56,971][175731] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-07 11:05:57,756][175731] Updated weights for policy 0, policy_version 61350 (0.0006) [2023-03-07 11:05:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12839.9). Total num frames: 62828544. Throughput: 0: 12822.7. Samples: 62802835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:05:58,322][175405] Avg episode reward: [(0, '24.903')] [2023-03-07 11:05:58,549][175731] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-07 11:05:59,353][175731] Updated weights for policy 0, policy_version 61370 (0.0007) [2023-03-07 11:06:00,149][175731] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-07 11:06:00,963][175731] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-07 11:06:01,754][175731] Updated weights for policy 0, policy_version 61400 (0.0007) [2023-03-07 11:06:02,534][175731] Updated weights for policy 0, policy_version 61410 (0.0007) [2023-03-07 11:06:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12839.9). Total num frames: 62893056. Throughput: 0: 12830.2. Samples: 62879925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:03,322][175405] Avg episode reward: [(0, '25.036')] [2023-03-07 11:06:03,343][175731] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-07 11:06:04,141][175731] Updated weights for policy 0, policy_version 61430 (0.0006) [2023-03-07 11:06:04,926][175731] Updated weights for policy 0, policy_version 61440 (0.0006) [2023-03-07 11:06:05,735][175731] Updated weights for policy 0, policy_version 61450 (0.0007) [2023-03-07 11:06:06,528][175731] Updated weights for policy 0, policy_version 61460 (0.0008) [2023-03-07 11:06:07,322][175731] Updated weights for policy 0, policy_version 61470 (0.0007) [2023-03-07 11:06:08,110][175731] Updated weights for policy 0, policy_version 61480 (0.0006) [2023-03-07 11:06:08,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.1, 300 sec: 12839.9). Total num frames: 62957568. Throughput: 0: 12829.6. Samples: 62956966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:08,321][175405] Avg episode reward: [(0, '25.158')] [2023-03-07 11:06:08,929][175731] Updated weights for policy 0, policy_version 61490 (0.0006) [2023-03-07 11:06:09,733][175731] Updated weights for policy 0, policy_version 61500 (0.0006) [2023-03-07 11:06:10,544][175731] Updated weights for policy 0, policy_version 61510 (0.0006) [2023-03-07 11:06:11,346][175731] Updated weights for policy 0, policy_version 61520 (0.0007) [2023-03-07 11:06:12,148][175731] Updated weights for policy 0, policy_version 61530 (0.0006) [2023-03-07 11:06:12,944][175731] Updated weights for policy 0, policy_version 61540 (0.0007) [2023-03-07 11:06:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12839.9). Total num frames: 63021056. Throughput: 0: 12819.8. Samples: 62995154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:13,321][175405] Avg episode reward: [(0, '24.063')] [2023-03-07 11:06:13,730][175731] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-07 11:06:14,538][175731] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-07 11:06:15,337][175731] Updated weights for policy 0, policy_version 61570 (0.0006) [2023-03-07 11:06:16,119][175731] Updated weights for policy 0, policy_version 61580 (0.0007) [2023-03-07 11:06:16,924][175731] Updated weights for policy 0, policy_version 61590 (0.0006) [2023-03-07 11:06:17,721][175731] Updated weights for policy 0, policy_version 61600 (0.0006) [2023-03-07 11:06:18,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12839.9). Total num frames: 63085568. Throughput: 0: 12825.8. Samples: 63072306. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:18,322][175405] Avg episode reward: [(0, '24.201')] [2023-03-07 11:06:18,517][175731] Updated weights for policy 0, policy_version 61610 (0.0007) [2023-03-07 11:06:19,328][175731] Updated weights for policy 0, policy_version 61620 (0.0007) [2023-03-07 11:06:20,118][175731] Updated weights for policy 0, policy_version 61630 (0.0007) [2023-03-07 11:06:20,920][175731] Updated weights for policy 0, policy_version 61640 (0.0006) [2023-03-07 11:06:21,698][175731] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-07 11:06:22,514][175731] Updated weights for policy 0, policy_version 61660 (0.0006) [2023-03-07 11:06:23,304][175731] Updated weights for policy 0, policy_version 61670 (0.0006) [2023-03-07 11:06:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.2, 300 sec: 12839.9). Total num frames: 63150080. Throughput: 0: 12834.1. Samples: 63149294. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:23,321][175405] Avg episode reward: [(0, '25.594')] [2023-03-07 11:06:24,098][175731] Updated weights for policy 0, policy_version 61680 (0.0007) [2023-03-07 11:06:24,893][175731] Updated weights for policy 0, policy_version 61690 (0.0007) [2023-03-07 11:06:25,698][175731] Updated weights for policy 0, policy_version 61700 (0.0007) [2023-03-07 11:06:26,497][175731] Updated weights for policy 0, policy_version 61710 (0.0006) [2023-03-07 11:06:27,285][175731] Updated weights for policy 0, policy_version 61720 (0.0006) [2023-03-07 11:06:28,099][175731] Updated weights for policy 0, policy_version 61730 (0.0006) [2023-03-07 11:06:28,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12839.9). Total num frames: 63213568. Throughput: 0: 12836.6. Samples: 63187795. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:28,321][175405] Avg episode reward: [(0, '23.515')] [2023-03-07 11:06:28,885][175731] Updated weights for policy 0, policy_version 61740 (0.0006) [2023-03-07 11:06:29,686][175731] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-07 11:06:30,490][175731] Updated weights for policy 0, policy_version 61760 (0.0007) [2023-03-07 11:06:31,274][175731] Updated weights for policy 0, policy_version 61770 (0.0006) [2023-03-07 11:06:32,082][175731] Updated weights for policy 0, policy_version 61780 (0.0006) [2023-03-07 11:06:32,874][175731] Updated weights for policy 0, policy_version 61790 (0.0007) [2023-03-07 11:06:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12839.9). Total num frames: 63278080. Throughput: 0: 12838.6. Samples: 63264789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:33,322][175405] Avg episode reward: [(0, '24.027')] [2023-03-07 11:06:33,664][175731] Updated weights for policy 0, policy_version 61800 (0.0007) [2023-03-07 11:06:34,458][175731] Updated weights for policy 0, policy_version 61810 (0.0006) [2023-03-07 11:06:35,262][175731] Updated weights for policy 0, policy_version 61820 (0.0006) [2023-03-07 11:06:36,046][175731] Updated weights for policy 0, policy_version 61830 (0.0007) [2023-03-07 11:06:36,838][175731] Updated weights for policy 0, policy_version 61840 (0.0007) [2023-03-07 11:06:37,649][175731] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-07 11:06:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.2, 300 sec: 12839.9). Total num frames: 63342592. Throughput: 0: 12842.6. Samples: 63342119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:38,322][175405] Avg episode reward: [(0, '23.102')] [2023-03-07 11:06:38,430][175731] Updated weights for policy 0, policy_version 61860 (0.0006) [2023-03-07 11:06:39,220][175731] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-07 11:06:40,026][175731] Updated weights for policy 0, policy_version 61880 (0.0006) [2023-03-07 11:06:40,832][175731] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-07 11:06:41,625][175731] Updated weights for policy 0, policy_version 61900 (0.0006) [2023-03-07 11:06:42,422][175731] Updated weights for policy 0, policy_version 61910 (0.0006) [2023-03-07 11:06:43,234][175731] Updated weights for policy 0, policy_version 61920 (0.0006) [2023-03-07 11:06:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12843.4). Total num frames: 63407104. Throughput: 0: 12842.0. Samples: 63380725. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:43,322][175405] Avg episode reward: [(0, '24.268')] [2023-03-07 11:06:44,037][175731] Updated weights for policy 0, policy_version 61930 (0.0006) [2023-03-07 11:06:44,830][175731] Updated weights for policy 0, policy_version 61940 (0.0006) [2023-03-07 11:06:45,621][175731] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-07 11:06:46,405][175731] Updated weights for policy 0, policy_version 61960 (0.0007) [2023-03-07 11:06:47,190][175731] Updated weights for policy 0, policy_version 61970 (0.0006) [2023-03-07 11:06:48,006][175731] Updated weights for policy 0, policy_version 61980 (0.0006) [2023-03-07 11:06:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12843.4). Total num frames: 63471616. Throughput: 0: 12842.8. Samples: 63457848. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:48,322][175405] Avg episode reward: [(0, '24.920')] [2023-03-07 11:06:48,804][175731] Updated weights for policy 0, policy_version 61990 (0.0006) [2023-03-07 11:06:49,611][175731] Updated weights for policy 0, policy_version 62000 (0.0007) [2023-03-07 11:06:50,421][175731] Updated weights for policy 0, policy_version 62010 (0.0007) [2023-03-07 11:06:51,216][175731] Updated weights for policy 0, policy_version 62020 (0.0006) [2023-03-07 11:06:51,993][175731] Updated weights for policy 0, policy_version 62030 (0.0007) [2023-03-07 11:06:52,803][175731] Updated weights for policy 0, policy_version 62040 (0.0007) [2023-03-07 11:06:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12843.4). Total num frames: 63535104. Throughput: 0: 12835.8. Samples: 63534580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:06:53,322][175405] Avg episode reward: [(0, '25.053')] [2023-03-07 11:06:53,612][175731] Updated weights for policy 0, policy_version 62050 (0.0007) [2023-03-07 11:06:54,401][175731] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-07 11:06:55,208][175731] Updated weights for policy 0, policy_version 62070 (0.0006) [2023-03-07 11:06:55,999][175731] Updated weights for policy 0, policy_version 62080 (0.0008) [2023-03-07 11:06:56,790][175731] Updated weights for policy 0, policy_version 62090 (0.0007) [2023-03-07 11:06:57,586][175731] Updated weights for policy 0, policy_version 62100 (0.0006) [2023-03-07 11:06:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12843.4). Total num frames: 63599616. Throughput: 0: 12843.4. Samples: 63573108. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:06:58,322][175405] Avg episode reward: [(0, '24.710')] [2023-03-07 11:06:58,365][175731] Updated weights for policy 0, policy_version 62110 (0.0006) [2023-03-07 11:06:59,158][175731] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-07 11:06:59,970][175731] Updated weights for policy 0, policy_version 62130 (0.0007) [2023-03-07 11:07:00,754][175731] Updated weights for policy 0, policy_version 62140 (0.0006) [2023-03-07 11:07:01,538][175731] Updated weights for policy 0, policy_version 62150 (0.0007) [2023-03-07 11:07:02,333][175731] Updated weights for policy 0, policy_version 62160 (0.0006) [2023-03-07 11:07:03,141][175731] Updated weights for policy 0, policy_version 62170 (0.0008) [2023-03-07 11:07:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12839.9). Total num frames: 63664128. Throughput: 0: 12847.6. Samples: 63650449. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:07:03,322][175405] Avg episode reward: [(0, '22.714')] [2023-03-07 11:07:03,937][175731] Updated weights for policy 0, policy_version 62180 (0.0006) [2023-03-07 11:07:04,729][175731] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-07 11:07:05,526][175731] Updated weights for policy 0, policy_version 62200 (0.0007) [2023-03-07 11:07:06,367][175731] Updated weights for policy 0, policy_version 62210 (0.0006) [2023-03-07 11:07:07,160][175731] Updated weights for policy 0, policy_version 62220 (0.0007) [2023-03-07 11:07:07,946][175731] Updated weights for policy 0, policy_version 62230 (0.0007) [2023-03-07 11:07:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12839.9). Total num frames: 63727616. Throughput: 0: 12844.1. Samples: 63727282. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:07:08,322][175405] Avg episode reward: [(0, '24.227')] [2023-03-07 11:07:08,762][175731] Updated weights for policy 0, policy_version 62240 (0.0007) [2023-03-07 11:07:09,567][175731] Updated weights for policy 0, policy_version 62250 (0.0006) [2023-03-07 11:07:10,360][175731] Updated weights for policy 0, policy_version 62260 (0.0008) [2023-03-07 11:07:11,161][175731] Updated weights for policy 0, policy_version 62270 (0.0006) [2023-03-07 11:07:11,941][175731] Updated weights for policy 0, policy_version 62280 (0.0007) [2023-03-07 11:07:12,738][175731] Updated weights for policy 0, policy_version 62290 (0.0007) [2023-03-07 11:07:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12839.9). Total num frames: 63792128. Throughput: 0: 12843.1. Samples: 63765737. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:07:13,322][175405] Avg episode reward: [(0, '25.746')] [2023-03-07 11:07:13,566][175731] Updated weights for policy 0, policy_version 62300 (0.0008) [2023-03-07 11:07:14,343][175731] Updated weights for policy 0, policy_version 62310 (0.0007) [2023-03-07 11:07:15,114][175731] Updated weights for policy 0, policy_version 62320 (0.0006) [2023-03-07 11:07:15,926][175731] Updated weights for policy 0, policy_version 62330 (0.0006) [2023-03-07 11:07:16,707][175731] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-07 11:07:17,521][175731] Updated weights for policy 0, policy_version 62350 (0.0007) [2023-03-07 11:07:18,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.2, 300 sec: 12836.4). Total num frames: 63855616. Throughput: 0: 12846.8. Samples: 63842893. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:07:18,326][175731] Updated weights for policy 0, policy_version 62360 (0.0007) [2023-03-07 11:07:18,332][175405] Avg episode reward: [(0, '24.133')] [2023-03-07 11:07:19,127][175731] Updated weights for policy 0, policy_version 62370 (0.0007) [2023-03-07 11:07:19,933][175731] Updated weights for policy 0, policy_version 62380 (0.0007) [2023-03-07 11:07:20,734][175731] Updated weights for policy 0, policy_version 62390 (0.0007) [2023-03-07 11:07:21,519][175731] Updated weights for policy 0, policy_version 62400 (0.0006) [2023-03-07 11:07:22,334][175731] Updated weights for policy 0, policy_version 62410 (0.0006) [2023-03-07 11:07:23,129][175731] Updated weights for policy 0, policy_version 62420 (0.0007) [2023-03-07 11:07:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 63920128. Throughput: 0: 12833.2. Samples: 63919613. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:07:23,332][175405] Avg episode reward: [(0, '23.155')] [2023-03-07 11:07:23,916][175731] Updated weights for policy 0, policy_version 62430 (0.0006) [2023-03-07 11:07:24,743][175731] Updated weights for policy 0, policy_version 62440 (0.0006) [2023-03-07 11:07:25,517][175731] Updated weights for policy 0, policy_version 62450 (0.0006) [2023-03-07 11:07:26,317][175731] Updated weights for policy 0, policy_version 62460 (0.0006) [2023-03-07 11:07:27,121][175731] Updated weights for policy 0, policy_version 62470 (0.0006) [2023-03-07 11:07:27,922][175731] Updated weights for policy 0, policy_version 62480 (0.0006) [2023-03-07 11:07:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12839.9). Total num frames: 63984640. Throughput: 0: 12828.8. Samples: 63958020. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:07:28,332][175405] Avg episode reward: [(0, '23.752')] [2023-03-07 11:07:28,705][175731] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-07 11:07:29,513][175731] Updated weights for policy 0, policy_version 62500 (0.0007) [2023-03-07 11:07:30,322][175731] Updated weights for policy 0, policy_version 62510 (0.0007) [2023-03-07 11:07:31,125][175731] Updated weights for policy 0, policy_version 62520 (0.0007) [2023-03-07 11:07:31,942][175731] Updated weights for policy 0, policy_version 62530 (0.0006) [2023-03-07 11:07:32,728][175731] Updated weights for policy 0, policy_version 62540 (0.0007) [2023-03-07 11:07:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 64048128. Throughput: 0: 12823.0. Samples: 64034882. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:07:33,332][175405] Avg episode reward: [(0, '23.801')] [2023-03-07 11:07:33,531][175731] Updated weights for policy 0, policy_version 62550 (0.0006) [2023-03-07 11:07:34,329][175731] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-07 11:07:35,117][175731] Updated weights for policy 0, policy_version 62570 (0.0006) [2023-03-07 11:07:35,923][175731] Updated weights for policy 0, policy_version 62580 (0.0007) [2023-03-07 11:07:36,731][175731] Updated weights for policy 0, policy_version 62590 (0.0006) [2023-03-07 11:07:37,526][175731] Updated weights for policy 0, policy_version 62600 (0.0006) [2023-03-07 11:07:38,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 64111616. Throughput: 0: 12822.6. Samples: 64111595. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:07:38,323][175731] Updated weights for policy 0, policy_version 62610 (0.0006) [2023-03-07 11:07:38,326][175405] Avg episode reward: [(0, '25.876')] [2023-03-07 11:07:39,118][175731] Updated weights for policy 0, policy_version 62620 (0.0006) [2023-03-07 11:07:39,922][175731] Updated weights for policy 0, policy_version 62630 (0.0006) [2023-03-07 11:07:40,723][175731] Updated weights for policy 0, policy_version 62640 (0.0006) [2023-03-07 11:07:41,521][175731] Updated weights for policy 0, policy_version 62650 (0.0006) [2023-03-07 11:07:42,333][175731] Updated weights for policy 0, policy_version 62660 (0.0006) [2023-03-07 11:07:43,125][175731] Updated weights for policy 0, policy_version 62670 (0.0006) [2023-03-07 11:07:43,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 64176128. Throughput: 0: 12817.3. Samples: 64149886. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:07:43,332][175405] Avg episode reward: [(0, '24.475')] [2023-03-07 11:07:43,910][175731] Updated weights for policy 0, policy_version 62680 (0.0007) [2023-03-07 11:07:44,701][175731] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-07 11:07:45,526][175731] Updated weights for policy 0, policy_version 62700 (0.0006) [2023-03-07 11:07:46,317][175731] Updated weights for policy 0, policy_version 62710 (0.0006) [2023-03-07 11:07:47,106][175731] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-07 11:07:47,913][175731] Updated weights for policy 0, policy_version 62730 (0.0007) [2023-03-07 11:07:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.0, 300 sec: 12836.4). Total num frames: 64240640. Throughput: 0: 12813.6. Samples: 64227060. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:07:48,332][175405] Avg episode reward: [(0, '24.733')] [2023-03-07 11:07:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000062735_64240640.pth... [2023-03-07 11:07:48,367][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000059728_61161472.pth [2023-03-07 11:07:48,722][175731] Updated weights for policy 0, policy_version 62740 (0.0006) [2023-03-07 11:07:49,514][175731] Updated weights for policy 0, policy_version 62750 (0.0006) [2023-03-07 11:07:50,316][175731] Updated weights for policy 0, policy_version 62760 (0.0007) [2023-03-07 11:07:51,122][175731] Updated weights for policy 0, policy_version 62770 (0.0006) [2023-03-07 11:07:51,920][175731] Updated weights for policy 0, policy_version 62780 (0.0007) [2023-03-07 11:07:52,710][175731] Updated weights for policy 0, policy_version 62790 (0.0007) [2023-03-07 11:07:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 64304128. Throughput: 0: 12812.4. Samples: 64303838. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:07:53,322][175405] Avg episode reward: [(0, '25.304')] [2023-03-07 11:07:53,502][175731] Updated weights for policy 0, policy_version 62800 (0.0006) [2023-03-07 11:07:54,298][175731] Updated weights for policy 0, policy_version 62810 (0.0008) [2023-03-07 11:07:55,084][175731] Updated weights for policy 0, policy_version 62820 (0.0006) [2023-03-07 11:07:55,889][175731] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-07 11:07:56,677][175731] Updated weights for policy 0, policy_version 62840 (0.0006) [2023-03-07 11:07:57,481][175731] Updated weights for policy 0, policy_version 62850 (0.0007) [2023-03-07 11:07:58,269][175731] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-07 11:07:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 64368640. Throughput: 0: 12817.0. Samples: 64342502. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:07:58,322][175405] Avg episode reward: [(0, '23.449')] [2023-03-07 11:07:59,067][175731] Updated weights for policy 0, policy_version 62870 (0.0007) [2023-03-07 11:07:59,873][175731] Updated weights for policy 0, policy_version 62880 (0.0006) [2023-03-07 11:08:00,672][175731] Updated weights for policy 0, policy_version 62890 (0.0006) [2023-03-07 11:08:01,443][175731] Updated weights for policy 0, policy_version 62900 (0.0007) [2023-03-07 11:08:02,236][175731] Updated weights for policy 0, policy_version 62910 (0.0006) [2023-03-07 11:08:03,023][175731] Updated weights for policy 0, policy_version 62920 (0.0006) [2023-03-07 11:08:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12836.4). Total num frames: 64433152. Throughput: 0: 12821.5. Samples: 64419862. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:08:03,321][175405] Avg episode reward: [(0, '24.749')] [2023-03-07 11:08:03,812][175731] Updated weights for policy 0, policy_version 62930 (0.0006) [2023-03-07 11:08:04,619][175731] Updated weights for policy 0, policy_version 62940 (0.0006) [2023-03-07 11:08:05,422][175731] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-07 11:08:06,210][175731] Updated weights for policy 0, policy_version 62960 (0.0007) [2023-03-07 11:08:06,997][175731] Updated weights for policy 0, policy_version 62970 (0.0006) [2023-03-07 11:08:07,798][175731] Updated weights for policy 0, policy_version 62980 (0.0006) [2023-03-07 11:08:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 64497664. Throughput: 0: 12834.0. Samples: 64497145. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:08:08,322][175405] Avg episode reward: [(0, '24.141')] [2023-03-07 11:08:08,593][175731] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-07 11:08:09,389][175731] Updated weights for policy 0, policy_version 63000 (0.0006) [2023-03-07 11:08:10,189][175731] Updated weights for policy 0, policy_version 63010 (0.0006) [2023-03-07 11:08:10,995][175731] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-07 11:08:11,794][175731] Updated weights for policy 0, policy_version 63030 (0.0006) [2023-03-07 11:08:12,606][175731] Updated weights for policy 0, policy_version 63040 (0.0006) [2023-03-07 11:08:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 64562176. Throughput: 0: 12834.7. Samples: 64535583. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:08:13,322][175405] Avg episode reward: [(0, '23.816')] [2023-03-07 11:08:13,405][175731] Updated weights for policy 0, policy_version 63050 (0.0006) [2023-03-07 11:08:14,198][175731] Updated weights for policy 0, policy_version 63060 (0.0007) [2023-03-07 11:08:14,978][175731] Updated weights for policy 0, policy_version 63070 (0.0007) [2023-03-07 11:08:15,790][175731] Updated weights for policy 0, policy_version 63080 (0.0006) [2023-03-07 11:08:16,586][175731] Updated weights for policy 0, policy_version 63090 (0.0007) [2023-03-07 11:08:17,375][175731] Updated weights for policy 0, policy_version 63100 (0.0006) [2023-03-07 11:08:18,194][175731] Updated weights for policy 0, policy_version 63110 (0.0006) [2023-03-07 11:08:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 64625664. Throughput: 0: 12838.4. Samples: 64612611. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:08:18,322][175405] Avg episode reward: [(0, '24.172')] [2023-03-07 11:08:18,974][175731] Updated weights for policy 0, policy_version 63120 (0.0008) [2023-03-07 11:08:19,770][175731] Updated weights for policy 0, policy_version 63130 (0.0007) [2023-03-07 11:08:20,589][175731] Updated weights for policy 0, policy_version 63140 (0.0007) [2023-03-07 11:08:21,385][175731] Updated weights for policy 0, policy_version 63150 (0.0007) [2023-03-07 11:08:22,172][175731] Updated weights for policy 0, policy_version 63160 (0.0007) [2023-03-07 11:08:22,975][175731] Updated weights for policy 0, policy_version 63170 (0.0007) [2023-03-07 11:08:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 64690176. Throughput: 0: 12841.4. Samples: 64689459. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:08:23,322][175405] Avg episode reward: [(0, '25.209')] [2023-03-07 11:08:23,768][175731] Updated weights for policy 0, policy_version 63180 (0.0007) [2023-03-07 11:08:24,561][175731] Updated weights for policy 0, policy_version 63190 (0.0006) [2023-03-07 11:08:25,365][175731] Updated weights for policy 0, policy_version 63200 (0.0007) [2023-03-07 11:08:26,159][175731] Updated weights for policy 0, policy_version 63210 (0.0006) [2023-03-07 11:08:26,936][175731] Updated weights for policy 0, policy_version 63220 (0.0006) [2023-03-07 11:08:27,751][175731] Updated weights for policy 0, policy_version 63230 (0.0007) [2023-03-07 11:08:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 64754688. Throughput: 0: 12849.2. Samples: 64728099. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:08:28,322][175405] Avg episode reward: [(0, '25.211')] [2023-03-07 11:08:28,549][175731] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-07 11:08:29,324][175731] Updated weights for policy 0, policy_version 63250 (0.0007) [2023-03-07 11:08:30,137][175731] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-07 11:08:30,932][175731] Updated weights for policy 0, policy_version 63270 (0.0006) [2023-03-07 11:08:31,728][175731] Updated weights for policy 0, policy_version 63280 (0.0006) [2023-03-07 11:08:32,509][175731] Updated weights for policy 0, policy_version 63290 (0.0007) [2023-03-07 11:08:33,315][175731] Updated weights for policy 0, policy_version 63300 (0.0007) [2023-03-07 11:08:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 64819200. Throughput: 0: 12853.6. Samples: 64805468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:08:33,322][175405] Avg episode reward: [(0, '23.761')] [2023-03-07 11:08:34,106][175731] Updated weights for policy 0, policy_version 63310 (0.0006) [2023-03-07 11:08:34,889][175731] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-07 11:08:35,708][175731] Updated weights for policy 0, policy_version 63330 (0.0007) [2023-03-07 11:08:36,510][175731] Updated weights for policy 0, policy_version 63340 (0.0007) [2023-03-07 11:08:37,304][175731] Updated weights for policy 0, policy_version 63350 (0.0006) [2023-03-07 11:08:38,096][175731] Updated weights for policy 0, policy_version 63360 (0.0006) [2023-03-07 11:08:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 64882688. Throughput: 0: 12859.7. Samples: 64882523. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:08:38,321][175405] Avg episode reward: [(0, '25.017')] [2023-03-07 11:08:38,887][175731] Updated weights for policy 0, policy_version 63370 (0.0007) [2023-03-07 11:08:39,711][175731] Updated weights for policy 0, policy_version 63380 (0.0006) [2023-03-07 11:08:40,518][175731] Updated weights for policy 0, policy_version 63390 (0.0007) [2023-03-07 11:08:41,303][175731] Updated weights for policy 0, policy_version 63400 (0.0007) [2023-03-07 11:08:42,098][175731] Updated weights for policy 0, policy_version 63410 (0.0007) [2023-03-07 11:08:42,891][175731] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-07 11:08:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 64947200. Throughput: 0: 12850.1. Samples: 64920755. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:08:43,322][175405] Avg episode reward: [(0, '23.706')] [2023-03-07 11:08:43,688][175731] Updated weights for policy 0, policy_version 63430 (0.0006) [2023-03-07 11:08:44,481][175731] Updated weights for policy 0, policy_version 63440 (0.0006) [2023-03-07 11:08:45,282][175731] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-07 11:08:46,071][175731] Updated weights for policy 0, policy_version 63460 (0.0007) [2023-03-07 11:08:46,881][175731] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-07 11:08:47,661][175731] Updated weights for policy 0, policy_version 63480 (0.0006) [2023-03-07 11:08:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 65011712. Throughput: 0: 12849.3. Samples: 64998082. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:08:48,322][175405] Avg episode reward: [(0, '24.735')] [2023-03-07 11:08:48,474][175731] Updated weights for policy 0, policy_version 63490 (0.0007) [2023-03-07 11:08:49,289][175731] Updated weights for policy 0, policy_version 63500 (0.0007) [2023-03-07 11:08:50,092][175731] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-07 11:08:50,902][175731] Updated weights for policy 0, policy_version 63520 (0.0007) [2023-03-07 11:08:51,693][175731] Updated weights for policy 0, policy_version 63530 (0.0006) [2023-03-07 11:08:52,482][175731] Updated weights for policy 0, policy_version 63540 (0.0005) [2023-03-07 11:08:53,278][175731] Updated weights for policy 0, policy_version 63550 (0.0007) [2023-03-07 11:08:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 65075200. Throughput: 0: 12835.9. Samples: 65074760. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:08:53,322][175405] Avg episode reward: [(0, '25.789')] [2023-03-07 11:08:54,073][175731] Updated weights for policy 0, policy_version 63560 (0.0006) [2023-03-07 11:08:54,876][175731] Updated weights for policy 0, policy_version 63570 (0.0006) [2023-03-07 11:08:55,673][175731] Updated weights for policy 0, policy_version 63580 (0.0008) [2023-03-07 11:08:56,461][175731] Updated weights for policy 0, policy_version 63590 (0.0007) [2023-03-07 11:08:57,268][175731] Updated weights for policy 0, policy_version 63600 (0.0006) [2023-03-07 11:08:58,081][175731] Updated weights for policy 0, policy_version 63610 (0.0006) [2023-03-07 11:08:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 65139712. Throughput: 0: 12836.7. Samples: 65113236. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:08:58,322][175405] Avg episode reward: [(0, '24.681')] [2023-03-07 11:08:58,874][175731] Updated weights for policy 0, policy_version 63620 (0.0006) [2023-03-07 11:08:59,702][175731] Updated weights for policy 0, policy_version 63630 (0.0007) [2023-03-07 11:09:00,482][175731] Updated weights for policy 0, policy_version 63640 (0.0007) [2023-03-07 11:09:01,271][175731] Updated weights for policy 0, policy_version 63650 (0.0007) [2023-03-07 11:09:02,086][175731] Updated weights for policy 0, policy_version 63660 (0.0006) [2023-03-07 11:09:02,886][175731] Updated weights for policy 0, policy_version 63670 (0.0006) [2023-03-07 11:09:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 65203200. Throughput: 0: 12828.6. Samples: 65189896. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:09:03,322][175405] Avg episode reward: [(0, '24.068')] [2023-03-07 11:09:03,682][175731] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-07 11:09:04,496][175731] Updated weights for policy 0, policy_version 63690 (0.0007) [2023-03-07 11:09:05,289][175731] Updated weights for policy 0, policy_version 63700 (0.0006) [2023-03-07 11:09:06,089][175731] Updated weights for policy 0, policy_version 63710 (0.0007) [2023-03-07 11:09:06,918][175731] Updated weights for policy 0, policy_version 63720 (0.0007) [2023-03-07 11:09:07,706][175731] Updated weights for policy 0, policy_version 63730 (0.0005) [2023-03-07 11:09:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 65266688. Throughput: 0: 12822.3. Samples: 65266462. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:09:08,322][175405] Avg episode reward: [(0, '25.884')] [2023-03-07 11:09:08,505][175731] Updated weights for policy 0, policy_version 63740 (0.0007) [2023-03-07 11:09:09,307][175731] Updated weights for policy 0, policy_version 63750 (0.0006) [2023-03-07 11:09:10,123][175731] Updated weights for policy 0, policy_version 63760 (0.0007) [2023-03-07 11:09:10,905][175731] Updated weights for policy 0, policy_version 63770 (0.0006) [2023-03-07 11:09:11,706][175731] Updated weights for policy 0, policy_version 63780 (0.0006) [2023-03-07 11:09:12,509][175731] Updated weights for policy 0, policy_version 63790 (0.0006) [2023-03-07 11:09:13,309][175731] Updated weights for policy 0, policy_version 63800 (0.0007) [2023-03-07 11:09:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 65331200. Throughput: 0: 12815.4. Samples: 65304791. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:09:13,322][175405] Avg episode reward: [(0, '23.268')] [2023-03-07 11:09:14,102][175731] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-07 11:09:14,889][175731] Updated weights for policy 0, policy_version 63820 (0.0007) [2023-03-07 11:09:15,706][175731] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-07 11:09:16,510][175731] Updated weights for policy 0, policy_version 63840 (0.0006) [2023-03-07 11:09:17,294][175731] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-07 11:09:18,104][175731] Updated weights for policy 0, policy_version 63860 (0.0007) [2023-03-07 11:09:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 65394688. Throughput: 0: 12803.7. Samples: 65381633. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:09:18,322][175405] Avg episode reward: [(0, '23.586')] [2023-03-07 11:09:18,885][175731] Updated weights for policy 0, policy_version 63870 (0.0006) [2023-03-07 11:09:19,697][175731] Updated weights for policy 0, policy_version 63880 (0.0007) [2023-03-07 11:09:20,508][175731] Updated weights for policy 0, policy_version 63890 (0.0006) [2023-03-07 11:09:21,313][175731] Updated weights for policy 0, policy_version 63900 (0.0008) [2023-03-07 11:09:22,128][175731] Updated weights for policy 0, policy_version 63910 (0.0007) [2023-03-07 11:09:22,917][175731] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-07 11:09:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 65459200. Throughput: 0: 12794.3. Samples: 65458269. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:09:23,322][175405] Avg episode reward: [(0, '25.283')] [2023-03-07 11:09:23,717][175731] Updated weights for policy 0, policy_version 63930 (0.0007) [2023-03-07 11:09:24,537][175731] Updated weights for policy 0, policy_version 63940 (0.0006) [2023-03-07 11:09:25,329][175731] Updated weights for policy 0, policy_version 63950 (0.0007) [2023-03-07 11:09:26,123][175731] Updated weights for policy 0, policy_version 63960 (0.0006) [2023-03-07 11:09:26,929][175731] Updated weights for policy 0, policy_version 63970 (0.0006) [2023-03-07 11:09:27,718][175731] Updated weights for policy 0, policy_version 63980 (0.0006) [2023-03-07 11:09:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 65522688. Throughput: 0: 12796.2. Samples: 65496584. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:09:28,321][175405] Avg episode reward: [(0, '25.405')] [2023-03-07 11:09:28,518][175731] Updated weights for policy 0, policy_version 63990 (0.0007) [2023-03-07 11:09:29,318][175731] Updated weights for policy 0, policy_version 64000 (0.0006) [2023-03-07 11:09:30,117][175731] Updated weights for policy 0, policy_version 64010 (0.0007) [2023-03-07 11:09:30,901][175731] Updated weights for policy 0, policy_version 64020 (0.0007) [2023-03-07 11:09:31,690][175731] Updated weights for policy 0, policy_version 64030 (0.0007) [2023-03-07 11:09:32,498][175731] Updated weights for policy 0, policy_version 64040 (0.0006) [2023-03-07 11:09:33,279][175731] Updated weights for policy 0, policy_version 64050 (0.0008) [2023-03-07 11:09:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 65587200. Throughput: 0: 12791.1. Samples: 65573682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:09:33,322][175405] Avg episode reward: [(0, '25.556')] [2023-03-07 11:09:34,069][175731] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-07 11:09:34,874][175731] Updated weights for policy 0, policy_version 64070 (0.0006) [2023-03-07 11:09:35,676][175731] Updated weights for policy 0, policy_version 64080 (0.0006) [2023-03-07 11:09:36,485][175731] Updated weights for policy 0, policy_version 64090 (0.0006) [2023-03-07 11:09:37,301][175731] Updated weights for policy 0, policy_version 64100 (0.0006) [2023-03-07 11:09:38,080][175731] Updated weights for policy 0, policy_version 64110 (0.0006) [2023-03-07 11:09:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 65650688. Throughput: 0: 12795.3. Samples: 65650548. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:09:38,322][175405] Avg episode reward: [(0, '24.208')] [2023-03-07 11:09:38,894][175731] Updated weights for policy 0, policy_version 64120 (0.0007) [2023-03-07 11:09:39,696][175731] Updated weights for policy 0, policy_version 64130 (0.0006) [2023-03-07 11:09:40,503][175731] Updated weights for policy 0, policy_version 64140 (0.0007) [2023-03-07 11:09:41,288][175731] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-07 11:09:42,089][175731] Updated weights for policy 0, policy_version 64160 (0.0006) [2023-03-07 11:09:42,897][175731] Updated weights for policy 0, policy_version 64170 (0.0006) [2023-03-07 11:09:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 65715200. Throughput: 0: 12795.4. Samples: 65689027. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:09:43,322][175405] Avg episode reward: [(0, '24.204')] [2023-03-07 11:09:43,664][175731] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-07 11:09:44,475][175731] Updated weights for policy 0, policy_version 64190 (0.0007) [2023-03-07 11:09:45,258][175731] Updated weights for policy 0, policy_version 64200 (0.0006) [2023-03-07 11:09:46,056][175731] Updated weights for policy 0, policy_version 64210 (0.0006) [2023-03-07 11:09:46,868][175731] Updated weights for policy 0, policy_version 64220 (0.0007) [2023-03-07 11:09:47,650][175731] Updated weights for policy 0, policy_version 64230 (0.0006) [2023-03-07 11:09:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 65779712. Throughput: 0: 12804.3. Samples: 65766089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:09:48,322][175405] Avg episode reward: [(0, '25.473')] [2023-03-07 11:09:48,327][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000064238_65779712.pth... [2023-03-07 11:09:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000061231_62700544.pth [2023-03-07 11:09:48,453][175731] Updated weights for policy 0, policy_version 64240 (0.0006) [2023-03-07 11:09:49,264][175731] Updated weights for policy 0, policy_version 64250 (0.0007) [2023-03-07 11:09:50,073][175731] Updated weights for policy 0, policy_version 64260 (0.0007) [2023-03-07 11:09:50,869][175731] Updated weights for policy 0, policy_version 64270 (0.0007) [2023-03-07 11:09:51,667][175731] Updated weights for policy 0, policy_version 64280 (0.0006) [2023-03-07 11:09:52,449][175731] Updated weights for policy 0, policy_version 64290 (0.0007) [2023-03-07 11:09:53,251][175731] Updated weights for policy 0, policy_version 64300 (0.0007) [2023-03-07 11:09:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 65843200. Throughput: 0: 12816.4. Samples: 65843198. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:09:53,322][175405] Avg episode reward: [(0, '24.674')] [2023-03-07 11:09:54,035][175731] Updated weights for policy 0, policy_version 64310 (0.0006) [2023-03-07 11:09:54,819][175731] Updated weights for policy 0, policy_version 64320 (0.0006) [2023-03-07 11:09:55,625][175731] Updated weights for policy 0, policy_version 64330 (0.0006) [2023-03-07 11:09:56,410][175731] Updated weights for policy 0, policy_version 64340 (0.0006) [2023-03-07 11:09:57,217][175731] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-07 11:09:58,046][175731] Updated weights for policy 0, policy_version 64360 (0.0007) [2023-03-07 11:09:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 65907712. Throughput: 0: 12823.4. Samples: 65881844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:09:58,321][175405] Avg episode reward: [(0, '24.332')] [2023-03-07 11:09:58,842][175731] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-07 11:09:59,636][175731] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-07 11:10:00,426][175731] Updated weights for policy 0, policy_version 64390 (0.0007) [2023-03-07 11:10:01,221][175731] Updated weights for policy 0, policy_version 64400 (0.0007) [2023-03-07 11:10:02,024][175731] Updated weights for policy 0, policy_version 64410 (0.0006) [2023-03-07 11:10:02,832][175731] Updated weights for policy 0, policy_version 64420 (0.0007) [2023-03-07 11:10:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 65972224. Throughput: 0: 12823.0. Samples: 65958669. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:10:03,322][175405] Avg episode reward: [(0, '27.414')] [2023-03-07 11:10:03,624][175731] Updated weights for policy 0, policy_version 64430 (0.0006) [2023-03-07 11:10:04,421][175731] Updated weights for policy 0, policy_version 64440 (0.0006) [2023-03-07 11:10:05,226][175731] Updated weights for policy 0, policy_version 64450 (0.0007) [2023-03-07 11:10:06,009][175731] Updated weights for policy 0, policy_version 64460 (0.0006) [2023-03-07 11:10:06,821][175731] Updated weights for policy 0, policy_version 64470 (0.0007) [2023-03-07 11:10:07,595][175731] Updated weights for policy 0, policy_version 64480 (0.0006) [2023-03-07 11:10:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12826.0). Total num frames: 66035712. Throughput: 0: 12832.0. Samples: 66035711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:10:08,322][175405] Avg episode reward: [(0, '23.337')] [2023-03-07 11:10:08,400][175731] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-07 11:10:09,194][175731] Updated weights for policy 0, policy_version 64500 (0.0007) [2023-03-07 11:10:09,981][175731] Updated weights for policy 0, policy_version 64510 (0.0006) [2023-03-07 11:10:10,772][175731] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-07 11:10:11,565][175731] Updated weights for policy 0, policy_version 64530 (0.0007) [2023-03-07 11:10:12,372][175731] Updated weights for policy 0, policy_version 64540 (0.0007) [2023-03-07 11:10:13,172][175731] Updated weights for policy 0, policy_version 64550 (0.0007) [2023-03-07 11:10:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 66100224. Throughput: 0: 12842.4. Samples: 66074489. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:13,321][175405] Avg episode reward: [(0, '26.132')] [2023-03-07 11:10:13,974][175731] Updated weights for policy 0, policy_version 64560 (0.0006) [2023-03-07 11:10:14,782][175731] Updated weights for policy 0, policy_version 64570 (0.0007) [2023-03-07 11:10:15,565][175731] Updated weights for policy 0, policy_version 64580 (0.0006) [2023-03-07 11:10:16,368][175731] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-07 11:10:17,162][175731] Updated weights for policy 0, policy_version 64600 (0.0007) [2023-03-07 11:10:17,954][175731] Updated weights for policy 0, policy_version 64610 (0.0007) [2023-03-07 11:10:18,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 66164736. Throughput: 0: 12838.6. Samples: 66151418. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:18,322][175405] Avg episode reward: [(0, '24.411')] [2023-03-07 11:10:18,757][175731] Updated weights for policy 0, policy_version 64620 (0.0007) [2023-03-07 11:10:19,559][175731] Updated weights for policy 0, policy_version 64630 (0.0006) [2023-03-07 11:10:20,353][175731] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-07 11:10:21,156][175731] Updated weights for policy 0, policy_version 64650 (0.0007) [2023-03-07 11:10:21,954][175731] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-07 11:10:22,739][175731] Updated weights for policy 0, policy_version 64670 (0.0006) [2023-03-07 11:10:23,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 66229248. Throughput: 0: 12840.3. Samples: 66228360. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:23,322][175405] Avg episode reward: [(0, '23.738')] [2023-03-07 11:10:23,547][175731] Updated weights for policy 0, policy_version 64680 (0.0006) [2023-03-07 11:10:24,352][175731] Updated weights for policy 0, policy_version 64690 (0.0007) [2023-03-07 11:10:25,151][175731] Updated weights for policy 0, policy_version 64700 (0.0006) [2023-03-07 11:10:25,941][175731] Updated weights for policy 0, policy_version 64710 (0.0007) [2023-03-07 11:10:26,755][175731] Updated weights for policy 0, policy_version 64720 (0.0006) [2023-03-07 11:10:27,553][175731] Updated weights for policy 0, policy_version 64730 (0.0007) [2023-03-07 11:10:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 66292736. Throughput: 0: 12838.5. Samples: 66266758. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:28,322][175405] Avg episode reward: [(0, '24.790')] [2023-03-07 11:10:28,352][175731] Updated weights for policy 0, policy_version 64740 (0.0006) [2023-03-07 11:10:29,158][175731] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-07 11:10:29,951][175731] Updated weights for policy 0, policy_version 64760 (0.0007) [2023-03-07 11:10:30,745][175731] Updated weights for policy 0, policy_version 64770 (0.0007) [2023-03-07 11:10:31,555][175731] Updated weights for policy 0, policy_version 64780 (0.0006) [2023-03-07 11:10:32,345][175731] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-07 11:10:33,154][175731] Updated weights for policy 0, policy_version 64800 (0.0006) [2023-03-07 11:10:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 66357248. Throughput: 0: 12834.6. Samples: 66343644. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:33,322][175405] Avg episode reward: [(0, '25.458')] [2023-03-07 11:10:33,940][175731] Updated weights for policy 0, policy_version 64810 (0.0006) [2023-03-07 11:10:34,737][175731] Updated weights for policy 0, policy_version 64820 (0.0007) [2023-03-07 11:10:35,531][175731] Updated weights for policy 0, policy_version 64830 (0.0006) [2023-03-07 11:10:36,344][175731] Updated weights for policy 0, policy_version 64840 (0.0006) [2023-03-07 11:10:37,126][175731] Updated weights for policy 0, policy_version 64850 (0.0006) [2023-03-07 11:10:37,912][175731] Updated weights for policy 0, policy_version 64860 (0.0007) [2023-03-07 11:10:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 66421760. Throughput: 0: 12840.1. Samples: 66421001. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:38,322][175405] Avg episode reward: [(0, '24.773')] [2023-03-07 11:10:38,701][175731] Updated weights for policy 0, policy_version 64870 (0.0006) [2023-03-07 11:10:39,491][175731] Updated weights for policy 0, policy_version 64880 (0.0007) [2023-03-07 11:10:40,298][175731] Updated weights for policy 0, policy_version 64890 (0.0006) [2023-03-07 11:10:41,090][175731] Updated weights for policy 0, policy_version 64900 (0.0006) [2023-03-07 11:10:41,899][175731] Updated weights for policy 0, policy_version 64910 (0.0007) [2023-03-07 11:10:42,684][175731] Updated weights for policy 0, policy_version 64920 (0.0007) [2023-03-07 11:10:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 66486272. Throughput: 0: 12838.1. Samples: 66459557. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:43,322][175405] Avg episode reward: [(0, '25.987')] [2023-03-07 11:10:43,465][175731] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-07 11:10:44,278][175731] Updated weights for policy 0, policy_version 64940 (0.0007) [2023-03-07 11:10:45,091][175731] Updated weights for policy 0, policy_version 64950 (0.0006) [2023-03-07 11:10:45,896][175731] Updated weights for policy 0, policy_version 64960 (0.0006) [2023-03-07 11:10:46,706][175731] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-07 11:10:47,494][175731] Updated weights for policy 0, policy_version 64980 (0.0007) [2023-03-07 11:10:48,299][175731] Updated weights for policy 0, policy_version 64990 (0.0007) [2023-03-07 11:10:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12829.5). Total num frames: 66549760. Throughput: 0: 12836.2. Samples: 66536299. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:10:48,322][175405] Avg episode reward: [(0, '26.700')] [2023-03-07 11:10:49,099][175731] Updated weights for policy 0, policy_version 65000 (0.0008) [2023-03-07 11:10:49,886][175731] Updated weights for policy 0, policy_version 65010 (0.0007) [2023-03-07 11:10:50,714][175731] Updated weights for policy 0, policy_version 65020 (0.0006) [2023-03-07 11:10:51,498][175731] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-07 11:10:52,290][175731] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-07 11:10:53,097][175731] Updated weights for policy 0, policy_version 65050 (0.0007) [2023-03-07 11:10:53,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 66613248. Throughput: 0: 12830.2. Samples: 66613068. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:10:53,322][175405] Avg episode reward: [(0, '24.736')] [2023-03-07 11:10:53,886][175731] Updated weights for policy 0, policy_version 65060 (0.0006) [2023-03-07 11:10:54,681][175731] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-07 11:10:55,493][175731] Updated weights for policy 0, policy_version 65080 (0.0007) [2023-03-07 11:10:56,287][175731] Updated weights for policy 0, policy_version 65090 (0.0007) [2023-03-07 11:10:57,090][175731] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-07 11:10:57,887][175731] Updated weights for policy 0, policy_version 65110 (0.0007) [2023-03-07 11:10:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 66677760. Throughput: 0: 12826.4. Samples: 66651679. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:10:58,332][175405] Avg episode reward: [(0, '24.179')] [2023-03-07 11:10:58,685][175731] Updated weights for policy 0, policy_version 65120 (0.0006) [2023-03-07 11:10:59,481][175731] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-07 11:11:00,281][175731] Updated weights for policy 0, policy_version 65140 (0.0007) [2023-03-07 11:11:01,091][175731] Updated weights for policy 0, policy_version 65150 (0.0006) [2023-03-07 11:11:01,898][175731] Updated weights for policy 0, policy_version 65160 (0.0007) [2023-03-07 11:11:02,685][175731] Updated weights for policy 0, policy_version 65170 (0.0007) [2023-03-07 11:11:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 66741248. Throughput: 0: 12819.1. Samples: 66728274. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:03,332][175405] Avg episode reward: [(0, '26.281')] [2023-03-07 11:11:03,501][175731] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-07 11:11:04,307][175731] Updated weights for policy 0, policy_version 65190 (0.0007) [2023-03-07 11:11:05,086][175731] Updated weights for policy 0, policy_version 65200 (0.0006) [2023-03-07 11:11:05,890][175731] Updated weights for policy 0, policy_version 65210 (0.0007) [2023-03-07 11:11:06,671][175731] Updated weights for policy 0, policy_version 65220 (0.0006) [2023-03-07 11:11:07,468][175731] Updated weights for policy 0, policy_version 65230 (0.0005) [2023-03-07 11:11:08,249][175731] Updated weights for policy 0, policy_version 65240 (0.0006) [2023-03-07 11:11:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12829.5). Total num frames: 66805760. Throughput: 0: 12827.2. Samples: 66805584. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:08,332][175405] Avg episode reward: [(0, '26.542')] [2023-03-07 11:11:09,049][175731] Updated weights for policy 0, policy_version 65250 (0.0007) [2023-03-07 11:11:09,838][175731] Updated weights for policy 0, policy_version 65260 (0.0007) [2023-03-07 11:11:10,651][175731] Updated weights for policy 0, policy_version 65270 (0.0006) [2023-03-07 11:11:11,451][175731] Updated weights for policy 0, policy_version 65280 (0.0006) [2023-03-07 11:11:12,255][175731] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-07 11:11:13,059][175731] Updated weights for policy 0, policy_version 65300 (0.0006) [2023-03-07 11:11:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 66870272. Throughput: 0: 12829.9. Samples: 66844104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:13,332][175405] Avg episode reward: [(0, '24.668')] [2023-03-07 11:11:13,849][175731] Updated weights for policy 0, policy_version 65310 (0.0007) [2023-03-07 11:11:14,657][175731] Updated weights for policy 0, policy_version 65320 (0.0007) [2023-03-07 11:11:15,465][175731] Updated weights for policy 0, policy_version 65330 (0.0006) [2023-03-07 11:11:16,278][175731] Updated weights for policy 0, policy_version 65340 (0.0006) [2023-03-07 11:11:17,059][175731] Updated weights for policy 0, policy_version 65350 (0.0006) [2023-03-07 11:11:17,855][175731] Updated weights for policy 0, policy_version 65360 (0.0006) [2023-03-07 11:11:18,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12826.0). Total num frames: 66933760. Throughput: 0: 12826.2. Samples: 66920826. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:18,332][175405] Avg episode reward: [(0, '24.667')] [2023-03-07 11:11:18,665][175731] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-07 11:11:19,455][175731] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-07 11:11:20,250][175731] Updated weights for policy 0, policy_version 65390 (0.0007) [2023-03-07 11:11:21,034][175731] Updated weights for policy 0, policy_version 65400 (0.0006) [2023-03-07 11:11:21,838][175731] Updated weights for policy 0, policy_version 65410 (0.0006) [2023-03-07 11:11:22,645][175731] Updated weights for policy 0, policy_version 65420 (0.0006) [2023-03-07 11:11:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 66998272. Throughput: 0: 12818.8. Samples: 66997847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:23,321][175405] Avg episode reward: [(0, '26.120')] [2023-03-07 11:11:23,426][175731] Updated weights for policy 0, policy_version 65430 (0.0007) [2023-03-07 11:11:24,224][175731] Updated weights for policy 0, policy_version 65440 (0.0007) [2023-03-07 11:11:25,022][175731] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-07 11:11:25,833][175731] Updated weights for policy 0, policy_version 65460 (0.0006) [2023-03-07 11:11:26,610][175731] Updated weights for policy 0, policy_version 65470 (0.0006) [2023-03-07 11:11:27,406][175731] Updated weights for policy 0, policy_version 65480 (0.0006) [2023-03-07 11:11:28,219][175731] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-07 11:11:28,321][175405] Fps is (10 sec: 12902.7, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 67062784. Throughput: 0: 12819.2. Samples: 67036421. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:28,321][175405] Avg episode reward: [(0, '27.557')] [2023-03-07 11:11:29,007][175731] Updated weights for policy 0, policy_version 65500 (0.0006) [2023-03-07 11:11:29,798][175731] Updated weights for policy 0, policy_version 65510 (0.0006) [2023-03-07 11:11:30,586][175731] Updated weights for policy 0, policy_version 65520 (0.0007) [2023-03-07 11:11:31,385][175731] Updated weights for policy 0, policy_version 65530 (0.0007) [2023-03-07 11:11:32,192][175731] Updated weights for policy 0, policy_version 65540 (0.0007) [2023-03-07 11:11:32,985][175731] Updated weights for policy 0, policy_version 65550 (0.0006) [2023-03-07 11:11:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 67127296. Throughput: 0: 12830.5. Samples: 67113670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:33,321][175405] Avg episode reward: [(0, '25.733')] [2023-03-07 11:11:33,792][175731] Updated weights for policy 0, policy_version 65560 (0.0007) [2023-03-07 11:11:34,590][175731] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-07 11:11:35,385][175731] Updated weights for policy 0, policy_version 65580 (0.0007) [2023-03-07 11:11:36,191][175731] Updated weights for policy 0, policy_version 65590 (0.0007) [2023-03-07 11:11:36,985][175731] Updated weights for policy 0, policy_version 65600 (0.0008) [2023-03-07 11:11:37,788][175731] Updated weights for policy 0, policy_version 65610 (0.0007) [2023-03-07 11:11:38,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 67190784. Throughput: 0: 12829.8. Samples: 67190409. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:38,322][175405] Avg episode reward: [(0, '27.258')] [2023-03-07 11:11:38,577][175731] Updated weights for policy 0, policy_version 65620 (0.0007) [2023-03-07 11:11:39,383][175731] Updated weights for policy 0, policy_version 65630 (0.0006) [2023-03-07 11:11:40,202][175731] Updated weights for policy 0, policy_version 65640 (0.0007) [2023-03-07 11:11:41,024][175731] Updated weights for policy 0, policy_version 65650 (0.0007) [2023-03-07 11:11:41,809][175731] Updated weights for policy 0, policy_version 65660 (0.0007) [2023-03-07 11:11:42,612][175731] Updated weights for policy 0, policy_version 65670 (0.0007) [2023-03-07 11:11:43,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 67254272. Throughput: 0: 12819.0. Samples: 67228534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:43,321][175405] Avg episode reward: [(0, '27.101')] [2023-03-07 11:11:43,411][175731] Updated weights for policy 0, policy_version 65680 (0.0006) [2023-03-07 11:11:44,198][175731] Updated weights for policy 0, policy_version 65690 (0.0008) [2023-03-07 11:11:45,001][175731] Updated weights for policy 0, policy_version 65700 (0.0007) [2023-03-07 11:11:45,798][175731] Updated weights for policy 0, policy_version 65710 (0.0007) [2023-03-07 11:11:46,595][175731] Updated weights for policy 0, policy_version 65720 (0.0007) [2023-03-07 11:11:47,397][175731] Updated weights for policy 0, policy_version 65730 (0.0006) [2023-03-07 11:11:48,205][175731] Updated weights for policy 0, policy_version 65740 (0.0006) [2023-03-07 11:11:48,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 67318784. Throughput: 0: 12829.0. Samples: 67305577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:48,321][175405] Avg episode reward: [(0, '24.607')] [2023-03-07 11:11:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000065741_67318784.pth... [2023-03-07 11:11:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000062735_64240640.pth [2023-03-07 11:11:49,003][175731] Updated weights for policy 0, policy_version 65750 (0.0006) [2023-03-07 11:11:49,814][175731] Updated weights for policy 0, policy_version 65760 (0.0006) [2023-03-07 11:11:50,615][175731] Updated weights for policy 0, policy_version 65770 (0.0007) [2023-03-07 11:11:51,416][175731] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-07 11:11:52,202][175731] Updated weights for policy 0, policy_version 65790 (0.0006) [2023-03-07 11:11:53,005][175731] Updated weights for policy 0, policy_version 65800 (0.0007) [2023-03-07 11:11:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 67382272. Throughput: 0: 12812.5. Samples: 67382148. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:53,322][175405] Avg episode reward: [(0, '26.089')] [2023-03-07 11:11:53,801][175731] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-07 11:11:54,618][175731] Updated weights for policy 0, policy_version 65820 (0.0007) [2023-03-07 11:11:55,397][175731] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-07 11:11:56,210][175731] Updated weights for policy 0, policy_version 65840 (0.0007) [2023-03-07 11:11:56,996][175731] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-07 11:11:57,792][175731] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-07 11:11:58,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 67446784. Throughput: 0: 12810.3. Samples: 67420567. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:11:58,322][175405] Avg episode reward: [(0, '26.391')] [2023-03-07 11:11:58,596][175731] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-07 11:11:59,418][175731] Updated weights for policy 0, policy_version 65880 (0.0007) [2023-03-07 11:12:00,199][175731] Updated weights for policy 0, policy_version 65890 (0.0007) [2023-03-07 11:12:01,021][175731] Updated weights for policy 0, policy_version 65900 (0.0006) [2023-03-07 11:12:01,806][175731] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-07 11:12:02,617][175731] Updated weights for policy 0, policy_version 65920 (0.0007) [2023-03-07 11:12:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 67510272. Throughput: 0: 12811.1. Samples: 67497325. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:03,321][175405] Avg episode reward: [(0, '25.957')] [2023-03-07 11:12:03,430][175731] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-07 11:12:04,217][175731] Updated weights for policy 0, policy_version 65940 (0.0006) [2023-03-07 11:12:05,017][175731] Updated weights for policy 0, policy_version 65950 (0.0006) [2023-03-07 11:12:05,820][175731] Updated weights for policy 0, policy_version 65960 (0.0007) [2023-03-07 11:12:06,626][175731] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-07 11:12:07,412][175731] Updated weights for policy 0, policy_version 65980 (0.0006) [2023-03-07 11:12:08,225][175731] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-07 11:12:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 67574784. Throughput: 0: 12807.3. Samples: 67574177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:08,322][175405] Avg episode reward: [(0, '26.050')] [2023-03-07 11:12:09,010][175731] Updated weights for policy 0, policy_version 66000 (0.0006) [2023-03-07 11:12:09,812][175731] Updated weights for policy 0, policy_version 66010 (0.0006) [2023-03-07 11:12:10,598][175731] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-07 11:12:11,395][175731] Updated weights for policy 0, policy_version 66030 (0.0006) [2023-03-07 11:12:12,189][175731] Updated weights for policy 0, policy_version 66040 (0.0006) [2023-03-07 11:12:12,993][175731] Updated weights for policy 0, policy_version 66050 (0.0007) [2023-03-07 11:12:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 67639296. Throughput: 0: 12808.7. Samples: 67612814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:13,322][175405] Avg episode reward: [(0, '26.428')] [2023-03-07 11:12:13,794][175731] Updated weights for policy 0, policy_version 66060 (0.0007) [2023-03-07 11:12:14,613][175731] Updated weights for policy 0, policy_version 66070 (0.0006) [2023-03-07 11:12:15,413][175731] Updated weights for policy 0, policy_version 66080 (0.0007) [2023-03-07 11:12:16,212][175731] Updated weights for policy 0, policy_version 66090 (0.0006) [2023-03-07 11:12:16,988][175731] Updated weights for policy 0, policy_version 66100 (0.0007) [2023-03-07 11:12:17,782][175731] Updated weights for policy 0, policy_version 66110 (0.0005) [2023-03-07 11:12:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 67702784. Throughput: 0: 12799.3. Samples: 67689639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:18,322][175405] Avg episode reward: [(0, '27.812')] [2023-03-07 11:12:18,579][175731] Updated weights for policy 0, policy_version 66120 (0.0007) [2023-03-07 11:12:19,373][175731] Updated weights for policy 0, policy_version 66130 (0.0007) [2023-03-07 11:12:20,169][175731] Updated weights for policy 0, policy_version 66140 (0.0006) [2023-03-07 11:12:20,977][175731] Updated weights for policy 0, policy_version 66150 (0.0006) [2023-03-07 11:12:21,799][175731] Updated weights for policy 0, policy_version 66160 (0.0006) [2023-03-07 11:12:22,583][175731] Updated weights for policy 0, policy_version 66170 (0.0007) [2023-03-07 11:12:23,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 67767296. Throughput: 0: 12800.1. Samples: 67766413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:23,321][175405] Avg episode reward: [(0, '25.346')] [2023-03-07 11:12:23,388][175731] Updated weights for policy 0, policy_version 66180 (0.0006) [2023-03-07 11:12:24,188][175731] Updated weights for policy 0, policy_version 66190 (0.0006) [2023-03-07 11:12:24,983][175731] Updated weights for policy 0, policy_version 66200 (0.0007) [2023-03-07 11:12:25,776][175731] Updated weights for policy 0, policy_version 66210 (0.0008) [2023-03-07 11:12:26,585][175731] Updated weights for policy 0, policy_version 66220 (0.0006) [2023-03-07 11:12:27,401][175731] Updated weights for policy 0, policy_version 66230 (0.0007) [2023-03-07 11:12:28,196][175731] Updated weights for policy 0, policy_version 66240 (0.0007) [2023-03-07 11:12:28,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 67830784. Throughput: 0: 12811.5. Samples: 67805051. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:28,321][175405] Avg episode reward: [(0, '26.458')] [2023-03-07 11:12:28,995][175731] Updated weights for policy 0, policy_version 66250 (0.0007) [2023-03-07 11:12:29,785][175731] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-07 11:12:30,583][175731] Updated weights for policy 0, policy_version 66270 (0.0007) [2023-03-07 11:12:31,379][175731] Updated weights for policy 0, policy_version 66280 (0.0007) [2023-03-07 11:12:32,185][175731] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-07 11:12:32,980][175731] Updated weights for policy 0, policy_version 66300 (0.0006) [2023-03-07 11:12:33,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 67895296. Throughput: 0: 12804.3. Samples: 67881771. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:33,322][175405] Avg episode reward: [(0, '25.599')] [2023-03-07 11:12:33,772][175731] Updated weights for policy 0, policy_version 66310 (0.0006) [2023-03-07 11:12:34,576][175731] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-07 11:12:35,361][175731] Updated weights for policy 0, policy_version 66330 (0.0007) [2023-03-07 11:12:36,166][175731] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-07 11:12:36,968][175731] Updated weights for policy 0, policy_version 66350 (0.0006) [2023-03-07 11:12:37,758][175731] Updated weights for policy 0, policy_version 66360 (0.0007) [2023-03-07 11:12:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 67958784. Throughput: 0: 12814.6. Samples: 67958806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:38,322][175405] Avg episode reward: [(0, '26.891')] [2023-03-07 11:12:38,566][175731] Updated weights for policy 0, policy_version 66370 (0.0008) [2023-03-07 11:12:39,354][175731] Updated weights for policy 0, policy_version 66380 (0.0007) [2023-03-07 11:12:40,174][175731] Updated weights for policy 0, policy_version 66390 (0.0007) [2023-03-07 11:12:40,973][175731] Updated weights for policy 0, policy_version 66400 (0.0006) [2023-03-07 11:12:41,772][175731] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-07 11:12:42,566][175731] Updated weights for policy 0, policy_version 66420 (0.0007) [2023-03-07 11:12:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 68023296. Throughput: 0: 12811.1. Samples: 67997067. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:43,322][175405] Avg episode reward: [(0, '25.250')] [2023-03-07 11:12:43,366][175731] Updated weights for policy 0, policy_version 66430 (0.0007) [2023-03-07 11:12:44,158][175731] Updated weights for policy 0, policy_version 66440 (0.0006) [2023-03-07 11:12:44,958][175731] Updated weights for policy 0, policy_version 66450 (0.0006) [2023-03-07 11:12:45,769][175731] Updated weights for policy 0, policy_version 66460 (0.0006) [2023-03-07 11:12:46,569][175731] Updated weights for policy 0, policy_version 66470 (0.0006) [2023-03-07 11:12:47,369][175731] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-07 11:12:48,165][175731] Updated weights for policy 0, policy_version 66490 (0.0007) [2023-03-07 11:12:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 68086784. Throughput: 0: 12811.6. Samples: 68073847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:48,332][175405] Avg episode reward: [(0, '27.476')] [2023-03-07 11:12:48,960][175731] Updated weights for policy 0, policy_version 66500 (0.0007) [2023-03-07 11:12:49,755][175731] Updated weights for policy 0, policy_version 66510 (0.0006) [2023-03-07 11:12:50,561][175731] Updated weights for policy 0, policy_version 66520 (0.0006) [2023-03-07 11:12:51,350][175731] Updated weights for policy 0, policy_version 66530 (0.0007) [2023-03-07 11:12:52,166][175731] Updated weights for policy 0, policy_version 66540 (0.0006) [2023-03-07 11:12:52,954][175731] Updated weights for policy 0, policy_version 66550 (0.0006) [2023-03-07 11:12:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 68151296. Throughput: 0: 12812.2. Samples: 68150723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:53,332][175405] Avg episode reward: [(0, '26.949')] [2023-03-07 11:12:53,748][175731] Updated weights for policy 0, policy_version 66560 (0.0006) [2023-03-07 11:12:54,576][175731] Updated weights for policy 0, policy_version 66570 (0.0007) [2023-03-07 11:12:55,345][175731] Updated weights for policy 0, policy_version 66580 (0.0007) [2023-03-07 11:12:56,156][175731] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-07 11:12:56,958][175731] Updated weights for policy 0, policy_version 66600 (0.0006) [2023-03-07 11:12:57,740][175731] Updated weights for policy 0, policy_version 66610 (0.0008) [2023-03-07 11:12:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 68215808. Throughput: 0: 12811.0. Samples: 68189310. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:12:58,332][175405] Avg episode reward: [(0, '23.945')] [2023-03-07 11:12:58,553][175731] Updated weights for policy 0, policy_version 66620 (0.0006) [2023-03-07 11:12:59,341][175731] Updated weights for policy 0, policy_version 66630 (0.0006) [2023-03-07 11:13:00,134][175731] Updated weights for policy 0, policy_version 66640 (0.0006) [2023-03-07 11:13:00,936][175731] Updated weights for policy 0, policy_version 66650 (0.0008) [2023-03-07 11:13:01,726][175731] Updated weights for policy 0, policy_version 66660 (0.0007) [2023-03-07 11:13:02,514][175731] Updated weights for policy 0, policy_version 66670 (0.0008) [2023-03-07 11:13:03,320][175731] Updated weights for policy 0, policy_version 66680 (0.0007) [2023-03-07 11:13:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 68280320. Throughput: 0: 12817.3. Samples: 68266418. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:13:03,331][175405] Avg episode reward: [(0, '25.683')] [2023-03-07 11:13:04,121][175731] Updated weights for policy 0, policy_version 66690 (0.0006) [2023-03-07 11:13:04,933][175731] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-07 11:13:05,717][175731] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-07 11:13:06,505][175731] Updated weights for policy 0, policy_version 66720 (0.0007) [2023-03-07 11:13:07,311][175731] Updated weights for policy 0, policy_version 66730 (0.0006) [2023-03-07 11:13:08,122][175731] Updated weights for policy 0, policy_version 66740 (0.0006) [2023-03-07 11:13:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 68343808. Throughput: 0: 12821.4. Samples: 68343378. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:13:08,332][175405] Avg episode reward: [(0, '24.263')] [2023-03-07 11:13:08,924][175731] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-07 11:13:09,735][175731] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-07 11:13:10,517][175731] Updated weights for policy 0, policy_version 66770 (0.0006) [2023-03-07 11:13:11,329][175731] Updated weights for policy 0, policy_version 66780 (0.0006) [2023-03-07 11:13:12,126][175731] Updated weights for policy 0, policy_version 66790 (0.0006) [2023-03-07 11:13:12,936][175731] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-07 11:13:13,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 68407296. Throughput: 0: 12814.8. Samples: 68381718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:13:13,321][175405] Avg episode reward: [(0, '26.260')] [2023-03-07 11:13:13,730][175731] Updated weights for policy 0, policy_version 66810 (0.0007) [2023-03-07 11:13:14,543][175731] Updated weights for policy 0, policy_version 66820 (0.0006) [2023-03-07 11:13:15,333][175731] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-07 11:13:16,119][175731] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-07 11:13:16,919][175731] Updated weights for policy 0, policy_version 66850 (0.0007) [2023-03-07 11:13:17,729][175731] Updated weights for policy 0, policy_version 66860 (0.0007) [2023-03-07 11:13:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 68471808. Throughput: 0: 12818.1. Samples: 68458586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:13:18,322][175405] Avg episode reward: [(0, '26.601')] [2023-03-07 11:13:18,524][175731] Updated weights for policy 0, policy_version 66870 (0.0006) [2023-03-07 11:13:19,318][175731] Updated weights for policy 0, policy_version 66880 (0.0006) [2023-03-07 11:13:20,120][175731] Updated weights for policy 0, policy_version 66890 (0.0006) [2023-03-07 11:13:20,902][175731] Updated weights for policy 0, policy_version 66900 (0.0007) [2023-03-07 11:13:21,717][175731] Updated weights for policy 0, policy_version 66910 (0.0007) [2023-03-07 11:13:22,534][175731] Updated weights for policy 0, policy_version 66920 (0.0006) [2023-03-07 11:13:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12815.6). Total num frames: 68535296. Throughput: 0: 12809.6. Samples: 68535239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:13:23,322][175405] Avg episode reward: [(0, '24.995')] [2023-03-07 11:13:23,334][175731] Updated weights for policy 0, policy_version 66930 (0.0006) [2023-03-07 11:13:24,131][175731] Updated weights for policy 0, policy_version 66940 (0.0007) [2023-03-07 11:13:24,933][175731] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-07 11:13:25,721][175731] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-07 11:13:26,537][175731] Updated weights for policy 0, policy_version 66970 (0.0007) [2023-03-07 11:13:27,328][175731] Updated weights for policy 0, policy_version 66980 (0.0007) [2023-03-07 11:13:28,139][175731] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-07 11:13:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12815.6). Total num frames: 68599808. Throughput: 0: 12812.3. Samples: 68573622. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:13:28,321][175405] Avg episode reward: [(0, '25.371')] [2023-03-07 11:13:28,937][175731] Updated weights for policy 0, policy_version 67000 (0.0006) [2023-03-07 11:13:29,731][175731] Updated weights for policy 0, policy_version 67010 (0.0006) [2023-03-07 11:13:30,523][175731] Updated weights for policy 0, policy_version 67020 (0.0006) [2023-03-07 11:13:31,327][175731] Updated weights for policy 0, policy_version 67030 (0.0006) [2023-03-07 11:13:32,124][175731] Updated weights for policy 0, policy_version 67040 (0.0007) [2023-03-07 11:13:32,908][175731] Updated weights for policy 0, policy_version 67050 (0.0006) [2023-03-07 11:13:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12815.6). Total num frames: 68663296. Throughput: 0: 12811.2. Samples: 68650353. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:13:33,322][175405] Avg episode reward: [(0, '27.637')] [2023-03-07 11:13:33,707][175731] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-07 11:13:34,493][175731] Updated weights for policy 0, policy_version 67070 (0.0007) [2023-03-07 11:13:35,293][175731] Updated weights for policy 0, policy_version 67080 (0.0007) [2023-03-07 11:13:36,101][175731] Updated weights for policy 0, policy_version 67090 (0.0007) [2023-03-07 11:13:36,893][175731] Updated weights for policy 0, policy_version 67100 (0.0006) [2023-03-07 11:13:37,702][175731] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-07 11:13:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 68727808. Throughput: 0: 12824.1. Samples: 68727806. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:13:38,322][175405] Avg episode reward: [(0, '25.906')] [2023-03-07 11:13:38,479][175731] Updated weights for policy 0, policy_version 67120 (0.0007) [2023-03-07 11:13:39,288][175731] Updated weights for policy 0, policy_version 67130 (0.0007) [2023-03-07 11:13:40,085][175731] Updated weights for policy 0, policy_version 67140 (0.0006) [2023-03-07 11:13:40,869][175731] Updated weights for policy 0, policy_version 67150 (0.0007) [2023-03-07 11:13:41,681][175731] Updated weights for policy 0, policy_version 67160 (0.0006) [2023-03-07 11:13:42,484][175731] Updated weights for policy 0, policy_version 67170 (0.0006) [2023-03-07 11:13:43,277][175731] Updated weights for policy 0, policy_version 67180 (0.0006) [2023-03-07 11:13:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 68792320. Throughput: 0: 12822.2. Samples: 68766307. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:13:43,322][175405] Avg episode reward: [(0, '24.550')] [2023-03-07 11:13:44,065][175731] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-07 11:13:44,878][175731] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-07 11:13:45,677][175731] Updated weights for policy 0, policy_version 67210 (0.0007) [2023-03-07 11:13:46,456][175731] Updated weights for policy 0, policy_version 67220 (0.0006) [2023-03-07 11:13:47,266][175731] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-07 11:13:48,066][175731] Updated weights for policy 0, policy_version 67240 (0.0007) [2023-03-07 11:13:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 68856832. Throughput: 0: 12819.7. Samples: 68843307. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:13:48,321][175405] Avg episode reward: [(0, '23.754')] [2023-03-07 11:13:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000067243_68856832.pth... [2023-03-07 11:13:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000064238_65779712.pth [2023-03-07 11:13:48,851][175731] Updated weights for policy 0, policy_version 67250 (0.0006) [2023-03-07 11:13:49,651][175731] Updated weights for policy 0, policy_version 67260 (0.0007) [2023-03-07 11:13:50,457][175731] Updated weights for policy 0, policy_version 67270 (0.0005) [2023-03-07 11:13:51,259][175731] Updated weights for policy 0, policy_version 67280 (0.0007) [2023-03-07 11:13:52,055][175731] Updated weights for policy 0, policy_version 67290 (0.0007) [2023-03-07 11:13:52,873][175731] Updated weights for policy 0, policy_version 67300 (0.0006) [2023-03-07 11:13:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 68920320. Throughput: 0: 12812.6. Samples: 68919942. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:13:53,321][175405] Avg episode reward: [(0, '24.280')] [2023-03-07 11:13:53,666][175731] Updated weights for policy 0, policy_version 67310 (0.0006) [2023-03-07 11:13:54,473][175731] Updated weights for policy 0, policy_version 67320 (0.0006) [2023-03-07 11:13:55,261][175731] Updated weights for policy 0, policy_version 67330 (0.0007) [2023-03-07 11:13:56,050][175731] Updated weights for policy 0, policy_version 67340 (0.0007) [2023-03-07 11:13:56,847][175731] Updated weights for policy 0, policy_version 67350 (0.0006) [2023-03-07 11:13:57,650][175731] Updated weights for policy 0, policy_version 67360 (0.0007) [2023-03-07 11:13:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 68984832. Throughput: 0: 12817.7. Samples: 68958516. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:13:58,322][175405] Avg episode reward: [(0, '24.981')] [2023-03-07 11:13:58,455][175731] Updated weights for policy 0, policy_version 67370 (0.0007) [2023-03-07 11:13:59,247][175731] Updated weights for policy 0, policy_version 67380 (0.0007) [2023-03-07 11:14:00,042][175731] Updated weights for policy 0, policy_version 67390 (0.0006) [2023-03-07 11:14:00,838][175731] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-07 11:14:01,639][175731] Updated weights for policy 0, policy_version 67410 (0.0007) [2023-03-07 11:14:02,447][175731] Updated weights for policy 0, policy_version 67420 (0.0006) [2023-03-07 11:14:03,246][175731] Updated weights for policy 0, policy_version 67430 (0.0006) [2023-03-07 11:14:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 69049344. Throughput: 0: 12823.2. Samples: 69035629. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:14:03,321][175405] Avg episode reward: [(0, '25.348')] [2023-03-07 11:14:04,045][175731] Updated weights for policy 0, policy_version 67440 (0.0006) [2023-03-07 11:14:04,853][175731] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-07 11:14:05,645][175731] Updated weights for policy 0, policy_version 67460 (0.0007) [2023-03-07 11:14:06,422][175731] Updated weights for policy 0, policy_version 67470 (0.0007) [2023-03-07 11:14:07,238][175731] Updated weights for policy 0, policy_version 67480 (0.0006) [2023-03-07 11:14:08,034][175731] Updated weights for policy 0, policy_version 67490 (0.0006) [2023-03-07 11:14:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 69112832. Throughput: 0: 12827.6. Samples: 69112480. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:14:08,322][175405] Avg episode reward: [(0, '23.877')] [2023-03-07 11:14:08,832][175731] Updated weights for policy 0, policy_version 67500 (0.0006) [2023-03-07 11:14:09,622][175731] Updated weights for policy 0, policy_version 67510 (0.0006) [2023-03-07 11:14:10,414][175731] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-07 11:14:11,225][175731] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-07 11:14:12,018][175731] Updated weights for policy 0, policy_version 67540 (0.0007) [2023-03-07 11:14:12,825][175731] Updated weights for policy 0, policy_version 67550 (0.0006) [2023-03-07 11:14:13,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 69177344. Throughput: 0: 12832.3. Samples: 69151074. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:14:13,322][175405] Avg episode reward: [(0, '25.724')] [2023-03-07 11:14:13,632][175731] Updated weights for policy 0, policy_version 67560 (0.0007) [2023-03-07 11:14:14,420][175731] Updated weights for policy 0, policy_version 67570 (0.0006) [2023-03-07 11:14:15,215][175731] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-07 11:14:16,009][175731] Updated weights for policy 0, policy_version 67590 (0.0006) [2023-03-07 11:14:16,806][175731] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-07 11:14:17,590][175731] Updated weights for policy 0, policy_version 67610 (0.0006) [2023-03-07 11:14:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.2, 300 sec: 12822.6). Total num frames: 69241856. Throughput: 0: 12838.5. Samples: 69228085. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:18,322][175405] Avg episode reward: [(0, '24.774')] [2023-03-07 11:14:18,400][175731] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-07 11:14:19,187][175731] Updated weights for policy 0, policy_version 67630 (0.0006) [2023-03-07 11:14:19,973][175731] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-07 11:14:20,785][175731] Updated weights for policy 0, policy_version 67650 (0.0006) [2023-03-07 11:14:21,589][175731] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-07 11:14:22,381][175731] Updated weights for policy 0, policy_version 67670 (0.0006) [2023-03-07 11:14:23,160][175731] Updated weights for policy 0, policy_version 67680 (0.0006) [2023-03-07 11:14:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 69305344. Throughput: 0: 12833.4. Samples: 69305308. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:23,322][175405] Avg episode reward: [(0, '24.447')] [2023-03-07 11:14:23,981][175731] Updated weights for policy 0, policy_version 67690 (0.0006) [2023-03-07 11:14:24,771][175731] Updated weights for policy 0, policy_version 67700 (0.0007) [2023-03-07 11:14:25,567][175731] Updated weights for policy 0, policy_version 67710 (0.0007) [2023-03-07 11:14:26,378][175731] Updated weights for policy 0, policy_version 67720 (0.0007) [2023-03-07 11:14:27,176][175731] Updated weights for policy 0, policy_version 67730 (0.0006) [2023-03-07 11:14:27,983][175731] Updated weights for policy 0, policy_version 67740 (0.0006) [2023-03-07 11:14:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 69369856. Throughput: 0: 12827.0. Samples: 69343522. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:28,321][175405] Avg episode reward: [(0, '25.313')] [2023-03-07 11:14:28,757][175731] Updated weights for policy 0, policy_version 67750 (0.0005) [2023-03-07 11:14:29,548][175731] Updated weights for policy 0, policy_version 67760 (0.0007) [2023-03-07 11:14:30,355][175731] Updated weights for policy 0, policy_version 67770 (0.0006) [2023-03-07 11:14:31,147][175731] Updated weights for policy 0, policy_version 67780 (0.0007) [2023-03-07 11:14:31,940][175731] Updated weights for policy 0, policy_version 67790 (0.0007) [2023-03-07 11:14:32,737][175731] Updated weights for policy 0, policy_version 67800 (0.0007) [2023-03-07 11:14:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 69434368. Throughput: 0: 12833.3. Samples: 69420805. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:33,332][175405] Avg episode reward: [(0, '23.875')] [2023-03-07 11:14:33,547][175731] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-07 11:14:34,328][175731] Updated weights for policy 0, policy_version 67820 (0.0006) [2023-03-07 11:14:35,147][175731] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-07 11:14:35,941][175731] Updated weights for policy 0, policy_version 67840 (0.0007) [2023-03-07 11:14:36,729][175731] Updated weights for policy 0, policy_version 67850 (0.0006) [2023-03-07 11:14:37,533][175731] Updated weights for policy 0, policy_version 67860 (0.0007) [2023-03-07 11:14:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 69497856. Throughput: 0: 12837.4. Samples: 69497628. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:38,332][175405] Avg episode reward: [(0, '23.961')] [2023-03-07 11:14:38,350][175731] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-07 11:14:39,143][175731] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-07 11:14:39,958][175731] Updated weights for policy 0, policy_version 67890 (0.0007) [2023-03-07 11:14:40,749][175731] Updated weights for policy 0, policy_version 67900 (0.0006) [2023-03-07 11:14:41,544][175731] Updated weights for policy 0, policy_version 67910 (0.0007) [2023-03-07 11:14:42,347][175731] Updated weights for policy 0, policy_version 67920 (0.0007) [2023-03-07 11:14:43,134][175731] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-07 11:14:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 69562368. Throughput: 0: 12831.1. Samples: 69535917. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:43,332][175405] Avg episode reward: [(0, '23.622')] [2023-03-07 11:14:43,949][175731] Updated weights for policy 0, policy_version 67940 (0.0008) [2023-03-07 11:14:44,718][175731] Updated weights for policy 0, policy_version 67950 (0.0006) [2023-03-07 11:14:45,498][175731] Updated weights for policy 0, policy_version 67960 (0.0006) [2023-03-07 11:14:46,288][175731] Updated weights for policy 0, policy_version 67970 (0.0006) [2023-03-07 11:14:47,109][175731] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-07 11:14:47,894][175731] Updated weights for policy 0, policy_version 67990 (0.0006) [2023-03-07 11:14:48,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 69626880. Throughput: 0: 12836.1. Samples: 69613256. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:48,332][175405] Avg episode reward: [(0, '24.852')] [2023-03-07 11:14:48,707][175731] Updated weights for policy 0, policy_version 68000 (0.0007) [2023-03-07 11:14:49,500][175731] Updated weights for policy 0, policy_version 68010 (0.0006) [2023-03-07 11:14:50,293][175731] Updated weights for policy 0, policy_version 68020 (0.0007) [2023-03-07 11:14:51,086][175731] Updated weights for policy 0, policy_version 68030 (0.0006) [2023-03-07 11:14:51,874][175731] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-07 11:14:52,674][175731] Updated weights for policy 0, policy_version 68050 (0.0007) [2023-03-07 11:14:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 69691392. Throughput: 0: 12843.6. Samples: 69690440. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:53,322][175405] Avg episode reward: [(0, '24.743')] [2023-03-07 11:14:53,484][175731] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-07 11:14:54,281][175731] Updated weights for policy 0, policy_version 68070 (0.0006) [2023-03-07 11:14:55,069][175731] Updated weights for policy 0, policy_version 68080 (0.0006) [2023-03-07 11:14:55,878][175731] Updated weights for policy 0, policy_version 68090 (0.0007) [2023-03-07 11:14:56,683][175731] Updated weights for policy 0, policy_version 68100 (0.0007) [2023-03-07 11:14:57,489][175731] Updated weights for policy 0, policy_version 68110 (0.0007) [2023-03-07 11:14:58,291][175731] Updated weights for policy 0, policy_version 68120 (0.0007) [2023-03-07 11:14:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 69754880. Throughput: 0: 12837.4. Samples: 69728756. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:14:58,322][175405] Avg episode reward: [(0, '27.440')] [2023-03-07 11:14:59,094][175731] Updated weights for policy 0, policy_version 68130 (0.0007) [2023-03-07 11:14:59,921][175731] Updated weights for policy 0, policy_version 68140 (0.0007) [2023-03-07 11:15:00,713][175731] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-07 11:15:01,513][175731] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-07 11:15:02,320][175731] Updated weights for policy 0, policy_version 68170 (0.0008) [2023-03-07 11:15:03,119][175731] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-07 11:15:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 69818368. Throughput: 0: 12822.6. Samples: 69805104. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:03,321][175405] Avg episode reward: [(0, '24.097')] [2023-03-07 11:15:03,909][175731] Updated weights for policy 0, policy_version 68190 (0.0007) [2023-03-07 11:15:04,717][175731] Updated weights for policy 0, policy_version 68200 (0.0006) [2023-03-07 11:15:05,511][175731] Updated weights for policy 0, policy_version 68210 (0.0006) [2023-03-07 11:15:06,315][175731] Updated weights for policy 0, policy_version 68220 (0.0007) [2023-03-07 11:15:07,109][175731] Updated weights for policy 0, policy_version 68230 (0.0006) [2023-03-07 11:15:07,905][175731] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-07 11:15:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 69882880. Throughput: 0: 12818.5. Samples: 69882141. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:08,321][175405] Avg episode reward: [(0, '24.043')] [2023-03-07 11:15:08,693][175731] Updated weights for policy 0, policy_version 68250 (0.0007) [2023-03-07 11:15:09,510][175731] Updated weights for policy 0, policy_version 68260 (0.0006) [2023-03-07 11:15:10,302][175731] Updated weights for policy 0, policy_version 68270 (0.0008) [2023-03-07 11:15:11,105][175731] Updated weights for policy 0, policy_version 68280 (0.0006) [2023-03-07 11:15:11,912][175731] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-07 11:15:12,696][175731] Updated weights for policy 0, policy_version 68300 (0.0006) [2023-03-07 11:15:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 69946368. Throughput: 0: 12822.4. Samples: 69920528. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:13,322][175405] Avg episode reward: [(0, '24.258')] [2023-03-07 11:15:13,487][175731] Updated weights for policy 0, policy_version 68310 (0.0007) [2023-03-07 11:15:14,293][175731] Updated weights for policy 0, policy_version 68320 (0.0007) [2023-03-07 11:15:15,090][175731] Updated weights for policy 0, policy_version 68330 (0.0006) [2023-03-07 11:15:15,879][175731] Updated weights for policy 0, policy_version 68340 (0.0007) [2023-03-07 11:15:16,692][175731] Updated weights for policy 0, policy_version 68350 (0.0007) [2023-03-07 11:15:17,495][175731] Updated weights for policy 0, policy_version 68360 (0.0006) [2023-03-07 11:15:18,285][175731] Updated weights for policy 0, policy_version 68370 (0.0007) [2023-03-07 11:15:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12819.1). Total num frames: 70010880. Throughput: 0: 12815.0. Samples: 69997481. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:18,322][175405] Avg episode reward: [(0, '24.853')] [2023-03-07 11:15:19,090][175731] Updated weights for policy 0, policy_version 68380 (0.0007) [2023-03-07 11:15:19,883][175731] Updated weights for policy 0, policy_version 68390 (0.0007) [2023-03-07 11:15:20,684][175731] Updated weights for policy 0, policy_version 68400 (0.0007) [2023-03-07 11:15:21,473][175731] Updated weights for policy 0, policy_version 68410 (0.0006) [2023-03-07 11:15:22,266][175731] Updated weights for policy 0, policy_version 68420 (0.0005) [2023-03-07 11:15:23,063][175731] Updated weights for policy 0, policy_version 68430 (0.0006) [2023-03-07 11:15:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 70075392. Throughput: 0: 12820.9. Samples: 70074567. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:23,321][175405] Avg episode reward: [(0, '24.643')] [2023-03-07 11:15:23,848][175731] Updated weights for policy 0, policy_version 68440 (0.0006) [2023-03-07 11:15:24,632][175731] Updated weights for policy 0, policy_version 68450 (0.0007) [2023-03-07 11:15:25,449][175731] Updated weights for policy 0, policy_version 68460 (0.0007) [2023-03-07 11:15:26,249][175731] Updated weights for policy 0, policy_version 68470 (0.0007) [2023-03-07 11:15:27,033][175731] Updated weights for policy 0, policy_version 68480 (0.0007) [2023-03-07 11:15:27,839][175731] Updated weights for policy 0, policy_version 68490 (0.0005) [2023-03-07 11:15:28,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 70139904. Throughput: 0: 12830.2. Samples: 70113278. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:28,322][175405] Avg episode reward: [(0, '24.924')] [2023-03-07 11:15:28,632][175731] Updated weights for policy 0, policy_version 68500 (0.0007) [2023-03-07 11:15:29,425][175731] Updated weights for policy 0, policy_version 68510 (0.0006) [2023-03-07 11:15:30,225][175731] Updated weights for policy 0, policy_version 68520 (0.0007) [2023-03-07 11:15:31,014][175731] Updated weights for policy 0, policy_version 68530 (0.0006) [2023-03-07 11:15:31,786][175731] Updated weights for policy 0, policy_version 68540 (0.0006) [2023-03-07 11:15:32,589][175731] Updated weights for policy 0, policy_version 68550 (0.0006) [2023-03-07 11:15:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 70204416. Throughput: 0: 12831.0. Samples: 70190649. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:33,322][175405] Avg episode reward: [(0, '25.295')] [2023-03-07 11:15:33,405][175731] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-07 11:15:34,206][175731] Updated weights for policy 0, policy_version 68570 (0.0008) [2023-03-07 11:15:35,000][175731] Updated weights for policy 0, policy_version 68580 (0.0006) [2023-03-07 11:15:35,802][175731] Updated weights for policy 0, policy_version 68590 (0.0006) [2023-03-07 11:15:36,599][175731] Updated weights for policy 0, policy_version 68600 (0.0006) [2023-03-07 11:15:37,387][175731] Updated weights for policy 0, policy_version 68610 (0.0006) [2023-03-07 11:15:38,202][175731] Updated weights for policy 0, policy_version 68620 (0.0007) [2023-03-07 11:15:38,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.2, 300 sec: 12819.1). Total num frames: 70267904. Throughput: 0: 12822.9. Samples: 70267468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:38,321][175405] Avg episode reward: [(0, '24.884')] [2023-03-07 11:15:38,997][175731] Updated weights for policy 0, policy_version 68630 (0.0007) [2023-03-07 11:15:39,803][175731] Updated weights for policy 0, policy_version 68640 (0.0007) [2023-03-07 11:15:40,590][175731] Updated weights for policy 0, policy_version 68650 (0.0006) [2023-03-07 11:15:41,394][175731] Updated weights for policy 0, policy_version 68660 (0.0008) [2023-03-07 11:15:42,173][175731] Updated weights for policy 0, policy_version 68670 (0.0007) [2023-03-07 11:15:42,985][175731] Updated weights for policy 0, policy_version 68680 (0.0006) [2023-03-07 11:15:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 70332416. Throughput: 0: 12827.9. Samples: 70306011. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:43,321][175405] Avg episode reward: [(0, '25.402')] [2023-03-07 11:15:43,779][175731] Updated weights for policy 0, policy_version 68690 (0.0007) [2023-03-07 11:15:44,570][175731] Updated weights for policy 0, policy_version 68700 (0.0007) [2023-03-07 11:15:45,362][175731] Updated weights for policy 0, policy_version 68710 (0.0006) [2023-03-07 11:15:46,174][175731] Updated weights for policy 0, policy_version 68720 (0.0007) [2023-03-07 11:15:46,967][175731] Updated weights for policy 0, policy_version 68730 (0.0006) [2023-03-07 11:15:47,777][175731] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-07 11:15:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 70395904. Throughput: 0: 12836.7. Samples: 70382758. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:48,322][175405] Avg episode reward: [(0, '25.321')] [2023-03-07 11:15:48,341][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000068747_70396928.pth... [2023-03-07 11:15:48,370][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000065741_67318784.pth [2023-03-07 11:15:48,566][175731] Updated weights for policy 0, policy_version 68750 (0.0006) [2023-03-07 11:15:49,373][175731] Updated weights for policy 0, policy_version 68760 (0.0006) [2023-03-07 11:15:50,154][175731] Updated weights for policy 0, policy_version 68770 (0.0006) [2023-03-07 11:15:50,954][175731] Updated weights for policy 0, policy_version 68780 (0.0006) [2023-03-07 11:15:51,739][175731] Updated weights for policy 0, policy_version 68790 (0.0007) [2023-03-07 11:15:52,546][175731] Updated weights for policy 0, policy_version 68800 (0.0007) [2023-03-07 11:15:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 70460416. Throughput: 0: 12842.1. Samples: 70460038. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:53,322][175405] Avg episode reward: [(0, '24.467')] [2023-03-07 11:15:53,346][175731] Updated weights for policy 0, policy_version 68810 (0.0006) [2023-03-07 11:15:54,165][175731] Updated weights for policy 0, policy_version 68820 (0.0007) [2023-03-07 11:15:54,956][175731] Updated weights for policy 0, policy_version 68830 (0.0007) [2023-03-07 11:15:55,748][175731] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-07 11:15:56,554][175731] Updated weights for policy 0, policy_version 68850 (0.0006) [2023-03-07 11:15:57,353][175731] Updated weights for policy 0, policy_version 68860 (0.0007) [2023-03-07 11:15:58,153][175731] Updated weights for policy 0, policy_version 68870 (0.0006) [2023-03-07 11:15:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 70524928. Throughput: 0: 12845.3. Samples: 70498567. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:15:58,322][175405] Avg episode reward: [(0, '25.212')] [2023-03-07 11:15:58,954][175731] Updated weights for policy 0, policy_version 68880 (0.0006) [2023-03-07 11:15:59,754][175731] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-07 11:16:00,552][175731] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-07 11:16:01,349][175731] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-07 11:16:02,146][175731] Updated weights for policy 0, policy_version 68920 (0.0007) [2023-03-07 11:16:02,945][175731] Updated weights for policy 0, policy_version 68930 (0.0007) [2023-03-07 11:16:03,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 70589440. Throughput: 0: 12844.1. Samples: 70575466. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:16:03,322][175405] Avg episode reward: [(0, '25.274')] [2023-03-07 11:16:03,745][175731] Updated weights for policy 0, policy_version 68940 (0.0007) [2023-03-07 11:16:04,530][175731] Updated weights for policy 0, policy_version 68950 (0.0007) [2023-03-07 11:16:05,341][175731] Updated weights for policy 0, policy_version 68960 (0.0007) [2023-03-07 11:16:06,150][175731] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-07 11:16:06,941][175731] Updated weights for policy 0, policy_version 68980 (0.0007) [2023-03-07 11:16:07,719][175731] Updated weights for policy 0, policy_version 68990 (0.0006) [2023-03-07 11:16:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 70652928. Throughput: 0: 12836.1. Samples: 70652194. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:16:08,322][175405] Avg episode reward: [(0, '24.738')] [2023-03-07 11:16:08,540][175731] Updated weights for policy 0, policy_version 69000 (0.0007) [2023-03-07 11:16:09,342][175731] Updated weights for policy 0, policy_version 69010 (0.0006) [2023-03-07 11:16:10,144][175731] Updated weights for policy 0, policy_version 69020 (0.0006) [2023-03-07 11:16:10,941][175731] Updated weights for policy 0, policy_version 69030 (0.0006) [2023-03-07 11:16:11,738][175731] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-07 11:16:12,550][175731] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-07 11:16:13,321][175405] Fps is (10 sec: 12697.4, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 70716416. Throughput: 0: 12827.2. Samples: 70690500. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:16:13,332][175405] Avg episode reward: [(0, '24.666')] [2023-03-07 11:16:13,351][175731] Updated weights for policy 0, policy_version 69060 (0.0007) [2023-03-07 11:16:14,155][175731] Updated weights for policy 0, policy_version 69070 (0.0006) [2023-03-07 11:16:14,937][175731] Updated weights for policy 0, policy_version 69080 (0.0006) [2023-03-07 11:16:15,751][175731] Updated weights for policy 0, policy_version 69090 (0.0006) [2023-03-07 11:16:16,537][175731] Updated weights for policy 0, policy_version 69100 (0.0006) [2023-03-07 11:16:17,343][175731] Updated weights for policy 0, policy_version 69110 (0.0007) [2023-03-07 11:16:18,134][175731] Updated weights for policy 0, policy_version 69120 (0.0007) [2023-03-07 11:16:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 70780928. Throughput: 0: 12818.7. Samples: 70767492. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:16:18,332][175405] Avg episode reward: [(0, '25.461')] [2023-03-07 11:16:18,936][175731] Updated weights for policy 0, policy_version 69130 (0.0007) [2023-03-07 11:16:19,720][175731] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-07 11:16:20,514][175731] Updated weights for policy 0, policy_version 69150 (0.0006) [2023-03-07 11:16:21,327][175731] Updated weights for policy 0, policy_version 69160 (0.0006) [2023-03-07 11:16:22,127][175731] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-07 11:16:22,928][175731] Updated weights for policy 0, policy_version 69180 (0.0006) [2023-03-07 11:16:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 70844416. Throughput: 0: 12815.7. Samples: 70844177. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:23,332][175405] Avg episode reward: [(0, '23.055')] [2023-03-07 11:16:23,739][175731] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-07 11:16:24,544][175731] Updated weights for policy 0, policy_version 69200 (0.0007) [2023-03-07 11:16:25,344][175731] Updated weights for policy 0, policy_version 69210 (0.0007) [2023-03-07 11:16:26,134][175731] Updated weights for policy 0, policy_version 69220 (0.0006) [2023-03-07 11:16:26,922][175731] Updated weights for policy 0, policy_version 69230 (0.0006) [2023-03-07 11:16:27,726][175731] Updated weights for policy 0, policy_version 69240 (0.0006) [2023-03-07 11:16:28,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 70908928. Throughput: 0: 12814.9. Samples: 70882683. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:28,332][175405] Avg episode reward: [(0, '24.880')] [2023-03-07 11:16:28,538][175731] Updated weights for policy 0, policy_version 69250 (0.0006) [2023-03-07 11:16:29,334][175731] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-07 11:16:30,127][175731] Updated weights for policy 0, policy_version 69270 (0.0006) [2023-03-07 11:16:30,918][175731] Updated weights for policy 0, policy_version 69280 (0.0006) [2023-03-07 11:16:31,722][175731] Updated weights for policy 0, policy_version 69290 (0.0007) [2023-03-07 11:16:32,522][175731] Updated weights for policy 0, policy_version 69300 (0.0007) [2023-03-07 11:16:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 70972416. Throughput: 0: 12819.5. Samples: 70959633. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:33,327][175731] Updated weights for policy 0, policy_version 69310 (0.0006) [2023-03-07 11:16:33,332][175405] Avg episode reward: [(0, '23.715')] [2023-03-07 11:16:34,128][175731] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-07 11:16:34,912][175731] Updated weights for policy 0, policy_version 69330 (0.0007) [2023-03-07 11:16:35,731][175731] Updated weights for policy 0, policy_version 69340 (0.0006) [2023-03-07 11:16:36,509][175731] Updated weights for policy 0, policy_version 69350 (0.0006) [2023-03-07 11:16:37,310][175731] Updated weights for policy 0, policy_version 69360 (0.0007) [2023-03-07 11:16:38,118][175731] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-07 11:16:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 71036928. Throughput: 0: 12809.0. Samples: 71036444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:38,332][175405] Avg episode reward: [(0, '24.011')] [2023-03-07 11:16:38,908][175731] Updated weights for policy 0, policy_version 69380 (0.0007) [2023-03-07 11:16:39,690][175731] Updated weights for policy 0, policy_version 69390 (0.0007) [2023-03-07 11:16:40,504][175731] Updated weights for policy 0, policy_version 69400 (0.0007) [2023-03-07 11:16:41,278][175731] Updated weights for policy 0, policy_version 69410 (0.0007) [2023-03-07 11:16:42,081][175731] Updated weights for policy 0, policy_version 69420 (0.0008) [2023-03-07 11:16:42,874][175731] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-07 11:16:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 71101440. Throughput: 0: 12813.5. Samples: 71075176. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:43,322][175405] Avg episode reward: [(0, '25.601')] [2023-03-07 11:16:43,687][175731] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-07 11:16:44,474][175731] Updated weights for policy 0, policy_version 69450 (0.0007) [2023-03-07 11:16:45,270][175731] Updated weights for policy 0, policy_version 69460 (0.0006) [2023-03-07 11:16:46,078][175731] Updated weights for policy 0, policy_version 69470 (0.0006) [2023-03-07 11:16:46,871][175731] Updated weights for policy 0, policy_version 69480 (0.0006) [2023-03-07 11:16:47,677][175731] Updated weights for policy 0, policy_version 69490 (0.0007) [2023-03-07 11:16:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 71165952. Throughput: 0: 12820.0. Samples: 71152369. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:48,322][175405] Avg episode reward: [(0, '24.003')] [2023-03-07 11:16:48,468][175731] Updated weights for policy 0, policy_version 69500 (0.0006) [2023-03-07 11:16:49,258][175731] Updated weights for policy 0, policy_version 69510 (0.0007) [2023-03-07 11:16:50,080][175731] Updated weights for policy 0, policy_version 69520 (0.0006) [2023-03-07 11:16:50,868][175731] Updated weights for policy 0, policy_version 69530 (0.0006) [2023-03-07 11:16:51,667][175731] Updated weights for policy 0, policy_version 69540 (0.0007) [2023-03-07 11:16:52,471][175731] Updated weights for policy 0, policy_version 69550 (0.0007) [2023-03-07 11:16:53,284][175731] Updated weights for policy 0, policy_version 69560 (0.0005) [2023-03-07 11:16:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 71229440. Throughput: 0: 12818.2. Samples: 71229009. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:53,321][175405] Avg episode reward: [(0, '23.830')] [2023-03-07 11:16:54,065][175731] Updated weights for policy 0, policy_version 69570 (0.0006) [2023-03-07 11:16:54,885][175731] Updated weights for policy 0, policy_version 69580 (0.0007) [2023-03-07 11:16:55,674][175731] Updated weights for policy 0, policy_version 69590 (0.0007) [2023-03-07 11:16:56,469][175731] Updated weights for policy 0, policy_version 69600 (0.0006) [2023-03-07 11:16:57,277][175731] Updated weights for policy 0, policy_version 69610 (0.0006) [2023-03-07 11:16:58,084][175731] Updated weights for policy 0, policy_version 69620 (0.0007) [2023-03-07 11:16:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 71293952. Throughput: 0: 12821.5. Samples: 71267469. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:16:58,322][175405] Avg episode reward: [(0, '24.251')] [2023-03-07 11:16:58,870][175731] Updated weights for policy 0, policy_version 69630 (0.0006) [2023-03-07 11:16:59,659][175731] Updated weights for policy 0, policy_version 69640 (0.0005) [2023-03-07 11:17:00,473][175731] Updated weights for policy 0, policy_version 69650 (0.0007) [2023-03-07 11:17:01,255][175731] Updated weights for policy 0, policy_version 69660 (0.0006) [2023-03-07 11:17:02,065][175731] Updated weights for policy 0, policy_version 69670 (0.0006) [2023-03-07 11:17:02,865][175731] Updated weights for policy 0, policy_version 69680 (0.0006) [2023-03-07 11:17:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 71357440. Throughput: 0: 12819.6. Samples: 71344374. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:03,322][175405] Avg episode reward: [(0, '25.450')] [2023-03-07 11:17:03,655][175731] Updated weights for policy 0, policy_version 69690 (0.0007) [2023-03-07 11:17:04,453][175731] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-07 11:17:05,246][175731] Updated weights for policy 0, policy_version 69710 (0.0007) [2023-03-07 11:17:06,035][175731] Updated weights for policy 0, policy_version 69720 (0.0006) [2023-03-07 11:17:06,831][175731] Updated weights for policy 0, policy_version 69730 (0.0006) [2023-03-07 11:17:07,638][175731] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-07 11:17:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 71421952. Throughput: 0: 12827.4. Samples: 71421411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:08,322][175405] Avg episode reward: [(0, '24.079')] [2023-03-07 11:17:08,440][175731] Updated weights for policy 0, policy_version 69750 (0.0006) [2023-03-07 11:17:09,218][175731] Updated weights for policy 0, policy_version 69760 (0.0006) [2023-03-07 11:17:10,019][175731] Updated weights for policy 0, policy_version 69770 (0.0007) [2023-03-07 11:17:10,816][175731] Updated weights for policy 0, policy_version 69780 (0.0006) [2023-03-07 11:17:11,594][175731] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-07 11:17:12,391][175731] Updated weights for policy 0, policy_version 69800 (0.0006) [2023-03-07 11:17:13,182][175731] Updated weights for policy 0, policy_version 69810 (0.0006) [2023-03-07 11:17:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 71486464. Throughput: 0: 12835.6. Samples: 71460285. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:13,322][175405] Avg episode reward: [(0, '23.934')] [2023-03-07 11:17:13,975][175731] Updated weights for policy 0, policy_version 69820 (0.0007) [2023-03-07 11:17:14,789][175731] Updated weights for policy 0, policy_version 69830 (0.0007) [2023-03-07 11:17:15,565][175731] Updated weights for policy 0, policy_version 69840 (0.0006) [2023-03-07 11:17:16,348][175731] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-07 11:17:17,161][175731] Updated weights for policy 0, policy_version 69860 (0.0006) [2023-03-07 11:17:17,951][175731] Updated weights for policy 0, policy_version 69870 (0.0006) [2023-03-07 11:17:18,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.2, 300 sec: 12826.0). Total num frames: 71550976. Throughput: 0: 12848.2. Samples: 71537802. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:18,321][175405] Avg episode reward: [(0, '23.450')] [2023-03-07 11:17:18,750][175731] Updated weights for policy 0, policy_version 69880 (0.0006) [2023-03-07 11:17:19,541][175731] Updated weights for policy 0, policy_version 69890 (0.0006) [2023-03-07 11:17:20,332][175731] Updated weights for policy 0, policy_version 69900 (0.0006) [2023-03-07 11:17:21,144][175731] Updated weights for policy 0, policy_version 69910 (0.0007) [2023-03-07 11:17:21,927][175731] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-07 11:17:22,741][175731] Updated weights for policy 0, policy_version 69930 (0.0006) [2023-03-07 11:17:23,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 71615488. Throughput: 0: 12851.2. Samples: 71614749. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:23,322][175405] Avg episode reward: [(0, '23.995')] [2023-03-07 11:17:23,543][175731] Updated weights for policy 0, policy_version 69940 (0.0006) [2023-03-07 11:17:24,325][175731] Updated weights for policy 0, policy_version 69950 (0.0006) [2023-03-07 11:17:25,110][175731] Updated weights for policy 0, policy_version 69960 (0.0007) [2023-03-07 11:17:25,915][175731] Updated weights for policy 0, policy_version 69970 (0.0007) [2023-03-07 11:17:26,713][175731] Updated weights for policy 0, policy_version 69980 (0.0006) [2023-03-07 11:17:27,493][175731] Updated weights for policy 0, policy_version 69990 (0.0006) [2023-03-07 11:17:28,297][175731] Updated weights for policy 0, policy_version 70000 (0.0006) [2023-03-07 11:17:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 71680000. Throughput: 0: 12852.6. Samples: 71653542. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:28,322][175405] Avg episode reward: [(0, '22.973')] [2023-03-07 11:17:29,108][175731] Updated weights for policy 0, policy_version 70010 (0.0006) [2023-03-07 11:17:29,893][175731] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-07 11:17:30,705][175731] Updated weights for policy 0, policy_version 70030 (0.0008) [2023-03-07 11:17:31,502][175731] Updated weights for policy 0, policy_version 70040 (0.0006) [2023-03-07 11:17:32,304][175731] Updated weights for policy 0, policy_version 70050 (0.0006) [2023-03-07 11:17:33,101][175731] Updated weights for policy 0, policy_version 70060 (0.0006) [2023-03-07 11:17:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 71743488. Throughput: 0: 12849.6. Samples: 71730603. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:33,322][175405] Avg episode reward: [(0, '24.308')] [2023-03-07 11:17:33,899][175731] Updated weights for policy 0, policy_version 70070 (0.0006) [2023-03-07 11:17:34,686][175731] Updated weights for policy 0, policy_version 70080 (0.0006) [2023-03-07 11:17:35,505][175731] Updated weights for policy 0, policy_version 70090 (0.0006) [2023-03-07 11:17:36,283][175731] Updated weights for policy 0, policy_version 70100 (0.0007) [2023-03-07 11:17:37,076][175731] Updated weights for policy 0, policy_version 70110 (0.0007) [2023-03-07 11:17:37,886][175731] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-07 11:17:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 71808000. Throughput: 0: 12850.7. Samples: 71807292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:38,322][175405] Avg episode reward: [(0, '24.045')] [2023-03-07 11:17:38,689][175731] Updated weights for policy 0, policy_version 70130 (0.0006) [2023-03-07 11:17:39,481][175731] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-07 11:17:40,285][175731] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-07 11:17:41,092][175731] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-07 11:17:41,883][175731] Updated weights for policy 0, policy_version 70170 (0.0006) [2023-03-07 11:17:42,696][175731] Updated weights for policy 0, policy_version 70180 (0.0006) [2023-03-07 11:17:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12829.5). Total num frames: 71871488. Throughput: 0: 12851.7. Samples: 71845791. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:43,321][175405] Avg episode reward: [(0, '25.884')] [2023-03-07 11:17:43,480][175731] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-07 11:17:44,280][175731] Updated weights for policy 0, policy_version 70200 (0.0006) [2023-03-07 11:17:45,081][175731] Updated weights for policy 0, policy_version 70210 (0.0006) [2023-03-07 11:17:45,862][175731] Updated weights for policy 0, policy_version 70220 (0.0006) [2023-03-07 11:17:46,651][175731] Updated weights for policy 0, policy_version 70230 (0.0007) [2023-03-07 11:17:47,461][175731] Updated weights for policy 0, policy_version 70240 (0.0007) [2023-03-07 11:17:48,250][175731] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-07 11:17:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 71937024. Throughput: 0: 12860.0. Samples: 71923072. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:48,332][175405] Avg episode reward: [(0, '24.944')] [2023-03-07 11:17:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000070251_71937024.pth... [2023-03-07 11:17:48,367][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000067243_68856832.pth [2023-03-07 11:17:49,060][175731] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-07 11:17:49,845][175731] Updated weights for policy 0, policy_version 70270 (0.0006) [2023-03-07 11:17:50,646][175731] Updated weights for policy 0, policy_version 70280 (0.0006) [2023-03-07 11:17:51,437][175731] Updated weights for policy 0, policy_version 70290 (0.0006) [2023-03-07 11:17:52,229][175731] Updated weights for policy 0, policy_version 70300 (0.0006) [2023-03-07 11:17:53,016][175731] Updated weights for policy 0, policy_version 70310 (0.0006) [2023-03-07 11:17:53,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 72000512. Throughput: 0: 12862.7. Samples: 72000232. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:53,332][175405] Avg episode reward: [(0, '25.402')] [2023-03-07 11:17:53,838][175731] Updated weights for policy 0, policy_version 70320 (0.0006) [2023-03-07 11:17:54,617][175731] Updated weights for policy 0, policy_version 70330 (0.0007) [2023-03-07 11:17:55,410][175731] Updated weights for policy 0, policy_version 70340 (0.0006) [2023-03-07 11:17:56,203][175731] Updated weights for policy 0, policy_version 70350 (0.0006) [2023-03-07 11:17:56,991][175731] Updated weights for policy 0, policy_version 70360 (0.0007) [2023-03-07 11:17:57,800][175731] Updated weights for policy 0, policy_version 70370 (0.0007) [2023-03-07 11:17:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 72065024. Throughput: 0: 12858.3. Samples: 72038908. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:17:58,332][175405] Avg episode reward: [(0, '23.412')] [2023-03-07 11:17:58,589][175731] Updated weights for policy 0, policy_version 70380 (0.0006) [2023-03-07 11:17:59,375][175731] Updated weights for policy 0, policy_version 70390 (0.0006) [2023-03-07 11:18:00,167][175731] Updated weights for policy 0, policy_version 70400 (0.0006) [2023-03-07 11:18:01,006][175731] Updated weights for policy 0, policy_version 70410 (0.0006) [2023-03-07 11:18:01,782][175731] Updated weights for policy 0, policy_version 70420 (0.0007) [2023-03-07 11:18:02,578][175731] Updated weights for policy 0, policy_version 70430 (0.0006) [2023-03-07 11:18:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12833.0). Total num frames: 72129536. Throughput: 0: 12851.6. Samples: 72116125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:18:03,332][175405] Avg episode reward: [(0, '25.490')] [2023-03-07 11:18:03,360][175731] Updated weights for policy 0, policy_version 70440 (0.0007) [2023-03-07 11:18:04,145][175731] Updated weights for policy 0, policy_version 70450 (0.0006) [2023-03-07 11:18:04,954][175731] Updated weights for policy 0, policy_version 70460 (0.0007) [2023-03-07 11:18:05,747][175731] Updated weights for policy 0, policy_version 70470 (0.0007) [2023-03-07 11:18:06,531][175731] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-07 11:18:07,332][175731] Updated weights for policy 0, policy_version 70490 (0.0007) [2023-03-07 11:18:08,140][175731] Updated weights for policy 0, policy_version 70500 (0.0006) [2023-03-07 11:18:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12836.4). Total num frames: 72194048. Throughput: 0: 12858.8. Samples: 72193393. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:18:08,332][175405] Avg episode reward: [(0, '24.557')] [2023-03-07 11:18:08,927][175731] Updated weights for policy 0, policy_version 70510 (0.0006) [2023-03-07 11:18:09,730][175731] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-07 11:18:10,525][175731] Updated weights for policy 0, policy_version 70530 (0.0006) [2023-03-07 11:18:11,340][175731] Updated weights for policy 0, policy_version 70540 (0.0007) [2023-03-07 11:18:12,129][175731] Updated weights for policy 0, policy_version 70550 (0.0007) [2023-03-07 11:18:12,925][175731] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-07 11:18:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12868.3, 300 sec: 12836.5). Total num frames: 72258560. Throughput: 0: 12851.7. Samples: 72231869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:18:13,332][175405] Avg episode reward: [(0, '25.590')] [2023-03-07 11:18:13,703][175731] Updated weights for policy 0, policy_version 70570 (0.0007) [2023-03-07 11:18:14,514][175731] Updated weights for policy 0, policy_version 70580 (0.0007) [2023-03-07 11:18:15,317][175731] Updated weights for policy 0, policy_version 70590 (0.0007) [2023-03-07 11:18:16,106][175731] Updated weights for policy 0, policy_version 70600 (0.0006) [2023-03-07 11:18:16,913][175731] Updated weights for policy 0, policy_version 70610 (0.0007) [2023-03-07 11:18:17,713][175731] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-07 11:18:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 72322048. Throughput: 0: 12852.6. Samples: 72308972. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:18:18,332][175405] Avg episode reward: [(0, '24.310')] [2023-03-07 11:18:18,494][175731] Updated weights for policy 0, policy_version 70630 (0.0007) [2023-03-07 11:18:19,283][175731] Updated weights for policy 0, policy_version 70640 (0.0007) [2023-03-07 11:18:20,106][175731] Updated weights for policy 0, policy_version 70650 (0.0007) [2023-03-07 11:18:20,898][175731] Updated weights for policy 0, policy_version 70660 (0.0006) [2023-03-07 11:18:21,679][175731] Updated weights for policy 0, policy_version 70670 (0.0005) [2023-03-07 11:18:22,481][175731] Updated weights for policy 0, policy_version 70680 (0.0006) [2023-03-07 11:18:23,273][175731] Updated weights for policy 0, policy_version 70690 (0.0007) [2023-03-07 11:18:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 72386560. Throughput: 0: 12862.4. Samples: 72386101. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:18:23,332][175405] Avg episode reward: [(0, '25.230')] [2023-03-07 11:18:24,075][175731] Updated weights for policy 0, policy_version 70700 (0.0006) [2023-03-07 11:18:24,871][175731] Updated weights for policy 0, policy_version 70710 (0.0007) [2023-03-07 11:18:25,657][175731] Updated weights for policy 0, policy_version 70720 (0.0006) [2023-03-07 11:18:26,471][175731] Updated weights for policy 0, policy_version 70730 (0.0006) [2023-03-07 11:18:27,249][175731] Updated weights for policy 0, policy_version 70740 (0.0007) [2023-03-07 11:18:28,066][175731] Updated weights for policy 0, policy_version 70750 (0.0006) [2023-03-07 11:18:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12839.9). Total num frames: 72451072. Throughput: 0: 12867.8. Samples: 72424844. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:18:28,332][175405] Avg episode reward: [(0, '23.832')] [2023-03-07 11:18:28,842][175731] Updated weights for policy 0, policy_version 70760 (0.0007) [2023-03-07 11:18:29,640][175731] Updated weights for policy 0, policy_version 70770 (0.0007) [2023-03-07 11:18:30,441][175731] Updated weights for policy 0, policy_version 70780 (0.0005) [2023-03-07 11:18:31,243][175731] Updated weights for policy 0, policy_version 70790 (0.0007) [2023-03-07 11:18:32,047][175731] Updated weights for policy 0, policy_version 70800 (0.0006) [2023-03-07 11:18:32,851][175731] Updated weights for policy 0, policy_version 70810 (0.0007) [2023-03-07 11:18:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 72514560. Throughput: 0: 12857.9. Samples: 72501677. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:18:33,322][175405] Avg episode reward: [(0, '24.026')] [2023-03-07 11:18:33,634][175731] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-07 11:18:34,443][175731] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-07 11:18:35,249][175731] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-07 11:18:36,041][175731] Updated weights for policy 0, policy_version 70850 (0.0007) [2023-03-07 11:18:36,818][175731] Updated weights for policy 0, policy_version 70860 (0.0007) [2023-03-07 11:18:37,645][175731] Updated weights for policy 0, policy_version 70870 (0.0006) [2023-03-07 11:18:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 72579072. Throughput: 0: 12855.3. Samples: 72578722. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:18:38,322][175405] Avg episode reward: [(0, '24.077')] [2023-03-07 11:18:38,435][175731] Updated weights for policy 0, policy_version 70880 (0.0007) [2023-03-07 11:18:39,230][175731] Updated weights for policy 0, policy_version 70890 (0.0005) [2023-03-07 11:18:40,035][175731] Updated weights for policy 0, policy_version 70900 (0.0006) [2023-03-07 11:18:40,821][175731] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-07 11:18:41,610][175731] Updated weights for policy 0, policy_version 70920 (0.0007) [2023-03-07 11:18:42,419][175731] Updated weights for policy 0, policy_version 70930 (0.0006) [2023-03-07 11:18:43,220][175731] Updated weights for policy 0, policy_version 70940 (0.0008) [2023-03-07 11:18:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.2, 300 sec: 12836.4). Total num frames: 72643584. Throughput: 0: 12852.3. Samples: 72617261. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:18:43,322][175405] Avg episode reward: [(0, '25.878')] [2023-03-07 11:18:44,015][175731] Updated weights for policy 0, policy_version 70950 (0.0007) [2023-03-07 11:18:44,818][175731] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-07 11:18:45,612][175731] Updated weights for policy 0, policy_version 70970 (0.0007) [2023-03-07 11:18:46,427][175731] Updated weights for policy 0, policy_version 70980 (0.0007) [2023-03-07 11:18:47,236][175731] Updated weights for policy 0, policy_version 70990 (0.0006) [2023-03-07 11:18:48,018][175731] Updated weights for policy 0, policy_version 71000 (0.0006) [2023-03-07 11:18:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 72707072. Throughput: 0: 12843.4. Samples: 72694076. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:18:48,322][175405] Avg episode reward: [(0, '24.287')] [2023-03-07 11:18:48,795][175731] Updated weights for policy 0, policy_version 71010 (0.0007) [2023-03-07 11:18:49,599][175731] Updated weights for policy 0, policy_version 71020 (0.0008) [2023-03-07 11:18:50,398][175731] Updated weights for policy 0, policy_version 71030 (0.0006) [2023-03-07 11:18:51,189][175731] Updated weights for policy 0, policy_version 71040 (0.0007) [2023-03-07 11:18:51,988][175731] Updated weights for policy 0, policy_version 71050 (0.0007) [2023-03-07 11:18:52,780][175731] Updated weights for policy 0, policy_version 71060 (0.0006) [2023-03-07 11:18:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 72771584. Throughput: 0: 12841.7. Samples: 72771269. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:18:53,322][175405] Avg episode reward: [(0, '24.089')] [2023-03-07 11:18:53,574][175731] Updated weights for policy 0, policy_version 71070 (0.0007) [2023-03-07 11:18:54,378][175731] Updated weights for policy 0, policy_version 71080 (0.0006) [2023-03-07 11:18:55,173][175731] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-07 11:18:55,976][175731] Updated weights for policy 0, policy_version 71100 (0.0007) [2023-03-07 11:18:56,785][175731] Updated weights for policy 0, policy_version 71110 (0.0007) [2023-03-07 11:18:57,597][175731] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-07 11:18:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 72836096. Throughput: 0: 12838.5. Samples: 72809603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:18:58,322][175405] Avg episode reward: [(0, '25.791')] [2023-03-07 11:18:58,380][175731] Updated weights for policy 0, policy_version 71130 (0.0006) [2023-03-07 11:18:59,183][175731] Updated weights for policy 0, policy_version 71140 (0.0007) [2023-03-07 11:18:59,992][175731] Updated weights for policy 0, policy_version 71150 (0.0006) [2023-03-07 11:19:00,794][175731] Updated weights for policy 0, policy_version 71160 (0.0007) [2023-03-07 11:19:01,592][175731] Updated weights for policy 0, policy_version 71170 (0.0006) [2023-03-07 11:19:02,395][175731] Updated weights for policy 0, policy_version 71180 (0.0007) [2023-03-07 11:19:03,186][175731] Updated weights for policy 0, policy_version 71190 (0.0007) [2023-03-07 11:19:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 72899584. Throughput: 0: 12835.0. Samples: 72886546. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:19:03,322][175405] Avg episode reward: [(0, '23.270')] [2023-03-07 11:19:03,975][175731] Updated weights for policy 0, policy_version 71200 (0.0007) [2023-03-07 11:19:04,781][175731] Updated weights for policy 0, policy_version 71210 (0.0006) [2023-03-07 11:19:05,577][175731] Updated weights for policy 0, policy_version 71220 (0.0007) [2023-03-07 11:19:06,349][175731] Updated weights for policy 0, policy_version 71230 (0.0007) [2023-03-07 11:19:07,168][175731] Updated weights for policy 0, policy_version 71240 (0.0007) [2023-03-07 11:19:07,949][175731] Updated weights for policy 0, policy_version 71250 (0.0007) [2023-03-07 11:19:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 72964096. Throughput: 0: 12833.7. Samples: 72963619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:19:08,322][175405] Avg episode reward: [(0, '25.763')] [2023-03-07 11:19:08,752][175731] Updated weights for policy 0, policy_version 71260 (0.0006) [2023-03-07 11:19:09,557][175731] Updated weights for policy 0, policy_version 71270 (0.0007) [2023-03-07 11:19:10,354][175731] Updated weights for policy 0, policy_version 71280 (0.0007) [2023-03-07 11:19:11,167][175731] Updated weights for policy 0, policy_version 71290 (0.0006) [2023-03-07 11:19:11,984][175731] Updated weights for policy 0, policy_version 71300 (0.0007) [2023-03-07 11:19:12,779][175731] Updated weights for policy 0, policy_version 71310 (0.0006) [2023-03-07 11:19:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 73027584. Throughput: 0: 12824.2. Samples: 73001932. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:13,322][175405] Avg episode reward: [(0, '24.663')] [2023-03-07 11:19:13,569][175731] Updated weights for policy 0, policy_version 71320 (0.0007) [2023-03-07 11:19:14,378][175731] Updated weights for policy 0, policy_version 71330 (0.0007) [2023-03-07 11:19:15,161][175731] Updated weights for policy 0, policy_version 71340 (0.0006) [2023-03-07 11:19:15,969][175731] Updated weights for policy 0, policy_version 71350 (0.0006) [2023-03-07 11:19:16,761][175731] Updated weights for policy 0, policy_version 71360 (0.0006) [2023-03-07 11:19:17,569][175731] Updated weights for policy 0, policy_version 71370 (0.0006) [2023-03-07 11:19:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 73092096. Throughput: 0: 12826.3. Samples: 73078859. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:18,322][175405] Avg episode reward: [(0, '25.322')] [2023-03-07 11:19:18,355][175731] Updated weights for policy 0, policy_version 71380 (0.0007) [2023-03-07 11:19:19,134][175731] Updated weights for policy 0, policy_version 71390 (0.0007) [2023-03-07 11:19:19,931][175731] Updated weights for policy 0, policy_version 71400 (0.0007) [2023-03-07 11:19:20,734][175731] Updated weights for policy 0, policy_version 71410 (0.0007) [2023-03-07 11:19:21,536][175731] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-07 11:19:22,330][175731] Updated weights for policy 0, policy_version 71430 (0.0006) [2023-03-07 11:19:23,118][175731] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-07 11:19:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 73156608. Throughput: 0: 12830.3. Samples: 73156086. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:23,322][175405] Avg episode reward: [(0, '24.415')] [2023-03-07 11:19:23,927][175731] Updated weights for policy 0, policy_version 71450 (0.0008) [2023-03-07 11:19:24,732][175731] Updated weights for policy 0, policy_version 71460 (0.0006) [2023-03-07 11:19:25,518][175731] Updated weights for policy 0, policy_version 71470 (0.0007) [2023-03-07 11:19:26,344][175731] Updated weights for policy 0, policy_version 71480 (0.0007) [2023-03-07 11:19:27,136][175731] Updated weights for policy 0, policy_version 71490 (0.0007) [2023-03-07 11:19:27,933][175731] Updated weights for policy 0, policy_version 71500 (0.0006) [2023-03-07 11:19:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 73220096. Throughput: 0: 12825.2. Samples: 73194397. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:28,322][175405] Avg episode reward: [(0, '25.476')] [2023-03-07 11:19:28,724][175731] Updated weights for policy 0, policy_version 71510 (0.0006) [2023-03-07 11:19:29,530][175731] Updated weights for policy 0, policy_version 71520 (0.0007) [2023-03-07 11:19:30,345][175731] Updated weights for policy 0, policy_version 71530 (0.0007) [2023-03-07 11:19:31,133][175731] Updated weights for policy 0, policy_version 71540 (0.0006) [2023-03-07 11:19:31,933][175731] Updated weights for policy 0, policy_version 71550 (0.0006) [2023-03-07 11:19:32,736][175731] Updated weights for policy 0, policy_version 71560 (0.0005) [2023-03-07 11:19:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 73284608. Throughput: 0: 12827.8. Samples: 73271329. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:33,322][175405] Avg episode reward: [(0, '25.872')] [2023-03-07 11:19:33,530][175731] Updated weights for policy 0, policy_version 71570 (0.0007) [2023-03-07 11:19:34,330][175731] Updated weights for policy 0, policy_version 71580 (0.0006) [2023-03-07 11:19:35,127][175731] Updated weights for policy 0, policy_version 71590 (0.0007) [2023-03-07 11:19:35,928][175731] Updated weights for policy 0, policy_version 71600 (0.0007) [2023-03-07 11:19:36,710][175731] Updated weights for policy 0, policy_version 71610 (0.0006) [2023-03-07 11:19:37,542][175731] Updated weights for policy 0, policy_version 71620 (0.0006) [2023-03-07 11:19:38,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 73348096. Throughput: 0: 12820.6. Samples: 73348195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:38,321][175405] Avg episode reward: [(0, '25.787')] [2023-03-07 11:19:38,323][175731] Updated weights for policy 0, policy_version 71630 (0.0006) [2023-03-07 11:19:39,106][175731] Updated weights for policy 0, policy_version 71640 (0.0006) [2023-03-07 11:19:39,918][175731] Updated weights for policy 0, policy_version 71650 (0.0006) [2023-03-07 11:19:40,720][175731] Updated weights for policy 0, policy_version 71660 (0.0006) [2023-03-07 11:19:41,515][175731] Updated weights for policy 0, policy_version 71670 (0.0007) [2023-03-07 11:19:42,315][175731] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-07 11:19:43,110][175731] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-07 11:19:43,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 73412608. Throughput: 0: 12823.7. Samples: 73386670. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:43,321][175405] Avg episode reward: [(0, '25.422')] [2023-03-07 11:19:43,904][175731] Updated weights for policy 0, policy_version 71700 (0.0007) [2023-03-07 11:19:44,702][175731] Updated weights for policy 0, policy_version 71710 (0.0007) [2023-03-07 11:19:45,504][175731] Updated weights for policy 0, policy_version 71720 (0.0007) [2023-03-07 11:19:46,293][175731] Updated weights for policy 0, policy_version 71730 (0.0007) [2023-03-07 11:19:47,101][175731] Updated weights for policy 0, policy_version 71740 (0.0006) [2023-03-07 11:19:47,892][175731] Updated weights for policy 0, policy_version 71750 (0.0007) [2023-03-07 11:19:48,321][175405] Fps is (10 sec: 12902.1, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 73477120. Throughput: 0: 12824.8. Samples: 73463661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:48,322][175405] Avg episode reward: [(0, '25.475')] [2023-03-07 11:19:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000071755_73477120.pth... [2023-03-07 11:19:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000068747_70396928.pth [2023-03-07 11:19:48,692][175731] Updated weights for policy 0, policy_version 71760 (0.0007) [2023-03-07 11:19:49,477][175731] Updated weights for policy 0, policy_version 71770 (0.0007) [2023-03-07 11:19:50,267][175731] Updated weights for policy 0, policy_version 71780 (0.0006) [2023-03-07 11:19:51,078][175731] Updated weights for policy 0, policy_version 71790 (0.0006) [2023-03-07 11:19:51,877][175731] Updated weights for policy 0, policy_version 71800 (0.0006) [2023-03-07 11:19:52,692][175731] Updated weights for policy 0, policy_version 71810 (0.0006) [2023-03-07 11:19:53,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 73541632. Throughput: 0: 12824.4. Samples: 73540717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:19:53,322][175405] Avg episode reward: [(0, '26.480')] [2023-03-07 11:19:53,468][175731] Updated weights for policy 0, policy_version 71820 (0.0007) [2023-03-07 11:19:54,272][175731] Updated weights for policy 0, policy_version 71830 (0.0007) [2023-03-07 11:19:55,087][175731] Updated weights for policy 0, policy_version 71840 (0.0007) [2023-03-07 11:19:55,885][175731] Updated weights for policy 0, policy_version 71850 (0.0007) [2023-03-07 11:19:56,680][175731] Updated weights for policy 0, policy_version 71860 (0.0007) [2023-03-07 11:19:57,485][175731] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-07 11:19:58,282][175731] Updated weights for policy 0, policy_version 71880 (0.0007) [2023-03-07 11:19:58,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12836.4). Total num frames: 73605120. Throughput: 0: 12826.4. Samples: 73579119. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:19:58,322][175405] Avg episode reward: [(0, '26.485')] [2023-03-07 11:19:59,076][175731] Updated weights for policy 0, policy_version 71890 (0.0007) [2023-03-07 11:19:59,872][175731] Updated weights for policy 0, policy_version 71900 (0.0006) [2023-03-07 11:20:00,671][175731] Updated weights for policy 0, policy_version 71910 (0.0005) [2023-03-07 11:20:01,465][175731] Updated weights for policy 0, policy_version 71920 (0.0007) [2023-03-07 11:20:02,268][175731] Updated weights for policy 0, policy_version 71930 (0.0006) [2023-03-07 11:20:03,050][175731] Updated weights for policy 0, policy_version 71940 (0.0007) [2023-03-07 11:20:03,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 73668608. Throughput: 0: 12826.2. Samples: 73656036. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:03,332][175405] Avg episode reward: [(0, '25.614')] [2023-03-07 11:20:03,847][175731] Updated weights for policy 0, policy_version 71950 (0.0007) [2023-03-07 11:20:04,642][175731] Updated weights for policy 0, policy_version 71960 (0.0006) [2023-03-07 11:20:05,458][175731] Updated weights for policy 0, policy_version 71970 (0.0007) [2023-03-07 11:20:06,273][175731] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-07 11:20:07,052][175731] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-07 11:20:07,850][175731] Updated weights for policy 0, policy_version 72000 (0.0006) [2023-03-07 11:20:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.2, 300 sec: 12839.9). Total num frames: 73734144. Throughput: 0: 12821.9. Samples: 73733070. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:08,332][175405] Avg episode reward: [(0, '26.262')] [2023-03-07 11:20:08,653][175731] Updated weights for policy 0, policy_version 72010 (0.0007) [2023-03-07 11:20:09,460][175731] Updated weights for policy 0, policy_version 72020 (0.0006) [2023-03-07 11:20:10,244][175731] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-07 11:20:11,039][175731] Updated weights for policy 0, policy_version 72040 (0.0006) [2023-03-07 11:20:11,828][175731] Updated weights for policy 0, policy_version 72050 (0.0007) [2023-03-07 11:20:12,643][175731] Updated weights for policy 0, policy_version 72060 (0.0006) [2023-03-07 11:20:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 73797632. Throughput: 0: 12825.7. Samples: 73771552. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:13,332][175405] Avg episode reward: [(0, '26.003')] [2023-03-07 11:20:13,444][175731] Updated weights for policy 0, policy_version 72070 (0.0007) [2023-03-07 11:20:14,213][175731] Updated weights for policy 0, policy_version 72080 (0.0006) [2023-03-07 11:20:15,031][175731] Updated weights for policy 0, policy_version 72090 (0.0007) [2023-03-07 11:20:15,833][175731] Updated weights for policy 0, policy_version 72100 (0.0006) [2023-03-07 11:20:16,639][175731] Updated weights for policy 0, policy_version 72110 (0.0006) [2023-03-07 11:20:17,441][175731] Updated weights for policy 0, policy_version 72120 (0.0006) [2023-03-07 11:20:18,236][175731] Updated weights for policy 0, policy_version 72130 (0.0007) [2023-03-07 11:20:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 73861120. Throughput: 0: 12823.1. Samples: 73848366. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:18,332][175405] Avg episode reward: [(0, '26.845')] [2023-03-07 11:20:19,025][175731] Updated weights for policy 0, policy_version 72140 (0.0007) [2023-03-07 11:20:19,817][175731] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-07 11:20:20,625][175731] Updated weights for policy 0, policy_version 72160 (0.0006) [2023-03-07 11:20:21,417][175731] Updated weights for policy 0, policy_version 72170 (0.0006) [2023-03-07 11:20:22,202][175731] Updated weights for policy 0, policy_version 72180 (0.0006) [2023-03-07 11:20:22,997][175731] Updated weights for policy 0, policy_version 72190 (0.0006) [2023-03-07 11:20:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 73925632. Throughput: 0: 12829.7. Samples: 73925531. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:23,322][175405] Avg episode reward: [(0, '27.159')] [2023-03-07 11:20:23,799][175731] Updated weights for policy 0, policy_version 72200 (0.0008) [2023-03-07 11:20:24,604][175731] Updated weights for policy 0, policy_version 72210 (0.0008) [2023-03-07 11:20:25,396][175731] Updated weights for policy 0, policy_version 72220 (0.0008) [2023-03-07 11:20:26,192][175731] Updated weights for policy 0, policy_version 72230 (0.0006) [2023-03-07 11:20:26,984][175731] Updated weights for policy 0, policy_version 72240 (0.0008) [2023-03-07 11:20:27,794][175731] Updated weights for policy 0, policy_version 72250 (0.0007) [2023-03-07 11:20:28,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 73990144. Throughput: 0: 12831.5. Samples: 73964091. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:28,322][175405] Avg episode reward: [(0, '28.954')] [2023-03-07 11:20:28,601][175731] Updated weights for policy 0, policy_version 72260 (0.0007) [2023-03-07 11:20:29,386][175731] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-07 11:20:30,193][175731] Updated weights for policy 0, policy_version 72280 (0.0007) [2023-03-07 11:20:30,992][175731] Updated weights for policy 0, policy_version 72290 (0.0006) [2023-03-07 11:20:31,786][175731] Updated weights for policy 0, policy_version 72300 (0.0006) [2023-03-07 11:20:32,590][175731] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-07 11:20:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74053632. Throughput: 0: 12828.7. Samples: 74040950. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:33,322][175405] Avg episode reward: [(0, '26.153')] [2023-03-07 11:20:33,382][175731] Updated weights for policy 0, policy_version 72320 (0.0006) [2023-03-07 11:20:34,199][175731] Updated weights for policy 0, policy_version 72330 (0.0007) [2023-03-07 11:20:35,003][175731] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-07 11:20:35,798][175731] Updated weights for policy 0, policy_version 72350 (0.0007) [2023-03-07 11:20:36,589][175731] Updated weights for policy 0, policy_version 72360 (0.0007) [2023-03-07 11:20:37,418][175731] Updated weights for policy 0, policy_version 72370 (0.0006) [2023-03-07 11:20:38,202][175731] Updated weights for policy 0, policy_version 72380 (0.0006) [2023-03-07 11:20:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 74118144. Throughput: 0: 12816.8. Samples: 74117475. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:20:38,322][175405] Avg episode reward: [(0, '27.072')] [2023-03-07 11:20:39,013][175731] Updated weights for policy 0, policy_version 72390 (0.0007) [2023-03-07 11:20:39,813][175731] Updated weights for policy 0, policy_version 72400 (0.0008) [2023-03-07 11:20:40,628][175731] Updated weights for policy 0, policy_version 72410 (0.0006) [2023-03-07 11:20:41,426][175731] Updated weights for policy 0, policy_version 72420 (0.0007) [2023-03-07 11:20:42,218][175731] Updated weights for policy 0, policy_version 72430 (0.0006) [2023-03-07 11:20:43,026][175731] Updated weights for policy 0, policy_version 72440 (0.0006) [2023-03-07 11:20:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74181632. Throughput: 0: 12816.2. Samples: 74155847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:20:43,321][175405] Avg episode reward: [(0, '26.822')] [2023-03-07 11:20:43,820][175731] Updated weights for policy 0, policy_version 72450 (0.0006) [2023-03-07 11:20:44,598][175731] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-07 11:20:45,403][175731] Updated weights for policy 0, policy_version 72470 (0.0007) [2023-03-07 11:20:46,210][175731] Updated weights for policy 0, policy_version 72480 (0.0006) [2023-03-07 11:20:47,009][175731] Updated weights for policy 0, policy_version 72490 (0.0006) [2023-03-07 11:20:47,829][175731] Updated weights for policy 0, policy_version 72500 (0.0006) [2023-03-07 11:20:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74246144. Throughput: 0: 12814.2. Samples: 74232674. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:20:48,322][175405] Avg episode reward: [(0, '26.723')] [2023-03-07 11:20:48,612][175731] Updated weights for policy 0, policy_version 72510 (0.0006) [2023-03-07 11:20:49,398][175731] Updated weights for policy 0, policy_version 72520 (0.0006) [2023-03-07 11:20:50,199][175731] Updated weights for policy 0, policy_version 72530 (0.0007) [2023-03-07 11:20:51,007][175731] Updated weights for policy 0, policy_version 72540 (0.0007) [2023-03-07 11:20:51,795][175731] Updated weights for policy 0, policy_version 72550 (0.0007) [2023-03-07 11:20:52,597][175731] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-07 11:20:53,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74310656. Throughput: 0: 12816.5. Samples: 74309814. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:20:53,332][175405] Avg episode reward: [(0, '26.613')] [2023-03-07 11:20:53,378][175731] Updated weights for policy 0, policy_version 72570 (0.0007) [2023-03-07 11:20:54,195][175731] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-07 11:20:54,991][175731] Updated weights for policy 0, policy_version 72590 (0.0007) [2023-03-07 11:20:55,796][175731] Updated weights for policy 0, policy_version 72600 (0.0006) [2023-03-07 11:20:56,590][175731] Updated weights for policy 0, policy_version 72610 (0.0006) [2023-03-07 11:20:57,394][175731] Updated weights for policy 0, policy_version 72620 (0.0006) [2023-03-07 11:20:58,197][175731] Updated weights for policy 0, policy_version 72630 (0.0006) [2023-03-07 11:20:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 74374144. Throughput: 0: 12813.4. Samples: 74348154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:20:58,332][175405] Avg episode reward: [(0, '25.932')] [2023-03-07 11:20:58,982][175731] Updated weights for policy 0, policy_version 72640 (0.0006) [2023-03-07 11:20:59,777][175731] Updated weights for policy 0, policy_version 72650 (0.0006) [2023-03-07 11:21:00,580][175731] Updated weights for policy 0, policy_version 72660 (0.0007) [2023-03-07 11:21:01,368][175731] Updated weights for policy 0, policy_version 72670 (0.0006) [2023-03-07 11:21:02,174][175731] Updated weights for policy 0, policy_version 72680 (0.0006) [2023-03-07 11:21:02,981][175731] Updated weights for policy 0, policy_version 72690 (0.0007) [2023-03-07 11:21:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 74438656. Throughput: 0: 12820.0. Samples: 74425265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:21:03,332][175405] Avg episode reward: [(0, '26.178')] [2023-03-07 11:21:03,790][175731] Updated weights for policy 0, policy_version 72700 (0.0007) [2023-03-07 11:21:04,585][175731] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-07 11:21:05,368][175731] Updated weights for policy 0, policy_version 72720 (0.0006) [2023-03-07 11:21:06,179][175731] Updated weights for policy 0, policy_version 72730 (0.0006) [2023-03-07 11:21:06,983][175731] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-07 11:21:07,778][175731] Updated weights for policy 0, policy_version 72750 (0.0007) [2023-03-07 11:21:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12833.0). Total num frames: 74502144. Throughput: 0: 12811.5. Samples: 74502048. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:21:08,332][175405] Avg episode reward: [(0, '25.026')] [2023-03-07 11:21:08,572][175731] Updated weights for policy 0, policy_version 72760 (0.0006) [2023-03-07 11:21:09,363][175731] Updated weights for policy 0, policy_version 72770 (0.0006) [2023-03-07 11:21:10,158][175731] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-07 11:21:10,973][175731] Updated weights for policy 0, policy_version 72790 (0.0006) [2023-03-07 11:21:11,770][175731] Updated weights for policy 0, policy_version 72800 (0.0006) [2023-03-07 11:21:12,572][175731] Updated weights for policy 0, policy_version 72810 (0.0007) [2023-03-07 11:21:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74566656. Throughput: 0: 12806.4. Samples: 74540380. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:21:13,332][175405] Avg episode reward: [(0, '26.567')] [2023-03-07 11:21:13,367][175731] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-07 11:21:14,157][175731] Updated weights for policy 0, policy_version 72830 (0.0006) [2023-03-07 11:21:14,975][175731] Updated weights for policy 0, policy_version 72840 (0.0008) [2023-03-07 11:21:15,781][175731] Updated weights for policy 0, policy_version 72850 (0.0006) [2023-03-07 11:21:16,584][175731] Updated weights for policy 0, policy_version 72860 (0.0007) [2023-03-07 11:21:17,373][175731] Updated weights for policy 0, policy_version 72870 (0.0006) [2023-03-07 11:21:18,174][175731] Updated weights for policy 0, policy_version 72880 (0.0007) [2023-03-07 11:21:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74630144. Throughput: 0: 12806.2. Samples: 74617227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:21:18,332][175405] Avg episode reward: [(0, '25.195')] [2023-03-07 11:21:18,979][175731] Updated weights for policy 0, policy_version 72890 (0.0007) [2023-03-07 11:21:19,775][175731] Updated weights for policy 0, policy_version 72900 (0.0006) [2023-03-07 11:21:20,567][175731] Updated weights for policy 0, policy_version 72910 (0.0007) [2023-03-07 11:21:21,371][175731] Updated weights for policy 0, policy_version 72920 (0.0007) [2023-03-07 11:21:22,145][175731] Updated weights for policy 0, policy_version 72930 (0.0006) [2023-03-07 11:21:22,954][175731] Updated weights for policy 0, policy_version 72940 (0.0006) [2023-03-07 11:21:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74694656. Throughput: 0: 12816.2. Samples: 74694203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 11:21:23,332][175405] Avg episode reward: [(0, '25.990')] [2023-03-07 11:21:23,761][175731] Updated weights for policy 0, policy_version 72950 (0.0006) [2023-03-07 11:21:24,541][175731] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-07 11:21:25,342][175731] Updated weights for policy 0, policy_version 72970 (0.0006) [2023-03-07 11:21:26,144][175731] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-07 11:21:26,946][175731] Updated weights for policy 0, policy_version 72990 (0.0005) [2023-03-07 11:21:27,752][175731] Updated weights for policy 0, policy_version 73000 (0.0007) [2023-03-07 11:21:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12836.4). Total num frames: 74759168. Throughput: 0: 12820.2. Samples: 74732759. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:21:28,332][175405] Avg episode reward: [(0, '26.309')] [2023-03-07 11:21:28,554][175731] Updated weights for policy 0, policy_version 73010 (0.0007) [2023-03-07 11:21:29,357][175731] Updated weights for policy 0, policy_version 73020 (0.0007) [2023-03-07 11:21:30,155][175731] Updated weights for policy 0, policy_version 73030 (0.0006) [2023-03-07 11:21:30,965][175731] Updated weights for policy 0, policy_version 73040 (0.0007) [2023-03-07 11:21:31,761][175731] Updated weights for policy 0, policy_version 73050 (0.0006) [2023-03-07 11:21:32,563][175731] Updated weights for policy 0, policy_version 73060 (0.0006) [2023-03-07 11:21:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 74822656. Throughput: 0: 12817.4. Samples: 74809459. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:21:33,332][175405] Avg episode reward: [(0, '25.064')] [2023-03-07 11:21:33,364][175731] Updated weights for policy 0, policy_version 73070 (0.0007) [2023-03-07 11:21:34,165][175731] Updated weights for policy 0, policy_version 73080 (0.0007) [2023-03-07 11:21:34,967][175731] Updated weights for policy 0, policy_version 73090 (0.0006) [2023-03-07 11:21:35,761][175731] Updated weights for policy 0, policy_version 73100 (0.0006) [2023-03-07 11:21:36,552][175731] Updated weights for policy 0, policy_version 73110 (0.0006) [2023-03-07 11:21:37,341][175731] Updated weights for policy 0, policy_version 73120 (0.0006) [2023-03-07 11:21:38,145][175731] Updated weights for policy 0, policy_version 73130 (0.0006) [2023-03-07 11:21:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 74887168. Throughput: 0: 12812.3. Samples: 74886371. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:21:38,332][175405] Avg episode reward: [(0, '25.155')] [2023-03-07 11:21:38,949][175731] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-07 11:21:39,734][175731] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-07 11:21:40,548][175731] Updated weights for policy 0, policy_version 73160 (0.0006) [2023-03-07 11:21:41,351][175731] Updated weights for policy 0, policy_version 73170 (0.0007) [2023-03-07 11:21:42,146][175731] Updated weights for policy 0, policy_version 73180 (0.0007) [2023-03-07 11:21:42,957][175731] Updated weights for policy 0, policy_version 73190 (0.0007) [2023-03-07 11:21:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 74950656. Throughput: 0: 12815.1. Samples: 74924832. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:21:43,332][175405] Avg episode reward: [(0, '27.763')] [2023-03-07 11:21:43,765][175731] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-07 11:21:44,545][175731] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-07 11:21:45,327][175731] Updated weights for policy 0, policy_version 73220 (0.0006) [2023-03-07 11:21:46,130][175731] Updated weights for policy 0, policy_version 73230 (0.0007) [2023-03-07 11:21:46,914][175731] Updated weights for policy 0, policy_version 73240 (0.0007) [2023-03-07 11:21:47,716][175731] Updated weights for policy 0, policy_version 73250 (0.0007) [2023-03-07 11:21:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 75015168. Throughput: 0: 12814.6. Samples: 75001923. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:21:48,332][175405] Avg episode reward: [(0, '26.671')] [2023-03-07 11:21:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000073257_75015168.pth... [2023-03-07 11:21:48,367][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000070251_71937024.pth [2023-03-07 11:21:48,510][175731] Updated weights for policy 0, policy_version 73260 (0.0006) [2023-03-07 11:21:49,305][175731] Updated weights for policy 0, policy_version 73270 (0.0006) [2023-03-07 11:21:50,114][175731] Updated weights for policy 0, policy_version 73280 (0.0006) [2023-03-07 11:21:50,914][175731] Updated weights for policy 0, policy_version 73290 (0.0006) [2023-03-07 11:21:51,721][175731] Updated weights for policy 0, policy_version 73300 (0.0006) [2023-03-07 11:21:52,531][175731] Updated weights for policy 0, policy_version 73310 (0.0006) [2023-03-07 11:21:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 75078656. Throughput: 0: 12813.1. Samples: 75078635. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:21:53,332][175405] Avg episode reward: [(0, '26.134')] [2023-03-07 11:21:53,341][175731] Updated weights for policy 0, policy_version 73320 (0.0006) [2023-03-07 11:21:54,128][175731] Updated weights for policy 0, policy_version 73330 (0.0007) [2023-03-07 11:21:54,948][175731] Updated weights for policy 0, policy_version 73340 (0.0007) [2023-03-07 11:21:55,733][175731] Updated weights for policy 0, policy_version 73350 (0.0006) [2023-03-07 11:21:56,522][175731] Updated weights for policy 0, policy_version 73360 (0.0006) [2023-03-07 11:21:57,344][175731] Updated weights for policy 0, policy_version 73370 (0.0006) [2023-03-07 11:21:58,141][175731] Updated weights for policy 0, policy_version 73380 (0.0007) [2023-03-07 11:21:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 75143168. Throughput: 0: 12813.5. Samples: 75116989. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:21:58,332][175405] Avg episode reward: [(0, '26.038')] [2023-03-07 11:21:58,946][175731] Updated weights for policy 0, policy_version 73390 (0.0007) [2023-03-07 11:21:59,743][175731] Updated weights for policy 0, policy_version 73400 (0.0006) [2023-03-07 11:22:00,548][175731] Updated weights for policy 0, policy_version 73410 (0.0006) [2023-03-07 11:22:01,325][175731] Updated weights for policy 0, policy_version 73420 (0.0007) [2023-03-07 11:22:02,130][175731] Updated weights for policy 0, policy_version 73430 (0.0006) [2023-03-07 11:22:02,935][175731] Updated weights for policy 0, policy_version 73440 (0.0007) [2023-03-07 11:22:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 75206656. Throughput: 0: 12810.5. Samples: 75193702. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:22:03,322][175405] Avg episode reward: [(0, '27.079')] [2023-03-07 11:22:03,713][175731] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-07 11:22:04,509][175731] Updated weights for policy 0, policy_version 73460 (0.0007) [2023-03-07 11:22:05,307][175731] Updated weights for policy 0, policy_version 73470 (0.0007) [2023-03-07 11:22:06,112][175731] Updated weights for policy 0, policy_version 73480 (0.0006) [2023-03-07 11:22:06,906][175731] Updated weights for policy 0, policy_version 73490 (0.0006) [2023-03-07 11:22:07,693][175731] Updated weights for policy 0, policy_version 73500 (0.0006) [2023-03-07 11:22:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 75271168. Throughput: 0: 12815.5. Samples: 75270902. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:22:08,322][175405] Avg episode reward: [(0, '25.940')] [2023-03-07 11:22:08,508][175731] Updated weights for policy 0, policy_version 73510 (0.0006) [2023-03-07 11:22:09,293][175731] Updated weights for policy 0, policy_version 73520 (0.0006) [2023-03-07 11:22:10,101][175731] Updated weights for policy 0, policy_version 73530 (0.0007) [2023-03-07 11:22:10,899][175731] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-07 11:22:11,701][175731] Updated weights for policy 0, policy_version 73550 (0.0005) [2023-03-07 11:22:12,493][175731] Updated weights for policy 0, policy_version 73560 (0.0006) [2023-03-07 11:22:13,294][175731] Updated weights for policy 0, policy_version 73570 (0.0007) [2023-03-07 11:22:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 75335680. Throughput: 0: 12813.4. Samples: 75309362. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:13,322][175405] Avg episode reward: [(0, '25.451')] [2023-03-07 11:22:14,060][175731] Updated weights for policy 0, policy_version 73580 (0.0006) [2023-03-07 11:22:14,861][175731] Updated weights for policy 0, policy_version 73590 (0.0006) [2023-03-07 11:22:15,663][175731] Updated weights for policy 0, policy_version 73600 (0.0006) [2023-03-07 11:22:16,437][175731] Updated weights for policy 0, policy_version 73610 (0.0006) [2023-03-07 11:22:17,231][175731] Updated weights for policy 0, policy_version 73620 (0.0006) [2023-03-07 11:22:18,047][175731] Updated weights for policy 0, policy_version 73630 (0.0006) [2023-03-07 11:22:18,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.2, 300 sec: 12829.5). Total num frames: 75400192. Throughput: 0: 12830.9. Samples: 75386848. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:18,321][175405] Avg episode reward: [(0, '27.519')] [2023-03-07 11:22:18,835][175731] Updated weights for policy 0, policy_version 73640 (0.0006) [2023-03-07 11:22:19,641][175731] Updated weights for policy 0, policy_version 73650 (0.0007) [2023-03-07 11:22:20,429][175731] Updated weights for policy 0, policy_version 73660 (0.0007) [2023-03-07 11:22:21,222][175731] Updated weights for policy 0, policy_version 73670 (0.0006) [2023-03-07 11:22:22,031][175731] Updated weights for policy 0, policy_version 73680 (0.0006) [2023-03-07 11:22:22,830][175731] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-07 11:22:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 75464704. Throughput: 0: 12836.1. Samples: 75463993. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:23,322][175405] Avg episode reward: [(0, '26.919')] [2023-03-07 11:22:23,613][175731] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-07 11:22:24,394][175731] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-07 11:22:25,208][175731] Updated weights for policy 0, policy_version 73720 (0.0006) [2023-03-07 11:22:26,006][175731] Updated weights for policy 0, policy_version 73730 (0.0007) [2023-03-07 11:22:26,800][175731] Updated weights for policy 0, policy_version 73740 (0.0006) [2023-03-07 11:22:27,589][175731] Updated weights for policy 0, policy_version 73750 (0.0006) [2023-03-07 11:22:28,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 75529216. Throughput: 0: 12840.7. Samples: 75502661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:28,321][175405] Avg episode reward: [(0, '26.374')] [2023-03-07 11:22:28,384][175731] Updated weights for policy 0, policy_version 73760 (0.0007) [2023-03-07 11:22:29,189][175731] Updated weights for policy 0, policy_version 73770 (0.0006) [2023-03-07 11:22:29,987][175731] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-07 11:22:30,781][175731] Updated weights for policy 0, policy_version 73790 (0.0006) [2023-03-07 11:22:31,564][175731] Updated weights for policy 0, policy_version 73800 (0.0006) [2023-03-07 11:22:32,371][175731] Updated weights for policy 0, policy_version 73810 (0.0006) [2023-03-07 11:22:33,179][175731] Updated weights for policy 0, policy_version 73820 (0.0007) [2023-03-07 11:22:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12829.5). Total num frames: 75592704. Throughput: 0: 12839.3. Samples: 75579690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:33,321][175405] Avg episode reward: [(0, '25.023')] [2023-03-07 11:22:33,957][175731] Updated weights for policy 0, policy_version 73830 (0.0006) [2023-03-07 11:22:34,753][175731] Updated weights for policy 0, policy_version 73840 (0.0007) [2023-03-07 11:22:35,576][175731] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-07 11:22:36,372][175731] Updated weights for policy 0, policy_version 73860 (0.0007) [2023-03-07 11:22:37,158][175731] Updated weights for policy 0, policy_version 73870 (0.0006) [2023-03-07 11:22:37,958][175731] Updated weights for policy 0, policy_version 73880 (0.0007) [2023-03-07 11:22:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 75657216. Throughput: 0: 12849.1. Samples: 75656846. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:38,322][175405] Avg episode reward: [(0, '25.206')] [2023-03-07 11:22:38,751][175731] Updated weights for policy 0, policy_version 73890 (0.0007) [2023-03-07 11:22:39,549][175731] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-07 11:22:40,345][175731] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-07 11:22:41,137][175731] Updated weights for policy 0, policy_version 73920 (0.0007) [2023-03-07 11:22:41,926][175731] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-07 11:22:42,721][175731] Updated weights for policy 0, policy_version 73940 (0.0006) [2023-03-07 11:22:43,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 75721728. Throughput: 0: 12852.1. Samples: 75695333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:43,321][175405] Avg episode reward: [(0, '26.294')] [2023-03-07 11:22:43,515][175731] Updated weights for policy 0, policy_version 73950 (0.0007) [2023-03-07 11:22:44,328][175731] Updated weights for policy 0, policy_version 73960 (0.0006) [2023-03-07 11:22:45,112][175731] Updated weights for policy 0, policy_version 73970 (0.0007) [2023-03-07 11:22:45,914][175731] Updated weights for policy 0, policy_version 73980 (0.0006) [2023-03-07 11:22:46,716][175731] Updated weights for policy 0, policy_version 73990 (0.0006) [2023-03-07 11:22:47,530][175731] Updated weights for policy 0, policy_version 74000 (0.0007) [2023-03-07 11:22:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 75785216. Throughput: 0: 12856.0. Samples: 75772222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:48,322][175405] Avg episode reward: [(0, '27.169')] [2023-03-07 11:22:48,331][175731] Updated weights for policy 0, policy_version 74010 (0.0007) [2023-03-07 11:22:49,141][175731] Updated weights for policy 0, policy_version 74020 (0.0006) [2023-03-07 11:22:49,949][175731] Updated weights for policy 0, policy_version 74030 (0.0007) [2023-03-07 11:22:50,739][175731] Updated weights for policy 0, policy_version 74040 (0.0007) [2023-03-07 11:22:51,541][175731] Updated weights for policy 0, policy_version 74050 (0.0006) [2023-03-07 11:22:52,323][175731] Updated weights for policy 0, policy_version 74060 (0.0006) [2023-03-07 11:22:53,121][175731] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-07 11:22:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 75849728. Throughput: 0: 12847.5. Samples: 75849040. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:22:53,322][175405] Avg episode reward: [(0, '26.439')] [2023-03-07 11:22:53,929][175731] Updated weights for policy 0, policy_version 74080 (0.0006) [2023-03-07 11:22:54,725][175731] Updated weights for policy 0, policy_version 74090 (0.0006) [2023-03-07 11:22:55,542][175731] Updated weights for policy 0, policy_version 74100 (0.0007) [2023-03-07 11:22:56,336][175731] Updated weights for policy 0, policy_version 74110 (0.0007) [2023-03-07 11:22:57,137][175731] Updated weights for policy 0, policy_version 74120 (0.0006) [2023-03-07 11:22:57,937][175731] Updated weights for policy 0, policy_version 74130 (0.0006) [2023-03-07 11:22:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 75913216. Throughput: 0: 12842.0. Samples: 75887250. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:22:58,322][175405] Avg episode reward: [(0, '25.347')] [2023-03-07 11:22:58,736][175731] Updated weights for policy 0, policy_version 74140 (0.0006) [2023-03-07 11:22:59,532][175731] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-07 11:23:00,330][175731] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-07 11:23:01,123][175731] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-07 11:23:01,908][175731] Updated weights for policy 0, policy_version 74180 (0.0006) [2023-03-07 11:23:02,720][175731] Updated weights for policy 0, policy_version 74190 (0.0006) [2023-03-07 11:23:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 75977728. Throughput: 0: 12833.3. Samples: 75964348. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:03,321][175405] Avg episode reward: [(0, '25.991')] [2023-03-07 11:23:03,508][175731] Updated weights for policy 0, policy_version 74200 (0.0006) [2023-03-07 11:23:04,308][175731] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-07 11:23:05,097][175731] Updated weights for policy 0, policy_version 74220 (0.0007) [2023-03-07 11:23:05,905][175731] Updated weights for policy 0, policy_version 74230 (0.0007) [2023-03-07 11:23:06,721][175731] Updated weights for policy 0, policy_version 74240 (0.0007) [2023-03-07 11:23:07,504][175731] Updated weights for policy 0, policy_version 74250 (0.0007) [2023-03-07 11:23:08,298][175731] Updated weights for policy 0, policy_version 74260 (0.0006) [2023-03-07 11:23:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 76042240. Throughput: 0: 12831.4. Samples: 76041407. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:08,321][175405] Avg episode reward: [(0, '25.865')] [2023-03-07 11:23:09,091][175731] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-07 11:23:09,889][175731] Updated weights for policy 0, policy_version 74280 (0.0006) [2023-03-07 11:23:10,684][175731] Updated weights for policy 0, policy_version 74290 (0.0008) [2023-03-07 11:23:11,494][175731] Updated weights for policy 0, policy_version 74300 (0.0007) [2023-03-07 11:23:12,281][175731] Updated weights for policy 0, policy_version 74310 (0.0006) [2023-03-07 11:23:13,076][175731] Updated weights for policy 0, policy_version 74320 (0.0006) [2023-03-07 11:23:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 76105728. Throughput: 0: 12828.6. Samples: 76079947. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:13,322][175405] Avg episode reward: [(0, '26.530')] [2023-03-07 11:23:13,871][175731] Updated weights for policy 0, policy_version 74330 (0.0007) [2023-03-07 11:23:14,681][175731] Updated weights for policy 0, policy_version 74340 (0.0007) [2023-03-07 11:23:15,478][175731] Updated weights for policy 0, policy_version 74350 (0.0007) [2023-03-07 11:23:16,272][175731] Updated weights for policy 0, policy_version 74360 (0.0007) [2023-03-07 11:23:17,084][175731] Updated weights for policy 0, policy_version 74370 (0.0007) [2023-03-07 11:23:17,871][175731] Updated weights for policy 0, policy_version 74380 (0.0008) [2023-03-07 11:23:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 76170240. Throughput: 0: 12829.5. Samples: 76157017. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:18,321][175405] Avg episode reward: [(0, '25.832')] [2023-03-07 11:23:18,661][175731] Updated weights for policy 0, policy_version 74390 (0.0007) [2023-03-07 11:23:19,441][175731] Updated weights for policy 0, policy_version 74400 (0.0006) [2023-03-07 11:23:20,261][175731] Updated weights for policy 0, policy_version 74410 (0.0007) [2023-03-07 11:23:21,058][175731] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-07 11:23:21,851][175731] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-07 11:23:22,648][175731] Updated weights for policy 0, policy_version 74440 (0.0006) [2023-03-07 11:23:23,113][175680] KL-divergence is very high: 188.0089 [2023-03-07 11:23:23,273][175680] KL-divergence is very high: 2783.1697 [2023-03-07 11:23:23,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 76234752. Throughput: 0: 12831.3. Samples: 76234257. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:23,322][175405] Avg episode reward: [(0, '25.475')] [2023-03-07 11:23:23,440][175731] Updated weights for policy 0, policy_version 74450 (0.0006) [2023-03-07 11:23:23,665][175680] KL-divergence is very high: 923.6111 [2023-03-07 11:23:23,827][175680] KL-divergence is very high: 18803.7383 [2023-03-07 11:23:24,239][175731] Updated weights for policy 0, policy_version 74460 (0.0006) [2023-03-07 11:23:24,396][175680] KL-divergence is very high: 2293.4309 [2023-03-07 11:23:24,466][175680] KL-divergence is very high: 1000.9674 [2023-03-07 11:23:24,550][175680] KL-divergence is very high: 33136.6289 [2023-03-07 11:23:25,036][175731] Updated weights for policy 0, policy_version 74470 (0.0006) [2023-03-07 11:23:25,496][175680] KL-divergence is very high: 140.7480 [2023-03-07 11:23:25,824][175731] Updated weights for policy 0, policy_version 74480 (0.0006) [2023-03-07 11:23:26,135][175680] KL-divergence is very high: 19270.4297 [2023-03-07 11:23:26,295][175680] KL-divergence is very high: 3415.9155 [2023-03-07 11:23:26,621][175731] Updated weights for policy 0, policy_version 74490 (0.0006) [2023-03-07 11:23:27,438][175731] Updated weights for policy 0, policy_version 74500 (0.0007) [2023-03-07 11:23:28,232][175731] Updated weights for policy 0, policy_version 74510 (0.0006) [2023-03-07 11:23:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 76299264. Throughput: 0: 12832.6. Samples: 76272798. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:28,322][175405] Avg episode reward: [(0, '25.600')] [2023-03-07 11:23:28,464][175680] KL-divergence is very high: 163426.2656 [2023-03-07 11:23:29,014][175731] Updated weights for policy 0, policy_version 74520 (0.0007) [2023-03-07 11:23:29,492][175680] KL-divergence is very high: 151639.3125 [2023-03-07 11:23:29,810][175731] Updated weights for policy 0, policy_version 74530 (0.0006) [2023-03-07 11:23:30,610][175731] Updated weights for policy 0, policy_version 74540 (0.0007) [2023-03-07 11:23:31,398][175731] Updated weights for policy 0, policy_version 74550 (0.0006) [2023-03-07 11:23:31,559][175680] KL-divergence is very high: 149.4859 [2023-03-07 11:23:32,193][175731] Updated weights for policy 0, policy_version 74560 (0.0006) [2023-03-07 11:23:32,994][175731] Updated weights for policy 0, policy_version 74570 (0.0006) [2023-03-07 11:23:33,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 76363776. Throughput: 0: 12839.3. Samples: 76349988. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:33,321][175405] Avg episode reward: [(0, '25.632')] [2023-03-07 11:23:33,765][175731] Updated weights for policy 0, policy_version 74580 (0.0007) [2023-03-07 11:23:34,577][175731] Updated weights for policy 0, policy_version 74590 (0.0007) [2023-03-07 11:23:35,400][175731] Updated weights for policy 0, policy_version 74600 (0.0006) [2023-03-07 11:23:36,180][175731] Updated weights for policy 0, policy_version 74610 (0.0006) [2023-03-07 11:23:36,978][175731] Updated weights for policy 0, policy_version 74620 (0.0007) [2023-03-07 11:23:37,780][175731] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-07 11:23:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 76427264. Throughput: 0: 12844.3. Samples: 76427036. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:38,322][175405] Avg episode reward: [(0, '26.510')] [2023-03-07 11:23:38,582][175731] Updated weights for policy 0, policy_version 74640 (0.0007) [2023-03-07 11:23:39,378][175731] Updated weights for policy 0, policy_version 74650 (0.0006) [2023-03-07 11:23:40,187][175731] Updated weights for policy 0, policy_version 74660 (0.0007) [2023-03-07 11:23:40,974][175731] Updated weights for policy 0, policy_version 74670 (0.0006) [2023-03-07 11:23:41,788][175731] Updated weights for policy 0, policy_version 74680 (0.0007) [2023-03-07 11:23:42,587][175731] Updated weights for policy 0, policy_version 74690 (0.0007) [2023-03-07 11:23:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 76491776. Throughput: 0: 12846.3. Samples: 76465331. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:43,321][175405] Avg episode reward: [(0, '26.774')] [2023-03-07 11:23:43,368][175731] Updated weights for policy 0, policy_version 74700 (0.0006) [2023-03-07 11:23:44,180][175731] Updated weights for policy 0, policy_version 74710 (0.0005) [2023-03-07 11:23:44,983][175731] Updated weights for policy 0, policy_version 74720 (0.0007) [2023-03-07 11:23:45,768][175731] Updated weights for policy 0, policy_version 74730 (0.0006) [2023-03-07 11:23:46,567][175731] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-07 11:23:47,369][175731] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-07 11:23:48,174][175731] Updated weights for policy 0, policy_version 74760 (0.0006) [2023-03-07 11:23:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 76555264. Throughput: 0: 12842.7. Samples: 76542272. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:48,322][175405] Avg episode reward: [(0, '26.085')] [2023-03-07 11:23:48,328][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000074762_76556288.pth... [2023-03-07 11:23:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000071755_73477120.pth [2023-03-07 11:23:48,965][175731] Updated weights for policy 0, policy_version 74770 (0.0006) [2023-03-07 11:23:49,784][175731] Updated weights for policy 0, policy_version 74780 (0.0006) [2023-03-07 11:23:50,551][175731] Updated weights for policy 0, policy_version 74790 (0.0007) [2023-03-07 11:23:51,357][175731] Updated weights for policy 0, policy_version 74800 (0.0006) [2023-03-07 11:23:52,160][175731] Updated weights for policy 0, policy_version 74810 (0.0006) [2023-03-07 11:23:52,955][175731] Updated weights for policy 0, policy_version 74820 (0.0007) [2023-03-07 11:23:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 76619776. Throughput: 0: 12844.4. Samples: 76619408. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:53,322][175405] Avg episode reward: [(0, '27.699')] [2023-03-07 11:23:53,761][175731] Updated weights for policy 0, policy_version 74830 (0.0006) [2023-03-07 11:23:54,546][175731] Updated weights for policy 0, policy_version 74840 (0.0007) [2023-03-07 11:23:55,351][175731] Updated weights for policy 0, policy_version 74850 (0.0006) [2023-03-07 11:23:56,148][175731] Updated weights for policy 0, policy_version 74860 (0.0006) [2023-03-07 11:23:56,962][175731] Updated weights for policy 0, policy_version 74870 (0.0006) [2023-03-07 11:23:57,758][175731] Updated weights for policy 0, policy_version 74880 (0.0006) [2023-03-07 11:23:58,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 76684288. Throughput: 0: 12843.3. Samples: 76657895. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:23:58,322][175405] Avg episode reward: [(0, '26.358')] [2023-03-07 11:23:58,549][175731] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-07 11:23:59,355][175731] Updated weights for policy 0, policy_version 74900 (0.0007) [2023-03-07 11:24:00,163][175731] Updated weights for policy 0, policy_version 74910 (0.0007) [2023-03-07 11:24:00,957][175731] Updated weights for policy 0, policy_version 74920 (0.0007) [2023-03-07 11:24:01,741][175731] Updated weights for policy 0, policy_version 74930 (0.0007) [2023-03-07 11:24:02,557][175731] Updated weights for policy 0, policy_version 74940 (0.0006) [2023-03-07 11:24:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 76747776. Throughput: 0: 12836.1. Samples: 76734642. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:03,322][175405] Avg episode reward: [(0, '25.507')] [2023-03-07 11:24:03,339][175731] Updated weights for policy 0, policy_version 74950 (0.0006) [2023-03-07 11:24:04,128][175731] Updated weights for policy 0, policy_version 74960 (0.0007) [2023-03-07 11:24:04,937][175731] Updated weights for policy 0, policy_version 74970 (0.0007) [2023-03-07 11:24:05,725][175731] Updated weights for policy 0, policy_version 74980 (0.0007) [2023-03-07 11:24:06,516][175731] Updated weights for policy 0, policy_version 74990 (0.0007) [2023-03-07 11:24:07,316][175731] Updated weights for policy 0, policy_version 75000 (0.0006) [2023-03-07 11:24:08,130][175731] Updated weights for policy 0, policy_version 75010 (0.0006) [2023-03-07 11:24:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 76812288. Throughput: 0: 12831.8. Samples: 76811687. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:08,322][175405] Avg episode reward: [(0, '25.997')] [2023-03-07 11:24:08,918][175731] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-07 11:24:09,713][175731] Updated weights for policy 0, policy_version 75030 (0.0007) [2023-03-07 11:24:10,518][175731] Updated weights for policy 0, policy_version 75040 (0.0006) [2023-03-07 11:24:11,310][175731] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-07 11:24:12,106][175731] Updated weights for policy 0, policy_version 75060 (0.0006) [2023-03-07 11:24:12,912][175731] Updated weights for policy 0, policy_version 75070 (0.0007) [2023-03-07 11:24:13,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 76876800. Throughput: 0: 12834.0. Samples: 76850328. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:13,322][175405] Avg episode reward: [(0, '25.121')] [2023-03-07 11:24:13,706][175731] Updated weights for policy 0, policy_version 75080 (0.0006) [2023-03-07 11:24:14,510][175731] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-07 11:24:15,303][175731] Updated weights for policy 0, policy_version 75100 (0.0006) [2023-03-07 11:24:16,106][175731] Updated weights for policy 0, policy_version 75110 (0.0007) [2023-03-07 11:24:16,901][175731] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-07 11:24:17,737][175731] Updated weights for policy 0, policy_version 75130 (0.0006) [2023-03-07 11:24:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12826.0). Total num frames: 76940288. Throughput: 0: 12824.2. Samples: 76927077. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:18,321][175405] Avg episode reward: [(0, '25.643')] [2023-03-07 11:24:18,534][175731] Updated weights for policy 0, policy_version 75140 (0.0006) [2023-03-07 11:24:19,327][175731] Updated weights for policy 0, policy_version 75150 (0.0006) [2023-03-07 11:24:20,131][175731] Updated weights for policy 0, policy_version 75160 (0.0005) [2023-03-07 11:24:20,940][175731] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-07 11:24:21,744][175731] Updated weights for policy 0, policy_version 75180 (0.0008) [2023-03-07 11:24:22,543][175731] Updated weights for policy 0, policy_version 75190 (0.0007) [2023-03-07 11:24:23,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77003776. Throughput: 0: 12814.5. Samples: 77003688. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:23,321][175405] Avg episode reward: [(0, '25.289')] [2023-03-07 11:24:23,331][175731] Updated weights for policy 0, policy_version 75200 (0.0006) [2023-03-07 11:24:24,124][175731] Updated weights for policy 0, policy_version 75210 (0.0006) [2023-03-07 11:24:24,349][175680] KL-divergence is very high: 186502.8281 [2023-03-07 11:24:24,659][175680] KL-divergence is very high: 37429.8906 [2023-03-07 11:24:24,829][175680] KL-divergence is very high: 609.9697 [2023-03-07 11:24:24,930][175731] Updated weights for policy 0, policy_version 75220 (0.0006) [2023-03-07 11:24:25,083][175680] KL-divergence is very high: 245.9757 [2023-03-07 11:24:25,724][175731] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-07 11:24:26,539][175731] Updated weights for policy 0, policy_version 75240 (0.0006) [2023-03-07 11:24:27,337][175731] Updated weights for policy 0, policy_version 75250 (0.0006) [2023-03-07 11:24:28,121][175731] Updated weights for policy 0, policy_version 75260 (0.0006) [2023-03-07 11:24:28,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77068288. Throughput: 0: 12814.9. Samples: 77042003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:28,322][175405] Avg episode reward: [(0, '29.099')] [2023-03-07 11:24:28,922][175731] Updated weights for policy 0, policy_version 75270 (0.0007) [2023-03-07 11:24:29,696][175731] Updated weights for policy 0, policy_version 75280 (0.0006) [2023-03-07 11:24:30,502][175731] Updated weights for policy 0, policy_version 75290 (0.0007) [2023-03-07 11:24:31,304][175731] Updated weights for policy 0, policy_version 75300 (0.0006) [2023-03-07 11:24:32,096][175731] Updated weights for policy 0, policy_version 75310 (0.0006) [2023-03-07 11:24:32,917][175731] Updated weights for policy 0, policy_version 75320 (0.0007) [2023-03-07 11:24:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 77132800. Throughput: 0: 12822.5. Samples: 77119280. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:33,321][175405] Avg episode reward: [(0, '26.107')] [2023-03-07 11:24:33,712][175731] Updated weights for policy 0, policy_version 75330 (0.0007) [2023-03-07 11:24:34,521][175731] Updated weights for policy 0, policy_version 75340 (0.0006) [2023-03-07 11:24:35,309][175731] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-07 11:24:36,114][175731] Updated weights for policy 0, policy_version 75360 (0.0008) [2023-03-07 11:24:36,907][175731] Updated weights for policy 0, policy_version 75370 (0.0007) [2023-03-07 11:24:37,697][175731] Updated weights for policy 0, policy_version 75380 (0.0007) [2023-03-07 11:24:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77196288. Throughput: 0: 12813.0. Samples: 77195993. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:38,322][175405] Avg episode reward: [(0, '27.409')] [2023-03-07 11:24:38,503][175731] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-07 11:24:39,301][175731] Updated weights for policy 0, policy_version 75400 (0.0007) [2023-03-07 11:24:40,091][175731] Updated weights for policy 0, policy_version 75410 (0.0006) [2023-03-07 11:24:40,906][175731] Updated weights for policy 0, policy_version 75420 (0.0006) [2023-03-07 11:24:41,693][175731] Updated weights for policy 0, policy_version 75430 (0.0006) [2023-03-07 11:24:42,482][175731] Updated weights for policy 0, policy_version 75440 (0.0007) [2023-03-07 11:24:43,290][175731] Updated weights for policy 0, policy_version 75450 (0.0007) [2023-03-07 11:24:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77260800. Throughput: 0: 12812.9. Samples: 77234476. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:43,321][175405] Avg episode reward: [(0, '26.901')] [2023-03-07 11:24:44,087][175731] Updated weights for policy 0, policy_version 75460 (0.0007) [2023-03-07 11:24:44,875][175731] Updated weights for policy 0, policy_version 75470 (0.0006) [2023-03-07 11:24:45,664][175731] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-07 11:24:46,473][175731] Updated weights for policy 0, policy_version 75490 (0.0006) [2023-03-07 11:24:47,255][175731] Updated weights for policy 0, policy_version 75500 (0.0006) [2023-03-07 11:24:48,061][175731] Updated weights for policy 0, policy_version 75510 (0.0006) [2023-03-07 11:24:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 77325312. Throughput: 0: 12825.5. Samples: 77311791. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:48,322][175405] Avg episode reward: [(0, '25.225')] [2023-03-07 11:24:48,850][175731] Updated weights for policy 0, policy_version 75520 (0.0006) [2023-03-07 11:24:49,666][175731] Updated weights for policy 0, policy_version 75530 (0.0006) [2023-03-07 11:24:50,449][175731] Updated weights for policy 0, policy_version 75540 (0.0007) [2023-03-07 11:24:51,257][175731] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-07 11:24:52,062][175731] Updated weights for policy 0, policy_version 75560 (0.0007) [2023-03-07 11:24:52,862][175731] Updated weights for policy 0, policy_version 75570 (0.0006) [2023-03-07 11:24:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77388800. Throughput: 0: 12819.9. Samples: 77388585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:53,322][175405] Avg episode reward: [(0, '25.778')] [2023-03-07 11:24:53,646][175731] Updated weights for policy 0, policy_version 75580 (0.0006) [2023-03-07 11:24:54,453][175731] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-07 11:24:55,273][175731] Updated weights for policy 0, policy_version 75600 (0.0007) [2023-03-07 11:24:56,076][175731] Updated weights for policy 0, policy_version 75610 (0.0007) [2023-03-07 11:24:56,861][175731] Updated weights for policy 0, policy_version 75620 (0.0006) [2023-03-07 11:24:57,673][175731] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-07 11:24:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12829.5). Total num frames: 77453312. Throughput: 0: 12813.3. Samples: 77426928. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:24:58,322][175405] Avg episode reward: [(0, '26.455')] [2023-03-07 11:24:58,471][175731] Updated weights for policy 0, policy_version 75640 (0.0007) [2023-03-07 11:24:59,273][175731] Updated weights for policy 0, policy_version 75650 (0.0007) [2023-03-07 11:25:00,066][175731] Updated weights for policy 0, policy_version 75660 (0.0007) [2023-03-07 11:25:00,863][175731] Updated weights for policy 0, policy_version 75670 (0.0007) [2023-03-07 11:25:01,666][175731] Updated weights for policy 0, policy_version 75680 (0.0006) [2023-03-07 11:25:02,478][175731] Updated weights for policy 0, policy_version 75690 (0.0007) [2023-03-07 11:25:03,263][175731] Updated weights for policy 0, policy_version 75700 (0.0006) [2023-03-07 11:25:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 77516800. Throughput: 0: 12815.6. Samples: 77503779. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:25:03,321][175405] Avg episode reward: [(0, '26.471')] [2023-03-07 11:25:04,070][175731] Updated weights for policy 0, policy_version 75710 (0.0008) [2023-03-07 11:25:04,845][175731] Updated weights for policy 0, policy_version 75720 (0.0006) [2023-03-07 11:25:05,653][175731] Updated weights for policy 0, policy_version 75730 (0.0007) [2023-03-07 11:25:06,462][175731] Updated weights for policy 0, policy_version 75740 (0.0006) [2023-03-07 11:25:07,258][175731] Updated weights for policy 0, policy_version 75750 (0.0007) [2023-03-07 11:25:08,033][175731] Updated weights for policy 0, policy_version 75760 (0.0007) [2023-03-07 11:25:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77581312. Throughput: 0: 12825.0. Samples: 77580816. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:25:08,332][175405] Avg episode reward: [(0, '26.658')] [2023-03-07 11:25:08,846][175731] Updated weights for policy 0, policy_version 75770 (0.0006) [2023-03-07 11:25:09,644][175731] Updated weights for policy 0, policy_version 75780 (0.0008) [2023-03-07 11:25:10,451][175731] Updated weights for policy 0, policy_version 75790 (0.0005) [2023-03-07 11:25:11,248][175731] Updated weights for policy 0, policy_version 75800 (0.0006) [2023-03-07 11:25:12,034][175731] Updated weights for policy 0, policy_version 75810 (0.0007) [2023-03-07 11:25:12,833][175731] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-07 11:25:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 77645824. Throughput: 0: 12827.9. Samples: 77619256. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:13,332][175405] Avg episode reward: [(0, '27.989')] [2023-03-07 11:25:13,627][175731] Updated weights for policy 0, policy_version 75830 (0.0007) [2023-03-07 11:25:14,441][175731] Updated weights for policy 0, policy_version 75840 (0.0007) [2023-03-07 11:25:15,231][175731] Updated weights for policy 0, policy_version 75850 (0.0006) [2023-03-07 11:25:16,022][175731] Updated weights for policy 0, policy_version 75860 (0.0006) [2023-03-07 11:25:16,831][175731] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-07 11:25:17,623][175731] Updated weights for policy 0, policy_version 75880 (0.0007) [2023-03-07 11:25:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77709312. Throughput: 0: 12819.7. Samples: 77696169. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:18,332][175405] Avg episode reward: [(0, '26.896')] [2023-03-07 11:25:18,435][175731] Updated weights for policy 0, policy_version 75890 (0.0006) [2023-03-07 11:25:19,236][175731] Updated weights for policy 0, policy_version 75900 (0.0006) [2023-03-07 11:25:20,027][175731] Updated weights for policy 0, policy_version 75910 (0.0007) [2023-03-07 11:25:20,831][175731] Updated weights for policy 0, policy_version 75920 (0.0007) [2023-03-07 11:25:21,632][175731] Updated weights for policy 0, policy_version 75930 (0.0007) [2023-03-07 11:25:22,431][175731] Updated weights for policy 0, policy_version 75940 (0.0006) [2023-03-07 11:25:23,231][175731] Updated weights for policy 0, policy_version 75950 (0.0006) [2023-03-07 11:25:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 77773824. Throughput: 0: 12821.7. Samples: 77772971. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:23,322][175405] Avg episode reward: [(0, '26.592')] [2023-03-07 11:25:24,029][175731] Updated weights for policy 0, policy_version 75960 (0.0007) [2023-03-07 11:25:24,830][175731] Updated weights for policy 0, policy_version 75970 (0.0007) [2023-03-07 11:25:25,633][175731] Updated weights for policy 0, policy_version 75980 (0.0006) [2023-03-07 11:25:26,400][175731] Updated weights for policy 0, policy_version 75990 (0.0006) [2023-03-07 11:25:27,225][175731] Updated weights for policy 0, policy_version 76000 (0.0006) [2023-03-07 11:25:28,020][175731] Updated weights for policy 0, policy_version 76010 (0.0007) [2023-03-07 11:25:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77837312. Throughput: 0: 12825.8. Samples: 77811637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:28,332][175405] Avg episode reward: [(0, '27.375')] [2023-03-07 11:25:28,815][175731] Updated weights for policy 0, policy_version 76020 (0.0007) [2023-03-07 11:25:29,611][175731] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-07 11:25:30,422][175731] Updated weights for policy 0, policy_version 76040 (0.0006) [2023-03-07 11:25:31,212][175731] Updated weights for policy 0, policy_version 76050 (0.0007) [2023-03-07 11:25:32,008][175731] Updated weights for policy 0, policy_version 76060 (0.0006) [2023-03-07 11:25:32,793][175731] Updated weights for policy 0, policy_version 76070 (0.0007) [2023-03-07 11:25:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12826.0). Total num frames: 77901824. Throughput: 0: 12815.4. Samples: 77888487. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:33,332][175405] Avg episode reward: [(0, '25.903')] [2023-03-07 11:25:33,586][175731] Updated weights for policy 0, policy_version 76080 (0.0007) [2023-03-07 11:25:34,394][175731] Updated weights for policy 0, policy_version 76090 (0.0006) [2023-03-07 11:25:35,205][175731] Updated weights for policy 0, policy_version 76100 (0.0006) [2023-03-07 11:25:35,997][175731] Updated weights for policy 0, policy_version 76110 (0.0007) [2023-03-07 11:25:36,814][175731] Updated weights for policy 0, policy_version 76120 (0.0006) [2023-03-07 11:25:37,593][175731] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-07 11:25:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 77965312. Throughput: 0: 12818.6. Samples: 77965423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:38,332][175405] Avg episode reward: [(0, '27.746')] [2023-03-07 11:25:38,389][175731] Updated weights for policy 0, policy_version 76140 (0.0007) [2023-03-07 11:25:39,197][175731] Updated weights for policy 0, policy_version 76150 (0.0007) [2023-03-07 11:25:40,013][175731] Updated weights for policy 0, policy_version 76160 (0.0007) [2023-03-07 11:25:40,798][175731] Updated weights for policy 0, policy_version 76170 (0.0007) [2023-03-07 11:25:41,599][175731] Updated weights for policy 0, policy_version 76180 (0.0006) [2023-03-07 11:25:42,396][175731] Updated weights for policy 0, policy_version 76190 (0.0007) [2023-03-07 11:25:43,202][175731] Updated weights for policy 0, policy_version 76200 (0.0007) [2023-03-07 11:25:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78029824. Throughput: 0: 12817.8. Samples: 78003726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:43,332][175405] Avg episode reward: [(0, '26.539')] [2023-03-07 11:25:44,006][175731] Updated weights for policy 0, policy_version 76210 (0.0007) [2023-03-07 11:25:44,791][175731] Updated weights for policy 0, policy_version 76220 (0.0006) [2023-03-07 11:25:45,591][175731] Updated weights for policy 0, policy_version 76230 (0.0007) [2023-03-07 11:25:46,398][175731] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-07 11:25:47,193][175731] Updated weights for policy 0, policy_version 76250 (0.0006) [2023-03-07 11:25:47,981][175731] Updated weights for policy 0, policy_version 76260 (0.0006) [2023-03-07 11:25:48,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78094336. Throughput: 0: 12818.9. Samples: 78080630. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:48,322][175405] Avg episode reward: [(0, '26.939')] [2023-03-07 11:25:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000076264_78094336.pth... [2023-03-07 11:25:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000073257_75015168.pth [2023-03-07 11:25:48,786][175731] Updated weights for policy 0, policy_version 76270 (0.0006) [2023-03-07 11:25:49,583][175731] Updated weights for policy 0, policy_version 76280 (0.0007) [2023-03-07 11:25:50,392][175731] Updated weights for policy 0, policy_version 76290 (0.0006) [2023-03-07 11:25:51,182][175731] Updated weights for policy 0, policy_version 76300 (0.0006) [2023-03-07 11:25:51,974][175731] Updated weights for policy 0, policy_version 76310 (0.0008) [2023-03-07 11:25:52,785][175731] Updated weights for policy 0, policy_version 76320 (0.0007) [2023-03-07 11:25:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78157824. Throughput: 0: 12814.1. Samples: 78157451. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:53,322][175405] Avg episode reward: [(0, '29.510')] [2023-03-07 11:25:53,586][175731] Updated weights for policy 0, policy_version 76330 (0.0006) [2023-03-07 11:25:54,381][175731] Updated weights for policy 0, policy_version 76340 (0.0007) [2023-03-07 11:25:55,177][175731] Updated weights for policy 0, policy_version 76350 (0.0006) [2023-03-07 11:25:55,968][175731] Updated weights for policy 0, policy_version 76360 (0.0007) [2023-03-07 11:25:56,765][175731] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-07 11:25:57,554][175731] Updated weights for policy 0, policy_version 76380 (0.0007) [2023-03-07 11:25:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78222336. Throughput: 0: 12816.4. Samples: 78195995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:25:58,321][175405] Avg episode reward: [(0, '25.653')] [2023-03-07 11:25:58,370][175731] Updated weights for policy 0, policy_version 76390 (0.0006) [2023-03-07 11:25:59,173][175731] Updated weights for policy 0, policy_version 76400 (0.0006) [2023-03-07 11:25:59,981][175731] Updated weights for policy 0, policy_version 76410 (0.0006) [2023-03-07 11:26:00,769][175731] Updated weights for policy 0, policy_version 76420 (0.0007) [2023-03-07 11:26:01,580][175731] Updated weights for policy 0, policy_version 76430 (0.0007) [2023-03-07 11:26:02,374][175731] Updated weights for policy 0, policy_version 76440 (0.0007) [2023-03-07 11:26:03,179][175731] Updated weights for policy 0, policy_version 76450 (0.0006) [2023-03-07 11:26:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12826.0). Total num frames: 78285824. Throughput: 0: 12812.7. Samples: 78272742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:03,322][175405] Avg episode reward: [(0, '28.163')] [2023-03-07 11:26:03,982][175731] Updated weights for policy 0, policy_version 76460 (0.0006) [2023-03-07 11:26:04,771][175731] Updated weights for policy 0, policy_version 76470 (0.0007) [2023-03-07 11:26:05,571][175731] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-07 11:26:06,386][175731] Updated weights for policy 0, policy_version 76490 (0.0006) [2023-03-07 11:26:07,182][175731] Updated weights for policy 0, policy_version 76500 (0.0006) [2023-03-07 11:26:07,993][175731] Updated weights for policy 0, policy_version 76510 (0.0006) [2023-03-07 11:26:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78350336. Throughput: 0: 12808.8. Samples: 78349367. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:08,321][175405] Avg episode reward: [(0, '25.519')] [2023-03-07 11:26:08,787][175731] Updated weights for policy 0, policy_version 76520 (0.0007) [2023-03-07 11:26:09,581][175731] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-07 11:26:10,392][175731] Updated weights for policy 0, policy_version 76540 (0.0007) [2023-03-07 11:26:11,202][175731] Updated weights for policy 0, policy_version 76550 (0.0007) [2023-03-07 11:26:11,987][175731] Updated weights for policy 0, policy_version 76560 (0.0006) [2023-03-07 11:26:12,790][175731] Updated weights for policy 0, policy_version 76570 (0.0007) [2023-03-07 11:26:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 78413824. Throughput: 0: 12803.6. Samples: 78387797. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:13,321][175405] Avg episode reward: [(0, '24.658')] [2023-03-07 11:26:13,581][175731] Updated weights for policy 0, policy_version 76580 (0.0007) [2023-03-07 11:26:14,375][175731] Updated weights for policy 0, policy_version 76590 (0.0007) [2023-03-07 11:26:15,178][175731] Updated weights for policy 0, policy_version 76600 (0.0006) [2023-03-07 11:26:15,989][175731] Updated weights for policy 0, policy_version 76610 (0.0007) [2023-03-07 11:26:16,786][175731] Updated weights for policy 0, policy_version 76620 (0.0006) [2023-03-07 11:26:17,591][175731] Updated weights for policy 0, policy_version 76630 (0.0006) [2023-03-07 11:26:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78478336. Throughput: 0: 12804.5. Samples: 78464687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:18,322][175405] Avg episode reward: [(0, '27.676')] [2023-03-07 11:26:18,397][175731] Updated weights for policy 0, policy_version 76640 (0.0007) [2023-03-07 11:26:19,185][175731] Updated weights for policy 0, policy_version 76650 (0.0006) [2023-03-07 11:26:19,988][175731] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-07 11:26:20,783][175731] Updated weights for policy 0, policy_version 76670 (0.0005) [2023-03-07 11:26:21,574][175731] Updated weights for policy 0, policy_version 76680 (0.0006) [2023-03-07 11:26:22,366][175731] Updated weights for policy 0, policy_version 76690 (0.0006) [2023-03-07 11:26:23,174][175731] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-07 11:26:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 78541824. Throughput: 0: 12804.1. Samples: 78541606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:23,322][175405] Avg episode reward: [(0, '25.614')] [2023-03-07 11:26:23,971][175731] Updated weights for policy 0, policy_version 76710 (0.0006) [2023-03-07 11:26:24,765][175731] Updated weights for policy 0, policy_version 76720 (0.0006) [2023-03-07 11:26:25,568][175731] Updated weights for policy 0, policy_version 76730 (0.0006) [2023-03-07 11:26:26,369][175731] Updated weights for policy 0, policy_version 76740 (0.0007) [2023-03-07 11:26:27,170][175731] Updated weights for policy 0, policy_version 76750 (0.0006) [2023-03-07 11:26:27,954][175731] Updated weights for policy 0, policy_version 76760 (0.0007) [2023-03-07 11:26:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78606336. Throughput: 0: 12808.5. Samples: 78580106. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:28,322][175405] Avg episode reward: [(0, '26.143')] [2023-03-07 11:26:28,770][175731] Updated weights for policy 0, policy_version 76770 (0.0008) [2023-03-07 11:26:29,573][175731] Updated weights for policy 0, policy_version 76780 (0.0007) [2023-03-07 11:26:30,350][175731] Updated weights for policy 0, policy_version 76790 (0.0007) [2023-03-07 11:26:31,161][175731] Updated weights for policy 0, policy_version 76800 (0.0007) [2023-03-07 11:26:31,978][175731] Updated weights for policy 0, policy_version 76810 (0.0006) [2023-03-07 11:26:32,130][175680] KL-divergence is very high: 22137614336.0000 [2023-03-07 11:26:32,755][175731] Updated weights for policy 0, policy_version 76820 (0.0007) [2023-03-07 11:26:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78670848. Throughput: 0: 12808.1. Samples: 78656992. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:33,332][175405] Avg episode reward: [(0, '52.406')] [2023-03-07 11:26:33,552][175731] Updated weights for policy 0, policy_version 76830 (0.0006) [2023-03-07 11:26:34,346][175731] Updated weights for policy 0, policy_version 76840 (0.0006) [2023-03-07 11:26:35,142][175731] Updated weights for policy 0, policy_version 76850 (0.0007) [2023-03-07 11:26:35,946][175731] Updated weights for policy 0, policy_version 76860 (0.0006) [2023-03-07 11:26:36,750][175731] Updated weights for policy 0, policy_version 76870 (0.0007) [2023-03-07 11:26:37,574][175731] Updated weights for policy 0, policy_version 76880 (0.0006) [2023-03-07 11:26:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78734336. Throughput: 0: 12808.6. Samples: 78733838. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:38,332][175405] Avg episode reward: [(0, '25.493')] [2023-03-07 11:26:38,372][175731] Updated weights for policy 0, policy_version 76890 (0.0008) [2023-03-07 11:26:39,153][175731] Updated weights for policy 0, policy_version 76900 (0.0006) [2023-03-07 11:26:39,961][175731] Updated weights for policy 0, policy_version 76910 (0.0008) [2023-03-07 11:26:40,753][175731] Updated weights for policy 0, policy_version 76920 (0.0006) [2023-03-07 11:26:41,537][175731] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-07 11:26:42,341][175731] Updated weights for policy 0, policy_version 76940 (0.0007) [2023-03-07 11:26:43,137][175731] Updated weights for policy 0, policy_version 76950 (0.0006) [2023-03-07 11:26:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 78798848. Throughput: 0: 12810.5. Samples: 78772469. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:26:43,331][175405] Avg episode reward: [(0, '25.637')] [2023-03-07 11:26:43,942][175731] Updated weights for policy 0, policy_version 76960 (0.0006) [2023-03-07 11:26:44,721][175731] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-07 11:26:45,514][175731] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-07 11:26:46,307][175731] Updated weights for policy 0, policy_version 76990 (0.0006) [2023-03-07 11:26:47,105][175731] Updated weights for policy 0, policy_version 77000 (0.0006) [2023-03-07 11:26:47,927][175731] Updated weights for policy 0, policy_version 77010 (0.0007) [2023-03-07 11:26:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.0, 300 sec: 12829.5). Total num frames: 78863360. Throughput: 0: 12823.8. Samples: 78849814. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:26:48,322][175405] Avg episode reward: [(0, '24.989')] [2023-03-07 11:26:48,719][175731] Updated weights for policy 0, policy_version 77020 (0.0006) [2023-03-07 11:26:49,503][175731] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-07 11:26:50,296][175731] Updated weights for policy 0, policy_version 77040 (0.0006) [2023-03-07 11:26:51,079][175731] Updated weights for policy 0, policy_version 77050 (0.0006) [2023-03-07 11:26:51,865][175731] Updated weights for policy 0, policy_version 77060 (0.0007) [2023-03-07 11:26:52,683][175731] Updated weights for policy 0, policy_version 77070 (0.0006) [2023-03-07 11:26:53,321][175405] Fps is (10 sec: 12901.8, 60 sec: 12834.0, 300 sec: 12829.5). Total num frames: 78927872. Throughput: 0: 12834.6. Samples: 78926931. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:26:53,332][175405] Avg episode reward: [(0, '26.065')] [2023-03-07 11:26:53,478][175731] Updated weights for policy 0, policy_version 77080 (0.0007) [2023-03-07 11:26:54,269][175731] Updated weights for policy 0, policy_version 77090 (0.0006) [2023-03-07 11:26:55,042][175731] Updated weights for policy 0, policy_version 77100 (0.0007) [2023-03-07 11:26:55,838][175731] Updated weights for policy 0, policy_version 77110 (0.0007) [2023-03-07 11:26:56,634][175731] Updated weights for policy 0, policy_version 77120 (0.0007) [2023-03-07 11:26:57,423][175731] Updated weights for policy 0, policy_version 77130 (0.0007) [2023-03-07 11:26:58,230][175731] Updated weights for policy 0, policy_version 77140 (0.0006) [2023-03-07 11:26:58,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 78992384. Throughput: 0: 12844.3. Samples: 78965792. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:26:58,332][175405] Avg episode reward: [(0, '25.308')] [2023-03-07 11:26:59,028][175731] Updated weights for policy 0, policy_version 77150 (0.0006) [2023-03-07 11:26:59,833][175731] Updated weights for policy 0, policy_version 77160 (0.0007) [2023-03-07 11:27:00,614][175731] Updated weights for policy 0, policy_version 77170 (0.0006) [2023-03-07 11:27:01,405][175731] Updated weights for policy 0, policy_version 77180 (0.0007) [2023-03-07 11:27:02,205][175731] Updated weights for policy 0, policy_version 77190 (0.0005) [2023-03-07 11:27:03,006][175731] Updated weights for policy 0, policy_version 77200 (0.0007) [2023-03-07 11:27:03,321][175405] Fps is (10 sec: 12800.6, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 79055872. Throughput: 0: 12850.8. Samples: 79042975. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:27:03,332][175405] Avg episode reward: [(0, '25.036')] [2023-03-07 11:27:03,793][175731] Updated weights for policy 0, policy_version 77210 (0.0007) [2023-03-07 11:27:04,604][175731] Updated weights for policy 0, policy_version 77220 (0.0007) [2023-03-07 11:27:05,392][175731] Updated weights for policy 0, policy_version 77230 (0.0006) [2023-03-07 11:27:06,181][175731] Updated weights for policy 0, policy_version 77240 (0.0006) [2023-03-07 11:27:06,993][175731] Updated weights for policy 0, policy_version 77250 (0.0006) [2023-03-07 11:27:07,790][175731] Updated weights for policy 0, policy_version 77260 (0.0006) [2023-03-07 11:27:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 79120384. Throughput: 0: 12853.3. Samples: 79120005. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:27:08,332][175405] Avg episode reward: [(0, '25.610')] [2023-03-07 11:27:08,579][175731] Updated weights for policy 0, policy_version 77270 (0.0007) [2023-03-07 11:27:09,380][175731] Updated weights for policy 0, policy_version 77280 (0.0006) [2023-03-07 11:27:10,184][175731] Updated weights for policy 0, policy_version 77290 (0.0006) [2023-03-07 11:27:10,970][175731] Updated weights for policy 0, policy_version 77300 (0.0007) [2023-03-07 11:27:11,766][175731] Updated weights for policy 0, policy_version 77310 (0.0006) [2023-03-07 11:27:12,583][175731] Updated weights for policy 0, policy_version 77320 (0.0007) [2023-03-07 11:27:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 79184896. Throughput: 0: 12853.6. Samples: 79158517. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:27:13,332][175405] Avg episode reward: [(0, '26.453')] [2023-03-07 11:27:13,373][175731] Updated weights for policy 0, policy_version 77330 (0.0006) [2023-03-07 11:27:14,182][175731] Updated weights for policy 0, policy_version 77340 (0.0006) [2023-03-07 11:27:14,969][175731] Updated weights for policy 0, policy_version 77350 (0.0007) [2023-03-07 11:27:15,761][175731] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-07 11:27:16,574][175731] Updated weights for policy 0, policy_version 77370 (0.0006) [2023-03-07 11:27:17,376][175731] Updated weights for policy 0, policy_version 77380 (0.0007) [2023-03-07 11:27:18,155][175731] Updated weights for policy 0, policy_version 77390 (0.0006) [2023-03-07 11:27:18,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 79249408. Throughput: 0: 12853.6. Samples: 79235405. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:27:18,332][175405] Avg episode reward: [(0, '26.145')] [2023-03-07 11:27:18,968][175731] Updated weights for policy 0, policy_version 77400 (0.0007) [2023-03-07 11:27:19,774][175731] Updated weights for policy 0, policy_version 77410 (0.0006) [2023-03-07 11:27:20,552][175731] Updated weights for policy 0, policy_version 77420 (0.0007) [2023-03-07 11:27:21,343][175731] Updated weights for policy 0, policy_version 77430 (0.0007) [2023-03-07 11:27:22,140][175731] Updated weights for policy 0, policy_version 77440 (0.0006) [2023-03-07 11:27:22,937][175731] Updated weights for policy 0, policy_version 77450 (0.0006) [2023-03-07 11:27:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 79312896. Throughput: 0: 12862.6. Samples: 79312657. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:27:23,322][175405] Avg episode reward: [(0, '26.470')] [2023-03-07 11:27:23,742][175731] Updated weights for policy 0, policy_version 77460 (0.0006) [2023-03-07 11:27:24,529][175731] Updated weights for policy 0, policy_version 77470 (0.0006) [2023-03-07 11:27:25,319][175731] Updated weights for policy 0, policy_version 77480 (0.0007) [2023-03-07 11:27:26,122][175731] Updated weights for policy 0, policy_version 77490 (0.0006) [2023-03-07 11:27:26,919][175731] Updated weights for policy 0, policy_version 77500 (0.0007) [2023-03-07 11:27:27,720][175731] Updated weights for policy 0, policy_version 77510 (0.0005) [2023-03-07 11:27:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 79377408. Throughput: 0: 12863.4. Samples: 79351321. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:27:28,322][175405] Avg episode reward: [(0, '26.621')] [2023-03-07 11:27:28,516][175731] Updated weights for policy 0, policy_version 77520 (0.0007) [2023-03-07 11:27:29,292][175731] Updated weights for policy 0, policy_version 77530 (0.0006) [2023-03-07 11:27:30,112][175731] Updated weights for policy 0, policy_version 77540 (0.0006) [2023-03-07 11:27:30,911][175731] Updated weights for policy 0, policy_version 77550 (0.0007) [2023-03-07 11:27:31,698][175731] Updated weights for policy 0, policy_version 77560 (0.0006) [2023-03-07 11:27:32,502][175731] Updated weights for policy 0, policy_version 77570 (0.0006) [2023-03-07 11:27:33,296][175731] Updated weights for policy 0, policy_version 77580 (0.0006) [2023-03-07 11:27:33,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 79441920. Throughput: 0: 12857.2. Samples: 79428387. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:27:33,321][175405] Avg episode reward: [(0, '25.822')] [2023-03-07 11:27:34,078][175731] Updated weights for policy 0, policy_version 77590 (0.0006) [2023-03-07 11:27:34,879][175731] Updated weights for policy 0, policy_version 77600 (0.0007) [2023-03-07 11:27:35,679][175731] Updated weights for policy 0, policy_version 77610 (0.0006) [2023-03-07 11:27:36,481][175731] Updated weights for policy 0, policy_version 77620 (0.0006) [2023-03-07 11:27:37,274][175731] Updated weights for policy 0, policy_version 77630 (0.0007) [2023-03-07 11:27:38,058][175731] Updated weights for policy 0, policy_version 77640 (0.0006) [2023-03-07 11:27:38,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12829.5). Total num frames: 79506432. Throughput: 0: 12859.5. Samples: 79505601. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:27:38,321][175405] Avg episode reward: [(0, '25.172')] [2023-03-07 11:27:38,852][175731] Updated weights for policy 0, policy_version 77650 (0.0007) [2023-03-07 11:27:39,665][175731] Updated weights for policy 0, policy_version 77660 (0.0007) [2023-03-07 11:27:40,463][175731] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-07 11:27:41,268][175731] Updated weights for policy 0, policy_version 77680 (0.0006) [2023-03-07 11:27:42,073][175731] Updated weights for policy 0, policy_version 77690 (0.0006) [2023-03-07 11:27:42,864][175731] Updated weights for policy 0, policy_version 77700 (0.0006) [2023-03-07 11:27:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 79569920. Throughput: 0: 12847.4. Samples: 79543925. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:27:43,322][175405] Avg episode reward: [(0, '25.994')] [2023-03-07 11:27:43,653][175731] Updated weights for policy 0, policy_version 77710 (0.0007) [2023-03-07 11:27:44,464][175731] Updated weights for policy 0, policy_version 77720 (0.0006) [2023-03-07 11:27:45,277][175731] Updated weights for policy 0, policy_version 77730 (0.0007) [2023-03-07 11:27:46,063][175731] Updated weights for policy 0, policy_version 77740 (0.0006) [2023-03-07 11:27:46,861][175731] Updated weights for policy 0, policy_version 77750 (0.0007) [2023-03-07 11:27:47,654][175731] Updated weights for policy 0, policy_version 77760 (0.0006) [2023-03-07 11:27:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 79634432. Throughput: 0: 12841.9. Samples: 79620863. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:27:48,322][175405] Avg episode reward: [(0, '27.463')] [2023-03-07 11:27:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000077768_79634432.pth... [2023-03-07 11:27:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000074762_76556288.pth [2023-03-07 11:27:48,455][175731] Updated weights for policy 0, policy_version 77770 (0.0007) [2023-03-07 11:27:49,240][175731] Updated weights for policy 0, policy_version 77780 (0.0006) [2023-03-07 11:27:50,052][175731] Updated weights for policy 0, policy_version 77790 (0.0006) [2023-03-07 11:27:50,855][175731] Updated weights for policy 0, policy_version 77800 (0.0006) [2023-03-07 11:27:51,656][175731] Updated weights for policy 0, policy_version 77810 (0.0007) [2023-03-07 11:27:52,454][175731] Updated weights for policy 0, policy_version 77820 (0.0007) [2023-03-07 11:27:53,249][175731] Updated weights for policy 0, policy_version 77830 (0.0006) [2023-03-07 11:27:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.3, 300 sec: 12833.0). Total num frames: 79698944. Throughput: 0: 12840.3. Samples: 79697816. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:27:53,322][175405] Avg episode reward: [(0, '27.987')] [2023-03-07 11:27:54,045][175731] Updated weights for policy 0, policy_version 77840 (0.0007) [2023-03-07 11:27:54,837][175731] Updated weights for policy 0, policy_version 77850 (0.0006) [2023-03-07 11:27:55,627][175731] Updated weights for policy 0, policy_version 77860 (0.0006) [2023-03-07 11:27:56,420][175731] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-07 11:27:57,207][175731] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-07 11:27:58,010][175731] Updated weights for policy 0, policy_version 77890 (0.0006) [2023-03-07 11:27:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 79762432. Throughput: 0: 12845.1. Samples: 79736545. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:27:58,322][175405] Avg episode reward: [(0, '25.546')] [2023-03-07 11:27:58,817][175731] Updated weights for policy 0, policy_version 77900 (0.0007) [2023-03-07 11:27:59,612][175731] Updated weights for policy 0, policy_version 77910 (0.0006) [2023-03-07 11:28:00,419][175731] Updated weights for policy 0, policy_version 77920 (0.0007) [2023-03-07 11:28:01,228][175731] Updated weights for policy 0, policy_version 77930 (0.0007) [2023-03-07 11:28:02,030][175731] Updated weights for policy 0, policy_version 77940 (0.0007) [2023-03-07 11:28:02,824][175731] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-07 11:28:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 79826944. Throughput: 0: 12843.2. Samples: 79813346. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:28:03,321][175405] Avg episode reward: [(0, '26.118')] [2023-03-07 11:28:03,640][175731] Updated weights for policy 0, policy_version 77960 (0.0006) [2023-03-07 11:28:04,415][175731] Updated weights for policy 0, policy_version 77970 (0.0006) [2023-03-07 11:28:05,234][175731] Updated weights for policy 0, policy_version 77980 (0.0007) [2023-03-07 11:28:06,031][175731] Updated weights for policy 0, policy_version 77990 (0.0007) [2023-03-07 11:28:06,827][175731] Updated weights for policy 0, policy_version 78000 (0.0007) [2023-03-07 11:28:07,638][175731] Updated weights for policy 0, policy_version 78010 (0.0006) [2023-03-07 11:28:08,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 79890432. Throughput: 0: 12831.4. Samples: 79890068. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:28:08,321][175405] Avg episode reward: [(0, '25.719')] [2023-03-07 11:28:08,423][175731] Updated weights for policy 0, policy_version 78020 (0.0007) [2023-03-07 11:28:09,235][175731] Updated weights for policy 0, policy_version 78030 (0.0006) [2023-03-07 11:28:10,050][175731] Updated weights for policy 0, policy_version 78040 (0.0007) [2023-03-07 11:28:10,853][175731] Updated weights for policy 0, policy_version 78050 (0.0006) [2023-03-07 11:28:11,642][175731] Updated weights for policy 0, policy_version 78060 (0.0006) [2023-03-07 11:28:12,442][175731] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-07 11:28:13,232][175731] Updated weights for policy 0, policy_version 78080 (0.0007) [2023-03-07 11:28:13,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 79953920. Throughput: 0: 12819.7. Samples: 79928207. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:28:13,322][175405] Avg episode reward: [(0, '25.472')] [2023-03-07 11:28:14,044][175731] Updated weights for policy 0, policy_version 78090 (0.0007) [2023-03-07 11:28:14,836][175731] Updated weights for policy 0, policy_version 78100 (0.0007) [2023-03-07 11:28:15,630][175731] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-07 11:28:16,441][175731] Updated weights for policy 0, policy_version 78120 (0.0007) [2023-03-07 11:28:17,234][175731] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-07 11:28:18,042][175731] Updated weights for policy 0, policy_version 78140 (0.0006) [2023-03-07 11:28:18,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 80018432. Throughput: 0: 12817.8. Samples: 80005190. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:18,322][175405] Avg episode reward: [(0, '25.989')] [2023-03-07 11:28:18,850][175731] Updated weights for policy 0, policy_version 78150 (0.0007) [2023-03-07 11:28:19,653][175731] Updated weights for policy 0, policy_version 78160 (0.0007) [2023-03-07 11:28:20,445][175731] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-07 11:28:21,258][175731] Updated weights for policy 0, policy_version 78180 (0.0006) [2023-03-07 11:28:22,047][175731] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-07 11:28:22,849][175731] Updated weights for policy 0, policy_version 78200 (0.0006) [2023-03-07 11:28:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 80081920. Throughput: 0: 12807.3. Samples: 80081929. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:23,322][175405] Avg episode reward: [(0, '27.295')] [2023-03-07 11:28:23,639][175731] Updated weights for policy 0, policy_version 78210 (0.0006) [2023-03-07 11:28:24,430][175731] Updated weights for policy 0, policy_version 78220 (0.0006) [2023-03-07 11:28:25,228][175731] Updated weights for policy 0, policy_version 78230 (0.0007) [2023-03-07 11:28:26,039][175731] Updated weights for policy 0, policy_version 78240 (0.0006) [2023-03-07 11:28:26,840][175731] Updated weights for policy 0, policy_version 78250 (0.0007) [2023-03-07 11:28:27,654][175731] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-07 11:28:28,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 80146432. Throughput: 0: 12805.7. Samples: 80120182. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:28,321][175405] Avg episode reward: [(0, '27.066')] [2023-03-07 11:28:28,445][175731] Updated weights for policy 0, policy_version 78270 (0.0006) [2023-03-07 11:28:29,266][175731] Updated weights for policy 0, policy_version 78280 (0.0006) [2023-03-07 11:28:30,062][175731] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-07 11:28:30,889][175731] Updated weights for policy 0, policy_version 78300 (0.0006) [2023-03-07 11:28:31,673][175731] Updated weights for policy 0, policy_version 78310 (0.0007) [2023-03-07 11:28:32,473][175731] Updated weights for policy 0, policy_version 78320 (0.0006) [2023-03-07 11:28:33,267][175731] Updated weights for policy 0, policy_version 78330 (0.0006) [2023-03-07 11:28:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 80209920. Throughput: 0: 12796.9. Samples: 80196722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:33,322][175405] Avg episode reward: [(0, '28.484')] [2023-03-07 11:28:34,078][175731] Updated weights for policy 0, policy_version 78340 (0.0007) [2023-03-07 11:28:34,874][175731] Updated weights for policy 0, policy_version 78350 (0.0006) [2023-03-07 11:28:35,693][175731] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-07 11:28:36,482][175731] Updated weights for policy 0, policy_version 78370 (0.0006) [2023-03-07 11:28:37,273][175731] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-07 11:28:38,081][175731] Updated weights for policy 0, policy_version 78390 (0.0006) [2023-03-07 11:28:38,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12782.9, 300 sec: 12819.1). Total num frames: 80273408. Throughput: 0: 12790.9. Samples: 80273407. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:38,321][175405] Avg episode reward: [(0, '25.074')] [2023-03-07 11:28:38,881][175731] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-07 11:28:39,677][175731] Updated weights for policy 0, policy_version 78410 (0.0006) [2023-03-07 11:28:40,486][175731] Updated weights for policy 0, policy_version 78420 (0.0006) [2023-03-07 11:28:41,273][175731] Updated weights for policy 0, policy_version 78430 (0.0007) [2023-03-07 11:28:42,078][175731] Updated weights for policy 0, policy_version 78440 (0.0007) [2023-03-07 11:28:42,875][175731] Updated weights for policy 0, policy_version 78450 (0.0006) [2023-03-07 11:28:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 80337920. Throughput: 0: 12780.5. Samples: 80311668. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:43,321][175405] Avg episode reward: [(0, '27.621')] [2023-03-07 11:28:43,662][175731] Updated weights for policy 0, policy_version 78460 (0.0006) [2023-03-07 11:28:44,456][175731] Updated weights for policy 0, policy_version 78470 (0.0007) [2023-03-07 11:28:45,264][175731] Updated weights for policy 0, policy_version 78480 (0.0006) [2023-03-07 11:28:46,077][175731] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-07 11:28:46,882][175731] Updated weights for policy 0, policy_version 78500 (0.0007) [2023-03-07 11:28:47,685][175731] Updated weights for policy 0, policy_version 78510 (0.0007) [2023-03-07 11:28:48,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 80402432. Throughput: 0: 12783.4. Samples: 80388601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:48,322][175405] Avg episode reward: [(0, '25.482')] [2023-03-07 11:28:48,483][175731] Updated weights for policy 0, policy_version 78520 (0.0006) [2023-03-07 11:28:49,293][175731] Updated weights for policy 0, policy_version 78530 (0.0007) [2023-03-07 11:28:50,081][175731] Updated weights for policy 0, policy_version 78540 (0.0007) [2023-03-07 11:28:50,877][175731] Updated weights for policy 0, policy_version 78550 (0.0007) [2023-03-07 11:28:51,677][175731] Updated weights for policy 0, policy_version 78560 (0.0006) [2023-03-07 11:28:52,470][175731] Updated weights for policy 0, policy_version 78570 (0.0006) [2023-03-07 11:28:53,292][175731] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-07 11:28:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12819.1). Total num frames: 80465920. Throughput: 0: 12783.9. Samples: 80465345. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:53,322][175405] Avg episode reward: [(0, '26.288')] [2023-03-07 11:28:54,097][175731] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-07 11:28:54,878][175731] Updated weights for policy 0, policy_version 78600 (0.0007) [2023-03-07 11:28:55,709][175731] Updated weights for policy 0, policy_version 78610 (0.0007) [2023-03-07 11:28:56,498][175731] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-07 11:28:57,293][175731] Updated weights for policy 0, policy_version 78630 (0.0006) [2023-03-07 11:28:58,086][175731] Updated weights for policy 0, policy_version 78640 (0.0007) [2023-03-07 11:28:58,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12783.0, 300 sec: 12819.1). Total num frames: 80529408. Throughput: 0: 12787.0. Samples: 80503621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:28:58,322][175405] Avg episode reward: [(0, '26.121')] [2023-03-07 11:28:58,889][175731] Updated weights for policy 0, policy_version 78650 (0.0006) [2023-03-07 11:28:59,700][175731] Updated weights for policy 0, policy_version 78660 (0.0007) [2023-03-07 11:29:00,474][175731] Updated weights for policy 0, policy_version 78670 (0.0007) [2023-03-07 11:29:01,277][175731] Updated weights for policy 0, policy_version 78680 (0.0007) [2023-03-07 11:29:02,074][175731] Updated weights for policy 0, policy_version 78690 (0.0007) [2023-03-07 11:29:02,880][175731] Updated weights for policy 0, policy_version 78700 (0.0006) [2023-03-07 11:29:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12819.1). Total num frames: 80593920. Throughput: 0: 12790.5. Samples: 80580762. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:03,322][175405] Avg episode reward: [(0, '27.733')] [2023-03-07 11:29:03,676][175731] Updated weights for policy 0, policy_version 78710 (0.0006) [2023-03-07 11:29:04,469][175731] Updated weights for policy 0, policy_version 78720 (0.0006) [2023-03-07 11:29:05,266][175731] Updated weights for policy 0, policy_version 78730 (0.0006) [2023-03-07 11:29:06,046][175731] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-07 11:29:06,874][175731] Updated weights for policy 0, policy_version 78750 (0.0007) [2023-03-07 11:29:07,661][175731] Updated weights for policy 0, policy_version 78760 (0.0006) [2023-03-07 11:29:08,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 80658432. Throughput: 0: 12794.3. Samples: 80657675. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:08,322][175405] Avg episode reward: [(0, '25.406')] [2023-03-07 11:29:08,460][175731] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-07 11:29:09,266][175731] Updated weights for policy 0, policy_version 78780 (0.0007) [2023-03-07 11:29:10,056][175731] Updated weights for policy 0, policy_version 78790 (0.0006) [2023-03-07 11:29:10,846][175731] Updated weights for policy 0, policy_version 78800 (0.0007) [2023-03-07 11:29:11,649][175731] Updated weights for policy 0, policy_version 78810 (0.0006) [2023-03-07 11:29:12,456][175731] Updated weights for policy 0, policy_version 78820 (0.0006) [2023-03-07 11:29:13,237][175731] Updated weights for policy 0, policy_version 78830 (0.0007) [2023-03-07 11:29:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 80721920. Throughput: 0: 12800.7. Samples: 80696213. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:13,321][175405] Avg episode reward: [(0, '25.764')] [2023-03-07 11:29:14,031][175731] Updated weights for policy 0, policy_version 78840 (0.0006) [2023-03-07 11:29:14,833][175731] Updated weights for policy 0, policy_version 78850 (0.0006) [2023-03-07 11:29:15,632][175731] Updated weights for policy 0, policy_version 78860 (0.0006) [2023-03-07 11:29:16,413][175731] Updated weights for policy 0, policy_version 78870 (0.0007) [2023-03-07 11:29:17,229][175731] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-07 11:29:18,015][175731] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-07 11:29:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 80786432. Throughput: 0: 12815.5. Samples: 80773418. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:18,322][175405] Avg episode reward: [(0, '27.529')] [2023-03-07 11:29:18,813][175731] Updated weights for policy 0, policy_version 78900 (0.0006) [2023-03-07 11:29:19,597][175731] Updated weights for policy 0, policy_version 78910 (0.0007) [2023-03-07 11:29:20,414][175731] Updated weights for policy 0, policy_version 78920 (0.0007) [2023-03-07 11:29:21,201][175731] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-07 11:29:22,005][175731] Updated weights for policy 0, policy_version 78940 (0.0005) [2023-03-07 11:29:22,798][175731] Updated weights for policy 0, policy_version 78950 (0.0006) [2023-03-07 11:29:23,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 80850944. Throughput: 0: 12823.1. Samples: 80850449. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:23,322][175405] Avg episode reward: [(0, '27.043')] [2023-03-07 11:29:23,606][175731] Updated weights for policy 0, policy_version 78960 (0.0006) [2023-03-07 11:29:24,410][175731] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-07 11:29:25,222][175731] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-07 11:29:26,021][175731] Updated weights for policy 0, policy_version 78990 (0.0006) [2023-03-07 11:29:26,803][175731] Updated weights for policy 0, policy_version 79000 (0.0006) [2023-03-07 11:29:27,605][175731] Updated weights for policy 0, policy_version 79010 (0.0006) [2023-03-07 11:29:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 80914432. Throughput: 0: 12823.2. Samples: 80888711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:28,321][175405] Avg episode reward: [(0, '27.172')] [2023-03-07 11:29:28,410][175731] Updated weights for policy 0, policy_version 79020 (0.0007) [2023-03-07 11:29:29,212][175731] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-07 11:29:30,009][175731] Updated weights for policy 0, policy_version 79040 (0.0007) [2023-03-07 11:29:30,802][175731] Updated weights for policy 0, policy_version 79050 (0.0006) [2023-03-07 11:29:31,613][175731] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-07 11:29:32,420][175731] Updated weights for policy 0, policy_version 79070 (0.0006) [2023-03-07 11:29:33,212][175731] Updated weights for policy 0, policy_version 79080 (0.0007) [2023-03-07 11:29:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 80978944. Throughput: 0: 12822.0. Samples: 80965591. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:33,322][175405] Avg episode reward: [(0, '26.756')] [2023-03-07 11:29:34,017][175731] Updated weights for policy 0, policy_version 79090 (0.0007) [2023-03-07 11:29:34,800][175731] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-07 11:29:35,601][175731] Updated weights for policy 0, policy_version 79110 (0.0006) [2023-03-07 11:29:36,385][175731] Updated weights for policy 0, policy_version 79120 (0.0006) [2023-03-07 11:29:37,188][175731] Updated weights for policy 0, policy_version 79130 (0.0006) [2023-03-07 11:29:37,990][175731] Updated weights for policy 0, policy_version 79140 (0.0007) [2023-03-07 11:29:38,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 81043456. Throughput: 0: 12826.6. Samples: 81042542. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:38,322][175405] Avg episode reward: [(0, '25.722')] [2023-03-07 11:29:38,803][175731] Updated weights for policy 0, policy_version 79150 (0.0007) [2023-03-07 11:29:39,588][175731] Updated weights for policy 0, policy_version 79160 (0.0006) [2023-03-07 11:29:40,415][175731] Updated weights for policy 0, policy_version 79170 (0.0007) [2023-03-07 11:29:41,216][175731] Updated weights for policy 0, policy_version 79180 (0.0006) [2023-03-07 11:29:42,005][175731] Updated weights for policy 0, policy_version 79190 (0.0007) [2023-03-07 11:29:42,805][175731] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-07 11:29:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12819.1). Total num frames: 81106944. Throughput: 0: 12826.8. Samples: 81080827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:43,322][175405] Avg episode reward: [(0, '26.257')] [2023-03-07 11:29:43,620][175731] Updated weights for policy 0, policy_version 79210 (0.0007) [2023-03-07 11:29:44,410][175731] Updated weights for policy 0, policy_version 79220 (0.0006) [2023-03-07 11:29:45,212][175731] Updated weights for policy 0, policy_version 79230 (0.0006) [2023-03-07 11:29:46,023][175731] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-07 11:29:46,821][175731] Updated weights for policy 0, policy_version 79250 (0.0007) [2023-03-07 11:29:47,617][175731] Updated weights for policy 0, policy_version 79260 (0.0007) [2023-03-07 11:29:48,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 81170432. Throughput: 0: 12813.9. Samples: 81157391. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:48,322][175405] Avg episode reward: [(0, '27.071')] [2023-03-07 11:29:48,332][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000079269_81171456.pth... [2023-03-07 11:29:48,361][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000076264_78094336.pth [2023-03-07 11:29:48,401][175731] Updated weights for policy 0, policy_version 79270 (0.0007) [2023-03-07 11:29:49,199][175731] Updated weights for policy 0, policy_version 79280 (0.0006) [2023-03-07 11:29:50,007][175731] Updated weights for policy 0, policy_version 79290 (0.0007) [2023-03-07 11:29:50,780][175731] Updated weights for policy 0, policy_version 79300 (0.0007) [2023-03-07 11:29:51,585][175731] Updated weights for policy 0, policy_version 79310 (0.0007) [2023-03-07 11:29:52,373][175731] Updated weights for policy 0, policy_version 79320 (0.0006) [2023-03-07 11:29:53,182][175731] Updated weights for policy 0, policy_version 79330 (0.0006) [2023-03-07 11:29:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 81234944. Throughput: 0: 12822.7. Samples: 81234696. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:53,322][175405] Avg episode reward: [(0, '24.982')] [2023-03-07 11:29:53,962][175731] Updated weights for policy 0, policy_version 79340 (0.0006) [2023-03-07 11:29:54,774][175731] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-07 11:29:55,588][175731] Updated weights for policy 0, policy_version 79360 (0.0007) [2023-03-07 11:29:56,372][175731] Updated weights for policy 0, policy_version 79370 (0.0006) [2023-03-07 11:29:57,160][175731] Updated weights for policy 0, policy_version 79380 (0.0006) [2023-03-07 11:29:57,984][175731] Updated weights for policy 0, policy_version 79390 (0.0006) [2023-03-07 11:29:58,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 81299456. Throughput: 0: 12819.8. Samples: 81273106. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:29:58,322][175405] Avg episode reward: [(0, '25.445')] [2023-03-07 11:29:58,791][175731] Updated weights for policy 0, policy_version 79400 (0.0006) [2023-03-07 11:29:59,598][175731] Updated weights for policy 0, policy_version 79410 (0.0005) [2023-03-07 11:30:00,389][175731] Updated weights for policy 0, policy_version 79420 (0.0006) [2023-03-07 11:30:01,193][175731] Updated weights for policy 0, policy_version 79430 (0.0007) [2023-03-07 11:30:01,989][175731] Updated weights for policy 0, policy_version 79440 (0.0006) [2023-03-07 11:30:02,796][175731] Updated weights for policy 0, policy_version 79450 (0.0005) [2023-03-07 11:30:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.2, 300 sec: 12822.6). Total num frames: 81363968. Throughput: 0: 12809.0. Samples: 81349821. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:03,321][175405] Avg episode reward: [(0, '24.499')] [2023-03-07 11:30:03,577][175731] Updated weights for policy 0, policy_version 79460 (0.0006) [2023-03-07 11:30:04,385][175731] Updated weights for policy 0, policy_version 79470 (0.0007) [2023-03-07 11:30:05,165][175731] Updated weights for policy 0, policy_version 79480 (0.0006) [2023-03-07 11:30:05,956][175731] Updated weights for policy 0, policy_version 79490 (0.0007) [2023-03-07 11:30:06,750][175731] Updated weights for policy 0, policy_version 79500 (0.0007) [2023-03-07 11:30:07,552][175731] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-07 11:30:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 81427456. Throughput: 0: 12809.6. Samples: 81426881. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:08,322][175405] Avg episode reward: [(0, '27.755')] [2023-03-07 11:30:08,354][175731] Updated weights for policy 0, policy_version 79520 (0.0008) [2023-03-07 11:30:09,161][175731] Updated weights for policy 0, policy_version 79530 (0.0007) [2023-03-07 11:30:09,953][175731] Updated weights for policy 0, policy_version 79540 (0.0007) [2023-03-07 11:30:10,745][175731] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-07 11:30:11,543][175731] Updated weights for policy 0, policy_version 79560 (0.0006) [2023-03-07 11:30:12,327][175731] Updated weights for policy 0, policy_version 79570 (0.0006) [2023-03-07 11:30:13,146][175731] Updated weights for policy 0, policy_version 79580 (0.0006) [2023-03-07 11:30:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 81491968. Throughput: 0: 12818.9. Samples: 81465562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:13,321][175405] Avg episode reward: [(0, '26.368')] [2023-03-07 11:30:13,925][175731] Updated weights for policy 0, policy_version 79590 (0.0007) [2023-03-07 11:30:14,749][175731] Updated weights for policy 0, policy_version 79600 (0.0007) [2023-03-07 11:30:15,538][175731] Updated weights for policy 0, policy_version 79610 (0.0006) [2023-03-07 11:30:16,348][175731] Updated weights for policy 0, policy_version 79620 (0.0007) [2023-03-07 11:30:17,129][175731] Updated weights for policy 0, policy_version 79630 (0.0005) [2023-03-07 11:30:17,958][175731] Updated weights for policy 0, policy_version 79640 (0.0007) [2023-03-07 11:30:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12819.1). Total num frames: 81555456. Throughput: 0: 12820.8. Samples: 81542529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:18,322][175405] Avg episode reward: [(0, '26.107')] [2023-03-07 11:30:18,748][175731] Updated weights for policy 0, policy_version 79650 (0.0007) [2023-03-07 11:30:19,535][175731] Updated weights for policy 0, policy_version 79660 (0.0007) [2023-03-07 11:30:20,334][175731] Updated weights for policy 0, policy_version 79670 (0.0007) [2023-03-07 11:30:21,129][175731] Updated weights for policy 0, policy_version 79680 (0.0006) [2023-03-07 11:30:21,914][175731] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-07 11:30:22,721][175731] Updated weights for policy 0, policy_version 79700 (0.0007) [2023-03-07 11:30:23,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 81619968. Throughput: 0: 12823.9. Samples: 81619616. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:23,322][175405] Avg episode reward: [(0, '26.288')] [2023-03-07 11:30:23,499][175731] Updated weights for policy 0, policy_version 79710 (0.0007) [2023-03-07 11:30:24,290][175731] Updated weights for policy 0, policy_version 79720 (0.0006) [2023-03-07 11:30:25,098][175731] Updated weights for policy 0, policy_version 79730 (0.0007) [2023-03-07 11:30:25,883][175731] Updated weights for policy 0, policy_version 79740 (0.0006) [2023-03-07 11:30:26,675][175731] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-07 11:30:27,487][175731] Updated weights for policy 0, policy_version 79760 (0.0006) [2023-03-07 11:30:28,282][175731] Updated weights for policy 0, policy_version 79770 (0.0007) [2023-03-07 11:30:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 81684480. Throughput: 0: 12831.4. Samples: 81658240. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:28,322][175405] Avg episode reward: [(0, '25.483')] [2023-03-07 11:30:29,083][175731] Updated weights for policy 0, policy_version 79780 (0.0008) [2023-03-07 11:30:29,880][175731] Updated weights for policy 0, policy_version 79790 (0.0006) [2023-03-07 11:30:30,691][175731] Updated weights for policy 0, policy_version 79800 (0.0006) [2023-03-07 11:30:31,472][175731] Updated weights for policy 0, policy_version 79810 (0.0006) [2023-03-07 11:30:32,260][175731] Updated weights for policy 0, policy_version 79820 (0.0006) [2023-03-07 11:30:33,078][175731] Updated weights for policy 0, policy_version 79830 (0.0006) [2023-03-07 11:30:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 81748992. Throughput: 0: 12840.8. Samples: 81735225. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:33,322][175405] Avg episode reward: [(0, '26.562')] [2023-03-07 11:30:33,861][175731] Updated weights for policy 0, policy_version 79840 (0.0006) [2023-03-07 11:30:34,661][175731] Updated weights for policy 0, policy_version 79850 (0.0006) [2023-03-07 11:30:35,465][175731] Updated weights for policy 0, policy_version 79860 (0.0006) [2023-03-07 11:30:36,246][175731] Updated weights for policy 0, policy_version 79870 (0.0007) [2023-03-07 11:30:37,049][175731] Updated weights for policy 0, policy_version 79880 (0.0006) [2023-03-07 11:30:37,852][175731] Updated weights for policy 0, policy_version 79890 (0.0006) [2023-03-07 11:30:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 81812480. Throughput: 0: 12833.6. Samples: 81812206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:30:38,321][175405] Avg episode reward: [(0, '25.499')] [2023-03-07 11:30:38,674][175731] Updated weights for policy 0, policy_version 79900 (0.0006) [2023-03-07 11:30:39,451][175731] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-07 11:30:40,258][175731] Updated weights for policy 0, policy_version 79920 (0.0007) [2023-03-07 11:30:41,059][175731] Updated weights for policy 0, policy_version 79930 (0.0006) [2023-03-07 11:30:41,839][175731] Updated weights for policy 0, policy_version 79940 (0.0005) [2023-03-07 11:30:42,642][175731] Updated weights for policy 0, policy_version 79950 (0.0008) [2023-03-07 11:30:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 81876992. Throughput: 0: 12837.4. Samples: 81850788. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:30:43,322][175405] Avg episode reward: [(0, '26.169')] [2023-03-07 11:30:43,445][175731] Updated weights for policy 0, policy_version 79960 (0.0007) [2023-03-07 11:30:44,257][175731] Updated weights for policy 0, policy_version 79970 (0.0006) [2023-03-07 11:30:45,062][175731] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-07 11:30:45,855][175731] Updated weights for policy 0, policy_version 79990 (0.0006) [2023-03-07 11:30:46,664][175731] Updated weights for policy 0, policy_version 80000 (0.0007) [2023-03-07 11:30:47,476][175731] Updated weights for policy 0, policy_version 80010 (0.0007) [2023-03-07 11:30:48,277][175731] Updated weights for policy 0, policy_version 80020 (0.0007) [2023-03-07 11:30:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12822.6). Total num frames: 81940480. Throughput: 0: 12834.3. Samples: 81927367. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:30:48,322][175405] Avg episode reward: [(0, '26.496')] [2023-03-07 11:30:49,069][175731] Updated weights for policy 0, policy_version 80030 (0.0006) [2023-03-07 11:30:49,845][175731] Updated weights for policy 0, policy_version 80040 (0.0006) [2023-03-07 11:30:50,664][175731] Updated weights for policy 0, policy_version 80050 (0.0007) [2023-03-07 11:30:51,445][175731] Updated weights for policy 0, policy_version 80060 (0.0007) [2023-03-07 11:30:52,253][175731] Updated weights for policy 0, policy_version 80070 (0.0006) [2023-03-07 11:30:53,074][175731] Updated weights for policy 0, policy_version 80080 (0.0006) [2023-03-07 11:30:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 82004992. Throughput: 0: 12830.1. Samples: 82004236. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:30:53,322][175405] Avg episode reward: [(0, '27.114')] [2023-03-07 11:30:53,861][175731] Updated weights for policy 0, policy_version 80090 (0.0007) [2023-03-07 11:30:54,664][175731] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-07 11:30:55,443][175731] Updated weights for policy 0, policy_version 80110 (0.0006) [2023-03-07 11:30:56,253][175731] Updated weights for policy 0, policy_version 80120 (0.0006) [2023-03-07 11:30:57,048][175731] Updated weights for policy 0, policy_version 80130 (0.0006) [2023-03-07 11:30:57,850][175731] Updated weights for policy 0, policy_version 80140 (0.0007) [2023-03-07 11:30:58,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 82069504. Throughput: 0: 12826.3. Samples: 82042747. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:30:58,322][175405] Avg episode reward: [(0, '27.296')] [2023-03-07 11:30:58,654][175731] Updated weights for policy 0, policy_version 80150 (0.0006) [2023-03-07 11:30:59,455][175731] Updated weights for policy 0, policy_version 80160 (0.0006) [2023-03-07 11:31:00,243][175731] Updated weights for policy 0, policy_version 80170 (0.0006) [2023-03-07 11:31:01,057][175731] Updated weights for policy 0, policy_version 80180 (0.0007) [2023-03-07 11:31:01,861][175731] Updated weights for policy 0, policy_version 80190 (0.0007) [2023-03-07 11:31:02,654][175731] Updated weights for policy 0, policy_version 80200 (0.0006) [2023-03-07 11:31:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 82132992. Throughput: 0: 12821.8. Samples: 82119509. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:31:03,321][175405] Avg episode reward: [(0, '27.593')] [2023-03-07 11:31:03,468][175731] Updated weights for policy 0, policy_version 80210 (0.0006) [2023-03-07 11:31:04,262][175731] Updated weights for policy 0, policy_version 80220 (0.0006) [2023-03-07 11:31:05,050][175731] Updated weights for policy 0, policy_version 80230 (0.0007) [2023-03-07 11:31:05,843][175731] Updated weights for policy 0, policy_version 80240 (0.0007) [2023-03-07 11:31:06,630][175731] Updated weights for policy 0, policy_version 80250 (0.0006) [2023-03-07 11:31:07,438][175731] Updated weights for policy 0, policy_version 80260 (0.0006) [2023-03-07 11:31:08,237][175731] Updated weights for policy 0, policy_version 80270 (0.0005) [2023-03-07 11:31:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 82197504. Throughput: 0: 12822.5. Samples: 82196627. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:31:08,321][175405] Avg episode reward: [(0, '27.059')] [2023-03-07 11:31:09,029][175731] Updated weights for policy 0, policy_version 80280 (0.0006) [2023-03-07 11:31:09,832][175731] Updated weights for policy 0, policy_version 80290 (0.0008) [2023-03-07 11:31:10,640][175731] Updated weights for policy 0, policy_version 80300 (0.0007) [2023-03-07 11:31:11,433][175731] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-07 11:31:12,223][175731] Updated weights for policy 0, policy_version 80320 (0.0007) [2023-03-07 11:31:13,018][175731] Updated weights for policy 0, policy_version 80330 (0.0006) [2023-03-07 11:31:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 82260992. Throughput: 0: 12815.8. Samples: 82234953. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:31:13,322][175405] Avg episode reward: [(0, '25.969')] [2023-03-07 11:31:13,833][175731] Updated weights for policy 0, policy_version 80340 (0.0006) [2023-03-07 11:31:14,641][175731] Updated weights for policy 0, policy_version 80350 (0.0007) [2023-03-07 11:31:15,408][175731] Updated weights for policy 0, policy_version 80360 (0.0007) [2023-03-07 11:31:16,221][175731] Updated weights for policy 0, policy_version 80370 (0.0007) [2023-03-07 11:31:17,024][175731] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-07 11:31:17,825][175731] Updated weights for policy 0, policy_version 80390 (0.0005) [2023-03-07 11:31:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12826.0). Total num frames: 82325504. Throughput: 0: 12816.3. Samples: 82311956. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:31:18,321][175405] Avg episode reward: [(0, '26.615')] [2023-03-07 11:31:18,615][175731] Updated weights for policy 0, policy_version 80400 (0.0007) [2023-03-07 11:31:19,412][175731] Updated weights for policy 0, policy_version 80410 (0.0007) [2023-03-07 11:31:20,207][175731] Updated weights for policy 0, policy_version 80420 (0.0008) [2023-03-07 11:31:21,042][175731] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-07 11:31:21,817][175731] Updated weights for policy 0, policy_version 80440 (0.0006) [2023-03-07 11:31:22,617][175731] Updated weights for policy 0, policy_version 80450 (0.0006) [2023-03-07 11:31:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 82388992. Throughput: 0: 12810.7. Samples: 82388689. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:31:23,321][175405] Avg episode reward: [(0, '27.003')] [2023-03-07 11:31:23,437][175731] Updated weights for policy 0, policy_version 80460 (0.0006) [2023-03-07 11:31:24,240][175731] Updated weights for policy 0, policy_version 80470 (0.0007) [2023-03-07 11:31:25,034][175731] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-07 11:31:25,826][175731] Updated weights for policy 0, policy_version 80490 (0.0007) [2023-03-07 11:31:26,615][175731] Updated weights for policy 0, policy_version 80500 (0.0007) [2023-03-07 11:31:27,417][175731] Updated weights for policy 0, policy_version 80510 (0.0006) [2023-03-07 11:31:28,217][175731] Updated weights for policy 0, policy_version 80520 (0.0007) [2023-03-07 11:31:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 82453504. Throughput: 0: 12807.5. Samples: 82427122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:31:28,321][175405] Avg episode reward: [(0, '27.389')] [2023-03-07 11:31:29,009][175731] Updated weights for policy 0, policy_version 80530 (0.0007) [2023-03-07 11:31:29,797][175731] Updated weights for policy 0, policy_version 80540 (0.0007) [2023-03-07 11:31:30,599][175731] Updated weights for policy 0, policy_version 80550 (0.0007) [2023-03-07 11:31:31,398][175731] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-07 11:31:32,177][175731] Updated weights for policy 0, policy_version 80570 (0.0006) [2023-03-07 11:31:32,975][175731] Updated weights for policy 0, policy_version 80580 (0.0005) [2023-03-07 11:31:33,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 82518016. Throughput: 0: 12825.1. Samples: 82504494. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:31:33,321][175405] Avg episode reward: [(0, '26.566')] [2023-03-07 11:31:33,784][175731] Updated weights for policy 0, policy_version 80590 (0.0007) [2023-03-07 11:31:34,580][175731] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-07 11:31:35,366][175731] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-07 11:31:36,166][175731] Updated weights for policy 0, policy_version 80620 (0.0006) [2023-03-07 11:31:36,985][175731] Updated weights for policy 0, policy_version 80630 (0.0006) [2023-03-07 11:31:37,780][175731] Updated weights for policy 0, policy_version 80640 (0.0007) [2023-03-07 11:31:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 82581504. Throughput: 0: 12819.0. Samples: 82581093. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:31:38,322][175405] Avg episode reward: [(0, '27.713')] [2023-03-07 11:31:38,593][175731] Updated weights for policy 0, policy_version 80650 (0.0007) [2023-03-07 11:31:39,382][175731] Updated weights for policy 0, policy_version 80660 (0.0006) [2023-03-07 11:31:40,181][175731] Updated weights for policy 0, policy_version 80670 (0.0006) [2023-03-07 11:31:40,985][175731] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-07 11:31:41,774][175731] Updated weights for policy 0, policy_version 80690 (0.0006) [2023-03-07 11:31:42,566][175731] Updated weights for policy 0, policy_version 80700 (0.0006) [2023-03-07 11:31:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 82646016. Throughput: 0: 12820.2. Samples: 82619657. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:31:43,322][175405] Avg episode reward: [(0, '25.632')] [2023-03-07 11:31:43,354][175731] Updated weights for policy 0, policy_version 80710 (0.0006) [2023-03-07 11:31:44,157][175731] Updated weights for policy 0, policy_version 80720 (0.0007) [2023-03-07 11:31:44,968][175731] Updated weights for policy 0, policy_version 80730 (0.0006) [2023-03-07 11:31:45,783][175731] Updated weights for policy 0, policy_version 80740 (0.0006) [2023-03-07 11:31:46,572][175731] Updated weights for policy 0, policy_version 80750 (0.0006) [2023-03-07 11:31:47,366][175731] Updated weights for policy 0, policy_version 80760 (0.0006) [2023-03-07 11:31:48,172][175731] Updated weights for policy 0, policy_version 80770 (0.0007) [2023-03-07 11:31:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12819.1). Total num frames: 82709504. Throughput: 0: 12824.6. Samples: 82696616. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:31:48,322][175405] Avg episode reward: [(0, '27.745')] [2023-03-07 11:31:48,327][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000080772_82710528.pth... [2023-03-07 11:31:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000077768_79634432.pth [2023-03-07 11:31:48,972][175731] Updated weights for policy 0, policy_version 80780 (0.0006) [2023-03-07 11:31:49,780][175731] Updated weights for policy 0, policy_version 80790 (0.0008) [2023-03-07 11:31:50,582][175731] Updated weights for policy 0, policy_version 80800 (0.0005) [2023-03-07 11:31:51,361][175731] Updated weights for policy 0, policy_version 80810 (0.0006) [2023-03-07 11:31:52,150][175731] Updated weights for policy 0, policy_version 80820 (0.0006) [2023-03-07 11:31:52,942][175731] Updated weights for policy 0, policy_version 80830 (0.0007) [2023-03-07 11:31:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 82774016. Throughput: 0: 12825.6. Samples: 82773780. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:31:53,322][175405] Avg episode reward: [(0, '27.280')] [2023-03-07 11:31:53,742][175731] Updated weights for policy 0, policy_version 80840 (0.0007) [2023-03-07 11:31:54,525][175731] Updated weights for policy 0, policy_version 80850 (0.0006) [2023-03-07 11:31:55,335][175731] Updated weights for policy 0, policy_version 80860 (0.0006) [2023-03-07 11:31:56,143][175731] Updated weights for policy 0, policy_version 80870 (0.0006) [2023-03-07 11:31:56,969][175731] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-07 11:31:57,744][175731] Updated weights for policy 0, policy_version 80890 (0.0007) [2023-03-07 11:31:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 82838528. Throughput: 0: 12826.5. Samples: 82812148. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:31:58,322][175405] Avg episode reward: [(0, '26.204')] [2023-03-07 11:31:58,546][175731] Updated weights for policy 0, policy_version 80900 (0.0006) [2023-03-07 11:31:59,349][175731] Updated weights for policy 0, policy_version 80910 (0.0007) [2023-03-07 11:32:00,137][175731] Updated weights for policy 0, policy_version 80920 (0.0007) [2023-03-07 11:32:00,921][175731] Updated weights for policy 0, policy_version 80930 (0.0007) [2023-03-07 11:32:01,737][175731] Updated weights for policy 0, policy_version 80940 (0.0006) [2023-03-07 11:32:02,524][175731] Updated weights for policy 0, policy_version 80950 (0.0006) [2023-03-07 11:32:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 82902016. Throughput: 0: 12824.8. Samples: 82889073. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:32:03,322][175405] Avg episode reward: [(0, '26.314')] [2023-03-07 11:32:03,335][175731] Updated weights for policy 0, policy_version 80960 (0.0006) [2023-03-07 11:32:04,126][175731] Updated weights for policy 0, policy_version 80970 (0.0006) [2023-03-07 11:32:04,923][175731] Updated weights for policy 0, policy_version 80980 (0.0006) [2023-03-07 11:32:05,716][175731] Updated weights for policy 0, policy_version 80990 (0.0007) [2023-03-07 11:32:06,504][175731] Updated weights for policy 0, policy_version 81000 (0.0006) [2023-03-07 11:32:07,294][175731] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-07 11:32:08,112][175731] Updated weights for policy 0, policy_version 81020 (0.0007) [2023-03-07 11:32:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12819.1). Total num frames: 82966528. Throughput: 0: 12832.9. Samples: 82966169. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:32:08,321][175405] Avg episode reward: [(0, '25.502')] [2023-03-07 11:32:08,910][175731] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-07 11:32:09,710][175731] Updated weights for policy 0, policy_version 81040 (0.0007) [2023-03-07 11:32:10,493][175731] Updated weights for policy 0, policy_version 81050 (0.0006) [2023-03-07 11:32:11,305][175731] Updated weights for policy 0, policy_version 81060 (0.0006) [2023-03-07 11:32:12,098][175731] Updated weights for policy 0, policy_version 81070 (0.0008) [2023-03-07 11:32:12,869][175731] Updated weights for policy 0, policy_version 81080 (0.0006) [2023-03-07 11:32:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83031040. Throughput: 0: 12835.9. Samples: 83004740. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:32:13,322][175405] Avg episode reward: [(0, '27.195')] [2023-03-07 11:32:13,682][175731] Updated weights for policy 0, policy_version 81090 (0.0006) [2023-03-07 11:32:14,490][175731] Updated weights for policy 0, policy_version 81100 (0.0006) [2023-03-07 11:32:15,279][175731] Updated weights for policy 0, policy_version 81110 (0.0007) [2023-03-07 11:32:16,083][175731] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-07 11:32:16,900][175731] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-07 11:32:17,689][175731] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-07 11:32:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 83095552. Throughput: 0: 12824.6. Samples: 83081599. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:32:18,322][175405] Avg episode reward: [(0, '26.765')] [2023-03-07 11:32:18,480][175731] Updated weights for policy 0, policy_version 81150 (0.0006) [2023-03-07 11:32:19,282][175731] Updated weights for policy 0, policy_version 81160 (0.0006) [2023-03-07 11:32:20,066][175731] Updated weights for policy 0, policy_version 81170 (0.0006) [2023-03-07 11:32:20,888][175731] Updated weights for policy 0, policy_version 81180 (0.0006) [2023-03-07 11:32:21,690][175731] Updated weights for policy 0, policy_version 81190 (0.0007) [2023-03-07 11:32:22,491][175731] Updated weights for policy 0, policy_version 81200 (0.0006) [2023-03-07 11:32:23,282][175731] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-07 11:32:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83159040. Throughput: 0: 12834.2. Samples: 83158630. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:23,321][175405] Avg episode reward: [(0, '26.618')] [2023-03-07 11:32:24,073][175731] Updated weights for policy 0, policy_version 81220 (0.0006) [2023-03-07 11:32:24,869][175731] Updated weights for policy 0, policy_version 81230 (0.0006) [2023-03-07 11:32:25,659][175731] Updated weights for policy 0, policy_version 81240 (0.0006) [2023-03-07 11:32:26,449][175731] Updated weights for policy 0, policy_version 81250 (0.0006) [2023-03-07 11:32:27,264][175731] Updated weights for policy 0, policy_version 81260 (0.0008) [2023-03-07 11:32:28,060][175731] Updated weights for policy 0, policy_version 81270 (0.0007) [2023-03-07 11:32:28,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83223552. Throughput: 0: 12836.8. Samples: 83197312. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:28,322][175405] Avg episode reward: [(0, '25.314')] [2023-03-07 11:32:28,846][175731] Updated weights for policy 0, policy_version 81280 (0.0006) [2023-03-07 11:32:29,642][175731] Updated weights for policy 0, policy_version 81290 (0.0006) [2023-03-07 11:32:30,434][175731] Updated weights for policy 0, policy_version 81300 (0.0007) [2023-03-07 11:32:31,245][175731] Updated weights for policy 0, policy_version 81310 (0.0007) [2023-03-07 11:32:32,046][175731] Updated weights for policy 0, policy_version 81320 (0.0006) [2023-03-07 11:32:32,836][175731] Updated weights for policy 0, policy_version 81330 (0.0007) [2023-03-07 11:32:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83288064. Throughput: 0: 12836.5. Samples: 83274259. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:33,322][175405] Avg episode reward: [(0, '29.926')] [2023-03-07 11:32:33,625][175731] Updated weights for policy 0, policy_version 81340 (0.0007) [2023-03-07 11:32:34,429][175731] Updated weights for policy 0, policy_version 81350 (0.0006) [2023-03-07 11:32:35,235][175731] Updated weights for policy 0, policy_version 81360 (0.0006) [2023-03-07 11:32:36,011][175731] Updated weights for policy 0, policy_version 81370 (0.0007) [2023-03-07 11:32:36,811][175731] Updated weights for policy 0, policy_version 81380 (0.0005) [2023-03-07 11:32:37,629][175731] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-07 11:32:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83351552. Throughput: 0: 12831.9. Samples: 83351217. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:38,322][175405] Avg episode reward: [(0, '25.188')] [2023-03-07 11:32:38,425][175731] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-07 11:32:39,237][175731] Updated weights for policy 0, policy_version 81410 (0.0007) [2023-03-07 11:32:40,012][175731] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-07 11:32:40,810][175731] Updated weights for policy 0, policy_version 81430 (0.0006) [2023-03-07 11:32:41,629][175731] Updated weights for policy 0, policy_version 81440 (0.0006) [2023-03-07 11:32:42,423][175731] Updated weights for policy 0, policy_version 81450 (0.0006) [2023-03-07 11:32:43,219][175731] Updated weights for policy 0, policy_version 81460 (0.0006) [2023-03-07 11:32:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83416064. Throughput: 0: 12834.1. Samples: 83389684. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:43,322][175405] Avg episode reward: [(0, '25.461')] [2023-03-07 11:32:44,020][175731] Updated weights for policy 0, policy_version 81470 (0.0007) [2023-03-07 11:32:44,818][175731] Updated weights for policy 0, policy_version 81480 (0.0006) [2023-03-07 11:32:45,626][175731] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-07 11:32:46,416][175731] Updated weights for policy 0, policy_version 81500 (0.0006) [2023-03-07 11:32:47,211][175731] Updated weights for policy 0, policy_version 81510 (0.0007) [2023-03-07 11:32:48,017][175731] Updated weights for policy 0, policy_version 81520 (0.0007) [2023-03-07 11:32:48,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12815.6). Total num frames: 83479552. Throughput: 0: 12835.2. Samples: 83466658. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:48,332][175405] Avg episode reward: [(0, '27.345')] [2023-03-07 11:32:48,795][175731] Updated weights for policy 0, policy_version 81530 (0.0006) [2023-03-07 11:32:49,591][175731] Updated weights for policy 0, policy_version 81540 (0.0006) [2023-03-07 11:32:50,400][175731] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-07 11:32:51,197][175731] Updated weights for policy 0, policy_version 81560 (0.0006) [2023-03-07 11:32:51,994][175731] Updated weights for policy 0, policy_version 81570 (0.0006) [2023-03-07 11:32:52,793][175731] Updated weights for policy 0, policy_version 81580 (0.0007) [2023-03-07 11:32:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83544064. Throughput: 0: 12832.9. Samples: 83543650. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:53,332][175405] Avg episode reward: [(0, '26.117')] [2023-03-07 11:32:53,594][175731] Updated weights for policy 0, policy_version 81590 (0.0007) [2023-03-07 11:32:54,409][175731] Updated weights for policy 0, policy_version 81600 (0.0006) [2023-03-07 11:32:55,198][175731] Updated weights for policy 0, policy_version 81610 (0.0006) [2023-03-07 11:32:56,002][175731] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-07 11:32:56,806][175731] Updated weights for policy 0, policy_version 81630 (0.0006) [2023-03-07 11:32:57,598][175731] Updated weights for policy 0, policy_version 81640 (0.0006) [2023-03-07 11:32:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 83607552. Throughput: 0: 12825.8. Samples: 83581903. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:32:58,333][175405] Avg episode reward: [(0, '25.717')] [2023-03-07 11:32:58,400][175731] Updated weights for policy 0, policy_version 81650 (0.0007) [2023-03-07 11:32:59,191][175731] Updated weights for policy 0, policy_version 81660 (0.0006) [2023-03-07 11:32:59,996][175731] Updated weights for policy 0, policy_version 81670 (0.0007) [2023-03-07 11:33:00,783][175731] Updated weights for policy 0, policy_version 81680 (0.0006) [2023-03-07 11:33:01,597][175731] Updated weights for policy 0, policy_version 81690 (0.0007) [2023-03-07 11:33:02,404][175731] Updated weights for policy 0, policy_version 81700 (0.0006) [2023-03-07 11:33:03,192][175731] Updated weights for policy 0, policy_version 81710 (0.0006) [2023-03-07 11:33:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12819.1). Total num frames: 83672064. Throughput: 0: 12824.7. Samples: 83658712. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:33:03,322][175405] Avg episode reward: [(0, '26.872')] [2023-03-07 11:33:03,973][175731] Updated weights for policy 0, policy_version 81720 (0.0007) [2023-03-07 11:33:04,789][175731] Updated weights for policy 0, policy_version 81730 (0.0006) [2023-03-07 11:33:05,576][175731] Updated weights for policy 0, policy_version 81740 (0.0007) [2023-03-07 11:33:06,382][175731] Updated weights for policy 0, policy_version 81750 (0.0006) [2023-03-07 11:33:07,174][175731] Updated weights for policy 0, policy_version 81760 (0.0006) [2023-03-07 11:33:07,976][175731] Updated weights for policy 0, policy_version 81770 (0.0007) [2023-03-07 11:33:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 83736576. Throughput: 0: 12830.4. Samples: 83735999. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:33:08,322][175405] Avg episode reward: [(0, '25.681')] [2023-03-07 11:33:08,765][175731] Updated weights for policy 0, policy_version 81780 (0.0007) [2023-03-07 11:33:09,555][175731] Updated weights for policy 0, policy_version 81790 (0.0007) [2023-03-07 11:33:10,361][175731] Updated weights for policy 0, policy_version 81800 (0.0006) [2023-03-07 11:33:11,154][175731] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-07 11:33:11,946][175731] Updated weights for policy 0, policy_version 81820 (0.0006) [2023-03-07 11:33:12,740][175731] Updated weights for policy 0, policy_version 81830 (0.0007) [2023-03-07 11:33:13,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 83801088. Throughput: 0: 12827.9. Samples: 83774570. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:13,322][175405] Avg episode reward: [(0, '26.478')] [2023-03-07 11:33:13,541][175731] Updated weights for policy 0, policy_version 81840 (0.0006) [2023-03-07 11:33:14,355][175731] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-07 11:33:15,141][175731] Updated weights for policy 0, policy_version 81860 (0.0006) [2023-03-07 11:33:15,949][175731] Updated weights for policy 0, policy_version 81870 (0.0006) [2023-03-07 11:33:16,759][175731] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-07 11:33:17,561][175731] Updated weights for policy 0, policy_version 81890 (0.0007) [2023-03-07 11:33:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.0, 300 sec: 12822.6). Total num frames: 83864576. Throughput: 0: 12824.3. Samples: 83851353. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:18,321][175405] Avg episode reward: [(0, '25.903')] [2023-03-07 11:33:18,379][175731] Updated weights for policy 0, policy_version 81900 (0.0007) [2023-03-07 11:33:19,148][175731] Updated weights for policy 0, policy_version 81910 (0.0007) [2023-03-07 11:33:19,969][175731] Updated weights for policy 0, policy_version 81920 (0.0006) [2023-03-07 11:33:20,757][175731] Updated weights for policy 0, policy_version 81930 (0.0006) [2023-03-07 11:33:21,556][175731] Updated weights for policy 0, policy_version 81940 (0.0007) [2023-03-07 11:33:22,351][175731] Updated weights for policy 0, policy_version 81950 (0.0006) [2023-03-07 11:33:23,136][175731] Updated weights for policy 0, policy_version 81960 (0.0006) [2023-03-07 11:33:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12822.6). Total num frames: 83929088. Throughput: 0: 12825.4. Samples: 83928359. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:23,322][175405] Avg episode reward: [(0, '26.223')] [2023-03-07 11:33:23,931][175731] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-07 11:33:24,746][175731] Updated weights for policy 0, policy_version 81980 (0.0006) [2023-03-07 11:33:25,528][175731] Updated weights for policy 0, policy_version 81990 (0.0006) [2023-03-07 11:33:26,326][175731] Updated weights for policy 0, policy_version 82000 (0.0007) [2023-03-07 11:33:27,134][175731] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-07 11:33:27,922][175731] Updated weights for policy 0, policy_version 82020 (0.0007) [2023-03-07 11:33:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 83993600. Throughput: 0: 12829.2. Samples: 83966998. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:28,321][175405] Avg episode reward: [(0, '25.815')] [2023-03-07 11:33:28,705][175731] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-07 11:33:29,498][175731] Updated weights for policy 0, policy_version 82040 (0.0006) [2023-03-07 11:33:30,293][175731] Updated weights for policy 0, policy_version 82050 (0.0007) [2023-03-07 11:33:31,089][175731] Updated weights for policy 0, policy_version 82060 (0.0006) [2023-03-07 11:33:31,880][175731] Updated weights for policy 0, policy_version 82070 (0.0007) [2023-03-07 11:33:32,671][175731] Updated weights for policy 0, policy_version 82080 (0.0006) [2023-03-07 11:33:33,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 84058112. Throughput: 0: 12837.6. Samples: 84044351. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:33,321][175405] Avg episode reward: [(0, '30.337')] [2023-03-07 11:33:33,455][175731] Updated weights for policy 0, policy_version 82090 (0.0006) [2023-03-07 11:33:34,253][175731] Updated weights for policy 0, policy_version 82100 (0.0006) [2023-03-07 11:33:35,063][175731] Updated weights for policy 0, policy_version 82110 (0.0006) [2023-03-07 11:33:35,867][175731] Updated weights for policy 0, policy_version 82120 (0.0006) [2023-03-07 11:33:36,653][175731] Updated weights for policy 0, policy_version 82130 (0.0006) [2023-03-07 11:33:37,460][175731] Updated weights for policy 0, policy_version 82140 (0.0007) [2023-03-07 11:33:38,264][175731] Updated weights for policy 0, policy_version 82150 (0.0006) [2023-03-07 11:33:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12826.0). Total num frames: 84121600. Throughput: 0: 12836.9. Samples: 84121309. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:38,321][175405] Avg episode reward: [(0, '26.301')] [2023-03-07 11:33:39,048][175731] Updated weights for policy 0, policy_version 82160 (0.0007) [2023-03-07 11:33:39,859][175731] Updated weights for policy 0, policy_version 82170 (0.0007) [2023-03-07 11:33:40,660][175731] Updated weights for policy 0, policy_version 82180 (0.0006) [2023-03-07 11:33:41,453][175731] Updated weights for policy 0, policy_version 82190 (0.0005) [2023-03-07 11:33:42,256][175731] Updated weights for policy 0, policy_version 82200 (0.0007) [2023-03-07 11:33:43,061][175731] Updated weights for policy 0, policy_version 82210 (0.0006) [2023-03-07 11:33:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 84186112. Throughput: 0: 12840.6. Samples: 84159730. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:43,322][175405] Avg episode reward: [(0, '27.598')] [2023-03-07 11:33:43,850][175731] Updated weights for policy 0, policy_version 82220 (0.0007) [2023-03-07 11:33:44,648][175731] Updated weights for policy 0, policy_version 82230 (0.0007) [2023-03-07 11:33:45,442][175731] Updated weights for policy 0, policy_version 82240 (0.0006) [2023-03-07 11:33:46,242][175731] Updated weights for policy 0, policy_version 82250 (0.0006) [2023-03-07 11:33:47,039][175731] Updated weights for policy 0, policy_version 82260 (0.0006) [2023-03-07 11:33:47,849][175731] Updated weights for policy 0, policy_version 82270 (0.0006) [2023-03-07 11:33:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12826.0). Total num frames: 84249600. Throughput: 0: 12844.0. Samples: 84236692. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:48,332][175405] Avg episode reward: [(0, '29.462')] [2023-03-07 11:33:48,336][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000082276_84250624.pth... [2023-03-07 11:33:48,367][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000079269_81171456.pth [2023-03-07 11:33:48,638][175731] Updated weights for policy 0, policy_version 82280 (0.0006) [2023-03-07 11:33:49,446][175731] Updated weights for policy 0, policy_version 82290 (0.0008) [2023-03-07 11:33:50,241][175731] Updated weights for policy 0, policy_version 82300 (0.0006) [2023-03-07 11:33:51,032][175731] Updated weights for policy 0, policy_version 82310 (0.0007) [2023-03-07 11:33:51,822][175731] Updated weights for policy 0, policy_version 82320 (0.0007) [2023-03-07 11:33:52,618][175731] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-07 11:33:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 84315136. Throughput: 0: 12846.7. Samples: 84314098. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:53,332][175405] Avg episode reward: [(0, '26.540')] [2023-03-07 11:33:53,408][175731] Updated weights for policy 0, policy_version 82340 (0.0006) [2023-03-07 11:33:54,208][175731] Updated weights for policy 0, policy_version 82350 (0.0007) [2023-03-07 11:33:55,007][175731] Updated weights for policy 0, policy_version 82360 (0.0006) [2023-03-07 11:33:55,798][175731] Updated weights for policy 0, policy_version 82370 (0.0007) [2023-03-07 11:33:56,599][175731] Updated weights for policy 0, policy_version 82380 (0.0006) [2023-03-07 11:33:57,381][175731] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-07 11:33:58,180][175731] Updated weights for policy 0, policy_version 82400 (0.0007) [2023-03-07 11:33:58,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 84378624. Throughput: 0: 12846.4. Samples: 84352659. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:33:58,332][175405] Avg episode reward: [(0, '28.473')] [2023-03-07 11:33:58,977][175731] Updated weights for policy 0, policy_version 82410 (0.0006) [2023-03-07 11:33:59,768][175731] Updated weights for policy 0, policy_version 82420 (0.0006) [2023-03-07 11:34:00,581][175731] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-07 11:34:01,382][175731] Updated weights for policy 0, policy_version 82440 (0.0006) [2023-03-07 11:34:02,176][175731] Updated weights for policy 0, policy_version 82450 (0.0006) [2023-03-07 11:34:02,980][175731] Updated weights for policy 0, policy_version 82460 (0.0007) [2023-03-07 11:34:03,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 84443136. Throughput: 0: 12854.8. Samples: 84429820. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:03,332][175405] Avg episode reward: [(0, '26.405')] [2023-03-07 11:34:03,777][175731] Updated weights for policy 0, policy_version 82470 (0.0007) [2023-03-07 11:34:04,582][175731] Updated weights for policy 0, policy_version 82480 (0.0007) [2023-03-07 11:34:05,389][175731] Updated weights for policy 0, policy_version 82490 (0.0007) [2023-03-07 11:34:06,180][175731] Updated weights for policy 0, policy_version 82500 (0.0006) [2023-03-07 11:34:06,993][175731] Updated weights for policy 0, policy_version 82510 (0.0007) [2023-03-07 11:34:07,784][175731] Updated weights for policy 0, policy_version 82520 (0.0006) [2023-03-07 11:34:08,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 84507648. Throughput: 0: 12850.8. Samples: 84506643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:08,332][175405] Avg episode reward: [(0, '26.403')] [2023-03-07 11:34:08,587][175731] Updated weights for policy 0, policy_version 82530 (0.0007) [2023-03-07 11:34:09,384][175731] Updated weights for policy 0, policy_version 82540 (0.0006) [2023-03-07 11:34:10,168][175731] Updated weights for policy 0, policy_version 82550 (0.0007) [2023-03-07 11:34:10,978][175731] Updated weights for policy 0, policy_version 82560 (0.0007) [2023-03-07 11:34:11,778][175731] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-07 11:34:12,562][175731] Updated weights for policy 0, policy_version 82580 (0.0006) [2023-03-07 11:34:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 84571136. Throughput: 0: 12838.9. Samples: 84544750. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:13,332][175405] Avg episode reward: [(0, '27.357')] [2023-03-07 11:34:13,357][175731] Updated weights for policy 0, policy_version 82590 (0.0007) [2023-03-07 11:34:14,153][175731] Updated weights for policy 0, policy_version 82600 (0.0007) [2023-03-07 11:34:14,934][175731] Updated weights for policy 0, policy_version 82610 (0.0008) [2023-03-07 11:34:15,737][175731] Updated weights for policy 0, policy_version 82620 (0.0006) [2023-03-07 11:34:16,550][175731] Updated weights for policy 0, policy_version 82630 (0.0007) [2023-03-07 11:34:17,336][175731] Updated weights for policy 0, policy_version 82640 (0.0006) [2023-03-07 11:34:18,145][175731] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-07 11:34:18,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12829.5). Total num frames: 84635648. Throughput: 0: 12838.9. Samples: 84622103. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:18,332][175405] Avg episode reward: [(0, '26.208')] [2023-03-07 11:34:18,943][175731] Updated weights for policy 0, policy_version 82660 (0.0006) [2023-03-07 11:34:19,742][175731] Updated weights for policy 0, policy_version 82670 (0.0007) [2023-03-07 11:34:20,545][175731] Updated weights for policy 0, policy_version 82680 (0.0006) [2023-03-07 11:34:21,336][175731] Updated weights for policy 0, policy_version 82690 (0.0006) [2023-03-07 11:34:22,125][175731] Updated weights for policy 0, policy_version 82700 (0.0007) [2023-03-07 11:34:22,925][175731] Updated weights for policy 0, policy_version 82710 (0.0006) [2023-03-07 11:34:23,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 84700160. Throughput: 0: 12841.3. Samples: 84699168. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:23,332][175405] Avg episode reward: [(0, '27.015')] [2023-03-07 11:34:23,738][175731] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-07 11:34:24,527][175731] Updated weights for policy 0, policy_version 82730 (0.0007) [2023-03-07 11:34:25,328][175731] Updated weights for policy 0, policy_version 82740 (0.0006) [2023-03-07 11:34:26,146][175731] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-07 11:34:26,943][175731] Updated weights for policy 0, policy_version 82760 (0.0007) [2023-03-07 11:34:27,732][175731] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-07 11:34:28,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 84763648. Throughput: 0: 12837.6. Samples: 84737419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:28,332][175405] Avg episode reward: [(0, '26.071')] [2023-03-07 11:34:28,537][175731] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-07 11:34:29,325][175731] Updated weights for policy 0, policy_version 82790 (0.0006) [2023-03-07 11:34:30,126][175731] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-07 11:34:30,913][175731] Updated weights for policy 0, policy_version 82810 (0.0006) [2023-03-07 11:34:31,710][175731] Updated weights for policy 0, policy_version 82820 (0.0006) [2023-03-07 11:34:32,483][175731] Updated weights for policy 0, policy_version 82830 (0.0007) [2023-03-07 11:34:33,290][175731] Updated weights for policy 0, policy_version 82840 (0.0007) [2023-03-07 11:34:33,321][175405] Fps is (10 sec: 12799.7, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 84828160. Throughput: 0: 12846.3. Samples: 84814778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:33,332][175405] Avg episode reward: [(0, '27.418')] [2023-03-07 11:34:34,074][175731] Updated weights for policy 0, policy_version 82850 (0.0006) [2023-03-07 11:34:34,885][175731] Updated weights for policy 0, policy_version 82860 (0.0007) [2023-03-07 11:34:35,679][175731] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-07 11:34:36,473][175731] Updated weights for policy 0, policy_version 82880 (0.0007) [2023-03-07 11:34:37,253][175731] Updated weights for policy 0, policy_version 82890 (0.0007) [2023-03-07 11:34:38,045][175731] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-07 11:34:38,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 84892672. Throughput: 0: 12842.0. Samples: 84891987. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:38,332][175405] Avg episode reward: [(0, '25.778')] [2023-03-07 11:34:38,853][175731] Updated weights for policy 0, policy_version 82910 (0.0006) [2023-03-07 11:34:39,646][175731] Updated weights for policy 0, policy_version 82920 (0.0005) [2023-03-07 11:34:40,439][175731] Updated weights for policy 0, policy_version 82930 (0.0007) [2023-03-07 11:34:41,225][175731] Updated weights for policy 0, policy_version 82940 (0.0007) [2023-03-07 11:34:42,026][175731] Updated weights for policy 0, policy_version 82950 (0.0005) [2023-03-07 11:34:42,813][175731] Updated weights for policy 0, policy_version 82960 (0.0007) [2023-03-07 11:34:43,321][175405] Fps is (10 sec: 12902.7, 60 sec: 12851.2, 300 sec: 12836.5). Total num frames: 84957184. Throughput: 0: 12847.4. Samples: 84930791. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:43,332][175405] Avg episode reward: [(0, '26.654')] [2023-03-07 11:34:43,621][175731] Updated weights for policy 0, policy_version 82970 (0.0007) [2023-03-07 11:34:44,413][175731] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-07 11:34:45,198][175731] Updated weights for policy 0, policy_version 82990 (0.0006) [2023-03-07 11:34:46,016][175731] Updated weights for policy 0, policy_version 83000 (0.0007) [2023-03-07 11:34:46,805][175731] Updated weights for policy 0, policy_version 83010 (0.0007) [2023-03-07 11:34:47,622][175731] Updated weights for policy 0, policy_version 83020 (0.0006) [2023-03-07 11:34:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 85020672. Throughput: 0: 12841.4. Samples: 85007681. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:48,332][175405] Avg episode reward: [(0, '26.995')] [2023-03-07 11:34:48,417][175731] Updated weights for policy 0, policy_version 83030 (0.0007) [2023-03-07 11:34:49,219][175731] Updated weights for policy 0, policy_version 83040 (0.0007) [2023-03-07 11:34:50,003][175731] Updated weights for policy 0, policy_version 83050 (0.0007) [2023-03-07 11:34:50,821][175731] Updated weights for policy 0, policy_version 83060 (0.0006) [2023-03-07 11:34:51,610][175731] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-07 11:34:52,402][175731] Updated weights for policy 0, policy_version 83080 (0.0005) [2023-03-07 11:34:53,186][175731] Updated weights for policy 0, policy_version 83090 (0.0006) [2023-03-07 11:34:53,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 85085184. Throughput: 0: 12849.1. Samples: 85084855. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:53,322][175405] Avg episode reward: [(0, '25.810')] [2023-03-07 11:34:53,986][175731] Updated weights for policy 0, policy_version 83100 (0.0007) [2023-03-07 11:34:54,782][175731] Updated weights for policy 0, policy_version 83110 (0.0007) [2023-03-07 11:34:55,573][175731] Updated weights for policy 0, policy_version 83120 (0.0007) [2023-03-07 11:34:56,370][175731] Updated weights for policy 0, policy_version 83130 (0.0006) [2023-03-07 11:34:57,147][175731] Updated weights for policy 0, policy_version 83140 (0.0006) [2023-03-07 11:34:57,949][175731] Updated weights for policy 0, policy_version 83150 (0.0006) [2023-03-07 11:34:58,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 85149696. Throughput: 0: 12861.0. Samples: 85123492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:34:58,321][175405] Avg episode reward: [(0, '26.770')] [2023-03-07 11:34:58,747][175731] Updated weights for policy 0, policy_version 83160 (0.0006) [2023-03-07 11:34:59,558][175731] Updated weights for policy 0, policy_version 83170 (0.0007) [2023-03-07 11:35:00,369][175731] Updated weights for policy 0, policy_version 83180 (0.0006) [2023-03-07 11:35:01,159][175731] Updated weights for policy 0, policy_version 83190 (0.0007) [2023-03-07 11:35:01,954][175731] Updated weights for policy 0, policy_version 83200 (0.0006) [2023-03-07 11:35:02,755][175731] Updated weights for policy 0, policy_version 83210 (0.0006) [2023-03-07 11:35:03,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12836.5). Total num frames: 85214208. Throughput: 0: 12853.1. Samples: 85200489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:03,322][175405] Avg episode reward: [(0, '27.717')] [2023-03-07 11:35:03,548][175731] Updated weights for policy 0, policy_version 83220 (0.0007) [2023-03-07 11:35:04,337][175731] Updated weights for policy 0, policy_version 83230 (0.0006) [2023-03-07 11:35:05,142][175731] Updated weights for policy 0, policy_version 83240 (0.0005) [2023-03-07 11:35:05,940][175731] Updated weights for policy 0, policy_version 83250 (0.0006) [2023-03-07 11:35:06,748][175731] Updated weights for policy 0, policy_version 83260 (0.0007) [2023-03-07 11:35:07,550][175731] Updated weights for policy 0, policy_version 83270 (0.0007) [2023-03-07 11:35:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 85277696. Throughput: 0: 12851.6. Samples: 85277490. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:08,322][175405] Avg episode reward: [(0, '26.324')] [2023-03-07 11:35:08,337][175731] Updated weights for policy 0, policy_version 83280 (0.0007) [2023-03-07 11:35:09,145][175731] Updated weights for policy 0, policy_version 83290 (0.0006) [2023-03-07 11:35:09,930][175731] Updated weights for policy 0, policy_version 83300 (0.0008) [2023-03-07 11:35:10,729][175731] Updated weights for policy 0, policy_version 83310 (0.0007) [2023-03-07 11:35:11,538][175731] Updated weights for policy 0, policy_version 83320 (0.0007) [2023-03-07 11:35:12,318][175731] Updated weights for policy 0, policy_version 83330 (0.0006) [2023-03-07 11:35:13,119][175731] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-07 11:35:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 85342208. Throughput: 0: 12856.0. Samples: 85315941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:13,322][175405] Avg episode reward: [(0, '25.079')] [2023-03-07 11:35:13,917][175731] Updated weights for policy 0, policy_version 83350 (0.0008) [2023-03-07 11:35:14,687][175731] Updated weights for policy 0, policy_version 83360 (0.0006) [2023-03-07 11:35:15,504][175731] Updated weights for policy 0, policy_version 83370 (0.0007) [2023-03-07 11:35:16,286][175731] Updated weights for policy 0, policy_version 83380 (0.0006) [2023-03-07 11:35:17,086][175731] Updated weights for policy 0, policy_version 83390 (0.0006) [2023-03-07 11:35:17,914][175731] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-07 11:35:18,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12851.2, 300 sec: 12836.5). Total num frames: 85406720. Throughput: 0: 12853.5. Samples: 85393185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:18,321][175405] Avg episode reward: [(0, '26.058')] [2023-03-07 11:35:18,710][175731] Updated weights for policy 0, policy_version 83410 (0.0007) [2023-03-07 11:35:19,503][175731] Updated weights for policy 0, policy_version 83420 (0.0006) [2023-03-07 11:35:20,310][175731] Updated weights for policy 0, policy_version 83430 (0.0006) [2023-03-07 11:35:21,101][175731] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-07 11:35:21,897][175731] Updated weights for policy 0, policy_version 83450 (0.0005) [2023-03-07 11:35:22,687][175731] Updated weights for policy 0, policy_version 83460 (0.0007) [2023-03-07 11:35:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 85470208. Throughput: 0: 12847.4. Samples: 85470117. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:23,332][175405] Avg episode reward: [(0, '25.874')] [2023-03-07 11:35:23,499][175731] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-07 11:35:24,298][175731] Updated weights for policy 0, policy_version 83480 (0.0006) [2023-03-07 11:35:25,089][175731] Updated weights for policy 0, policy_version 83490 (0.0006) [2023-03-07 11:35:25,881][175731] Updated weights for policy 0, policy_version 83500 (0.0006) [2023-03-07 11:35:26,667][175731] Updated weights for policy 0, policy_version 83510 (0.0008) [2023-03-07 11:35:27,485][175731] Updated weights for policy 0, policy_version 83520 (0.0006) [2023-03-07 11:35:28,282][175731] Updated weights for policy 0, policy_version 83530 (0.0006) [2023-03-07 11:35:28,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 85534720. Throughput: 0: 12841.4. Samples: 85508655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:28,332][175405] Avg episode reward: [(0, '25.023')] [2023-03-07 11:35:29,074][175731] Updated weights for policy 0, policy_version 83540 (0.0006) [2023-03-07 11:35:29,886][175731] Updated weights for policy 0, policy_version 83550 (0.0007) [2023-03-07 11:35:30,685][175731] Updated weights for policy 0, policy_version 83560 (0.0008) [2023-03-07 11:35:31,506][175731] Updated weights for policy 0, policy_version 83570 (0.0006) [2023-03-07 11:35:32,311][175731] Updated weights for policy 0, policy_version 83580 (0.0007) [2023-03-07 11:35:33,120][175731] Updated weights for policy 0, policy_version 83590 (0.0006) [2023-03-07 11:35:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.2, 300 sec: 12833.0). Total num frames: 85598208. Throughput: 0: 12831.4. Samples: 85585095. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:33,332][175405] Avg episode reward: [(0, '28.704')] [2023-03-07 11:35:33,924][175731] Updated weights for policy 0, policy_version 83600 (0.0006) [2023-03-07 11:35:34,720][175731] Updated weights for policy 0, policy_version 83610 (0.0007) [2023-03-07 11:35:35,514][175731] Updated weights for policy 0, policy_version 83620 (0.0007) [2023-03-07 11:35:36,310][175731] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-07 11:35:37,114][175731] Updated weights for policy 0, policy_version 83640 (0.0007) [2023-03-07 11:35:37,896][175731] Updated weights for policy 0, policy_version 83650 (0.0006) [2023-03-07 11:35:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 85662720. Throughput: 0: 12825.8. Samples: 85662013. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:38,332][175405] Avg episode reward: [(0, '30.276')] [2023-03-07 11:35:38,692][175731] Updated weights for policy 0, policy_version 83660 (0.0006) [2023-03-07 11:35:39,499][175731] Updated weights for policy 0, policy_version 83670 (0.0006) [2023-03-07 11:35:40,274][175731] Updated weights for policy 0, policy_version 83680 (0.0007) [2023-03-07 11:35:41,085][175731] Updated weights for policy 0, policy_version 83690 (0.0007) [2023-03-07 11:35:41,873][175731] Updated weights for policy 0, policy_version 83700 (0.0007) [2023-03-07 11:35:42,698][175731] Updated weights for policy 0, policy_version 83710 (0.0007) [2023-03-07 11:35:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 85726208. Throughput: 0: 12822.8. Samples: 85700520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:43,332][175405] Avg episode reward: [(0, '26.261')] [2023-03-07 11:35:43,498][175731] Updated weights for policy 0, policy_version 83720 (0.0007) [2023-03-07 11:35:44,305][175731] Updated weights for policy 0, policy_version 83730 (0.0007) [2023-03-07 11:35:45,118][175731] Updated weights for policy 0, policy_version 83740 (0.0006) [2023-03-07 11:35:45,896][175731] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-07 11:35:46,695][175731] Updated weights for policy 0, policy_version 83760 (0.0006) [2023-03-07 11:35:47,491][175731] Updated weights for policy 0, policy_version 83770 (0.0006) [2023-03-07 11:35:48,292][175731] Updated weights for policy 0, policy_version 83780 (0.0006) [2023-03-07 11:35:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 85790720. Throughput: 0: 12816.0. Samples: 85777211. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:48,322][175405] Avg episode reward: [(0, '26.483')] [2023-03-07 11:35:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000083780_85790720.pth... [2023-03-07 11:35:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000080772_82710528.pth [2023-03-07 11:35:49,096][175731] Updated weights for policy 0, policy_version 83790 (0.0006) [2023-03-07 11:35:49,890][175731] Updated weights for policy 0, policy_version 83800 (0.0006) [2023-03-07 11:35:50,693][175731] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-07 11:35:51,493][175731] Updated weights for policy 0, policy_version 83820 (0.0006) [2023-03-07 11:35:52,275][175731] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-07 11:35:53,070][175731] Updated weights for policy 0, policy_version 83840 (0.0006) [2023-03-07 11:35:53,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.2, 300 sec: 12833.0). Total num frames: 85855232. Throughput: 0: 12820.1. Samples: 85854396. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:53,332][175405] Avg episode reward: [(0, '26.875')] [2023-03-07 11:35:53,861][175731] Updated weights for policy 0, policy_version 83850 (0.0006) [2023-03-07 11:35:54,676][175731] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-07 11:35:55,462][175731] Updated weights for policy 0, policy_version 83870 (0.0007) [2023-03-07 11:35:56,271][175731] Updated weights for policy 0, policy_version 83880 (0.0006) [2023-03-07 11:35:57,089][175731] Updated weights for policy 0, policy_version 83890 (0.0007) [2023-03-07 11:35:57,859][175731] Updated weights for policy 0, policy_version 83900 (0.0006) [2023-03-07 11:35:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 85918720. Throughput: 0: 12817.4. Samples: 85892727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:35:58,332][175405] Avg episode reward: [(0, '26.439')] [2023-03-07 11:35:58,658][175731] Updated weights for policy 0, policy_version 83910 (0.0006) [2023-03-07 11:35:59,477][175731] Updated weights for policy 0, policy_version 83920 (0.0008) [2023-03-07 11:36:00,257][175731] Updated weights for policy 0, policy_version 83930 (0.0007) [2023-03-07 11:36:01,057][175731] Updated weights for policy 0, policy_version 83940 (0.0007) [2023-03-07 11:36:01,861][175731] Updated weights for policy 0, policy_version 83950 (0.0006) [2023-03-07 11:36:02,664][175731] Updated weights for policy 0, policy_version 83960 (0.0006) [2023-03-07 11:36:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 85983232. Throughput: 0: 12808.4. Samples: 85969565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:36:03,333][175405] Avg episode reward: [(0, '26.574')] [2023-03-07 11:36:03,462][175731] Updated weights for policy 0, policy_version 83970 (0.0006) [2023-03-07 11:36:04,257][175731] Updated weights for policy 0, policy_version 83980 (0.0007) [2023-03-07 11:36:05,070][175731] Updated weights for policy 0, policy_version 83990 (0.0007) [2023-03-07 11:36:05,884][175731] Updated weights for policy 0, policy_version 84000 (0.0008) [2023-03-07 11:36:06,662][175731] Updated weights for policy 0, policy_version 84010 (0.0006) [2023-03-07 11:36:07,481][175731] Updated weights for policy 0, policy_version 84020 (0.0007) [2023-03-07 11:36:08,278][175731] Updated weights for policy 0, policy_version 84030 (0.0006) [2023-03-07 11:36:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 86046720. Throughput: 0: 12804.8. Samples: 86046334. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:36:08,332][175405] Avg episode reward: [(0, '28.069')] [2023-03-07 11:36:09,081][175731] Updated weights for policy 0, policy_version 84040 (0.0007) [2023-03-07 11:36:09,885][175731] Updated weights for policy 0, policy_version 84050 (0.0006) [2023-03-07 11:36:10,689][175731] Updated weights for policy 0, policy_version 84060 (0.0007) [2023-03-07 11:36:11,497][175731] Updated weights for policy 0, policy_version 84070 (0.0006) [2023-03-07 11:36:12,274][175731] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-07 11:36:13,089][175731] Updated weights for policy 0, policy_version 84090 (0.0006) [2023-03-07 11:36:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 86111232. Throughput: 0: 12803.0. Samples: 86084791. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:36:13,332][175405] Avg episode reward: [(0, '26.770')] [2023-03-07 11:36:13,894][175731] Updated weights for policy 0, policy_version 84100 (0.0007) [2023-03-07 11:36:14,700][175731] Updated weights for policy 0, policy_version 84110 (0.0006) [2023-03-07 11:36:15,505][175731] Updated weights for policy 0, policy_version 84120 (0.0006) [2023-03-07 11:36:16,302][175731] Updated weights for policy 0, policy_version 84130 (0.0006) [2023-03-07 11:36:17,108][175731] Updated weights for policy 0, policy_version 84140 (0.0006) [2023-03-07 11:36:17,909][175731] Updated weights for policy 0, policy_version 84150 (0.0007) [2023-03-07 11:36:18,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12833.0). Total num frames: 86174720. Throughput: 0: 12803.0. Samples: 86161228. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:36:18,332][175405] Avg episode reward: [(0, '27.125')] [2023-03-07 11:36:18,700][175731] Updated weights for policy 0, policy_version 84160 (0.0008) [2023-03-07 11:36:19,495][175731] Updated weights for policy 0, policy_version 84170 (0.0007) [2023-03-07 11:36:20,298][175731] Updated weights for policy 0, policy_version 84180 (0.0006) [2023-03-07 11:36:21,101][175731] Updated weights for policy 0, policy_version 84190 (0.0006) [2023-03-07 11:36:21,911][175731] Updated weights for policy 0, policy_version 84200 (0.0007) [2023-03-07 11:36:22,696][175731] Updated weights for policy 0, policy_version 84210 (0.0006) [2023-03-07 11:36:23,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 86238208. Throughput: 0: 12803.7. Samples: 86238179. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:36:23,332][175405] Avg episode reward: [(0, '30.093')] [2023-03-07 11:36:23,489][175731] Updated weights for policy 0, policy_version 84220 (0.0007) [2023-03-07 11:36:24,268][175731] Updated weights for policy 0, policy_version 84230 (0.0006) [2023-03-07 11:36:25,059][175731] Updated weights for policy 0, policy_version 84240 (0.0007) [2023-03-07 11:36:25,871][175731] Updated weights for policy 0, policy_version 84250 (0.0006) [2023-03-07 11:36:26,679][175731] Updated weights for policy 0, policy_version 84260 (0.0006) [2023-03-07 11:36:27,469][175731] Updated weights for policy 0, policy_version 84270 (0.0007) [2023-03-07 11:36:28,288][175731] Updated weights for policy 0, policy_version 84280 (0.0006) [2023-03-07 11:36:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 86302720. Throughput: 0: 12803.8. Samples: 86276689. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:36:28,332][175405] Avg episode reward: [(0, '25.615')] [2023-03-07 11:36:29,081][175731] Updated weights for policy 0, policy_version 84290 (0.0006) [2023-03-07 11:36:29,871][175731] Updated weights for policy 0, policy_version 84300 (0.0007) [2023-03-07 11:36:30,681][175731] Updated weights for policy 0, policy_version 84310 (0.0007) [2023-03-07 11:36:31,452][175731] Updated weights for policy 0, policy_version 84320 (0.0007) [2023-03-07 11:36:32,250][175731] Updated weights for policy 0, policy_version 84330 (0.0007) [2023-03-07 11:36:33,054][175731] Updated weights for policy 0, policy_version 84340 (0.0006) [2023-03-07 11:36:33,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 86367232. Throughput: 0: 12808.1. Samples: 86353577. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:36:33,332][175405] Avg episode reward: [(0, '28.506')] [2023-03-07 11:36:33,849][175731] Updated weights for policy 0, policy_version 84350 (0.0006) [2023-03-07 11:36:34,652][175731] Updated weights for policy 0, policy_version 84360 (0.0007) [2023-03-07 11:36:35,458][175731] Updated weights for policy 0, policy_version 84370 (0.0007) [2023-03-07 11:36:36,249][175731] Updated weights for policy 0, policy_version 84380 (0.0007) [2023-03-07 11:36:37,049][175731] Updated weights for policy 0, policy_version 84390 (0.0006) [2023-03-07 11:36:37,831][175731] Updated weights for policy 0, policy_version 84400 (0.0007) [2023-03-07 11:36:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 86431744. Throughput: 0: 12811.3. Samples: 86430904. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:36:38,332][175405] Avg episode reward: [(0, '27.793')] [2023-03-07 11:36:38,632][175731] Updated weights for policy 0, policy_version 84410 (0.0005) [2023-03-07 11:36:39,416][175731] Updated weights for policy 0, policy_version 84420 (0.0006) [2023-03-07 11:36:40,212][175731] Updated weights for policy 0, policy_version 84430 (0.0006) [2023-03-07 11:36:41,010][175731] Updated weights for policy 0, policy_version 84440 (0.0006) [2023-03-07 11:36:41,806][175731] Updated weights for policy 0, policy_version 84450 (0.0006) [2023-03-07 11:36:42,610][175731] Updated weights for policy 0, policy_version 84460 (0.0007) [2023-03-07 11:36:43,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.2, 300 sec: 12836.5). Total num frames: 86496256. Throughput: 0: 12817.2. Samples: 86469497. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:36:43,321][175405] Avg episode reward: [(0, '27.315')] [2023-03-07 11:36:43,405][175731] Updated weights for policy 0, policy_version 84470 (0.0007) [2023-03-07 11:36:44,189][175731] Updated weights for policy 0, policy_version 84480 (0.0006) [2023-03-07 11:36:44,995][175731] Updated weights for policy 0, policy_version 84490 (0.0005) [2023-03-07 11:36:45,792][175731] Updated weights for policy 0, policy_version 84500 (0.0007) [2023-03-07 11:36:46,598][175731] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-07 11:36:47,386][175731] Updated weights for policy 0, policy_version 84520 (0.0006) [2023-03-07 11:36:48,181][175731] Updated weights for policy 0, policy_version 84530 (0.0006) [2023-03-07 11:36:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 86559744. Throughput: 0: 12822.9. Samples: 86546598. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:36:48,322][175405] Avg episode reward: [(0, '27.236')] [2023-03-07 11:36:48,971][175731] Updated weights for policy 0, policy_version 84540 (0.0007) [2023-03-07 11:36:49,771][175731] Updated weights for policy 0, policy_version 84550 (0.0007) [2023-03-07 11:36:50,565][175731] Updated weights for policy 0, policy_version 84560 (0.0006) [2023-03-07 11:36:51,377][175731] Updated weights for policy 0, policy_version 84570 (0.0006) [2023-03-07 11:36:52,177][175731] Updated weights for policy 0, policy_version 84580 (0.0006) [2023-03-07 11:36:52,970][175731] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-07 11:36:53,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12833.0). Total num frames: 86624256. Throughput: 0: 12827.8. Samples: 86623583. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:36:53,322][175405] Avg episode reward: [(0, '27.255')] [2023-03-07 11:36:53,776][175731] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-07 11:36:54,554][175731] Updated weights for policy 0, policy_version 84610 (0.0007) [2023-03-07 11:36:55,351][175731] Updated weights for policy 0, policy_version 84620 (0.0006) [2023-03-07 11:36:56,164][175731] Updated weights for policy 0, policy_version 84630 (0.0007) [2023-03-07 11:36:56,957][175731] Updated weights for policy 0, policy_version 84640 (0.0007) [2023-03-07 11:36:57,752][175731] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-07 11:36:58,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12834.2, 300 sec: 12836.4). Total num frames: 86688768. Throughput: 0: 12829.6. Samples: 86662123. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:36:58,321][175405] Avg episode reward: [(0, '29.087')] [2023-03-07 11:36:58,552][175731] Updated weights for policy 0, policy_version 84660 (0.0006) [2023-03-07 11:36:59,341][175731] Updated weights for policy 0, policy_version 84670 (0.0006) [2023-03-07 11:37:00,137][175731] Updated weights for policy 0, policy_version 84680 (0.0007) [2023-03-07 11:37:00,935][175731] Updated weights for policy 0, policy_version 84690 (0.0006) [2023-03-07 11:37:01,724][175731] Updated weights for policy 0, policy_version 84700 (0.0006) [2023-03-07 11:37:02,514][175731] Updated weights for policy 0, policy_version 84710 (0.0006) [2023-03-07 11:37:03,315][175731] Updated weights for policy 0, policy_version 84720 (0.0007) [2023-03-07 11:37:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 86753280. Throughput: 0: 12850.2. Samples: 86739489. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:37:03,321][175405] Avg episode reward: [(0, '27.749')] [2023-03-07 11:37:04,119][175731] Updated weights for policy 0, policy_version 84730 (0.0006) [2023-03-07 11:37:04,932][175731] Updated weights for policy 0, policy_version 84740 (0.0007) [2023-03-07 11:37:05,717][175731] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-07 11:37:06,533][175731] Updated weights for policy 0, policy_version 84760 (0.0006) [2023-03-07 11:37:07,324][175731] Updated weights for policy 0, policy_version 84770 (0.0006) [2023-03-07 11:37:08,126][175731] Updated weights for policy 0, policy_version 84780 (0.0005) [2023-03-07 11:37:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 86816768. Throughput: 0: 12842.9. Samples: 86816111. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:37:08,322][175405] Avg episode reward: [(0, '27.317')] [2023-03-07 11:37:08,926][175731] Updated weights for policy 0, policy_version 84790 (0.0007) [2023-03-07 11:37:09,722][175731] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-07 11:37:10,517][175731] Updated weights for policy 0, policy_version 84810 (0.0008) [2023-03-07 11:37:11,316][175731] Updated weights for policy 0, policy_version 84820 (0.0007) [2023-03-07 11:37:12,113][175731] Updated weights for policy 0, policy_version 84830 (0.0006) [2023-03-07 11:37:12,931][175731] Updated weights for policy 0, policy_version 84840 (0.0006) [2023-03-07 11:37:13,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 86880256. Throughput: 0: 12846.8. Samples: 86854793. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:37:13,322][175405] Avg episode reward: [(0, '26.072')] [2023-03-07 11:37:13,716][175731] Updated weights for policy 0, policy_version 84850 (0.0006) [2023-03-07 11:37:14,534][175731] Updated weights for policy 0, policy_version 84860 (0.0006) [2023-03-07 11:37:15,306][175731] Updated weights for policy 0, policy_version 84870 (0.0007) [2023-03-07 11:37:16,099][175731] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-07 11:37:16,910][175731] Updated weights for policy 0, policy_version 84890 (0.0007) [2023-03-07 11:37:17,704][175731] Updated weights for policy 0, policy_version 84900 (0.0006) [2023-03-07 11:37:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 86944768. Throughput: 0: 12847.0. Samples: 86931691. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 11:37:18,321][175405] Avg episode reward: [(0, '25.616')] [2023-03-07 11:37:18,502][175731] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-07 11:37:19,310][175731] Updated weights for policy 0, policy_version 84920 (0.0006) [2023-03-07 11:37:20,101][175731] Updated weights for policy 0, policy_version 84930 (0.0007) [2023-03-07 11:37:20,886][175731] Updated weights for policy 0, policy_version 84940 (0.0007) [2023-03-07 11:37:21,699][175731] Updated weights for policy 0, policy_version 84950 (0.0006) [2023-03-07 11:37:22,498][175731] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-07 11:37:23,301][175731] Updated weights for policy 0, policy_version 84970 (0.0006) [2023-03-07 11:37:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 87009280. Throughput: 0: 12833.5. Samples: 87008412. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:23,321][175405] Avg episode reward: [(0, '28.409')] [2023-03-07 11:37:24,090][175731] Updated weights for policy 0, policy_version 84980 (0.0007) [2023-03-07 11:37:24,893][175731] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-07 11:37:25,687][175731] Updated weights for policy 0, policy_version 85000 (0.0007) [2023-03-07 11:37:26,478][175731] Updated weights for policy 0, policy_version 85010 (0.0006) [2023-03-07 11:37:27,289][175731] Updated weights for policy 0, policy_version 85020 (0.0007) [2023-03-07 11:37:28,074][175731] Updated weights for policy 0, policy_version 85030 (0.0006) [2023-03-07 11:37:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 87072768. Throughput: 0: 12834.7. Samples: 87047061. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:28,322][175405] Avg episode reward: [(0, '27.426')] [2023-03-07 11:37:28,873][175731] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-07 11:37:29,693][175731] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-07 11:37:30,474][175731] Updated weights for policy 0, policy_version 85060 (0.0007) [2023-03-07 11:37:31,275][175731] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-07 11:37:32,070][175731] Updated weights for policy 0, policy_version 85080 (0.0007) [2023-03-07 11:37:32,867][175731] Updated weights for policy 0, policy_version 85090 (0.0006) [2023-03-07 11:37:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12833.0). Total num frames: 87137280. Throughput: 0: 12834.2. Samples: 87124136. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:33,321][175405] Avg episode reward: [(0, '28.081')] [2023-03-07 11:37:33,667][175731] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-07 11:37:34,469][175731] Updated weights for policy 0, policy_version 85110 (0.0007) [2023-03-07 11:37:35,267][175731] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-07 11:37:36,061][175731] Updated weights for policy 0, policy_version 85130 (0.0006) [2023-03-07 11:37:36,850][175731] Updated weights for policy 0, policy_version 85140 (0.0006) [2023-03-07 11:37:37,657][175731] Updated weights for policy 0, policy_version 85150 (0.0007) [2023-03-07 11:37:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 87201792. Throughput: 0: 12833.6. Samples: 87201093. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:38,321][175405] Avg episode reward: [(0, '27.280')] [2023-03-07 11:37:38,452][175731] Updated weights for policy 0, policy_version 85160 (0.0006) [2023-03-07 11:37:39,260][175731] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-07 11:37:40,058][175731] Updated weights for policy 0, policy_version 85180 (0.0007) [2023-03-07 11:37:40,843][175731] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-07 11:37:41,637][175731] Updated weights for policy 0, policy_version 85200 (0.0006) [2023-03-07 11:37:42,441][175731] Updated weights for policy 0, policy_version 85210 (0.0006) [2023-03-07 11:37:43,221][175731] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-07 11:37:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12836.5). Total num frames: 87266304. Throughput: 0: 12836.2. Samples: 87239752. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:43,321][175405] Avg episode reward: [(0, '26.447')] [2023-03-07 11:37:44,028][175731] Updated weights for policy 0, policy_version 85230 (0.0006) [2023-03-07 11:37:44,834][175731] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-07 11:37:45,634][175731] Updated weights for policy 0, policy_version 85250 (0.0007) [2023-03-07 11:37:46,408][175731] Updated weights for policy 0, policy_version 85260 (0.0007) [2023-03-07 11:37:47,184][175731] Updated weights for policy 0, policy_version 85270 (0.0006) [2023-03-07 11:37:48,011][175731] Updated weights for policy 0, policy_version 85280 (0.0006) [2023-03-07 11:37:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12833.0). Total num frames: 87329792. Throughput: 0: 12834.9. Samples: 87317060. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:48,321][175405] Avg episode reward: [(0, '27.267')] [2023-03-07 11:37:48,327][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000085284_87330816.pth... [2023-03-07 11:37:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000082276_84250624.pth [2023-03-07 11:37:48,797][175731] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-07 11:37:49,596][175731] Updated weights for policy 0, policy_version 85300 (0.0008) [2023-03-07 11:37:50,376][175731] Updated weights for policy 0, policy_version 85310 (0.0007) [2023-03-07 11:37:51,164][175731] Updated weights for policy 0, policy_version 85320 (0.0006) [2023-03-07 11:37:51,950][175731] Updated weights for policy 0, policy_version 85330 (0.0006) [2023-03-07 11:37:52,761][175731] Updated weights for policy 0, policy_version 85340 (0.0006) [2023-03-07 11:37:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12836.5). Total num frames: 87394304. Throughput: 0: 12845.8. Samples: 87394172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:53,332][175405] Avg episode reward: [(0, '26.770')] [2023-03-07 11:37:53,566][175731] Updated weights for policy 0, policy_version 85350 (0.0006) [2023-03-07 11:37:54,345][175731] Updated weights for policy 0, policy_version 85360 (0.0006) [2023-03-07 11:37:55,147][175731] Updated weights for policy 0, policy_version 85370 (0.0007) [2023-03-07 11:37:55,962][175731] Updated weights for policy 0, policy_version 85380 (0.0008) [2023-03-07 11:37:56,761][175731] Updated weights for policy 0, policy_version 85390 (0.0005) [2023-03-07 11:37:57,546][175731] Updated weights for policy 0, policy_version 85400 (0.0007) [2023-03-07 11:37:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 87458816. Throughput: 0: 12841.7. Samples: 87432670. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:37:58,332][175405] Avg episode reward: [(0, '26.603')] [2023-03-07 11:37:58,338][175731] Updated weights for policy 0, policy_version 85410 (0.0006) [2023-03-07 11:37:59,151][175731] Updated weights for policy 0, policy_version 85420 (0.0007) [2023-03-07 11:37:59,941][175731] Updated weights for policy 0, policy_version 85430 (0.0006) [2023-03-07 11:38:00,736][175731] Updated weights for policy 0, policy_version 85440 (0.0006) [2023-03-07 11:38:01,531][175731] Updated weights for policy 0, policy_version 85450 (0.0006) [2023-03-07 11:38:02,321][175731] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-07 11:38:03,121][175731] Updated weights for policy 0, policy_version 85470 (0.0006) [2023-03-07 11:38:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 87523328. Throughput: 0: 12850.7. Samples: 87509973. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:03,332][175405] Avg episode reward: [(0, '27.652')] [2023-03-07 11:38:03,916][175731] Updated weights for policy 0, policy_version 85480 (0.0006) [2023-03-07 11:38:04,734][175731] Updated weights for policy 0, policy_version 85490 (0.0006) [2023-03-07 11:38:05,518][175731] Updated weights for policy 0, policy_version 85500 (0.0006) [2023-03-07 11:38:06,311][175731] Updated weights for policy 0, policy_version 85510 (0.0007) [2023-03-07 11:38:07,131][175731] Updated weights for policy 0, policy_version 85520 (0.0006) [2023-03-07 11:38:07,917][175731] Updated weights for policy 0, policy_version 85530 (0.0006) [2023-03-07 11:38:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 87587840. Throughput: 0: 12858.0. Samples: 87587021. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:08,332][175405] Avg episode reward: [(0, '26.740')] [2023-03-07 11:38:08,695][175731] Updated weights for policy 0, policy_version 85540 (0.0006) [2023-03-07 11:38:09,502][175731] Updated weights for policy 0, policy_version 85550 (0.0006) [2023-03-07 11:38:10,298][175731] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-07 11:38:11,106][175731] Updated weights for policy 0, policy_version 85570 (0.0006) [2023-03-07 11:38:11,901][175731] Updated weights for policy 0, policy_version 85580 (0.0007) [2023-03-07 11:38:12,693][175731] Updated weights for policy 0, policy_version 85590 (0.0007) [2023-03-07 11:38:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 87651328. Throughput: 0: 12850.3. Samples: 87625327. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:13,332][175405] Avg episode reward: [(0, '26.700')] [2023-03-07 11:38:13,486][175731] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-07 11:38:14,289][175731] Updated weights for policy 0, policy_version 85610 (0.0006) [2023-03-07 11:38:15,078][175731] Updated weights for policy 0, policy_version 85620 (0.0007) [2023-03-07 11:38:15,865][175731] Updated weights for policy 0, policy_version 85630 (0.0007) [2023-03-07 11:38:16,668][175731] Updated weights for policy 0, policy_version 85640 (0.0006) [2023-03-07 11:38:17,483][175731] Updated weights for policy 0, policy_version 85650 (0.0007) [2023-03-07 11:38:18,275][175731] Updated weights for policy 0, policy_version 85660 (0.0006) [2023-03-07 11:38:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 87715840. Throughput: 0: 12854.9. Samples: 87702609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:18,332][175405] Avg episode reward: [(0, '26.816')] [2023-03-07 11:38:19,081][175731] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-07 11:38:19,875][175731] Updated weights for policy 0, policy_version 85680 (0.0006) [2023-03-07 11:38:20,668][175731] Updated weights for policy 0, policy_version 85690 (0.0007) [2023-03-07 11:38:21,457][175731] Updated weights for policy 0, policy_version 85700 (0.0006) [2023-03-07 11:38:22,263][175731] Updated weights for policy 0, policy_version 85710 (0.0007) [2023-03-07 11:38:23,057][175731] Updated weights for policy 0, policy_version 85720 (0.0007) [2023-03-07 11:38:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 87780352. Throughput: 0: 12854.5. Samples: 87779545. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:23,332][175405] Avg episode reward: [(0, '26.363')] [2023-03-07 11:38:23,861][175731] Updated weights for policy 0, policy_version 85730 (0.0006) [2023-03-07 11:38:24,658][175731] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-07 11:38:25,454][175731] Updated weights for policy 0, policy_version 85750 (0.0006) [2023-03-07 11:38:26,261][175731] Updated weights for policy 0, policy_version 85760 (0.0006) [2023-03-07 11:38:27,068][175731] Updated weights for policy 0, policy_version 85770 (0.0007) [2023-03-07 11:38:27,840][175731] Updated weights for policy 0, policy_version 85780 (0.0007) [2023-03-07 11:38:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12868.3, 300 sec: 12836.4). Total num frames: 87844864. Throughput: 0: 12854.8. Samples: 87818219. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:28,332][175405] Avg episode reward: [(0, '26.467')] [2023-03-07 11:38:28,646][175731] Updated weights for policy 0, policy_version 85790 (0.0006) [2023-03-07 11:38:29,446][175731] Updated weights for policy 0, policy_version 85800 (0.0006) [2023-03-07 11:38:30,239][175731] Updated weights for policy 0, policy_version 85810 (0.0007) [2023-03-07 11:38:31,031][175731] Updated weights for policy 0, policy_version 85820 (0.0006) [2023-03-07 11:38:31,850][175731] Updated weights for policy 0, policy_version 85830 (0.0006) [2023-03-07 11:38:32,660][175731] Updated weights for policy 0, policy_version 85840 (0.0007) [2023-03-07 11:38:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12836.4). Total num frames: 87908352. Throughput: 0: 12839.8. Samples: 87894851. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:33,321][175405] Avg episode reward: [(0, '24.447')] [2023-03-07 11:38:33,460][175731] Updated weights for policy 0, policy_version 85850 (0.0007) [2023-03-07 11:38:34,250][175731] Updated weights for policy 0, policy_version 85860 (0.0006) [2023-03-07 11:38:35,047][175731] Updated weights for policy 0, policy_version 85870 (0.0006) [2023-03-07 11:38:35,841][175731] Updated weights for policy 0, policy_version 85880 (0.0006) [2023-03-07 11:38:36,639][175731] Updated weights for policy 0, policy_version 85890 (0.0006) [2023-03-07 11:38:37,434][175731] Updated weights for policy 0, policy_version 85900 (0.0007) [2023-03-07 11:38:38,235][175731] Updated weights for policy 0, policy_version 85910 (0.0007) [2023-03-07 11:38:38,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 87971840. Throughput: 0: 12839.5. Samples: 87971950. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:38,322][175405] Avg episode reward: [(0, '26.184')] [2023-03-07 11:38:39,046][175731] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-07 11:38:39,843][175731] Updated weights for policy 0, policy_version 85930 (0.0007) [2023-03-07 11:38:40,633][175731] Updated weights for policy 0, policy_version 85940 (0.0006) [2023-03-07 11:38:41,430][175731] Updated weights for policy 0, policy_version 85950 (0.0006) [2023-03-07 11:38:42,232][175731] Updated weights for policy 0, policy_version 85960 (0.0007) [2023-03-07 11:38:43,038][175731] Updated weights for policy 0, policy_version 85970 (0.0007) [2023-03-07 11:38:43,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 88036352. Throughput: 0: 12839.7. Samples: 88010456. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:43,322][175405] Avg episode reward: [(0, '26.941')] [2023-03-07 11:38:43,824][175731] Updated weights for policy 0, policy_version 85980 (0.0006) [2023-03-07 11:38:44,630][175731] Updated weights for policy 0, policy_version 85990 (0.0007) [2023-03-07 11:38:45,429][175731] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-07 11:38:46,235][175731] Updated weights for policy 0, policy_version 86010 (0.0006) [2023-03-07 11:38:47,027][175731] Updated weights for policy 0, policy_version 86020 (0.0007) [2023-03-07 11:38:47,844][175731] Updated weights for policy 0, policy_version 86030 (0.0007) [2023-03-07 11:38:48,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12833.0). Total num frames: 88100864. Throughput: 0: 12826.6. Samples: 88087172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:48,321][175405] Avg episode reward: [(0, '27.090')] [2023-03-07 11:38:48,638][175731] Updated weights for policy 0, policy_version 86040 (0.0006) [2023-03-07 11:38:49,449][175731] Updated weights for policy 0, policy_version 86050 (0.0007) [2023-03-07 11:38:50,235][175731] Updated weights for policy 0, policy_version 86060 (0.0006) [2023-03-07 11:38:51,050][175731] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-07 11:38:51,842][175731] Updated weights for policy 0, policy_version 86080 (0.0006) [2023-03-07 11:38:52,645][175731] Updated weights for policy 0, policy_version 86090 (0.0006) [2023-03-07 11:38:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 88164352. Throughput: 0: 12819.8. Samples: 88163914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:53,322][175405] Avg episode reward: [(0, '27.213')] [2023-03-07 11:38:53,445][175731] Updated weights for policy 0, policy_version 86100 (0.0006) [2023-03-07 11:38:54,241][175731] Updated weights for policy 0, policy_version 86110 (0.0005) [2023-03-07 11:38:55,042][175731] Updated weights for policy 0, policy_version 86120 (0.0005) [2023-03-07 11:38:55,822][175731] Updated weights for policy 0, policy_version 86130 (0.0007) [2023-03-07 11:38:56,623][175731] Updated weights for policy 0, policy_version 86140 (0.0007) [2023-03-07 11:38:57,408][175731] Updated weights for policy 0, policy_version 86150 (0.0007) [2023-03-07 11:38:58,218][175731] Updated weights for policy 0, policy_version 86160 (0.0006) [2023-03-07 11:38:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 88228864. Throughput: 0: 12828.8. Samples: 88202620. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:38:58,321][175405] Avg episode reward: [(0, '27.355')] [2023-03-07 11:38:59,003][175731] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-07 11:38:59,805][175731] Updated weights for policy 0, policy_version 86180 (0.0006) [2023-03-07 11:39:00,604][175731] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-07 11:39:01,385][175731] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-07 11:39:02,195][175731] Updated weights for policy 0, policy_version 86210 (0.0006) [2023-03-07 11:39:02,990][175731] Updated weights for policy 0, policy_version 86220 (0.0008) [2023-03-07 11:39:03,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 88293376. Throughput: 0: 12822.0. Samples: 88279596. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:39:03,321][175405] Avg episode reward: [(0, '27.266')] [2023-03-07 11:39:03,778][175731] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-07 11:39:04,573][175731] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-07 11:39:05,392][175731] Updated weights for policy 0, policy_version 86250 (0.0006) [2023-03-07 11:39:06,180][175731] Updated weights for policy 0, policy_version 86260 (0.0006) [2023-03-07 11:39:06,981][175731] Updated weights for policy 0, policy_version 86270 (0.0007) [2023-03-07 11:39:07,794][175731] Updated weights for policy 0, policy_version 86280 (0.0007) [2023-03-07 11:39:08,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 88356864. Throughput: 0: 12820.9. Samples: 88356485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:08,322][175405] Avg episode reward: [(0, '26.394')] [2023-03-07 11:39:08,586][175731] Updated weights for policy 0, policy_version 86290 (0.0006) [2023-03-07 11:39:09,386][175731] Updated weights for policy 0, policy_version 86300 (0.0006) [2023-03-07 11:39:10,196][175731] Updated weights for policy 0, policy_version 86310 (0.0006) [2023-03-07 11:39:10,981][175731] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-07 11:39:11,770][175731] Updated weights for policy 0, policy_version 86330 (0.0006) [2023-03-07 11:39:12,505][175680] KL-divergence is very high: 12630.8984 [2023-03-07 11:39:12,586][175731] Updated weights for policy 0, policy_version 86340 (0.0006) [2023-03-07 11:39:12,669][175680] KL-divergence is very high: 580616.6250 [2023-03-07 11:39:13,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 88421376. Throughput: 0: 12815.7. Samples: 88394925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:13,322][175405] Avg episode reward: [(0, '27.133')] [2023-03-07 11:39:13,397][175731] Updated weights for policy 0, policy_version 86350 (0.0007) [2023-03-07 11:39:14,199][175731] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-07 11:39:14,988][175731] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-07 11:39:15,766][175731] Updated weights for policy 0, policy_version 86380 (0.0007) [2023-03-07 11:39:16,574][175731] Updated weights for policy 0, policy_version 86390 (0.0006) [2023-03-07 11:39:17,365][175731] Updated weights for policy 0, policy_version 86400 (0.0007) [2023-03-07 11:39:18,167][175731] Updated weights for policy 0, policy_version 86410 (0.0005) [2023-03-07 11:39:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 88484864. Throughput: 0: 12826.9. Samples: 88472064. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:18,321][175405] Avg episode reward: [(0, '25.550')] [2023-03-07 11:39:18,941][175731] Updated weights for policy 0, policy_version 86420 (0.0007) [2023-03-07 11:39:19,738][175731] Updated weights for policy 0, policy_version 86430 (0.0006) [2023-03-07 11:39:20,549][175731] Updated weights for policy 0, policy_version 86440 (0.0007) [2023-03-07 11:39:21,341][175731] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-07 11:39:22,132][175731] Updated weights for policy 0, policy_version 86460 (0.0006) [2023-03-07 11:39:22,932][175731] Updated weights for policy 0, policy_version 86470 (0.0007) [2023-03-07 11:39:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 88549376. Throughput: 0: 12829.5. Samples: 88549277. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:23,321][175405] Avg episode reward: [(0, '26.796')] [2023-03-07 11:39:23,714][175731] Updated weights for policy 0, policy_version 86480 (0.0007) [2023-03-07 11:39:24,528][175731] Updated weights for policy 0, policy_version 86490 (0.0006) [2023-03-07 11:39:25,334][175731] Updated weights for policy 0, policy_version 86500 (0.0007) [2023-03-07 11:39:26,128][175731] Updated weights for policy 0, policy_version 86510 (0.0007) [2023-03-07 11:39:26,930][175731] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-07 11:39:27,737][175731] Updated weights for policy 0, policy_version 86530 (0.0006) [2023-03-07 11:39:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 88613888. Throughput: 0: 12825.7. Samples: 88587610. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:28,322][175405] Avg episode reward: [(0, '28.843')] [2023-03-07 11:39:28,534][175731] Updated weights for policy 0, policy_version 86540 (0.0007) [2023-03-07 11:39:29,329][175731] Updated weights for policy 0, policy_version 86550 (0.0007) [2023-03-07 11:39:30,138][175731] Updated weights for policy 0, policy_version 86560 (0.0007) [2023-03-07 11:39:30,933][175731] Updated weights for policy 0, policy_version 86570 (0.0007) [2023-03-07 11:39:31,731][175731] Updated weights for policy 0, policy_version 86580 (0.0006) [2023-03-07 11:39:32,529][175731] Updated weights for policy 0, policy_version 86590 (0.0005) [2023-03-07 11:39:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12829.5). Total num frames: 88677376. Throughput: 0: 12825.2. Samples: 88664306. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:33,322][175405] Avg episode reward: [(0, '25.689')] [2023-03-07 11:39:33,326][175731] Updated weights for policy 0, policy_version 86600 (0.0007) [2023-03-07 11:39:34,127][175731] Updated weights for policy 0, policy_version 86610 (0.0006) [2023-03-07 11:39:34,902][175731] Updated weights for policy 0, policy_version 86620 (0.0006) [2023-03-07 11:39:35,724][175731] Updated weights for policy 0, policy_version 86630 (0.0006) [2023-03-07 11:39:36,528][175731] Updated weights for policy 0, policy_version 86640 (0.0007) [2023-03-07 11:39:37,323][175731] Updated weights for policy 0, policy_version 86650 (0.0006) [2023-03-07 11:39:38,124][175731] Updated weights for policy 0, policy_version 86660 (0.0008) [2023-03-07 11:39:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 88741888. Throughput: 0: 12828.7. Samples: 88741206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:38,321][175405] Avg episode reward: [(0, '29.401')] [2023-03-07 11:39:38,930][175731] Updated weights for policy 0, policy_version 86670 (0.0006) [2023-03-07 11:39:39,724][175731] Updated weights for policy 0, policy_version 86680 (0.0007) [2023-03-07 11:39:40,521][175731] Updated weights for policy 0, policy_version 86690 (0.0007) [2023-03-07 11:39:41,338][175731] Updated weights for policy 0, policy_version 86700 (0.0006) [2023-03-07 11:39:42,129][175731] Updated weights for policy 0, policy_version 86710 (0.0007) [2023-03-07 11:39:42,926][175731] Updated weights for policy 0, policy_version 86720 (0.0006) [2023-03-07 11:39:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.2, 300 sec: 12833.0). Total num frames: 88806400. Throughput: 0: 12821.7. Samples: 88779598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:43,332][175405] Avg episode reward: [(0, '28.396')] [2023-03-07 11:39:43,701][175731] Updated weights for policy 0, policy_version 86730 (0.0006) [2023-03-07 11:39:44,505][175731] Updated weights for policy 0, policy_version 86740 (0.0006) [2023-03-07 11:39:45,291][175731] Updated weights for policy 0, policy_version 86750 (0.0006) [2023-03-07 11:39:46,098][175731] Updated weights for policy 0, policy_version 86760 (0.0007) [2023-03-07 11:39:46,880][175731] Updated weights for policy 0, policy_version 86770 (0.0006) [2023-03-07 11:39:47,665][175731] Updated weights for policy 0, policy_version 86780 (0.0006) [2023-03-07 11:39:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 88869888. Throughput: 0: 12835.5. Samples: 88857195. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:48,332][175405] Avg episode reward: [(0, '27.005')] [2023-03-07 11:39:48,336][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000086788_88870912.pth... [2023-03-07 11:39:48,368][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000083780_85790720.pth [2023-03-07 11:39:48,477][175731] Updated weights for policy 0, policy_version 86790 (0.0006) [2023-03-07 11:39:49,251][175731] Updated weights for policy 0, policy_version 86800 (0.0007) [2023-03-07 11:39:50,082][175731] Updated weights for policy 0, policy_version 86810 (0.0007) [2023-03-07 11:39:50,864][175731] Updated weights for policy 0, policy_version 86820 (0.0006) [2023-03-07 11:39:51,655][175731] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-07 11:39:52,480][175731] Updated weights for policy 0, policy_version 86840 (0.0007) [2023-03-07 11:39:53,263][175731] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-07 11:39:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 88934400. Throughput: 0: 12837.1. Samples: 88934154. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:53,332][175405] Avg episode reward: [(0, '25.193')] [2023-03-07 11:39:54,041][175731] Updated weights for policy 0, policy_version 86860 (0.0007) [2023-03-07 11:39:54,859][175731] Updated weights for policy 0, policy_version 86870 (0.0006) [2023-03-07 11:39:55,641][175731] Updated weights for policy 0, policy_version 86880 (0.0007) [2023-03-07 11:39:56,435][175731] Updated weights for policy 0, policy_version 86890 (0.0006) [2023-03-07 11:39:57,245][175731] Updated weights for policy 0, policy_version 86900 (0.0007) [2023-03-07 11:39:58,033][175731] Updated weights for policy 0, policy_version 86910 (0.0007) [2023-03-07 11:39:58,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 88998912. Throughput: 0: 12844.5. Samples: 88972927. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:39:58,322][175405] Avg episode reward: [(0, '26.568')] [2023-03-07 11:39:58,839][175731] Updated weights for policy 0, policy_version 86920 (0.0007) [2023-03-07 11:39:59,633][175731] Updated weights for policy 0, policy_version 86930 (0.0008) [2023-03-07 11:40:00,432][175731] Updated weights for policy 0, policy_version 86940 (0.0007) [2023-03-07 11:40:01,230][175731] Updated weights for policy 0, policy_version 86950 (0.0007) [2023-03-07 11:40:02,046][175731] Updated weights for policy 0, policy_version 86960 (0.0007) [2023-03-07 11:40:02,826][175731] Updated weights for policy 0, policy_version 86970 (0.0007) [2023-03-07 11:40:03,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 89063424. Throughput: 0: 12837.1. Samples: 89049733. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:03,332][175405] Avg episode reward: [(0, '25.952')] [2023-03-07 11:40:03,626][175731] Updated weights for policy 0, policy_version 86980 (0.0007) [2023-03-07 11:40:04,412][175731] Updated weights for policy 0, policy_version 86990 (0.0007) [2023-03-07 11:40:05,225][175731] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-07 11:40:06,019][175731] Updated weights for policy 0, policy_version 87010 (0.0007) [2023-03-07 11:40:06,823][175731] Updated weights for policy 0, policy_version 87020 (0.0006) [2023-03-07 11:40:07,621][175731] Updated weights for policy 0, policy_version 87030 (0.0006) [2023-03-07 11:40:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 89126912. Throughput: 0: 12833.0. Samples: 89126763. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:08,332][175405] Avg episode reward: [(0, '27.096')] [2023-03-07 11:40:08,420][175731] Updated weights for policy 0, policy_version 87040 (0.0007) [2023-03-07 11:40:09,230][175731] Updated weights for policy 0, policy_version 87050 (0.0006) [2023-03-07 11:40:10,037][175731] Updated weights for policy 0, policy_version 87060 (0.0006) [2023-03-07 11:40:10,841][175731] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-07 11:40:11,642][175731] Updated weights for policy 0, policy_version 87080 (0.0006) [2023-03-07 11:40:12,426][175731] Updated weights for policy 0, policy_version 87090 (0.0007) [2023-03-07 11:40:13,235][175731] Updated weights for policy 0, policy_version 87100 (0.0006) [2023-03-07 11:40:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 89191424. Throughput: 0: 12829.9. Samples: 89164956. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:13,332][175405] Avg episode reward: [(0, '27.152')] [2023-03-07 11:40:14,017][175731] Updated weights for policy 0, policy_version 87110 (0.0007) [2023-03-07 11:40:14,814][175731] Updated weights for policy 0, policy_version 87120 (0.0006) [2023-03-07 11:40:15,631][175731] Updated weights for policy 0, policy_version 87130 (0.0006) [2023-03-07 11:40:16,423][175731] Updated weights for policy 0, policy_version 87140 (0.0006) [2023-03-07 11:40:17,214][175731] Updated weights for policy 0, policy_version 87150 (0.0005) [2023-03-07 11:40:18,006][175731] Updated weights for policy 0, policy_version 87160 (0.0006) [2023-03-07 11:40:18,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 89254912. Throughput: 0: 12837.4. Samples: 89241990. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:18,332][175405] Avg episode reward: [(0, '26.573')] [2023-03-07 11:40:18,805][175731] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-07 11:40:19,604][175731] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-07 11:40:20,397][175731] Updated weights for policy 0, policy_version 87190 (0.0007) [2023-03-07 11:40:21,203][175731] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-07 11:40:22,000][175731] Updated weights for policy 0, policy_version 87210 (0.0007) [2023-03-07 11:40:22,808][175731] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-07 11:40:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 89319424. Throughput: 0: 12829.7. Samples: 89318544. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:23,322][175405] Avg episode reward: [(0, '27.661')] [2023-03-07 11:40:23,604][175731] Updated weights for policy 0, policy_version 87230 (0.0007) [2023-03-07 11:40:24,417][175731] Updated weights for policy 0, policy_version 87240 (0.0006) [2023-03-07 11:40:25,214][175731] Updated weights for policy 0, policy_version 87250 (0.0006) [2023-03-07 11:40:25,999][175731] Updated weights for policy 0, policy_version 87260 (0.0007) [2023-03-07 11:40:26,804][175731] Updated weights for policy 0, policy_version 87270 (0.0007) [2023-03-07 11:40:27,596][175731] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-07 11:40:28,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 89383936. Throughput: 0: 12835.6. Samples: 89357199. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:28,321][175405] Avg episode reward: [(0, '27.260')] [2023-03-07 11:40:28,381][175731] Updated weights for policy 0, policy_version 87290 (0.0006) [2023-03-07 11:40:29,188][175731] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-07 11:40:29,994][175731] Updated weights for policy 0, policy_version 87310 (0.0006) [2023-03-07 11:40:30,787][175731] Updated weights for policy 0, policy_version 87320 (0.0006) [2023-03-07 11:40:31,599][175731] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-07 11:40:32,397][175731] Updated weights for policy 0, policy_version 87340 (0.0006) [2023-03-07 11:40:33,180][175731] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-07 11:40:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 89447424. Throughput: 0: 12823.1. Samples: 89434233. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:33,322][175405] Avg episode reward: [(0, '27.242')] [2023-03-07 11:40:33,974][175731] Updated weights for policy 0, policy_version 87360 (0.0007) [2023-03-07 11:40:34,794][175731] Updated weights for policy 0, policy_version 87370 (0.0007) [2023-03-07 11:40:35,589][175731] Updated weights for policy 0, policy_version 87380 (0.0007) [2023-03-07 11:40:36,371][175731] Updated weights for policy 0, policy_version 87390 (0.0007) [2023-03-07 11:40:37,162][175731] Updated weights for policy 0, policy_version 87400 (0.0007) [2023-03-07 11:40:37,962][175731] Updated weights for policy 0, policy_version 87410 (0.0008) [2023-03-07 11:40:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 89511936. Throughput: 0: 12827.0. Samples: 89511370. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:38,322][175405] Avg episode reward: [(0, '25.966')] [2023-03-07 11:40:38,763][175731] Updated weights for policy 0, policy_version 87420 (0.0007) [2023-03-07 11:40:39,564][175731] Updated weights for policy 0, policy_version 87430 (0.0006) [2023-03-07 11:40:40,350][175731] Updated weights for policy 0, policy_version 87440 (0.0006) [2023-03-07 11:40:41,143][175731] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-07 11:40:41,964][175731] Updated weights for policy 0, policy_version 87460 (0.0006) [2023-03-07 11:40:42,746][175731] Updated weights for policy 0, policy_version 87470 (0.0007) [2023-03-07 11:40:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 89576448. Throughput: 0: 12821.6. Samples: 89549899. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:43,321][175405] Avg episode reward: [(0, '26.150')] [2023-03-07 11:40:43,546][175731] Updated weights for policy 0, policy_version 87480 (0.0007) [2023-03-07 11:40:44,332][175731] Updated weights for policy 0, policy_version 87490 (0.0007) [2023-03-07 11:40:45,141][175731] Updated weights for policy 0, policy_version 87500 (0.0006) [2023-03-07 11:40:45,943][175731] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-07 11:40:46,750][175731] Updated weights for policy 0, policy_version 87520 (0.0007) [2023-03-07 11:40:47,560][175731] Updated weights for policy 0, policy_version 87530 (0.0006) [2023-03-07 11:40:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 89639936. Throughput: 0: 12823.4. Samples: 89626785. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:40:48,322][175405] Avg episode reward: [(0, '26.205')] [2023-03-07 11:40:48,351][175731] Updated weights for policy 0, policy_version 87540 (0.0006) [2023-03-07 11:40:49,156][175731] Updated weights for policy 0, policy_version 87550 (0.0007) [2023-03-07 11:40:49,967][175731] Updated weights for policy 0, policy_version 87560 (0.0006) [2023-03-07 11:40:50,757][175731] Updated weights for policy 0, policy_version 87570 (0.0007) [2023-03-07 11:40:51,561][175731] Updated weights for policy 0, policy_version 87580 (0.0006) [2023-03-07 11:40:52,358][175731] Updated weights for policy 0, policy_version 87590 (0.0007) [2023-03-07 11:40:53,147][175731] Updated weights for policy 0, policy_version 87600 (0.0007) [2023-03-07 11:40:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 89704448. Throughput: 0: 12817.2. Samples: 89703537. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:40:53,321][175405] Avg episode reward: [(0, '26.091')] [2023-03-07 11:40:53,950][175731] Updated weights for policy 0, policy_version 87610 (0.0006) [2023-03-07 11:40:54,757][175731] Updated weights for policy 0, policy_version 87620 (0.0006) [2023-03-07 11:40:55,557][175731] Updated weights for policy 0, policy_version 87630 (0.0007) [2023-03-07 11:40:56,346][175731] Updated weights for policy 0, policy_version 87640 (0.0007) [2023-03-07 11:40:57,137][175731] Updated weights for policy 0, policy_version 87650 (0.0006) [2023-03-07 11:40:57,954][175731] Updated weights for policy 0, policy_version 87660 (0.0006) [2023-03-07 11:40:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12829.5). Total num frames: 89767936. Throughput: 0: 12820.0. Samples: 89741856. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:40:58,322][175405] Avg episode reward: [(0, '27.351')] [2023-03-07 11:40:58,755][175731] Updated weights for policy 0, policy_version 87670 (0.0006) [2023-03-07 11:40:59,555][175731] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-07 11:41:00,376][175731] Updated weights for policy 0, policy_version 87690 (0.0007) [2023-03-07 11:41:01,163][175731] Updated weights for policy 0, policy_version 87700 (0.0007) [2023-03-07 11:41:01,959][175731] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-07 11:41:02,779][175731] Updated weights for policy 0, policy_version 87720 (0.0006) [2023-03-07 11:41:03,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 89831424. Throughput: 0: 12811.6. Samples: 89818513. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:03,322][175405] Avg episode reward: [(0, '25.736')] [2023-03-07 11:41:03,567][175731] Updated weights for policy 0, policy_version 87730 (0.0007) [2023-03-07 11:41:04,361][175731] Updated weights for policy 0, policy_version 87740 (0.0007) [2023-03-07 11:41:05,141][175731] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-07 11:41:05,943][175731] Updated weights for policy 0, policy_version 87760 (0.0007) [2023-03-07 11:41:06,737][175731] Updated weights for policy 0, policy_version 87770 (0.0007) [2023-03-07 11:41:07,542][175731] Updated weights for policy 0, policy_version 87780 (0.0006) [2023-03-07 11:41:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 89895936. Throughput: 0: 12821.4. Samples: 89895506. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:08,323][175405] Avg episode reward: [(0, '26.519')] [2023-03-07 11:41:08,350][175731] Updated weights for policy 0, policy_version 87790 (0.0006) [2023-03-07 11:41:09,157][175731] Updated weights for policy 0, policy_version 87800 (0.0006) [2023-03-07 11:41:09,948][175731] Updated weights for policy 0, policy_version 87810 (0.0007) [2023-03-07 11:41:10,749][175731] Updated weights for policy 0, policy_version 87820 (0.0007) [2023-03-07 11:41:11,566][175731] Updated weights for policy 0, policy_version 87830 (0.0007) [2023-03-07 11:41:12,355][175731] Updated weights for policy 0, policy_version 87840 (0.0007) [2023-03-07 11:41:13,137][175731] Updated weights for policy 0, policy_version 87850 (0.0006) [2023-03-07 11:41:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 89960448. Throughput: 0: 12816.3. Samples: 89933932. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:13,332][175405] Avg episode reward: [(0, '26.067')] [2023-03-07 11:41:13,957][175731] Updated weights for policy 0, policy_version 87860 (0.0006) [2023-03-07 11:41:14,740][175731] Updated weights for policy 0, policy_version 87870 (0.0007) [2023-03-07 11:41:15,545][175731] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-07 11:41:16,335][175731] Updated weights for policy 0, policy_version 87890 (0.0006) [2023-03-07 11:41:17,135][175731] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-07 11:41:17,929][175731] Updated weights for policy 0, policy_version 87910 (0.0006) [2023-03-07 11:41:18,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12836.4). Total num frames: 90024960. Throughput: 0: 12815.2. Samples: 90010915. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:18,332][175405] Avg episode reward: [(0, '32.101')] [2023-03-07 11:41:18,734][175731] Updated weights for policy 0, policy_version 87920 (0.0006) [2023-03-07 11:41:19,514][175731] Updated weights for policy 0, policy_version 87930 (0.0006) [2023-03-07 11:41:20,325][175731] Updated weights for policy 0, policy_version 87940 (0.0007) [2023-03-07 11:41:21,121][175731] Updated weights for policy 0, policy_version 87950 (0.0007) [2023-03-07 11:41:21,925][175731] Updated weights for policy 0, policy_version 87960 (0.0006) [2023-03-07 11:41:22,701][175731] Updated weights for policy 0, policy_version 87970 (0.0006) [2023-03-07 11:41:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 90088448. Throughput: 0: 12818.4. Samples: 90088201. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:23,332][175405] Avg episode reward: [(0, '26.333')] [2023-03-07 11:41:23,502][175731] Updated weights for policy 0, policy_version 87980 (0.0007) [2023-03-07 11:41:24,308][175731] Updated weights for policy 0, policy_version 87990 (0.0007) [2023-03-07 11:41:25,113][175731] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-07 11:41:25,908][175731] Updated weights for policy 0, policy_version 88010 (0.0006) [2023-03-07 11:41:26,715][175731] Updated weights for policy 0, policy_version 88020 (0.0006) [2023-03-07 11:41:27,511][175731] Updated weights for policy 0, policy_version 88030 (0.0006) [2023-03-07 11:41:28,317][175731] Updated weights for policy 0, policy_version 88040 (0.0007) [2023-03-07 11:41:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12833.0). Total num frames: 90152960. Throughput: 0: 12811.4. Samples: 90126411. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:28,332][175405] Avg episode reward: [(0, '26.376')] [2023-03-07 11:41:29,116][175731] Updated weights for policy 0, policy_version 88050 (0.0006) [2023-03-07 11:41:29,923][175731] Updated weights for policy 0, policy_version 88060 (0.0006) [2023-03-07 11:41:30,710][175731] Updated weights for policy 0, policy_version 88070 (0.0005) [2023-03-07 11:41:31,509][175731] Updated weights for policy 0, policy_version 88080 (0.0007) [2023-03-07 11:41:32,322][175731] Updated weights for policy 0, policy_version 88090 (0.0007) [2023-03-07 11:41:33,109][175731] Updated weights for policy 0, policy_version 88100 (0.0006) [2023-03-07 11:41:33,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 90216448. Throughput: 0: 12814.5. Samples: 90203435. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:33,332][175405] Avg episode reward: [(0, '25.873')] [2023-03-07 11:41:33,928][175731] Updated weights for policy 0, policy_version 88110 (0.0006) [2023-03-07 11:41:34,705][175731] Updated weights for policy 0, policy_version 88120 (0.0006) [2023-03-07 11:41:35,494][175731] Updated weights for policy 0, policy_version 88130 (0.0007) [2023-03-07 11:41:36,302][175731] Updated weights for policy 0, policy_version 88140 (0.0006) [2023-03-07 11:41:37,106][175731] Updated weights for policy 0, policy_version 88150 (0.0007) [2023-03-07 11:41:37,888][175731] Updated weights for policy 0, policy_version 88160 (0.0006) [2023-03-07 11:41:38,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.0, 300 sec: 12829.5). Total num frames: 90280960. Throughput: 0: 12817.4. Samples: 90280322. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:38,333][175405] Avg episode reward: [(0, '28.202')] [2023-03-07 11:41:38,693][175731] Updated weights for policy 0, policy_version 88170 (0.0006) [2023-03-07 11:41:39,491][175731] Updated weights for policy 0, policy_version 88180 (0.0006) [2023-03-07 11:41:40,287][175731] Updated weights for policy 0, policy_version 88190 (0.0007) [2023-03-07 11:41:41,095][175731] Updated weights for policy 0, policy_version 88200 (0.0007) [2023-03-07 11:41:41,894][175731] Updated weights for policy 0, policy_version 88210 (0.0006) [2023-03-07 11:41:42,692][175731] Updated weights for policy 0, policy_version 88220 (0.0006) [2023-03-07 11:41:43,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12800.0, 300 sec: 12829.5). Total num frames: 90344448. Throughput: 0: 12820.1. Samples: 90318760. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:41:43,332][175405] Avg episode reward: [(0, '26.724')] [2023-03-07 11:41:43,513][175731] Updated weights for policy 0, policy_version 88230 (0.0007) [2023-03-07 11:41:44,314][175731] Updated weights for policy 0, policy_version 88240 (0.0007) [2023-03-07 11:41:45,106][175731] Updated weights for policy 0, policy_version 88250 (0.0006) [2023-03-07 11:41:45,913][175731] Updated weights for policy 0, policy_version 88260 (0.0006) [2023-03-07 11:41:46,717][175731] Updated weights for policy 0, policy_version 88270 (0.0007) [2023-03-07 11:41:47,506][175731] Updated weights for policy 0, policy_version 88280 (0.0007) [2023-03-07 11:41:48,309][175731] Updated weights for policy 0, policy_version 88290 (0.0006) [2023-03-07 11:41:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 90408960. Throughput: 0: 12817.9. Samples: 90395319. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:41:48,332][175405] Avg episode reward: [(0, '26.723')] [2023-03-07 11:41:48,337][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000088290_90408960.pth... [2023-03-07 11:41:48,365][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000085284_87330816.pth [2023-03-07 11:41:49,092][175731] Updated weights for policy 0, policy_version 88300 (0.0006) [2023-03-07 11:41:49,897][175731] Updated weights for policy 0, policy_version 88310 (0.0006) [2023-03-07 11:41:50,713][175731] Updated weights for policy 0, policy_version 88320 (0.0005) [2023-03-07 11:41:51,503][175731] Updated weights for policy 0, policy_version 88330 (0.0006) [2023-03-07 11:41:52,301][175731] Updated weights for policy 0, policy_version 88340 (0.0006) [2023-03-07 11:41:53,102][175731] Updated weights for policy 0, policy_version 88350 (0.0006) [2023-03-07 11:41:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12826.0). Total num frames: 90472448. Throughput: 0: 12814.2. Samples: 90472146. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:41:53,332][175405] Avg episode reward: [(0, '27.635')] [2023-03-07 11:41:53,903][175731] Updated weights for policy 0, policy_version 88360 (0.0006) [2023-03-07 11:41:54,704][175731] Updated weights for policy 0, policy_version 88370 (0.0007) [2023-03-07 11:41:55,503][175731] Updated weights for policy 0, policy_version 88380 (0.0006) [2023-03-07 11:41:56,288][175731] Updated weights for policy 0, policy_version 88390 (0.0007) [2023-03-07 11:41:57,086][175731] Updated weights for policy 0, policy_version 88400 (0.0006) [2023-03-07 11:41:57,873][175731] Updated weights for policy 0, policy_version 88410 (0.0007) [2023-03-07 11:41:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 90536960. Throughput: 0: 12815.6. Samples: 90510636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:41:58,332][175405] Avg episode reward: [(0, '26.831')] [2023-03-07 11:41:58,662][175731] Updated weights for policy 0, policy_version 88420 (0.0007) [2023-03-07 11:41:59,470][175731] Updated weights for policy 0, policy_version 88430 (0.0007) [2023-03-07 11:42:00,272][175731] Updated weights for policy 0, policy_version 88440 (0.0006) [2023-03-07 11:42:01,049][175731] Updated weights for policy 0, policy_version 88450 (0.0006) [2023-03-07 11:42:01,861][175731] Updated weights for policy 0, policy_version 88460 (0.0007) [2023-03-07 11:42:02,654][175731] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-07 11:42:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 90601472. Throughput: 0: 12822.9. Samples: 90587947. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:03,332][175405] Avg episode reward: [(0, '27.096')] [2023-03-07 11:42:03,453][175731] Updated weights for policy 0, policy_version 88480 (0.0007) [2023-03-07 11:42:04,239][175731] Updated weights for policy 0, policy_version 88490 (0.0006) [2023-03-07 11:42:05,022][175731] Updated weights for policy 0, policy_version 88500 (0.0006) [2023-03-07 11:42:05,841][175731] Updated weights for policy 0, policy_version 88510 (0.0006) [2023-03-07 11:42:06,630][175731] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-07 11:42:07,407][175731] Updated weights for policy 0, policy_version 88530 (0.0006) [2023-03-07 11:42:08,201][175731] Updated weights for policy 0, policy_version 88540 (0.0006) [2023-03-07 11:42:08,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 90665984. Throughput: 0: 12828.0. Samples: 90665459. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:08,332][175405] Avg episode reward: [(0, '26.489')] [2023-03-07 11:42:09,000][175731] Updated weights for policy 0, policy_version 88550 (0.0005) [2023-03-07 11:42:09,797][175731] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-07 11:42:10,578][175731] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-07 11:42:11,390][175731] Updated weights for policy 0, policy_version 88580 (0.0006) [2023-03-07 11:42:12,180][175731] Updated weights for policy 0, policy_version 88590 (0.0006) [2023-03-07 11:42:12,993][175731] Updated weights for policy 0, policy_version 88600 (0.0007) [2023-03-07 11:42:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 90729472. Throughput: 0: 12832.9. Samples: 90703894. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:13,322][175405] Avg episode reward: [(0, '25.214')] [2023-03-07 11:42:13,798][175731] Updated weights for policy 0, policy_version 88610 (0.0006) [2023-03-07 11:42:14,609][175731] Updated weights for policy 0, policy_version 88620 (0.0006) [2023-03-07 11:42:15,414][175731] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-07 11:42:16,211][175731] Updated weights for policy 0, policy_version 88640 (0.0006) [2023-03-07 11:42:17,002][175731] Updated weights for policy 0, policy_version 88650 (0.0006) [2023-03-07 11:42:17,818][175731] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-07 11:42:18,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 90793984. Throughput: 0: 12828.5. Samples: 90780719. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:18,321][175405] Avg episode reward: [(0, '26.128')] [2023-03-07 11:42:18,605][175731] Updated weights for policy 0, policy_version 88670 (0.0007) [2023-03-07 11:42:19,384][175731] Updated weights for policy 0, policy_version 88680 (0.0006) [2023-03-07 11:42:20,171][175731] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-07 11:42:20,979][175731] Updated weights for policy 0, policy_version 88700 (0.0006) [2023-03-07 11:42:21,771][175731] Updated weights for policy 0, policy_version 88710 (0.0007) [2023-03-07 11:42:22,573][175731] Updated weights for policy 0, policy_version 88720 (0.0006) [2023-03-07 11:42:23,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12834.2, 300 sec: 12833.0). Total num frames: 90858496. Throughput: 0: 12836.3. Samples: 90857955. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:23,321][175405] Avg episode reward: [(0, '26.860')] [2023-03-07 11:42:23,349][175731] Updated weights for policy 0, policy_version 88730 (0.0007) [2023-03-07 11:42:24,148][175731] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-07 11:42:24,977][175731] Updated weights for policy 0, policy_version 88750 (0.0007) [2023-03-07 11:42:25,758][175731] Updated weights for policy 0, policy_version 88760 (0.0007) [2023-03-07 11:42:26,553][175731] Updated weights for policy 0, policy_version 88770 (0.0006) [2023-03-07 11:42:27,351][175731] Updated weights for policy 0, policy_version 88780 (0.0007) [2023-03-07 11:42:28,157][175731] Updated weights for policy 0, policy_version 88790 (0.0008) [2023-03-07 11:42:28,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12833.0). Total num frames: 90923008. Throughput: 0: 12836.1. Samples: 90896385. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:28,322][175405] Avg episode reward: [(0, '26.256')] [2023-03-07 11:42:28,958][175731] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-07 11:42:29,763][175731] Updated weights for policy 0, policy_version 88810 (0.0006) [2023-03-07 11:42:30,566][175731] Updated weights for policy 0, policy_version 88820 (0.0006) [2023-03-07 11:42:31,348][175731] Updated weights for policy 0, policy_version 88830 (0.0007) [2023-03-07 11:42:32,158][175731] Updated weights for policy 0, policy_version 88840 (0.0007) [2023-03-07 11:42:32,946][175731] Updated weights for policy 0, policy_version 88850 (0.0006) [2023-03-07 11:42:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 90986496. Throughput: 0: 12844.5. Samples: 90973320. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:33,322][175405] Avg episode reward: [(0, '26.430')] [2023-03-07 11:42:33,729][175731] Updated weights for policy 0, policy_version 88860 (0.0006) [2023-03-07 11:42:34,563][175731] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-07 11:42:35,352][175731] Updated weights for policy 0, policy_version 88880 (0.0007) [2023-03-07 11:42:36,136][175731] Updated weights for policy 0, policy_version 88890 (0.0006) [2023-03-07 11:42:36,949][175731] Updated weights for policy 0, policy_version 88900 (0.0007) [2023-03-07 11:42:37,761][175731] Updated weights for policy 0, policy_version 88910 (0.0007) [2023-03-07 11:42:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12829.5). Total num frames: 91051008. Throughput: 0: 12844.7. Samples: 91050159. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:38,322][175405] Avg episode reward: [(0, '26.403')] [2023-03-07 11:42:38,564][175731] Updated weights for policy 0, policy_version 88920 (0.0007) [2023-03-07 11:42:39,340][175731] Updated weights for policy 0, policy_version 88930 (0.0006) [2023-03-07 11:42:40,148][175731] Updated weights for policy 0, policy_version 88940 (0.0006) [2023-03-07 11:42:40,954][175731] Updated weights for policy 0, policy_version 88950 (0.0008) [2023-03-07 11:42:41,739][175731] Updated weights for policy 0, policy_version 88960 (0.0007) [2023-03-07 11:42:42,540][175731] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-07 11:42:43,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.2, 300 sec: 12829.5). Total num frames: 91114496. Throughput: 0: 12842.5. Samples: 91088549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:43,322][175405] Avg episode reward: [(0, '27.445')] [2023-03-07 11:42:43,342][175731] Updated weights for policy 0, policy_version 88980 (0.0007) [2023-03-07 11:42:44,138][175731] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-07 11:42:44,952][175731] Updated weights for policy 0, policy_version 89000 (0.0005) [2023-03-07 11:42:45,730][175731] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-07 11:42:46,525][175731] Updated weights for policy 0, policy_version 89020 (0.0006) [2023-03-07 11:42:47,351][175731] Updated weights for policy 0, policy_version 89030 (0.0007) [2023-03-07 11:42:48,146][175731] Updated weights for policy 0, policy_version 89040 (0.0006) [2023-03-07 11:42:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12829.5). Total num frames: 91179008. Throughput: 0: 12833.4. Samples: 91165448. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:48,322][175405] Avg episode reward: [(0, '26.060')] [2023-03-07 11:42:48,941][175731] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-07 11:42:49,741][175731] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-07 11:42:50,545][175731] Updated weights for policy 0, policy_version 89070 (0.0007) [2023-03-07 11:42:51,337][175731] Updated weights for policy 0, policy_version 89080 (0.0007) [2023-03-07 11:42:52,127][175731] Updated weights for policy 0, policy_version 89090 (0.0006) [2023-03-07 11:42:52,943][175731] Updated weights for policy 0, policy_version 89100 (0.0006) [2023-03-07 11:42:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 91242496. Throughput: 0: 12818.1. Samples: 91242273. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:53,321][175405] Avg episode reward: [(0, '27.283')] [2023-03-07 11:42:53,738][175731] Updated weights for policy 0, policy_version 89110 (0.0007) [2023-03-07 11:42:54,553][175731] Updated weights for policy 0, policy_version 89120 (0.0006) [2023-03-07 11:42:55,348][175731] Updated weights for policy 0, policy_version 89130 (0.0006) [2023-03-07 11:42:56,165][175731] Updated weights for policy 0, policy_version 89140 (0.0006) [2023-03-07 11:42:56,951][175731] Updated weights for policy 0, policy_version 89150 (0.0006) [2023-03-07 11:42:57,742][175731] Updated weights for policy 0, policy_version 89160 (0.0007) [2023-03-07 11:42:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 91307008. Throughput: 0: 12814.1. Samples: 91280529. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:42:58,322][175405] Avg episode reward: [(0, '27.149')] [2023-03-07 11:42:58,540][175731] Updated weights for policy 0, policy_version 89170 (0.0006) [2023-03-07 11:42:59,343][175731] Updated weights for policy 0, policy_version 89180 (0.0006) [2023-03-07 11:43:00,138][175731] Updated weights for policy 0, policy_version 89190 (0.0007) [2023-03-07 11:43:00,955][175731] Updated weights for policy 0, policy_version 89200 (0.0006) [2023-03-07 11:43:01,752][175731] Updated weights for policy 0, policy_version 89210 (0.0006) [2023-03-07 11:43:02,556][175731] Updated weights for policy 0, policy_version 89220 (0.0007) [2023-03-07 11:43:03,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 91370496. Throughput: 0: 12812.4. Samples: 91357276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:03,322][175405] Avg episode reward: [(0, '26.932')] [2023-03-07 11:43:03,369][175731] Updated weights for policy 0, policy_version 89230 (0.0006) [2023-03-07 11:43:04,167][175731] Updated weights for policy 0, policy_version 89240 (0.0007) [2023-03-07 11:43:04,971][175731] Updated weights for policy 0, policy_version 89250 (0.0006) [2023-03-07 11:43:05,774][175731] Updated weights for policy 0, policy_version 89260 (0.0006) [2023-03-07 11:43:06,570][175731] Updated weights for policy 0, policy_version 89270 (0.0006) [2023-03-07 11:43:07,357][175731] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-07 11:43:08,148][175731] Updated weights for policy 0, policy_version 89290 (0.0006) [2023-03-07 11:43:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 91435008. Throughput: 0: 12803.5. Samples: 91434112. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:08,322][175405] Avg episode reward: [(0, '26.551')] [2023-03-07 11:43:08,943][175731] Updated weights for policy 0, policy_version 89300 (0.0007) [2023-03-07 11:43:09,732][175731] Updated weights for policy 0, policy_version 89310 (0.0007) [2023-03-07 11:43:10,527][175731] Updated weights for policy 0, policy_version 89320 (0.0007) [2023-03-07 11:43:11,337][175731] Updated weights for policy 0, policy_version 89330 (0.0007) [2023-03-07 11:43:12,128][175731] Updated weights for policy 0, policy_version 89340 (0.0006) [2023-03-07 11:43:12,931][175731] Updated weights for policy 0, policy_version 89350 (0.0007) [2023-03-07 11:43:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 91498496. Throughput: 0: 12808.1. Samples: 91472749. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:13,322][175405] Avg episode reward: [(0, '26.547')] [2023-03-07 11:43:13,742][175731] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-07 11:43:14,545][175731] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-07 11:43:15,349][175731] Updated weights for policy 0, policy_version 89380 (0.0007) [2023-03-07 11:43:16,150][175731] Updated weights for policy 0, policy_version 89390 (0.0007) [2023-03-07 11:43:16,954][175731] Updated weights for policy 0, policy_version 89400 (0.0007) [2023-03-07 11:43:17,743][175731] Updated weights for policy 0, policy_version 89410 (0.0005) [2023-03-07 11:43:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 91563008. Throughput: 0: 12801.7. Samples: 91549394. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:18,322][175405] Avg episode reward: [(0, '27.147')] [2023-03-07 11:43:18,526][175731] Updated weights for policy 0, policy_version 89420 (0.0005) [2023-03-07 11:43:19,319][175731] Updated weights for policy 0, policy_version 89430 (0.0006) [2023-03-07 11:43:20,131][175731] Updated weights for policy 0, policy_version 89440 (0.0007) [2023-03-07 11:43:20,940][175731] Updated weights for policy 0, policy_version 89450 (0.0007) [2023-03-07 11:43:21,742][175731] Updated weights for policy 0, policy_version 89460 (0.0006) [2023-03-07 11:43:22,547][175731] Updated weights for policy 0, policy_version 89470 (0.0006) [2023-03-07 11:43:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 91626496. Throughput: 0: 12801.4. Samples: 91626222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:23,322][175405] Avg episode reward: [(0, '26.636')] [2023-03-07 11:43:23,351][175731] Updated weights for policy 0, policy_version 89480 (0.0006) [2023-03-07 11:43:24,145][175731] Updated weights for policy 0, policy_version 89490 (0.0006) [2023-03-07 11:43:24,947][175731] Updated weights for policy 0, policy_version 89500 (0.0006) [2023-03-07 11:43:25,759][175731] Updated weights for policy 0, policy_version 89510 (0.0006) [2023-03-07 11:43:26,553][175731] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-07 11:43:27,354][175731] Updated weights for policy 0, policy_version 89530 (0.0006) [2023-03-07 11:43:28,130][175731] Updated weights for policy 0, policy_version 89540 (0.0007) [2023-03-07 11:43:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 91691008. Throughput: 0: 12800.3. Samples: 91664561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:28,322][175405] Avg episode reward: [(0, '27.405')] [2023-03-07 11:43:28,927][175731] Updated weights for policy 0, policy_version 89550 (0.0007) [2023-03-07 11:43:29,747][175731] Updated weights for policy 0, policy_version 89560 (0.0007) [2023-03-07 11:43:30,551][175731] Updated weights for policy 0, policy_version 89570 (0.0006) [2023-03-07 11:43:31,337][175731] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-07 11:43:32,132][175731] Updated weights for policy 0, policy_version 89590 (0.0007) [2023-03-07 11:43:32,927][175731] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-07 11:43:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 91754496. Throughput: 0: 12802.3. Samples: 91741549. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:33,322][175405] Avg episode reward: [(0, '28.421')] [2023-03-07 11:43:33,733][175731] Updated weights for policy 0, policy_version 89610 (0.0006) [2023-03-07 11:43:34,531][175731] Updated weights for policy 0, policy_version 89620 (0.0006) [2023-03-07 11:43:35,320][175731] Updated weights for policy 0, policy_version 89630 (0.0006) [2023-03-07 11:43:36,118][175731] Updated weights for policy 0, policy_version 89640 (0.0007) [2023-03-07 11:43:36,927][175731] Updated weights for policy 0, policy_version 89650 (0.0007) [2023-03-07 11:43:37,724][175731] Updated weights for policy 0, policy_version 89660 (0.0006) [2023-03-07 11:43:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 91819008. Throughput: 0: 12805.1. Samples: 91818501. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:38,321][175405] Avg episode reward: [(0, '27.159')] [2023-03-07 11:43:38,519][175731] Updated weights for policy 0, policy_version 89670 (0.0005) [2023-03-07 11:43:39,315][175731] Updated weights for policy 0, policy_version 89680 (0.0006) [2023-03-07 11:43:40,081][175731] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-07 11:43:40,889][175731] Updated weights for policy 0, policy_version 89700 (0.0006) [2023-03-07 11:43:41,714][175731] Updated weights for policy 0, policy_version 89710 (0.0008) [2023-03-07 11:43:42,491][175731] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-07 11:43:43,317][175731] Updated weights for policy 0, policy_version 89730 (0.0007) [2023-03-07 11:43:43,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 91883520. Throughput: 0: 12813.8. Samples: 91857150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:43,328][175405] Avg episode reward: [(0, '26.284')] [2023-03-07 11:43:44,110][175731] Updated weights for policy 0, policy_version 89740 (0.0006) [2023-03-07 11:43:44,905][175731] Updated weights for policy 0, policy_version 89750 (0.0007) [2023-03-07 11:43:45,699][175731] Updated weights for policy 0, policy_version 89760 (0.0006) [2023-03-07 11:43:46,490][175731] Updated weights for policy 0, policy_version 89770 (0.0006) [2023-03-07 11:43:47,313][175731] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-07 11:43:48,109][175731] Updated weights for policy 0, policy_version 89790 (0.0006) [2023-03-07 11:43:48,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 91947008. Throughput: 0: 12815.5. Samples: 91933974. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:48,332][175405] Avg episode reward: [(0, '29.033')] [2023-03-07 11:43:48,340][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000089793_91948032.pth... [2023-03-07 11:43:48,369][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000086788_88870912.pth [2023-03-07 11:43:48,912][175731] Updated weights for policy 0, policy_version 89800 (0.0008) [2023-03-07 11:43:49,709][175731] Updated weights for policy 0, policy_version 89810 (0.0007) [2023-03-07 11:43:50,516][175731] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-07 11:43:51,315][175731] Updated weights for policy 0, policy_version 89830 (0.0006) [2023-03-07 11:43:52,103][175731] Updated weights for policy 0, policy_version 89840 (0.0006) [2023-03-07 11:43:52,904][175731] Updated weights for policy 0, policy_version 89850 (0.0007) [2023-03-07 11:43:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 92011520. Throughput: 0: 12814.2. Samples: 92010751. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:53,332][175405] Avg episode reward: [(0, '28.128')] [2023-03-07 11:43:53,698][175731] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-07 11:43:54,482][175731] Updated weights for policy 0, policy_version 89870 (0.0006) [2023-03-07 11:43:55,287][175731] Updated weights for policy 0, policy_version 89880 (0.0006) [2023-03-07 11:43:56,074][175731] Updated weights for policy 0, policy_version 89890 (0.0006) [2023-03-07 11:43:56,873][175731] Updated weights for policy 0, policy_version 89900 (0.0007) [2023-03-07 11:43:57,659][175731] Updated weights for policy 0, policy_version 89910 (0.0007) [2023-03-07 11:43:58,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 92076032. Throughput: 0: 12813.7. Samples: 92049364. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:43:58,332][175405] Avg episode reward: [(0, '26.531')] [2023-03-07 11:43:58,442][175731] Updated weights for policy 0, policy_version 89920 (0.0005) [2023-03-07 11:43:59,259][175731] Updated weights for policy 0, policy_version 89930 (0.0006) [2023-03-07 11:44:00,065][175731] Updated weights for policy 0, policy_version 89940 (0.0006) [2023-03-07 11:44:00,854][175731] Updated weights for policy 0, policy_version 89950 (0.0005) [2023-03-07 11:44:01,645][175731] Updated weights for policy 0, policy_version 89960 (0.0006) [2023-03-07 11:44:02,441][175731] Updated weights for policy 0, policy_version 89970 (0.0006) [2023-03-07 11:44:03,230][175731] Updated weights for policy 0, policy_version 89980 (0.0006) [2023-03-07 11:44:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 92140544. Throughput: 0: 12827.4. Samples: 92126626. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:44:03,322][175405] Avg episode reward: [(0, '26.767')] [2023-03-07 11:44:04,026][175731] Updated weights for policy 0, policy_version 89990 (0.0005) [2023-03-07 11:44:04,823][175731] Updated weights for policy 0, policy_version 90000 (0.0007) [2023-03-07 11:44:05,636][175731] Updated weights for policy 0, policy_version 90010 (0.0006) [2023-03-07 11:44:06,430][175731] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-07 11:44:07,229][175731] Updated weights for policy 0, policy_version 90030 (0.0006) [2023-03-07 11:44:08,027][175731] Updated weights for policy 0, policy_version 90040 (0.0006) [2023-03-07 11:44:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.0, 300 sec: 12822.6). Total num frames: 92204032. Throughput: 0: 12836.2. Samples: 92203853. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:44:08,322][175405] Avg episode reward: [(0, '27.386')] [2023-03-07 11:44:08,813][175731] Updated weights for policy 0, policy_version 90050 (0.0006) [2023-03-07 11:44:09,595][175731] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-07 11:44:10,391][175731] Updated weights for policy 0, policy_version 90070 (0.0006) [2023-03-07 11:44:11,166][175731] Updated weights for policy 0, policy_version 90080 (0.0006) [2023-03-07 11:44:11,975][175731] Updated weights for policy 0, policy_version 90090 (0.0007) [2023-03-07 11:44:12,781][175731] Updated weights for policy 0, policy_version 90100 (0.0007) [2023-03-07 11:44:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 92268544. Throughput: 0: 12847.9. Samples: 92242717. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:44:13,321][175405] Avg episode reward: [(0, '27.569')] [2023-03-07 11:44:13,564][175731] Updated weights for policy 0, policy_version 90110 (0.0007) [2023-03-07 11:44:14,367][175731] Updated weights for policy 0, policy_version 90120 (0.0006) [2023-03-07 11:44:15,160][175731] Updated weights for policy 0, policy_version 90130 (0.0008) [2023-03-07 11:44:15,958][175731] Updated weights for policy 0, policy_version 90140 (0.0007) [2023-03-07 11:44:16,768][175731] Updated weights for policy 0, policy_version 90150 (0.0007) [2023-03-07 11:44:17,549][175731] Updated weights for policy 0, policy_version 90160 (0.0006) [2023-03-07 11:44:18,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 92333056. Throughput: 0: 12851.8. Samples: 92319882. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:44:18,322][175405] Avg episode reward: [(0, '25.623')] [2023-03-07 11:44:18,361][175731] Updated weights for policy 0, policy_version 90170 (0.0006) [2023-03-07 11:44:19,146][175731] Updated weights for policy 0, policy_version 90180 (0.0007) [2023-03-07 11:44:19,957][175731] Updated weights for policy 0, policy_version 90190 (0.0006) [2023-03-07 11:44:20,779][175731] Updated weights for policy 0, policy_version 90200 (0.0006) [2023-03-07 11:44:21,559][175731] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-07 11:44:22,343][175731] Updated weights for policy 0, policy_version 90220 (0.0006) [2023-03-07 11:44:23,156][175731] Updated weights for policy 0, policy_version 90230 (0.0006) [2023-03-07 11:44:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 92397568. Throughput: 0: 12849.5. Samples: 92396727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:44:23,321][175405] Avg episode reward: [(0, '27.092')] [2023-03-07 11:44:23,955][175731] Updated weights for policy 0, policy_version 90240 (0.0007) [2023-03-07 11:44:24,735][175731] Updated weights for policy 0, policy_version 90250 (0.0007) [2023-03-07 11:44:25,539][175731] Updated weights for policy 0, policy_version 90260 (0.0006) [2023-03-07 11:44:26,346][175731] Updated weights for policy 0, policy_version 90270 (0.0007) [2023-03-07 11:44:27,145][175731] Updated weights for policy 0, policy_version 90280 (0.0006) [2023-03-07 11:44:27,953][175731] Updated weights for policy 0, policy_version 90290 (0.0006) [2023-03-07 11:44:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 92461056. Throughput: 0: 12844.7. Samples: 92435162. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:44:28,322][175405] Avg episode reward: [(0, '30.548')] [2023-03-07 11:44:28,754][175731] Updated weights for policy 0, policy_version 90300 (0.0006) [2023-03-07 11:44:29,545][175731] Updated weights for policy 0, policy_version 90310 (0.0007) [2023-03-07 11:44:30,338][175731] Updated weights for policy 0, policy_version 90320 (0.0007) [2023-03-07 11:44:31,141][175731] Updated weights for policy 0, policy_version 90330 (0.0006) [2023-03-07 11:44:31,935][175731] Updated weights for policy 0, policy_version 90340 (0.0007) [2023-03-07 11:44:32,739][175731] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-07 11:44:33,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 92525568. Throughput: 0: 12846.0. Samples: 92512043. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:44:33,322][175405] Avg episode reward: [(0, '27.433')] [2023-03-07 11:44:33,537][175731] Updated weights for policy 0, policy_version 90360 (0.0006) [2023-03-07 11:44:34,329][175731] Updated weights for policy 0, policy_version 90370 (0.0006) [2023-03-07 11:44:35,117][175731] Updated weights for policy 0, policy_version 90380 (0.0007) [2023-03-07 11:44:35,912][175731] Updated weights for policy 0, policy_version 90390 (0.0006) [2023-03-07 11:44:36,713][175731] Updated weights for policy 0, policy_version 90400 (0.0007) [2023-03-07 11:44:37,522][175731] Updated weights for policy 0, policy_version 90410 (0.0005) [2023-03-07 11:44:38,305][175731] Updated weights for policy 0, policy_version 90420 (0.0006) [2023-03-07 11:44:38,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 92590080. Throughput: 0: 12855.2. Samples: 92589235. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:44:38,322][175405] Avg episode reward: [(0, '27.668')] [2023-03-07 11:44:39,123][175731] Updated weights for policy 0, policy_version 90430 (0.0006) [2023-03-07 11:44:39,912][175731] Updated weights for policy 0, policy_version 90440 (0.0007) [2023-03-07 11:44:40,719][175731] Updated weights for policy 0, policy_version 90450 (0.0006) [2023-03-07 11:44:41,518][175731] Updated weights for policy 0, policy_version 90460 (0.0006) [2023-03-07 11:44:42,311][175731] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-07 11:44:43,119][175731] Updated weights for policy 0, policy_version 90480 (0.0007) [2023-03-07 11:44:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 92653568. Throughput: 0: 12849.0. Samples: 92627571. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:44:43,321][175405] Avg episode reward: [(0, '28.778')] [2023-03-07 11:44:43,921][175731] Updated weights for policy 0, policy_version 90490 (0.0007) [2023-03-07 11:44:44,711][175731] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-07 11:44:45,507][175731] Updated weights for policy 0, policy_version 90510 (0.0006) [2023-03-07 11:44:46,317][175731] Updated weights for policy 0, policy_version 90520 (0.0007) [2023-03-07 11:44:47,101][175731] Updated weights for policy 0, policy_version 90530 (0.0007) [2023-03-07 11:44:47,893][175731] Updated weights for policy 0, policy_version 90540 (0.0006) [2023-03-07 11:44:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 92718080. Throughput: 0: 12839.6. Samples: 92704410. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:44:48,322][175405] Avg episode reward: [(0, '26.815')] [2023-03-07 11:44:48,688][175731] Updated weights for policy 0, policy_version 90550 (0.0006) [2023-03-07 11:44:49,470][175731] Updated weights for policy 0, policy_version 90560 (0.0007) [2023-03-07 11:44:50,250][175731] Updated weights for policy 0, policy_version 90570 (0.0007) [2023-03-07 11:44:51,056][175731] Updated weights for policy 0, policy_version 90580 (0.0007) [2023-03-07 11:44:51,868][175731] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-07 11:44:52,641][175731] Updated weights for policy 0, policy_version 90600 (0.0007) [2023-03-07 11:44:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 92782592. Throughput: 0: 12848.1. Samples: 92782015. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:44:53,322][175405] Avg episode reward: [(0, '28.050')] [2023-03-07 11:44:53,456][175731] Updated weights for policy 0, policy_version 90610 (0.0007) [2023-03-07 11:44:54,246][175731] Updated weights for policy 0, policy_version 90620 (0.0006) [2023-03-07 11:44:55,047][175731] Updated weights for policy 0, policy_version 90630 (0.0006) [2023-03-07 11:44:55,839][175731] Updated weights for policy 0, policy_version 90640 (0.0006) [2023-03-07 11:44:56,640][175731] Updated weights for policy 0, policy_version 90650 (0.0006) [2023-03-07 11:44:57,452][175731] Updated weights for policy 0, policy_version 90660 (0.0006) [2023-03-07 11:44:58,243][175731] Updated weights for policy 0, policy_version 90670 (0.0007) [2023-03-07 11:44:58,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 92847104. Throughput: 0: 12840.7. Samples: 92820548. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:44:58,322][175405] Avg episode reward: [(0, '30.909')] [2023-03-07 11:44:59,041][175731] Updated weights for policy 0, policy_version 90680 (0.0008) [2023-03-07 11:44:59,848][175731] Updated weights for policy 0, policy_version 90690 (0.0007) [2023-03-07 11:45:00,638][175731] Updated weights for policy 0, policy_version 90700 (0.0006) [2023-03-07 11:45:01,452][175731] Updated weights for policy 0, policy_version 90710 (0.0006) [2023-03-07 11:45:02,242][175731] Updated weights for policy 0, policy_version 90720 (0.0007) [2023-03-07 11:45:03,028][175731] Updated weights for policy 0, policy_version 90730 (0.0006) [2023-03-07 11:45:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 92910592. Throughput: 0: 12831.9. Samples: 92897317. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:45:03,322][175405] Avg episode reward: [(0, '26.957')] [2023-03-07 11:45:03,841][175731] Updated weights for policy 0, policy_version 90740 (0.0006) [2023-03-07 11:45:04,627][175731] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-07 11:45:05,418][175731] Updated weights for policy 0, policy_version 90760 (0.0007) [2023-03-07 11:45:06,230][175731] Updated weights for policy 0, policy_version 90770 (0.0006) [2023-03-07 11:45:07,030][175731] Updated weights for policy 0, policy_version 90780 (0.0006) [2023-03-07 11:45:07,825][175731] Updated weights for policy 0, policy_version 90790 (0.0006) [2023-03-07 11:45:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12851.2, 300 sec: 12826.0). Total num frames: 92975104. Throughput: 0: 12835.3. Samples: 92974319. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:45:08,322][175405] Avg episode reward: [(0, '27.850')] [2023-03-07 11:45:08,631][175731] Updated weights for policy 0, policy_version 90800 (0.0007) [2023-03-07 11:45:09,424][175731] Updated weights for policy 0, policy_version 90810 (0.0006) [2023-03-07 11:45:10,233][175731] Updated weights for policy 0, policy_version 90820 (0.0007) [2023-03-07 11:45:11,026][175731] Updated weights for policy 0, policy_version 90830 (0.0006) [2023-03-07 11:45:11,809][175731] Updated weights for policy 0, policy_version 90840 (0.0006) [2023-03-07 11:45:12,627][175731] Updated weights for policy 0, policy_version 90850 (0.0007) [2023-03-07 11:45:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 93038592. Throughput: 0: 12831.2. Samples: 93012566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:45:13,322][175405] Avg episode reward: [(0, '26.324')] [2023-03-07 11:45:13,428][175731] Updated weights for policy 0, policy_version 90860 (0.0006) [2023-03-07 11:45:14,221][175731] Updated weights for policy 0, policy_version 90870 (0.0006) [2023-03-07 11:45:15,020][175731] Updated weights for policy 0, policy_version 90880 (0.0006) [2023-03-07 11:45:15,830][175731] Updated weights for policy 0, policy_version 90890 (0.0007) [2023-03-07 11:45:16,627][175731] Updated weights for policy 0, policy_version 90900 (0.0006) [2023-03-07 11:45:17,435][175731] Updated weights for policy 0, policy_version 90910 (0.0007) [2023-03-07 11:45:18,219][175731] Updated weights for policy 0, policy_version 90920 (0.0006) [2023-03-07 11:45:18,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12834.2, 300 sec: 12826.0). Total num frames: 93103104. Throughput: 0: 12832.0. Samples: 93089483. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:45:18,332][175405] Avg episode reward: [(0, '27.170')] [2023-03-07 11:45:19,025][175731] Updated weights for policy 0, policy_version 90930 (0.0007) [2023-03-07 11:45:19,810][175731] Updated weights for policy 0, policy_version 90940 (0.0006) [2023-03-07 11:45:20,609][175731] Updated weights for policy 0, policy_version 90950 (0.0007) [2023-03-07 11:45:21,403][175731] Updated weights for policy 0, policy_version 90960 (0.0007) [2023-03-07 11:45:22,213][175731] Updated weights for policy 0, policy_version 90970 (0.0006) [2023-03-07 11:45:23,005][175731] Updated weights for policy 0, policy_version 90980 (0.0006) [2023-03-07 11:45:23,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 93167616. Throughput: 0: 12831.1. Samples: 93166635. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:23,332][175405] Avg episode reward: [(0, '28.470')] [2023-03-07 11:45:23,798][175731] Updated weights for policy 0, policy_version 90990 (0.0008) [2023-03-07 11:45:24,607][175731] Updated weights for policy 0, policy_version 91000 (0.0006) [2023-03-07 11:45:25,398][175731] Updated weights for policy 0, policy_version 91010 (0.0006) [2023-03-07 11:45:26,196][175731] Updated weights for policy 0, policy_version 91020 (0.0006) [2023-03-07 11:45:26,992][175731] Updated weights for policy 0, policy_version 91030 (0.0007) [2023-03-07 11:45:27,788][175731] Updated weights for policy 0, policy_version 91040 (0.0007) [2023-03-07 11:45:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.2, 300 sec: 12826.0). Total num frames: 93231104. Throughput: 0: 12834.6. Samples: 93205129. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:28,332][175405] Avg episode reward: [(0, '27.697')] [2023-03-07 11:45:28,587][175731] Updated weights for policy 0, policy_version 91050 (0.0006) [2023-03-07 11:45:29,381][175731] Updated weights for policy 0, policy_version 91060 (0.0007) [2023-03-07 11:45:30,199][175731] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-07 11:45:30,972][175731] Updated weights for policy 0, policy_version 91080 (0.0007) [2023-03-07 11:45:31,788][175731] Updated weights for policy 0, policy_version 91090 (0.0007) [2023-03-07 11:45:32,594][175731] Updated weights for policy 0, policy_version 91100 (0.0007) [2023-03-07 11:45:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 93295616. Throughput: 0: 12836.7. Samples: 93282061. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:33,332][175405] Avg episode reward: [(0, '26.533')] [2023-03-07 11:45:33,379][175731] Updated weights for policy 0, policy_version 91110 (0.0007) [2023-03-07 11:45:34,162][175731] Updated weights for policy 0, policy_version 91120 (0.0007) [2023-03-07 11:45:34,961][175731] Updated weights for policy 0, policy_version 91130 (0.0007) [2023-03-07 11:45:35,773][175731] Updated weights for policy 0, policy_version 91140 (0.0007) [2023-03-07 11:45:36,570][175731] Updated weights for policy 0, policy_version 91150 (0.0007) [2023-03-07 11:45:37,382][175731] Updated weights for policy 0, policy_version 91160 (0.0007) [2023-03-07 11:45:38,179][175731] Updated weights for policy 0, policy_version 91170 (0.0006) [2023-03-07 11:45:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 93359104. Throughput: 0: 12822.7. Samples: 93359039. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:38,332][175405] Avg episode reward: [(0, '27.474')] [2023-03-07 11:45:38,967][175731] Updated weights for policy 0, policy_version 91180 (0.0007) [2023-03-07 11:45:39,767][175731] Updated weights for policy 0, policy_version 91190 (0.0006) [2023-03-07 11:45:40,573][175731] Updated weights for policy 0, policy_version 91200 (0.0008) [2023-03-07 11:45:41,367][175731] Updated weights for policy 0, policy_version 91210 (0.0006) [2023-03-07 11:45:42,169][175731] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-07 11:45:42,958][175731] Updated weights for policy 0, policy_version 91230 (0.0006) [2023-03-07 11:45:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 93423616. Throughput: 0: 12818.3. Samples: 93397370. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:43,332][175405] Avg episode reward: [(0, '26.641')] [2023-03-07 11:45:43,758][175731] Updated weights for policy 0, policy_version 91240 (0.0007) [2023-03-07 11:45:44,542][175731] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-07 11:45:45,349][175731] Updated weights for policy 0, policy_version 91260 (0.0006) [2023-03-07 11:45:46,151][175731] Updated weights for policy 0, policy_version 91270 (0.0006) [2023-03-07 11:45:46,974][175731] Updated weights for policy 0, policy_version 91280 (0.0007) [2023-03-07 11:45:47,758][175731] Updated weights for policy 0, policy_version 91290 (0.0006) [2023-03-07 11:45:48,321][175405] Fps is (10 sec: 12902.2, 60 sec: 12834.1, 300 sec: 12826.0). Total num frames: 93488128. Throughput: 0: 12822.6. Samples: 93474333. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:48,322][175405] Avg episode reward: [(0, '27.494')] [2023-03-07 11:45:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000091297_93488128.pth... [2023-03-07 11:45:48,358][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000088290_90408960.pth [2023-03-07 11:45:48,561][175731] Updated weights for policy 0, policy_version 91300 (0.0007) [2023-03-07 11:45:49,356][175731] Updated weights for policy 0, policy_version 91310 (0.0006) [2023-03-07 11:45:50,153][175731] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-07 11:45:50,950][175731] Updated weights for policy 0, policy_version 91330 (0.0006) [2023-03-07 11:45:51,750][175731] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-07 11:45:52,557][175731] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-07 11:45:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12826.0). Total num frames: 93551616. Throughput: 0: 12817.3. Samples: 93551099. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:53,322][175405] Avg episode reward: [(0, '26.830')] [2023-03-07 11:45:53,354][175731] Updated weights for policy 0, policy_version 91360 (0.0007) [2023-03-07 11:45:54,169][175731] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-07 11:45:54,961][175731] Updated weights for policy 0, policy_version 91380 (0.0006) [2023-03-07 11:45:55,750][175731] Updated weights for policy 0, policy_version 91390 (0.0006) [2023-03-07 11:45:56,557][175731] Updated weights for policy 0, policy_version 91400 (0.0007) [2023-03-07 11:45:57,361][175731] Updated weights for policy 0, policy_version 91410 (0.0006) [2023-03-07 11:45:58,138][175731] Updated weights for policy 0, policy_version 91420 (0.0006) [2023-03-07 11:45:58,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12829.5). Total num frames: 93616128. Throughput: 0: 12823.5. Samples: 93589624. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:45:58,322][175405] Avg episode reward: [(0, '27.723')] [2023-03-07 11:45:58,945][175731] Updated weights for policy 0, policy_version 91430 (0.0006) [2023-03-07 11:45:59,762][175731] Updated weights for policy 0, policy_version 91440 (0.0007) [2023-03-07 11:46:00,566][175731] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-07 11:46:01,354][175731] Updated weights for policy 0, policy_version 91460 (0.0007) [2023-03-07 11:46:02,166][175731] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-07 11:46:02,963][175731] Updated weights for policy 0, policy_version 91480 (0.0006) [2023-03-07 11:46:03,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 93679616. Throughput: 0: 12822.1. Samples: 93666477. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:03,322][175405] Avg episode reward: [(0, '26.856')] [2023-03-07 11:46:03,769][175731] Updated weights for policy 0, policy_version 91490 (0.0007) [2023-03-07 11:46:04,547][175731] Updated weights for policy 0, policy_version 91500 (0.0006) [2023-03-07 11:46:05,357][175731] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-07 11:46:06,158][175731] Updated weights for policy 0, policy_version 91520 (0.0006) [2023-03-07 11:46:06,965][175731] Updated weights for policy 0, policy_version 91530 (0.0007) [2023-03-07 11:46:07,751][175731] Updated weights for policy 0, policy_version 91540 (0.0006) [2023-03-07 11:46:08,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 93744128. Throughput: 0: 12815.5. Samples: 93743336. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:08,322][175405] Avg episode reward: [(0, '28.263')] [2023-03-07 11:46:08,549][175731] Updated weights for policy 0, policy_version 91550 (0.0006) [2023-03-07 11:46:09,365][175731] Updated weights for policy 0, policy_version 91560 (0.0006) [2023-03-07 11:46:10,154][175731] Updated weights for policy 0, policy_version 91570 (0.0006) [2023-03-07 11:46:10,949][175731] Updated weights for policy 0, policy_version 91580 (0.0006) [2023-03-07 11:46:11,789][175731] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-07 11:46:12,571][175731] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-07 11:46:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12822.6). Total num frames: 93807616. Throughput: 0: 12809.0. Samples: 93781535. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:13,322][175405] Avg episode reward: [(0, '26.029')] [2023-03-07 11:46:13,368][175731] Updated weights for policy 0, policy_version 91610 (0.0006) [2023-03-07 11:46:14,183][175731] Updated weights for policy 0, policy_version 91620 (0.0006) [2023-03-07 11:46:14,985][175731] Updated weights for policy 0, policy_version 91630 (0.0007) [2023-03-07 11:46:15,792][175731] Updated weights for policy 0, policy_version 91640 (0.0006) [2023-03-07 11:46:16,582][175731] Updated weights for policy 0, policy_version 91650 (0.0007) [2023-03-07 11:46:17,388][175731] Updated weights for policy 0, policy_version 91660 (0.0006) [2023-03-07 11:46:18,190][175731] Updated weights for policy 0, policy_version 91670 (0.0006) [2023-03-07 11:46:18,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 93871104. Throughput: 0: 12797.9. Samples: 93857969. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:18,322][175405] Avg episode reward: [(0, '26.852')] [2023-03-07 11:46:18,982][175731] Updated weights for policy 0, policy_version 91680 (0.0006) [2023-03-07 11:46:19,782][175731] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-07 11:46:20,593][175731] Updated weights for policy 0, policy_version 91700 (0.0006) [2023-03-07 11:46:21,384][175731] Updated weights for policy 0, policy_version 91710 (0.0006) [2023-03-07 11:46:22,181][175731] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-07 11:46:22,976][175731] Updated weights for policy 0, policy_version 91730 (0.0007) [2023-03-07 11:46:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 93935616. Throughput: 0: 12797.2. Samples: 93934915. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:23,322][175405] Avg episode reward: [(0, '27.587')] [2023-03-07 11:46:23,766][175731] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-07 11:46:24,563][175731] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-07 11:46:25,367][175731] Updated weights for policy 0, policy_version 91760 (0.0007) [2023-03-07 11:46:26,159][175731] Updated weights for policy 0, policy_version 91770 (0.0006) [2023-03-07 11:46:26,969][175731] Updated weights for policy 0, policy_version 91780 (0.0007) [2023-03-07 11:46:27,769][175731] Updated weights for policy 0, policy_version 91790 (0.0006) [2023-03-07 11:46:28,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 94000128. Throughput: 0: 12805.1. Samples: 93973600. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:28,322][175405] Avg episode reward: [(0, '27.740')] [2023-03-07 11:46:28,567][175731] Updated weights for policy 0, policy_version 91800 (0.0007) [2023-03-07 11:46:29,375][175731] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-07 11:46:30,171][175731] Updated weights for policy 0, policy_version 91820 (0.0007) [2023-03-07 11:46:30,956][175731] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-07 11:46:31,749][175731] Updated weights for policy 0, policy_version 91840 (0.0006) [2023-03-07 11:46:32,557][175731] Updated weights for policy 0, policy_version 91850 (0.0007) [2023-03-07 11:46:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 94063616. Throughput: 0: 12803.1. Samples: 94050472. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:33,322][175405] Avg episode reward: [(0, '26.929')] [2023-03-07 11:46:33,357][175731] Updated weights for policy 0, policy_version 91860 (0.0007) [2023-03-07 11:46:34,148][175731] Updated weights for policy 0, policy_version 91870 (0.0006) [2023-03-07 11:46:34,945][175731] Updated weights for policy 0, policy_version 91880 (0.0006) [2023-03-07 11:46:35,747][175731] Updated weights for policy 0, policy_version 91890 (0.0007) [2023-03-07 11:46:36,549][175731] Updated weights for policy 0, policy_version 91900 (0.0006) [2023-03-07 11:46:37,354][175731] Updated weights for policy 0, policy_version 91910 (0.0006) [2023-03-07 11:46:38,145][175731] Updated weights for policy 0, policy_version 91920 (0.0005) [2023-03-07 11:46:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12826.0). Total num frames: 94128128. Throughput: 0: 12803.3. Samples: 94127245. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:38,321][175405] Avg episode reward: [(0, '27.993')] [2023-03-07 11:46:38,950][175731] Updated weights for policy 0, policy_version 91930 (0.0007) [2023-03-07 11:46:39,754][175731] Updated weights for policy 0, policy_version 91940 (0.0007) [2023-03-07 11:46:40,555][175731] Updated weights for policy 0, policy_version 91950 (0.0007) [2023-03-07 11:46:41,352][175731] Updated weights for policy 0, policy_version 91960 (0.0007) [2023-03-07 11:46:42,169][175731] Updated weights for policy 0, policy_version 91970 (0.0007) [2023-03-07 11:46:42,966][175731] Updated weights for policy 0, policy_version 91980 (0.0006) [2023-03-07 11:46:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 94191616. Throughput: 0: 12799.9. Samples: 94165618. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:43,322][175405] Avg episode reward: [(0, '26.656')] [2023-03-07 11:46:43,762][175731] Updated weights for policy 0, policy_version 91990 (0.0007) [2023-03-07 11:46:44,553][175731] Updated weights for policy 0, policy_version 92000 (0.0006) [2023-03-07 11:46:45,357][175731] Updated weights for policy 0, policy_version 92010 (0.0007) [2023-03-07 11:46:46,148][175731] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-07 11:46:46,957][175731] Updated weights for policy 0, policy_version 92030 (0.0007) [2023-03-07 11:46:47,758][175731] Updated weights for policy 0, policy_version 92040 (0.0006) [2023-03-07 11:46:48,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12783.0, 300 sec: 12822.6). Total num frames: 94255104. Throughput: 0: 12796.0. Samples: 94242297. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:48,321][175405] Avg episode reward: [(0, '28.485')] [2023-03-07 11:46:48,553][175731] Updated weights for policy 0, policy_version 92050 (0.0007) [2023-03-07 11:46:49,363][175731] Updated weights for policy 0, policy_version 92060 (0.0006) [2023-03-07 11:46:50,175][175731] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-07 11:46:50,955][175731] Updated weights for policy 0, policy_version 92080 (0.0006) [2023-03-07 11:46:51,769][175731] Updated weights for policy 0, policy_version 92090 (0.0007) [2023-03-07 11:46:52,577][175731] Updated weights for policy 0, policy_version 92100 (0.0007) [2023-03-07 11:46:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12822.6). Total num frames: 94319616. Throughput: 0: 12789.7. Samples: 94318870. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:53,322][175405] Avg episode reward: [(0, '26.792')] [2023-03-07 11:46:53,367][175731] Updated weights for policy 0, policy_version 92110 (0.0007) [2023-03-07 11:46:54,193][175731] Updated weights for policy 0, policy_version 92120 (0.0007) [2023-03-07 11:46:54,978][175731] Updated weights for policy 0, policy_version 92130 (0.0006) [2023-03-07 11:46:55,767][175731] Updated weights for policy 0, policy_version 92140 (0.0005) [2023-03-07 11:46:56,581][175731] Updated weights for policy 0, policy_version 92150 (0.0007) [2023-03-07 11:46:57,385][175731] Updated weights for policy 0, policy_version 92160 (0.0008) [2023-03-07 11:46:58,191][175731] Updated weights for policy 0, policy_version 92170 (0.0006) [2023-03-07 11:46:58,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12819.1). Total num frames: 94383104. Throughput: 0: 12793.7. Samples: 94357255. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:46:58,322][175405] Avg episode reward: [(0, '28.057')] [2023-03-07 11:46:59,001][175731] Updated weights for policy 0, policy_version 92180 (0.0007) [2023-03-07 11:46:59,790][175731] Updated weights for policy 0, policy_version 92190 (0.0007) [2023-03-07 11:47:00,597][175731] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-07 11:47:01,398][175731] Updated weights for policy 0, policy_version 92210 (0.0008) [2023-03-07 11:47:02,221][175731] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-07 11:47:03,013][175731] Updated weights for policy 0, policy_version 92230 (0.0006) [2023-03-07 11:47:03,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12815.6). Total num frames: 94446592. Throughput: 0: 12795.8. Samples: 94433777. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:03,322][175405] Avg episode reward: [(0, '27.169')] [2023-03-07 11:47:03,796][175731] Updated weights for policy 0, policy_version 92240 (0.0007) [2023-03-07 11:47:04,604][175731] Updated weights for policy 0, policy_version 92250 (0.0007) [2023-03-07 11:47:05,389][175731] Updated weights for policy 0, policy_version 92260 (0.0006) [2023-03-07 11:47:06,199][175731] Updated weights for policy 0, policy_version 92270 (0.0007) [2023-03-07 11:47:06,976][175731] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-07 11:47:07,784][175731] Updated weights for policy 0, policy_version 92290 (0.0006) [2023-03-07 11:47:08,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12819.1). Total num frames: 94511104. Throughput: 0: 12797.3. Samples: 94510793. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:08,322][175405] Avg episode reward: [(0, '29.563')] [2023-03-07 11:47:08,585][175731] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-07 11:47:09,408][175731] Updated weights for policy 0, policy_version 92310 (0.0006) [2023-03-07 11:47:10,203][175731] Updated weights for policy 0, policy_version 92320 (0.0007) [2023-03-07 11:47:10,988][175731] Updated weights for policy 0, policy_version 92330 (0.0006) [2023-03-07 11:47:11,791][175731] Updated weights for policy 0, policy_version 92340 (0.0007) [2023-03-07 11:47:12,599][175731] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-07 11:47:13,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12800.0, 300 sec: 12819.1). Total num frames: 94575616. Throughput: 0: 12786.6. Samples: 94548998. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:13,322][175405] Avg episode reward: [(0, '29.895')] [2023-03-07 11:47:13,375][175731] Updated weights for policy 0, policy_version 92360 (0.0007) [2023-03-07 11:47:14,185][175731] Updated weights for policy 0, policy_version 92370 (0.0007) [2023-03-07 11:47:14,984][175731] Updated weights for policy 0, policy_version 92380 (0.0006) [2023-03-07 11:47:15,783][175731] Updated weights for policy 0, policy_version 92390 (0.0006) [2023-03-07 11:47:16,596][175731] Updated weights for policy 0, policy_version 92400 (0.0006) [2023-03-07 11:47:17,412][175731] Updated weights for policy 0, policy_version 92410 (0.0007) [2023-03-07 11:47:18,210][175731] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-07 11:47:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12815.6). Total num frames: 94639104. Throughput: 0: 12785.4. Samples: 94625818. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:18,322][175405] Avg episode reward: [(0, '30.508')] [2023-03-07 11:47:19,014][175731] Updated weights for policy 0, policy_version 92430 (0.0006) [2023-03-07 11:47:19,806][175731] Updated weights for policy 0, policy_version 92440 (0.0006) [2023-03-07 11:47:20,611][175731] Updated weights for policy 0, policy_version 92450 (0.0007) [2023-03-07 11:47:21,416][175731] Updated weights for policy 0, policy_version 92460 (0.0007) [2023-03-07 11:47:22,219][175731] Updated weights for policy 0, policy_version 92470 (0.0006) [2023-03-07 11:47:23,014][175731] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-07 11:47:23,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12812.1). Total num frames: 94702592. Throughput: 0: 12787.0. Samples: 94702660. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:23,322][175405] Avg episode reward: [(0, '27.289')] [2023-03-07 11:47:23,814][175731] Updated weights for policy 0, policy_version 92490 (0.0006) [2023-03-07 11:47:24,609][175731] Updated weights for policy 0, policy_version 92500 (0.0007) [2023-03-07 11:47:25,406][175731] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-07 11:47:26,202][175731] Updated weights for policy 0, policy_version 92520 (0.0006) [2023-03-07 11:47:27,018][175731] Updated weights for policy 0, policy_version 92530 (0.0006) [2023-03-07 11:47:27,820][175731] Updated weights for policy 0, policy_version 92540 (0.0006) [2023-03-07 11:47:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12782.9, 300 sec: 12815.6). Total num frames: 94767104. Throughput: 0: 12784.6. Samples: 94740923. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:28,322][175405] Avg episode reward: [(0, '27.748')] [2023-03-07 11:47:28,629][175731] Updated weights for policy 0, policy_version 92550 (0.0006) [2023-03-07 11:47:29,422][175731] Updated weights for policy 0, policy_version 92560 (0.0008) [2023-03-07 11:47:30,227][175731] Updated weights for policy 0, policy_version 92570 (0.0006) [2023-03-07 11:47:31,026][175731] Updated weights for policy 0, policy_version 92580 (0.0006) [2023-03-07 11:47:31,824][175731] Updated weights for policy 0, policy_version 92590 (0.0008) [2023-03-07 11:47:32,622][175731] Updated weights for policy 0, policy_version 92600 (0.0006) [2023-03-07 11:47:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12812.1). Total num frames: 94830592. Throughput: 0: 12784.1. Samples: 94817581. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:33,322][175405] Avg episode reward: [(0, '27.992')] [2023-03-07 11:47:33,417][175731] Updated weights for policy 0, policy_version 92610 (0.0006) [2023-03-07 11:47:34,221][175731] Updated weights for policy 0, policy_version 92620 (0.0006) [2023-03-07 11:47:35,029][175731] Updated weights for policy 0, policy_version 92630 (0.0007) [2023-03-07 11:47:35,822][175731] Updated weights for policy 0, policy_version 92640 (0.0006) [2023-03-07 11:47:36,621][175731] Updated weights for policy 0, policy_version 92650 (0.0006) [2023-03-07 11:47:37,429][175731] Updated weights for policy 0, policy_version 92660 (0.0006) [2023-03-07 11:47:38,221][175731] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-07 11:47:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12815.6). Total num frames: 94895104. Throughput: 0: 12789.8. Samples: 94894411. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:38,322][175405] Avg episode reward: [(0, '28.005')] [2023-03-07 11:47:39,026][175731] Updated weights for policy 0, policy_version 92680 (0.0006) [2023-03-07 11:47:39,822][175731] Updated weights for policy 0, policy_version 92690 (0.0007) [2023-03-07 11:47:40,625][175731] Updated weights for policy 0, policy_version 92700 (0.0007) [2023-03-07 11:47:41,422][175731] Updated weights for policy 0, policy_version 92710 (0.0006) [2023-03-07 11:47:42,224][175731] Updated weights for policy 0, policy_version 92720 (0.0006) [2023-03-07 11:47:43,016][175731] Updated weights for policy 0, policy_version 92730 (0.0006) [2023-03-07 11:47:43,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12782.9, 300 sec: 12812.2). Total num frames: 94958592. Throughput: 0: 12790.5. Samples: 94932824. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:43,321][175405] Avg episode reward: [(0, '28.739')] [2023-03-07 11:47:43,827][175731] Updated weights for policy 0, policy_version 92740 (0.0007) [2023-03-07 11:47:44,645][175731] Updated weights for policy 0, policy_version 92750 (0.0007) [2023-03-07 11:47:45,435][175731] Updated weights for policy 0, policy_version 92760 (0.0007) [2023-03-07 11:47:46,245][175731] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-07 11:47:47,026][175731] Updated weights for policy 0, policy_version 92780 (0.0005) [2023-03-07 11:47:47,819][175731] Updated weights for policy 0, policy_version 92790 (0.0007) [2023-03-07 11:47:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12815.6). Total num frames: 95023104. Throughput: 0: 12794.6. Samples: 95009536. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:48,322][175405] Avg episode reward: [(0, '28.342')] [2023-03-07 11:47:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000092796_95023104.pth... [2023-03-07 11:47:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000089793_91948032.pth [2023-03-07 11:47:48,617][175731] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-07 11:47:49,425][175731] Updated weights for policy 0, policy_version 92810 (0.0007) [2023-03-07 11:47:50,222][175731] Updated weights for policy 0, policy_version 92820 (0.0007) [2023-03-07 11:47:51,011][175731] Updated weights for policy 0, policy_version 92830 (0.0007) [2023-03-07 11:47:51,808][175731] Updated weights for policy 0, policy_version 92840 (0.0006) [2023-03-07 11:47:52,612][175731] Updated weights for policy 0, policy_version 92850 (0.0006) [2023-03-07 11:47:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12812.1). Total num frames: 95086592. Throughput: 0: 12793.2. Samples: 95086488. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:53,322][175405] Avg episode reward: [(0, '27.852')] [2023-03-07 11:47:53,406][175731] Updated weights for policy 0, policy_version 92860 (0.0005) [2023-03-07 11:47:54,242][175731] Updated weights for policy 0, policy_version 92870 (0.0007) [2023-03-07 11:47:55,038][175731] Updated weights for policy 0, policy_version 92880 (0.0007) [2023-03-07 11:47:55,830][175731] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-07 11:47:56,614][175731] Updated weights for policy 0, policy_version 92900 (0.0007) [2023-03-07 11:47:57,409][175731] Updated weights for policy 0, policy_version 92910 (0.0006) [2023-03-07 11:47:58,202][175731] Updated weights for policy 0, policy_version 92920 (0.0006) [2023-03-07 11:47:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12815.6). Total num frames: 95151104. Throughput: 0: 12793.7. Samples: 95124713. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:47:58,321][175405] Avg episode reward: [(0, '26.842')] [2023-03-07 11:47:58,986][175731] Updated weights for policy 0, policy_version 92930 (0.0006) [2023-03-07 11:47:59,788][175731] Updated weights for policy 0, policy_version 92940 (0.0007) [2023-03-07 11:48:00,597][175731] Updated weights for policy 0, policy_version 92950 (0.0007) [2023-03-07 11:48:01,374][175731] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-07 11:48:02,183][175731] Updated weights for policy 0, policy_version 92970 (0.0006) [2023-03-07 11:48:02,997][175731] Updated weights for policy 0, policy_version 92980 (0.0007) [2023-03-07 11:48:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12815.6). Total num frames: 95215616. Throughput: 0: 12802.9. Samples: 95201948. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:48:03,322][175405] Avg episode reward: [(0, '28.351')] [2023-03-07 11:48:03,789][175731] Updated weights for policy 0, policy_version 92990 (0.0007) [2023-03-07 11:48:04,593][175731] Updated weights for policy 0, policy_version 93000 (0.0007) [2023-03-07 11:48:05,396][175731] Updated weights for policy 0, policy_version 93010 (0.0008) [2023-03-07 11:48:06,197][175731] Updated weights for policy 0, policy_version 93020 (0.0007) [2023-03-07 11:48:07,004][175731] Updated weights for policy 0, policy_version 93030 (0.0007) [2023-03-07 11:48:07,803][175731] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-07 11:48:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12815.6). Total num frames: 95279104. Throughput: 0: 12795.8. Samples: 95278469. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 11:48:08,322][175405] Avg episode reward: [(0, '28.240')] [2023-03-07 11:48:08,606][175731] Updated weights for policy 0, policy_version 93050 (0.0006) [2023-03-07 11:48:09,427][175731] Updated weights for policy 0, policy_version 93060 (0.0006) [2023-03-07 11:48:10,226][175731] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-07 11:48:11,019][175731] Updated weights for policy 0, policy_version 93080 (0.0006) [2023-03-07 11:48:11,821][175731] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-07 11:48:12,617][175731] Updated weights for policy 0, policy_version 93100 (0.0006) [2023-03-07 11:48:13,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12782.9, 300 sec: 12812.1). Total num frames: 95342592. Throughput: 0: 12794.0. Samples: 95316651. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:13,322][175405] Avg episode reward: [(0, '28.214')] [2023-03-07 11:48:13,417][175731] Updated weights for policy 0, policy_version 93110 (0.0007) [2023-03-07 11:48:14,209][175731] Updated weights for policy 0, policy_version 93120 (0.0006) [2023-03-07 11:48:15,010][175731] Updated weights for policy 0, policy_version 93130 (0.0007) [2023-03-07 11:48:15,812][175731] Updated weights for policy 0, policy_version 93140 (0.0007) [2023-03-07 11:48:16,609][175731] Updated weights for policy 0, policy_version 93150 (0.0007) [2023-03-07 11:48:17,421][175731] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-07 11:48:18,211][175731] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-07 11:48:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12815.6). Total num frames: 95407104. Throughput: 0: 12799.1. Samples: 95393540. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:18,322][175405] Avg episode reward: [(0, '29.123')] [2023-03-07 11:48:19,016][175731] Updated weights for policy 0, policy_version 93180 (0.0006) [2023-03-07 11:48:19,812][175731] Updated weights for policy 0, policy_version 93190 (0.0007) [2023-03-07 11:48:20,619][175731] Updated weights for policy 0, policy_version 93200 (0.0007) [2023-03-07 11:48:21,409][175731] Updated weights for policy 0, policy_version 93210 (0.0005) [2023-03-07 11:48:22,209][175731] Updated weights for policy 0, policy_version 93220 (0.0006) [2023-03-07 11:48:23,028][175731] Updated weights for policy 0, policy_version 93230 (0.0007) [2023-03-07 11:48:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 95470592. Throughput: 0: 12798.4. Samples: 95470338. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:23,322][175405] Avg episode reward: [(0, '28.339')] [2023-03-07 11:48:23,825][175731] Updated weights for policy 0, policy_version 93240 (0.0007) [2023-03-07 11:48:24,615][175731] Updated weights for policy 0, policy_version 93250 (0.0006) [2023-03-07 11:48:25,444][175731] Updated weights for policy 0, policy_version 93260 (0.0007) [2023-03-07 11:48:26,245][175731] Updated weights for policy 0, policy_version 93270 (0.0008) [2023-03-07 11:48:27,053][175731] Updated weights for policy 0, policy_version 93280 (0.0008) [2023-03-07 11:48:27,848][175731] Updated weights for policy 0, policy_version 93290 (0.0006) [2023-03-07 11:48:28,321][175405] Fps is (10 sec: 12697.8, 60 sec: 12783.0, 300 sec: 12812.2). Total num frames: 95534080. Throughput: 0: 12789.6. Samples: 95508354. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:28,321][175405] Avg episode reward: [(0, '28.213')] [2023-03-07 11:48:28,657][175731] Updated weights for policy 0, policy_version 93300 (0.0007) [2023-03-07 11:48:29,458][175731] Updated weights for policy 0, policy_version 93310 (0.0006) [2023-03-07 11:48:30,259][175731] Updated weights for policy 0, policy_version 93320 (0.0008) [2023-03-07 11:48:31,062][175731] Updated weights for policy 0, policy_version 93330 (0.0006) [2023-03-07 11:48:31,876][175731] Updated weights for policy 0, policy_version 93340 (0.0007) [2023-03-07 11:48:32,672][175731] Updated weights for policy 0, policy_version 93350 (0.0006) [2023-03-07 11:48:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 95598592. Throughput: 0: 12782.8. Samples: 95584762. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:33,322][175405] Avg episode reward: [(0, '27.902')] [2023-03-07 11:48:33,454][175731] Updated weights for policy 0, policy_version 93360 (0.0006) [2023-03-07 11:48:34,266][175731] Updated weights for policy 0, policy_version 93370 (0.0006) [2023-03-07 11:48:35,064][175731] Updated weights for policy 0, policy_version 93380 (0.0007) [2023-03-07 11:48:35,850][175731] Updated weights for policy 0, policy_version 93390 (0.0006) [2023-03-07 11:48:36,653][175731] Updated weights for policy 0, policy_version 93400 (0.0006) [2023-03-07 11:48:37,454][175731] Updated weights for policy 0, policy_version 93410 (0.0006) [2023-03-07 11:48:38,246][175731] Updated weights for policy 0, policy_version 93420 (0.0007) [2023-03-07 11:48:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12808.7). Total num frames: 95662080. Throughput: 0: 12789.6. Samples: 95662019. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:38,321][175405] Avg episode reward: [(0, '31.107')] [2023-03-07 11:48:39,055][175731] Updated weights for policy 0, policy_version 93430 (0.0005) [2023-03-07 11:48:39,860][175731] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-07 11:48:40,633][175731] Updated weights for policy 0, policy_version 93450 (0.0007) [2023-03-07 11:48:41,443][175731] Updated weights for policy 0, policy_version 93460 (0.0007) [2023-03-07 11:48:42,240][175731] Updated weights for policy 0, policy_version 93470 (0.0006) [2023-03-07 11:48:43,028][175731] Updated weights for policy 0, policy_version 93480 (0.0006) [2023-03-07 11:48:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12812.2). Total num frames: 95726592. Throughput: 0: 12795.4. Samples: 95700505. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:43,322][175405] Avg episode reward: [(0, '28.615')] [2023-03-07 11:48:43,834][175731] Updated weights for policy 0, policy_version 93490 (0.0007) [2023-03-07 11:48:44,625][175731] Updated weights for policy 0, policy_version 93500 (0.0006) [2023-03-07 11:48:45,426][175731] Updated weights for policy 0, policy_version 93510 (0.0005) [2023-03-07 11:48:46,214][175731] Updated weights for policy 0, policy_version 93520 (0.0006) [2023-03-07 11:48:47,019][175731] Updated weights for policy 0, policy_version 93530 (0.0006) [2023-03-07 11:48:47,824][175731] Updated weights for policy 0, policy_version 93540 (0.0007) [2023-03-07 11:48:48,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12800.0, 300 sec: 12812.1). Total num frames: 95791104. Throughput: 0: 12787.5. Samples: 95777385. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:48,321][175405] Avg episode reward: [(0, '28.517')] [2023-03-07 11:48:48,613][175731] Updated weights for policy 0, policy_version 93550 (0.0006) [2023-03-07 11:48:49,416][175731] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-07 11:48:50,226][175731] Updated weights for policy 0, policy_version 93570 (0.0006) [2023-03-07 11:48:51,033][175731] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-07 11:48:51,850][175731] Updated weights for policy 0, policy_version 93590 (0.0007) [2023-03-07 11:48:52,641][175731] Updated weights for policy 0, policy_version 93600 (0.0006) [2023-03-07 11:48:53,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 95854592. Throughput: 0: 12791.7. Samples: 95854096. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:53,322][175405] Avg episode reward: [(0, '28.262')] [2023-03-07 11:48:53,443][175731] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-07 11:48:54,247][175731] Updated weights for policy 0, policy_version 93620 (0.0006) [2023-03-07 11:48:55,045][175731] Updated weights for policy 0, policy_version 93630 (0.0007) [2023-03-07 11:48:55,846][175731] Updated weights for policy 0, policy_version 93640 (0.0006) [2023-03-07 11:48:56,645][175731] Updated weights for policy 0, policy_version 93650 (0.0007) [2023-03-07 11:48:57,425][175731] Updated weights for policy 0, policy_version 93660 (0.0006) [2023-03-07 11:48:58,229][175731] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-07 11:48:58,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12805.2). Total num frames: 95918080. Throughput: 0: 12796.3. Samples: 95892483. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:48:58,321][175405] Avg episode reward: [(0, '28.162')] [2023-03-07 11:48:59,039][175731] Updated weights for policy 0, policy_version 93680 (0.0006) [2023-03-07 11:48:59,821][175731] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-07 11:49:00,623][175731] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-07 11:49:01,425][175731] Updated weights for policy 0, policy_version 93710 (0.0007) [2023-03-07 11:49:02,243][175731] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-07 11:49:03,040][175731] Updated weights for policy 0, policy_version 93730 (0.0006) [2023-03-07 11:49:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12808.7). Total num frames: 95982592. Throughput: 0: 12791.2. Samples: 95969143. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:49:03,322][175405] Avg episode reward: [(0, '29.720')] [2023-03-07 11:49:03,847][175731] Updated weights for policy 0, policy_version 93740 (0.0006) [2023-03-07 11:49:04,631][175731] Updated weights for policy 0, policy_version 93750 (0.0007) [2023-03-07 11:49:05,433][175731] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-07 11:49:06,218][175731] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-07 11:49:07,030][175731] Updated weights for policy 0, policy_version 93780 (0.0007) [2023-03-07 11:49:07,820][175731] Updated weights for policy 0, policy_version 93790 (0.0007) [2023-03-07 11:49:08,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12800.0, 300 sec: 12808.7). Total num frames: 96047104. Throughput: 0: 12797.5. Samples: 96046227. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:08,321][175405] Avg episode reward: [(0, '27.313')] [2023-03-07 11:49:08,628][175731] Updated weights for policy 0, policy_version 93800 (0.0007) [2023-03-07 11:49:09,451][175731] Updated weights for policy 0, policy_version 93810 (0.0007) [2023-03-07 11:49:10,247][175731] Updated weights for policy 0, policy_version 93820 (0.0006) [2023-03-07 11:49:11,043][175731] Updated weights for policy 0, policy_version 93830 (0.0007) [2023-03-07 11:49:11,839][175731] Updated weights for policy 0, policy_version 93840 (0.0007) [2023-03-07 11:49:12,630][175731] Updated weights for policy 0, policy_version 93850 (0.0006) [2023-03-07 11:49:13,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 96110592. Throughput: 0: 12805.6. Samples: 96084605. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:13,321][175405] Avg episode reward: [(0, '27.313')] [2023-03-07 11:49:13,443][175731] Updated weights for policy 0, policy_version 93860 (0.0006) [2023-03-07 11:49:14,235][175731] Updated weights for policy 0, policy_version 93870 (0.0007) [2023-03-07 11:49:15,037][175731] Updated weights for policy 0, policy_version 93880 (0.0006) [2023-03-07 11:49:15,855][175731] Updated weights for policy 0, policy_version 93890 (0.0007) [2023-03-07 11:49:16,667][175731] Updated weights for policy 0, policy_version 93900 (0.0007) [2023-03-07 11:49:17,467][175731] Updated weights for policy 0, policy_version 93910 (0.0007) [2023-03-07 11:49:18,288][175731] Updated weights for policy 0, policy_version 93920 (0.0007) [2023-03-07 11:49:18,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 96174080. Throughput: 0: 12805.7. Samples: 96161017. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:18,322][175405] Avg episode reward: [(0, '28.635')] [2023-03-07 11:49:19,079][175731] Updated weights for policy 0, policy_version 93930 (0.0006) [2023-03-07 11:49:19,881][175731] Updated weights for policy 0, policy_version 93940 (0.0005) [2023-03-07 11:49:20,669][175731] Updated weights for policy 0, policy_version 93950 (0.0006) [2023-03-07 11:49:21,473][175731] Updated weights for policy 0, policy_version 93960 (0.0006) [2023-03-07 11:49:22,269][175731] Updated weights for policy 0, policy_version 93970 (0.0007) [2023-03-07 11:49:23,056][175731] Updated weights for policy 0, policy_version 93980 (0.0006) [2023-03-07 11:49:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 96238592. Throughput: 0: 12796.5. Samples: 96237860. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:23,321][175405] Avg episode reward: [(0, '27.970')] [2023-03-07 11:49:23,869][175731] Updated weights for policy 0, policy_version 93990 (0.0007) [2023-03-07 11:49:24,659][175731] Updated weights for policy 0, policy_version 94000 (0.0006) [2023-03-07 11:49:25,459][175731] Updated weights for policy 0, policy_version 94010 (0.0007) [2023-03-07 11:49:26,254][175731] Updated weights for policy 0, policy_version 94020 (0.0006) [2023-03-07 11:49:27,065][175731] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-07 11:49:27,853][175731] Updated weights for policy 0, policy_version 94040 (0.0006) [2023-03-07 11:49:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 96302080. Throughput: 0: 12795.1. Samples: 96276284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:28,322][175405] Avg episode reward: [(0, '28.191')] [2023-03-07 11:49:28,649][175731] Updated weights for policy 0, policy_version 94050 (0.0006) [2023-03-07 11:49:29,446][175731] Updated weights for policy 0, policy_version 94060 (0.0007) [2023-03-07 11:49:30,238][175731] Updated weights for policy 0, policy_version 94070 (0.0007) [2023-03-07 11:49:31,059][175731] Updated weights for policy 0, policy_version 94080 (0.0007) [2023-03-07 11:49:31,862][175731] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-07 11:49:32,651][175731] Updated weights for policy 0, policy_version 94100 (0.0006) [2023-03-07 11:49:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 96366592. Throughput: 0: 12797.1. Samples: 96353253. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:33,332][175405] Avg episode reward: [(0, '28.891')] [2023-03-07 11:49:33,449][175731] Updated weights for policy 0, policy_version 94110 (0.0007) [2023-03-07 11:49:34,269][175731] Updated weights for policy 0, policy_version 94120 (0.0007) [2023-03-07 11:49:35,072][175731] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-07 11:49:35,868][175731] Updated weights for policy 0, policy_version 94140 (0.0006) [2023-03-07 11:49:36,666][175731] Updated weights for policy 0, policy_version 94150 (0.0007) [2023-03-07 11:49:37,484][175731] Updated weights for policy 0, policy_version 94160 (0.0006) [2023-03-07 11:49:38,265][175731] Updated weights for policy 0, policy_version 94170 (0.0007) [2023-03-07 11:49:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 96430080. Throughput: 0: 12792.0. Samples: 96429738. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:38,332][175405] Avg episode reward: [(0, '29.150')] [2023-03-07 11:49:39,092][175731] Updated weights for policy 0, policy_version 94180 (0.0006) [2023-03-07 11:49:39,888][175731] Updated weights for policy 0, policy_version 94190 (0.0005) [2023-03-07 11:49:40,698][175731] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-07 11:49:41,500][175731] Updated weights for policy 0, policy_version 94210 (0.0007) [2023-03-07 11:49:42,296][175731] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-07 11:49:43,077][175731] Updated weights for policy 0, policy_version 94230 (0.0007) [2023-03-07 11:49:43,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 96494592. Throughput: 0: 12789.0. Samples: 96467989. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:43,322][175405] Avg episode reward: [(0, '33.682')] [2023-03-07 11:49:43,865][175731] Updated weights for policy 0, policy_version 94240 (0.0006) [2023-03-07 11:49:44,662][175731] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-07 11:49:45,445][175731] Updated weights for policy 0, policy_version 94260 (0.0007) [2023-03-07 11:49:46,248][175731] Updated weights for policy 0, policy_version 94270 (0.0007) [2023-03-07 11:49:47,046][175731] Updated weights for policy 0, policy_version 94280 (0.0006) [2023-03-07 11:49:47,839][175731] Updated weights for policy 0, policy_version 94290 (0.0007) [2023-03-07 11:49:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12782.9, 300 sec: 12798.3). Total num frames: 96558080. Throughput: 0: 12803.3. Samples: 96545292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:48,322][175405] Avg episode reward: [(0, '29.350')] [2023-03-07 11:49:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000094296_96559104.pth... [2023-03-07 11:49:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000091297_93488128.pth [2023-03-07 11:49:48,643][175731] Updated weights for policy 0, policy_version 94300 (0.0007) [2023-03-07 11:49:49,420][175731] Updated weights for policy 0, policy_version 94310 (0.0006) [2023-03-07 11:49:50,228][175731] Updated weights for policy 0, policy_version 94320 (0.0006) [2023-03-07 11:49:51,021][175731] Updated weights for policy 0, policy_version 94330 (0.0007) [2023-03-07 11:49:51,813][175731] Updated weights for policy 0, policy_version 94340 (0.0006) [2023-03-07 11:49:52,611][175731] Updated weights for policy 0, policy_version 94350 (0.0006) [2023-03-07 11:49:53,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 96622592. Throughput: 0: 12807.5. Samples: 96622562. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:53,321][175405] Avg episode reward: [(0, '30.946')] [2023-03-07 11:49:53,398][175731] Updated weights for policy 0, policy_version 94360 (0.0006) [2023-03-07 11:49:54,199][175731] Updated weights for policy 0, policy_version 94370 (0.0006) [2023-03-07 11:49:54,994][175731] Updated weights for policy 0, policy_version 94380 (0.0007) [2023-03-07 11:49:55,801][175731] Updated weights for policy 0, policy_version 94390 (0.0007) [2023-03-07 11:49:56,596][175731] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-07 11:49:57,393][175731] Updated weights for policy 0, policy_version 94410 (0.0007) [2023-03-07 11:49:58,200][175731] Updated weights for policy 0, policy_version 94420 (0.0007) [2023-03-07 11:49:58,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 96687104. Throughput: 0: 12809.8. Samples: 96661046. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:49:58,321][175405] Avg episode reward: [(0, '30.357')] [2023-03-07 11:49:59,012][175731] Updated weights for policy 0, policy_version 94430 (0.0007) [2023-03-07 11:49:59,794][175731] Updated weights for policy 0, policy_version 94440 (0.0008) [2023-03-07 11:50:00,585][175731] Updated weights for policy 0, policy_version 94450 (0.0006) [2023-03-07 11:50:01,374][175731] Updated weights for policy 0, policy_version 94460 (0.0007) [2023-03-07 11:50:02,185][175731] Updated weights for policy 0, policy_version 94470 (0.0007) [2023-03-07 11:50:03,002][175731] Updated weights for policy 0, policy_version 94480 (0.0006) [2023-03-07 11:50:03,321][175405] Fps is (10 sec: 12902.3, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 96751616. Throughput: 0: 12821.2. Samples: 96737969. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:03,322][175405] Avg episode reward: [(0, '28.507')] [2023-03-07 11:50:03,788][175731] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-07 11:50:04,582][175731] Updated weights for policy 0, policy_version 94500 (0.0007) [2023-03-07 11:50:05,381][175731] Updated weights for policy 0, policy_version 94510 (0.0006) [2023-03-07 11:50:06,201][175731] Updated weights for policy 0, policy_version 94520 (0.0006) [2023-03-07 11:50:06,994][175731] Updated weights for policy 0, policy_version 94530 (0.0006) [2023-03-07 11:50:07,799][175731] Updated weights for policy 0, policy_version 94540 (0.0007) [2023-03-07 11:50:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 96815104. Throughput: 0: 12817.7. Samples: 96814655. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:08,322][175405] Avg episode reward: [(0, '29.077')] [2023-03-07 11:50:08,600][175731] Updated weights for policy 0, policy_version 94550 (0.0007) [2023-03-07 11:50:09,395][175731] Updated weights for policy 0, policy_version 94560 (0.0007) [2023-03-07 11:50:10,202][175731] Updated weights for policy 0, policy_version 94570 (0.0007) [2023-03-07 11:50:10,995][175731] Updated weights for policy 0, policy_version 94580 (0.0006) [2023-03-07 11:50:11,801][175731] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-07 11:50:12,598][175731] Updated weights for policy 0, policy_version 94600 (0.0006) [2023-03-07 11:50:13,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 96879616. Throughput: 0: 12816.8. Samples: 96853040. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:13,322][175405] Avg episode reward: [(0, '29.149')] [2023-03-07 11:50:13,403][175731] Updated weights for policy 0, policy_version 94610 (0.0007) [2023-03-07 11:50:14,206][175731] Updated weights for policy 0, policy_version 94620 (0.0006) [2023-03-07 11:50:15,001][175731] Updated weights for policy 0, policy_version 94630 (0.0006) [2023-03-07 11:50:15,787][175731] Updated weights for policy 0, policy_version 94640 (0.0007) [2023-03-07 11:50:16,603][175731] Updated weights for policy 0, policy_version 94650 (0.0007) [2023-03-07 11:50:17,406][175731] Updated weights for policy 0, policy_version 94660 (0.0006) [2023-03-07 11:50:18,213][175731] Updated weights for policy 0, policy_version 94670 (0.0006) [2023-03-07 11:50:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12798.3). Total num frames: 96943104. Throughput: 0: 12814.0. Samples: 96929882. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:18,322][175405] Avg episode reward: [(0, '30.230')] [2023-03-07 11:50:19,013][175731] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-07 11:50:19,805][175731] Updated weights for policy 0, policy_version 94690 (0.0007) [2023-03-07 11:50:20,609][175731] Updated weights for policy 0, policy_version 94700 (0.0007) [2023-03-07 11:50:21,395][175731] Updated weights for policy 0, policy_version 94710 (0.0006) [2023-03-07 11:50:22,185][175731] Updated weights for policy 0, policy_version 94720 (0.0006) [2023-03-07 11:50:22,982][175731] Updated weights for policy 0, policy_version 94730 (0.0006) [2023-03-07 11:50:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 97007616. Throughput: 0: 12826.9. Samples: 97006948. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:23,322][175405] Avg episode reward: [(0, '29.374')] [2023-03-07 11:50:23,781][175731] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-07 11:50:24,587][175731] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-07 11:50:25,380][175731] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-07 11:50:26,194][175731] Updated weights for policy 0, policy_version 94770 (0.0006) [2023-03-07 11:50:27,004][175731] Updated weights for policy 0, policy_version 94780 (0.0007) [2023-03-07 11:50:27,808][175731] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-07 11:50:28,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12798.3). Total num frames: 97071104. Throughput: 0: 12825.1. Samples: 97045117. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:28,322][175405] Avg episode reward: [(0, '27.678')] [2023-03-07 11:50:28,616][175731] Updated weights for policy 0, policy_version 94800 (0.0009) [2023-03-07 11:50:29,422][175731] Updated weights for policy 0, policy_version 94810 (0.0008) [2023-03-07 11:50:30,216][175731] Updated weights for policy 0, policy_version 94820 (0.0007) [2023-03-07 11:50:31,005][175731] Updated weights for policy 0, policy_version 94830 (0.0006) [2023-03-07 11:50:31,812][175731] Updated weights for policy 0, policy_version 94840 (0.0008) [2023-03-07 11:50:32,628][175731] Updated weights for policy 0, policy_version 94850 (0.0008) [2023-03-07 11:50:33,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 97134592. Throughput: 0: 12806.9. Samples: 97121603. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:33,321][175405] Avg episode reward: [(0, '29.178')] [2023-03-07 11:50:33,415][175731] Updated weights for policy 0, policy_version 94860 (0.0006) [2023-03-07 11:50:34,241][175731] Updated weights for policy 0, policy_version 94870 (0.0006) [2023-03-07 11:50:35,022][175731] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-07 11:50:35,818][175731] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-07 11:50:36,626][175731] Updated weights for policy 0, policy_version 94900 (0.0007) [2023-03-07 11:50:37,444][175731] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-07 11:50:38,241][175731] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-07 11:50:38,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12798.3). Total num frames: 97199104. Throughput: 0: 12794.1. Samples: 97198296. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:38,321][175405] Avg episode reward: [(0, '28.163')] [2023-03-07 11:50:39,014][175731] Updated weights for policy 0, policy_version 94930 (0.0007) [2023-03-07 11:50:39,829][175731] Updated weights for policy 0, policy_version 94940 (0.0006) [2023-03-07 11:50:40,640][175731] Updated weights for policy 0, policy_version 94950 (0.0007) [2023-03-07 11:50:41,430][175731] Updated weights for policy 0, policy_version 94960 (0.0006) [2023-03-07 11:50:42,230][175731] Updated weights for policy 0, policy_version 94970 (0.0006) [2023-03-07 11:50:43,017][175731] Updated weights for policy 0, policy_version 94980 (0.0007) [2023-03-07 11:50:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12794.8). Total num frames: 97262592. Throughput: 0: 12792.5. Samples: 97236707. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:43,322][175405] Avg episode reward: [(0, '28.863')] [2023-03-07 11:50:43,811][175731] Updated weights for policy 0, policy_version 94990 (0.0006) [2023-03-07 11:50:44,610][175731] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-07 11:50:45,393][175731] Updated weights for policy 0, policy_version 95010 (0.0006) [2023-03-07 11:50:46,186][175731] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-07 11:50:46,999][175731] Updated weights for policy 0, policy_version 95030 (0.0006) [2023-03-07 11:50:47,811][175731] Updated weights for policy 0, policy_version 95040 (0.0006) [2023-03-07 11:50:48,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12798.3). Total num frames: 97327104. Throughput: 0: 12797.7. Samples: 97313864. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:48,321][175405] Avg episode reward: [(0, '29.608')] [2023-03-07 11:50:48,586][175731] Updated weights for policy 0, policy_version 95050 (0.0006) [2023-03-07 11:50:49,418][175731] Updated weights for policy 0, policy_version 95060 (0.0007) [2023-03-07 11:50:50,207][175731] Updated weights for policy 0, policy_version 95070 (0.0007) [2023-03-07 11:50:50,993][175731] Updated weights for policy 0, policy_version 95080 (0.0006) [2023-03-07 11:50:51,808][175731] Updated weights for policy 0, policy_version 95090 (0.0007) [2023-03-07 11:50:52,601][175731] Updated weights for policy 0, policy_version 95100 (0.0006) [2023-03-07 11:50:53,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12798.3). Total num frames: 97391616. Throughput: 0: 12804.3. Samples: 97390849. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:53,321][175405] Avg episode reward: [(0, '28.367')] [2023-03-07 11:50:53,391][175731] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-07 11:50:54,174][175731] Updated weights for policy 0, policy_version 95120 (0.0007) [2023-03-07 11:50:54,966][175731] Updated weights for policy 0, policy_version 95130 (0.0006) [2023-03-07 11:50:55,772][175731] Updated weights for policy 0, policy_version 95140 (0.0006) [2023-03-07 11:50:56,568][175731] Updated weights for policy 0, policy_version 95150 (0.0006) [2023-03-07 11:50:57,374][175731] Updated weights for policy 0, policy_version 95160 (0.0006) [2023-03-07 11:50:58,153][175731] Updated weights for policy 0, policy_version 95170 (0.0007) [2023-03-07 11:50:58,321][175405] Fps is (10 sec: 12902.5, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 97456128. Throughput: 0: 12807.8. Samples: 97429391. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 11:50:58,321][175405] Avg episode reward: [(0, '30.186')] [2023-03-07 11:50:58,947][175731] Updated weights for policy 0, policy_version 95180 (0.0006) [2023-03-07 11:50:59,766][175731] Updated weights for policy 0, policy_version 95190 (0.0006) [2023-03-07 11:51:00,555][175731] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-07 11:51:01,350][175731] Updated weights for policy 0, policy_version 95210 (0.0007) [2023-03-07 11:51:02,162][175731] Updated weights for policy 0, policy_version 95220 (0.0007) [2023-03-07 11:51:02,942][175731] Updated weights for policy 0, policy_version 95230 (0.0007) [2023-03-07 11:51:03,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 97519616. Throughput: 0: 12814.1. Samples: 97506517. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:03,332][175405] Avg episode reward: [(0, '29.199')] [2023-03-07 11:51:03,734][175731] Updated weights for policy 0, policy_version 95240 (0.0008) [2023-03-07 11:51:04,525][175731] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-07 11:51:05,332][175731] Updated weights for policy 0, policy_version 95260 (0.0007) [2023-03-07 11:51:06,121][175731] Updated weights for policy 0, policy_version 95270 (0.0006) [2023-03-07 11:51:06,946][175731] Updated weights for policy 0, policy_version 95280 (0.0006) [2023-03-07 11:51:07,745][175731] Updated weights for policy 0, policy_version 95290 (0.0007) [2023-03-07 11:51:08,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 97584128. Throughput: 0: 12811.2. Samples: 97583451. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:08,333][175405] Avg episode reward: [(0, '29.523')] [2023-03-07 11:51:08,521][175731] Updated weights for policy 0, policy_version 95300 (0.0006) [2023-03-07 11:51:09,320][175731] Updated weights for policy 0, policy_version 95310 (0.0006) [2023-03-07 11:51:10,141][175731] Updated weights for policy 0, policy_version 95320 (0.0006) [2023-03-07 11:51:10,936][175731] Updated weights for policy 0, policy_version 95330 (0.0006) [2023-03-07 11:51:11,738][175731] Updated weights for policy 0, policy_version 95340 (0.0006) [2023-03-07 11:51:12,527][175731] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-07 11:51:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 97647616. Throughput: 0: 12814.7. Samples: 97621780. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:13,331][175731] Updated weights for policy 0, policy_version 95360 (0.0007) [2023-03-07 11:51:13,332][175405] Avg episode reward: [(0, '28.707')] [2023-03-07 11:51:14,134][175731] Updated weights for policy 0, policy_version 95370 (0.0006) [2023-03-07 11:51:14,936][175731] Updated weights for policy 0, policy_version 95380 (0.0006) [2023-03-07 11:51:15,717][175731] Updated weights for policy 0, policy_version 95390 (0.0007) [2023-03-07 11:51:16,533][175731] Updated weights for policy 0, policy_version 95400 (0.0007) [2023-03-07 11:51:17,323][175731] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-07 11:51:18,141][175731] Updated weights for policy 0, policy_version 95420 (0.0006) [2023-03-07 11:51:18,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.0, 300 sec: 12801.7). Total num frames: 97712128. Throughput: 0: 12827.6. Samples: 97698846. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:18,332][175405] Avg episode reward: [(0, '29.016')] [2023-03-07 11:51:18,933][175731] Updated weights for policy 0, policy_version 95430 (0.0006) [2023-03-07 11:51:19,745][175731] Updated weights for policy 0, policy_version 95440 (0.0007) [2023-03-07 11:51:20,520][175731] Updated weights for policy 0, policy_version 95450 (0.0006) [2023-03-07 11:51:21,328][175731] Updated weights for policy 0, policy_version 95460 (0.0006) [2023-03-07 11:51:22,113][175731] Updated weights for policy 0, policy_version 95470 (0.0006) [2023-03-07 11:51:22,925][175731] Updated weights for policy 0, policy_version 95480 (0.0006) [2023-03-07 11:51:23,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12798.3). Total num frames: 97775616. Throughput: 0: 12828.0. Samples: 97775556. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:23,332][175405] Avg episode reward: [(0, '27.971')] [2023-03-07 11:51:23,722][175731] Updated weights for policy 0, policy_version 95490 (0.0007) [2023-03-07 11:51:24,524][175731] Updated weights for policy 0, policy_version 95500 (0.0006) [2023-03-07 11:51:25,325][175731] Updated weights for policy 0, policy_version 95510 (0.0007) [2023-03-07 11:51:26,127][175731] Updated weights for policy 0, policy_version 95520 (0.0006) [2023-03-07 11:51:26,913][175731] Updated weights for policy 0, policy_version 95530 (0.0007) [2023-03-07 11:51:27,714][175731] Updated weights for policy 0, policy_version 95540 (0.0006) [2023-03-07 11:51:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 97840128. Throughput: 0: 12828.4. Samples: 97813986. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:28,332][175405] Avg episode reward: [(0, '35.878')] [2023-03-07 11:51:28,508][175731] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-07 11:51:29,338][175731] Updated weights for policy 0, policy_version 95560 (0.0007) [2023-03-07 11:51:30,121][175731] Updated weights for policy 0, policy_version 95570 (0.0006) [2023-03-07 11:51:30,931][175731] Updated weights for policy 0, policy_version 95580 (0.0007) [2023-03-07 11:51:31,735][175731] Updated weights for policy 0, policy_version 95590 (0.0007) [2023-03-07 11:51:32,533][175731] Updated weights for policy 0, policy_version 95600 (0.0006) [2023-03-07 11:51:33,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12798.3). Total num frames: 97903616. Throughput: 0: 12816.2. Samples: 97890593. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:33,321][175405] Avg episode reward: [(0, '29.011')] [2023-03-07 11:51:33,331][175731] Updated weights for policy 0, policy_version 95610 (0.0006) [2023-03-07 11:51:34,149][175731] Updated weights for policy 0, policy_version 95620 (0.0006) [2023-03-07 11:51:34,954][175731] Updated weights for policy 0, policy_version 95630 (0.0006) [2023-03-07 11:51:35,749][175731] Updated weights for policy 0, policy_version 95640 (0.0007) [2023-03-07 11:51:36,539][175731] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-07 11:51:37,344][175731] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-07 11:51:38,147][175731] Updated weights for policy 0, policy_version 95670 (0.0006) [2023-03-07 11:51:38,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 97968128. Throughput: 0: 12810.3. Samples: 97967311. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:38,321][175405] Avg episode reward: [(0, '26.754')] [2023-03-07 11:51:38,946][175731] Updated weights for policy 0, policy_version 95680 (0.0007) [2023-03-07 11:51:39,743][175731] Updated weights for policy 0, policy_version 95690 (0.0006) [2023-03-07 11:51:40,543][175731] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-07 11:51:41,355][175731] Updated weights for policy 0, policy_version 95710 (0.0006) [2023-03-07 11:51:42,162][175731] Updated weights for policy 0, policy_version 95720 (0.0005) [2023-03-07 11:51:42,960][175731] Updated weights for policy 0, policy_version 95730 (0.0006) [2023-03-07 11:51:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 98031616. Throughput: 0: 12805.1. Samples: 98005619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:43,321][175405] Avg episode reward: [(0, '29.595')] [2023-03-07 11:51:43,748][175731] Updated weights for policy 0, policy_version 95740 (0.0007) [2023-03-07 11:51:44,559][175731] Updated weights for policy 0, policy_version 95750 (0.0007) [2023-03-07 11:51:45,346][175731] Updated weights for policy 0, policy_version 95760 (0.0005) [2023-03-07 11:51:46,140][175731] Updated weights for policy 0, policy_version 95770 (0.0006) [2023-03-07 11:51:46,942][175731] Updated weights for policy 0, policy_version 95780 (0.0006) [2023-03-07 11:51:47,740][175731] Updated weights for policy 0, policy_version 95790 (0.0008) [2023-03-07 11:51:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12801.7). Total num frames: 98096128. Throughput: 0: 12801.5. Samples: 98082585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:48,322][175405] Avg episode reward: [(0, '29.887')] [2023-03-07 11:51:48,325][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000095797_98096128.pth... [2023-03-07 11:51:48,357][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000092796_95023104.pth [2023-03-07 11:51:48,545][175731] Updated weights for policy 0, policy_version 95800 (0.0007) [2023-03-07 11:51:49,344][175731] Updated weights for policy 0, policy_version 95810 (0.0007) [2023-03-07 11:51:50,130][175731] Updated weights for policy 0, policy_version 95820 (0.0006) [2023-03-07 11:51:50,935][175731] Updated weights for policy 0, policy_version 95830 (0.0006) [2023-03-07 11:51:51,727][175731] Updated weights for policy 0, policy_version 95840 (0.0006) [2023-03-07 11:51:52,546][175731] Updated weights for policy 0, policy_version 95850 (0.0006) [2023-03-07 11:51:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98159616. Throughput: 0: 12798.4. Samples: 98159377. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:53,322][175405] Avg episode reward: [(0, '28.000')] [2023-03-07 11:51:53,342][175731] Updated weights for policy 0, policy_version 95860 (0.0006) [2023-03-07 11:51:54,126][175731] Updated weights for policy 0, policy_version 95870 (0.0007) [2023-03-07 11:51:54,920][175731] Updated weights for policy 0, policy_version 95880 (0.0005) [2023-03-07 11:51:55,708][175731] Updated weights for policy 0, policy_version 95890 (0.0006) [2023-03-07 11:51:56,515][175731] Updated weights for policy 0, policy_version 95900 (0.0007) [2023-03-07 11:51:57,335][175731] Updated weights for policy 0, policy_version 95910 (0.0006) [2023-03-07 11:51:58,122][175731] Updated weights for policy 0, policy_version 95920 (0.0006) [2023-03-07 11:51:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 98224128. Throughput: 0: 12807.7. Samples: 98198125. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:51:58,322][175405] Avg episode reward: [(0, '27.645')] [2023-03-07 11:51:58,917][175731] Updated weights for policy 0, policy_version 95930 (0.0007) [2023-03-07 11:51:59,727][175731] Updated weights for policy 0, policy_version 95940 (0.0007) [2023-03-07 11:52:00,524][175731] Updated weights for policy 0, policy_version 95950 (0.0007) [2023-03-07 11:52:01,332][175731] Updated weights for policy 0, policy_version 95960 (0.0007) [2023-03-07 11:52:02,146][175731] Updated weights for policy 0, policy_version 95970 (0.0007) [2023-03-07 11:52:02,924][175731] Updated weights for policy 0, policy_version 95980 (0.0007) [2023-03-07 11:52:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98287616. Throughput: 0: 12795.7. Samples: 98274650. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:03,322][175405] Avg episode reward: [(0, '28.637')] [2023-03-07 11:52:03,717][175731] Updated weights for policy 0, policy_version 95990 (0.0006) [2023-03-07 11:52:04,523][175731] Updated weights for policy 0, policy_version 96000 (0.0006) [2023-03-07 11:52:05,312][175731] Updated weights for policy 0, policy_version 96010 (0.0006) [2023-03-07 11:52:06,118][175731] Updated weights for policy 0, policy_version 96020 (0.0007) [2023-03-07 11:52:06,927][175731] Updated weights for policy 0, policy_version 96030 (0.0007) [2023-03-07 11:52:07,721][175731] Updated weights for policy 0, policy_version 96040 (0.0006) [2023-03-07 11:52:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98352128. Throughput: 0: 12798.4. Samples: 98351486. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:08,322][175405] Avg episode reward: [(0, '30.276')] [2023-03-07 11:52:08,535][175731] Updated weights for policy 0, policy_version 96050 (0.0006) [2023-03-07 11:52:09,320][175731] Updated weights for policy 0, policy_version 96060 (0.0007) [2023-03-07 11:52:10,133][175731] Updated weights for policy 0, policy_version 96070 (0.0007) [2023-03-07 11:52:10,932][175731] Updated weights for policy 0, policy_version 96080 (0.0007) [2023-03-07 11:52:11,738][175731] Updated weights for policy 0, policy_version 96090 (0.0006) [2023-03-07 11:52:12,532][175731] Updated weights for policy 0, policy_version 96100 (0.0006) [2023-03-07 11:52:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98415616. Throughput: 0: 12797.4. Samples: 98389868. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:13,322][175405] Avg episode reward: [(0, '27.625')] [2023-03-07 11:52:13,326][175731] Updated weights for policy 0, policy_version 96110 (0.0006) [2023-03-07 11:52:14,149][175731] Updated weights for policy 0, policy_version 96120 (0.0007) [2023-03-07 11:52:14,932][175731] Updated weights for policy 0, policy_version 96130 (0.0007) [2023-03-07 11:52:15,743][175731] Updated weights for policy 0, policy_version 96140 (0.0007) [2023-03-07 11:52:16,525][175731] Updated weights for policy 0, policy_version 96150 (0.0007) [2023-03-07 11:52:17,344][175731] Updated weights for policy 0, policy_version 96160 (0.0007) [2023-03-07 11:52:18,141][175731] Updated weights for policy 0, policy_version 96170 (0.0008) [2023-03-07 11:52:18,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 98480128. Throughput: 0: 12800.3. Samples: 98466609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:18,322][175405] Avg episode reward: [(0, '30.606')] [2023-03-07 11:52:18,946][175731] Updated weights for policy 0, policy_version 96180 (0.0007) [2023-03-07 11:52:19,771][175731] Updated weights for policy 0, policy_version 96190 (0.0007) [2023-03-07 11:52:20,558][175731] Updated weights for policy 0, policy_version 96200 (0.0006) [2023-03-07 11:52:21,362][175731] Updated weights for policy 0, policy_version 96210 (0.0006) [2023-03-07 11:52:22,146][175731] Updated weights for policy 0, policy_version 96220 (0.0006) [2023-03-07 11:52:22,966][175731] Updated weights for policy 0, policy_version 96230 (0.0006) [2023-03-07 11:52:23,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98543616. Throughput: 0: 12796.9. Samples: 98543172. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:23,322][175405] Avg episode reward: [(0, '28.614')] [2023-03-07 11:52:23,749][175731] Updated weights for policy 0, policy_version 96240 (0.0006) [2023-03-07 11:52:24,537][175731] Updated weights for policy 0, policy_version 96250 (0.0006) [2023-03-07 11:52:25,346][175731] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-07 11:52:26,152][175731] Updated weights for policy 0, policy_version 96270 (0.0008) [2023-03-07 11:52:26,957][175731] Updated weights for policy 0, policy_version 96280 (0.0006) [2023-03-07 11:52:27,758][175731] Updated weights for policy 0, policy_version 96290 (0.0005) [2023-03-07 11:52:28,321][175405] Fps is (10 sec: 12697.7, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 98607104. Throughput: 0: 12799.3. Samples: 98581586. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:28,322][175405] Avg episode reward: [(0, '28.939')] [2023-03-07 11:52:28,552][175731] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-07 11:52:29,353][175731] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-07 11:52:30,149][175731] Updated weights for policy 0, policy_version 96320 (0.0007) [2023-03-07 11:52:30,958][175731] Updated weights for policy 0, policy_version 96330 (0.0006) [2023-03-07 11:52:31,748][175731] Updated weights for policy 0, policy_version 96340 (0.0006) [2023-03-07 11:52:32,546][175731] Updated weights for policy 0, policy_version 96350 (0.0006) [2023-03-07 11:52:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98671616. Throughput: 0: 12796.7. Samples: 98658438. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:33,322][175405] Avg episode reward: [(0, '28.388')] [2023-03-07 11:52:33,358][175731] Updated weights for policy 0, policy_version 96360 (0.0007) [2023-03-07 11:52:34,163][175731] Updated weights for policy 0, policy_version 96370 (0.0007) [2023-03-07 11:52:34,960][175731] Updated weights for policy 0, policy_version 96380 (0.0007) [2023-03-07 11:52:35,759][175731] Updated weights for policy 0, policy_version 96390 (0.0007) [2023-03-07 11:52:36,574][175731] Updated weights for policy 0, policy_version 96400 (0.0007) [2023-03-07 11:52:37,359][175731] Updated weights for policy 0, policy_version 96410 (0.0007) [2023-03-07 11:52:38,189][175731] Updated weights for policy 0, policy_version 96420 (0.0006) [2023-03-07 11:52:38,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 98735104. Throughput: 0: 12788.2. Samples: 98734849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:38,322][175405] Avg episode reward: [(0, '28.512')] [2023-03-07 11:52:38,955][175731] Updated weights for policy 0, policy_version 96430 (0.0006) [2023-03-07 11:52:39,772][175731] Updated weights for policy 0, policy_version 96440 (0.0007) [2023-03-07 11:52:40,574][175731] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-07 11:52:41,365][175731] Updated weights for policy 0, policy_version 96460 (0.0007) [2023-03-07 11:52:42,164][175731] Updated weights for policy 0, policy_version 96470 (0.0008) [2023-03-07 11:52:42,972][175731] Updated weights for policy 0, policy_version 96480 (0.0006) [2023-03-07 11:52:43,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98799616. Throughput: 0: 12781.6. Samples: 98773298. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:43,321][175405] Avg episode reward: [(0, '28.739')] [2023-03-07 11:52:43,752][175731] Updated weights for policy 0, policy_version 96490 (0.0007) [2023-03-07 11:52:44,570][175731] Updated weights for policy 0, policy_version 96500 (0.0007) [2023-03-07 11:52:45,371][175731] Updated weights for policy 0, policy_version 96510 (0.0006) [2023-03-07 11:52:46,166][175731] Updated weights for policy 0, policy_version 96520 (0.0006) [2023-03-07 11:52:46,976][175731] Updated weights for policy 0, policy_version 96530 (0.0006) [2023-03-07 11:52:47,772][175731] Updated weights for policy 0, policy_version 96540 (0.0006) [2023-03-07 11:52:48,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 98863104. Throughput: 0: 12786.6. Samples: 98850050. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:48,322][175405] Avg episode reward: [(0, '28.351')] [2023-03-07 11:52:48,577][175731] Updated weights for policy 0, policy_version 96550 (0.0006) [2023-03-07 11:52:49,380][175731] Updated weights for policy 0, policy_version 96560 (0.0007) [2023-03-07 11:52:50,173][175731] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-07 11:52:50,975][175731] Updated weights for policy 0, policy_version 96580 (0.0007) [2023-03-07 11:52:51,780][175731] Updated weights for policy 0, policy_version 96590 (0.0007) [2023-03-07 11:52:52,586][175731] Updated weights for policy 0, policy_version 96600 (0.0007) [2023-03-07 11:52:53,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98927616. Throughput: 0: 12786.0. Samples: 98926857. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 11:52:53,322][175405] Avg episode reward: [(0, '28.829')] [2023-03-07 11:52:53,373][175731] Updated weights for policy 0, policy_version 96610 (0.0006) [2023-03-07 11:52:54,177][175731] Updated weights for policy 0, policy_version 96620 (0.0006) [2023-03-07 11:52:54,973][175731] Updated weights for policy 0, policy_version 96630 (0.0007) [2023-03-07 11:52:55,777][175731] Updated weights for policy 0, policy_version 96640 (0.0007) [2023-03-07 11:52:56,574][175731] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-07 11:52:57,363][175731] Updated weights for policy 0, policy_version 96660 (0.0007) [2023-03-07 11:52:58,162][175731] Updated weights for policy 0, policy_version 96670 (0.0007) [2023-03-07 11:52:58,321][175405] Fps is (10 sec: 12902.6, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 98992128. Throughput: 0: 12792.3. Samples: 98965522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:52:58,322][175405] Avg episode reward: [(0, '27.418')] [2023-03-07 11:52:58,967][175731] Updated weights for policy 0, policy_version 96680 (0.0007) [2023-03-07 11:52:59,774][175731] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-07 11:53:00,578][175731] Updated weights for policy 0, policy_version 96700 (0.0007) [2023-03-07 11:53:01,386][175731] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-07 11:53:02,190][175731] Updated weights for policy 0, policy_version 96720 (0.0006) [2023-03-07 11:53:02,989][175731] Updated weights for policy 0, policy_version 96730 (0.0007) [2023-03-07 11:53:03,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 99055616. Throughput: 0: 12784.5. Samples: 99041911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:03,322][175405] Avg episode reward: [(0, '28.608')] [2023-03-07 11:53:03,789][175731] Updated weights for policy 0, policy_version 96740 (0.0007) [2023-03-07 11:53:04,573][175731] Updated weights for policy 0, policy_version 96750 (0.0007) [2023-03-07 11:53:05,383][175731] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-07 11:53:06,166][175731] Updated weights for policy 0, policy_version 96770 (0.0006) [2023-03-07 11:53:06,966][175731] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-07 11:53:07,766][175731] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-07 11:53:08,321][175405] Fps is (10 sec: 12697.6, 60 sec: 12782.9, 300 sec: 12801.7). Total num frames: 99119104. Throughput: 0: 12794.9. Samples: 99118941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:08,321][175405] Avg episode reward: [(0, '28.942')] [2023-03-07 11:53:08,571][175731] Updated weights for policy 0, policy_version 96800 (0.0007) [2023-03-07 11:53:09,376][175731] Updated weights for policy 0, policy_version 96810 (0.0006) [2023-03-07 11:53:10,172][175731] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-07 11:53:10,977][175731] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-07 11:53:11,785][175731] Updated weights for policy 0, policy_version 96840 (0.0007) [2023-03-07 11:53:12,594][175731] Updated weights for policy 0, policy_version 96850 (0.0007) [2023-03-07 11:53:13,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 99183616. Throughput: 0: 12792.3. Samples: 99157239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:13,322][175405] Avg episode reward: [(0, '28.519')] [2023-03-07 11:53:13,352][175731] Updated weights for policy 0, policy_version 96860 (0.0006) [2023-03-07 11:53:14,161][175731] Updated weights for policy 0, policy_version 96870 (0.0007) [2023-03-07 11:53:14,978][175731] Updated weights for policy 0, policy_version 96880 (0.0005) [2023-03-07 11:53:15,766][175731] Updated weights for policy 0, policy_version 96890 (0.0007) [2023-03-07 11:53:16,558][175731] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-07 11:53:17,371][175731] Updated weights for policy 0, policy_version 96910 (0.0007) [2023-03-07 11:53:18,141][175731] Updated weights for policy 0, policy_version 96920 (0.0006) [2023-03-07 11:53:18,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 99248128. Throughput: 0: 12795.9. Samples: 99234253. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:18,322][175405] Avg episode reward: [(0, '29.069')] [2023-03-07 11:53:18,956][175731] Updated weights for policy 0, policy_version 96930 (0.0006) [2023-03-07 11:53:19,756][175731] Updated weights for policy 0, policy_version 96940 (0.0007) [2023-03-07 11:53:20,555][175731] Updated weights for policy 0, policy_version 96950 (0.0007) [2023-03-07 11:53:21,338][175731] Updated weights for policy 0, policy_version 96960 (0.0006) [2023-03-07 11:53:22,147][175731] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-07 11:53:22,941][175731] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-07 11:53:23,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 99311616. Throughput: 0: 12807.4. Samples: 99311183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:23,322][175405] Avg episode reward: [(0, '36.754')] [2023-03-07 11:53:23,762][175731] Updated weights for policy 0, policy_version 96990 (0.0006) [2023-03-07 11:53:24,560][175731] Updated weights for policy 0, policy_version 97000 (0.0006) [2023-03-07 11:53:25,351][175731] Updated weights for policy 0, policy_version 97010 (0.0006) [2023-03-07 11:53:26,159][175731] Updated weights for policy 0, policy_version 97020 (0.0006) [2023-03-07 11:53:26,969][175731] Updated weights for policy 0, policy_version 97030 (0.0006) [2023-03-07 11:53:27,768][175731] Updated weights for policy 0, policy_version 97040 (0.0007) [2023-03-07 11:53:28,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12817.1, 300 sec: 12805.2). Total num frames: 99376128. Throughput: 0: 12809.1. Samples: 99349708. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:28,321][175405] Avg episode reward: [(0, '31.645')] [2023-03-07 11:53:28,569][175731] Updated weights for policy 0, policy_version 97050 (0.0006) [2023-03-07 11:53:29,367][175731] Updated weights for policy 0, policy_version 97060 (0.0006) [2023-03-07 11:53:30,158][175731] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-07 11:53:30,943][175731] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-07 11:53:31,757][175731] Updated weights for policy 0, policy_version 97090 (0.0007) [2023-03-07 11:53:32,561][175731] Updated weights for policy 0, policy_version 97100 (0.0006) [2023-03-07 11:53:33,321][175405] Fps is (10 sec: 12800.1, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 99439616. Throughput: 0: 12807.2. Samples: 99426371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:33,321][175405] Avg episode reward: [(0, '30.270')] [2023-03-07 11:53:33,345][175731] Updated weights for policy 0, policy_version 97110 (0.0006) [2023-03-07 11:53:34,140][175731] Updated weights for policy 0, policy_version 97120 (0.0006) [2023-03-07 11:53:34,949][175731] Updated weights for policy 0, policy_version 97130 (0.0006) [2023-03-07 11:53:35,754][175731] Updated weights for policy 0, policy_version 97140 (0.0007) [2023-03-07 11:53:36,560][175731] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-07 11:53:37,351][175731] Updated weights for policy 0, policy_version 97160 (0.0006) [2023-03-07 11:53:38,156][175731] Updated weights for policy 0, policy_version 97170 (0.0007) [2023-03-07 11:53:38,321][175405] Fps is (10 sec: 12799.8, 60 sec: 12817.1, 300 sec: 12805.2). Total num frames: 99504128. Throughput: 0: 12809.5. Samples: 99503288. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:38,322][175405] Avg episode reward: [(0, '27.905')] [2023-03-07 11:53:38,948][175731] Updated weights for policy 0, policy_version 97180 (0.0007) [2023-03-07 11:53:39,743][175731] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-07 11:53:40,553][175731] Updated weights for policy 0, policy_version 97200 (0.0007) [2023-03-07 11:53:41,341][175731] Updated weights for policy 0, policy_version 97210 (0.0007) [2023-03-07 11:53:42,124][175731] Updated weights for policy 0, policy_version 97220 (0.0006) [2023-03-07 11:53:42,917][175731] Updated weights for policy 0, policy_version 97230 (0.0007) [2023-03-07 11:53:43,321][175405] Fps is (10 sec: 12902.4, 60 sec: 12817.1, 300 sec: 12805.2). Total num frames: 99568640. Throughput: 0: 12809.3. Samples: 99541940. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:43,321][175405] Avg episode reward: [(0, '29.602')] [2023-03-07 11:53:43,721][175731] Updated weights for policy 0, policy_version 97240 (0.0007) [2023-03-07 11:53:44,528][175731] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-07 11:53:45,325][175731] Updated weights for policy 0, policy_version 97260 (0.0006) [2023-03-07 11:53:46,105][175731] Updated weights for policy 0, policy_version 97270 (0.0006) [2023-03-07 11:53:46,913][175731] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-07 11:53:47,702][175731] Updated weights for policy 0, policy_version 97290 (0.0006) [2023-03-07 11:53:48,321][175405] Fps is (10 sec: 12800.2, 60 sec: 12817.1, 300 sec: 12805.2). Total num frames: 99632128. Throughput: 0: 12822.3. Samples: 99618914. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:48,321][175405] Avg episode reward: [(0, '29.170')] [2023-03-07 11:53:48,326][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000097297_99632128.pth... [2023-03-07 11:53:48,356][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000094296_96559104.pth [2023-03-07 11:53:48,512][175731] Updated weights for policy 0, policy_version 97300 (0.0007) [2023-03-07 11:53:49,317][175731] Updated weights for policy 0, policy_version 97310 (0.0006) [2023-03-07 11:53:50,097][175731] Updated weights for policy 0, policy_version 97320 (0.0007) [2023-03-07 11:53:50,901][175731] Updated weights for policy 0, policy_version 97330 (0.0007) [2023-03-07 11:53:51,702][175731] Updated weights for policy 0, policy_version 97340 (0.0006) [2023-03-07 11:53:52,492][175731] Updated weights for policy 0, policy_version 97350 (0.0006) [2023-03-07 11:53:53,282][175731] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-07 11:53:53,321][175405] Fps is (10 sec: 12799.9, 60 sec: 12817.1, 300 sec: 12808.7). Total num frames: 99696640. Throughput: 0: 12827.0. Samples: 99696157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:53,321][175405] Avg episode reward: [(0, '28.085')] [2023-03-07 11:53:54,073][175731] Updated weights for policy 0, policy_version 97370 (0.0006) [2023-03-07 11:53:54,891][175731] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-07 11:53:55,693][175731] Updated weights for policy 0, policy_version 97390 (0.0006) [2023-03-07 11:53:56,501][175731] Updated weights for policy 0, policy_version 97400 (0.0007) [2023-03-07 11:53:57,302][175731] Updated weights for policy 0, policy_version 97410 (0.0007) [2023-03-07 11:53:58,122][175731] Updated weights for policy 0, policy_version 97420 (0.0005) [2023-03-07 11:53:58,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 99760128. Throughput: 0: 12824.4. Samples: 99734337. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:53:58,321][175405] Avg episode reward: [(0, '28.017')] [2023-03-07 11:53:58,921][175731] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-07 11:53:59,743][175731] Updated weights for policy 0, policy_version 97440 (0.0006) [2023-03-07 11:54:00,541][175731] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-07 11:54:01,317][175731] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-07 11:54:02,102][175731] Updated weights for policy 0, policy_version 97470 (0.0006) [2023-03-07 11:54:02,903][175731] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-07 11:54:03,321][175405] Fps is (10 sec: 12697.5, 60 sec: 12800.0, 300 sec: 12801.7). Total num frames: 99823616. Throughput: 0: 12814.1. Samples: 99810889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:54:03,322][175405] Avg episode reward: [(0, '27.893')] [2023-03-07 11:54:03,708][175731] Updated weights for policy 0, policy_version 97490 (0.0007) [2023-03-07 11:54:04,506][175731] Updated weights for policy 0, policy_version 97500 (0.0006) [2023-03-07 11:54:05,309][175731] Updated weights for policy 0, policy_version 97510 (0.0008) [2023-03-07 11:54:06,115][175731] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-07 11:54:06,913][175731] Updated weights for policy 0, policy_version 97530 (0.0007) [2023-03-07 11:54:07,722][175731] Updated weights for policy 0, policy_version 97540 (0.0006) [2023-03-07 11:54:08,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12817.1, 300 sec: 12805.2). Total num frames: 99888128. Throughput: 0: 12808.6. Samples: 99887568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:54:08,322][175405] Avg episode reward: [(0, '30.295')] [2023-03-07 11:54:08,522][175731] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-07 11:54:09,332][175731] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-07 11:54:10,119][175731] Updated weights for policy 0, policy_version 97570 (0.0006) [2023-03-07 11:54:10,928][175731] Updated weights for policy 0, policy_version 97580 (0.0007) [2023-03-07 11:54:11,726][175731] Updated weights for policy 0, policy_version 97590 (0.0006) [2023-03-07 11:54:12,543][175731] Updated weights for policy 0, policy_version 97600 (0.0006) [2023-03-07 11:54:13,321][175405] Fps is (10 sec: 12800.0, 60 sec: 12800.0, 300 sec: 12805.2). Total num frames: 99951616. Throughput: 0: 12807.0. Samples: 99926026. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 11:54:13,322][175405] Avg episode reward: [(0, '29.416')] [2023-03-07 11:54:13,336][175731] Updated weights for policy 0, policy_version 97610 (0.0006) [2023-03-07 11:54:14,141][175731] Updated weights for policy 0, policy_version 97620 (0.0006) [2023-03-07 11:54:14,946][175731] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-07 11:54:15,742][175731] Updated weights for policy 0, policy_version 97640 (0.0006) [2023-03-07 11:54:16,535][175731] Updated weights for policy 0, policy_version 97650 (0.0007) [2023-03-07 11:54:17,178][176356] Stopping RolloutWorker_w30... [2023-03-07 11:54:17,178][176158] Stopping RolloutWorker_w23... [2023-03-07 11:54:17,178][176356] Loop rollout_proc30_evt_loop terminating... [2023-03-07 11:54:17,178][175862] Stopping RolloutWorker_w15... [2023-03-07 11:54:17,178][176158] Loop rollout_proc23_evt_loop terminating... [2023-03-07 11:54:17,178][175680] Stopping Batcher_0... [2023-03-07 11:54:17,178][175873] Stopping RolloutWorker_w14... [2023-03-07 11:54:17,178][175860] Stopping RolloutWorker_w0... [2023-03-07 11:54:17,178][175680] Loop batcher_evt_loop terminating... [2023-03-07 11:54:17,178][175868] Stopping RolloutWorker_w18... [2023-03-07 11:54:17,178][175862] Loop rollout_proc15_evt_loop terminating... [2023-03-07 11:54:17,178][175873] Loop rollout_proc14_evt_loop terminating... [2023-03-07 11:54:17,178][176161] Stopping RolloutWorker_w26... [2023-03-07 11:54:17,178][175863] Stopping RolloutWorker_w16... [2023-03-07 11:54:17,178][175865] Stopping RolloutWorker_w2... [2023-03-07 11:54:17,179][175860] Loop rollout_proc0_evt_loop terminating... [2023-03-07 11:54:17,179][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 11:54:17,179][175868] Loop rollout_proc18_evt_loop terminating... [2023-03-07 11:54:17,179][176161] Loop rollout_proc26_evt_loop terminating... [2023-03-07 11:54:17,179][175864] Stopping RolloutWorker_w10... [2023-03-07 11:54:17,179][176125] Stopping RolloutWorker_w21... [2023-03-07 11:54:17,179][175865] Loop rollout_proc2_evt_loop terminating... [2023-03-07 11:54:17,179][175863] Loop rollout_proc16_evt_loop terminating... [2023-03-07 11:54:17,179][175864] Loop rollout_proc10_evt_loop terminating... [2023-03-07 11:54:17,179][176125] Loop rollout_proc21_evt_loop terminating... [2023-03-07 11:54:17,179][175861] Stopping RolloutWorker_w6... [2023-03-07 11:54:17,179][175733] Stopping RolloutWorker_w3... [2023-03-07 11:54:17,179][176358] Stopping RolloutWorker_w29... [2023-03-07 11:54:17,178][175405] Component RolloutWorker_w30 stopped! [2023-03-07 11:54:17,179][175932] Stopping RolloutWorker_w7... [2023-03-07 11:54:17,179][175867] Stopping RolloutWorker_w17... [2023-03-07 11:54:17,179][175732] Stopping RolloutWorker_w1... [2023-03-07 11:54:17,179][176319] Stopping RolloutWorker_w31... [2023-03-07 11:54:17,179][176321] Stopping RolloutWorker_w25... [2023-03-07 11:54:17,179][176036] Stopping RolloutWorker_w19... [2023-03-07 11:54:17,179][176355] Stopping RolloutWorker_w28... [2023-03-07 11:54:17,179][176218] Stopping RolloutWorker_w24... [2023-03-07 11:54:17,179][176294] Stopping RolloutWorker_w27... [2023-03-07 11:54:17,179][175861] Loop rollout_proc6_evt_loop terminating... [2023-03-07 11:54:17,179][175733] Loop rollout_proc3_evt_loop terminating... [2023-03-07 11:54:17,179][176358] Loop rollout_proc29_evt_loop terminating... [2023-03-07 11:54:17,179][175932] Loop rollout_proc7_evt_loop terminating... [2023-03-07 11:54:17,179][175871] Stopping RolloutWorker_w12... [2023-03-07 11:54:17,179][175867] Loop rollout_proc17_evt_loop terminating... [2023-03-07 11:54:17,179][175732] Loop rollout_proc1_evt_loop terminating... [2023-03-07 11:54:17,179][176319] Loop rollout_proc31_evt_loop terminating... [2023-03-07 11:54:17,179][176321] Loop rollout_proc25_evt_loop terminating... [2023-03-07 11:54:17,179][175869] Stopping RolloutWorker_w9... [2023-03-07 11:54:17,179][176036] Loop rollout_proc19_evt_loop terminating... [2023-03-07 11:54:17,179][176110] Stopping RolloutWorker_w20... [2023-03-07 11:54:17,179][176355] Loop rollout_proc28_evt_loop terminating... [2023-03-07 11:54:17,179][176294] Loop rollout_proc27_evt_loop terminating... [2023-03-07 11:54:17,179][175405] Component RolloutWorker_w23 stopped! [2023-03-07 11:54:17,179][176218] Loop rollout_proc24_evt_loop terminating... [2023-03-07 11:54:17,179][175734] Stopping RolloutWorker_w4... [2023-03-07 11:54:17,179][175870] Stopping RolloutWorker_w13... [2023-03-07 11:54:17,180][175869] Loop rollout_proc9_evt_loop terminating... [2023-03-07 11:54:17,180][176110] Loop rollout_proc20_evt_loop terminating... [2023-03-07 11:54:17,180][175405] Component Batcher_0 stopped! [2023-03-07 11:54:17,180][175734] Loop rollout_proc4_evt_loop terminating... [2023-03-07 11:54:17,180][175870] Loop rollout_proc13_evt_loop terminating... [2023-03-07 11:54:17,180][176126] Stopping RolloutWorker_w22... [2023-03-07 11:54:17,180][175405] Component RolloutWorker_w15 stopped! [2023-03-07 11:54:17,180][176126] Loop rollout_proc22_evt_loop terminating... [2023-03-07 11:54:17,181][175871] Loop rollout_proc12_evt_loop terminating... [2023-03-07 11:54:17,181][175405] Component RolloutWorker_w14 stopped! [2023-03-07 11:54:17,181][175405] Component RolloutWorker_w0 stopped! [2023-03-07 11:54:17,181][175405] Component RolloutWorker_w18 stopped! [2023-03-07 11:54:17,182][175405] Component RolloutWorker_w26 stopped! [2023-03-07 11:54:17,182][175405] Component RolloutWorker_w16 stopped! [2023-03-07 11:54:17,182][175405] Component RolloutWorker_w2 stopped! [2023-03-07 11:54:17,183][175405] Component RolloutWorker_w10 stopped! [2023-03-07 11:54:17,183][175405] Component RolloutWorker_w21 stopped! [2023-03-07 11:54:17,183][175405] Component RolloutWorker_w6 stopped! [2023-03-07 11:54:17,184][175405] Component RolloutWorker_w29 stopped! [2023-03-07 11:54:17,184][175405] Component RolloutWorker_w3 stopped! [2023-03-07 11:54:17,184][175405] Component RolloutWorker_w7 stopped! [2023-03-07 11:54:17,184][175859] Stopping RolloutWorker_w5... [2023-03-07 11:54:17,185][175859] Loop rollout_proc5_evt_loop terminating... [2023-03-07 11:54:17,185][175405] Component RolloutWorker_w17 stopped! [2023-03-07 11:54:17,185][175405] Component RolloutWorker_w31 stopped! [2023-03-07 11:54:17,185][175405] Component RolloutWorker_w1 stopped! [2023-03-07 11:54:17,185][175405] Component RolloutWorker_w25 stopped! [2023-03-07 11:54:17,186][175405] Component RolloutWorker_w19 stopped! [2023-03-07 11:54:17,186][175405] Component RolloutWorker_w28 stopped! [2023-03-07 11:54:17,186][175405] Component RolloutWorker_w24 stopped! [2023-03-07 11:54:17,187][175405] Component RolloutWorker_w27 stopped! [2023-03-07 11:54:17,187][175405] Component RolloutWorker_w12 stopped! [2023-03-07 11:54:17,187][175405] Component RolloutWorker_w9 stopped! [2023-03-07 11:54:17,188][175405] Component RolloutWorker_w20 stopped! [2023-03-07 11:54:17,188][175405] Component RolloutWorker_w4 stopped! [2023-03-07 11:54:17,188][175405] Component RolloutWorker_w13 stopped! [2023-03-07 11:54:17,189][175405] Component RolloutWorker_w22 stopped! [2023-03-07 11:54:17,194][175872] Stopping RolloutWorker_w8... [2023-03-07 11:54:17,195][175872] Loop rollout_proc8_evt_loop terminating... [2023-03-07 11:54:17,189][175405] Component RolloutWorker_w5 stopped! [2023-03-07 11:54:17,196][175866] Stopping RolloutWorker_w11... [2023-03-07 11:54:17,196][175866] Loop rollout_proc11_evt_loop terminating... [2023-03-07 11:54:17,196][175405] Component RolloutWorker_w8 stopped! [2023-03-07 11:54:17,197][175405] Component RolloutWorker_w11 stopped! [2023-03-07 11:54:17,252][175731] Weights refcount: 2 0 [2023-03-07 11:54:17,254][175731] Stopping InferenceWorker_p0-w0... [2023-03-07 11:54:17,254][175731] Loop inference_proc0-0_evt_loop terminating... [2023-03-07 11:54:17,255][175405] Component InferenceWorker_p0-w0 stopped! [2023-03-07 11:54:17,300][175680] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000095797_98096128.pth [2023-03-07 11:54:17,308][175680] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/coffee-pull-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 11:54:17,395][175680] Stopping LearnerWorker_p0... [2023-03-07 11:54:17,395][175680] Loop learner_proc0_evt_loop terminating... [2023-03-07 11:54:17,395][175405] Component LearnerWorker_p0 stopped! [2023-03-07 11:54:17,396][175405] Waiting for process learner_proc0 to stop... [2023-03-07 11:54:18,562][175405] Waiting for process inference_proc0-0 to join... [2023-03-07 11:54:18,562][175405] Waiting for process rollout_proc0 to join... [2023-03-07 11:54:18,563][175405] Waiting for process rollout_proc1 to join... [2023-03-07 11:54:18,563][175405] Waiting for process rollout_proc2 to join... [2023-03-07 11:54:18,563][175405] Waiting for process rollout_proc3 to join... [2023-03-07 11:54:18,563][175405] Waiting for process rollout_proc4 to join... [2023-03-07 11:54:18,563][175405] Waiting for process rollout_proc5 to join... [2023-03-07 11:54:18,564][175405] Waiting for process rollout_proc6 to join... [2023-03-07 11:54:18,564][175405] Waiting for process rollout_proc7 to join... [2023-03-07 11:54:18,564][175405] Waiting for process rollout_proc8 to join... [2023-03-07 11:54:18,564][175405] Waiting for process rollout_proc9 to join... [2023-03-07 11:54:18,565][175405] Waiting for process rollout_proc10 to join... [2023-03-07 11:54:18,565][175405] Waiting for process rollout_proc11 to join... [2023-03-07 11:54:18,565][175405] Waiting for process rollout_proc12 to join... [2023-03-07 11:54:18,565][175405] Waiting for process rollout_proc13 to join... [2023-03-07 11:54:18,566][175405] Waiting for process rollout_proc14 to join... [2023-03-07 11:54:18,566][175405] Waiting for process rollout_proc15 to join... [2023-03-07 11:54:18,566][175405] Waiting for process rollout_proc16 to join... [2023-03-07 11:54:18,566][175405] Waiting for process rollout_proc17 to join... [2023-03-07 11:54:18,567][175405] Waiting for process rollout_proc18 to join... [2023-03-07 11:54:18,567][175405] Waiting for process rollout_proc19 to join... [2023-03-07 11:54:18,567][175405] Waiting for process rollout_proc20 to join... [2023-03-07 11:54:18,567][175405] Waiting for process rollout_proc21 to join... [2023-03-07 11:54:18,567][175405] Waiting for process rollout_proc22 to join... [2023-03-07 11:54:18,568][175405] Waiting for process rollout_proc23 to join... [2023-03-07 11:54:18,568][175405] Waiting for process rollout_proc24 to join... [2023-03-07 11:54:18,568][175405] Waiting for process rollout_proc25 to join... [2023-03-07 11:54:18,568][175405] Waiting for process rollout_proc26 to join... [2023-03-07 11:54:18,569][175405] Waiting for process rollout_proc27 to join... [2023-03-07 11:54:18,569][175405] Waiting for process rollout_proc28 to join... [2023-03-07 11:54:18,569][175405] Waiting for process rollout_proc29 to join... [2023-03-07 11:54:18,569][175405] Waiting for process rollout_proc30 to join... [2023-03-07 11:54:18,570][175405] Waiting for process rollout_proc31 to join... [2023-03-07 11:54:18,570][175405] Batcher 0 profile tree view: batching: 801.5629, releasing_batches: 1.5819 [2023-03-07 11:54:18,570][175405] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 233.0864 update_model: 139.0945 weight_update: 0.0006 one_step: 0.0118 handle_policy_step: 7065.3238 deserialize: 214.3368, stack: 36.7552, obs_to_device_normalize: 1249.4361, forward: 3186.2621, send_messages: 1373.4812 prepare_outputs: 729.1373 to_cpu: 370.0511 [2023-03-07 11:54:18,570][175405] Learner 0 profile tree view: misc: 0.4541, prepare_batch: 404.8230 train: 904.4702 epoch_init: 0.3782, minibatch_init: 0.3737, losses_postprocess: 31.2905, kl_divergence: 35.2230, after_optimizer: 102.4214 calculate_losses: 296.3223 losses_init: 0.2044, forward_head: 16.5745, bptt_initial: 107.7690, tail: 59.9175, advantages_returns: 7.4067, losses: 28.0440 bptt: 67.6480 bptt_forward_core: 65.2484 update: 415.8446 clip: 55.7563 [2023-03-07 11:54:18,570][175405] RolloutWorker_w0 profile tree view: wait_for_trajectories: 3.7165, enqueue_policy_requests: 176.0482, env_step: 3099.4928, overhead: 159.3952, complete_rollouts: 9.2153 save_policy_outputs: 209.4118 split_output_tensors: 103.9349 [2023-03-07 11:54:18,570][175405] RolloutWorker_w31 profile tree view: wait_for_trajectories: 3.8535, enqueue_policy_requests: 184.6282, env_step: 3161.4707, overhead: 162.3947, complete_rollouts: 9.3185 save_policy_outputs: 217.1368 split_output_tensors: 106.8966 [2023-03-07 11:54:18,571][175405] Loop Runner_EvtLoop terminating... [2023-03-07 11:54:18,571][175405] Runner profile tree view: main_loop: 7827.1158 [2023-03-07 11:54:18,571][175405] Collected {0: 100001792}, FPS: 12776.3