[2023-03-06 23:09:59,132][81074] Saving configuration to /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/config.json... [2023-03-06 23:09:59,146][81074] Rollout worker 0 uses device cpu [2023-03-06 23:09:59,146][81074] Rollout worker 1 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 2 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 3 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 4 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 5 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 6 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 7 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 8 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 9 uses device cpu [2023-03-06 23:09:59,147][81074] Rollout worker 10 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 11 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 12 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 13 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 14 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 15 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 16 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 17 uses device cpu [2023-03-06 23:09:59,148][81074] Rollout worker 18 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 19 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 20 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 21 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 22 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 23 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 24 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 25 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 26 uses device cpu [2023-03-06 23:09:59,149][81074] Rollout worker 27 uses device cpu [2023-03-06 23:09:59,150][81074] Rollout worker 28 uses device cpu [2023-03-06 23:09:59,150][81074] Rollout worker 29 uses device cpu [2023-03-06 23:09:59,150][81074] Rollout worker 30 uses device cpu [2023-03-06 23:09:59,150][81074] Rollout worker 31 uses device cpu [2023-03-06 23:09:59,163][81074] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 23:09:59,163][81074] InferenceWorker_p0-w0: min num requests: 10 [2023-03-06 23:09:59,239][81074] Starting all processes... [2023-03-06 23:09:59,240][81074] Starting process learner_proc0 [2023-03-06 23:09:59,289][81074] Starting all processes... [2023-03-06 23:09:59,337][81074] Starting process inference_proc0-0 [2023-03-06 23:09:59,345][81074] Starting process rollout_proc0 [2023-03-06 23:09:59,346][81074] Starting process rollout_proc1 [2023-03-06 23:09:59,346][81074] Starting process rollout_proc2 [2023-03-06 23:09:59,346][81074] Starting process rollout_proc3 [2023-03-06 23:09:59,346][81074] Starting process rollout_proc4 [2023-03-06 23:09:59,346][81074] Starting process rollout_proc5 [2023-03-06 23:09:59,346][81074] Starting process rollout_proc6 [2023-03-06 23:09:59,346][81074] Starting process rollout_proc7 [2023-03-06 23:09:59,347][81074] Starting process rollout_proc8 [2023-03-06 23:09:59,347][81074] Starting process rollout_proc9 [2023-03-06 23:09:59,347][81074] Starting process rollout_proc10 [2023-03-06 23:09:59,350][81074] Starting process rollout_proc11 [2023-03-06 23:09:59,350][81074] Starting process rollout_proc12 [2023-03-06 23:09:59,357][81074] Starting process rollout_proc13 [2023-03-06 23:09:59,358][81074] Starting process rollout_proc14 [2023-03-06 23:09:59,365][81074] Starting process rollout_proc15 [2023-03-06 23:09:59,365][81074] Starting process rollout_proc16 [2023-03-06 23:09:59,366][81074] Starting process rollout_proc17 [2023-03-06 23:09:59,378][81074] Starting process rollout_proc18 [2023-03-06 23:09:59,379][81074] Starting process rollout_proc19 [2023-03-06 23:09:59,384][81074] Starting process rollout_proc20 [2023-03-06 23:09:59,395][81074] Starting process rollout_proc21 [2023-03-06 23:09:59,485][81074] Starting process rollout_proc22 [2023-03-06 23:09:59,489][81074] Starting process rollout_proc23 [2023-03-06 23:09:59,509][81074] Starting process rollout_proc24 [2023-03-06 23:09:59,515][81074] Starting process rollout_proc25 [2023-03-06 23:09:59,529][81074] Starting process rollout_proc26 [2023-03-06 23:09:59,529][81074] Starting process rollout_proc27 [2023-03-06 23:09:59,537][81074] Starting process rollout_proc28 [2023-03-06 23:09:59,537][81074] Starting process rollout_proc29 [2023-03-06 23:09:59,538][81074] Starting process rollout_proc30 [2023-03-06 23:09:59,546][81074] Starting process rollout_proc31 [2023-03-06 23:10:01,312][81349] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 23:10:01,312][81349] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2023-03-06 23:10:01,322][81349] Num visible devices: 1 [2023-03-06 23:10:01,348][81349] WARNING! It is generally recommended to enable Fixed KL loss (https://arxiv.org/pdf/1707.06347.pdf) for continuous action tasks to avoid potential numerical issues. I.e. set --kl_loss_coeff=0.1 [2023-03-06 23:10:01,349][81349] Starting seed is not provided [2023-03-06 23:10:01,349][81349] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 23:10:01,349][81349] Initializing actor-critic model on device cuda:0 [2023-03-06 23:10:01,349][81349] RunningMeanStd input shape: (39,) [2023-03-06 23:10:01,350][81349] RunningMeanStd input shape: (1,) [2023-03-06 23:10:01,383][81402] Worker 2 uses CPU cores [2] [2023-03-06 23:10:01,463][81400] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 23:10:01,464][81400] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2023-03-06 23:10:01,474][81400] Num visible devices: 1 [2023-03-06 23:10:01,531][81349] Created Actor Critic model with architecture: [2023-03-06 23:10:01,531][81349] ActorCriticSharedWeights( (obs_normalizer): ObservationNormalizer( (running_mean_std): RunningMeanStdDictInPlace( (running_mean_std): ModuleDict( (obs): RunningMeanStdInPlace() ) ) ) (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (encoder): MultiInputEncoder( (encoders): ModuleDict( (obs): MlpEncoder( (mlp_head): RecursiveScriptModule( original_name=Sequential (0): RecursiveScriptModule(original_name=Linear) (1): RecursiveScriptModule(original_name=ELU) (2): RecursiveScriptModule(original_name=Linear) (3): RecursiveScriptModule(original_name=ELU) ) ) ) ) (core): ModelCoreRNN( (core): GRU(512, 512) ) (decoder): MlpDecoder( (mlp): Identity() ) (critic_linear): Linear(in_features=512, out_features=1, bias=True) (action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=8, bias=True) ) ) [2023-03-06 23:10:01,630][81640] Worker 15 uses CPU cores [15] [2023-03-06 23:10:01,674][81639] Worker 10 uses CPU cores [10] [2023-03-06 23:10:01,737][81846] Worker 31 uses CPU cores [31] [2023-03-06 23:10:01,907][81405] Worker 4 uses CPU cores [4] [2023-03-06 23:10:02,003][81643] Worker 9 uses CPU cores [9] [2023-03-06 23:10:02,019][81637] Worker 7 uses CPU cores [7] [2023-03-06 23:10:02,143][81740] Worker 28 uses CPU cores [28] [2023-03-06 23:10:02,190][81868] Worker 25 uses CPU cores [25] [2023-03-06 23:10:02,335][81755] Worker 29 uses CPU cores [29] [2023-03-06 23:10:02,355][81401] Worker 1 uses CPU cores [1] [2023-03-06 23:10:02,582][81602] Worker 21 uses CPU cores [21] [2023-03-06 23:10:02,735][81403] Worker 0 uses CPU cores [0] [2023-03-06 23:10:02,861][81566] Worker 11 uses CPU cores [11] [2023-03-06 23:10:02,963][81641] Worker 23 uses CPU cores [23] [2023-03-06 23:10:03,088][81349] Using optimizer [2023-03-06 23:10:03,089][81349] No checkpoints found [2023-03-06 23:10:03,089][81349] Did not load from checkpoint, starting from scratch! [2023-03-06 23:10:03,089][81349] Initialized policy 0 weights for model version 0 [2023-03-06 23:10:03,090][81349] LearnerWorker_p0 finished initialization! [2023-03-06 23:10:03,090][81349] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2023-03-06 23:10:03,133][81644] Worker 24 uses CPU cores [24] [2023-03-06 23:10:03,157][81400] RunningMeanStd input shape: (39,) [2023-03-06 23:10:03,158][81400] RunningMeanStd input shape: (1,) [2023-03-06 23:10:03,191][81567] Worker 18 uses CPU cores [18] [2023-03-06 23:10:03,233][81565] Worker 16 uses CPU cores [16] [2023-03-06 23:10:03,244][81636] Worker 22 uses CPU cores [22] [2023-03-06 23:10:03,495][81738] Worker 26 uses CPU cores [26] [2023-03-06 23:10:03,628][81555] Worker 20 uses CPU cores [20] [2023-03-06 23:10:03,738][81404] Worker 3 uses CPU cores [3] [2023-03-06 23:10:03,803][81437] Worker 5 uses CPU cores [5] [2023-03-06 23:10:03,850][81845] Worker 30 uses CPU cores [30] [2023-03-06 23:10:03,998][81601] Worker 8 uses CPU cores [8] [2023-03-06 23:10:04,005][81074] Inference worker 0-0 is ready! [2023-03-06 23:10:04,006][81074] All inference workers are ready! Signal rollout workers to start! [2023-03-06 23:10:04,171][81553] Worker 12 uses CPU cores [12] [2023-03-06 23:10:04,447][81603] Worker 14 uses CPU cores [14] [2023-03-06 23:10:04,538][81600] Worker 13 uses CPU cores [13] [2023-03-06 23:10:04,573][81564] Worker 6 uses CPU cores [6] [2023-03-06 23:10:04,743][81604] Worker 17 uses CPU cores [17] [2023-03-06 23:10:04,744][81739] Worker 27 uses CPU cores [27] [2023-03-06 23:10:05,185][81599] Worker 19 uses CPU cores [19] [2023-03-06 23:10:05,258][81404] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,373][81401] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,423][81636] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,424][81644] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,480][81755] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,590][81640] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,594][81639] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,618][81740] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,648][81565] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,650][81555] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,650][81405] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,660][81641] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,660][81643] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,660][81868] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,662][81403] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,665][81567] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,665][81402] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,675][81846] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,679][81738] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,680][81437] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,680][81602] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,682][81566] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,720][81637] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,867][81845] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,897][81553] Decorrelating experience for 0 frames... [2023-03-06 23:10:05,910][81601] Decorrelating experience for 0 frames... [2023-03-06 23:10:06,159][81603] Decorrelating experience for 0 frames... [2023-03-06 23:10:06,237][81074] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2023-03-06 23:10:06,452][81600] Decorrelating experience for 0 frames... [2023-03-06 23:10:06,574][81564] Decorrelating experience for 0 frames... [2023-03-06 23:10:06,710][81604] Decorrelating experience for 0 frames... [2023-03-06 23:10:06,764][81404] Decorrelating experience for 32 frames... [2023-03-06 23:10:06,932][81401] Decorrelating experience for 32 frames... [2023-03-06 23:10:06,960][81739] Decorrelating experience for 0 frames... [2023-03-06 23:10:07,006][81644] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,028][81636] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,033][81599] Decorrelating experience for 0 frames... [2023-03-06 23:10:07,053][81755] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,177][81740] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,186][81640] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,204][81639] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,213][81566] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,252][81738] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,253][81846] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,265][81641] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,280][81555] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,280][81405] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,293][81643] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,298][81868] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,301][81637] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,304][81402] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,304][81567] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,305][81601] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,306][81403] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,307][81565] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,311][81602] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,312][81437] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,343][81553] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,384][81845] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,592][81603] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,648][81349] Signal inference workers to stop experience collection... [2023-03-06 23:10:07,652][81400] InferenceWorker_p0-w0: stopping experience collection [2023-03-06 23:10:07,679][81600] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,750][81564] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,895][81604] Decorrelating experience for 32 frames... [2023-03-06 23:10:07,948][81349] Signal inference workers to resume experience collection... [2023-03-06 23:10:07,949][81400] InferenceWorker_p0-w0: resuming experience collection [2023-03-06 23:10:08,016][81739] Decorrelating experience for 32 frames... [2023-03-06 23:10:08,022][81599] Decorrelating experience for 32 frames... [2023-03-06 23:10:09,101][81400] Updated weights for policy 0, policy_version 10 (0.0216) [2023-03-06 23:10:09,859][81400] Updated weights for policy 0, policy_version 20 (0.0007) [2023-03-06 23:10:10,648][81400] Updated weights for policy 0, policy_version 30 (0.0006) [2023-03-06 23:10:11,236][81074] Fps is (10 sec: 7577.8, 60 sec: 7577.8, 300 sec: 7577.8). Total num frames: 37888. Throughput: 0: 4787.5. Samples: 23937. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 23:10:11,237][81074] Avg episode reward: [(0, '214.175')] [2023-03-06 23:10:11,392][81400] Updated weights for policy 0, policy_version 40 (0.0007) [2023-03-06 23:10:12,147][81400] Updated weights for policy 0, policy_version 50 (0.0007) [2023-03-06 23:10:12,910][81400] Updated weights for policy 0, policy_version 60 (0.0006) [2023-03-06 23:10:13,679][81400] Updated weights for policy 0, policy_version 70 (0.0005) [2023-03-06 23:10:14,422][81400] Updated weights for policy 0, policy_version 80 (0.0006) [2023-03-06 23:10:15,193][81400] Updated weights for policy 0, policy_version 90 (0.0005) [2023-03-06 23:10:15,948][81400] Updated weights for policy 0, policy_version 100 (0.0006) [2023-03-06 23:10:16,236][81074] Fps is (10 sec: 10547.3, 60 sec: 10547.3, 300 sec: 10547.3). Total num frames: 105472. Throughput: 0: 10464.7. Samples: 104646. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:10:16,237][81074] Avg episode reward: [(0, '537.408')] [2023-03-06 23:10:16,246][81349] Saving new best policy, reward=537.408! [2023-03-06 23:10:16,700][81400] Updated weights for policy 0, policy_version 110 (0.0006) [2023-03-06 23:10:17,451][81400] Updated weights for policy 0, policy_version 120 (0.0006) [2023-03-06 23:10:18,226][81400] Updated weights for policy 0, policy_version 130 (0.0005) [2023-03-06 23:10:18,971][81400] Updated weights for policy 0, policy_version 140 (0.0006) [2023-03-06 23:10:19,159][81074] Heartbeat connected on Batcher_0 [2023-03-06 23:10:19,161][81074] Heartbeat connected on LearnerWorker_p0 [2023-03-06 23:10:19,165][81074] Heartbeat connected on InferenceWorker_p0-w0 [2023-03-06 23:10:19,166][81074] Heartbeat connected on RolloutWorker_w0 [2023-03-06 23:10:19,169][81074] Heartbeat connected on RolloutWorker_w1 [2023-03-06 23:10:19,171][81074] Heartbeat connected on RolloutWorker_w2 [2023-03-06 23:10:19,173][81074] Heartbeat connected on RolloutWorker_w3 [2023-03-06 23:10:19,173][81074] Heartbeat connected on RolloutWorker_w4 [2023-03-06 23:10:19,176][81074] Heartbeat connected on RolloutWorker_w5 [2023-03-06 23:10:19,177][81074] Heartbeat connected on RolloutWorker_w6 [2023-03-06 23:10:19,179][81074] Heartbeat connected on RolloutWorker_w7 [2023-03-06 23:10:19,182][81074] Heartbeat connected on RolloutWorker_w8 [2023-03-06 23:10:19,183][81074] Heartbeat connected on RolloutWorker_w9 [2023-03-06 23:10:19,184][81074] Heartbeat connected on RolloutWorker_w10 [2023-03-06 23:10:19,202][81074] Heartbeat connected on RolloutWorker_w11 [2023-03-06 23:10:19,205][81074] Heartbeat connected on RolloutWorker_w13 [2023-03-06 23:10:19,205][81074] Heartbeat connected on RolloutWorker_w12 [2023-03-06 23:10:19,207][81074] Heartbeat connected on RolloutWorker_w14 [2023-03-06 23:10:19,209][81074] Heartbeat connected on RolloutWorker_w15 [2023-03-06 23:10:19,212][81074] Heartbeat connected on RolloutWorker_w16 [2023-03-06 23:10:19,212][81074] Heartbeat connected on RolloutWorker_w17 [2023-03-06 23:10:19,214][81074] Heartbeat connected on RolloutWorker_w18 [2023-03-06 23:10:19,216][81074] Heartbeat connected on RolloutWorker_w19 [2023-03-06 23:10:19,217][81074] Heartbeat connected on RolloutWorker_w20 [2023-03-06 23:10:19,220][81074] Heartbeat connected on RolloutWorker_w21 [2023-03-06 23:10:19,221][81074] Heartbeat connected on RolloutWorker_w22 [2023-03-06 23:10:19,224][81074] Heartbeat connected on RolloutWorker_w23 [2023-03-06 23:10:19,225][81074] Heartbeat connected on RolloutWorker_w24 [2023-03-06 23:10:19,227][81074] Heartbeat connected on RolloutWorker_w25 [2023-03-06 23:10:19,229][81074] Heartbeat connected on RolloutWorker_w26 [2023-03-06 23:10:19,230][81074] Heartbeat connected on RolloutWorker_w27 [2023-03-06 23:10:19,232][81074] Heartbeat connected on RolloutWorker_w28 [2023-03-06 23:10:19,236][81074] Heartbeat connected on RolloutWorker_w30 [2023-03-06 23:10:19,237][81074] Heartbeat connected on RolloutWorker_w31 [2023-03-06 23:10:19,241][81074] Heartbeat connected on RolloutWorker_w29 [2023-03-06 23:10:19,713][81400] Updated weights for policy 0, policy_version 150 (0.0005) [2023-03-06 23:10:20,498][81400] Updated weights for policy 0, policy_version 160 (0.0006) [2023-03-06 23:10:21,236][81074] Fps is (10 sec: 13516.7, 60 sec: 11537.1, 300 sec: 11537.1). Total num frames: 173056. Throughput: 0: 9670.0. Samples: 145050. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:10:21,237][81074] Avg episode reward: [(0, '478.262')] [2023-03-06 23:10:21,252][81400] Updated weights for policy 0, policy_version 170 (0.0006) [2023-03-06 23:10:21,991][81400] Updated weights for policy 0, policy_version 180 (0.0006) [2023-03-06 23:10:22,778][81400] Updated weights for policy 0, policy_version 190 (0.0006) [2023-03-06 23:10:23,529][81400] Updated weights for policy 0, policy_version 200 (0.0006) [2023-03-06 23:10:24,274][81400] Updated weights for policy 0, policy_version 210 (0.0005) [2023-03-06 23:10:25,050][81400] Updated weights for policy 0, policy_version 220 (0.0006) [2023-03-06 23:10:25,789][81400] Updated weights for policy 0, policy_version 230 (0.0008) [2023-03-06 23:10:26,236][81074] Fps is (10 sec: 13516.8, 60 sec: 12032.0, 300 sec: 12032.0). Total num frames: 240640. Throughput: 0: 11312.3. Samples: 226245. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:10:26,237][81074] Avg episode reward: [(0, '728.770')] [2023-03-06 23:10:26,244][81349] Saving new best policy, reward=728.770! [2023-03-06 23:10:26,552][81400] Updated weights for policy 0, policy_version 240 (0.0006) [2023-03-06 23:10:27,327][81400] Updated weights for policy 0, policy_version 250 (0.0006) [2023-03-06 23:10:28,071][81400] Updated weights for policy 0, policy_version 260 (0.0005) [2023-03-06 23:10:28,803][81400] Updated weights for policy 0, policy_version 270 (0.0007) [2023-03-06 23:10:29,575][81400] Updated weights for policy 0, policy_version 280 (0.0005) [2023-03-06 23:10:30,328][81400] Updated weights for policy 0, policy_version 290 (0.0007) [2023-03-06 23:10:31,081][81400] Updated weights for policy 0, policy_version 300 (0.0006) [2023-03-06 23:10:31,236][81074] Fps is (10 sec: 13619.4, 60 sec: 12370.0, 300 sec: 12370.0). Total num frames: 309248. Throughput: 0: 12301.9. Samples: 307546. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:10:31,237][81074] Avg episode reward: [(0, '944.802')] [2023-03-06 23:10:31,237][81349] Saving new best policy, reward=944.802! [2023-03-06 23:10:31,827][81400] Updated weights for policy 0, policy_version 310 (0.0006) [2023-03-06 23:10:32,588][81400] Updated weights for policy 0, policy_version 320 (0.0007) [2023-03-06 23:10:33,314][81400] Updated weights for policy 0, policy_version 330 (0.0007) [2023-03-06 23:10:34,082][81400] Updated weights for policy 0, policy_version 340 (0.0006) [2023-03-06 23:10:34,841][81400] Updated weights for policy 0, policy_version 350 (0.0006) [2023-03-06 23:10:35,591][81400] Updated weights for policy 0, policy_version 360 (0.0005) [2023-03-06 23:10:36,236][81074] Fps is (10 sec: 13619.3, 60 sec: 12561.1, 300 sec: 12561.1). Total num frames: 376832. Throughput: 0: 11618.1. Samples: 348540. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:10:36,237][81074] Avg episode reward: [(0, '1038.145')] [2023-03-06 23:10:36,241][81349] Saving new best policy, reward=1038.145! [2023-03-06 23:10:36,344][81400] Updated weights for policy 0, policy_version 370 (0.0006) [2023-03-06 23:10:37,124][81400] Updated weights for policy 0, policy_version 380 (0.0007) [2023-03-06 23:10:37,848][81400] Updated weights for policy 0, policy_version 390 (0.0006) [2023-03-06 23:10:38,594][81400] Updated weights for policy 0, policy_version 400 (0.0005) [2023-03-06 23:10:39,353][81400] Updated weights for policy 0, policy_version 410 (0.0006) [2023-03-06 23:10:40,100][81400] Updated weights for policy 0, policy_version 420 (0.0006) [2023-03-06 23:10:40,848][81400] Updated weights for policy 0, policy_version 430 (0.0006) [2023-03-06 23:10:41,236][81074] Fps is (10 sec: 13619.0, 60 sec: 12726.9, 300 sec: 12726.9). Total num frames: 445440. Throughput: 0: 12288.6. Samples: 430099. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:10:41,237][81074] Avg episode reward: [(0, '1109.618')] [2023-03-06 23:10:41,238][81349] Saving new best policy, reward=1109.618! [2023-03-06 23:10:41,621][81400] Updated weights for policy 0, policy_version 440 (0.0006) [2023-03-06 23:10:42,372][81400] Updated weights for policy 0, policy_version 450 (0.0007) [2023-03-06 23:10:43,107][81400] Updated weights for policy 0, policy_version 460 (0.0006) [2023-03-06 23:10:43,894][81400] Updated weights for policy 0, policy_version 470 (0.0006) [2023-03-06 23:10:44,641][81400] Updated weights for policy 0, policy_version 480 (0.0006) [2023-03-06 23:10:45,372][81400] Updated weights for policy 0, policy_version 490 (0.0007) [2023-03-06 23:10:46,150][81400] Updated weights for policy 0, policy_version 500 (0.0006) [2023-03-06 23:10:46,236][81074] Fps is (10 sec: 13619.2, 60 sec: 12825.6, 300 sec: 12825.6). Total num frames: 513024. Throughput: 0: 12790.3. Samples: 511610. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:10:46,237][81074] Avg episode reward: [(0, '1065.361')] [2023-03-06 23:10:46,903][81400] Updated weights for policy 0, policy_version 510 (0.0007) [2023-03-06 23:10:47,638][81400] Updated weights for policy 0, policy_version 520 (0.0006) [2023-03-06 23:10:48,402][81400] Updated weights for policy 0, policy_version 530 (0.0006) [2023-03-06 23:10:49,154][81400] Updated weights for policy 0, policy_version 540 (0.0006) [2023-03-06 23:10:49,904][81400] Updated weights for policy 0, policy_version 550 (0.0007) [2023-03-06 23:10:50,657][81400] Updated weights for policy 0, policy_version 560 (0.0005) [2023-03-06 23:10:51,236][81074] Fps is (10 sec: 13516.9, 60 sec: 12902.4, 300 sec: 12902.4). Total num frames: 580608. Throughput: 0: 12271.8. Samples: 552231. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:10:51,237][81074] Avg episode reward: [(0, '1000.058')] [2023-03-06 23:10:51,430][81400] Updated weights for policy 0, policy_version 570 (0.0007) [2023-03-06 23:10:52,173][81400] Updated weights for policy 0, policy_version 580 (0.0006) [2023-03-06 23:10:52,923][81400] Updated weights for policy 0, policy_version 590 (0.0006) [2023-03-06 23:10:53,684][81400] Updated weights for policy 0, policy_version 600 (0.0007) [2023-03-06 23:10:54,413][81400] Updated weights for policy 0, policy_version 610 (0.0005) [2023-03-06 23:10:55,143][81400] Updated weights for policy 0, policy_version 620 (0.0006) [2023-03-06 23:10:55,917][81400] Updated weights for policy 0, policy_version 630 (0.0006) [2023-03-06 23:10:56,236][81074] Fps is (10 sec: 13619.2, 60 sec: 12984.4, 300 sec: 12984.4). Total num frames: 649216. Throughput: 0: 13561.3. Samples: 634196. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:10:56,237][81074] Avg episode reward: [(0, '985.732')] [2023-03-06 23:10:56,674][81400] Updated weights for policy 0, policy_version 640 (0.0006) [2023-03-06 23:10:57,429][81400] Updated weights for policy 0, policy_version 650 (0.0006) [2023-03-06 23:10:58,187][81400] Updated weights for policy 0, policy_version 660 (0.0006) [2023-03-06 23:10:58,944][81400] Updated weights for policy 0, policy_version 670 (0.0006) [2023-03-06 23:10:59,694][81400] Updated weights for policy 0, policy_version 680 (0.0005) [2023-03-06 23:11:00,461][81400] Updated weights for policy 0, policy_version 690 (0.0006) [2023-03-06 23:11:01,198][81400] Updated weights for policy 0, policy_version 700 (0.0006) [2023-03-06 23:11:01,236][81074] Fps is (10 sec: 13619.0, 60 sec: 13032.7, 300 sec: 13032.7). Total num frames: 716800. Throughput: 0: 13573.2. Samples: 715440. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:11:01,237][81074] Avg episode reward: [(0, '1067.954')] [2023-03-06 23:11:01,946][81400] Updated weights for policy 0, policy_version 710 (0.0005) [2023-03-06 23:11:02,734][81400] Updated weights for policy 0, policy_version 720 (0.0006) [2023-03-06 23:11:03,473][81400] Updated weights for policy 0, policy_version 730 (0.0006) [2023-03-06 23:11:04,220][81400] Updated weights for policy 0, policy_version 740 (0.0006) [2023-03-06 23:11:04,982][81400] Updated weights for policy 0, policy_version 750 (0.0007) [2023-03-06 23:11:05,738][81400] Updated weights for policy 0, policy_version 760 (0.0005) [2023-03-06 23:11:06,236][81074] Fps is (10 sec: 13516.6, 60 sec: 13073.1, 300 sec: 13073.1). Total num frames: 784384. Throughput: 0: 13583.4. Samples: 756302. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:11:06,237][81074] Avg episode reward: [(0, '1077.447')] [2023-03-06 23:11:06,482][81400] Updated weights for policy 0, policy_version 770 (0.0006) [2023-03-06 23:11:07,239][81400] Updated weights for policy 0, policy_version 780 (0.0006) [2023-03-06 23:11:08,007][81400] Updated weights for policy 0, policy_version 790 (0.0006) [2023-03-06 23:11:08,746][81400] Updated weights for policy 0, policy_version 800 (0.0005) [2023-03-06 23:11:09,494][81400] Updated weights for policy 0, policy_version 810 (0.0006) [2023-03-06 23:11:10,261][81400] Updated weights for policy 0, policy_version 820 (0.0005) [2023-03-06 23:11:11,009][81400] Updated weights for policy 0, policy_version 830 (0.0007) [2023-03-06 23:11:11,236][81074] Fps is (10 sec: 13619.4, 60 sec: 13585.1, 300 sec: 13123.0). Total num frames: 852992. Throughput: 0: 13588.1. Samples: 837710. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:11:11,237][81074] Avg episode reward: [(0, '1109.498')] [2023-03-06 23:11:11,766][81400] Updated weights for policy 0, policy_version 840 (0.0006) [2023-03-06 23:11:12,537][81400] Updated weights for policy 0, policy_version 850 (0.0006) [2023-03-06 23:11:13,274][81400] Updated weights for policy 0, policy_version 860 (0.0006) [2023-03-06 23:11:14,018][81400] Updated weights for policy 0, policy_version 870 (0.0006) [2023-03-06 23:11:14,780][81400] Updated weights for policy 0, policy_version 880 (0.0006) [2023-03-06 23:11:15,533][81400] Updated weights for policy 0, policy_version 890 (0.0006) [2023-03-06 23:11:16,236][81074] Fps is (10 sec: 13619.3, 60 sec: 13585.1, 300 sec: 13151.1). Total num frames: 920576. Throughput: 0: 13595.0. Samples: 919321. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:11:16,237][81074] Avg episode reward: [(0, '1116.363')] [2023-03-06 23:11:16,241][81349] Saving new best policy, reward=1116.363! [2023-03-06 23:11:16,291][81400] Updated weights for policy 0, policy_version 900 (0.0006) [2023-03-06 23:11:17,028][81400] Updated weights for policy 0, policy_version 910 (0.0007) [2023-03-06 23:11:17,777][81400] Updated weights for policy 0, policy_version 920 (0.0005) [2023-03-06 23:11:18,543][81400] Updated weights for policy 0, policy_version 930 (0.0005) [2023-03-06 23:11:19,286][81400] Updated weights for policy 0, policy_version 940 (0.0006) [2023-03-06 23:11:20,050][81400] Updated weights for policy 0, policy_version 950 (0.0006) [2023-03-06 23:11:20,805][81400] Updated weights for policy 0, policy_version 960 (0.0006) [2023-03-06 23:11:21,236][81074] Fps is (10 sec: 13516.8, 60 sec: 13585.1, 300 sec: 13175.5). Total num frames: 988160. Throughput: 0: 13587.2. Samples: 959964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:11:21,237][81074] Avg episode reward: [(0, '1152.684')] [2023-03-06 23:11:21,253][81349] Saving new best policy, reward=1152.684! [2023-03-06 23:11:21,570][81400] Updated weights for policy 0, policy_version 970 (0.0006) [2023-03-06 23:11:22,323][81400] Updated weights for policy 0, policy_version 980 (0.0006) [2023-03-06 23:11:23,054][81400] Updated weights for policy 0, policy_version 990 (0.0006) [2023-03-06 23:11:23,824][81400] Updated weights for policy 0, policy_version 1000 (0.0006) [2023-03-06 23:11:24,572][81400] Updated weights for policy 0, policy_version 1010 (0.0006) [2023-03-06 23:11:25,325][81400] Updated weights for policy 0, policy_version 1020 (0.0006) [2023-03-06 23:11:26,077][81400] Updated weights for policy 0, policy_version 1030 (0.0006) [2023-03-06 23:11:26,236][81074] Fps is (10 sec: 13619.3, 60 sec: 13602.2, 300 sec: 13209.6). Total num frames: 1056768. Throughput: 0: 13588.5. Samples: 1041579. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:11:26,237][81074] Avg episode reward: [(0, '1167.462')] [2023-03-06 23:11:26,242][81349] Saving new best policy, reward=1167.462! [2023-03-06 23:11:26,854][81400] Updated weights for policy 0, policy_version 1040 (0.0006) [2023-03-06 23:11:27,619][81400] Updated weights for policy 0, policy_version 1050 (0.0005) [2023-03-06 23:11:28,367][81400] Updated weights for policy 0, policy_version 1060 (0.0006) [2023-03-06 23:11:29,135][81400] Updated weights for policy 0, policy_version 1070 (0.0006) [2023-03-06 23:11:29,864][81400] Updated weights for policy 0, policy_version 1080 (0.0006) [2023-03-06 23:11:30,638][81400] Updated weights for policy 0, policy_version 1090 (0.0006) [2023-03-06 23:11:31,236][81074] Fps is (10 sec: 13516.7, 60 sec: 13568.0, 300 sec: 13215.6). Total num frames: 1123328. Throughput: 0: 13578.7. Samples: 1122654. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:11:31,237][81074] Avg episode reward: [(0, '1183.766')] [2023-03-06 23:11:31,239][81349] Saving new best policy, reward=1183.766! [2023-03-06 23:11:31,385][81400] Updated weights for policy 0, policy_version 1100 (0.0005) [2023-03-06 23:11:32,125][81400] Updated weights for policy 0, policy_version 1110 (0.0007) [2023-03-06 23:11:32,879][81400] Updated weights for policy 0, policy_version 1120 (0.0006) [2023-03-06 23:11:33,643][81400] Updated weights for policy 0, policy_version 1130 (0.0005) [2023-03-06 23:11:34,371][81400] Updated weights for policy 0, policy_version 1140 (0.0006) [2023-03-06 23:11:35,116][81400] Updated weights for policy 0, policy_version 1150 (0.0006) [2023-03-06 23:11:35,860][81400] Updated weights for policy 0, policy_version 1160 (0.0005) [2023-03-06 23:11:36,236][81074] Fps is (10 sec: 13516.8, 60 sec: 13585.1, 300 sec: 13243.8). Total num frames: 1191936. Throughput: 0: 13584.1. Samples: 1163514. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:11:36,237][81074] Avg episode reward: [(0, '1209.934')] [2023-03-06 23:11:36,246][81349] Saving new best policy, reward=1209.934! [2023-03-06 23:11:36,613][81400] Updated weights for policy 0, policy_version 1170 (0.0006) [2023-03-06 23:11:37,365][81400] Updated weights for policy 0, policy_version 1180 (0.0006) [2023-03-06 23:11:38,142][81400] Updated weights for policy 0, policy_version 1190 (0.0007) [2023-03-06 23:11:38,886][81400] Updated weights for policy 0, policy_version 1200 (0.0007) [2023-03-06 23:11:39,621][81400] Updated weights for policy 0, policy_version 1210 (0.0006) [2023-03-06 23:11:40,388][81400] Updated weights for policy 0, policy_version 1220 (0.0005) [2023-03-06 23:11:41,155][81400] Updated weights for policy 0, policy_version 1230 (0.0006) [2023-03-06 23:11:41,236][81074] Fps is (10 sec: 13721.6, 60 sec: 13585.1, 300 sec: 13268.9). Total num frames: 1260544. Throughput: 0: 13582.7. Samples: 1245417. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:11:41,247][81074] Avg episode reward: [(0, '1194.807')] [2023-03-06 23:11:41,910][81400] Updated weights for policy 0, policy_version 1240 (0.0006) [2023-03-06 23:11:42,647][81400] Updated weights for policy 0, policy_version 1250 (0.0005) [2023-03-06 23:11:43,434][81400] Updated weights for policy 0, policy_version 1260 (0.0006) [2023-03-06 23:11:44,181][81400] Updated weights for policy 0, policy_version 1270 (0.0006) [2023-03-06 23:11:44,942][81400] Updated weights for policy 0, policy_version 1280 (0.0006) [2023-03-06 23:11:45,692][81400] Updated weights for policy 0, policy_version 1290 (0.0005) [2023-03-06 23:11:46,236][81074] Fps is (10 sec: 13516.7, 60 sec: 13568.0, 300 sec: 13271.1). Total num frames: 1327104. Throughput: 0: 13579.0. Samples: 1326493. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:11:46,237][81074] Avg episode reward: [(0, '1228.359')] [2023-03-06 23:11:46,241][81349] Saving new best policy, reward=1228.359! [2023-03-06 23:11:46,444][81400] Updated weights for policy 0, policy_version 1300 (0.0007) [2023-03-06 23:11:47,193][81400] Updated weights for policy 0, policy_version 1310 (0.0006) [2023-03-06 23:11:47,962][81400] Updated weights for policy 0, policy_version 1320 (0.0007) [2023-03-06 23:11:48,696][81400] Updated weights for policy 0, policy_version 1330 (0.0005) [2023-03-06 23:11:49,451][81400] Updated weights for policy 0, policy_version 1340 (0.0006) [2023-03-06 23:11:50,200][81400] Updated weights for policy 0, policy_version 1350 (0.0006) [2023-03-06 23:11:50,972][81400] Updated weights for policy 0, policy_version 1360 (0.0006) [2023-03-06 23:11:51,236][81074] Fps is (10 sec: 13516.8, 60 sec: 13585.1, 300 sec: 13292.5). Total num frames: 1395712. Throughput: 0: 13580.7. Samples: 1367430. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:11:51,237][81074] Avg episode reward: [(0, '1187.814')] [2023-03-06 23:11:51,719][81400] Updated weights for policy 0, policy_version 1370 (0.0006) [2023-03-06 23:11:52,503][81400] Updated weights for policy 0, policy_version 1380 (0.0007) [2023-03-06 23:11:53,233][81400] Updated weights for policy 0, policy_version 1390 (0.0005) [2023-03-06 23:11:53,997][81400] Updated weights for policy 0, policy_version 1400 (0.0006) [2023-03-06 23:11:54,754][81400] Updated weights for policy 0, policy_version 1410 (0.0005) [2023-03-06 23:11:55,517][81400] Updated weights for policy 0, policy_version 1420 (0.0006) [2023-03-06 23:11:56,236][81074] Fps is (10 sec: 13619.3, 60 sec: 13568.0, 300 sec: 13302.7). Total num frames: 1463296. Throughput: 0: 13574.6. Samples: 1448568. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:11:56,237][81074] Avg episode reward: [(0, '1318.343')] [2023-03-06 23:11:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000001429_1463296.pth... [2023-03-06 23:11:56,272][81349] Saving new best policy, reward=1318.343! [2023-03-06 23:11:56,325][81400] Updated weights for policy 0, policy_version 1430 (0.0006) [2023-03-06 23:11:57,038][81400] Updated weights for policy 0, policy_version 1440 (0.0005) [2023-03-06 23:11:57,786][81400] Updated weights for policy 0, policy_version 1450 (0.0006) [2023-03-06 23:11:58,542][81400] Updated weights for policy 0, policy_version 1460 (0.0005) [2023-03-06 23:11:59,306][81400] Updated weights for policy 0, policy_version 1470 (0.0006) [2023-03-06 23:12:00,067][81400] Updated weights for policy 0, policy_version 1480 (0.0007) [2023-03-06 23:12:00,813][81400] Updated weights for policy 0, policy_version 1490 (0.0007) [2023-03-06 23:12:01,236][81074] Fps is (10 sec: 13516.7, 60 sec: 13568.0, 300 sec: 13312.0). Total num frames: 1530880. Throughput: 0: 13564.1. Samples: 1529705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 23:12:01,237][81074] Avg episode reward: [(0, '1302.087')] [2023-03-06 23:12:01,578][81400] Updated weights for policy 0, policy_version 1500 (0.0006) [2023-03-06 23:12:02,333][81400] Updated weights for policy 0, policy_version 1510 (0.0005) [2023-03-06 23:12:03,091][81400] Updated weights for policy 0, policy_version 1520 (0.0006) [2023-03-06 23:12:03,849][81400] Updated weights for policy 0, policy_version 1530 (0.0006) [2023-03-06 23:12:04,641][81400] Updated weights for policy 0, policy_version 1540 (0.0006) [2023-03-06 23:12:05,389][81400] Updated weights for policy 0, policy_version 1550 (0.0006) [2023-03-06 23:12:06,145][81400] Updated weights for policy 0, policy_version 1560 (0.0006) [2023-03-06 23:12:06,236][81074] Fps is (10 sec: 13516.7, 60 sec: 13568.0, 300 sec: 13320.5). Total num frames: 1598464. Throughput: 0: 13558.2. Samples: 1570085. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:12:06,237][81074] Avg episode reward: [(0, '1358.993')] [2023-03-06 23:12:06,240][81349] Saving new best policy, reward=1358.993! [2023-03-06 23:12:06,902][81400] Updated weights for policy 0, policy_version 1570 (0.0006) [2023-03-06 23:12:07,672][81400] Updated weights for policy 0, policy_version 1580 (0.0006) [2023-03-06 23:12:08,436][81400] Updated weights for policy 0, policy_version 1590 (0.0006) [2023-03-06 23:12:09,197][81400] Updated weights for policy 0, policy_version 1600 (0.0006) [2023-03-06 23:12:09,967][81400] Updated weights for policy 0, policy_version 1610 (0.0005) [2023-03-06 23:12:10,724][81400] Updated weights for policy 0, policy_version 1620 (0.0006) [2023-03-06 23:12:11,236][81074] Fps is (10 sec: 13414.7, 60 sec: 13533.9, 300 sec: 13320.2). Total num frames: 1665024. Throughput: 0: 13534.9. Samples: 1650651. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:12:11,237][81074] Avg episode reward: [(0, '1239.165')] [2023-03-06 23:12:11,492][81400] Updated weights for policy 0, policy_version 1630 (0.0007) [2023-03-06 23:12:12,250][81400] Updated weights for policy 0, policy_version 1640 (0.0007) [2023-03-06 23:12:13,009][81400] Updated weights for policy 0, policy_version 1650 (0.0006) [2023-03-06 23:12:13,767][81400] Updated weights for policy 0, policy_version 1660 (0.0006) [2023-03-06 23:12:14,526][81400] Updated weights for policy 0, policy_version 1670 (0.0006) [2023-03-06 23:12:15,306][81400] Updated weights for policy 0, policy_version 1680 (0.0006) [2023-03-06 23:12:16,087][81400] Updated weights for policy 0, policy_version 1690 (0.0006) [2023-03-06 23:12:16,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13533.9, 300 sec: 13327.8). Total num frames: 1732608. Throughput: 0: 13518.5. Samples: 1730984. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:12:16,247][81074] Avg episode reward: [(0, '1320.212')] [2023-03-06 23:12:16,845][81400] Updated weights for policy 0, policy_version 1700 (0.0006) [2023-03-06 23:12:17,614][81400] Updated weights for policy 0, policy_version 1710 (0.0006) [2023-03-06 23:12:18,387][81400] Updated weights for policy 0, policy_version 1720 (0.0006) [2023-03-06 23:12:19,166][81400] Updated weights for policy 0, policy_version 1730 (0.0006) [2023-03-06 23:12:19,929][81400] Updated weights for policy 0, policy_version 1740 (0.0005) [2023-03-06 23:12:20,699][81400] Updated weights for policy 0, policy_version 1750 (0.0006) [2023-03-06 23:12:21,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13499.7, 300 sec: 13319.6). Total num frames: 1798144. Throughput: 0: 13495.0. Samples: 1770793. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:12:21,237][81074] Avg episode reward: [(0, '1340.592')] [2023-03-06 23:12:21,455][81400] Updated weights for policy 0, policy_version 1760 (0.0006) [2023-03-06 23:12:22,226][81400] Updated weights for policy 0, policy_version 1770 (0.0006) [2023-03-06 23:12:23,009][81400] Updated weights for policy 0, policy_version 1780 (0.0007) [2023-03-06 23:12:23,808][81400] Updated weights for policy 0, policy_version 1790 (0.0006) [2023-03-06 23:12:24,566][81400] Updated weights for policy 0, policy_version 1800 (0.0006) [2023-03-06 23:12:25,330][81400] Updated weights for policy 0, policy_version 1810 (0.0005) [2023-03-06 23:12:26,123][81400] Updated weights for policy 0, policy_version 1820 (0.0006) [2023-03-06 23:12:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13465.6, 300 sec: 13319.3). Total num frames: 1864704. Throughput: 0: 13442.8. Samples: 1850341. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 23:12:26,237][81074] Avg episode reward: [(0, '1429.795')] [2023-03-06 23:12:26,241][81349] Saving new best policy, reward=1429.795! [2023-03-06 23:12:26,904][81400] Updated weights for policy 0, policy_version 1830 (0.0006) [2023-03-06 23:12:27,664][81400] Updated weights for policy 0, policy_version 1840 (0.0006) [2023-03-06 23:12:28,424][81400] Updated weights for policy 0, policy_version 1850 (0.0006) [2023-03-06 23:12:29,217][81400] Updated weights for policy 0, policy_version 1860 (0.0006) [2023-03-06 23:12:29,967][81400] Updated weights for policy 0, policy_version 1870 (0.0006) [2023-03-06 23:12:30,733][81400] Updated weights for policy 0, policy_version 1880 (0.0006) [2023-03-06 23:12:31,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13465.6, 300 sec: 13319.1). Total num frames: 1931264. Throughput: 0: 13415.6. Samples: 1930196. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:12:31,237][81074] Avg episode reward: [(0, '1232.450')] [2023-03-06 23:12:31,506][81400] Updated weights for policy 0, policy_version 1890 (0.0006) [2023-03-06 23:12:32,284][81400] Updated weights for policy 0, policy_version 1900 (0.0006) [2023-03-06 23:12:33,051][81400] Updated weights for policy 0, policy_version 1910 (0.0005) [2023-03-06 23:12:33,801][81400] Updated weights for policy 0, policy_version 1920 (0.0006) [2023-03-06 23:12:34,575][81400] Updated weights for policy 0, policy_version 1930 (0.0007) [2023-03-06 23:12:35,357][81400] Updated weights for policy 0, policy_version 1940 (0.0007) [2023-03-06 23:12:36,118][81400] Updated weights for policy 0, policy_version 1950 (0.0006) [2023-03-06 23:12:36,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13431.4, 300 sec: 13318.8). Total num frames: 1997824. Throughput: 0: 13395.2. Samples: 1970218. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:12:36,237][81074] Avg episode reward: [(0, '1378.818')] [2023-03-06 23:12:36,893][81400] Updated weights for policy 0, policy_version 1960 (0.0005) [2023-03-06 23:12:37,655][81400] Updated weights for policy 0, policy_version 1970 (0.0007) [2023-03-06 23:12:38,408][81400] Updated weights for policy 0, policy_version 1980 (0.0006) [2023-03-06 23:12:39,174][81400] Updated weights for policy 0, policy_version 1990 (0.0006) [2023-03-06 23:12:39,965][81400] Updated weights for policy 0, policy_version 2000 (0.0006) [2023-03-06 23:12:40,722][81400] Updated weights for policy 0, policy_version 2010 (0.0006) [2023-03-06 23:12:41,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13397.3, 300 sec: 13318.6). Total num frames: 2064384. Throughput: 0: 13366.2. Samples: 2050049. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:12:41,247][81074] Avg episode reward: [(0, '1516.940')] [2023-03-06 23:12:41,248][81349] Saving new best policy, reward=1516.940! [2023-03-06 23:12:41,513][81400] Updated weights for policy 0, policy_version 2020 (0.0006) [2023-03-06 23:12:42,270][81400] Updated weights for policy 0, policy_version 2030 (0.0006) [2023-03-06 23:12:43,033][81400] Updated weights for policy 0, policy_version 2040 (0.0006) [2023-03-06 23:12:43,801][81400] Updated weights for policy 0, policy_version 2050 (0.0005) [2023-03-06 23:12:44,569][81400] Updated weights for policy 0, policy_version 2060 (0.0007) [2023-03-06 23:12:45,353][81400] Updated weights for policy 0, policy_version 2070 (0.0006) [2023-03-06 23:12:46,116][81400] Updated weights for policy 0, policy_version 2080 (0.0006) [2023-03-06 23:12:46,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13397.3, 300 sec: 13318.4). Total num frames: 2130944. Throughput: 0: 13339.0. Samples: 2129957. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:12:46,248][81074] Avg episode reward: [(0, '1342.508')] [2023-03-06 23:12:46,881][81400] Updated weights for policy 0, policy_version 2090 (0.0006) [2023-03-06 23:12:47,652][81400] Updated weights for policy 0, policy_version 2100 (0.0006) [2023-03-06 23:12:48,407][81400] Updated weights for policy 0, policy_version 2110 (0.0006) [2023-03-06 23:12:49,181][81400] Updated weights for policy 0, policy_version 2120 (0.0006) [2023-03-06 23:12:49,955][81400] Updated weights for policy 0, policy_version 2130 (0.0006) [2023-03-06 23:12:50,721][81400] Updated weights for policy 0, policy_version 2140 (0.0006) [2023-03-06 23:12:51,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13363.2, 300 sec: 13318.2). Total num frames: 2197504. Throughput: 0: 13330.9. Samples: 2169976. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:12:51,247][81074] Avg episode reward: [(0, '1437.765')] [2023-03-06 23:12:51,504][81400] Updated weights for policy 0, policy_version 2150 (0.0006) [2023-03-06 23:12:52,274][81400] Updated weights for policy 0, policy_version 2160 (0.0007) [2023-03-06 23:12:53,043][81400] Updated weights for policy 0, policy_version 2170 (0.0006) [2023-03-06 23:12:53,816][81400] Updated weights for policy 0, policy_version 2180 (0.0007) [2023-03-06 23:12:54,597][81400] Updated weights for policy 0, policy_version 2190 (0.0006) [2023-03-06 23:12:55,368][81400] Updated weights for policy 0, policy_version 2200 (0.0006) [2023-03-06 23:12:56,142][81400] Updated weights for policy 0, policy_version 2210 (0.0006) [2023-03-06 23:12:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.1, 300 sec: 13318.0). Total num frames: 2264064. Throughput: 0: 13308.2. Samples: 2249522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:12:56,237][81074] Avg episode reward: [(0, '1594.466')] [2023-03-06 23:12:56,244][81349] Saving new best policy, reward=1594.466! [2023-03-06 23:12:56,913][81400] Updated weights for policy 0, policy_version 2220 (0.0006) [2023-03-06 23:12:57,677][81400] Updated weights for policy 0, policy_version 2230 (0.0006) [2023-03-06 23:12:58,459][81400] Updated weights for policy 0, policy_version 2240 (0.0006) [2023-03-06 23:12:59,233][81400] Updated weights for policy 0, policy_version 2250 (0.0006) [2023-03-06 23:13:00,006][81400] Updated weights for policy 0, policy_version 2260 (0.0005) [2023-03-06 23:13:00,798][81400] Updated weights for policy 0, policy_version 2270 (0.0006) [2023-03-06 23:13:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13312.0, 300 sec: 13312.0). Total num frames: 2329600. Throughput: 0: 13283.6. Samples: 2328747. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:13:01,247][81074] Avg episode reward: [(0, '1545.785')] [2023-03-06 23:13:01,560][81400] Updated weights for policy 0, policy_version 2280 (0.0007) [2023-03-06 23:13:02,336][81400] Updated weights for policy 0, policy_version 2290 (0.0006) [2023-03-06 23:13:03,104][81400] Updated weights for policy 0, policy_version 2300 (0.0006) [2023-03-06 23:13:03,888][81400] Updated weights for policy 0, policy_version 2310 (0.0006) [2023-03-06 23:13:04,655][81400] Updated weights for policy 0, policy_version 2320 (0.0006) [2023-03-06 23:13:05,450][81400] Updated weights for policy 0, policy_version 2330 (0.0006) [2023-03-06 23:13:06,203][81400] Updated weights for policy 0, policy_version 2340 (0.0007) [2023-03-06 23:13:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13294.9, 300 sec: 13312.0). Total num frames: 2396160. Throughput: 0: 13282.5. Samples: 2368503. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:13:06,247][81074] Avg episode reward: [(0, '1596.976')] [2023-03-06 23:13:06,251][81349] Saving new best policy, reward=1596.976! [2023-03-06 23:13:06,984][81400] Updated weights for policy 0, policy_version 2350 (0.0006) [2023-03-06 23:13:07,753][81400] Updated weights for policy 0, policy_version 2360 (0.0006) [2023-03-06 23:13:08,545][81400] Updated weights for policy 0, policy_version 2370 (0.0006) [2023-03-06 23:13:09,295][81400] Updated weights for policy 0, policy_version 2380 (0.0006) [2023-03-06 23:13:10,089][81400] Updated weights for policy 0, policy_version 2390 (0.0006) [2023-03-06 23:13:10,870][81400] Updated weights for policy 0, policy_version 2400 (0.0005) [2023-03-06 23:13:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13277.8, 300 sec: 13306.5). Total num frames: 2461696. Throughput: 0: 13271.7. Samples: 2447566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:13:11,247][81074] Avg episode reward: [(0, '1756.848')] [2023-03-06 23:13:11,257][81349] Saving new best policy, reward=1756.848! [2023-03-06 23:13:11,634][81400] Updated weights for policy 0, policy_version 2410 (0.0006) [2023-03-06 23:13:12,424][81400] Updated weights for policy 0, policy_version 2420 (0.0006) [2023-03-06 23:13:13,196][81400] Updated weights for policy 0, policy_version 2430 (0.0008) [2023-03-06 23:13:13,966][81400] Updated weights for policy 0, policy_version 2440 (0.0006) [2023-03-06 23:13:14,721][81400] Updated weights for policy 0, policy_version 2450 (0.0005) [2023-03-06 23:13:15,471][81400] Updated weights for policy 0, policy_version 2460 (0.0006) [2023-03-06 23:13:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13306.6). Total num frames: 2528256. Throughput: 0: 13273.0. Samples: 2527479. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:13:16,240][81400] Updated weights for policy 0, policy_version 2470 (0.0005) [2023-03-06 23:13:16,247][81074] Avg episode reward: [(0, '1721.449')] [2023-03-06 23:13:17,032][81400] Updated weights for policy 0, policy_version 2480 (0.0006) [2023-03-06 23:13:17,797][81400] Updated weights for policy 0, policy_version 2490 (0.0006) [2023-03-06 23:13:18,561][81400] Updated weights for policy 0, policy_version 2500 (0.0007) [2023-03-06 23:13:19,341][81400] Updated weights for policy 0, policy_version 2510 (0.0006) [2023-03-06 23:13:20,118][81400] Updated weights for policy 0, policy_version 2520 (0.0007) [2023-03-06 23:13:20,893][81400] Updated weights for policy 0, policy_version 2530 (0.0007) [2023-03-06 23:13:21,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13306.8). Total num frames: 2594816. Throughput: 0: 13266.2. Samples: 2567193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:13:21,247][81074] Avg episode reward: [(0, '1768.313')] [2023-03-06 23:13:21,248][81349] Saving new best policy, reward=1768.313! [2023-03-06 23:13:21,669][81400] Updated weights for policy 0, policy_version 2540 (0.0006) [2023-03-06 23:13:22,444][81400] Updated weights for policy 0, policy_version 2550 (0.0006) [2023-03-06 23:13:23,213][81400] Updated weights for policy 0, policy_version 2560 (0.0005) [2023-03-06 23:13:23,983][81400] Updated weights for policy 0, policy_version 2570 (0.0007) [2023-03-06 23:13:24,745][81400] Updated weights for policy 0, policy_version 2580 (0.0006) [2023-03-06 23:13:25,516][81400] Updated weights for policy 0, policy_version 2590 (0.0006) [2023-03-06 23:13:26,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13277.9, 300 sec: 13306.9). Total num frames: 2661376. Throughput: 0: 13258.5. Samples: 2646682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:13:26,248][81074] Avg episode reward: [(0, '1776.464')] [2023-03-06 23:13:26,252][81349] Saving new best policy, reward=1776.464! [2023-03-06 23:13:26,302][81400] Updated weights for policy 0, policy_version 2600 (0.0007) [2023-03-06 23:13:27,079][81400] Updated weights for policy 0, policy_version 2610 (0.0006) [2023-03-06 23:13:27,857][81400] Updated weights for policy 0, policy_version 2620 (0.0006) [2023-03-06 23:13:28,620][81400] Updated weights for policy 0, policy_version 2630 (0.0007) [2023-03-06 23:13:29,409][81400] Updated weights for policy 0, policy_version 2640 (0.0006) [2023-03-06 23:13:30,178][81400] Updated weights for policy 0, policy_version 2650 (0.0006) [2023-03-06 23:13:30,949][81400] Updated weights for policy 0, policy_version 2660 (0.0005) [2023-03-06 23:13:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13302.0). Total num frames: 2726912. Throughput: 0: 13245.7. Samples: 2726014. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:13:31,247][81074] Avg episode reward: [(0, '1890.367')] [2023-03-06 23:13:31,248][81349] Saving new best policy, reward=1890.367! [2023-03-06 23:13:31,710][81400] Updated weights for policy 0, policy_version 2670 (0.0006) [2023-03-06 23:13:32,491][81400] Updated weights for policy 0, policy_version 2680 (0.0005) [2023-03-06 23:13:33,272][81400] Updated weights for policy 0, policy_version 2690 (0.0008) [2023-03-06 23:13:34,032][81400] Updated weights for policy 0, policy_version 2700 (0.0006) [2023-03-06 23:13:34,817][81400] Updated weights for policy 0, policy_version 2710 (0.0006) [2023-03-06 23:13:35,586][81400] Updated weights for policy 0, policy_version 2720 (0.0006) [2023-03-06 23:13:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13260.9, 300 sec: 13302.3). Total num frames: 2793472. Throughput: 0: 13237.7. Samples: 2765671. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:13:36,247][81074] Avg episode reward: [(0, '1818.384')] [2023-03-06 23:13:36,345][81400] Updated weights for policy 0, policy_version 2730 (0.0007) [2023-03-06 23:13:37,129][81400] Updated weights for policy 0, policy_version 2740 (0.0006) [2023-03-06 23:13:37,905][81400] Updated weights for policy 0, policy_version 2750 (0.0006) [2023-03-06 23:13:38,677][81400] Updated weights for policy 0, policy_version 2760 (0.0006) [2023-03-06 23:13:39,444][81400] Updated weights for policy 0, policy_version 2770 (0.0006) [2023-03-06 23:13:40,219][81400] Updated weights for policy 0, policy_version 2780 (0.0006) [2023-03-06 23:13:41,001][81400] Updated weights for policy 0, policy_version 2790 (0.0006) [2023-03-06 23:13:41,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13302.5). Total num frames: 2860032. Throughput: 0: 13236.3. Samples: 2845153. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:13:41,237][81074] Avg episode reward: [(0, '1886.203')] [2023-03-06 23:13:41,759][81400] Updated weights for policy 0, policy_version 2800 (0.0006) [2023-03-06 23:13:42,532][81400] Updated weights for policy 0, policy_version 2810 (0.0005) [2023-03-06 23:13:43,298][81400] Updated weights for policy 0, policy_version 2820 (0.0006) [2023-03-06 23:13:44,081][81400] Updated weights for policy 0, policy_version 2830 (0.0006) [2023-03-06 23:13:44,842][81400] Updated weights for policy 0, policy_version 2840 (0.0006) [2023-03-06 23:13:45,613][81400] Updated weights for policy 0, policy_version 2850 (0.0006) [2023-03-06 23:13:46,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13302.7). Total num frames: 2926592. Throughput: 0: 13252.2. Samples: 2925096. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:13:46,237][81074] Avg episode reward: [(0, '1893.655')] [2023-03-06 23:13:46,241][81349] Saving new best policy, reward=1893.655! [2023-03-06 23:13:46,387][81400] Updated weights for policy 0, policy_version 2860 (0.0006) [2023-03-06 23:13:47,153][81400] Updated weights for policy 0, policy_version 2870 (0.0006) [2023-03-06 23:13:47,931][81400] Updated weights for policy 0, policy_version 2880 (0.0006) [2023-03-06 23:13:48,706][81400] Updated weights for policy 0, policy_version 2890 (0.0006) [2023-03-06 23:13:49,466][81400] Updated weights for policy 0, policy_version 2900 (0.0006) [2023-03-06 23:13:50,223][81400] Updated weights for policy 0, policy_version 2910 (0.0006) [2023-03-06 23:13:50,985][81400] Updated weights for policy 0, policy_version 2920 (0.0006) [2023-03-06 23:13:51,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13302.9). Total num frames: 2993152. Throughput: 0: 13253.0. Samples: 2964889. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:13:51,237][81074] Avg episode reward: [(0, '1937.363')] [2023-03-06 23:13:51,238][81349] Saving new best policy, reward=1937.363! [2023-03-06 23:13:51,772][81400] Updated weights for policy 0, policy_version 2930 (0.0006) [2023-03-06 23:13:52,558][81400] Updated weights for policy 0, policy_version 2940 (0.0006) [2023-03-06 23:13:53,315][81400] Updated weights for policy 0, policy_version 2950 (0.0007) [2023-03-06 23:13:54,081][81400] Updated weights for policy 0, policy_version 2960 (0.0006) [2023-03-06 23:13:54,855][81400] Updated weights for policy 0, policy_version 2970 (0.0006) [2023-03-06 23:13:55,627][81400] Updated weights for policy 0, policy_version 2980 (0.0006) [2023-03-06 23:13:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13298.7). Total num frames: 3058688. Throughput: 0: 13268.6. Samples: 3044652. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:13:56,237][81074] Avg episode reward: [(0, '2049.557')] [2023-03-06 23:13:56,248][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000002988_3059712.pth... [2023-03-06 23:13:56,280][81349] Saving new best policy, reward=2049.557! [2023-03-06 23:13:56,397][81400] Updated weights for policy 0, policy_version 2990 (0.0007) [2023-03-06 23:13:57,165][81400] Updated weights for policy 0, policy_version 3000 (0.0006) [2023-03-06 23:13:57,948][81400] Updated weights for policy 0, policy_version 3010 (0.0007) [2023-03-06 23:13:58,713][81400] Updated weights for policy 0, policy_version 3020 (0.0006) [2023-03-06 23:13:59,475][81400] Updated weights for policy 0, policy_version 3030 (0.0006) [2023-03-06 23:14:00,230][81400] Updated weights for policy 0, policy_version 3040 (0.0006) [2023-03-06 23:14:01,007][81400] Updated weights for policy 0, policy_version 3050 (0.0007) [2023-03-06 23:14:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13298.9). Total num frames: 3125248. Throughput: 0: 13263.5. Samples: 3124339. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:01,237][81074] Avg episode reward: [(0, '2173.621')] [2023-03-06 23:14:01,238][81349] Saving new best policy, reward=2173.621! [2023-03-06 23:14:01,783][81400] Updated weights for policy 0, policy_version 3060 (0.0006) [2023-03-06 23:14:02,558][81400] Updated weights for policy 0, policy_version 3070 (0.0006) [2023-03-06 23:14:03,348][81400] Updated weights for policy 0, policy_version 3080 (0.0007) [2023-03-06 23:14:04,112][81400] Updated weights for policy 0, policy_version 3090 (0.0006) [2023-03-06 23:14:04,897][81400] Updated weights for policy 0, policy_version 3100 (0.0006) [2023-03-06 23:14:05,637][81400] Updated weights for policy 0, policy_version 3110 (0.0006) [2023-03-06 23:14:06,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13299.2). Total num frames: 3191808. Throughput: 0: 13265.4. Samples: 3164137. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:14:06,237][81074] Avg episode reward: [(0, '2332.367')] [2023-03-06 23:14:06,242][81349] Saving new best policy, reward=2332.367! [2023-03-06 23:14:06,430][81400] Updated weights for policy 0, policy_version 3120 (0.0006) [2023-03-06 23:14:07,189][81400] Updated weights for policy 0, policy_version 3130 (0.0006) [2023-03-06 23:14:07,967][81400] Updated weights for policy 0, policy_version 3140 (0.0007) [2023-03-06 23:14:08,715][81400] Updated weights for policy 0, policy_version 3150 (0.0005) [2023-03-06 23:14:09,478][81400] Updated weights for policy 0, policy_version 3160 (0.0006) [2023-03-06 23:14:10,268][81400] Updated weights for policy 0, policy_version 3170 (0.0006) [2023-03-06 23:14:11,040][81400] Updated weights for policy 0, policy_version 3180 (0.0006) [2023-03-06 23:14:11,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13299.5). Total num frames: 3258368. Throughput: 0: 13275.1. Samples: 3244060. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:11,237][81074] Avg episode reward: [(0, '2156.435')] [2023-03-06 23:14:11,798][81400] Updated weights for policy 0, policy_version 3190 (0.0006) [2023-03-06 23:14:12,574][81400] Updated weights for policy 0, policy_version 3200 (0.0007) [2023-03-06 23:14:13,355][81400] Updated weights for policy 0, policy_version 3210 (0.0006) [2023-03-06 23:14:14,126][81400] Updated weights for policy 0, policy_version 3220 (0.0006) [2023-03-06 23:14:14,905][81400] Updated weights for policy 0, policy_version 3230 (0.0006) [2023-03-06 23:14:15,666][81400] Updated weights for policy 0, policy_version 3240 (0.0006) [2023-03-06 23:14:16,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13277.8, 300 sec: 13299.7). Total num frames: 3324928. Throughput: 0: 13282.3. Samples: 3323720. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:16,237][81074] Avg episode reward: [(0, '2132.108')] [2023-03-06 23:14:16,436][81400] Updated weights for policy 0, policy_version 3250 (0.0006) [2023-03-06 23:14:17,209][81400] Updated weights for policy 0, policy_version 3260 (0.0006) [2023-03-06 23:14:17,973][81400] Updated weights for policy 0, policy_version 3270 (0.0005) [2023-03-06 23:14:18,744][81400] Updated weights for policy 0, policy_version 3280 (0.0006) [2023-03-06 23:14:19,501][81400] Updated weights for policy 0, policy_version 3290 (0.0005) [2023-03-06 23:14:20,273][81400] Updated weights for policy 0, policy_version 3300 (0.0007) [2023-03-06 23:14:21,036][81400] Updated weights for policy 0, policy_version 3310 (0.0006) [2023-03-06 23:14:21,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13300.0). Total num frames: 3391488. Throughput: 0: 13288.6. Samples: 3363657. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:14:21,237][81074] Avg episode reward: [(0, '1997.208')] [2023-03-06 23:14:21,800][81400] Updated weights for policy 0, policy_version 3320 (0.0006) [2023-03-06 23:14:22,550][81400] Updated weights for policy 0, policy_version 3330 (0.0005) [2023-03-06 23:14:23,317][81400] Updated weights for policy 0, policy_version 3340 (0.0006) [2023-03-06 23:14:24,102][81400] Updated weights for policy 0, policy_version 3350 (0.0006) [2023-03-06 23:14:24,858][81400] Updated weights for policy 0, policy_version 3360 (0.0005) [2023-03-06 23:14:25,616][81400] Updated weights for policy 0, policy_version 3370 (0.0007) [2023-03-06 23:14:26,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13300.2). Total num frames: 3458048. Throughput: 0: 13305.4. Samples: 3443897. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:26,237][81074] Avg episode reward: [(0, '1962.496')] [2023-03-06 23:14:26,389][81400] Updated weights for policy 0, policy_version 3380 (0.0006) [2023-03-06 23:14:27,168][81400] Updated weights for policy 0, policy_version 3390 (0.0006) [2023-03-06 23:14:27,932][81400] Updated weights for policy 0, policy_version 3400 (0.0006) [2023-03-06 23:14:28,697][81400] Updated weights for policy 0, policy_version 3410 (0.0006) [2023-03-06 23:14:29,461][81400] Updated weights for policy 0, policy_version 3420 (0.0006) [2023-03-06 23:14:30,220][81400] Updated weights for policy 0, policy_version 3430 (0.0006) [2023-03-06 23:14:31,007][81400] Updated weights for policy 0, policy_version 3440 (0.0006) [2023-03-06 23:14:31,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13295.0, 300 sec: 13300.4). Total num frames: 3524608. Throughput: 0: 13306.9. Samples: 3523907. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:14:31,237][81074] Avg episode reward: [(0, '2173.038')] [2023-03-06 23:14:31,765][81400] Updated weights for policy 0, policy_version 3450 (0.0007) [2023-03-06 23:14:32,535][81400] Updated weights for policy 0, policy_version 3460 (0.0006) [2023-03-06 23:14:33,314][81400] Updated weights for policy 0, policy_version 3470 (0.0007) [2023-03-06 23:14:34,087][81400] Updated weights for policy 0, policy_version 3480 (0.0008) [2023-03-06 23:14:34,852][81400] Updated weights for policy 0, policy_version 3490 (0.0006) [2023-03-06 23:14:35,617][81400] Updated weights for policy 0, policy_version 3500 (0.0006) [2023-03-06 23:14:36,236][81074] Fps is (10 sec: 13414.3, 60 sec: 13312.0, 300 sec: 13304.4). Total num frames: 3592192. Throughput: 0: 13307.5. Samples: 3563723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:36,237][81074] Avg episode reward: [(0, '2287.631')] [2023-03-06 23:14:36,377][81400] Updated weights for policy 0, policy_version 3510 (0.0006) [2023-03-06 23:14:37,168][81400] Updated weights for policy 0, policy_version 3520 (0.0006) [2023-03-06 23:14:37,917][81400] Updated weights for policy 0, policy_version 3530 (0.0006) [2023-03-06 23:14:38,696][81400] Updated weights for policy 0, policy_version 3540 (0.0007) [2023-03-06 23:14:39,469][81400] Updated weights for policy 0, policy_version 3550 (0.0006) [2023-03-06 23:14:40,234][81400] Updated weights for policy 0, policy_version 3560 (0.0006) [2023-03-06 23:14:41,003][81400] Updated weights for policy 0, policy_version 3570 (0.0006) [2023-03-06 23:14:41,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13312.0, 300 sec: 13304.6). Total num frames: 3658752. Throughput: 0: 13313.9. Samples: 3643778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:41,237][81074] Avg episode reward: [(0, '2243.347')] [2023-03-06 23:14:41,775][81400] Updated weights for policy 0, policy_version 3580 (0.0006) [2023-03-06 23:14:42,533][81400] Updated weights for policy 0, policy_version 3590 (0.0006) [2023-03-06 23:14:43,313][81400] Updated weights for policy 0, policy_version 3600 (0.0006) [2023-03-06 23:14:44,097][81400] Updated weights for policy 0, policy_version 3610 (0.0006) [2023-03-06 23:14:44,858][81400] Updated weights for policy 0, policy_version 3620 (0.0005) [2023-03-06 23:14:45,640][81400] Updated weights for policy 0, policy_version 3630 (0.0006) [2023-03-06 23:14:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13294.9, 300 sec: 13301.0). Total num frames: 3724288. Throughput: 0: 13311.6. Samples: 3723360. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:46,237][81074] Avg episode reward: [(0, '2510.743')] [2023-03-06 23:14:46,241][81349] Saving new best policy, reward=2510.743! [2023-03-06 23:14:46,418][81400] Updated weights for policy 0, policy_version 3640 (0.0006) [2023-03-06 23:14:47,168][81400] Updated weights for policy 0, policy_version 3650 (0.0005) [2023-03-06 23:14:47,952][81400] Updated weights for policy 0, policy_version 3660 (0.0005) [2023-03-06 23:14:48,730][81400] Updated weights for policy 0, policy_version 3670 (0.0006) [2023-03-06 23:14:49,518][81400] Updated weights for policy 0, policy_version 3680 (0.0006) [2023-03-06 23:14:50,283][81400] Updated weights for policy 0, policy_version 3690 (0.0006) [2023-03-06 23:14:51,071][81400] Updated weights for policy 0, policy_version 3700 (0.0006) [2023-03-06 23:14:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13295.0, 300 sec: 13301.2). Total num frames: 3790848. Throughput: 0: 13309.1. Samples: 3763045. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:14:51,237][81074] Avg episode reward: [(0, '2925.728')] [2023-03-06 23:14:51,237][81349] Saving new best policy, reward=2925.728! [2023-03-06 23:14:51,831][81400] Updated weights for policy 0, policy_version 3710 (0.0006) [2023-03-06 23:14:52,653][81400] Updated weights for policy 0, policy_version 3720 (0.0006) [2023-03-06 23:14:53,443][81400] Updated weights for policy 0, policy_version 3730 (0.0007) [2023-03-06 23:14:54,201][81400] Updated weights for policy 0, policy_version 3740 (0.0006) [2023-03-06 23:14:54,973][81400] Updated weights for policy 0, policy_version 3750 (0.0006) [2023-03-06 23:14:55,750][81400] Updated weights for policy 0, policy_version 3760 (0.0006) [2023-03-06 23:14:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13294.9, 300 sec: 13297.9). Total num frames: 3856384. Throughput: 0: 13281.5. Samples: 3841727. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:14:56,237][81074] Avg episode reward: [(0, '3060.884')] [2023-03-06 23:14:56,242][81349] Saving new best policy, reward=3060.884! [2023-03-06 23:14:56,516][81400] Updated weights for policy 0, policy_version 3770 (0.0007) [2023-03-06 23:14:57,283][81400] Updated weights for policy 0, policy_version 3780 (0.0006) [2023-03-06 23:14:58,058][81400] Updated weights for policy 0, policy_version 3790 (0.0006) [2023-03-06 23:14:58,840][81400] Updated weights for policy 0, policy_version 3800 (0.0006) [2023-03-06 23:14:59,628][81400] Updated weights for policy 0, policy_version 3810 (0.0006) [2023-03-06 23:15:00,393][81400] Updated weights for policy 0, policy_version 3820 (0.0006) [2023-03-06 23:15:01,178][81400] Updated weights for policy 0, policy_version 3830 (0.0006) [2023-03-06 23:15:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13295.0, 300 sec: 13298.1). Total num frames: 3922944. Throughput: 0: 13277.0. Samples: 3921185. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:15:01,237][81074] Avg episode reward: [(0, '3280.692')] [2023-03-06 23:15:01,237][81349] Saving new best policy, reward=3280.692! [2023-03-06 23:15:01,953][81400] Updated weights for policy 0, policy_version 3840 (0.0007) [2023-03-06 23:15:02,720][81400] Updated weights for policy 0, policy_version 3850 (0.0006) [2023-03-06 23:15:03,507][81400] Updated weights for policy 0, policy_version 3860 (0.0006) [2023-03-06 23:15:04,273][81400] Updated weights for policy 0, policy_version 3870 (0.0006) [2023-03-06 23:15:05,055][81400] Updated weights for policy 0, policy_version 3880 (0.0007) [2023-03-06 23:15:05,830][81400] Updated weights for policy 0, policy_version 3890 (0.0006) [2023-03-06 23:15:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13391.8). Total num frames: 3988480. Throughput: 0: 13268.2. Samples: 3960723. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:15:06,237][81074] Avg episode reward: [(0, '3202.286')] [2023-03-06 23:15:06,595][81400] Updated weights for policy 0, policy_version 3900 (0.0007) [2023-03-06 23:15:07,389][81400] Updated weights for policy 0, policy_version 3910 (0.0006) [2023-03-06 23:15:08,157][81400] Updated weights for policy 0, policy_version 3920 (0.0006) [2023-03-06 23:15:08,953][81400] Updated weights for policy 0, policy_version 3930 (0.0006) [2023-03-06 23:15:09,717][81400] Updated weights for policy 0, policy_version 3940 (0.0006) [2023-03-06 23:15:10,490][81400] Updated weights for policy 0, policy_version 3950 (0.0007) [2023-03-06 23:15:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13260.8, 300 sec: 13384.9). Total num frames: 4054016. Throughput: 0: 13241.0. Samples: 4039743. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:15:11,237][81074] Avg episode reward: [(0, '3463.698')] [2023-03-06 23:15:11,237][81349] Saving new best policy, reward=3463.698! [2023-03-06 23:15:11,288][81400] Updated weights for policy 0, policy_version 3960 (0.0006) [2023-03-06 23:15:12,047][81400] Updated weights for policy 0, policy_version 3970 (0.0006) [2023-03-06 23:15:12,829][81400] Updated weights for policy 0, policy_version 3980 (0.0005) [2023-03-06 23:15:13,601][81400] Updated weights for policy 0, policy_version 3990 (0.0006) [2023-03-06 23:15:14,386][81400] Updated weights for policy 0, policy_version 4000 (0.0005) [2023-03-06 23:15:15,166][81400] Updated weights for policy 0, policy_version 4010 (0.0006) [2023-03-06 23:15:15,928][81400] Updated weights for policy 0, policy_version 4020 (0.0006) [2023-03-06 23:15:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13243.8, 300 sec: 13378.0). Total num frames: 4119552. Throughput: 0: 13219.7. Samples: 4118793. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:15:16,237][81074] Avg episode reward: [(0, '3120.754')] [2023-03-06 23:15:16,703][81400] Updated weights for policy 0, policy_version 4030 (0.0007) [2023-03-06 23:15:17,486][81400] Updated weights for policy 0, policy_version 4040 (0.0006) [2023-03-06 23:15:18,243][81400] Updated weights for policy 0, policy_version 4050 (0.0006) [2023-03-06 23:15:19,010][81400] Updated weights for policy 0, policy_version 4060 (0.0006) [2023-03-06 23:15:19,792][81400] Updated weights for policy 0, policy_version 4070 (0.0007) [2023-03-06 23:15:20,566][81400] Updated weights for policy 0, policy_version 4080 (0.0006) [2023-03-06 23:15:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13374.5). Total num frames: 4186112. Throughput: 0: 13219.1. Samples: 4158585. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:15:21,237][81074] Avg episode reward: [(0, '3471.356')] [2023-03-06 23:15:21,238][81349] Saving new best policy, reward=3471.356! [2023-03-06 23:15:21,348][81400] Updated weights for policy 0, policy_version 4090 (0.0006) [2023-03-06 23:15:22,129][81400] Updated weights for policy 0, policy_version 4100 (0.0007) [2023-03-06 23:15:22,915][81400] Updated weights for policy 0, policy_version 4110 (0.0007) [2023-03-06 23:15:23,691][81400] Updated weights for policy 0, policy_version 4120 (0.0006) [2023-03-06 23:15:24,469][81400] Updated weights for policy 0, policy_version 4130 (0.0006) [2023-03-06 23:15:25,239][81400] Updated weights for policy 0, policy_version 4140 (0.0007) [2023-03-06 23:15:26,029][81400] Updated weights for policy 0, policy_version 4150 (0.0006) [2023-03-06 23:15:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13364.1). Total num frames: 4251648. Throughput: 0: 13194.3. Samples: 4237523. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:15:26,237][81074] Avg episode reward: [(0, '3347.118')] [2023-03-06 23:15:26,802][81400] Updated weights for policy 0, policy_version 4160 (0.0006) [2023-03-06 23:15:27,592][81400] Updated weights for policy 0, policy_version 4170 (0.0006) [2023-03-06 23:15:28,365][81400] Updated weights for policy 0, policy_version 4180 (0.0006) [2023-03-06 23:15:29,148][81400] Updated weights for policy 0, policy_version 4190 (0.0007) [2023-03-06 23:15:29,914][81400] Updated weights for policy 0, policy_version 4200 (0.0006) [2023-03-06 23:15:30,697][81400] Updated weights for policy 0, policy_version 4210 (0.0006) [2023-03-06 23:15:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13209.6, 300 sec: 13357.1). Total num frames: 4317184. Throughput: 0: 13179.3. Samples: 4316428. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:15:31,237][81074] Avg episode reward: [(0, '3501.222')] [2023-03-06 23:15:31,238][81349] Saving new best policy, reward=3501.222! [2023-03-06 23:15:31,459][81400] Updated weights for policy 0, policy_version 4220 (0.0006) [2023-03-06 23:15:32,244][81400] Updated weights for policy 0, policy_version 4230 (0.0006) [2023-03-06 23:15:33,028][81400] Updated weights for policy 0, policy_version 4240 (0.0006) [2023-03-06 23:15:33,798][81400] Updated weights for policy 0, policy_version 4250 (0.0006) [2023-03-06 23:15:34,582][81400] Updated weights for policy 0, policy_version 4260 (0.0007) [2023-03-06 23:15:35,384][81400] Updated weights for policy 0, policy_version 4270 (0.0006) [2023-03-06 23:15:36,162][81400] Updated weights for policy 0, policy_version 4280 (0.0006) [2023-03-06 23:15:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13350.2). Total num frames: 4383744. Throughput: 0: 13178.5. Samples: 4356078. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:15:36,237][81074] Avg episode reward: [(0, '3287.581')] [2023-03-06 23:15:36,948][81400] Updated weights for policy 0, policy_version 4290 (0.0006) [2023-03-06 23:15:37,706][81400] Updated weights for policy 0, policy_version 4300 (0.0006) [2023-03-06 23:15:38,479][81400] Updated weights for policy 0, policy_version 4310 (0.0006) [2023-03-06 23:15:39,254][81400] Updated weights for policy 0, policy_version 4320 (0.0006) [2023-03-06 23:15:40,047][81400] Updated weights for policy 0, policy_version 4330 (0.0006) [2023-03-06 23:15:40,818][81400] Updated weights for policy 0, policy_version 4340 (0.0006) [2023-03-06 23:15:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13343.2). Total num frames: 4449280. Throughput: 0: 13180.7. Samples: 4434860. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:15:41,237][81074] Avg episode reward: [(0, '3273.536')] [2023-03-06 23:15:41,598][81400] Updated weights for policy 0, policy_version 4350 (0.0006) [2023-03-06 23:15:42,380][81400] Updated weights for policy 0, policy_version 4360 (0.0006) [2023-03-06 23:15:43,157][81400] Updated weights for policy 0, policy_version 4370 (0.0006) [2023-03-06 23:15:43,932][81400] Updated weights for policy 0, policy_version 4380 (0.0007) [2023-03-06 23:15:44,703][81400] Updated weights for policy 0, policy_version 4390 (0.0006) [2023-03-06 23:15:45,478][81400] Updated weights for policy 0, policy_version 4400 (0.0006) [2023-03-06 23:15:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13336.3). Total num frames: 4514816. Throughput: 0: 13169.4. Samples: 4513810. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:15:46,237][81074] Avg episode reward: [(0, '3437.424')] [2023-03-06 23:15:46,274][81400] Updated weights for policy 0, policy_version 4410 (0.0006) [2023-03-06 23:15:47,041][81400] Updated weights for policy 0, policy_version 4420 (0.0006) [2023-03-06 23:15:47,828][81400] Updated weights for policy 0, policy_version 4430 (0.0006) [2023-03-06 23:15:48,607][81400] Updated weights for policy 0, policy_version 4440 (0.0006) [2023-03-06 23:15:49,372][81400] Updated weights for policy 0, policy_version 4450 (0.0006) [2023-03-06 23:15:50,165][81400] Updated weights for policy 0, policy_version 4460 (0.0006) [2023-03-06 23:15:50,941][81400] Updated weights for policy 0, policy_version 4470 (0.0007) [2023-03-06 23:15:51,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13325.9). Total num frames: 4580352. Throughput: 0: 13166.2. Samples: 4553204. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:15:51,237][81074] Avg episode reward: [(0, '3463.486')] [2023-03-06 23:15:51,706][81400] Updated weights for policy 0, policy_version 4480 (0.0006) [2023-03-06 23:15:52,517][81400] Updated weights for policy 0, policy_version 4490 (0.0006) [2023-03-06 23:15:53,269][81400] Updated weights for policy 0, policy_version 4500 (0.0006) [2023-03-06 23:15:54,053][81400] Updated weights for policy 0, policy_version 4510 (0.0007) [2023-03-06 23:15:54,829][81400] Updated weights for policy 0, policy_version 4520 (0.0005) [2023-03-06 23:15:55,606][81400] Updated weights for policy 0, policy_version 4530 (0.0006) [2023-03-06 23:15:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13322.4). Total num frames: 4646912. Throughput: 0: 13164.8. Samples: 4632162. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:15:56,237][81074] Avg episode reward: [(0, '3435.348')] [2023-03-06 23:15:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000004538_4646912.pth... [2023-03-06 23:15:56,270][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000001429_1463296.pth [2023-03-06 23:15:56,371][81400] Updated weights for policy 0, policy_version 4540 (0.0006) [2023-03-06 23:15:57,147][81400] Updated weights for policy 0, policy_version 4550 (0.0006) [2023-03-06 23:15:57,936][81400] Updated weights for policy 0, policy_version 4560 (0.0006) [2023-03-06 23:15:58,715][81400] Updated weights for policy 0, policy_version 4570 (0.0006) [2023-03-06 23:15:59,479][81400] Updated weights for policy 0, policy_version 4580 (0.0006) [2023-03-06 23:16:00,252][81400] Updated weights for policy 0, policy_version 4590 (0.0006) [2023-03-06 23:16:01,042][81400] Updated weights for policy 0, policy_version 4600 (0.0006) [2023-03-06 23:16:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13315.5). Total num frames: 4712448. Throughput: 0: 13165.4. Samples: 4711237. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:16:01,237][81074] Avg episode reward: [(0, '3550.360')] [2023-03-06 23:16:01,237][81349] Saving new best policy, reward=3550.360! [2023-03-06 23:16:01,815][81400] Updated weights for policy 0, policy_version 4610 (0.0006) [2023-03-06 23:16:02,585][81400] Updated weights for policy 0, policy_version 4620 (0.0006) [2023-03-06 23:16:03,360][81400] Updated weights for policy 0, policy_version 4630 (0.0007) [2023-03-06 23:16:04,146][81400] Updated weights for policy 0, policy_version 4640 (0.0006) [2023-03-06 23:16:04,924][81400] Updated weights for policy 0, policy_version 4650 (0.0006) [2023-03-06 23:16:05,725][81400] Updated weights for policy 0, policy_version 4660 (0.0006) [2023-03-06 23:16:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13305.1). Total num frames: 4777984. Throughput: 0: 13156.1. Samples: 4750610. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:16:06,237][81074] Avg episode reward: [(0, '3644.104')] [2023-03-06 23:16:06,255][81349] Saving new best policy, reward=3644.104! [2023-03-06 23:16:06,504][81400] Updated weights for policy 0, policy_version 4670 (0.0006) [2023-03-06 23:16:07,304][81400] Updated weights for policy 0, policy_version 4680 (0.0006) [2023-03-06 23:16:08,091][81400] Updated weights for policy 0, policy_version 4690 (0.0006) [2023-03-06 23:16:08,864][81400] Updated weights for policy 0, policy_version 4700 (0.0006) [2023-03-06 23:16:09,651][81400] Updated weights for policy 0, policy_version 4710 (0.0005) [2023-03-06 23:16:10,433][81400] Updated weights for policy 0, policy_version 4720 (0.0006) [2023-03-06 23:16:11,209][81400] Updated weights for policy 0, policy_version 4730 (0.0006) [2023-03-06 23:16:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13298.1). Total num frames: 4843520. Throughput: 0: 13142.6. Samples: 4828939. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:16:11,237][81074] Avg episode reward: [(0, '3621.575')] [2023-03-06 23:16:11,985][81400] Updated weights for policy 0, policy_version 4740 (0.0005) [2023-03-06 23:16:12,758][81400] Updated weights for policy 0, policy_version 4750 (0.0007) [2023-03-06 23:16:13,530][81400] Updated weights for policy 0, policy_version 4760 (0.0007) [2023-03-06 23:16:14,324][81400] Updated weights for policy 0, policy_version 4770 (0.0006) [2023-03-06 23:16:15,103][81400] Updated weights for policy 0, policy_version 4780 (0.0006) [2023-03-06 23:16:15,894][81400] Updated weights for policy 0, policy_version 4790 (0.0006) [2023-03-06 23:16:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13291.2). Total num frames: 4909056. Throughput: 0: 13137.1. Samples: 4907602. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 23:16:16,237][81074] Avg episode reward: [(0, '3551.924')] [2023-03-06 23:16:16,683][81400] Updated weights for policy 0, policy_version 4800 (0.0006) [2023-03-06 23:16:17,452][81400] Updated weights for policy 0, policy_version 4810 (0.0006) [2023-03-06 23:16:18,237][81400] Updated weights for policy 0, policy_version 4820 (0.0006) [2023-03-06 23:16:19,011][81400] Updated weights for policy 0, policy_version 4830 (0.0006) [2023-03-06 23:16:19,791][81400] Updated weights for policy 0, policy_version 4840 (0.0007) [2023-03-06 23:16:20,561][81400] Updated weights for policy 0, policy_version 4850 (0.0007) [2023-03-06 23:16:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13280.8). Total num frames: 4974592. Throughput: 0: 13134.1. Samples: 4947109. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:16:21,237][81074] Avg episode reward: [(0, '3412.233')] [2023-03-06 23:16:21,336][81400] Updated weights for policy 0, policy_version 4860 (0.0006) [2023-03-06 23:16:22,115][81400] Updated weights for policy 0, policy_version 4870 (0.0006) [2023-03-06 23:16:22,896][81400] Updated weights for policy 0, policy_version 4880 (0.0005) [2023-03-06 23:16:23,660][81400] Updated weights for policy 0, policy_version 4890 (0.0006) [2023-03-06 23:16:24,455][81400] Updated weights for policy 0, policy_version 4900 (0.0006) [2023-03-06 23:16:25,239][81400] Updated weights for policy 0, policy_version 4910 (0.0006) [2023-03-06 23:16:26,010][81400] Updated weights for policy 0, policy_version 4920 (0.0006) [2023-03-06 23:16:26,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13280.8). Total num frames: 5041152. Throughput: 0: 13140.8. Samples: 5026194. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:16:26,237][81074] Avg episode reward: [(0, '3399.901')] [2023-03-06 23:16:26,778][81400] Updated weights for policy 0, policy_version 4930 (0.0005) [2023-03-06 23:16:27,550][81400] Updated weights for policy 0, policy_version 4940 (0.0006) [2023-03-06 23:16:28,335][81400] Updated weights for policy 0, policy_version 4950 (0.0006) [2023-03-06 23:16:29,107][81400] Updated weights for policy 0, policy_version 4960 (0.0006) [2023-03-06 23:16:29,891][81400] Updated weights for policy 0, policy_version 4970 (0.0006) [2023-03-06 23:16:30,658][81400] Updated weights for policy 0, policy_version 4980 (0.0007) [2023-03-06 23:16:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13270.3). Total num frames: 5106688. Throughput: 0: 13146.1. Samples: 5105386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 23:16:31,237][81074] Avg episode reward: [(0, '3355.654')] [2023-03-06 23:16:31,428][81400] Updated weights for policy 0, policy_version 4990 (0.0006) [2023-03-06 23:16:32,225][81400] Updated weights for policy 0, policy_version 5000 (0.0006) [2023-03-06 23:16:33,001][81400] Updated weights for policy 0, policy_version 5010 (0.0006) [2023-03-06 23:16:33,764][81400] Updated weights for policy 0, policy_version 5020 (0.0007) [2023-03-06 23:16:34,542][81400] Updated weights for policy 0, policy_version 5030 (0.0008) [2023-03-06 23:16:35,330][81400] Updated weights for policy 0, policy_version 5040 (0.0006) [2023-03-06 23:16:36,094][81400] Updated weights for policy 0, policy_version 5050 (0.0005) [2023-03-06 23:16:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13259.9). Total num frames: 5172224. Throughput: 0: 13147.3. Samples: 5144835. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:16:36,237][81074] Avg episode reward: [(0, '3465.663')] [2023-03-06 23:16:36,890][81400] Updated weights for policy 0, policy_version 5060 (0.0006) [2023-03-06 23:16:37,670][81400] Updated weights for policy 0, policy_version 5070 (0.0006) [2023-03-06 23:16:38,437][81400] Updated weights for policy 0, policy_version 5080 (0.0006) [2023-03-06 23:16:39,200][81400] Updated weights for policy 0, policy_version 5090 (0.0006) [2023-03-06 23:16:39,990][81400] Updated weights for policy 0, policy_version 5100 (0.0006) [2023-03-06 23:16:40,758][81400] Updated weights for policy 0, policy_version 5110 (0.0006) [2023-03-06 23:16:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13259.9). Total num frames: 5238784. Throughput: 0: 13150.9. Samples: 5223952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:16:41,237][81074] Avg episode reward: [(0, '3526.977')] [2023-03-06 23:16:41,543][81400] Updated weights for policy 0, policy_version 5120 (0.0006) [2023-03-06 23:16:42,314][81400] Updated weights for policy 0, policy_version 5130 (0.0006) [2023-03-06 23:16:43,095][81400] Updated weights for policy 0, policy_version 5140 (0.0006) [2023-03-06 23:16:43,862][81400] Updated weights for policy 0, policy_version 5150 (0.0006) [2023-03-06 23:16:44,637][81400] Updated weights for policy 0, policy_version 5160 (0.0006) [2023-03-06 23:16:45,404][81400] Updated weights for policy 0, policy_version 5170 (0.0005) [2023-03-06 23:16:46,195][81400] Updated weights for policy 0, policy_version 5180 (0.0007) [2023-03-06 23:16:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13249.5). Total num frames: 5304320. Throughput: 0: 13152.8. Samples: 5303112. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:16:46,237][81074] Avg episode reward: [(0, '3464.730')] [2023-03-06 23:16:46,986][81400] Updated weights for policy 0, policy_version 5190 (0.0006) [2023-03-06 23:16:47,747][81400] Updated weights for policy 0, policy_version 5200 (0.0005) [2023-03-06 23:16:48,521][81400] Updated weights for policy 0, policy_version 5210 (0.0006) [2023-03-06 23:16:49,292][81400] Updated weights for policy 0, policy_version 5220 (0.0006) [2023-03-06 23:16:50,076][81400] Updated weights for policy 0, policy_version 5230 (0.0006) [2023-03-06 23:16:50,851][81400] Updated weights for policy 0, policy_version 5240 (0.0007) [2023-03-06 23:16:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13242.6). Total num frames: 5369856. Throughput: 0: 13159.5. Samples: 5342786. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:16:51,237][81074] Avg episode reward: [(0, '3477.967')] [2023-03-06 23:16:51,634][81400] Updated weights for policy 0, policy_version 5250 (0.0007) [2023-03-06 23:16:52,434][81400] Updated weights for policy 0, policy_version 5260 (0.0006) [2023-03-06 23:16:53,193][81400] Updated weights for policy 0, policy_version 5270 (0.0006) [2023-03-06 23:16:53,975][81400] Updated weights for policy 0, policy_version 5280 (0.0006) [2023-03-06 23:16:54,762][81400] Updated weights for policy 0, policy_version 5290 (0.0006) [2023-03-06 23:16:55,547][81400] Updated weights for policy 0, policy_version 5300 (0.0006) [2023-03-06 23:16:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13235.6). Total num frames: 5435392. Throughput: 0: 13163.7. Samples: 5421305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:16:56,237][81074] Avg episode reward: [(0, '3418.122')] [2023-03-06 23:16:56,327][81400] Updated weights for policy 0, policy_version 5310 (0.0005) [2023-03-06 23:16:57,104][81400] Updated weights for policy 0, policy_version 5320 (0.0006) [2023-03-06 23:16:57,878][81400] Updated weights for policy 0, policy_version 5330 (0.0006) [2023-03-06 23:16:58,656][81400] Updated weights for policy 0, policy_version 5340 (0.0006) [2023-03-06 23:16:59,436][81400] Updated weights for policy 0, policy_version 5350 (0.0006) [2023-03-06 23:17:00,205][81400] Updated weights for policy 0, policy_version 5360 (0.0006) [2023-03-06 23:17:00,977][81400] Updated weights for policy 0, policy_version 5370 (0.0006) [2023-03-06 23:17:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13232.2). Total num frames: 5501952. Throughput: 0: 13174.8. Samples: 5500468. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:17:01,237][81074] Avg episode reward: [(0, '3431.979')] [2023-03-06 23:17:01,745][81400] Updated weights for policy 0, policy_version 5380 (0.0006) [2023-03-06 23:17:02,518][81400] Updated weights for policy 0, policy_version 5390 (0.0006) [2023-03-06 23:17:03,306][81400] Updated weights for policy 0, policy_version 5400 (0.0006) [2023-03-06 23:17:04,058][81400] Updated weights for policy 0, policy_version 5410 (0.0006) [2023-03-06 23:17:04,850][81400] Updated weights for policy 0, policy_version 5420 (0.0006) [2023-03-06 23:17:05,647][81400] Updated weights for policy 0, policy_version 5430 (0.0006) [2023-03-06 23:17:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13228.7). Total num frames: 5567488. Throughput: 0: 13182.5. Samples: 5540324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:17:06,237][81074] Avg episode reward: [(0, '3331.931')] [2023-03-06 23:17:06,432][81400] Updated weights for policy 0, policy_version 5440 (0.0006) [2023-03-06 23:17:07,214][81400] Updated weights for policy 0, policy_version 5450 (0.0007) [2023-03-06 23:17:07,980][81400] Updated weights for policy 0, policy_version 5460 (0.0007) [2023-03-06 23:17:08,752][81400] Updated weights for policy 0, policy_version 5470 (0.0006) [2023-03-06 23:17:09,522][81400] Updated weights for policy 0, policy_version 5480 (0.0007) [2023-03-06 23:17:10,286][81400] Updated weights for policy 0, policy_version 5490 (0.0007) [2023-03-06 23:17:11,067][81400] Updated weights for policy 0, policy_version 5500 (0.0005) [2023-03-06 23:17:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13225.2). Total num frames: 5634048. Throughput: 0: 13181.0. Samples: 5619340. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:17:11,237][81074] Avg episode reward: [(0, '3412.101')] [2023-03-06 23:17:11,843][81400] Updated weights for policy 0, policy_version 5510 (0.0005) [2023-03-06 23:17:12,616][81400] Updated weights for policy 0, policy_version 5520 (0.0006) [2023-03-06 23:17:13,404][81400] Updated weights for policy 0, policy_version 5530 (0.0006) [2023-03-06 23:17:14,182][81400] Updated weights for policy 0, policy_version 5540 (0.0006) [2023-03-06 23:17:14,958][81400] Updated weights for policy 0, policy_version 5550 (0.0006) [2023-03-06 23:17:15,775][81400] Updated weights for policy 0, policy_version 5560 (0.0007) [2023-03-06 23:17:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13221.8). Total num frames: 5698560. Throughput: 0: 13167.5. Samples: 5697924. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:17:16,237][81074] Avg episode reward: [(0, '2897.922')] [2023-03-06 23:17:16,562][81400] Updated weights for policy 0, policy_version 5570 (0.0006) [2023-03-06 23:17:17,321][81400] Updated weights for policy 0, policy_version 5580 (0.0006) [2023-03-06 23:17:18,108][81400] Updated weights for policy 0, policy_version 5590 (0.0007) [2023-03-06 23:17:18,877][81400] Updated weights for policy 0, policy_version 5600 (0.0006) [2023-03-06 23:17:19,653][81400] Updated weights for policy 0, policy_version 5610 (0.0006) [2023-03-06 23:17:20,429][81400] Updated weights for policy 0, policy_version 5620 (0.0006) [2023-03-06 23:17:21,197][81400] Updated weights for policy 0, policy_version 5630 (0.0006) [2023-03-06 23:17:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13221.8). Total num frames: 5765120. Throughput: 0: 13169.9. Samples: 5737481. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:17:21,237][81074] Avg episode reward: [(0, '3273.300')] [2023-03-06 23:17:21,981][81400] Updated weights for policy 0, policy_version 5640 (0.0006) [2023-03-06 23:17:22,749][81400] Updated weights for policy 0, policy_version 5650 (0.0007) [2023-03-06 23:17:23,518][81400] Updated weights for policy 0, policy_version 5660 (0.0005) [2023-03-06 23:17:24,311][81400] Updated weights for policy 0, policy_version 5670 (0.0006) [2023-03-06 23:17:25,093][81400] Updated weights for policy 0, policy_version 5680 (0.0006) [2023-03-06 23:17:25,849][81400] Updated weights for policy 0, policy_version 5690 (0.0006) [2023-03-06 23:17:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13218.3). Total num frames: 5830656. Throughput: 0: 13167.6. Samples: 5816496. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:17:26,237][81074] Avg episode reward: [(0, '3506.027')] [2023-03-06 23:17:26,634][81400] Updated weights for policy 0, policy_version 5700 (0.0006) [2023-03-06 23:17:27,421][81400] Updated weights for policy 0, policy_version 5710 (0.0006) [2023-03-06 23:17:28,195][81400] Updated weights for policy 0, policy_version 5720 (0.0006) [2023-03-06 23:17:28,995][81400] Updated weights for policy 0, policy_version 5730 (0.0007) [2023-03-06 23:17:29,776][81400] Updated weights for policy 0, policy_version 5740 (0.0007) [2023-03-06 23:17:30,563][81400] Updated weights for policy 0, policy_version 5750 (0.0007) [2023-03-06 23:17:31,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13214.8). Total num frames: 5896192. Throughput: 0: 13158.7. Samples: 5895255. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:17:31,237][81074] Avg episode reward: [(0, '3175.119')] [2023-03-06 23:17:31,335][81400] Updated weights for policy 0, policy_version 5760 (0.0006) [2023-03-06 23:17:32,109][81400] Updated weights for policy 0, policy_version 5770 (0.0006) [2023-03-06 23:17:32,906][81400] Updated weights for policy 0, policy_version 5780 (0.0006) [2023-03-06 23:17:33,670][81400] Updated weights for policy 0, policy_version 5790 (0.0006) [2023-03-06 23:17:34,443][81400] Updated weights for policy 0, policy_version 5800 (0.0006) [2023-03-06 23:17:35,228][81400] Updated weights for policy 0, policy_version 5810 (0.0006) [2023-03-06 23:17:35,999][81400] Updated weights for policy 0, policy_version 5820 (0.0007) [2023-03-06 23:17:36,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13211.3). Total num frames: 5961728. Throughput: 0: 13154.0. Samples: 5934718. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:17:36,248][81074] Avg episode reward: [(0, '3344.035')] [2023-03-06 23:17:36,778][81400] Updated weights for policy 0, policy_version 5830 (0.0006) [2023-03-06 23:17:37,563][81400] Updated weights for policy 0, policy_version 5840 (0.0006) [2023-03-06 23:17:38,336][81400] Updated weights for policy 0, policy_version 5850 (0.0006) [2023-03-06 23:17:39,121][81400] Updated weights for policy 0, policy_version 5860 (0.0006) [2023-03-06 23:17:39,887][81400] Updated weights for policy 0, policy_version 5870 (0.0007) [2023-03-06 23:17:40,673][81400] Updated weights for policy 0, policy_version 5880 (0.0006) [2023-03-06 23:17:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13211.3). Total num frames: 6028288. Throughput: 0: 13165.4. Samples: 6013748. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:17:41,247][81074] Avg episode reward: [(0, '3344.668')] [2023-03-06 23:17:41,441][81400] Updated weights for policy 0, policy_version 5890 (0.0005) [2023-03-06 23:17:42,222][81400] Updated weights for policy 0, policy_version 5900 (0.0006) [2023-03-06 23:17:42,995][81400] Updated weights for policy 0, policy_version 5910 (0.0005) [2023-03-06 23:17:43,782][81400] Updated weights for policy 0, policy_version 5920 (0.0007) [2023-03-06 23:17:44,557][81400] Updated weights for policy 0, policy_version 5930 (0.0006) [2023-03-06 23:17:45,350][81400] Updated weights for policy 0, policy_version 5940 (0.0006) [2023-03-06 23:17:46,106][81400] Updated weights for policy 0, policy_version 5950 (0.0007) [2023-03-06 23:17:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13207.9). Total num frames: 6093824. Throughput: 0: 13162.1. Samples: 6092762. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:17:46,247][81074] Avg episode reward: [(0, '3365.525')] [2023-03-06 23:17:46,897][81400] Updated weights for policy 0, policy_version 5960 (0.0006) [2023-03-06 23:17:47,690][81400] Updated weights for policy 0, policy_version 5970 (0.0006) [2023-03-06 23:17:48,470][81400] Updated weights for policy 0, policy_version 5980 (0.0006) [2023-03-06 23:17:49,245][81400] Updated weights for policy 0, policy_version 5990 (0.0007) [2023-03-06 23:17:50,026][81400] Updated weights for policy 0, policy_version 6000 (0.0006) [2023-03-06 23:17:50,769][81400] Updated weights for policy 0, policy_version 6010 (0.0006) [2023-03-06 23:17:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13204.4). Total num frames: 6159360. Throughput: 0: 13147.6. Samples: 6131965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:17:51,247][81074] Avg episode reward: [(0, '3338.489')] [2023-03-06 23:17:51,543][81400] Updated weights for policy 0, policy_version 6020 (0.0006) [2023-03-06 23:17:52,322][81400] Updated weights for policy 0, policy_version 6030 (0.0006) [2023-03-06 23:17:53,106][81400] Updated weights for policy 0, policy_version 6040 (0.0006) [2023-03-06 23:17:53,874][81400] Updated weights for policy 0, policy_version 6050 (0.0006) [2023-03-06 23:17:54,667][81400] Updated weights for policy 0, policy_version 6060 (0.0006) [2023-03-06 23:17:55,458][81400] Updated weights for policy 0, policy_version 6070 (0.0005) [2023-03-06 23:17:56,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13204.4). Total num frames: 6224896. Throughput: 0: 13150.2. Samples: 6211099. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:17:56,248][81074] Avg episode reward: [(0, '3627.573')] [2023-03-06 23:17:56,249][81400] Updated weights for policy 0, policy_version 6080 (0.0006) [2023-03-06 23:17:56,253][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000006080_6225920.pth... [2023-03-06 23:17:56,282][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000002988_3059712.pth [2023-03-06 23:17:57,009][81400] Updated weights for policy 0, policy_version 6090 (0.0006) [2023-03-06 23:17:57,799][81400] Updated weights for policy 0, policy_version 6100 (0.0006) [2023-03-06 23:17:58,598][81400] Updated weights for policy 0, policy_version 6110 (0.0006) [2023-03-06 23:17:59,382][81400] Updated weights for policy 0, policy_version 6120 (0.0007) [2023-03-06 23:18:00,167][81400] Updated weights for policy 0, policy_version 6130 (0.0006) [2023-03-06 23:18:00,930][81400] Updated weights for policy 0, policy_version 6140 (0.0006) [2023-03-06 23:18:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13200.9). Total num frames: 6290432. Throughput: 0: 13147.3. Samples: 6289554. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:18:01,247][81074] Avg episode reward: [(0, '3568.633')] [2023-03-06 23:18:01,724][81400] Updated weights for policy 0, policy_version 6150 (0.0006) [2023-03-06 23:18:02,507][81400] Updated weights for policy 0, policy_version 6160 (0.0006) [2023-03-06 23:18:03,294][81400] Updated weights for policy 0, policy_version 6170 (0.0006) [2023-03-06 23:18:04,066][81400] Updated weights for policy 0, policy_version 6180 (0.0006) [2023-03-06 23:18:04,847][81400] Updated weights for policy 0, policy_version 6190 (0.0006) [2023-03-06 23:18:05,631][81400] Updated weights for policy 0, policy_version 6200 (0.0006) [2023-03-06 23:18:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13200.9). Total num frames: 6355968. Throughput: 0: 13140.7. Samples: 6328813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:18:06,247][81074] Avg episode reward: [(0, '3564.102')] [2023-03-06 23:18:06,409][81400] Updated weights for policy 0, policy_version 6210 (0.0006) [2023-03-06 23:18:07,190][81400] Updated weights for policy 0, policy_version 6220 (0.0006) [2023-03-06 23:18:07,977][81400] Updated weights for policy 0, policy_version 6230 (0.0006) [2023-03-06 23:18:08,740][81400] Updated weights for policy 0, policy_version 6240 (0.0006) [2023-03-06 23:18:09,529][81400] Updated weights for policy 0, policy_version 6250 (0.0006) [2023-03-06 23:18:10,288][81400] Updated weights for policy 0, policy_version 6260 (0.0006) [2023-03-06 23:18:11,048][81400] Updated weights for policy 0, policy_version 6270 (0.0006) [2023-03-06 23:18:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13200.9). Total num frames: 6422528. Throughput: 0: 13139.4. Samples: 6407768. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:18:11,247][81074] Avg episode reward: [(0, '3660.722')] [2023-03-06 23:18:11,248][81349] Saving new best policy, reward=3660.722! [2023-03-06 23:18:11,810][81400] Updated weights for policy 0, policy_version 6280 (0.0006) [2023-03-06 23:18:12,598][81400] Updated weights for policy 0, policy_version 6290 (0.0007) [2023-03-06 23:18:13,383][81400] Updated weights for policy 0, policy_version 6300 (0.0006) [2023-03-06 23:18:14,152][81400] Updated weights for policy 0, policy_version 6310 (0.0006) [2023-03-06 23:18:14,910][81400] Updated weights for policy 0, policy_version 6320 (0.0006) [2023-03-06 23:18:15,700][81400] Updated weights for policy 0, policy_version 6330 (0.0008) [2023-03-06 23:18:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13197.4). Total num frames: 6488064. Throughput: 0: 13153.1. Samples: 6487146. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:18:16,248][81074] Avg episode reward: [(0, '3262.310')] [2023-03-06 23:18:16,478][81400] Updated weights for policy 0, policy_version 6340 (0.0006) [2023-03-06 23:18:17,262][81400] Updated weights for policy 0, policy_version 6350 (0.0007) [2023-03-06 23:18:18,020][81400] Updated weights for policy 0, policy_version 6360 (0.0006) [2023-03-06 23:18:18,790][81400] Updated weights for policy 0, policy_version 6370 (0.0006) [2023-03-06 23:18:19,579][81400] Updated weights for policy 0, policy_version 6380 (0.0005) [2023-03-06 23:18:20,358][81400] Updated weights for policy 0, policy_version 6390 (0.0006) [2023-03-06 23:18:21,130][81400] Updated weights for policy 0, policy_version 6400 (0.0006) [2023-03-06 23:18:21,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13197.5). Total num frames: 6554624. Throughput: 0: 13161.2. Samples: 6526968. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:18:21,237][81074] Avg episode reward: [(0, '3585.885')] [2023-03-06 23:18:21,895][81400] Updated weights for policy 0, policy_version 6410 (0.0006) [2023-03-06 23:18:22,676][81400] Updated weights for policy 0, policy_version 6420 (0.0006) [2023-03-06 23:18:23,442][81400] Updated weights for policy 0, policy_version 6430 (0.0007) [2023-03-06 23:18:24,250][81400] Updated weights for policy 0, policy_version 6440 (0.0005) [2023-03-06 23:18:25,008][81400] Updated weights for policy 0, policy_version 6450 (0.0006) [2023-03-06 23:18:25,778][81400] Updated weights for policy 0, policy_version 6460 (0.0007) [2023-03-06 23:18:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13197.4). Total num frames: 6620160. Throughput: 0: 13162.8. Samples: 6606075. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:18:26,237][81074] Avg episode reward: [(0, '3654.251')] [2023-03-06 23:18:26,568][81400] Updated weights for policy 0, policy_version 6470 (0.0006) [2023-03-06 23:18:27,354][81400] Updated weights for policy 0, policy_version 6480 (0.0006) [2023-03-06 23:18:28,132][81400] Updated weights for policy 0, policy_version 6490 (0.0006) [2023-03-06 23:18:28,913][81400] Updated weights for policy 0, policy_version 6500 (0.0006) [2023-03-06 23:18:29,701][81400] Updated weights for policy 0, policy_version 6510 (0.0007) [2023-03-06 23:18:30,493][81400] Updated weights for policy 0, policy_version 6520 (0.0005) [2023-03-06 23:18:31,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13194.0). Total num frames: 6685696. Throughput: 0: 13149.8. Samples: 6684504. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:18:31,237][81074] Avg episode reward: [(0, '3515.920')] [2023-03-06 23:18:31,275][81400] Updated weights for policy 0, policy_version 6530 (0.0006) [2023-03-06 23:18:32,050][81400] Updated weights for policy 0, policy_version 6540 (0.0007) [2023-03-06 23:18:32,823][81400] Updated weights for policy 0, policy_version 6550 (0.0006) [2023-03-06 23:18:33,609][81400] Updated weights for policy 0, policy_version 6560 (0.0006) [2023-03-06 23:18:34,381][81400] Updated weights for policy 0, policy_version 6570 (0.0006) [2023-03-06 23:18:35,140][81400] Updated weights for policy 0, policy_version 6580 (0.0006) [2023-03-06 23:18:35,927][81400] Updated weights for policy 0, policy_version 6590 (0.0006) [2023-03-06 23:18:36,237][81074] Fps is (10 sec: 13208.1, 60 sec: 13175.2, 300 sec: 13193.9). Total num frames: 6752256. Throughput: 0: 13153.3. Samples: 6723878. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:18:36,238][81074] Avg episode reward: [(0, '3712.454')] [2023-03-06 23:18:36,241][81349] Saving new best policy, reward=3712.454! [2023-03-06 23:18:36,726][81400] Updated weights for policy 0, policy_version 6600 (0.0006) [2023-03-06 23:18:37,509][81400] Updated weights for policy 0, policy_version 6610 (0.0007) [2023-03-06 23:18:38,274][81400] Updated weights for policy 0, policy_version 6620 (0.0006) [2023-03-06 23:18:39,045][81400] Updated weights for policy 0, policy_version 6630 (0.0006) [2023-03-06 23:18:39,805][81400] Updated weights for policy 0, policy_version 6640 (0.0006) [2023-03-06 23:18:40,587][81400] Updated weights for policy 0, policy_version 6650 (0.0006) [2023-03-06 23:18:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13190.5). Total num frames: 6817792. Throughput: 0: 13154.2. Samples: 6803036. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:18:41,237][81074] Avg episode reward: [(0, '3670.332')] [2023-03-06 23:18:41,379][81400] Updated weights for policy 0, policy_version 6660 (0.0007) [2023-03-06 23:18:42,157][81400] Updated weights for policy 0, policy_version 6670 (0.0006) [2023-03-06 23:18:42,941][81400] Updated weights for policy 0, policy_version 6680 (0.0006) [2023-03-06 23:18:43,728][81400] Updated weights for policy 0, policy_version 6690 (0.0005) [2023-03-06 23:18:44,499][81400] Updated weights for policy 0, policy_version 6700 (0.0005) [2023-03-06 23:18:45,275][81400] Updated weights for policy 0, policy_version 6710 (0.0006) [2023-03-06 23:18:46,033][81400] Updated weights for policy 0, policy_version 6720 (0.0006) [2023-03-06 23:18:46,236][81074] Fps is (10 sec: 13108.7, 60 sec: 13158.4, 300 sec: 13187.0). Total num frames: 6883328. Throughput: 0: 13171.3. Samples: 6882265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:18:46,237][81074] Avg episode reward: [(0, '3297.366')] [2023-03-06 23:18:46,814][81400] Updated weights for policy 0, policy_version 6730 (0.0006) [2023-03-06 23:18:47,597][81400] Updated weights for policy 0, policy_version 6740 (0.0007) [2023-03-06 23:18:48,382][81400] Updated weights for policy 0, policy_version 6750 (0.0007) [2023-03-06 23:18:49,164][81400] Updated weights for policy 0, policy_version 6760 (0.0006) [2023-03-06 23:18:49,946][81400] Updated weights for policy 0, policy_version 6770 (0.0007) [2023-03-06 23:18:50,729][81400] Updated weights for policy 0, policy_version 6780 (0.0007) [2023-03-06 23:18:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13187.0). Total num frames: 6948864. Throughput: 0: 13170.4. Samples: 6921478. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:18:51,237][81074] Avg episode reward: [(0, '3437.698')] [2023-03-06 23:18:51,513][81400] Updated weights for policy 0, policy_version 6790 (0.0005) [2023-03-06 23:18:52,290][81400] Updated weights for policy 0, policy_version 6800 (0.0006) [2023-03-06 23:18:53,070][81400] Updated weights for policy 0, policy_version 6810 (0.0006) [2023-03-06 23:18:53,862][81400] Updated weights for policy 0, policy_version 6820 (0.0006) [2023-03-06 23:18:54,632][81400] Updated weights for policy 0, policy_version 6830 (0.0007) [2023-03-06 23:18:55,410][81400] Updated weights for policy 0, policy_version 6840 (0.0006) [2023-03-06 23:18:56,190][81400] Updated weights for policy 0, policy_version 6850 (0.0006) [2023-03-06 23:18:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 7014400. Throughput: 0: 13163.6. Samples: 7000130. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:18:56,237][81074] Avg episode reward: [(0, '3441.949')] [2023-03-06 23:18:56,955][81400] Updated weights for policy 0, policy_version 6860 (0.0006) [2023-03-06 23:18:57,729][81400] Updated weights for policy 0, policy_version 6870 (0.0006) [2023-03-06 23:18:58,512][81400] Updated weights for policy 0, policy_version 6880 (0.0006) [2023-03-06 23:18:59,297][81400] Updated weights for policy 0, policy_version 6890 (0.0006) [2023-03-06 23:19:00,080][81400] Updated weights for policy 0, policy_version 6900 (0.0007) [2023-03-06 23:19:00,843][81400] Updated weights for policy 0, policy_version 6910 (0.0007) [2023-03-06 23:19:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 7080960. Throughput: 0: 13155.2. Samples: 7079127. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:19:01,237][81074] Avg episode reward: [(0, '3496.929')] [2023-03-06 23:19:01,636][81400] Updated weights for policy 0, policy_version 6920 (0.0007) [2023-03-06 23:19:02,403][81400] Updated weights for policy 0, policy_version 6930 (0.0007) [2023-03-06 23:19:03,189][81400] Updated weights for policy 0, policy_version 6940 (0.0006) [2023-03-06 23:19:03,957][81400] Updated weights for policy 0, policy_version 6950 (0.0006) [2023-03-06 23:19:04,729][81400] Updated weights for policy 0, policy_version 6960 (0.0007) [2023-03-06 23:19:05,490][81400] Updated weights for policy 0, policy_version 6970 (0.0006) [2023-03-06 23:19:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 7146496. Throughput: 0: 13150.0. Samples: 7118718. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:19:06,237][81074] Avg episode reward: [(0, '3426.219')] [2023-03-06 23:19:06,275][81400] Updated weights for policy 0, policy_version 6980 (0.0006) [2023-03-06 23:19:07,049][81400] Updated weights for policy 0, policy_version 6990 (0.0006) [2023-03-06 23:19:07,818][81400] Updated weights for policy 0, policy_version 7000 (0.0006) [2023-03-06 23:19:08,590][81400] Updated weights for policy 0, policy_version 7010 (0.0006) [2023-03-06 23:19:09,375][81400] Updated weights for policy 0, policy_version 7020 (0.0007) [2023-03-06 23:19:10,157][81400] Updated weights for policy 0, policy_version 7030 (0.0005) [2023-03-06 23:19:10,938][81400] Updated weights for policy 0, policy_version 7040 (0.0006) [2023-03-06 23:19:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 7212032. Throughput: 0: 13152.0. Samples: 7197914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:19:11,237][81074] Avg episode reward: [(0, '3333.585')] [2023-03-06 23:19:11,721][81400] Updated weights for policy 0, policy_version 7050 (0.0007) [2023-03-06 23:19:12,496][81400] Updated weights for policy 0, policy_version 7060 (0.0006) [2023-03-06 23:19:13,270][81400] Updated weights for policy 0, policy_version 7070 (0.0006) [2023-03-06 23:19:14,064][81400] Updated weights for policy 0, policy_version 7080 (0.0006) [2023-03-06 23:19:14,836][81400] Updated weights for policy 0, policy_version 7090 (0.0006) [2023-03-06 23:19:15,618][81400] Updated weights for policy 0, policy_version 7100 (0.0006) [2023-03-06 23:19:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 7277568. Throughput: 0: 13159.0. Samples: 7276662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:19:16,237][81074] Avg episode reward: [(0, '3244.715')] [2023-03-06 23:19:16,416][81400] Updated weights for policy 0, policy_version 7110 (0.0005) [2023-03-06 23:19:17,172][81400] Updated weights for policy 0, policy_version 7120 (0.0006) [2023-03-06 23:19:17,952][81400] Updated weights for policy 0, policy_version 7130 (0.0006) [2023-03-06 23:19:18,738][81400] Updated weights for policy 0, policy_version 7140 (0.0006) [2023-03-06 23:19:19,497][81400] Updated weights for policy 0, policy_version 7150 (0.0006) [2023-03-06 23:19:20,273][81400] Updated weights for policy 0, policy_version 7160 (0.0006) [2023-03-06 23:19:21,065][81400] Updated weights for policy 0, policy_version 7170 (0.0006) [2023-03-06 23:19:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 7344128. Throughput: 0: 13166.3. Samples: 7316346. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 23:19:21,237][81074] Avg episode reward: [(0, '3147.750')] [2023-03-06 23:19:21,851][81400] Updated weights for policy 0, policy_version 7180 (0.0007) [2023-03-06 23:19:22,621][81400] Updated weights for policy 0, policy_version 7190 (0.0005) [2023-03-06 23:19:23,406][81400] Updated weights for policy 0, policy_version 7200 (0.0006) [2023-03-06 23:19:24,187][81400] Updated weights for policy 0, policy_version 7210 (0.0006) [2023-03-06 23:19:24,961][81400] Updated weights for policy 0, policy_version 7220 (0.0006) [2023-03-06 23:19:25,735][81400] Updated weights for policy 0, policy_version 7230 (0.0006) [2023-03-06 23:19:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 7409664. Throughput: 0: 13157.9. Samples: 7395142. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:19:26,237][81074] Avg episode reward: [(0, '3449.170')] [2023-03-06 23:19:26,511][81400] Updated weights for policy 0, policy_version 7240 (0.0005) [2023-03-06 23:19:27,279][81400] Updated weights for policy 0, policy_version 7250 (0.0006) [2023-03-06 23:19:28,070][81400] Updated weights for policy 0, policy_version 7260 (0.0006) [2023-03-06 23:19:28,864][81400] Updated weights for policy 0, policy_version 7270 (0.0006) [2023-03-06 23:19:29,639][81400] Updated weights for policy 0, policy_version 7280 (0.0008) [2023-03-06 23:19:30,417][81400] Updated weights for policy 0, policy_version 7290 (0.0006) [2023-03-06 23:19:31,172][81400] Updated weights for policy 0, policy_version 7300 (0.0006) [2023-03-06 23:19:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 7475200. Throughput: 0: 13157.1. Samples: 7474333. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:19:31,237][81074] Avg episode reward: [(0, '3529.259')] [2023-03-06 23:19:31,956][81400] Updated weights for policy 0, policy_version 7310 (0.0006) [2023-03-06 23:19:32,734][81400] Updated weights for policy 0, policy_version 7320 (0.0006) [2023-03-06 23:19:33,501][81400] Updated weights for policy 0, policy_version 7330 (0.0005) [2023-03-06 23:19:34,295][81400] Updated weights for policy 0, policy_version 7340 (0.0006) [2023-03-06 23:19:35,073][81400] Updated weights for policy 0, policy_version 7350 (0.0006) [2023-03-06 23:19:35,836][81400] Updated weights for policy 0, policy_version 7360 (0.0006) [2023-03-06 23:19:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.7, 300 sec: 13162.7). Total num frames: 7541760. Throughput: 0: 13163.7. Samples: 7513845. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:19:36,237][81074] Avg episode reward: [(0, '3291.114')] [2023-03-06 23:19:36,630][81400] Updated weights for policy 0, policy_version 7370 (0.0006) [2023-03-06 23:19:37,407][81400] Updated weights for policy 0, policy_version 7380 (0.0006) [2023-03-06 23:19:38,187][81400] Updated weights for policy 0, policy_version 7390 (0.0005) [2023-03-06 23:19:38,950][81400] Updated weights for policy 0, policy_version 7400 (0.0007) [2023-03-06 23:19:39,746][81400] Updated weights for policy 0, policy_version 7410 (0.0007) [2023-03-06 23:19:40,538][81400] Updated weights for policy 0, policy_version 7420 (0.0005) [2023-03-06 23:19:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 7607296. Throughput: 0: 13165.5. Samples: 7592577. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:19:41,237][81074] Avg episode reward: [(0, '3258.106')] [2023-03-06 23:19:41,304][81400] Updated weights for policy 0, policy_version 7430 (0.0006) [2023-03-06 23:19:42,076][81400] Updated weights for policy 0, policy_version 7440 (0.0006) [2023-03-06 23:19:42,860][81400] Updated weights for policy 0, policy_version 7450 (0.0006) [2023-03-06 23:19:43,637][81400] Updated weights for policy 0, policy_version 7460 (0.0005) [2023-03-06 23:19:44,407][81400] Updated weights for policy 0, policy_version 7470 (0.0006) [2023-03-06 23:19:45,193][81400] Updated weights for policy 0, policy_version 7480 (0.0006) [2023-03-06 23:19:45,940][81400] Updated weights for policy 0, policy_version 7490 (0.0006) [2023-03-06 23:19:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 7672832. Throughput: 0: 13169.1. Samples: 7671737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:19:46,237][81074] Avg episode reward: [(0, '3451.598')] [2023-03-06 23:19:46,740][81400] Updated weights for policy 0, policy_version 7500 (0.0007) [2023-03-06 23:19:47,531][81400] Updated weights for policy 0, policy_version 7510 (0.0007) [2023-03-06 23:19:48,306][81400] Updated weights for policy 0, policy_version 7520 (0.0006) [2023-03-06 23:19:49,084][81400] Updated weights for policy 0, policy_version 7530 (0.0007) [2023-03-06 23:19:49,872][81400] Updated weights for policy 0, policy_version 7540 (0.0007) [2023-03-06 23:19:50,658][81400] Updated weights for policy 0, policy_version 7550 (0.0006) [2023-03-06 23:19:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 7738368. Throughput: 0: 13157.2. Samples: 7710789. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:19:51,237][81074] Avg episode reward: [(0, '3565.770')] [2023-03-06 23:19:51,443][81400] Updated weights for policy 0, policy_version 7560 (0.0006) [2023-03-06 23:19:52,221][81400] Updated weights for policy 0, policy_version 7570 (0.0006) [2023-03-06 23:19:52,985][81400] Updated weights for policy 0, policy_version 7580 (0.0007) [2023-03-06 23:19:53,787][81400] Updated weights for policy 0, policy_version 7590 (0.0007) [2023-03-06 23:19:54,573][81400] Updated weights for policy 0, policy_version 7600 (0.0005) [2023-03-06 23:19:55,352][81400] Updated weights for policy 0, policy_version 7610 (0.0006) [2023-03-06 23:19:56,119][81400] Updated weights for policy 0, policy_version 7620 (0.0006) [2023-03-06 23:19:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 7803904. Throughput: 0: 13145.7. Samples: 7789471. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:19:56,237][81074] Avg episode reward: [(0, '3635.285')] [2023-03-06 23:19:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000007621_7803904.pth... [2023-03-06 23:19:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000004538_4646912.pth [2023-03-06 23:19:56,921][81400] Updated weights for policy 0, policy_version 7630 (0.0006) [2023-03-06 23:19:57,697][81400] Updated weights for policy 0, policy_version 7640 (0.0006) [2023-03-06 23:19:58,453][81400] Updated weights for policy 0, policy_version 7650 (0.0006) [2023-03-06 23:19:59,252][81400] Updated weights for policy 0, policy_version 7660 (0.0006) [2023-03-06 23:20:00,010][81400] Updated weights for policy 0, policy_version 7670 (0.0006) [2023-03-06 23:20:00,802][81400] Updated weights for policy 0, policy_version 7680 (0.0006) [2023-03-06 23:20:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 7869440. Throughput: 0: 13149.7. Samples: 7868394. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:20:01,247][81074] Avg episode reward: [(0, '3629.639')] [2023-03-06 23:20:01,586][81400] Updated weights for policy 0, policy_version 7690 (0.0007) [2023-03-06 23:20:02,365][81400] Updated weights for policy 0, policy_version 7700 (0.0006) [2023-03-06 23:20:03,146][81400] Updated weights for policy 0, policy_version 7710 (0.0006) [2023-03-06 23:20:03,935][81400] Updated weights for policy 0, policy_version 7720 (0.0006) [2023-03-06 23:20:04,711][81400] Updated weights for policy 0, policy_version 7730 (0.0006) [2023-03-06 23:20:05,489][81400] Updated weights for policy 0, policy_version 7740 (0.0006) [2023-03-06 23:20:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 7934976. Throughput: 0: 13138.6. Samples: 7907584. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:20:06,247][81074] Avg episode reward: [(0, '3477.770')] [2023-03-06 23:20:06,260][81400] Updated weights for policy 0, policy_version 7750 (0.0006) [2023-03-06 23:20:07,049][81400] Updated weights for policy 0, policy_version 7760 (0.0007) [2023-03-06 23:20:07,822][81400] Updated weights for policy 0, policy_version 7770 (0.0006) [2023-03-06 23:20:08,615][81400] Updated weights for policy 0, policy_version 7780 (0.0006) [2023-03-06 23:20:09,394][81400] Updated weights for policy 0, policy_version 7790 (0.0006) [2023-03-06 23:20:10,186][81400] Updated weights for policy 0, policy_version 7800 (0.0007) [2023-03-06 23:20:10,974][81400] Updated weights for policy 0, policy_version 7810 (0.0006) [2023-03-06 23:20:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 8000512. Throughput: 0: 13133.1. Samples: 7986130. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:20:11,237][81074] Avg episode reward: [(0, '3590.308')] [2023-03-06 23:20:11,765][81400] Updated weights for policy 0, policy_version 7820 (0.0006) [2023-03-06 23:20:12,550][81400] Updated weights for policy 0, policy_version 7830 (0.0006) [2023-03-06 23:20:13,335][81400] Updated weights for policy 0, policy_version 7840 (0.0006) [2023-03-06 23:20:14,122][81400] Updated weights for policy 0, policy_version 7850 (0.0006) [2023-03-06 23:20:14,914][81400] Updated weights for policy 0, policy_version 7860 (0.0006) [2023-03-06 23:20:15,685][81400] Updated weights for policy 0, policy_version 7870 (0.0006) [2023-03-06 23:20:16,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 8065024. Throughput: 0: 13109.8. Samples: 8064276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:20:16,237][81074] Avg episode reward: [(0, '3580.981')] [2023-03-06 23:20:16,463][81400] Updated weights for policy 0, policy_version 7880 (0.0006) [2023-03-06 23:20:17,264][81400] Updated weights for policy 0, policy_version 7890 (0.0007) [2023-03-06 23:20:18,037][81400] Updated weights for policy 0, policy_version 7900 (0.0006) [2023-03-06 23:20:18,805][81400] Updated weights for policy 0, policy_version 7910 (0.0006) [2023-03-06 23:20:19,607][81400] Updated weights for policy 0, policy_version 7920 (0.0006) [2023-03-06 23:20:20,381][81400] Updated weights for policy 0, policy_version 7930 (0.0006) [2023-03-06 23:20:21,173][81400] Updated weights for policy 0, policy_version 7940 (0.0006) [2023-03-06 23:20:21,236][81074] Fps is (10 sec: 13004.7, 60 sec: 13107.2, 300 sec: 13148.9). Total num frames: 8130560. Throughput: 0: 13107.0. Samples: 8103658. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:20:21,237][81074] Avg episode reward: [(0, '3581.402')] [2023-03-06 23:20:21,953][81400] Updated weights for policy 0, policy_version 7950 (0.0006) [2023-03-06 23:20:22,731][81400] Updated weights for policy 0, policy_version 7960 (0.0006) [2023-03-06 23:20:23,493][81400] Updated weights for policy 0, policy_version 7970 (0.0007) [2023-03-06 23:20:24,272][81400] Updated weights for policy 0, policy_version 7980 (0.0007) [2023-03-06 23:20:25,034][81400] Updated weights for policy 0, policy_version 7990 (0.0006) [2023-03-06 23:20:25,816][81400] Updated weights for policy 0, policy_version 8000 (0.0006) [2023-03-06 23:20:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 8197120. Throughput: 0: 13111.8. Samples: 8182609. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:20:26,237][81074] Avg episode reward: [(0, '3634.147')] [2023-03-06 23:20:26,610][81400] Updated weights for policy 0, policy_version 8010 (0.0006) [2023-03-06 23:20:27,402][81400] Updated weights for policy 0, policy_version 8020 (0.0006) [2023-03-06 23:20:28,177][81400] Updated weights for policy 0, policy_version 8030 (0.0006) [2023-03-06 23:20:28,991][81400] Updated weights for policy 0, policy_version 8040 (0.0006) [2023-03-06 23:20:29,763][81400] Updated weights for policy 0, policy_version 8050 (0.0006) [2023-03-06 23:20:30,530][81400] Updated weights for policy 0, policy_version 8060 (0.0006) [2023-03-06 23:20:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 8262656. Throughput: 0: 13094.5. Samples: 8260989. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:20:31,237][81074] Avg episode reward: [(0, '3556.405')] [2023-03-06 23:20:31,311][81400] Updated weights for policy 0, policy_version 8070 (0.0006) [2023-03-06 23:20:32,092][81400] Updated weights for policy 0, policy_version 8080 (0.0006) [2023-03-06 23:20:32,879][81400] Updated weights for policy 0, policy_version 8090 (0.0006) [2023-03-06 23:20:33,658][81400] Updated weights for policy 0, policy_version 8100 (0.0006) [2023-03-06 23:20:34,429][81400] Updated weights for policy 0, policy_version 8110 (0.0006) [2023-03-06 23:20:35,203][81400] Updated weights for policy 0, policy_version 8120 (0.0006) [2023-03-06 23:20:35,977][81400] Updated weights for policy 0, policy_version 8130 (0.0006) [2023-03-06 23:20:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13148.9). Total num frames: 8328192. Throughput: 0: 13104.3. Samples: 8300481. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:20:36,237][81074] Avg episode reward: [(0, '3500.597')] [2023-03-06 23:20:36,750][81400] Updated weights for policy 0, policy_version 8140 (0.0005) [2023-03-06 23:20:37,530][81400] Updated weights for policy 0, policy_version 8150 (0.0006) [2023-03-06 23:20:38,311][81400] Updated weights for policy 0, policy_version 8160 (0.0006) [2023-03-06 23:20:39,089][81400] Updated weights for policy 0, policy_version 8170 (0.0006) [2023-03-06 23:20:39,869][81400] Updated weights for policy 0, policy_version 8180 (0.0006) [2023-03-06 23:20:40,658][81400] Updated weights for policy 0, policy_version 8190 (0.0006) [2023-03-06 23:20:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13148.9). Total num frames: 8393728. Throughput: 0: 13109.1. Samples: 8379381. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:20:41,237][81074] Avg episode reward: [(0, '3661.270')] [2023-03-06 23:20:41,425][81400] Updated weights for policy 0, policy_version 8200 (0.0007) [2023-03-06 23:20:42,206][81400] Updated weights for policy 0, policy_version 8210 (0.0006) [2023-03-06 23:20:42,976][81400] Updated weights for policy 0, policy_version 8220 (0.0005) [2023-03-06 23:20:43,746][81400] Updated weights for policy 0, policy_version 8230 (0.0006) [2023-03-06 23:20:44,525][81400] Updated weights for policy 0, policy_version 8240 (0.0006) [2023-03-06 23:20:45,289][81400] Updated weights for policy 0, policy_version 8250 (0.0005) [2023-03-06 23:20:46,075][81400] Updated weights for policy 0, policy_version 8260 (0.0006) [2023-03-06 23:20:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 8460288. Throughput: 0: 13117.4. Samples: 8458680. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:20:46,237][81074] Avg episode reward: [(0, '3539.629')] [2023-03-06 23:20:46,853][81400] Updated weights for policy 0, policy_version 8270 (0.0006) [2023-03-06 23:20:47,615][81400] Updated weights for policy 0, policy_version 8280 (0.0006) [2023-03-06 23:20:48,395][81400] Updated weights for policy 0, policy_version 8290 (0.0006) [2023-03-06 23:20:49,178][81400] Updated weights for policy 0, policy_version 8300 (0.0005) [2023-03-06 23:20:49,948][81400] Updated weights for policy 0, policy_version 8310 (0.0006) [2023-03-06 23:20:50,733][81400] Updated weights for policy 0, policy_version 8320 (0.0006) [2023-03-06 23:20:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 8525824. Throughput: 0: 13128.6. Samples: 8498370. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:20:51,237][81074] Avg episode reward: [(0, '3697.961')] [2023-03-06 23:20:51,500][81400] Updated weights for policy 0, policy_version 8330 (0.0006) [2023-03-06 23:20:52,270][81400] Updated weights for policy 0, policy_version 8340 (0.0006) [2023-03-06 23:20:53,065][81400] Updated weights for policy 0, policy_version 8350 (0.0007) [2023-03-06 23:20:53,843][81400] Updated weights for policy 0, policy_version 8360 (0.0006) [2023-03-06 23:20:54,609][81400] Updated weights for policy 0, policy_version 8370 (0.0006) [2023-03-06 23:20:55,388][81400] Updated weights for policy 0, policy_version 8380 (0.0006) [2023-03-06 23:20:56,179][81400] Updated weights for policy 0, policy_version 8390 (0.0006) [2023-03-06 23:20:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 8591360. Throughput: 0: 13138.2. Samples: 8577350. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:20:56,237][81074] Avg episode reward: [(0, '3661.024')] [2023-03-06 23:20:56,953][81400] Updated weights for policy 0, policy_version 8400 (0.0006) [2023-03-06 23:20:57,716][81400] Updated weights for policy 0, policy_version 8410 (0.0006) [2023-03-06 23:20:58,491][81400] Updated weights for policy 0, policy_version 8420 (0.0006) [2023-03-06 23:20:59,265][81400] Updated weights for policy 0, policy_version 8430 (0.0006) [2023-03-06 23:21:00,043][81400] Updated weights for policy 0, policy_version 8440 (0.0006) [2023-03-06 23:21:00,815][81400] Updated weights for policy 0, policy_version 8450 (0.0006) [2023-03-06 23:21:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 8657920. Throughput: 0: 13162.3. Samples: 8656580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:21:01,237][81074] Avg episode reward: [(0, '3538.502')] [2023-03-06 23:21:01,587][81400] Updated weights for policy 0, policy_version 8460 (0.0006) [2023-03-06 23:21:02,369][81400] Updated weights for policy 0, policy_version 8470 (0.0006) [2023-03-06 23:21:03,146][81400] Updated weights for policy 0, policy_version 8480 (0.0006) [2023-03-06 23:21:03,922][81400] Updated weights for policy 0, policy_version 8490 (0.0007) [2023-03-06 23:21:04,695][81400] Updated weights for policy 0, policy_version 8500 (0.0006) [2023-03-06 23:21:05,481][81400] Updated weights for policy 0, policy_version 8510 (0.0006) [2023-03-06 23:21:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 8723456. Throughput: 0: 13165.0. Samples: 8696080. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-06 23:21:06,237][81074] Avg episode reward: [(0, '3578.368')] [2023-03-06 23:21:06,266][81400] Updated weights for policy 0, policy_version 8520 (0.0006) [2023-03-06 23:21:07,037][81400] Updated weights for policy 0, policy_version 8530 (0.0006) [2023-03-06 23:21:07,807][81400] Updated weights for policy 0, policy_version 8540 (0.0006) [2023-03-06 23:21:08,582][81400] Updated weights for policy 0, policy_version 8550 (0.0005) [2023-03-06 23:21:09,373][81400] Updated weights for policy 0, policy_version 8560 (0.0005) [2023-03-06 23:21:10,162][81400] Updated weights for policy 0, policy_version 8570 (0.0006) [2023-03-06 23:21:10,937][81400] Updated weights for policy 0, policy_version 8580 (0.0006) [2023-03-06 23:21:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 8788992. Throughput: 0: 13166.1. Samples: 8775082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:21:11,237][81074] Avg episode reward: [(0, '3555.860')] [2023-03-06 23:21:11,704][81400] Updated weights for policy 0, policy_version 8590 (0.0007) [2023-03-06 23:21:12,498][81400] Updated weights for policy 0, policy_version 8600 (0.0006) [2023-03-06 23:21:13,255][81400] Updated weights for policy 0, policy_version 8610 (0.0006) [2023-03-06 23:21:14,037][81400] Updated weights for policy 0, policy_version 8620 (0.0006) [2023-03-06 23:21:14,841][81400] Updated weights for policy 0, policy_version 8630 (0.0006) [2023-03-06 23:21:15,617][81400] Updated weights for policy 0, policy_version 8640 (0.0006) [2023-03-06 23:21:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 8855552. Throughput: 0: 13174.0. Samples: 8853820. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:21:16,237][81074] Avg episode reward: [(0, '3535.334')] [2023-03-06 23:21:16,395][81400] Updated weights for policy 0, policy_version 8650 (0.0006) [2023-03-06 23:21:17,181][81400] Updated weights for policy 0, policy_version 8660 (0.0005) [2023-03-06 23:21:17,961][81400] Updated weights for policy 0, policy_version 8670 (0.0006) [2023-03-06 23:21:18,732][81400] Updated weights for policy 0, policy_version 8680 (0.0006) [2023-03-06 23:21:19,510][81400] Updated weights for policy 0, policy_version 8690 (0.0006) [2023-03-06 23:21:20,278][81400] Updated weights for policy 0, policy_version 8700 (0.0006) [2023-03-06 23:21:21,070][81400] Updated weights for policy 0, policy_version 8710 (0.0006) [2023-03-06 23:21:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 8921088. Throughput: 0: 13175.3. Samples: 8893368. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:21:21,237][81074] Avg episode reward: [(0, '3594.163')] [2023-03-06 23:21:21,841][81400] Updated weights for policy 0, policy_version 8720 (0.0007) [2023-03-06 23:21:22,625][81400] Updated weights for policy 0, policy_version 8730 (0.0006) [2023-03-06 23:21:23,410][81400] Updated weights for policy 0, policy_version 8740 (0.0007) [2023-03-06 23:21:24,181][81400] Updated weights for policy 0, policy_version 8750 (0.0006) [2023-03-06 23:21:24,969][81400] Updated weights for policy 0, policy_version 8760 (0.0006) [2023-03-06 23:21:25,749][81400] Updated weights for policy 0, policy_version 8770 (0.0007) [2023-03-06 23:21:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 8986624. Throughput: 0: 13168.1. Samples: 8971947. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:21:26,237][81074] Avg episode reward: [(0, '3466.662')] [2023-03-06 23:21:26,532][81400] Updated weights for policy 0, policy_version 8780 (0.0006) [2023-03-06 23:21:27,298][81400] Updated weights for policy 0, policy_version 8790 (0.0006) [2023-03-06 23:21:28,093][81400] Updated weights for policy 0, policy_version 8800 (0.0006) [2023-03-06 23:21:28,875][81400] Updated weights for policy 0, policy_version 8810 (0.0005) [2023-03-06 23:21:29,653][81400] Updated weights for policy 0, policy_version 8820 (0.0006) [2023-03-06 23:21:30,418][81400] Updated weights for policy 0, policy_version 8830 (0.0006) [2023-03-06 23:21:31,209][81400] Updated weights for policy 0, policy_version 8840 (0.0006) [2023-03-06 23:21:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 9052160. Throughput: 0: 13156.0. Samples: 9050700. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:21:31,237][81074] Avg episode reward: [(0, '3591.683')] [2023-03-06 23:21:31,999][81400] Updated weights for policy 0, policy_version 8850 (0.0006) [2023-03-06 23:21:32,801][81400] Updated weights for policy 0, policy_version 8860 (0.0006) [2023-03-06 23:21:33,582][81400] Updated weights for policy 0, policy_version 8870 (0.0007) [2023-03-06 23:21:34,372][81400] Updated weights for policy 0, policy_version 8880 (0.0007) [2023-03-06 23:21:35,158][81400] Updated weights for policy 0, policy_version 8890 (0.0006) [2023-03-06 23:21:35,937][81400] Updated weights for policy 0, policy_version 8900 (0.0007) [2023-03-06 23:21:36,236][81074] Fps is (10 sec: 13004.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 9116672. Throughput: 0: 13141.1. Samples: 9089720. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:21:36,237][81074] Avg episode reward: [(0, '3677.660')] [2023-03-06 23:21:36,729][81400] Updated weights for policy 0, policy_version 8910 (0.0006) [2023-03-06 23:21:37,491][81400] Updated weights for policy 0, policy_version 8920 (0.0007) [2023-03-06 23:21:38,277][81400] Updated weights for policy 0, policy_version 8930 (0.0006) [2023-03-06 23:21:39,057][81400] Updated weights for policy 0, policy_version 8940 (0.0006) [2023-03-06 23:21:39,832][81400] Updated weights for policy 0, policy_version 8950 (0.0007) [2023-03-06 23:21:40,624][81400] Updated weights for policy 0, policy_version 8960 (0.0007) [2023-03-06 23:21:41,236][81074] Fps is (10 sec: 13004.7, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 9182208. Throughput: 0: 13130.2. Samples: 9168208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:21:41,237][81074] Avg episode reward: [(0, '3785.074')] [2023-03-06 23:21:41,249][81349] Saving new best policy, reward=3785.074! [2023-03-06 23:21:41,408][81400] Updated weights for policy 0, policy_version 8970 (0.0006) [2023-03-06 23:21:42,178][81400] Updated weights for policy 0, policy_version 8980 (0.0006) [2023-03-06 23:21:42,953][81400] Updated weights for policy 0, policy_version 8990 (0.0006) [2023-03-06 23:21:43,733][81400] Updated weights for policy 0, policy_version 9000 (0.0006) [2023-03-06 23:21:44,506][81400] Updated weights for policy 0, policy_version 9010 (0.0007) [2023-03-06 23:21:45,268][81400] Updated weights for policy 0, policy_version 9020 (0.0007) [2023-03-06 23:21:46,050][81400] Updated weights for policy 0, policy_version 9030 (0.0006) [2023-03-06 23:21:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 9248768. Throughput: 0: 13129.2. Samples: 9247398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:21:46,237][81074] Avg episode reward: [(0, '3533.461')] [2023-03-06 23:21:46,823][81400] Updated weights for policy 0, policy_version 9040 (0.0006) [2023-03-06 23:21:47,618][81400] Updated weights for policy 0, policy_version 9050 (0.0005) [2023-03-06 23:21:48,377][81400] Updated weights for policy 0, policy_version 9060 (0.0006) [2023-03-06 23:21:49,164][81400] Updated weights for policy 0, policy_version 9070 (0.0007) [2023-03-06 23:21:49,959][81400] Updated weights for policy 0, policy_version 9080 (0.0006) [2023-03-06 23:21:50,735][81400] Updated weights for policy 0, policy_version 9090 (0.0007) [2023-03-06 23:21:51,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 9314304. Throughput: 0: 13128.4. Samples: 9286856. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:21:51,237][81074] Avg episode reward: [(0, '3490.873')] [2023-03-06 23:21:51,515][81400] Updated weights for policy 0, policy_version 9100 (0.0006) [2023-03-06 23:21:52,296][81400] Updated weights for policy 0, policy_version 9110 (0.0007) [2023-03-06 23:21:53,061][81400] Updated weights for policy 0, policy_version 9120 (0.0006) [2023-03-06 23:21:53,842][81400] Updated weights for policy 0, policy_version 9130 (0.0007) [2023-03-06 23:21:54,629][81400] Updated weights for policy 0, policy_version 9140 (0.0006) [2023-03-06 23:21:55,410][81400] Updated weights for policy 0, policy_version 9150 (0.0007) [2023-03-06 23:21:56,179][81400] Updated weights for policy 0, policy_version 9160 (0.0006) [2023-03-06 23:21:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 9379840. Throughput: 0: 13123.8. Samples: 9365657. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:21:56,237][81074] Avg episode reward: [(0, '3497.113')] [2023-03-06 23:21:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000009160_9379840.pth... [2023-03-06 23:21:56,273][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000006080_6225920.pth [2023-03-06 23:21:56,961][81400] Updated weights for policy 0, policy_version 9170 (0.0006) [2023-03-06 23:21:57,761][81400] Updated weights for policy 0, policy_version 9180 (0.0006) [2023-03-06 23:21:58,540][81400] Updated weights for policy 0, policy_version 9190 (0.0007) [2023-03-06 23:21:59,321][81400] Updated weights for policy 0, policy_version 9200 (0.0006) [2023-03-06 23:22:00,110][81400] Updated weights for policy 0, policy_version 9210 (0.0006) [2023-03-06 23:22:00,867][81400] Updated weights for policy 0, policy_version 9220 (0.0007) [2023-03-06 23:22:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 9445376. Throughput: 0: 13122.9. Samples: 9444352. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:22:01,237][81074] Avg episode reward: [(0, '3447.929')] [2023-03-06 23:22:01,642][81400] Updated weights for policy 0, policy_version 9230 (0.0006) [2023-03-06 23:22:02,421][81400] Updated weights for policy 0, policy_version 9240 (0.0006) [2023-03-06 23:22:03,197][81400] Updated weights for policy 0, policy_version 9250 (0.0007) [2023-03-06 23:22:03,978][81400] Updated weights for policy 0, policy_version 9260 (0.0007) [2023-03-06 23:22:04,749][81400] Updated weights for policy 0, policy_version 9270 (0.0006) [2023-03-06 23:22:05,522][81400] Updated weights for policy 0, policy_version 9280 (0.0006) [2023-03-06 23:22:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 9511936. Throughput: 0: 13124.3. Samples: 9483965. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:22:06,237][81074] Avg episode reward: [(0, '3140.912')] [2023-03-06 23:22:06,287][81400] Updated weights for policy 0, policy_version 9290 (0.0007) [2023-03-06 23:22:07,077][81400] Updated weights for policy 0, policy_version 9300 (0.0007) [2023-03-06 23:22:07,873][81400] Updated weights for policy 0, policy_version 9310 (0.0007) [2023-03-06 23:22:08,644][81400] Updated weights for policy 0, policy_version 9320 (0.0006) [2023-03-06 23:22:09,424][81400] Updated weights for policy 0, policy_version 9330 (0.0006) [2023-03-06 23:22:10,195][81400] Updated weights for policy 0, policy_version 9340 (0.0006) [2023-03-06 23:22:10,955][81400] Updated weights for policy 0, policy_version 9350 (0.0006) [2023-03-06 23:22:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 9577472. Throughput: 0: 13137.5. Samples: 9563133. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:22:11,237][81074] Avg episode reward: [(0, '3570.303')] [2023-03-06 23:22:11,754][81400] Updated weights for policy 0, policy_version 9360 (0.0007) [2023-03-06 23:22:12,509][81400] Updated weights for policy 0, policy_version 9370 (0.0006) [2023-03-06 23:22:13,289][81400] Updated weights for policy 0, policy_version 9380 (0.0006) [2023-03-06 23:22:14,062][81400] Updated weights for policy 0, policy_version 9390 (0.0006) [2023-03-06 23:22:14,843][81400] Updated weights for policy 0, policy_version 9400 (0.0006) [2023-03-06 23:22:15,618][81400] Updated weights for policy 0, policy_version 9410 (0.0007) [2023-03-06 23:22:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 9643008. Throughput: 0: 13144.5. Samples: 9642203. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:22:16,237][81074] Avg episode reward: [(0, '3265.449')] [2023-03-06 23:22:16,400][81400] Updated weights for policy 0, policy_version 9420 (0.0006) [2023-03-06 23:22:17,194][81400] Updated weights for policy 0, policy_version 9430 (0.0006) [2023-03-06 23:22:17,990][81400] Updated weights for policy 0, policy_version 9440 (0.0006) [2023-03-06 23:22:18,748][81400] Updated weights for policy 0, policy_version 9450 (0.0006) [2023-03-06 23:22:19,521][81400] Updated weights for policy 0, policy_version 9460 (0.0006) [2023-03-06 23:22:20,298][81400] Updated weights for policy 0, policy_version 9470 (0.0006) [2023-03-06 23:22:21,067][81400] Updated weights for policy 0, policy_version 9480 (0.0006) [2023-03-06 23:22:21,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 9709568. Throughput: 0: 13153.1. Samples: 9681608. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:22:21,237][81074] Avg episode reward: [(0, '3230.691')] [2023-03-06 23:22:21,858][81400] Updated weights for policy 0, policy_version 9490 (0.0006) [2023-03-06 23:22:22,649][81400] Updated weights for policy 0, policy_version 9500 (0.0006) [2023-03-06 23:22:23,407][81400] Updated weights for policy 0, policy_version 9510 (0.0006) [2023-03-06 23:22:24,185][81400] Updated weights for policy 0, policy_version 9520 (0.0006) [2023-03-06 23:22:24,975][81400] Updated weights for policy 0, policy_version 9530 (0.0006) [2023-03-06 23:22:25,756][81400] Updated weights for policy 0, policy_version 9540 (0.0006) [2023-03-06 23:22:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 9775104. Throughput: 0: 13157.4. Samples: 9760293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:22:26,237][81074] Avg episode reward: [(0, '3356.738')] [2023-03-06 23:22:26,539][81400] Updated weights for policy 0, policy_version 9550 (0.0006) [2023-03-06 23:22:27,318][81400] Updated weights for policy 0, policy_version 9560 (0.0006) [2023-03-06 23:22:28,093][81400] Updated weights for policy 0, policy_version 9570 (0.0006) [2023-03-06 23:22:28,873][81400] Updated weights for policy 0, policy_version 9580 (0.0005) [2023-03-06 23:22:29,659][81400] Updated weights for policy 0, policy_version 9590 (0.0006) [2023-03-06 23:22:30,453][81400] Updated weights for policy 0, policy_version 9600 (0.0006) [2023-03-06 23:22:31,236][81074] Fps is (10 sec: 13005.1, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 9839616. Throughput: 0: 13146.7. Samples: 9838998. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:22:31,237][81074] Avg episode reward: [(0, '3424.933')] [2023-03-06 23:22:31,238][81400] Updated weights for policy 0, policy_version 9610 (0.0006) [2023-03-06 23:22:32,012][81400] Updated weights for policy 0, policy_version 9620 (0.0006) [2023-03-06 23:22:32,790][81400] Updated weights for policy 0, policy_version 9630 (0.0006) [2023-03-06 23:22:33,564][81400] Updated weights for policy 0, policy_version 9640 (0.0006) [2023-03-06 23:22:34,342][81400] Updated weights for policy 0, policy_version 9650 (0.0007) [2023-03-06 23:22:35,128][81400] Updated weights for policy 0, policy_version 9660 (0.0006) [2023-03-06 23:22:35,895][81400] Updated weights for policy 0, policy_version 9670 (0.0006) [2023-03-06 23:22:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 9906176. Throughput: 0: 13151.4. Samples: 9878671. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:22:36,237][81074] Avg episode reward: [(0, '3318.162')] [2023-03-06 23:22:36,675][81400] Updated weights for policy 0, policy_version 9680 (0.0006) [2023-03-06 23:22:37,436][81400] Updated weights for policy 0, policy_version 9690 (0.0006) [2023-03-06 23:22:38,206][81400] Updated weights for policy 0, policy_version 9700 (0.0006) [2023-03-06 23:22:38,978][81400] Updated weights for policy 0, policy_version 9710 (0.0005) [2023-03-06 23:22:39,782][81400] Updated weights for policy 0, policy_version 9720 (0.0006) [2023-03-06 23:22:40,568][81400] Updated weights for policy 0, policy_version 9730 (0.0006) [2023-03-06 23:22:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 9971712. Throughput: 0: 13154.1. Samples: 9957589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:22:41,237][81074] Avg episode reward: [(0, '3316.523')] [2023-03-06 23:22:41,328][81400] Updated weights for policy 0, policy_version 9740 (0.0007) [2023-03-06 23:22:42,124][81400] Updated weights for policy 0, policy_version 9750 (0.0006) [2023-03-06 23:22:42,911][81400] Updated weights for policy 0, policy_version 9760 (0.0006) [2023-03-06 23:22:43,677][81400] Updated weights for policy 0, policy_version 9770 (0.0007) [2023-03-06 23:22:44,455][81400] Updated weights for policy 0, policy_version 9780 (0.0006) [2023-03-06 23:22:45,234][81400] Updated weights for policy 0, policy_version 9790 (0.0006) [2023-03-06 23:22:46,013][81400] Updated weights for policy 0, policy_version 9800 (0.0006) [2023-03-06 23:22:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 10037248. Throughput: 0: 13155.9. Samples: 10036369. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:22:46,237][81074] Avg episode reward: [(0, '3353.760')] [2023-03-06 23:22:46,804][81400] Updated weights for policy 0, policy_version 9810 (0.0006) [2023-03-06 23:22:47,574][81400] Updated weights for policy 0, policy_version 9820 (0.0006) [2023-03-06 23:22:48,372][81400] Updated weights for policy 0, policy_version 9830 (0.0007) [2023-03-06 23:22:49,149][81400] Updated weights for policy 0, policy_version 9840 (0.0006) [2023-03-06 23:22:49,911][81400] Updated weights for policy 0, policy_version 9850 (0.0007) [2023-03-06 23:22:50,704][81400] Updated weights for policy 0, policy_version 9860 (0.0006) [2023-03-06 23:22:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 10102784. Throughput: 0: 13151.1. Samples: 10075761. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:22:51,237][81074] Avg episode reward: [(0, '3074.448')] [2023-03-06 23:22:51,469][81400] Updated weights for policy 0, policy_version 9870 (0.0006) [2023-03-06 23:22:52,247][81400] Updated weights for policy 0, policy_version 9880 (0.0006) [2023-03-06 23:22:53,022][81400] Updated weights for policy 0, policy_version 9890 (0.0006) [2023-03-06 23:22:53,806][81400] Updated weights for policy 0, policy_version 9900 (0.0006) [2023-03-06 23:22:54,578][81400] Updated weights for policy 0, policy_version 9910 (0.0006) [2023-03-06 23:22:55,382][81400] Updated weights for policy 0, policy_version 9920 (0.0006) [2023-03-06 23:22:56,137][81400] Updated weights for policy 0, policy_version 9930 (0.0006) [2023-03-06 23:22:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 10169344. Throughput: 0: 13142.8. Samples: 10154561. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:22:56,237][81074] Avg episode reward: [(0, '3218.062')] [2023-03-06 23:22:56,909][81400] Updated weights for policy 0, policy_version 9940 (0.0006) [2023-03-06 23:22:57,697][81400] Updated weights for policy 0, policy_version 9950 (0.0007) [2023-03-06 23:22:58,487][81400] Updated weights for policy 0, policy_version 9960 (0.0006) [2023-03-06 23:22:59,257][81400] Updated weights for policy 0, policy_version 9970 (0.0006) [2023-03-06 23:23:00,034][81400] Updated weights for policy 0, policy_version 9980 (0.0006) [2023-03-06 23:23:00,826][81400] Updated weights for policy 0, policy_version 9990 (0.0006) [2023-03-06 23:23:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 10234880. Throughput: 0: 13137.7. Samples: 10233402. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:23:01,237][81074] Avg episode reward: [(0, '3114.852')] [2023-03-06 23:23:01,613][81400] Updated weights for policy 0, policy_version 10000 (0.0007) [2023-03-06 23:23:02,390][81400] Updated weights for policy 0, policy_version 10010 (0.0006) [2023-03-06 23:23:03,156][81400] Updated weights for policy 0, policy_version 10020 (0.0006) [2023-03-06 23:23:03,940][81400] Updated weights for policy 0, policy_version 10030 (0.0005) [2023-03-06 23:23:04,713][81400] Updated weights for policy 0, policy_version 10040 (0.0006) [2023-03-06 23:23:05,510][81400] Updated weights for policy 0, policy_version 10050 (0.0006) [2023-03-06 23:23:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 10300416. Throughput: 0: 13139.8. Samples: 10272898. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:23:06,237][81074] Avg episode reward: [(0, '3366.569')] [2023-03-06 23:23:06,268][81400] Updated weights for policy 0, policy_version 10060 (0.0007) [2023-03-06 23:23:07,058][81400] Updated weights for policy 0, policy_version 10070 (0.0006) [2023-03-06 23:23:07,840][81400] Updated weights for policy 0, policy_version 10080 (0.0007) [2023-03-06 23:23:08,606][81400] Updated weights for policy 0, policy_version 10090 (0.0007) [2023-03-06 23:23:09,389][81400] Updated weights for policy 0, policy_version 10100 (0.0006) [2023-03-06 23:23:10,159][81400] Updated weights for policy 0, policy_version 10110 (0.0006) [2023-03-06 23:23:10,930][81400] Updated weights for policy 0, policy_version 10120 (0.0006) [2023-03-06 23:23:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 10366976. Throughput: 0: 13145.9. Samples: 10351859. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:23:11,237][81074] Avg episode reward: [(0, '3374.751')] [2023-03-06 23:23:11,700][81400] Updated weights for policy 0, policy_version 10130 (0.0006) [2023-03-06 23:23:12,486][81400] Updated weights for policy 0, policy_version 10140 (0.0006) [2023-03-06 23:23:13,243][81400] Updated weights for policy 0, policy_version 10150 (0.0006) [2023-03-06 23:23:14,031][81400] Updated weights for policy 0, policy_version 10160 (0.0006) [2023-03-06 23:23:14,829][81400] Updated weights for policy 0, policy_version 10170 (0.0006) [2023-03-06 23:23:15,613][81400] Updated weights for policy 0, policy_version 10180 (0.0006) [2023-03-06 23:23:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 10431488. Throughput: 0: 13149.9. Samples: 10430742. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:23:16,237][81074] Avg episode reward: [(0, '3397.153')] [2023-03-06 23:23:16,402][81400] Updated weights for policy 0, policy_version 10190 (0.0006) [2023-03-06 23:23:17,183][81400] Updated weights for policy 0, policy_version 10200 (0.0007) [2023-03-06 23:23:17,956][81400] Updated weights for policy 0, policy_version 10210 (0.0006) [2023-03-06 23:23:18,733][81400] Updated weights for policy 0, policy_version 10220 (0.0006) [2023-03-06 23:23:19,497][81400] Updated weights for policy 0, policy_version 10230 (0.0006) [2023-03-06 23:23:20,297][81400] Updated weights for policy 0, policy_version 10240 (0.0007) [2023-03-06 23:23:21,068][81400] Updated weights for policy 0, policy_version 10250 (0.0007) [2023-03-06 23:23:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 10498048. Throughput: 0: 13151.5. Samples: 10470489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:23:21,237][81074] Avg episode reward: [(0, '3421.505')] [2023-03-06 23:23:21,851][81400] Updated weights for policy 0, policy_version 10260 (0.0006) [2023-03-06 23:23:22,626][81400] Updated weights for policy 0, policy_version 10270 (0.0006) [2023-03-06 23:23:23,395][81400] Updated weights for policy 0, policy_version 10280 (0.0006) [2023-03-06 23:23:24,171][81400] Updated weights for policy 0, policy_version 10290 (0.0006) [2023-03-06 23:23:24,939][81400] Updated weights for policy 0, policy_version 10300 (0.0006) [2023-03-06 23:23:25,734][81400] Updated weights for policy 0, policy_version 10310 (0.0007) [2023-03-06 23:23:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 10563584. Throughput: 0: 13149.5. Samples: 10549316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:23:26,237][81074] Avg episode reward: [(0, '3381.883')] [2023-03-06 23:23:26,513][81400] Updated weights for policy 0, policy_version 10320 (0.0006) [2023-03-06 23:23:27,298][81400] Updated weights for policy 0, policy_version 10330 (0.0006) [2023-03-06 23:23:28,075][81400] Updated weights for policy 0, policy_version 10340 (0.0006) [2023-03-06 23:23:28,853][81400] Updated weights for policy 0, policy_version 10350 (0.0006) [2023-03-06 23:23:29,646][81400] Updated weights for policy 0, policy_version 10360 (0.0006) [2023-03-06 23:23:30,429][81400] Updated weights for policy 0, policy_version 10370 (0.0006) [2023-03-06 23:23:31,204][81400] Updated weights for policy 0, policy_version 10380 (0.0006) [2023-03-06 23:23:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13142.0). Total num frames: 10629120. Throughput: 0: 13147.4. Samples: 10628002. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:23:31,237][81074] Avg episode reward: [(0, '3409.217')] [2023-03-06 23:23:31,977][81400] Updated weights for policy 0, policy_version 10390 (0.0006) [2023-03-06 23:23:32,752][81400] Updated weights for policy 0, policy_version 10400 (0.0006) [2023-03-06 23:23:33,516][81400] Updated weights for policy 0, policy_version 10410 (0.0006) [2023-03-06 23:23:34,292][81400] Updated weights for policy 0, policy_version 10420 (0.0006) [2023-03-06 23:23:35,060][81400] Updated weights for policy 0, policy_version 10430 (0.0006) [2023-03-06 23:23:35,820][81400] Updated weights for policy 0, policy_version 10440 (0.0006) [2023-03-06 23:23:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 10695680. Throughput: 0: 13153.1. Samples: 10667651. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:23:36,237][81074] Avg episode reward: [(0, '3451.528')] [2023-03-06 23:23:36,601][81400] Updated weights for policy 0, policy_version 10450 (0.0006) [2023-03-06 23:23:37,367][81400] Updated weights for policy 0, policy_version 10460 (0.0006) [2023-03-06 23:23:38,158][81400] Updated weights for policy 0, policy_version 10470 (0.0006) [2023-03-06 23:23:38,944][81400] Updated weights for policy 0, policy_version 10480 (0.0006) [2023-03-06 23:23:39,727][81400] Updated weights for policy 0, policy_version 10490 (0.0005) [2023-03-06 23:23:40,494][81400] Updated weights for policy 0, policy_version 10500 (0.0006) [2023-03-06 23:23:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 10761216. Throughput: 0: 13161.0. Samples: 10746806. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:23:41,237][81074] Avg episode reward: [(0, '3308.974')] [2023-03-06 23:23:41,284][81400] Updated weights for policy 0, policy_version 10510 (0.0006) [2023-03-06 23:23:42,056][81400] Updated weights for policy 0, policy_version 10520 (0.0006) [2023-03-06 23:23:42,823][81400] Updated weights for policy 0, policy_version 10530 (0.0006) [2023-03-06 23:23:43,586][81400] Updated weights for policy 0, policy_version 10540 (0.0006) [2023-03-06 23:23:44,360][81400] Updated weights for policy 0, policy_version 10550 (0.0006) [2023-03-06 23:23:45,137][81400] Updated weights for policy 0, policy_version 10560 (0.0007) [2023-03-06 23:23:45,923][81400] Updated weights for policy 0, policy_version 10570 (0.0006) [2023-03-06 23:23:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 10827776. Throughput: 0: 13170.6. Samples: 10826079. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:23:46,237][81074] Avg episode reward: [(0, '3029.019')] [2023-03-06 23:23:46,685][81400] Updated weights for policy 0, policy_version 10580 (0.0006) [2023-03-06 23:23:47,473][81400] Updated weights for policy 0, policy_version 10590 (0.0005) [2023-03-06 23:23:48,252][81400] Updated weights for policy 0, policy_version 10600 (0.0006) [2023-03-06 23:23:49,035][81400] Updated weights for policy 0, policy_version 10610 (0.0006) [2023-03-06 23:23:49,807][81400] Updated weights for policy 0, policy_version 10620 (0.0006) [2023-03-06 23:23:50,566][81400] Updated weights for policy 0, policy_version 10630 (0.0006) [2023-03-06 23:23:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 10893312. Throughput: 0: 13173.4. Samples: 10865699. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:23:51,237][81074] Avg episode reward: [(0, '3044.924')] [2023-03-06 23:23:51,347][81400] Updated weights for policy 0, policy_version 10640 (0.0005) [2023-03-06 23:23:52,143][81400] Updated weights for policy 0, policy_version 10650 (0.0006) [2023-03-06 23:23:52,913][81400] Updated weights for policy 0, policy_version 10660 (0.0007) [2023-03-06 23:23:53,688][81400] Updated weights for policy 0, policy_version 10670 (0.0007) [2023-03-06 23:23:54,469][81400] Updated weights for policy 0, policy_version 10680 (0.0006) [2023-03-06 23:23:55,247][81400] Updated weights for policy 0, policy_version 10690 (0.0006) [2023-03-06 23:23:56,031][81400] Updated weights for policy 0, policy_version 10700 (0.0006) [2023-03-06 23:23:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 10958848. Throughput: 0: 13171.3. Samples: 10944568. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:23:56,237][81074] Avg episode reward: [(0, '2999.583')] [2023-03-06 23:23:56,240][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000010702_10958848.pth... [2023-03-06 23:23:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000007621_7803904.pth [2023-03-06 23:23:56,799][81400] Updated weights for policy 0, policy_version 10710 (0.0006) [2023-03-06 23:23:57,582][81400] Updated weights for policy 0, policy_version 10720 (0.0006) [2023-03-06 23:23:58,378][81400] Updated weights for policy 0, policy_version 10730 (0.0005) [2023-03-06 23:23:59,155][81400] Updated weights for policy 0, policy_version 10740 (0.0006) [2023-03-06 23:23:59,919][81400] Updated weights for policy 0, policy_version 10750 (0.0006) [2023-03-06 23:24:00,697][81400] Updated weights for policy 0, policy_version 10760 (0.0006) [2023-03-06 23:24:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 11025408. Throughput: 0: 13178.7. Samples: 11023781. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:24:01,237][81074] Avg episode reward: [(0, '3072.909')] [2023-03-06 23:24:01,470][81400] Updated weights for policy 0, policy_version 10770 (0.0005) [2023-03-06 23:24:02,237][81400] Updated weights for policy 0, policy_version 10780 (0.0006) [2023-03-06 23:24:03,011][81400] Updated weights for policy 0, policy_version 10790 (0.0007) [2023-03-06 23:24:03,787][81400] Updated weights for policy 0, policy_version 10800 (0.0007) [2023-03-06 23:24:04,558][81400] Updated weights for policy 0, policy_version 10810 (0.0007) [2023-03-06 23:24:05,341][81400] Updated weights for policy 0, policy_version 10820 (0.0006) [2023-03-06 23:24:06,129][81400] Updated weights for policy 0, policy_version 10830 (0.0006) [2023-03-06 23:24:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 11090944. Throughput: 0: 13174.1. Samples: 11063324. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:24:06,237][81074] Avg episode reward: [(0, '2931.290')] [2023-03-06 23:24:06,905][81400] Updated weights for policy 0, policy_version 10840 (0.0006) [2023-03-06 23:24:07,669][81400] Updated weights for policy 0, policy_version 10850 (0.0006) [2023-03-06 23:24:08,462][81400] Updated weights for policy 0, policy_version 10860 (0.0006) [2023-03-06 23:24:09,226][81400] Updated weights for policy 0, policy_version 10870 (0.0006) [2023-03-06 23:24:09,994][81400] Updated weights for policy 0, policy_version 10880 (0.0006) [2023-03-06 23:24:10,760][81400] Updated weights for policy 0, policy_version 10890 (0.0007) [2023-03-06 23:24:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 11157504. Throughput: 0: 13185.9. Samples: 11142679. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:24:11,237][81074] Avg episode reward: [(0, '2989.652')] [2023-03-06 23:24:11,546][81400] Updated weights for policy 0, policy_version 10900 (0.0005) [2023-03-06 23:24:12,314][81400] Updated weights for policy 0, policy_version 10910 (0.0007) [2023-03-06 23:24:13,098][81400] Updated weights for policy 0, policy_version 10920 (0.0006) [2023-03-06 23:24:13,874][81400] Updated weights for policy 0, policy_version 10930 (0.0006) [2023-03-06 23:24:14,645][81400] Updated weights for policy 0, policy_version 10940 (0.0006) [2023-03-06 23:24:15,420][81400] Updated weights for policy 0, policy_version 10950 (0.0006) [2023-03-06 23:24:16,198][81400] Updated weights for policy 0, policy_version 10960 (0.0006) [2023-03-06 23:24:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13148.9). Total num frames: 11223040. Throughput: 0: 13195.5. Samples: 11221802. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:24:16,237][81074] Avg episode reward: [(0, '2981.889')] [2023-03-06 23:24:16,968][81400] Updated weights for policy 0, policy_version 10970 (0.0007) [2023-03-06 23:24:17,747][81400] Updated weights for policy 0, policy_version 10980 (0.0006) [2023-03-06 23:24:18,524][81400] Updated weights for policy 0, policy_version 10990 (0.0006) [2023-03-06 23:24:19,295][81400] Updated weights for policy 0, policy_version 11000 (0.0007) [2023-03-06 23:24:20,073][81400] Updated weights for policy 0, policy_version 11010 (0.0006) [2023-03-06 23:24:20,851][81400] Updated weights for policy 0, policy_version 11020 (0.0006) [2023-03-06 23:24:21,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 11288576. Throughput: 0: 13195.9. Samples: 11261467. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:24:21,237][81074] Avg episode reward: [(0, '3030.114')] [2023-03-06 23:24:21,631][81400] Updated weights for policy 0, policy_version 11030 (0.0006) [2023-03-06 23:24:22,400][81400] Updated weights for policy 0, policy_version 11040 (0.0006) [2023-03-06 23:24:23,166][81400] Updated weights for policy 0, policy_version 11050 (0.0006) [2023-03-06 23:24:23,956][81400] Updated weights for policy 0, policy_version 11060 (0.0007) [2023-03-06 23:24:24,721][81400] Updated weights for policy 0, policy_version 11070 (0.0006) [2023-03-06 23:24:25,513][81400] Updated weights for policy 0, policy_version 11080 (0.0006) [2023-03-06 23:24:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13152.3). Total num frames: 11355136. Throughput: 0: 13197.9. Samples: 11340711. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:24:26,237][81074] Avg episode reward: [(0, '3134.576')] [2023-03-06 23:24:26,272][81400] Updated weights for policy 0, policy_version 11090 (0.0007) [2023-03-06 23:24:27,048][81400] Updated weights for policy 0, policy_version 11100 (0.0007) [2023-03-06 23:24:27,832][81400] Updated weights for policy 0, policy_version 11110 (0.0006) [2023-03-06 23:24:28,590][81400] Updated weights for policy 0, policy_version 11120 (0.0006) [2023-03-06 23:24:29,386][81400] Updated weights for policy 0, policy_version 11130 (0.0006) [2023-03-06 23:24:30,177][81400] Updated weights for policy 0, policy_version 11140 (0.0006) [2023-03-06 23:24:30,950][81400] Updated weights for policy 0, policy_version 11150 (0.0006) [2023-03-06 23:24:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13148.9). Total num frames: 11420672. Throughput: 0: 13189.7. Samples: 11419614. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:24:31,237][81074] Avg episode reward: [(0, '3338.172')] [2023-03-06 23:24:31,731][81400] Updated weights for policy 0, policy_version 11160 (0.0006) [2023-03-06 23:24:32,509][81400] Updated weights for policy 0, policy_version 11170 (0.0006) [2023-03-06 23:24:33,276][81400] Updated weights for policy 0, policy_version 11180 (0.0006) [2023-03-06 23:24:34,041][81400] Updated weights for policy 0, policy_version 11190 (0.0006) [2023-03-06 23:24:34,828][81400] Updated weights for policy 0, policy_version 11200 (0.0006) [2023-03-06 23:24:35,606][81400] Updated weights for policy 0, policy_version 11210 (0.0006) [2023-03-06 23:24:36,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13152.3). Total num frames: 11487232. Throughput: 0: 13190.1. Samples: 11459255. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:24:36,237][81074] Avg episode reward: [(0, '3359.030')] [2023-03-06 23:24:36,390][81400] Updated weights for policy 0, policy_version 11220 (0.0006) [2023-03-06 23:24:37,178][81400] Updated weights for policy 0, policy_version 11230 (0.0006) [2023-03-06 23:24:37,965][81400] Updated weights for policy 0, policy_version 11240 (0.0007) [2023-03-06 23:24:38,732][81400] Updated weights for policy 0, policy_version 11250 (0.0007) [2023-03-06 23:24:39,509][81400] Updated weights for policy 0, policy_version 11260 (0.0006) [2023-03-06 23:24:40,286][81400] Updated weights for policy 0, policy_version 11270 (0.0008) [2023-03-06 23:24:41,070][81400] Updated weights for policy 0, policy_version 11280 (0.0006) [2023-03-06 23:24:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13152.3). Total num frames: 11552768. Throughput: 0: 13192.9. Samples: 11538248. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:24:41,237][81074] Avg episode reward: [(0, '3203.333')] [2023-03-06 23:24:41,860][81400] Updated weights for policy 0, policy_version 11290 (0.0006) [2023-03-06 23:24:42,639][81400] Updated weights for policy 0, policy_version 11300 (0.0007) [2023-03-06 23:24:43,413][81400] Updated weights for policy 0, policy_version 11310 (0.0006) [2023-03-06 23:24:44,198][81400] Updated weights for policy 0, policy_version 11320 (0.0006) [2023-03-06 23:24:44,969][81400] Updated weights for policy 0, policy_version 11330 (0.0006) [2023-03-06 23:24:45,740][81400] Updated weights for policy 0, policy_version 11340 (0.0006) [2023-03-06 23:24:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 11618304. Throughput: 0: 13179.1. Samples: 11616843. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:24:46,237][81074] Avg episode reward: [(0, '3206.002')] [2023-03-06 23:24:46,547][81400] Updated weights for policy 0, policy_version 11350 (0.0006) [2023-03-06 23:24:47,333][81400] Updated weights for policy 0, policy_version 11360 (0.0006) [2023-03-06 23:24:48,120][81400] Updated weights for policy 0, policy_version 11370 (0.0007) [2023-03-06 23:24:48,894][81400] Updated weights for policy 0, policy_version 11380 (0.0006) [2023-03-06 23:24:49,676][81400] Updated weights for policy 0, policy_version 11390 (0.0006) [2023-03-06 23:24:50,449][81400] Updated weights for policy 0, policy_version 11400 (0.0005) [2023-03-06 23:24:51,217][81400] Updated weights for policy 0, policy_version 11410 (0.0005) [2023-03-06 23:24:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 11683840. Throughput: 0: 13171.2. Samples: 11656028. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:24:51,237][81074] Avg episode reward: [(0, '3318.196')] [2023-03-06 23:24:51,997][81400] Updated weights for policy 0, policy_version 11420 (0.0007) [2023-03-06 23:24:52,772][81400] Updated weights for policy 0, policy_version 11430 (0.0007) [2023-03-06 23:24:53,546][81400] Updated weights for policy 0, policy_version 11440 (0.0007) [2023-03-06 23:24:54,310][81400] Updated weights for policy 0, policy_version 11450 (0.0006) [2023-03-06 23:24:55,095][81400] Updated weights for policy 0, policy_version 11460 (0.0006) [2023-03-06 23:24:55,878][81400] Updated weights for policy 0, policy_version 11470 (0.0006) [2023-03-06 23:24:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 11749376. Throughput: 0: 13167.8. Samples: 11735232. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:24:56,237][81074] Avg episode reward: [(0, '3392.390')] [2023-03-06 23:24:56,645][81400] Updated weights for policy 0, policy_version 11480 (0.0006) [2023-03-06 23:24:57,441][81400] Updated weights for policy 0, policy_version 11490 (0.0006) [2023-03-06 23:24:58,230][81400] Updated weights for policy 0, policy_version 11500 (0.0006) [2023-03-06 23:24:58,980][81400] Updated weights for policy 0, policy_version 11510 (0.0005) [2023-03-06 23:24:59,759][81400] Updated weights for policy 0, policy_version 11520 (0.0006) [2023-03-06 23:25:00,526][81400] Updated weights for policy 0, policy_version 11530 (0.0005) [2023-03-06 23:25:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 11815936. Throughput: 0: 13165.9. Samples: 11814266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:25:01,237][81074] Avg episode reward: [(0, '3432.471')] [2023-03-06 23:25:01,291][81400] Updated weights for policy 0, policy_version 11540 (0.0005) [2023-03-06 23:25:02,068][81400] Updated weights for policy 0, policy_version 11550 (0.0005) [2023-03-06 23:25:02,850][81400] Updated weights for policy 0, policy_version 11560 (0.0006) [2023-03-06 23:25:03,611][81400] Updated weights for policy 0, policy_version 11570 (0.0006) [2023-03-06 23:25:04,402][81400] Updated weights for policy 0, policy_version 11580 (0.0006) [2023-03-06 23:25:05,196][81400] Updated weights for policy 0, policy_version 11590 (0.0008) [2023-03-06 23:25:05,969][81400] Updated weights for policy 0, policy_version 11600 (0.0006) [2023-03-06 23:25:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 11881472. Throughput: 0: 13164.3. Samples: 11853861. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:25:06,237][81074] Avg episode reward: [(0, '3453.797')] [2023-03-06 23:25:06,760][81400] Updated weights for policy 0, policy_version 11610 (0.0006) [2023-03-06 23:25:07,540][81400] Updated weights for policy 0, policy_version 11620 (0.0006) [2023-03-06 23:25:08,329][81400] Updated weights for policy 0, policy_version 11630 (0.0006) [2023-03-06 23:25:09,090][81400] Updated weights for policy 0, policy_version 11640 (0.0006) [2023-03-06 23:25:09,870][81400] Updated weights for policy 0, policy_version 11650 (0.0006) [2023-03-06 23:25:10,661][81400] Updated weights for policy 0, policy_version 11660 (0.0007) [2023-03-06 23:25:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 11947008. Throughput: 0: 13155.0. Samples: 11932684. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:25:11,237][81074] Avg episode reward: [(0, '3344.180')] [2023-03-06 23:25:11,432][81400] Updated weights for policy 0, policy_version 11670 (0.0006) [2023-03-06 23:25:12,212][81400] Updated weights for policy 0, policy_version 11680 (0.0006) [2023-03-06 23:25:12,998][81400] Updated weights for policy 0, policy_version 11690 (0.0006) [2023-03-06 23:25:13,786][81400] Updated weights for policy 0, policy_version 11700 (0.0006) [2023-03-06 23:25:14,552][81400] Updated weights for policy 0, policy_version 11710 (0.0006) [2023-03-06 23:25:15,332][81400] Updated weights for policy 0, policy_version 11720 (0.0006) [2023-03-06 23:25:16,117][81400] Updated weights for policy 0, policy_version 11730 (0.0005) [2023-03-06 23:25:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 12012544. Throughput: 0: 13152.4. Samples: 12011470. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:25:16,237][81074] Avg episode reward: [(0, '3529.352')] [2023-03-06 23:25:16,881][81400] Updated weights for policy 0, policy_version 11740 (0.0006) [2023-03-06 23:25:17,654][81400] Updated weights for policy 0, policy_version 11750 (0.0006) [2023-03-06 23:25:18,444][81400] Updated weights for policy 0, policy_version 11760 (0.0006) [2023-03-06 23:25:19,233][81400] Updated weights for policy 0, policy_version 11770 (0.0006) [2023-03-06 23:25:19,997][81400] Updated weights for policy 0, policy_version 11780 (0.0006) [2023-03-06 23:25:20,779][81400] Updated weights for policy 0, policy_version 11790 (0.0006) [2023-03-06 23:25:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 12079104. Throughput: 0: 13146.9. Samples: 12050863. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:25:21,237][81074] Avg episode reward: [(0, '3277.298')] [2023-03-06 23:25:21,541][81400] Updated weights for policy 0, policy_version 11800 (0.0006) [2023-03-06 23:25:22,311][81400] Updated weights for policy 0, policy_version 11810 (0.0006) [2023-03-06 23:25:23,106][81400] Updated weights for policy 0, policy_version 11820 (0.0006) [2023-03-06 23:25:23,889][81400] Updated weights for policy 0, policy_version 11830 (0.0007) [2023-03-06 23:25:24,664][81400] Updated weights for policy 0, policy_version 11840 (0.0006) [2023-03-06 23:25:25,457][81400] Updated weights for policy 0, policy_version 11850 (0.0006) [2023-03-06 23:25:26,221][81400] Updated weights for policy 0, policy_version 11860 (0.0007) [2023-03-06 23:25:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 12144640. Throughput: 0: 13149.2. Samples: 12129963. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:25:26,237][81074] Avg episode reward: [(0, '3426.255')] [2023-03-06 23:25:26,996][81400] Updated weights for policy 0, policy_version 11870 (0.0006) [2023-03-06 23:25:27,765][81400] Updated weights for policy 0, policy_version 11880 (0.0006) [2023-03-06 23:25:28,542][81400] Updated weights for policy 0, policy_version 11890 (0.0006) [2023-03-06 23:25:29,318][81400] Updated weights for policy 0, policy_version 11900 (0.0006) [2023-03-06 23:25:30,107][81400] Updated weights for policy 0, policy_version 11910 (0.0007) [2023-03-06 23:25:30,890][81400] Updated weights for policy 0, policy_version 11920 (0.0007) [2023-03-06 23:25:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 12210176. Throughput: 0: 13160.2. Samples: 12209053. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:25:31,237][81074] Avg episode reward: [(0, '3396.152')] [2023-03-06 23:25:31,665][81400] Updated weights for policy 0, policy_version 11930 (0.0006) [2023-03-06 23:25:32,447][81400] Updated weights for policy 0, policy_version 11940 (0.0006) [2023-03-06 23:25:33,206][81400] Updated weights for policy 0, policy_version 11950 (0.0005) [2023-03-06 23:25:33,980][81400] Updated weights for policy 0, policy_version 11960 (0.0006) [2023-03-06 23:25:34,760][81400] Updated weights for policy 0, policy_version 11970 (0.0007) [2023-03-06 23:25:35,545][81400] Updated weights for policy 0, policy_version 11980 (0.0006) [2023-03-06 23:25:36,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 12275712. Throughput: 0: 13172.4. Samples: 12248786. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:25:36,237][81074] Avg episode reward: [(0, '3377.126')] [2023-03-06 23:25:36,318][81400] Updated weights for policy 0, policy_version 11990 (0.0006) [2023-03-06 23:25:37,093][81400] Updated weights for policy 0, policy_version 12000 (0.0006) [2023-03-06 23:25:37,856][81400] Updated weights for policy 0, policy_version 12010 (0.0006) [2023-03-06 23:25:38,640][81400] Updated weights for policy 0, policy_version 12020 (0.0005) [2023-03-06 23:25:39,417][81400] Updated weights for policy 0, policy_version 12030 (0.0006) [2023-03-06 23:25:40,209][81400] Updated weights for policy 0, policy_version 12040 (0.0006) [2023-03-06 23:25:40,996][81400] Updated weights for policy 0, policy_version 12050 (0.0006) [2023-03-06 23:25:41,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 12342272. Throughput: 0: 13165.2. Samples: 12327667. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:25:41,237][81074] Avg episode reward: [(0, '3288.498')] [2023-03-06 23:25:41,775][81400] Updated weights for policy 0, policy_version 12060 (0.0006) [2023-03-06 23:25:42,548][81400] Updated weights for policy 0, policy_version 12070 (0.0006) [2023-03-06 23:25:43,339][81400] Updated weights for policy 0, policy_version 12080 (0.0005) [2023-03-06 23:25:44,105][81400] Updated weights for policy 0, policy_version 12090 (0.0007) [2023-03-06 23:25:44,898][81400] Updated weights for policy 0, policy_version 12100 (0.0007) [2023-03-06 23:25:45,662][81400] Updated weights for policy 0, policy_version 12110 (0.0006) [2023-03-06 23:25:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 12407808. Throughput: 0: 13161.2. Samples: 12406522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:25:46,237][81074] Avg episode reward: [(0, '3479.714')] [2023-03-06 23:25:46,428][81400] Updated weights for policy 0, policy_version 12120 (0.0006) [2023-03-06 23:25:47,209][81400] Updated weights for policy 0, policy_version 12130 (0.0007) [2023-03-06 23:25:47,966][81400] Updated weights for policy 0, policy_version 12140 (0.0006) [2023-03-06 23:25:48,731][81400] Updated weights for policy 0, policy_version 12150 (0.0007) [2023-03-06 23:25:49,514][81400] Updated weights for policy 0, policy_version 12160 (0.0006) [2023-03-06 23:25:50,271][81400] Updated weights for policy 0, policy_version 12170 (0.0006) [2023-03-06 23:25:51,059][81400] Updated weights for policy 0, policy_version 12180 (0.0006) [2023-03-06 23:25:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 12474368. Throughput: 0: 13166.1. Samples: 12446337. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:25:51,237][81074] Avg episode reward: [(0, '3478.459')] [2023-03-06 23:25:51,850][81400] Updated weights for policy 0, policy_version 12190 (0.0006) [2023-03-06 23:25:52,617][81400] Updated weights for policy 0, policy_version 12200 (0.0006) [2023-03-06 23:25:53,413][81400] Updated weights for policy 0, policy_version 12210 (0.0006) [2023-03-06 23:25:54,181][81400] Updated weights for policy 0, policy_version 12220 (0.0006) [2023-03-06 23:25:54,953][81400] Updated weights for policy 0, policy_version 12230 (0.0006) [2023-03-06 23:25:55,727][81400] Updated weights for policy 0, policy_version 12240 (0.0006) [2023-03-06 23:25:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 12539904. Throughput: 0: 13173.2. Samples: 12525479. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:25:56,237][81074] Avg episode reward: [(0, '3463.705')] [2023-03-06 23:25:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000012246_12539904.pth... [2023-03-06 23:25:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000009160_9379840.pth [2023-03-06 23:25:56,495][81400] Updated weights for policy 0, policy_version 12250 (0.0006) [2023-03-06 23:25:57,273][81400] Updated weights for policy 0, policy_version 12260 (0.0006) [2023-03-06 23:25:58,044][81400] Updated weights for policy 0, policy_version 12270 (0.0006) [2023-03-06 23:25:58,829][81400] Updated weights for policy 0, policy_version 12280 (0.0006) [2023-03-06 23:25:59,600][81400] Updated weights for policy 0, policy_version 12290 (0.0006) [2023-03-06 23:26:00,388][81400] Updated weights for policy 0, policy_version 12300 (0.0007) [2023-03-06 23:26:01,163][81400] Updated weights for policy 0, policy_version 12310 (0.0006) [2023-03-06 23:26:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 12606464. Throughput: 0: 13184.0. Samples: 12604748. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:26:01,237][81074] Avg episode reward: [(0, '3563.980')] [2023-03-06 23:26:01,938][81400] Updated weights for policy 0, policy_version 12320 (0.0006) [2023-03-06 23:26:02,713][81400] Updated weights for policy 0, policy_version 12330 (0.0006) [2023-03-06 23:26:03,494][81400] Updated weights for policy 0, policy_version 12340 (0.0006) [2023-03-06 23:26:04,262][81400] Updated weights for policy 0, policy_version 12350 (0.0006) [2023-03-06 23:26:05,028][81400] Updated weights for policy 0, policy_version 12360 (0.0006) [2023-03-06 23:26:05,798][81400] Updated weights for policy 0, policy_version 12370 (0.0006) [2023-03-06 23:26:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13162.7). Total num frames: 12672000. Throughput: 0: 13187.3. Samples: 12644293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:26:06,237][81074] Avg episode reward: [(0, '3522.544')] [2023-03-06 23:26:06,581][81400] Updated weights for policy 0, policy_version 12380 (0.0006) [2023-03-06 23:26:07,362][81400] Updated weights for policy 0, policy_version 12390 (0.0006) [2023-03-06 23:26:08,144][81400] Updated weights for policy 0, policy_version 12400 (0.0006) [2023-03-06 23:26:08,921][81400] Updated weights for policy 0, policy_version 12410 (0.0006) [2023-03-06 23:26:09,678][81400] Updated weights for policy 0, policy_version 12420 (0.0006) [2023-03-06 23:26:10,441][81400] Updated weights for policy 0, policy_version 12430 (0.0006) [2023-03-06 23:26:11,223][81400] Updated weights for policy 0, policy_version 12440 (0.0006) [2023-03-06 23:26:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13162.7). Total num frames: 12738560. Throughput: 0: 13193.2. Samples: 12723657. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:26:11,237][81074] Avg episode reward: [(0, '3308.422')] [2023-03-06 23:26:12,009][81400] Updated weights for policy 0, policy_version 12450 (0.0006) [2023-03-06 23:26:12,798][81400] Updated weights for policy 0, policy_version 12460 (0.0007) [2023-03-06 23:26:13,570][81400] Updated weights for policy 0, policy_version 12470 (0.0006) [2023-03-06 23:26:14,350][81400] Updated weights for policy 0, policy_version 12480 (0.0005) [2023-03-06 23:26:15,137][81400] Updated weights for policy 0, policy_version 12490 (0.0006) [2023-03-06 23:26:15,899][81400] Updated weights for policy 0, policy_version 12500 (0.0006) [2023-03-06 23:26:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13162.7). Total num frames: 12804096. Throughput: 0: 13193.1. Samples: 12802743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:26:16,237][81074] Avg episode reward: [(0, '3172.163')] [2023-03-06 23:26:16,697][81400] Updated weights for policy 0, policy_version 12510 (0.0006) [2023-03-06 23:26:17,465][81400] Updated weights for policy 0, policy_version 12520 (0.0006) [2023-03-06 23:26:18,242][81400] Updated weights for policy 0, policy_version 12530 (0.0006) [2023-03-06 23:26:19,022][81400] Updated weights for policy 0, policy_version 12540 (0.0006) [2023-03-06 23:26:19,797][81400] Updated weights for policy 0, policy_version 12550 (0.0006) [2023-03-06 23:26:20,573][81400] Updated weights for policy 0, policy_version 12560 (0.0006) [2023-03-06 23:26:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 12869632. Throughput: 0: 13186.8. Samples: 12842193. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:26:21,237][81074] Avg episode reward: [(0, '3123.580')] [2023-03-06 23:26:21,338][81400] Updated weights for policy 0, policy_version 12570 (0.0006) [2023-03-06 23:26:22,105][81400] Updated weights for policy 0, policy_version 12580 (0.0005) [2023-03-06 23:26:22,872][81400] Updated weights for policy 0, policy_version 12590 (0.0006) [2023-03-06 23:26:23,669][81400] Updated weights for policy 0, policy_version 12600 (0.0006) [2023-03-06 23:26:24,439][81400] Updated weights for policy 0, policy_version 12610 (0.0006) [2023-03-06 23:26:25,211][81400] Updated weights for policy 0, policy_version 12620 (0.0006) [2023-03-06 23:26:25,993][81400] Updated weights for policy 0, policy_version 12630 (0.0006) [2023-03-06 23:26:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 12936192. Throughput: 0: 13195.5. Samples: 12921466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:26:26,237][81074] Avg episode reward: [(0, '2779.332')] [2023-03-06 23:26:26,757][81400] Updated weights for policy 0, policy_version 12640 (0.0006) [2023-03-06 23:26:27,547][81400] Updated weights for policy 0, policy_version 12650 (0.0007) [2023-03-06 23:26:28,319][81400] Updated weights for policy 0, policy_version 12660 (0.0005) [2023-03-06 23:26:29,127][81400] Updated weights for policy 0, policy_version 12670 (0.0005) [2023-03-06 23:26:29,900][81400] Updated weights for policy 0, policy_version 12680 (0.0007) [2023-03-06 23:26:30,686][81400] Updated weights for policy 0, policy_version 12690 (0.0005) [2023-03-06 23:26:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 13001728. Throughput: 0: 13188.6. Samples: 13000009. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:26:31,237][81074] Avg episode reward: [(0, '3145.681')] [2023-03-06 23:26:31,449][81400] Updated weights for policy 0, policy_version 12700 (0.0006) [2023-03-06 23:26:32,230][81400] Updated weights for policy 0, policy_version 12710 (0.0006) [2023-03-06 23:26:33,007][81400] Updated weights for policy 0, policy_version 12720 (0.0006) [2023-03-06 23:26:33,801][81400] Updated weights for policy 0, policy_version 12730 (0.0005) [2023-03-06 23:26:34,582][81400] Updated weights for policy 0, policy_version 12740 (0.0006) [2023-03-06 23:26:35,345][81400] Updated weights for policy 0, policy_version 12750 (0.0006) [2023-03-06 23:26:36,120][81400] Updated weights for policy 0, policy_version 12760 (0.0006) [2023-03-06 23:26:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 13067264. Throughput: 0: 13182.5. Samples: 13039549. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:26:36,237][81074] Avg episode reward: [(0, '3260.138')] [2023-03-06 23:26:36,906][81400] Updated weights for policy 0, policy_version 12770 (0.0006) [2023-03-06 23:26:37,689][81400] Updated weights for policy 0, policy_version 12780 (0.0006) [2023-03-06 23:26:38,465][81400] Updated weights for policy 0, policy_version 12790 (0.0006) [2023-03-06 23:26:39,234][81400] Updated weights for policy 0, policy_version 12800 (0.0006) [2023-03-06 23:26:40,035][81400] Updated weights for policy 0, policy_version 12810 (0.0006) [2023-03-06 23:26:40,799][81400] Updated weights for policy 0, policy_version 12820 (0.0007) [2023-03-06 23:26:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13166.2). Total num frames: 13132800. Throughput: 0: 13178.3. Samples: 13118502. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:26:41,237][81074] Avg episode reward: [(0, '3247.981')] [2023-03-06 23:26:41,573][81400] Updated weights for policy 0, policy_version 12830 (0.0006) [2023-03-06 23:26:42,344][81400] Updated weights for policy 0, policy_version 12840 (0.0006) [2023-03-06 23:26:43,117][81400] Updated weights for policy 0, policy_version 12850 (0.0006) [2023-03-06 23:26:43,901][81400] Updated weights for policy 0, policy_version 12860 (0.0005) [2023-03-06 23:26:44,685][81400] Updated weights for policy 0, policy_version 12870 (0.0006) [2023-03-06 23:26:45,461][81400] Updated weights for policy 0, policy_version 12880 (0.0006) [2023-03-06 23:26:46,231][81400] Updated weights for policy 0, policy_version 12890 (0.0006) [2023-03-06 23:26:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 13199360. Throughput: 0: 13175.9. Samples: 13197664. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:26:46,237][81074] Avg episode reward: [(0, '3410.483')] [2023-03-06 23:26:47,030][81400] Updated weights for policy 0, policy_version 12900 (0.0006) [2023-03-06 23:26:47,789][81400] Updated weights for policy 0, policy_version 12910 (0.0007) [2023-03-06 23:26:48,565][81400] Updated weights for policy 0, policy_version 12920 (0.0006) [2023-03-06 23:26:49,353][81400] Updated weights for policy 0, policy_version 12930 (0.0006) [2023-03-06 23:26:50,131][81400] Updated weights for policy 0, policy_version 12940 (0.0006) [2023-03-06 23:26:50,924][81400] Updated weights for policy 0, policy_version 12950 (0.0006) [2023-03-06 23:26:51,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 13264896. Throughput: 0: 13171.4. Samples: 13237005. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:26:51,237][81074] Avg episode reward: [(0, '3450.961')] [2023-03-06 23:26:51,689][81400] Updated weights for policy 0, policy_version 12960 (0.0006) [2023-03-06 23:26:52,484][81400] Updated weights for policy 0, policy_version 12970 (0.0006) [2023-03-06 23:26:53,257][81400] Updated weights for policy 0, policy_version 12980 (0.0006) [2023-03-06 23:26:54,034][81400] Updated weights for policy 0, policy_version 12990 (0.0006) [2023-03-06 23:26:54,826][81400] Updated weights for policy 0, policy_version 13000 (0.0006) [2023-03-06 23:26:55,608][81400] Updated weights for policy 0, policy_version 13010 (0.0006) [2023-03-06 23:26:56,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 13329408. Throughput: 0: 13154.5. Samples: 13315611. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:26:56,237][81074] Avg episode reward: [(0, '3477.596')] [2023-03-06 23:26:56,386][81400] Updated weights for policy 0, policy_version 13020 (0.0006) [2023-03-06 23:26:57,167][81400] Updated weights for policy 0, policy_version 13030 (0.0006) [2023-03-06 23:26:57,941][81400] Updated weights for policy 0, policy_version 13040 (0.0006) [2023-03-06 23:26:58,709][81400] Updated weights for policy 0, policy_version 13050 (0.0006) [2023-03-06 23:26:59,481][81400] Updated weights for policy 0, policy_version 13060 (0.0006) [2023-03-06 23:27:00,257][81400] Updated weights for policy 0, policy_version 13070 (0.0006) [2023-03-06 23:27:01,038][81400] Updated weights for policy 0, policy_version 13080 (0.0007) [2023-03-06 23:27:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 13395968. Throughput: 0: 13155.6. Samples: 13394744. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:27:01,237][81074] Avg episode reward: [(0, '3379.972')] [2023-03-06 23:27:01,828][81400] Updated weights for policy 0, policy_version 13090 (0.0006) [2023-03-06 23:27:02,609][81400] Updated weights for policy 0, policy_version 13100 (0.0006) [2023-03-06 23:27:03,373][81400] Updated weights for policy 0, policy_version 13110 (0.0007) [2023-03-06 23:27:04,156][81400] Updated weights for policy 0, policy_version 13120 (0.0006) [2023-03-06 23:27:04,918][81400] Updated weights for policy 0, policy_version 13130 (0.0006) [2023-03-06 23:27:05,713][81400] Updated weights for policy 0, policy_version 13140 (0.0006) [2023-03-06 23:27:06,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 13462528. Throughput: 0: 13154.7. Samples: 13434154. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:27:06,237][81074] Avg episode reward: [(0, '3410.063')] [2023-03-06 23:27:06,490][81400] Updated weights for policy 0, policy_version 13150 (0.0006) [2023-03-06 23:27:07,262][81400] Updated weights for policy 0, policy_version 13160 (0.0006) [2023-03-06 23:27:08,055][81400] Updated weights for policy 0, policy_version 13170 (0.0007) [2023-03-06 23:27:08,829][81400] Updated weights for policy 0, policy_version 13180 (0.0006) [2023-03-06 23:27:09,600][81400] Updated weights for policy 0, policy_version 13190 (0.0006) [2023-03-06 23:27:10,380][81400] Updated weights for policy 0, policy_version 13200 (0.0006) [2023-03-06 23:27:11,150][81400] Updated weights for policy 0, policy_version 13210 (0.0006) [2023-03-06 23:27:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 13527040. Throughput: 0: 13149.5. Samples: 13513194. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:11,237][81074] Avg episode reward: [(0, '3399.610')] [2023-03-06 23:27:11,918][81400] Updated weights for policy 0, policy_version 13220 (0.0006) [2023-03-06 23:27:12,702][81400] Updated weights for policy 0, policy_version 13230 (0.0007) [2023-03-06 23:27:13,476][81400] Updated weights for policy 0, policy_version 13240 (0.0006) [2023-03-06 23:27:14,260][81400] Updated weights for policy 0, policy_version 13250 (0.0006) [2023-03-06 23:27:15,035][81400] Updated weights for policy 0, policy_version 13260 (0.0006) [2023-03-06 23:27:15,829][81400] Updated weights for policy 0, policy_version 13270 (0.0006) [2023-03-06 23:27:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 13593600. Throughput: 0: 13158.5. Samples: 13592139. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:16,237][81074] Avg episode reward: [(0, '3514.928')] [2023-03-06 23:27:16,601][81400] Updated weights for policy 0, policy_version 13280 (0.0007) [2023-03-06 23:27:17,375][81400] Updated weights for policy 0, policy_version 13290 (0.0006) [2023-03-06 23:27:18,143][81400] Updated weights for policy 0, policy_version 13300 (0.0005) [2023-03-06 23:27:18,928][81400] Updated weights for policy 0, policy_version 13310 (0.0006) [2023-03-06 23:27:19,702][81400] Updated weights for policy 0, policy_version 13320 (0.0007) [2023-03-06 23:27:20,489][81400] Updated weights for policy 0, policy_version 13330 (0.0007) [2023-03-06 23:27:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 13659136. Throughput: 0: 13159.0. Samples: 13631705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:21,237][81074] Avg episode reward: [(0, '3564.447')] [2023-03-06 23:27:21,269][81400] Updated weights for policy 0, policy_version 13340 (0.0006) [2023-03-06 23:27:22,041][81400] Updated weights for policy 0, policy_version 13350 (0.0005) [2023-03-06 23:27:22,823][81400] Updated weights for policy 0, policy_version 13360 (0.0006) [2023-03-06 23:27:23,609][81400] Updated weights for policy 0, policy_version 13370 (0.0007) [2023-03-06 23:27:24,354][81400] Updated weights for policy 0, policy_version 13380 (0.0006) [2023-03-06 23:27:25,145][81400] Updated weights for policy 0, policy_version 13390 (0.0006) [2023-03-06 23:27:25,893][81400] Updated weights for policy 0, policy_version 13400 (0.0006) [2023-03-06 23:27:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 13725696. Throughput: 0: 13166.8. Samples: 13711006. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:26,237][81074] Avg episode reward: [(0, '3376.034')] [2023-03-06 23:27:26,676][81400] Updated weights for policy 0, policy_version 13410 (0.0006) [2023-03-06 23:27:27,434][81400] Updated weights for policy 0, policy_version 13420 (0.0006) [2023-03-06 23:27:28,226][81400] Updated weights for policy 0, policy_version 13430 (0.0007) [2023-03-06 23:27:29,000][81400] Updated weights for policy 0, policy_version 13440 (0.0007) [2023-03-06 23:27:29,764][81400] Updated weights for policy 0, policy_version 13450 (0.0006) [2023-03-06 23:27:30,541][81400] Updated weights for policy 0, policy_version 13460 (0.0005) [2023-03-06 23:27:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 13791232. Throughput: 0: 13169.9. Samples: 13790311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:31,237][81074] Avg episode reward: [(0, '3554.758')] [2023-03-06 23:27:31,339][81400] Updated weights for policy 0, policy_version 13470 (0.0007) [2023-03-06 23:27:32,118][81400] Updated weights for policy 0, policy_version 13480 (0.0005) [2023-03-06 23:27:32,893][81400] Updated weights for policy 0, policy_version 13490 (0.0006) [2023-03-06 23:27:33,660][81400] Updated weights for policy 0, policy_version 13500 (0.0006) [2023-03-06 23:27:34,434][81400] Updated weights for policy 0, policy_version 13510 (0.0006) [2023-03-06 23:27:35,215][81400] Updated weights for policy 0, policy_version 13520 (0.0006) [2023-03-06 23:27:35,973][81400] Updated weights for policy 0, policy_version 13530 (0.0007) [2023-03-06 23:27:36,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 13857792. Throughput: 0: 13170.8. Samples: 13829690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:36,237][81074] Avg episode reward: [(0, '3611.497')] [2023-03-06 23:27:36,773][81400] Updated weights for policy 0, policy_version 13540 (0.0006) [2023-03-06 23:27:37,561][81400] Updated weights for policy 0, policy_version 13550 (0.0006) [2023-03-06 23:27:38,320][81400] Updated weights for policy 0, policy_version 13560 (0.0006) [2023-03-06 23:27:39,106][81400] Updated weights for policy 0, policy_version 13570 (0.0006) [2023-03-06 23:27:39,891][81400] Updated weights for policy 0, policy_version 13580 (0.0006) [2023-03-06 23:27:40,672][81400] Updated weights for policy 0, policy_version 13590 (0.0006) [2023-03-06 23:27:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 13923328. Throughput: 0: 13178.7. Samples: 13908655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:41,237][81074] Avg episode reward: [(0, '3646.961')] [2023-03-06 23:27:41,462][81400] Updated weights for policy 0, policy_version 13600 (0.0006) [2023-03-06 23:27:42,250][81400] Updated weights for policy 0, policy_version 13610 (0.0007) [2023-03-06 23:27:43,026][81400] Updated weights for policy 0, policy_version 13620 (0.0006) [2023-03-06 23:27:43,790][81400] Updated weights for policy 0, policy_version 13630 (0.0007) [2023-03-06 23:27:44,584][81400] Updated weights for policy 0, policy_version 13640 (0.0007) [2023-03-06 23:27:45,358][81400] Updated weights for policy 0, policy_version 13650 (0.0006) [2023-03-06 23:27:46,106][81400] Updated weights for policy 0, policy_version 13660 (0.0006) [2023-03-06 23:27:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 13988864. Throughput: 0: 13178.5. Samples: 13987778. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:46,237][81074] Avg episode reward: [(0, '3719.203')] [2023-03-06 23:27:46,898][81400] Updated weights for policy 0, policy_version 13670 (0.0006) [2023-03-06 23:27:47,668][81400] Updated weights for policy 0, policy_version 13680 (0.0006) [2023-03-06 23:27:48,446][81400] Updated weights for policy 0, policy_version 13690 (0.0006) [2023-03-06 23:27:49,225][81400] Updated weights for policy 0, policy_version 13700 (0.0007) [2023-03-06 23:27:49,984][81400] Updated weights for policy 0, policy_version 13710 (0.0006) [2023-03-06 23:27:50,774][81400] Updated weights for policy 0, policy_version 13720 (0.0006) [2023-03-06 23:27:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.4, 300 sec: 13173.2). Total num frames: 14055424. Throughput: 0: 13179.5. Samples: 14027233. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:51,237][81074] Avg episode reward: [(0, '3670.751')] [2023-03-06 23:27:51,559][81400] Updated weights for policy 0, policy_version 13730 (0.0006) [2023-03-06 23:27:52,336][81400] Updated weights for policy 0, policy_version 13740 (0.0006) [2023-03-06 23:27:53,130][81400] Updated weights for policy 0, policy_version 13750 (0.0006) [2023-03-06 23:27:53,897][81400] Updated weights for policy 0, policy_version 13760 (0.0007) [2023-03-06 23:27:54,689][81400] Updated weights for policy 0, policy_version 13770 (0.0006) [2023-03-06 23:27:55,462][81400] Updated weights for policy 0, policy_version 13780 (0.0006) [2023-03-06 23:27:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 14119936. Throughput: 0: 13172.3. Samples: 14105946. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:27:56,237][81074] Avg episode reward: [(0, '3542.597')] [2023-03-06 23:27:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000013790_14120960.pth... [2023-03-06 23:27:56,242][81400] Updated weights for policy 0, policy_version 13790 (0.0006) [2023-03-06 23:27:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000010702_10958848.pth [2023-03-06 23:27:57,008][81400] Updated weights for policy 0, policy_version 13800 (0.0006) [2023-03-06 23:27:57,797][81400] Updated weights for policy 0, policy_version 13810 (0.0006) [2023-03-06 23:27:58,585][81400] Updated weights for policy 0, policy_version 13820 (0.0006) [2023-03-06 23:27:59,360][81400] Updated weights for policy 0, policy_version 13830 (0.0006) [2023-03-06 23:28:00,156][81400] Updated weights for policy 0, policy_version 13840 (0.0007) [2023-03-06 23:28:00,933][81400] Updated weights for policy 0, policy_version 13850 (0.0006) [2023-03-06 23:28:01,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 14185472. Throughput: 0: 13167.0. Samples: 14184654. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:28:01,237][81074] Avg episode reward: [(0, '3601.592')] [2023-03-06 23:28:01,712][81400] Updated weights for policy 0, policy_version 13860 (0.0006) [2023-03-06 23:28:02,490][81400] Updated weights for policy 0, policy_version 13870 (0.0005) [2023-03-06 23:28:03,258][81400] Updated weights for policy 0, policy_version 13880 (0.0006) [2023-03-06 23:28:04,022][81400] Updated weights for policy 0, policy_version 13890 (0.0006) [2023-03-06 23:28:04,806][81400] Updated weights for policy 0, policy_version 13900 (0.0006) [2023-03-06 23:28:05,589][81400] Updated weights for policy 0, policy_version 13910 (0.0006) [2023-03-06 23:28:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 14252032. Throughput: 0: 13170.7. Samples: 14224385. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:28:06,237][81074] Avg episode reward: [(0, '3431.486')] [2023-03-06 23:28:06,373][81400] Updated weights for policy 0, policy_version 13920 (0.0007) [2023-03-06 23:28:07,130][81400] Updated weights for policy 0, policy_version 13930 (0.0006) [2023-03-06 23:28:07,914][81400] Updated weights for policy 0, policy_version 13940 (0.0007) [2023-03-06 23:28:08,695][81400] Updated weights for policy 0, policy_version 13950 (0.0006) [2023-03-06 23:28:09,461][81400] Updated weights for policy 0, policy_version 13960 (0.0006) [2023-03-06 23:28:10,245][81400] Updated weights for policy 0, policy_version 13970 (0.0006) [2023-03-06 23:28:11,038][81400] Updated weights for policy 0, policy_version 13980 (0.0006) [2023-03-06 23:28:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 14317568. Throughput: 0: 13165.6. Samples: 14303456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:28:11,237][81074] Avg episode reward: [(0, '3467.605')] [2023-03-06 23:28:11,804][81400] Updated weights for policy 0, policy_version 13990 (0.0006) [2023-03-06 23:28:12,590][81400] Updated weights for policy 0, policy_version 14000 (0.0006) [2023-03-06 23:28:13,366][81400] Updated weights for policy 0, policy_version 14010 (0.0006) [2023-03-06 23:28:14,148][81400] Updated weights for policy 0, policy_version 14020 (0.0006) [2023-03-06 23:28:14,950][81400] Updated weights for policy 0, policy_version 14030 (0.0006) [2023-03-06 23:28:15,718][81400] Updated weights for policy 0, policy_version 14040 (0.0006) [2023-03-06 23:28:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 14383104. Throughput: 0: 13150.9. Samples: 14382101. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:28:16,237][81074] Avg episode reward: [(0, '3610.558')] [2023-03-06 23:28:16,499][81400] Updated weights for policy 0, policy_version 14050 (0.0005) [2023-03-06 23:28:17,278][81400] Updated weights for policy 0, policy_version 14060 (0.0006) [2023-03-06 23:28:18,075][81400] Updated weights for policy 0, policy_version 14070 (0.0006) [2023-03-06 23:28:18,861][81400] Updated weights for policy 0, policy_version 14080 (0.0006) [2023-03-06 23:28:19,636][81400] Updated weights for policy 0, policy_version 14090 (0.0007) [2023-03-06 23:28:20,409][81400] Updated weights for policy 0, policy_version 14100 (0.0006) [2023-03-06 23:28:21,191][81400] Updated weights for policy 0, policy_version 14110 (0.0007) [2023-03-06 23:28:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 14448640. Throughput: 0: 13147.9. Samples: 14421346. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:28:21,237][81074] Avg episode reward: [(0, '3690.287')] [2023-03-06 23:28:21,955][81400] Updated weights for policy 0, policy_version 14120 (0.0006) [2023-03-06 23:28:22,729][81400] Updated weights for policy 0, policy_version 14130 (0.0007) [2023-03-06 23:28:23,504][81400] Updated weights for policy 0, policy_version 14140 (0.0006) [2023-03-06 23:28:24,281][81400] Updated weights for policy 0, policy_version 14150 (0.0006) [2023-03-06 23:28:25,077][81400] Updated weights for policy 0, policy_version 14160 (0.0006) [2023-03-06 23:28:25,847][81400] Updated weights for policy 0, policy_version 14170 (0.0006) [2023-03-06 23:28:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 14514176. Throughput: 0: 13147.0. Samples: 14500271. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:28:26,237][81074] Avg episode reward: [(0, '3614.757')] [2023-03-06 23:28:26,630][81400] Updated weights for policy 0, policy_version 14180 (0.0005) [2023-03-06 23:28:27,384][81400] Updated weights for policy 0, policy_version 14190 (0.0006) [2023-03-06 23:28:28,157][81400] Updated weights for policy 0, policy_version 14200 (0.0006) [2023-03-06 23:28:28,946][81400] Updated weights for policy 0, policy_version 14210 (0.0006) [2023-03-06 23:28:29,711][81400] Updated weights for policy 0, policy_version 14220 (0.0006) [2023-03-06 23:28:30,490][81400] Updated weights for policy 0, policy_version 14230 (0.0006) [2023-03-06 23:28:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 14580736. Throughput: 0: 13148.0. Samples: 14579441. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:28:31,237][81074] Avg episode reward: [(0, '3685.224')] [2023-03-06 23:28:31,265][81400] Updated weights for policy 0, policy_version 14240 (0.0006) [2023-03-06 23:28:32,030][81400] Updated weights for policy 0, policy_version 14250 (0.0006) [2023-03-06 23:28:32,822][81400] Updated weights for policy 0, policy_version 14260 (0.0007) [2023-03-06 23:28:33,590][81400] Updated weights for policy 0, policy_version 14270 (0.0006) [2023-03-06 23:28:34,359][81400] Updated weights for policy 0, policy_version 14280 (0.0006) [2023-03-06 23:28:35,148][81400] Updated weights for policy 0, policy_version 14290 (0.0007) [2023-03-06 23:28:35,919][81400] Updated weights for policy 0, policy_version 14300 (0.0006) [2023-03-06 23:28:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 14646272. Throughput: 0: 13154.4. Samples: 14619180. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:28:36,237][81074] Avg episode reward: [(0, '3551.579')] [2023-03-06 23:28:36,702][81400] Updated weights for policy 0, policy_version 14310 (0.0006) [2023-03-06 23:28:37,500][81400] Updated weights for policy 0, policy_version 14320 (0.0006) [2023-03-06 23:28:38,273][81400] Updated weights for policy 0, policy_version 14330 (0.0006) [2023-03-06 23:28:39,030][81400] Updated weights for policy 0, policy_version 14340 (0.0006) [2023-03-06 23:28:39,821][81400] Updated weights for policy 0, policy_version 14350 (0.0005) [2023-03-06 23:28:40,579][81400] Updated weights for policy 0, policy_version 14360 (0.0006) [2023-03-06 23:28:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 14712832. Throughput: 0: 13162.3. Samples: 14698253. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:28:41,237][81074] Avg episode reward: [(0, '3625.428')] [2023-03-06 23:28:41,385][81400] Updated weights for policy 0, policy_version 14370 (0.0006) [2023-03-06 23:28:42,151][81400] Updated weights for policy 0, policy_version 14380 (0.0006) [2023-03-06 23:28:42,948][81400] Updated weights for policy 0, policy_version 14390 (0.0006) [2023-03-06 23:28:43,728][81400] Updated weights for policy 0, policy_version 14400 (0.0007) [2023-03-06 23:28:44,504][81400] Updated weights for policy 0, policy_version 14410 (0.0006) [2023-03-06 23:28:45,282][81400] Updated weights for policy 0, policy_version 14420 (0.0006) [2023-03-06 23:28:46,079][81400] Updated weights for policy 0, policy_version 14430 (0.0006) [2023-03-06 23:28:46,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13166.2). Total num frames: 14777344. Throughput: 0: 13159.0. Samples: 14776806. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:28:46,237][81074] Avg episode reward: [(0, '3442.790')] [2023-03-06 23:28:46,865][81400] Updated weights for policy 0, policy_version 14440 (0.0006) [2023-03-06 23:28:47,648][81400] Updated weights for policy 0, policy_version 14450 (0.0006) [2023-03-06 23:28:48,438][81400] Updated weights for policy 0, policy_version 14460 (0.0006) [2023-03-06 23:28:49,203][81400] Updated weights for policy 0, policy_version 14470 (0.0005) [2023-03-06 23:28:49,995][81400] Updated weights for policy 0, policy_version 14480 (0.0006) [2023-03-06 23:28:50,773][81400] Updated weights for policy 0, policy_version 14490 (0.0006) [2023-03-06 23:28:51,236][81074] Fps is (10 sec: 13005.0, 60 sec: 13124.3, 300 sec: 13166.2). Total num frames: 14842880. Throughput: 0: 13147.7. Samples: 14816030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:28:51,237][81074] Avg episode reward: [(0, '3389.060')] [2023-03-06 23:28:51,560][81400] Updated weights for policy 0, policy_version 14500 (0.0006) [2023-03-06 23:28:52,347][81400] Updated weights for policy 0, policy_version 14510 (0.0007) [2023-03-06 23:28:53,121][81400] Updated weights for policy 0, policy_version 14520 (0.0006) [2023-03-06 23:28:53,893][81400] Updated weights for policy 0, policy_version 14530 (0.0006) [2023-03-06 23:28:54,681][81400] Updated weights for policy 0, policy_version 14540 (0.0007) [2023-03-06 23:28:55,451][81400] Updated weights for policy 0, policy_version 14550 (0.0006) [2023-03-06 23:28:56,236][81400] Updated weights for policy 0, policy_version 14560 (0.0006) [2023-03-06 23:28:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 14909440. Throughput: 0: 13137.0. Samples: 14894620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:28:56,237][81074] Avg episode reward: [(0, '3523.629')] [2023-03-06 23:28:57,008][81400] Updated weights for policy 0, policy_version 14570 (0.0007) [2023-03-06 23:28:57,802][81400] Updated weights for policy 0, policy_version 14580 (0.0006) [2023-03-06 23:28:58,570][81400] Updated weights for policy 0, policy_version 14590 (0.0006) [2023-03-06 23:28:59,343][81400] Updated weights for policy 0, policy_version 14600 (0.0006) [2023-03-06 23:29:00,132][81400] Updated weights for policy 0, policy_version 14610 (0.0007) [2023-03-06 23:29:00,914][81400] Updated weights for policy 0, policy_version 14620 (0.0006) [2023-03-06 23:29:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 14974976. Throughput: 0: 13139.2. Samples: 14973364. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:29:01,237][81074] Avg episode reward: [(0, '3400.572')] [2023-03-06 23:29:01,689][81400] Updated weights for policy 0, policy_version 14630 (0.0006) [2023-03-06 23:29:02,478][81400] Updated weights for policy 0, policy_version 14640 (0.0006) [2023-03-06 23:29:03,265][81400] Updated weights for policy 0, policy_version 14650 (0.0007) [2023-03-06 23:29:04,046][81400] Updated weights for policy 0, policy_version 14660 (0.0006) [2023-03-06 23:29:04,835][81400] Updated weights for policy 0, policy_version 14670 (0.0006) [2023-03-06 23:29:05,597][81400] Updated weights for policy 0, policy_version 14680 (0.0006) [2023-03-06 23:29:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 15040512. Throughput: 0: 13141.9. Samples: 15012733. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:29:06,237][81074] Avg episode reward: [(0, '3340.239')] [2023-03-06 23:29:06,385][81400] Updated weights for policy 0, policy_version 14690 (0.0006) [2023-03-06 23:29:07,159][81400] Updated weights for policy 0, policy_version 14700 (0.0006) [2023-03-06 23:29:07,941][81400] Updated weights for policy 0, policy_version 14710 (0.0006) [2023-03-06 23:29:08,710][81400] Updated weights for policy 0, policy_version 14720 (0.0006) [2023-03-06 23:29:09,484][81400] Updated weights for policy 0, policy_version 14730 (0.0007) [2023-03-06 23:29:10,270][81400] Updated weights for policy 0, policy_version 14740 (0.0006) [2023-03-06 23:29:11,061][81400] Updated weights for policy 0, policy_version 14750 (0.0006) [2023-03-06 23:29:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13162.7). Total num frames: 15106048. Throughput: 0: 13140.6. Samples: 15091598. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:29:11,237][81074] Avg episode reward: [(0, '3397.087')] [2023-03-06 23:29:11,838][81400] Updated weights for policy 0, policy_version 14760 (0.0007) [2023-03-06 23:29:12,638][81400] Updated weights for policy 0, policy_version 14770 (0.0006) [2023-03-06 23:29:13,405][81400] Updated weights for policy 0, policy_version 14780 (0.0006) [2023-03-06 23:29:14,168][81400] Updated weights for policy 0, policy_version 14790 (0.0006) [2023-03-06 23:29:14,948][81400] Updated weights for policy 0, policy_version 14800 (0.0007) [2023-03-06 23:29:15,720][81400] Updated weights for policy 0, policy_version 14810 (0.0006) [2023-03-06 23:29:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 15171584. Throughput: 0: 13136.3. Samples: 15170575. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:29:16,237][81074] Avg episode reward: [(0, '3273.891')] [2023-03-06 23:29:16,527][81400] Updated weights for policy 0, policy_version 14820 (0.0006) [2023-03-06 23:29:17,288][81400] Updated weights for policy 0, policy_version 14830 (0.0006) [2023-03-06 23:29:18,061][81400] Updated weights for policy 0, policy_version 14840 (0.0006) [2023-03-06 23:29:18,840][81400] Updated weights for policy 0, policy_version 14850 (0.0006) [2023-03-06 23:29:19,630][81400] Updated weights for policy 0, policy_version 14860 (0.0006) [2023-03-06 23:29:20,409][81400] Updated weights for policy 0, policy_version 14870 (0.0006) [2023-03-06 23:29:21,169][81400] Updated weights for policy 0, policy_version 14880 (0.0006) [2023-03-06 23:29:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13159.3). Total num frames: 15237120. Throughput: 0: 13127.6. Samples: 15209920. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:29:21,237][81074] Avg episode reward: [(0, '3256.138')] [2023-03-06 23:29:21,965][81400] Updated weights for policy 0, policy_version 14890 (0.0006) [2023-03-06 23:29:22,733][81400] Updated weights for policy 0, policy_version 14900 (0.0006) [2023-03-06 23:29:23,503][81400] Updated weights for policy 0, policy_version 14910 (0.0007) [2023-03-06 23:29:24,286][81400] Updated weights for policy 0, policy_version 14920 (0.0005) [2023-03-06 23:29:25,076][81400] Updated weights for policy 0, policy_version 14930 (0.0006) [2023-03-06 23:29:25,863][81400] Updated weights for policy 0, policy_version 14940 (0.0006) [2023-03-06 23:29:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 15302656. Throughput: 0: 13120.3. Samples: 15288664. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:29:26,237][81074] Avg episode reward: [(0, '3336.210')] [2023-03-06 23:29:26,647][81400] Updated weights for policy 0, policy_version 14950 (0.0005) [2023-03-06 23:29:27,430][81400] Updated weights for policy 0, policy_version 14960 (0.0006) [2023-03-06 23:29:28,197][81400] Updated weights for policy 0, policy_version 14970 (0.0006) [2023-03-06 23:29:28,974][81400] Updated weights for policy 0, policy_version 14980 (0.0006) [2023-03-06 23:29:29,763][81400] Updated weights for policy 0, policy_version 14990 (0.0006) [2023-03-06 23:29:30,542][81400] Updated weights for policy 0, policy_version 15000 (0.0006) [2023-03-06 23:29:31,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13155.8). Total num frames: 15368192. Throughput: 0: 13125.5. Samples: 15367453. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:29:31,237][81074] Avg episode reward: [(0, '3462.728')] [2023-03-06 23:29:31,314][81400] Updated weights for policy 0, policy_version 15010 (0.0007) [2023-03-06 23:29:32,091][81400] Updated weights for policy 0, policy_version 15020 (0.0007) [2023-03-06 23:29:32,875][81400] Updated weights for policy 0, policy_version 15030 (0.0006) [2023-03-06 23:29:33,675][81400] Updated weights for policy 0, policy_version 15040 (0.0006) [2023-03-06 23:29:34,444][81400] Updated weights for policy 0, policy_version 15050 (0.0005) [2023-03-06 23:29:35,213][81400] Updated weights for policy 0, policy_version 15060 (0.0006) [2023-03-06 23:29:35,999][81400] Updated weights for policy 0, policy_version 15070 (0.0006) [2023-03-06 23:29:36,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13141.4, 300 sec: 13159.3). Total num frames: 15434752. Throughput: 0: 13127.6. Samples: 15406772. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:29:36,237][81074] Avg episode reward: [(0, '3364.306')] [2023-03-06 23:29:36,769][81400] Updated weights for policy 0, policy_version 15080 (0.0006) [2023-03-06 23:29:37,546][81400] Updated weights for policy 0, policy_version 15090 (0.0007) [2023-03-06 23:29:38,328][81400] Updated weights for policy 0, policy_version 15100 (0.0006) [2023-03-06 23:29:39,109][81400] Updated weights for policy 0, policy_version 15110 (0.0007) [2023-03-06 23:29:39,900][81400] Updated weights for policy 0, policy_version 15120 (0.0007) [2023-03-06 23:29:40,679][81400] Updated weights for policy 0, policy_version 15130 (0.0007) [2023-03-06 23:29:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13159.3). Total num frames: 15500288. Throughput: 0: 13131.9. Samples: 15485554. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:29:41,237][81074] Avg episode reward: [(0, '3469.570')] [2023-03-06 23:29:41,464][81400] Updated weights for policy 0, policy_version 15140 (0.0009) [2023-03-06 23:29:42,250][81400] Updated weights for policy 0, policy_version 15150 (0.0006) [2023-03-06 23:29:43,025][81400] Updated weights for policy 0, policy_version 15160 (0.0006) [2023-03-06 23:29:43,824][81400] Updated weights for policy 0, policy_version 15170 (0.0006) [2023-03-06 23:29:44,590][81400] Updated weights for policy 0, policy_version 15180 (0.0006) [2023-03-06 23:29:45,367][81400] Updated weights for policy 0, policy_version 15190 (0.0006) [2023-03-06 23:29:46,150][81400] Updated weights for policy 0, policy_version 15200 (0.0006) [2023-03-06 23:29:46,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 15565824. Throughput: 0: 13130.9. Samples: 15564255. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:29:46,237][81074] Avg episode reward: [(0, '3160.568')] [2023-03-06 23:29:46,917][81400] Updated weights for policy 0, policy_version 15210 (0.0006) [2023-03-06 23:29:47,698][81400] Updated weights for policy 0, policy_version 15220 (0.0006) [2023-03-06 23:29:48,475][81400] Updated weights for policy 0, policy_version 15230 (0.0006) [2023-03-06 23:29:49,227][81400] Updated weights for policy 0, policy_version 15240 (0.0007) [2023-03-06 23:29:50,032][81400] Updated weights for policy 0, policy_version 15250 (0.0007) [2023-03-06 23:29:50,814][81400] Updated weights for policy 0, policy_version 15260 (0.0006) [2023-03-06 23:29:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 15631360. Throughput: 0: 13137.4. Samples: 15603914. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:29:51,237][81074] Avg episode reward: [(0, '3242.024')] [2023-03-06 23:29:51,571][81400] Updated weights for policy 0, policy_version 15270 (0.0006) [2023-03-06 23:29:52,343][81400] Updated weights for policy 0, policy_version 15280 (0.0006) [2023-03-06 23:29:53,109][81400] Updated weights for policy 0, policy_version 15290 (0.0005) [2023-03-06 23:29:53,889][81400] Updated weights for policy 0, policy_version 15300 (0.0006) [2023-03-06 23:29:54,663][81400] Updated weights for policy 0, policy_version 15310 (0.0007) [2023-03-06 23:29:55,434][81400] Updated weights for policy 0, policy_version 15320 (0.0006) [2023-03-06 23:29:56,214][81400] Updated weights for policy 0, policy_version 15330 (0.0006) [2023-03-06 23:29:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 15697920. Throughput: 0: 13149.7. Samples: 15683336. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:29:56,237][81074] Avg episode reward: [(0, '3221.568')] [2023-03-06 23:29:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000015330_15697920.pth... [2023-03-06 23:29:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000012246_12539904.pth [2023-03-06 23:29:57,005][81400] Updated weights for policy 0, policy_version 15340 (0.0006) [2023-03-06 23:29:57,793][81400] Updated weights for policy 0, policy_version 15350 (0.0006) [2023-03-06 23:29:58,578][81400] Updated weights for policy 0, policy_version 15360 (0.0006) [2023-03-06 23:29:59,335][81400] Updated weights for policy 0, policy_version 15370 (0.0007) [2023-03-06 23:30:00,124][81400] Updated weights for policy 0, policy_version 15380 (0.0006) [2023-03-06 23:30:00,902][81400] Updated weights for policy 0, policy_version 15390 (0.0007) [2023-03-06 23:30:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 15763456. Throughput: 0: 13143.4. Samples: 15762028. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:30:01,237][81074] Avg episode reward: [(0, '3131.731')] [2023-03-06 23:30:01,674][81400] Updated weights for policy 0, policy_version 15400 (0.0006) [2023-03-06 23:30:02,478][81400] Updated weights for policy 0, policy_version 15410 (0.0006) [2023-03-06 23:30:03,240][81400] Updated weights for policy 0, policy_version 15420 (0.0007) [2023-03-06 23:30:04,007][81400] Updated weights for policy 0, policy_version 15430 (0.0007) [2023-03-06 23:30:04,814][81400] Updated weights for policy 0, policy_version 15440 (0.0006) [2023-03-06 23:30:05,574][81400] Updated weights for policy 0, policy_version 15450 (0.0006) [2023-03-06 23:30:06,236][81074] Fps is (10 sec: 13107.5, 60 sec: 13141.4, 300 sec: 13159.3). Total num frames: 15828992. Throughput: 0: 13142.3. Samples: 15801324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:30:06,237][81074] Avg episode reward: [(0, '3213.996')] [2023-03-06 23:30:06,363][81400] Updated weights for policy 0, policy_version 15460 (0.0006) [2023-03-06 23:30:07,147][81400] Updated weights for policy 0, policy_version 15470 (0.0006) [2023-03-06 23:30:07,926][81400] Updated weights for policy 0, policy_version 15480 (0.0006) [2023-03-06 23:30:08,704][81400] Updated weights for policy 0, policy_version 15490 (0.0006) [2023-03-06 23:30:09,479][81400] Updated weights for policy 0, policy_version 15500 (0.0007) [2023-03-06 23:30:10,248][81400] Updated weights for policy 0, policy_version 15510 (0.0007) [2023-03-06 23:30:11,039][81400] Updated weights for policy 0, policy_version 15520 (0.0007) [2023-03-06 23:30:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 15894528. Throughput: 0: 13146.7. Samples: 15880265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:30:11,237][81074] Avg episode reward: [(0, '3212.672')] [2023-03-06 23:30:11,806][81400] Updated weights for policy 0, policy_version 15530 (0.0006) [2023-03-06 23:30:12,583][81400] Updated weights for policy 0, policy_version 15540 (0.0007) [2023-03-06 23:30:13,366][81400] Updated weights for policy 0, policy_version 15550 (0.0006) [2023-03-06 23:30:14,144][81400] Updated weights for policy 0, policy_version 15560 (0.0007) [2023-03-06 23:30:14,929][81400] Updated weights for policy 0, policy_version 15570 (0.0006) [2023-03-06 23:30:15,711][81400] Updated weights for policy 0, policy_version 15580 (0.0006) [2023-03-06 23:30:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 15960064. Throughput: 0: 13148.9. Samples: 15959156. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:30:16,237][81074] Avg episode reward: [(0, '3449.976')] [2023-03-06 23:30:16,483][81400] Updated weights for policy 0, policy_version 15590 (0.0006) [2023-03-06 23:30:17,258][81400] Updated weights for policy 0, policy_version 15600 (0.0006) [2023-03-06 23:30:18,043][81400] Updated weights for policy 0, policy_version 15610 (0.0006) [2023-03-06 23:30:18,815][81400] Updated weights for policy 0, policy_version 15620 (0.0006) [2023-03-06 23:30:19,593][81400] Updated weights for policy 0, policy_version 15630 (0.0006) [2023-03-06 23:30:20,383][81400] Updated weights for policy 0, policy_version 15640 (0.0007) [2023-03-06 23:30:21,147][81400] Updated weights for policy 0, policy_version 15650 (0.0006) [2023-03-06 23:30:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 16025600. Throughput: 0: 13154.0. Samples: 15998701. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:30:21,237][81074] Avg episode reward: [(0, '3492.933')] [2023-03-06 23:30:21,932][81400] Updated weights for policy 0, policy_version 15660 (0.0006) [2023-03-06 23:30:22,717][81400] Updated weights for policy 0, policy_version 15670 (0.0006) [2023-03-06 23:30:23,486][81400] Updated weights for policy 0, policy_version 15680 (0.0007) [2023-03-06 23:30:24,259][81400] Updated weights for policy 0, policy_version 15690 (0.0006) [2023-03-06 23:30:25,049][81400] Updated weights for policy 0, policy_version 15700 (0.0006) [2023-03-06 23:30:25,831][81400] Updated weights for policy 0, policy_version 15710 (0.0006) [2023-03-06 23:30:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 16092160. Throughput: 0: 13160.0. Samples: 16077755. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:30:26,237][81074] Avg episode reward: [(0, '3404.682')] [2023-03-06 23:30:26,608][81400] Updated weights for policy 0, policy_version 15720 (0.0006) [2023-03-06 23:30:27,392][81400] Updated weights for policy 0, policy_version 15730 (0.0006) [2023-03-06 23:30:28,180][81400] Updated weights for policy 0, policy_version 15740 (0.0006) [2023-03-06 23:30:28,966][81400] Updated weights for policy 0, policy_version 15750 (0.0005) [2023-03-06 23:30:29,738][81400] Updated weights for policy 0, policy_version 15760 (0.0007) [2023-03-06 23:30:30,513][81400] Updated weights for policy 0, policy_version 15770 (0.0006) [2023-03-06 23:30:31,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 16157696. Throughput: 0: 13162.4. Samples: 16156564. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:30:31,237][81074] Avg episode reward: [(0, '3439.507')] [2023-03-06 23:30:31,281][81400] Updated weights for policy 0, policy_version 15780 (0.0006) [2023-03-06 23:30:32,048][81400] Updated weights for policy 0, policy_version 15790 (0.0006) [2023-03-06 23:30:32,841][81400] Updated weights for policy 0, policy_version 15800 (0.0006) [2023-03-06 23:30:33,614][81400] Updated weights for policy 0, policy_version 15810 (0.0006) [2023-03-06 23:30:34,401][81400] Updated weights for policy 0, policy_version 15820 (0.0005) [2023-03-06 23:30:35,202][81400] Updated weights for policy 0, policy_version 15830 (0.0006) [2023-03-06 23:30:35,994][81400] Updated weights for policy 0, policy_version 15840 (0.0006) [2023-03-06 23:30:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 16223232. Throughput: 0: 13153.6. Samples: 16195829. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:30:36,237][81074] Avg episode reward: [(0, '3569.470')] [2023-03-06 23:30:36,758][81400] Updated weights for policy 0, policy_version 15850 (0.0006) [2023-03-06 23:30:37,539][81400] Updated weights for policy 0, policy_version 15860 (0.0007) [2023-03-06 23:30:38,323][81400] Updated weights for policy 0, policy_version 15870 (0.0007) [2023-03-06 23:30:39,100][81400] Updated weights for policy 0, policy_version 15880 (0.0006) [2023-03-06 23:30:39,870][81400] Updated weights for policy 0, policy_version 15890 (0.0006) [2023-03-06 23:30:40,650][81400] Updated weights for policy 0, policy_version 15900 (0.0006) [2023-03-06 23:30:41,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 16288768. Throughput: 0: 13136.9. Samples: 16274495. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:30:41,237][81074] Avg episode reward: [(0, '3504.818')] [2023-03-06 23:30:41,413][81400] Updated weights for policy 0, policy_version 15910 (0.0006) [2023-03-06 23:30:42,211][81400] Updated weights for policy 0, policy_version 15920 (0.0007) [2023-03-06 23:30:43,003][81400] Updated weights for policy 0, policy_version 15930 (0.0006) [2023-03-06 23:30:43,781][81400] Updated weights for policy 0, policy_version 15940 (0.0006) [2023-03-06 23:30:44,549][81400] Updated weights for policy 0, policy_version 15950 (0.0007) [2023-03-06 23:30:45,317][81400] Updated weights for policy 0, policy_version 15960 (0.0006) [2023-03-06 23:30:46,103][81400] Updated weights for policy 0, policy_version 15970 (0.0007) [2023-03-06 23:30:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 16354304. Throughput: 0: 13137.2. Samples: 16353203. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:30:46,237][81074] Avg episode reward: [(0, '3367.489')] [2023-03-06 23:30:46,894][81400] Updated weights for policy 0, policy_version 15980 (0.0006) [2023-03-06 23:30:47,662][81400] Updated weights for policy 0, policy_version 15990 (0.0006) [2023-03-06 23:30:48,445][81400] Updated weights for policy 0, policy_version 16000 (0.0006) [2023-03-06 23:30:49,228][81400] Updated weights for policy 0, policy_version 16010 (0.0006) [2023-03-06 23:30:50,004][81400] Updated weights for policy 0, policy_version 16020 (0.0007) [2023-03-06 23:30:50,782][81400] Updated weights for policy 0, policy_version 16030 (0.0007) [2023-03-06 23:30:51,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 16419840. Throughput: 0: 13138.8. Samples: 16392572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:30:51,237][81074] Avg episode reward: [(0, '3456.953')] [2023-03-06 23:30:51,564][81400] Updated weights for policy 0, policy_version 16040 (0.0006) [2023-03-06 23:30:52,334][81400] Updated weights for policy 0, policy_version 16050 (0.0006) [2023-03-06 23:30:53,102][81400] Updated weights for policy 0, policy_version 16060 (0.0006) [2023-03-06 23:30:53,882][81400] Updated weights for policy 0, policy_version 16070 (0.0006) [2023-03-06 23:30:54,661][81400] Updated weights for policy 0, policy_version 16080 (0.0006) [2023-03-06 23:30:55,469][81400] Updated weights for policy 0, policy_version 16090 (0.0006) [2023-03-06 23:30:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 16485376. Throughput: 0: 13139.2. Samples: 16471527. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:30:56,237][81074] Avg episode reward: [(0, '3502.337')] [2023-03-06 23:30:56,246][81400] Updated weights for policy 0, policy_version 16100 (0.0006) [2023-03-06 23:30:57,013][81400] Updated weights for policy 0, policy_version 16110 (0.0006) [2023-03-06 23:30:57,801][81400] Updated weights for policy 0, policy_version 16120 (0.0006) [2023-03-06 23:30:58,593][81400] Updated weights for policy 0, policy_version 16130 (0.0006) [2023-03-06 23:30:59,362][81400] Updated weights for policy 0, policy_version 16140 (0.0006) [2023-03-06 23:31:00,155][81400] Updated weights for policy 0, policy_version 16150 (0.0006) [2023-03-06 23:31:00,951][81400] Updated weights for policy 0, policy_version 16160 (0.0006) [2023-03-06 23:31:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 16550912. Throughput: 0: 13126.9. Samples: 16549864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:31:01,237][81074] Avg episode reward: [(0, '3530.319')] [2023-03-06 23:31:01,723][81400] Updated weights for policy 0, policy_version 16170 (0.0006) [2023-03-06 23:31:02,493][81400] Updated weights for policy 0, policy_version 16180 (0.0006) [2023-03-06 23:31:03,273][81400] Updated weights for policy 0, policy_version 16190 (0.0006) [2023-03-06 23:31:04,058][81400] Updated weights for policy 0, policy_version 16200 (0.0006) [2023-03-06 23:31:04,846][81400] Updated weights for policy 0, policy_version 16210 (0.0007) [2023-03-06 23:31:05,627][81400] Updated weights for policy 0, policy_version 16220 (0.0007) [2023-03-06 23:31:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 16617472. Throughput: 0: 13127.9. Samples: 16589455. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:06,237][81074] Avg episode reward: [(0, '3570.627')] [2023-03-06 23:31:06,392][81400] Updated weights for policy 0, policy_version 16230 (0.0006) [2023-03-06 23:31:07,193][81400] Updated weights for policy 0, policy_version 16240 (0.0006) [2023-03-06 23:31:07,967][81400] Updated weights for policy 0, policy_version 16250 (0.0006) [2023-03-06 23:31:08,759][81400] Updated weights for policy 0, policy_version 16260 (0.0006) [2023-03-06 23:31:09,530][81400] Updated weights for policy 0, policy_version 16270 (0.0005) [2023-03-06 23:31:10,308][81400] Updated weights for policy 0, policy_version 16280 (0.0006) [2023-03-06 23:31:11,078][81400] Updated weights for policy 0, policy_version 16290 (0.0006) [2023-03-06 23:31:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 16681984. Throughput: 0: 13118.6. Samples: 16668088. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:11,237][81074] Avg episode reward: [(0, '3692.186')] [2023-03-06 23:31:11,850][81400] Updated weights for policy 0, policy_version 16300 (0.0006) [2023-03-06 23:31:12,629][81400] Updated weights for policy 0, policy_version 16310 (0.0006) [2023-03-06 23:31:13,421][81400] Updated weights for policy 0, policy_version 16320 (0.0005) [2023-03-06 23:31:14,198][81400] Updated weights for policy 0, policy_version 16330 (0.0006) [2023-03-06 23:31:14,969][81400] Updated weights for policy 0, policy_version 16340 (0.0006) [2023-03-06 23:31:15,759][81400] Updated weights for policy 0, policy_version 16350 (0.0006) [2023-03-06 23:31:16,236][81074] Fps is (10 sec: 13004.6, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 16747520. Throughput: 0: 13119.6. Samples: 16746944. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:31:16,237][81074] Avg episode reward: [(0, '3772.642')] [2023-03-06 23:31:16,547][81400] Updated weights for policy 0, policy_version 16360 (0.0007) [2023-03-06 23:31:17,334][81400] Updated weights for policy 0, policy_version 16370 (0.0007) [2023-03-06 23:31:18,118][81400] Updated weights for policy 0, policy_version 16380 (0.0006) [2023-03-06 23:31:18,901][81400] Updated weights for policy 0, policy_version 16390 (0.0006) [2023-03-06 23:31:19,699][81400] Updated weights for policy 0, policy_version 16400 (0.0006) [2023-03-06 23:31:20,465][81400] Updated weights for policy 0, policy_version 16410 (0.0006) [2023-03-06 23:31:21,234][81400] Updated weights for policy 0, policy_version 16420 (0.0006) [2023-03-06 23:31:21,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 16814080. Throughput: 0: 13119.4. Samples: 16786202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:31:21,237][81074] Avg episode reward: [(0, '3686.548')] [2023-03-06 23:31:22,006][81400] Updated weights for policy 0, policy_version 16430 (0.0006) [2023-03-06 23:31:22,773][81400] Updated weights for policy 0, policy_version 16440 (0.0006) [2023-03-06 23:31:23,559][81400] Updated weights for policy 0, policy_version 16450 (0.0006) [2023-03-06 23:31:24,331][81400] Updated weights for policy 0, policy_version 16460 (0.0006) [2023-03-06 23:31:25,106][81400] Updated weights for policy 0, policy_version 16470 (0.0006) [2023-03-06 23:31:25,882][81400] Updated weights for policy 0, policy_version 16480 (0.0006) [2023-03-06 23:31:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 16879616. Throughput: 0: 13127.1. Samples: 16865218. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:26,237][81074] Avg episode reward: [(0, '3543.996')] [2023-03-06 23:31:26,660][81400] Updated weights for policy 0, policy_version 16490 (0.0006) [2023-03-06 23:31:27,431][81400] Updated weights for policy 0, policy_version 16500 (0.0006) [2023-03-06 23:31:28,231][81400] Updated weights for policy 0, policy_version 16510 (0.0006) [2023-03-06 23:31:29,025][81400] Updated weights for policy 0, policy_version 16520 (0.0006) [2023-03-06 23:31:29,802][81400] Updated weights for policy 0, policy_version 16530 (0.0006) [2023-03-06 23:31:30,575][81400] Updated weights for policy 0, policy_version 16540 (0.0006) [2023-03-06 23:31:31,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 16945152. Throughput: 0: 13127.4. Samples: 16943935. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:31,247][81074] Avg episode reward: [(0, '3507.137')] [2023-03-06 23:31:31,361][81400] Updated weights for policy 0, policy_version 16550 (0.0007) [2023-03-06 23:31:32,130][81400] Updated weights for policy 0, policy_version 16560 (0.0006) [2023-03-06 23:31:32,886][81400] Updated weights for policy 0, policy_version 16570 (0.0006) [2023-03-06 23:31:33,670][81400] Updated weights for policy 0, policy_version 16580 (0.0006) [2023-03-06 23:31:34,460][81400] Updated weights for policy 0, policy_version 16590 (0.0006) [2023-03-06 23:31:35,234][81400] Updated weights for policy 0, policy_version 16600 (0.0006) [2023-03-06 23:31:36,001][81400] Updated weights for policy 0, policy_version 16610 (0.0007) [2023-03-06 23:31:36,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 17010688. Throughput: 0: 13130.7. Samples: 16983453. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:36,247][81074] Avg episode reward: [(0, '3459.410')] [2023-03-06 23:31:36,776][81400] Updated weights for policy 0, policy_version 16620 (0.0006) [2023-03-06 23:31:37,557][81400] Updated weights for policy 0, policy_version 16630 (0.0006) [2023-03-06 23:31:38,327][81400] Updated weights for policy 0, policy_version 16640 (0.0006) [2023-03-06 23:31:39,109][81400] Updated weights for policy 0, policy_version 16650 (0.0005) [2023-03-06 23:31:39,863][81400] Updated weights for policy 0, policy_version 16660 (0.0006) [2023-03-06 23:31:40,643][81400] Updated weights for policy 0, policy_version 16670 (0.0007) [2023-03-06 23:31:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 17077248. Throughput: 0: 13139.4. Samples: 17062800. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:41,247][81074] Avg episode reward: [(0, '3597.600')] [2023-03-06 23:31:41,425][81400] Updated weights for policy 0, policy_version 16680 (0.0006) [2023-03-06 23:31:42,202][81400] Updated weights for policy 0, policy_version 16690 (0.0006) [2023-03-06 23:31:42,991][81400] Updated weights for policy 0, policy_version 16700 (0.0006) [2023-03-06 23:31:43,763][81400] Updated weights for policy 0, policy_version 16710 (0.0006) [2023-03-06 23:31:44,558][81400] Updated weights for policy 0, policy_version 16720 (0.0007) [2023-03-06 23:31:45,338][81400] Updated weights for policy 0, policy_version 16730 (0.0006) [2023-03-06 23:31:46,105][81400] Updated weights for policy 0, policy_version 16740 (0.0006) [2023-03-06 23:31:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 17142784. Throughput: 0: 13156.2. Samples: 17141893. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:46,247][81074] Avg episode reward: [(0, '3521.106')] [2023-03-06 23:31:46,885][81400] Updated weights for policy 0, policy_version 16750 (0.0007) [2023-03-06 23:31:47,662][81400] Updated weights for policy 0, policy_version 16760 (0.0006) [2023-03-06 23:31:48,425][81400] Updated weights for policy 0, policy_version 16770 (0.0006) [2023-03-06 23:31:49,214][81400] Updated weights for policy 0, policy_version 16780 (0.0005) [2023-03-06 23:31:49,991][81400] Updated weights for policy 0, policy_version 16790 (0.0006) [2023-03-06 23:31:50,778][81400] Updated weights for policy 0, policy_version 16800 (0.0007) [2023-03-06 23:31:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 17209344. Throughput: 0: 13149.2. Samples: 17181168. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:51,247][81074] Avg episode reward: [(0, '3573.631')] [2023-03-06 23:31:51,552][81400] Updated weights for policy 0, policy_version 16810 (0.0007) [2023-03-06 23:31:52,340][81400] Updated weights for policy 0, policy_version 16820 (0.0007) [2023-03-06 23:31:53,110][81400] Updated weights for policy 0, policy_version 16830 (0.0006) [2023-03-06 23:31:53,887][81400] Updated weights for policy 0, policy_version 16840 (0.0007) [2023-03-06 23:31:54,671][81400] Updated weights for policy 0, policy_version 16850 (0.0006) [2023-03-06 23:31:55,439][81400] Updated weights for policy 0, policy_version 16860 (0.0006) [2023-03-06 23:31:56,220][81400] Updated weights for policy 0, policy_version 16870 (0.0006) [2023-03-06 23:31:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 17274880. Throughput: 0: 13158.4. Samples: 17260216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:31:56,247][81074] Avg episode reward: [(0, '3497.620')] [2023-03-06 23:31:56,252][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000016870_17274880.pth... [2023-03-06 23:31:56,282][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000013790_14120960.pth [2023-03-06 23:31:57,014][81400] Updated weights for policy 0, policy_version 16880 (0.0006) [2023-03-06 23:31:57,789][81400] Updated weights for policy 0, policy_version 16890 (0.0006) [2023-03-06 23:31:58,574][81400] Updated weights for policy 0, policy_version 16900 (0.0007) [2023-03-06 23:31:59,366][81400] Updated weights for policy 0, policy_version 16910 (0.0006) [2023-03-06 23:32:00,136][81400] Updated weights for policy 0, policy_version 16920 (0.0006) [2023-03-06 23:32:00,925][81400] Updated weights for policy 0, policy_version 16930 (0.0006) [2023-03-06 23:32:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 17340416. Throughput: 0: 13150.7. Samples: 17338726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:01,247][81074] Avg episode reward: [(0, '3575.868')] [2023-03-06 23:32:01,687][81400] Updated weights for policy 0, policy_version 16940 (0.0006) [2023-03-06 23:32:02,473][81400] Updated weights for policy 0, policy_version 16950 (0.0007) [2023-03-06 23:32:03,261][81400] Updated weights for policy 0, policy_version 16960 (0.0006) [2023-03-06 23:32:04,034][81400] Updated weights for policy 0, policy_version 16970 (0.0005) [2023-03-06 23:32:04,807][81400] Updated weights for policy 0, policy_version 16980 (0.0006) [2023-03-06 23:32:05,590][81400] Updated weights for policy 0, policy_version 16990 (0.0006) [2023-03-06 23:32:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 17405952. Throughput: 0: 13157.7. Samples: 17378297. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:06,247][81074] Avg episode reward: [(0, '3388.683')] [2023-03-06 23:32:06,377][81400] Updated weights for policy 0, policy_version 17000 (0.0006) [2023-03-06 23:32:07,158][81400] Updated weights for policy 0, policy_version 17010 (0.0007) [2023-03-06 23:32:07,946][81400] Updated weights for policy 0, policy_version 17020 (0.0007) [2023-03-06 23:32:08,721][81400] Updated weights for policy 0, policy_version 17030 (0.0005) [2023-03-06 23:32:09,492][81400] Updated weights for policy 0, policy_version 17040 (0.0007) [2023-03-06 23:32:10,301][81400] Updated weights for policy 0, policy_version 17050 (0.0006) [2023-03-06 23:32:11,073][81400] Updated weights for policy 0, policy_version 17060 (0.0007) [2023-03-06 23:32:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 17471488. Throughput: 0: 13147.2. Samples: 17456841. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:11,247][81074] Avg episode reward: [(0, '3107.470')] [2023-03-06 23:32:11,850][81400] Updated weights for policy 0, policy_version 17070 (0.0006) [2023-03-06 23:32:12,620][81400] Updated weights for policy 0, policy_version 17080 (0.0007) [2023-03-06 23:32:13,410][81400] Updated weights for policy 0, policy_version 17090 (0.0006) [2023-03-06 23:32:14,198][81400] Updated weights for policy 0, policy_version 17100 (0.0006) [2023-03-06 23:32:14,970][81400] Updated weights for policy 0, policy_version 17110 (0.0005) [2023-03-06 23:32:15,757][81400] Updated weights for policy 0, policy_version 17120 (0.0006) [2023-03-06 23:32:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 17537024. Throughput: 0: 13145.5. Samples: 17535483. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:32:16,247][81074] Avg episode reward: [(0, '2996.546')] [2023-03-06 23:32:16,533][81400] Updated weights for policy 0, policy_version 17130 (0.0005) [2023-03-06 23:32:17,310][81400] Updated weights for policy 0, policy_version 17140 (0.0006) [2023-03-06 23:32:18,093][81400] Updated weights for policy 0, policy_version 17150 (0.0006) [2023-03-06 23:32:18,862][81400] Updated weights for policy 0, policy_version 17160 (0.0006) [2023-03-06 23:32:19,642][81400] Updated weights for policy 0, policy_version 17170 (0.0006) [2023-03-06 23:32:20,422][81400] Updated weights for policy 0, policy_version 17180 (0.0006) [2023-03-06 23:32:21,218][81400] Updated weights for policy 0, policy_version 17190 (0.0006) [2023-03-06 23:32:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13141.9). Total num frames: 17602560. Throughput: 0: 13146.6. Samples: 17575048. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:32:21,247][81074] Avg episode reward: [(0, '3053.185')] [2023-03-06 23:32:21,978][81400] Updated weights for policy 0, policy_version 17200 (0.0006) [2023-03-06 23:32:22,749][81400] Updated weights for policy 0, policy_version 17210 (0.0006) [2023-03-06 23:32:23,530][81400] Updated weights for policy 0, policy_version 17220 (0.0006) [2023-03-06 23:32:24,304][81400] Updated weights for policy 0, policy_version 17230 (0.0006) [2023-03-06 23:32:25,082][81400] Updated weights for policy 0, policy_version 17240 (0.0006) [2023-03-06 23:32:25,853][81400] Updated weights for policy 0, policy_version 17250 (0.0006) [2023-03-06 23:32:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 17668096. Throughput: 0: 13137.9. Samples: 17654008. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:26,237][81074] Avg episode reward: [(0, '3143.095')] [2023-03-06 23:32:26,639][81400] Updated weights for policy 0, policy_version 17260 (0.0006) [2023-03-06 23:32:27,408][81400] Updated weights for policy 0, policy_version 17270 (0.0006) [2023-03-06 23:32:28,188][81400] Updated weights for policy 0, policy_version 17280 (0.0006) [2023-03-06 23:32:28,968][81400] Updated weights for policy 0, policy_version 17290 (0.0006) [2023-03-06 23:32:29,735][81400] Updated weights for policy 0, policy_version 17300 (0.0006) [2023-03-06 23:32:30,521][81400] Updated weights for policy 0, policy_version 17310 (0.0006) [2023-03-06 23:32:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 17734656. Throughput: 0: 13137.9. Samples: 17733100. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:31,237][81074] Avg episode reward: [(0, '3152.757')] [2023-03-06 23:32:31,299][81400] Updated weights for policy 0, policy_version 17320 (0.0006) [2023-03-06 23:32:32,096][81400] Updated weights for policy 0, policy_version 17330 (0.0007) [2023-03-06 23:32:32,865][81400] Updated weights for policy 0, policy_version 17340 (0.0006) [2023-03-06 23:32:33,639][81400] Updated weights for policy 0, policy_version 17350 (0.0006) [2023-03-06 23:32:34,412][81400] Updated weights for policy 0, policy_version 17360 (0.0006) [2023-03-06 23:32:35,195][81400] Updated weights for policy 0, policy_version 17370 (0.0007) [2023-03-06 23:32:35,971][81400] Updated weights for policy 0, policy_version 17380 (0.0006) [2023-03-06 23:32:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 17800192. Throughput: 0: 13140.3. Samples: 17772480. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:36,237][81074] Avg episode reward: [(0, '3215.106')] [2023-03-06 23:32:36,762][81400] Updated weights for policy 0, policy_version 17390 (0.0007) [2023-03-06 23:32:37,552][81400] Updated weights for policy 0, policy_version 17400 (0.0006) [2023-03-06 23:32:38,322][81400] Updated weights for policy 0, policy_version 17410 (0.0006) [2023-03-06 23:32:39,114][81400] Updated weights for policy 0, policy_version 17420 (0.0006) [2023-03-06 23:32:39,897][81400] Updated weights for policy 0, policy_version 17430 (0.0007) [2023-03-06 23:32:40,662][81400] Updated weights for policy 0, policy_version 17440 (0.0006) [2023-03-06 23:32:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 17865728. Throughput: 0: 13132.5. Samples: 17851179. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:32:41,237][81074] Avg episode reward: [(0, '3331.603')] [2023-03-06 23:32:41,461][81400] Updated weights for policy 0, policy_version 17450 (0.0006) [2023-03-06 23:32:42,227][81400] Updated weights for policy 0, policy_version 17460 (0.0007) [2023-03-06 23:32:43,006][81400] Updated weights for policy 0, policy_version 17470 (0.0006) [2023-03-06 23:32:43,786][81400] Updated weights for policy 0, policy_version 17480 (0.0006) [2023-03-06 23:32:44,575][81400] Updated weights for policy 0, policy_version 17490 (0.0006) [2023-03-06 23:32:45,352][81400] Updated weights for policy 0, policy_version 17500 (0.0006) [2023-03-06 23:32:46,123][81400] Updated weights for policy 0, policy_version 17510 (0.0006) [2023-03-06 23:32:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 17931264. Throughput: 0: 13141.6. Samples: 17930096. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:32:46,237][81074] Avg episode reward: [(0, '3301.980')] [2023-03-06 23:32:46,927][81400] Updated weights for policy 0, policy_version 17520 (0.0007) [2023-03-06 23:32:47,693][81400] Updated weights for policy 0, policy_version 17530 (0.0007) [2023-03-06 23:32:48,478][81400] Updated weights for policy 0, policy_version 17540 (0.0007) [2023-03-06 23:32:49,253][81400] Updated weights for policy 0, policy_version 17550 (0.0007) [2023-03-06 23:32:50,013][81400] Updated weights for policy 0, policy_version 17560 (0.0006) [2023-03-06 23:32:50,791][81400] Updated weights for policy 0, policy_version 17570 (0.0006) [2023-03-06 23:32:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.2, 300 sec: 13141.9). Total num frames: 17996800. Throughput: 0: 13132.1. Samples: 17969243. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:51,237][81074] Avg episode reward: [(0, '3406.823')] [2023-03-06 23:32:51,561][81400] Updated weights for policy 0, policy_version 17580 (0.0005) [2023-03-06 23:32:52,344][81400] Updated weights for policy 0, policy_version 17590 (0.0007) [2023-03-06 23:32:53,125][81400] Updated weights for policy 0, policy_version 17600 (0.0006) [2023-03-06 23:32:53,903][81400] Updated weights for policy 0, policy_version 17610 (0.0006) [2023-03-06 23:32:54,675][81400] Updated weights for policy 0, policy_version 17620 (0.0006) [2023-03-06 23:32:55,471][81400] Updated weights for policy 0, policy_version 17630 (0.0006) [2023-03-06 23:32:56,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 18062336. Throughput: 0: 13144.4. Samples: 18048340. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:32:56,237][81074] Avg episode reward: [(0, '3269.035')] [2023-03-06 23:32:56,247][81400] Updated weights for policy 0, policy_version 17640 (0.0006) [2023-03-06 23:32:57,023][81400] Updated weights for policy 0, policy_version 17650 (0.0006) [2023-03-06 23:32:57,799][81400] Updated weights for policy 0, policy_version 17660 (0.0006) [2023-03-06 23:32:58,598][81400] Updated weights for policy 0, policy_version 17670 (0.0007) [2023-03-06 23:32:59,364][81400] Updated weights for policy 0, policy_version 17680 (0.0006) [2023-03-06 23:33:00,142][81400] Updated weights for policy 0, policy_version 17690 (0.0006) [2023-03-06 23:33:00,894][81400] Updated weights for policy 0, policy_version 17700 (0.0005) [2023-03-06 23:33:01,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 18128896. Throughput: 0: 13153.6. Samples: 18127396. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:01,237][81074] Avg episode reward: [(0, '3142.100')] [2023-03-06 23:33:01,684][81400] Updated weights for policy 0, policy_version 17710 (0.0006) [2023-03-06 23:33:02,480][81400] Updated weights for policy 0, policy_version 17720 (0.0006) [2023-03-06 23:33:03,249][81400] Updated weights for policy 0, policy_version 17730 (0.0006) [2023-03-06 23:33:04,052][81400] Updated weights for policy 0, policy_version 17740 (0.0006) [2023-03-06 23:33:04,806][81400] Updated weights for policy 0, policy_version 17750 (0.0006) [2023-03-06 23:33:05,577][81400] Updated weights for policy 0, policy_version 17760 (0.0006) [2023-03-06 23:33:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 18194432. Throughput: 0: 13147.5. Samples: 18166684. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:06,237][81074] Avg episode reward: [(0, '3365.008')] [2023-03-06 23:33:06,350][81400] Updated weights for policy 0, policy_version 17770 (0.0006) [2023-03-06 23:33:07,120][81400] Updated weights for policy 0, policy_version 17780 (0.0007) [2023-03-06 23:33:07,896][81400] Updated weights for policy 0, policy_version 17790 (0.0006) [2023-03-06 23:33:08,663][81400] Updated weights for policy 0, policy_version 17800 (0.0006) [2023-03-06 23:33:09,449][81400] Updated weights for policy 0, policy_version 17810 (0.0006) [2023-03-06 23:33:10,249][81400] Updated weights for policy 0, policy_version 17820 (0.0006) [2023-03-06 23:33:11,022][81400] Updated weights for policy 0, policy_version 17830 (0.0007) [2023-03-06 23:33:11,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 18259968. Throughput: 0: 13153.8. Samples: 18245929. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:11,237][81074] Avg episode reward: [(0, '3302.535')] [2023-03-06 23:33:11,805][81400] Updated weights for policy 0, policy_version 17840 (0.0006) [2023-03-06 23:33:12,567][81400] Updated weights for policy 0, policy_version 17850 (0.0006) [2023-03-06 23:33:13,351][81400] Updated weights for policy 0, policy_version 17860 (0.0006) [2023-03-06 23:33:14,141][81400] Updated weights for policy 0, policy_version 17870 (0.0006) [2023-03-06 23:33:14,918][81400] Updated weights for policy 0, policy_version 17880 (0.0006) [2023-03-06 23:33:15,698][81400] Updated weights for policy 0, policy_version 17890 (0.0006) [2023-03-06 23:33:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 18326528. Throughput: 0: 13150.7. Samples: 18324884. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:33:16,237][81074] Avg episode reward: [(0, '3264.717')] [2023-03-06 23:33:16,478][81400] Updated weights for policy 0, policy_version 17900 (0.0006) [2023-03-06 23:33:17,270][81400] Updated weights for policy 0, policy_version 17910 (0.0005) [2023-03-06 23:33:18,042][81400] Updated weights for policy 0, policy_version 17920 (0.0006) [2023-03-06 23:33:18,815][81400] Updated weights for policy 0, policy_version 17930 (0.0006) [2023-03-06 23:33:19,604][81400] Updated weights for policy 0, policy_version 17940 (0.0006) [2023-03-06 23:33:20,385][81400] Updated weights for policy 0, policy_version 17950 (0.0006) [2023-03-06 23:33:21,164][81400] Updated weights for policy 0, policy_version 17960 (0.0007) [2023-03-06 23:33:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 18391040. Throughput: 0: 13148.2. Samples: 18364150. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:33:21,237][81074] Avg episode reward: [(0, '3292.435')] [2023-03-06 23:33:21,945][81400] Updated weights for policy 0, policy_version 17970 (0.0006) [2023-03-06 23:33:22,724][81400] Updated weights for policy 0, policy_version 17980 (0.0006) [2023-03-06 23:33:23,486][81400] Updated weights for policy 0, policy_version 17990 (0.0006) [2023-03-06 23:33:24,264][81400] Updated weights for policy 0, policy_version 18000 (0.0006) [2023-03-06 23:33:25,030][81400] Updated weights for policy 0, policy_version 18010 (0.0006) [2023-03-06 23:33:25,812][81400] Updated weights for policy 0, policy_version 18020 (0.0006) [2023-03-06 23:33:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 18457600. Throughput: 0: 13156.3. Samples: 18443213. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:26,237][81074] Avg episode reward: [(0, '3290.315')] [2023-03-06 23:33:26,587][81400] Updated weights for policy 0, policy_version 18030 (0.0007) [2023-03-06 23:33:27,359][81400] Updated weights for policy 0, policy_version 18040 (0.0006) [2023-03-06 23:33:28,142][81400] Updated weights for policy 0, policy_version 18050 (0.0006) [2023-03-06 23:33:28,939][81400] Updated weights for policy 0, policy_version 18060 (0.0006) [2023-03-06 23:33:29,722][81400] Updated weights for policy 0, policy_version 18070 (0.0006) [2023-03-06 23:33:30,481][81400] Updated weights for policy 0, policy_version 18080 (0.0006) [2023-03-06 23:33:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 18523136. Throughput: 0: 13160.1. Samples: 18522301. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:31,237][81074] Avg episode reward: [(0, '3255.691')] [2023-03-06 23:33:31,257][81400] Updated weights for policy 0, policy_version 18090 (0.0006) [2023-03-06 23:33:32,037][81400] Updated weights for policy 0, policy_version 18100 (0.0007) [2023-03-06 23:33:32,804][81400] Updated weights for policy 0, policy_version 18110 (0.0007) [2023-03-06 23:33:33,594][81400] Updated weights for policy 0, policy_version 18120 (0.0006) [2023-03-06 23:33:34,379][81400] Updated weights for policy 0, policy_version 18130 (0.0006) [2023-03-06 23:33:35,142][81400] Updated weights for policy 0, policy_version 18140 (0.0006) [2023-03-06 23:33:35,908][81400] Updated weights for policy 0, policy_version 18150 (0.0007) [2023-03-06 23:33:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 18589696. Throughput: 0: 13163.8. Samples: 18561616. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:36,237][81074] Avg episode reward: [(0, '3250.330')] [2023-03-06 23:33:36,681][81400] Updated weights for policy 0, policy_version 18160 (0.0006) [2023-03-06 23:33:37,449][81400] Updated weights for policy 0, policy_version 18170 (0.0006) [2023-03-06 23:33:38,250][81400] Updated weights for policy 0, policy_version 18180 (0.0006) [2023-03-06 23:33:39,025][81400] Updated weights for policy 0, policy_version 18190 (0.0007) [2023-03-06 23:33:39,793][81400] Updated weights for policy 0, policy_version 18200 (0.0006) [2023-03-06 23:33:40,577][81400] Updated weights for policy 0, policy_version 18210 (0.0006) [2023-03-06 23:33:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 18655232. Throughput: 0: 13166.8. Samples: 18640847. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:33:41,237][81074] Avg episode reward: [(0, '3265.375')] [2023-03-06 23:33:41,363][81400] Updated weights for policy 0, policy_version 18220 (0.0007) [2023-03-06 23:33:42,141][81400] Updated weights for policy 0, policy_version 18230 (0.0007) [2023-03-06 23:33:42,908][81400] Updated weights for policy 0, policy_version 18240 (0.0006) [2023-03-06 23:33:43,701][81400] Updated weights for policy 0, policy_version 18250 (0.0005) [2023-03-06 23:33:44,476][81400] Updated weights for policy 0, policy_version 18260 (0.0006) [2023-03-06 23:33:45,245][81400] Updated weights for policy 0, policy_version 18270 (0.0006) [2023-03-06 23:33:46,012][81400] Updated weights for policy 0, policy_version 18280 (0.0005) [2023-03-06 23:33:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 18720768. Throughput: 0: 13164.1. Samples: 18719780. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:33:46,237][81074] Avg episode reward: [(0, '3354.708')] [2023-03-06 23:33:46,802][81400] Updated weights for policy 0, policy_version 18290 (0.0006) [2023-03-06 23:33:47,605][81400] Updated weights for policy 0, policy_version 18300 (0.0006) [2023-03-06 23:33:48,379][81400] Updated weights for policy 0, policy_version 18310 (0.0006) [2023-03-06 23:33:49,153][81400] Updated weights for policy 0, policy_version 18320 (0.0006) [2023-03-06 23:33:49,924][81400] Updated weights for policy 0, policy_version 18330 (0.0006) [2023-03-06 23:33:50,703][81400] Updated weights for policy 0, policy_version 18340 (0.0006) [2023-03-06 23:33:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 18787328. Throughput: 0: 13165.6. Samples: 18759134. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:51,237][81074] Avg episode reward: [(0, '3330.560')] [2023-03-06 23:33:51,457][81400] Updated weights for policy 0, policy_version 18350 (0.0006) [2023-03-06 23:33:52,239][81400] Updated weights for policy 0, policy_version 18360 (0.0006) [2023-03-06 23:33:53,007][81400] Updated weights for policy 0, policy_version 18370 (0.0006) [2023-03-06 23:33:53,788][81400] Updated weights for policy 0, policy_version 18380 (0.0007) [2023-03-06 23:33:54,568][81400] Updated weights for policy 0, policy_version 18390 (0.0006) [2023-03-06 23:33:55,344][81400] Updated weights for policy 0, policy_version 18400 (0.0006) [2023-03-06 23:33:56,109][81400] Updated weights for policy 0, policy_version 18410 (0.0007) [2023-03-06 23:33:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 18852864. Throughput: 0: 13172.0. Samples: 18838668. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:33:56,237][81074] Avg episode reward: [(0, '3345.935')] [2023-03-06 23:33:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000018411_18852864.pth... [2023-03-06 23:33:56,270][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000015330_15697920.pth [2023-03-06 23:33:56,882][81400] Updated weights for policy 0, policy_version 18420 (0.0006) [2023-03-06 23:33:57,666][81400] Updated weights for policy 0, policy_version 18430 (0.0005) [2023-03-06 23:33:58,454][81400] Updated weights for policy 0, policy_version 18440 (0.0007) [2023-03-06 23:33:59,226][81400] Updated weights for policy 0, policy_version 18450 (0.0006) [2023-03-06 23:34:00,009][81400] Updated weights for policy 0, policy_version 18460 (0.0006) [2023-03-06 23:34:00,787][81400] Updated weights for policy 0, policy_version 18470 (0.0006) [2023-03-06 23:34:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 18918400. Throughput: 0: 13170.3. Samples: 18917549. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:34:01,237][81074] Avg episode reward: [(0, '3299.438')] [2023-03-06 23:34:01,570][81400] Updated weights for policy 0, policy_version 18480 (0.0006) [2023-03-06 23:34:02,355][81400] Updated weights for policy 0, policy_version 18490 (0.0006) [2023-03-06 23:34:03,129][81400] Updated weights for policy 0, policy_version 18500 (0.0007) [2023-03-06 23:34:03,921][81400] Updated weights for policy 0, policy_version 18510 (0.0006) [2023-03-06 23:34:04,690][81400] Updated weights for policy 0, policy_version 18520 (0.0006) [2023-03-06 23:34:05,473][81400] Updated weights for policy 0, policy_version 18530 (0.0006) [2023-03-06 23:34:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 18983936. Throughput: 0: 13169.2. Samples: 18956768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:06,237][81074] Avg episode reward: [(0, '3273.743')] [2023-03-06 23:34:06,249][81400] Updated weights for policy 0, policy_version 18540 (0.0006) [2023-03-06 23:34:07,018][81400] Updated weights for policy 0, policy_version 18550 (0.0005) [2023-03-06 23:34:07,804][81400] Updated weights for policy 0, policy_version 18560 (0.0006) [2023-03-06 23:34:08,582][81400] Updated weights for policy 0, policy_version 18570 (0.0007) [2023-03-06 23:34:09,394][81400] Updated weights for policy 0, policy_version 18580 (0.0007) [2023-03-06 23:34:10,171][81400] Updated weights for policy 0, policy_version 18590 (0.0006) [2023-03-06 23:34:10,949][81400] Updated weights for policy 0, policy_version 18600 (0.0006) [2023-03-06 23:34:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 19049472. Throughput: 0: 13158.7. Samples: 19035358. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:11,237][81074] Avg episode reward: [(0, '3169.166')] [2023-03-06 23:34:11,726][81400] Updated weights for policy 0, policy_version 18610 (0.0006) [2023-03-06 23:34:12,499][81400] Updated weights for policy 0, policy_version 18620 (0.0006) [2023-03-06 23:34:13,300][81400] Updated weights for policy 0, policy_version 18630 (0.0006) [2023-03-06 23:34:14,065][81400] Updated weights for policy 0, policy_version 18640 (0.0006) [2023-03-06 23:34:14,839][81400] Updated weights for policy 0, policy_version 18650 (0.0007) [2023-03-06 23:34:15,621][81400] Updated weights for policy 0, policy_version 18660 (0.0006) [2023-03-06 23:34:16,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 19115008. Throughput: 0: 13151.8. Samples: 19114131. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:16,237][81074] Avg episode reward: [(0, '3049.166')] [2023-03-06 23:34:16,409][81400] Updated weights for policy 0, policy_version 18670 (0.0008) [2023-03-06 23:34:17,175][81400] Updated weights for policy 0, policy_version 18680 (0.0006) [2023-03-06 23:34:17,950][81400] Updated weights for policy 0, policy_version 18690 (0.0006) [2023-03-06 23:34:18,725][81400] Updated weights for policy 0, policy_version 18700 (0.0006) [2023-03-06 23:34:19,514][81400] Updated weights for policy 0, policy_version 18710 (0.0006) [2023-03-06 23:34:20,285][81400] Updated weights for policy 0, policy_version 18720 (0.0006) [2023-03-06 23:34:21,042][81400] Updated weights for policy 0, policy_version 18730 (0.0006) [2023-03-06 23:34:21,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 19181568. Throughput: 0: 13158.0. Samples: 19153726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:21,237][81074] Avg episode reward: [(0, '3193.576')] [2023-03-06 23:34:21,824][81400] Updated weights for policy 0, policy_version 18740 (0.0006) [2023-03-06 23:34:22,605][81400] Updated weights for policy 0, policy_version 18750 (0.0007) [2023-03-06 23:34:23,370][81400] Updated weights for policy 0, policy_version 18760 (0.0006) [2023-03-06 23:34:24,157][81400] Updated weights for policy 0, policy_version 18770 (0.0006) [2023-03-06 23:34:24,954][81400] Updated weights for policy 0, policy_version 18780 (0.0007) [2023-03-06 23:34:25,709][81400] Updated weights for policy 0, policy_version 18790 (0.0006) [2023-03-06 23:34:26,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 19248128. Throughput: 0: 13158.0. Samples: 19232954. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:26,237][81074] Avg episode reward: [(0, '3207.845')] [2023-03-06 23:34:26,471][81400] Updated weights for policy 0, policy_version 18800 (0.0006) [2023-03-06 23:34:27,246][81400] Updated weights for policy 0, policy_version 18810 (0.0006) [2023-03-06 23:34:28,033][81400] Updated weights for policy 0, policy_version 18820 (0.0006) [2023-03-06 23:34:28,813][81400] Updated weights for policy 0, policy_version 18830 (0.0006) [2023-03-06 23:34:29,575][81400] Updated weights for policy 0, policy_version 18840 (0.0006) [2023-03-06 23:34:30,359][81400] Updated weights for policy 0, policy_version 18850 (0.0007) [2023-03-06 23:34:31,141][81400] Updated weights for policy 0, policy_version 18860 (0.0006) [2023-03-06 23:34:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 19313664. Throughput: 0: 13162.4. Samples: 19312088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:31,237][81074] Avg episode reward: [(0, '3154.163')] [2023-03-06 23:34:31,930][81400] Updated weights for policy 0, policy_version 18870 (0.0006) [2023-03-06 23:34:32,707][81400] Updated weights for policy 0, policy_version 18880 (0.0006) [2023-03-06 23:34:33,483][81400] Updated weights for policy 0, policy_version 18890 (0.0006) [2023-03-06 23:34:34,265][81400] Updated weights for policy 0, policy_version 18900 (0.0006) [2023-03-06 23:34:35,038][81400] Updated weights for policy 0, policy_version 18910 (0.0006) [2023-03-06 23:34:35,835][81400] Updated weights for policy 0, policy_version 18920 (0.0007) [2023-03-06 23:34:36,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 19379200. Throughput: 0: 13165.7. Samples: 19351592. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:36,237][81074] Avg episode reward: [(0, '3239.729')] [2023-03-06 23:34:36,612][81400] Updated weights for policy 0, policy_version 18930 (0.0007) [2023-03-06 23:34:37,382][81400] Updated weights for policy 0, policy_version 18940 (0.0007) [2023-03-06 23:34:38,150][81400] Updated weights for policy 0, policy_version 18950 (0.0006) [2023-03-06 23:34:38,929][81400] Updated weights for policy 0, policy_version 18960 (0.0007) [2023-03-06 23:34:39,710][81400] Updated weights for policy 0, policy_version 18970 (0.0006) [2023-03-06 23:34:40,491][81400] Updated weights for policy 0, policy_version 18980 (0.0006) [2023-03-06 23:34:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 19444736. Throughput: 0: 13148.8. Samples: 19430363. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:41,237][81074] Avg episode reward: [(0, '3353.172')] [2023-03-06 23:34:41,270][81400] Updated weights for policy 0, policy_version 18990 (0.0006) [2023-03-06 23:34:42,055][81400] Updated weights for policy 0, policy_version 19000 (0.0006) [2023-03-06 23:34:42,833][81400] Updated weights for policy 0, policy_version 19010 (0.0006) [2023-03-06 23:34:43,602][81400] Updated weights for policy 0, policy_version 19020 (0.0006) [2023-03-06 23:34:44,384][81400] Updated weights for policy 0, policy_version 19030 (0.0006) [2023-03-06 23:34:45,185][81400] Updated weights for policy 0, policy_version 19040 (0.0006) [2023-03-06 23:34:45,954][81400] Updated weights for policy 0, policy_version 19050 (0.0006) [2023-03-06 23:34:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 19510272. Throughput: 0: 13149.5. Samples: 19509278. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:46,237][81074] Avg episode reward: [(0, '3401.412')] [2023-03-06 23:34:46,742][81400] Updated weights for policy 0, policy_version 19060 (0.0007) [2023-03-06 23:34:47,513][81400] Updated weights for policy 0, policy_version 19070 (0.0006) [2023-03-06 23:34:48,286][81400] Updated weights for policy 0, policy_version 19080 (0.0007) [2023-03-06 23:34:49,051][81400] Updated weights for policy 0, policy_version 19090 (0.0006) [2023-03-06 23:34:49,848][81400] Updated weights for policy 0, policy_version 19100 (0.0006) [2023-03-06 23:34:50,625][81400] Updated weights for policy 0, policy_version 19110 (0.0006) [2023-03-06 23:34:51,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 19575808. Throughput: 0: 13155.3. Samples: 19548753. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:51,237][81074] Avg episode reward: [(0, '3184.403')] [2023-03-06 23:34:51,425][81400] Updated weights for policy 0, policy_version 19120 (0.0006) [2023-03-06 23:34:52,179][81400] Updated weights for policy 0, policy_version 19130 (0.0007) [2023-03-06 23:34:52,962][81400] Updated weights for policy 0, policy_version 19140 (0.0006) [2023-03-06 23:34:53,732][81400] Updated weights for policy 0, policy_version 19150 (0.0006) [2023-03-06 23:34:54,520][81400] Updated weights for policy 0, policy_version 19160 (0.0008) [2023-03-06 23:34:55,285][81400] Updated weights for policy 0, policy_version 19170 (0.0007) [2023-03-06 23:34:56,074][81400] Updated weights for policy 0, policy_version 19180 (0.0006) [2023-03-06 23:34:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 19642368. Throughput: 0: 13162.4. Samples: 19627665. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:34:56,237][81074] Avg episode reward: [(0, '2889.504')] [2023-03-06 23:34:56,843][81400] Updated weights for policy 0, policy_version 19190 (0.0007) [2023-03-06 23:34:57,613][81400] Updated weights for policy 0, policy_version 19200 (0.0006) [2023-03-06 23:34:58,389][81400] Updated weights for policy 0, policy_version 19210 (0.0007) [2023-03-06 23:34:59,173][81400] Updated weights for policy 0, policy_version 19220 (0.0007) [2023-03-06 23:34:59,947][81400] Updated weights for policy 0, policy_version 19230 (0.0006) [2023-03-06 23:35:00,730][81400] Updated weights for policy 0, policy_version 19240 (0.0006) [2023-03-06 23:35:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 19707904. Throughput: 0: 13169.2. Samples: 19706746. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:35:01,237][81074] Avg episode reward: [(0, '3148.445')] [2023-03-06 23:35:01,506][81400] Updated weights for policy 0, policy_version 19250 (0.0006) [2023-03-06 23:35:02,262][81400] Updated weights for policy 0, policy_version 19260 (0.0006) [2023-03-06 23:35:03,058][81400] Updated weights for policy 0, policy_version 19270 (0.0006) [2023-03-06 23:35:03,848][81400] Updated weights for policy 0, policy_version 19280 (0.0006) [2023-03-06 23:35:04,635][81400] Updated weights for policy 0, policy_version 19290 (0.0006) [2023-03-06 23:35:05,424][81400] Updated weights for policy 0, policy_version 19300 (0.0006) [2023-03-06 23:35:06,217][81400] Updated weights for policy 0, policy_version 19310 (0.0007) [2023-03-06 23:35:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 19773440. Throughput: 0: 13164.0. Samples: 19746108. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:35:06,237][81074] Avg episode reward: [(0, '3192.634')] [2023-03-06 23:35:07,003][81400] Updated weights for policy 0, policy_version 19320 (0.0006) [2023-03-06 23:35:07,794][81400] Updated weights for policy 0, policy_version 19330 (0.0006) [2023-03-06 23:35:08,568][81400] Updated weights for policy 0, policy_version 19340 (0.0006) [2023-03-06 23:35:09,373][81400] Updated weights for policy 0, policy_version 19350 (0.0006) [2023-03-06 23:35:10,147][81400] Updated weights for policy 0, policy_version 19360 (0.0006) [2023-03-06 23:35:10,927][81400] Updated weights for policy 0, policy_version 19370 (0.0006) [2023-03-06 23:35:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 19838976. Throughput: 0: 13139.6. Samples: 19824238. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:35:11,237][81074] Avg episode reward: [(0, '3221.621')] [2023-03-06 23:35:11,706][81400] Updated weights for policy 0, policy_version 19380 (0.0007) [2023-03-06 23:35:12,490][81400] Updated weights for policy 0, policy_version 19390 (0.0007) [2023-03-06 23:35:13,256][81400] Updated weights for policy 0, policy_version 19400 (0.0006) [2023-03-06 23:35:14,027][81400] Updated weights for policy 0, policy_version 19410 (0.0006) [2023-03-06 23:35:14,793][81400] Updated weights for policy 0, policy_version 19420 (0.0006) [2023-03-06 23:35:15,568][81400] Updated weights for policy 0, policy_version 19430 (0.0007) [2023-03-06 23:35:16,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 19904512. Throughput: 0: 13137.5. Samples: 19903276. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:35:16,237][81074] Avg episode reward: [(0, '3185.328')] [2023-03-06 23:35:16,347][81400] Updated weights for policy 0, policy_version 19440 (0.0006) [2023-03-06 23:35:17,133][81400] Updated weights for policy 0, policy_version 19450 (0.0006) [2023-03-06 23:35:17,892][81400] Updated weights for policy 0, policy_version 19460 (0.0006) [2023-03-06 23:35:18,665][81400] Updated weights for policy 0, policy_version 19470 (0.0006) [2023-03-06 23:35:19,477][81400] Updated weights for policy 0, policy_version 19480 (0.0006) [2023-03-06 23:35:20,241][81400] Updated weights for policy 0, policy_version 19490 (0.0006) [2023-03-06 23:35:21,019][81400] Updated weights for policy 0, policy_version 19500 (0.0007) [2023-03-06 23:35:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 19970048. Throughput: 0: 13136.7. Samples: 19942744. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:35:21,237][81074] Avg episode reward: [(0, '3294.217')] [2023-03-06 23:35:21,794][81400] Updated weights for policy 0, policy_version 19510 (0.0006) [2023-03-06 23:35:22,586][81400] Updated weights for policy 0, policy_version 19520 (0.0005) [2023-03-06 23:35:23,365][81400] Updated weights for policy 0, policy_version 19530 (0.0006) [2023-03-06 23:35:24,137][81400] Updated weights for policy 0, policy_version 19540 (0.0007) [2023-03-06 23:35:24,912][81400] Updated weights for policy 0, policy_version 19550 (0.0007) [2023-03-06 23:35:25,701][81400] Updated weights for policy 0, policy_version 19560 (0.0006) [2023-03-06 23:35:26,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 20036608. Throughput: 0: 13140.9. Samples: 20021700. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:35:26,237][81074] Avg episode reward: [(0, '2938.102')] [2023-03-06 23:35:26,476][81400] Updated weights for policy 0, policy_version 19570 (0.0006) [2023-03-06 23:35:27,259][81400] Updated weights for policy 0, policy_version 19580 (0.0006) [2023-03-06 23:35:28,051][81400] Updated weights for policy 0, policy_version 19590 (0.0006) [2023-03-06 23:35:28,822][81400] Updated weights for policy 0, policy_version 19600 (0.0007) [2023-03-06 23:35:29,601][81400] Updated weights for policy 0, policy_version 19610 (0.0006) [2023-03-06 23:35:30,367][81400] Updated weights for policy 0, policy_version 19620 (0.0006) [2023-03-06 23:35:31,142][81400] Updated weights for policy 0, policy_version 19630 (0.0006) [2023-03-06 23:35:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 20102144. Throughput: 0: 13140.9. Samples: 20100618. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:35:31,237][81074] Avg episode reward: [(0, '2977.741')] [2023-03-06 23:35:31,929][81400] Updated weights for policy 0, policy_version 19640 (0.0007) [2023-03-06 23:35:32,697][81400] Updated weights for policy 0, policy_version 19650 (0.0006) [2023-03-06 23:35:33,470][81400] Updated weights for policy 0, policy_version 19660 (0.0006) [2023-03-06 23:35:34,263][81400] Updated weights for policy 0, policy_version 19670 (0.0006) [2023-03-06 23:35:35,025][81400] Updated weights for policy 0, policy_version 19680 (0.0006) [2023-03-06 23:35:35,794][81400] Updated weights for policy 0, policy_version 19690 (0.0006) [2023-03-06 23:35:36,236][81074] Fps is (10 sec: 13106.9, 60 sec: 13141.3, 300 sec: 13148.8). Total num frames: 20167680. Throughput: 0: 13141.7. Samples: 20140132. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:35:36,237][81074] Avg episode reward: [(0, '2814.054')] [2023-03-06 23:35:36,576][81400] Updated weights for policy 0, policy_version 19700 (0.0006) [2023-03-06 23:35:37,350][81400] Updated weights for policy 0, policy_version 19710 (0.0006) [2023-03-06 23:35:38,134][81400] Updated weights for policy 0, policy_version 19720 (0.0007) [2023-03-06 23:35:38,893][81400] Updated weights for policy 0, policy_version 19730 (0.0006) [2023-03-06 23:35:39,681][81400] Updated weights for policy 0, policy_version 19740 (0.0006) [2023-03-06 23:35:40,470][81400] Updated weights for policy 0, policy_version 19750 (0.0006) [2023-03-06 23:35:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 20233216. Throughput: 0: 13146.1. Samples: 20219237. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:35:41,237][81074] Avg episode reward: [(0, '2955.353')] [2023-03-06 23:35:41,246][81400] Updated weights for policy 0, policy_version 19760 (0.0006) [2023-03-06 23:35:42,016][81400] Updated weights for policy 0, policy_version 19770 (0.0005) [2023-03-06 23:35:42,798][81400] Updated weights for policy 0, policy_version 19780 (0.0007) [2023-03-06 23:35:43,572][81400] Updated weights for policy 0, policy_version 19790 (0.0006) [2023-03-06 23:35:44,369][81400] Updated weights for policy 0, policy_version 19800 (0.0006) [2023-03-06 23:35:45,146][81400] Updated weights for policy 0, policy_version 19810 (0.0006) [2023-03-06 23:35:45,928][81400] Updated weights for policy 0, policy_version 19820 (0.0006) [2023-03-06 23:35:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 20299776. Throughput: 0: 13141.8. Samples: 20298125. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:35:46,237][81074] Avg episode reward: [(0, '2898.811')] [2023-03-06 23:35:46,695][81400] Updated weights for policy 0, policy_version 19830 (0.0006) [2023-03-06 23:35:47,480][81400] Updated weights for policy 0, policy_version 19840 (0.0006) [2023-03-06 23:35:48,246][81400] Updated weights for policy 0, policy_version 19850 (0.0006) [2023-03-06 23:35:49,032][81400] Updated weights for policy 0, policy_version 19860 (0.0006) [2023-03-06 23:35:49,826][81400] Updated weights for policy 0, policy_version 19870 (0.0006) [2023-03-06 23:35:50,605][81400] Updated weights for policy 0, policy_version 19880 (0.0006) [2023-03-06 23:35:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 20365312. Throughput: 0: 13144.2. Samples: 20337599. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:35:51,237][81074] Avg episode reward: [(0, '2812.520')] [2023-03-06 23:35:51,371][81400] Updated weights for policy 0, policy_version 19890 (0.0006) [2023-03-06 23:35:52,135][81400] Updated weights for policy 0, policy_version 19900 (0.0005) [2023-03-06 23:35:52,936][81400] Updated weights for policy 0, policy_version 19910 (0.0006) [2023-03-06 23:35:53,718][81400] Updated weights for policy 0, policy_version 19920 (0.0006) [2023-03-06 23:35:54,491][81400] Updated weights for policy 0, policy_version 19930 (0.0006) [2023-03-06 23:35:55,290][81400] Updated weights for policy 0, policy_version 19940 (0.0007) [2023-03-06 23:35:56,075][81400] Updated weights for policy 0, policy_version 19950 (0.0006) [2023-03-06 23:35:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 20430848. Throughput: 0: 13153.0. Samples: 20416124. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:35:56,237][81074] Avg episode reward: [(0, '2914.920')] [2023-03-06 23:35:56,240][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000019952_20430848.pth... [2023-03-06 23:35:56,270][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000016870_17274880.pth [2023-03-06 23:35:56,842][81400] Updated weights for policy 0, policy_version 19960 (0.0006) [2023-03-06 23:35:57,621][81400] Updated weights for policy 0, policy_version 19970 (0.0006) [2023-03-06 23:35:58,391][81400] Updated weights for policy 0, policy_version 19980 (0.0006) [2023-03-06 23:35:59,163][81400] Updated weights for policy 0, policy_version 19990 (0.0006) [2023-03-06 23:35:59,934][81400] Updated weights for policy 0, policy_version 20000 (0.0006) [2023-03-06 23:36:00,695][81400] Updated weights for policy 0, policy_version 20010 (0.0006) [2023-03-06 23:36:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 20496384. Throughput: 0: 13160.7. Samples: 20495507. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:36:01,237][81074] Avg episode reward: [(0, '2788.921')] [2023-03-06 23:36:01,465][81400] Updated weights for policy 0, policy_version 20020 (0.0007) [2023-03-06 23:36:02,262][81400] Updated weights for policy 0, policy_version 20030 (0.0006) [2023-03-06 23:36:03,055][81400] Updated weights for policy 0, policy_version 20040 (0.0006) [2023-03-06 23:36:03,845][81400] Updated weights for policy 0, policy_version 20050 (0.0006) [2023-03-06 23:36:04,597][81400] Updated weights for policy 0, policy_version 20060 (0.0006) [2023-03-06 23:36:05,382][81400] Updated weights for policy 0, policy_version 20070 (0.0007) [2023-03-06 23:36:06,165][81400] Updated weights for policy 0, policy_version 20080 (0.0006) [2023-03-06 23:36:06,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 20562944. Throughput: 0: 13159.2. Samples: 20534907. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:36:06,237][81074] Avg episode reward: [(0, '2565.095')] [2023-03-06 23:36:06,938][81400] Updated weights for policy 0, policy_version 20090 (0.0006) [2023-03-06 23:36:07,718][81400] Updated weights for policy 0, policy_version 20100 (0.0007) [2023-03-06 23:36:08,508][81400] Updated weights for policy 0, policy_version 20110 (0.0005) [2023-03-06 23:36:09,278][81400] Updated weights for policy 0, policy_version 20120 (0.0006) [2023-03-06 23:36:10,050][81400] Updated weights for policy 0, policy_version 20130 (0.0006) [2023-03-06 23:36:10,830][81400] Updated weights for policy 0, policy_version 20140 (0.0006) [2023-03-06 23:36:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 20628480. Throughput: 0: 13161.0. Samples: 20613946. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:36:11,237][81074] Avg episode reward: [(0, '2807.017')] [2023-03-06 23:36:11,596][81400] Updated weights for policy 0, policy_version 20150 (0.0007) [2023-03-06 23:36:12,370][81400] Updated weights for policy 0, policy_version 20160 (0.0006) [2023-03-06 23:36:13,153][81400] Updated weights for policy 0, policy_version 20170 (0.0006) [2023-03-06 23:36:13,935][81400] Updated weights for policy 0, policy_version 20180 (0.0007) [2023-03-06 23:36:14,725][81400] Updated weights for policy 0, policy_version 20190 (0.0006) [2023-03-06 23:36:15,505][81400] Updated weights for policy 0, policy_version 20200 (0.0006) [2023-03-06 23:36:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 20694016. Throughput: 0: 13158.8. Samples: 20692765. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:36:16,237][81074] Avg episode reward: [(0, '2595.845')] [2023-03-06 23:36:16,282][81400] Updated weights for policy 0, policy_version 20210 (0.0006) [2023-03-06 23:36:17,063][81400] Updated weights for policy 0, policy_version 20220 (0.0006) [2023-03-06 23:36:17,823][81400] Updated weights for policy 0, policy_version 20230 (0.0006) [2023-03-06 23:36:18,613][81400] Updated weights for policy 0, policy_version 20240 (0.0006) [2023-03-06 23:36:19,394][81400] Updated weights for policy 0, policy_version 20250 (0.0006) [2023-03-06 23:36:20,181][81400] Updated weights for policy 0, policy_version 20260 (0.0006) [2023-03-06 23:36:20,951][81400] Updated weights for policy 0, policy_version 20270 (0.0007) [2023-03-06 23:36:21,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 20759552. Throughput: 0: 13156.0. Samples: 20732150. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:36:21,237][81074] Avg episode reward: [(0, '2701.648')] [2023-03-06 23:36:21,731][81400] Updated weights for policy 0, policy_version 20280 (0.0007) [2023-03-06 23:36:22,497][81400] Updated weights for policy 0, policy_version 20290 (0.0006) [2023-03-06 23:36:23,298][81400] Updated weights for policy 0, policy_version 20300 (0.0006) [2023-03-06 23:36:24,086][81400] Updated weights for policy 0, policy_version 20310 (0.0006) [2023-03-06 23:36:24,854][81400] Updated weights for policy 0, policy_version 20320 (0.0006) [2023-03-06 23:36:25,618][81400] Updated weights for policy 0, policy_version 20330 (0.0006) [2023-03-06 23:36:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 20825088. Throughput: 0: 13152.8. Samples: 20811114. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:36:26,237][81074] Avg episode reward: [(0, '2781.987')] [2023-03-06 23:36:26,421][81400] Updated weights for policy 0, policy_version 20340 (0.0007) [2023-03-06 23:36:27,185][81400] Updated weights for policy 0, policy_version 20350 (0.0006) [2023-03-06 23:36:27,985][81400] Updated weights for policy 0, policy_version 20360 (0.0006) [2023-03-06 23:36:28,768][81400] Updated weights for policy 0, policy_version 20370 (0.0006) [2023-03-06 23:36:29,536][81400] Updated weights for policy 0, policy_version 20380 (0.0007) [2023-03-06 23:36:30,300][81400] Updated weights for policy 0, policy_version 20390 (0.0007) [2023-03-06 23:36:31,081][81400] Updated weights for policy 0, policy_version 20400 (0.0006) [2023-03-06 23:36:31,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 20890624. Throughput: 0: 13149.9. Samples: 20889872. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:36:31,237][81074] Avg episode reward: [(0, '2700.431')] [2023-03-06 23:36:31,860][81400] Updated weights for policy 0, policy_version 20410 (0.0007) [2023-03-06 23:36:32,631][81400] Updated weights for policy 0, policy_version 20420 (0.0006) [2023-03-06 23:36:33,406][81400] Updated weights for policy 0, policy_version 20430 (0.0006) [2023-03-06 23:36:34,185][81400] Updated weights for policy 0, policy_version 20440 (0.0006) [2023-03-06 23:36:34,962][81400] Updated weights for policy 0, policy_version 20450 (0.0007) [2023-03-06 23:36:35,724][81400] Updated weights for policy 0, policy_version 20460 (0.0007) [2023-03-06 23:36:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 20957184. Throughput: 0: 13151.2. Samples: 20929401. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:36:36,247][81074] Avg episode reward: [(0, '2674.554')] [2023-03-06 23:36:36,514][81400] Updated weights for policy 0, policy_version 20470 (0.0006) [2023-03-06 23:36:37,291][81400] Updated weights for policy 0, policy_version 20480 (0.0006) [2023-03-06 23:36:38,061][81400] Updated weights for policy 0, policy_version 20490 (0.0006) [2023-03-06 23:36:38,849][81400] Updated weights for policy 0, policy_version 20500 (0.0005) [2023-03-06 23:36:39,632][81400] Updated weights for policy 0, policy_version 20510 (0.0006) [2023-03-06 23:36:40,389][81400] Updated weights for policy 0, policy_version 20520 (0.0006) [2023-03-06 23:36:41,177][81400] Updated weights for policy 0, policy_version 20530 (0.0006) [2023-03-06 23:36:41,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 21023744. Throughput: 0: 13167.6. Samples: 21008666. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:36:41,237][81074] Avg episode reward: [(0, '2797.641')] [2023-03-06 23:36:41,928][81400] Updated weights for policy 0, policy_version 20540 (0.0006) [2023-03-06 23:36:42,721][81400] Updated weights for policy 0, policy_version 20550 (0.0006) [2023-03-06 23:36:43,506][81400] Updated weights for policy 0, policy_version 20560 (0.0006) [2023-03-06 23:36:44,281][81400] Updated weights for policy 0, policy_version 20570 (0.0007) [2023-03-06 23:36:45,040][81400] Updated weights for policy 0, policy_version 20580 (0.0006) [2023-03-06 23:36:45,837][81400] Updated weights for policy 0, policy_version 20590 (0.0006) [2023-03-06 23:36:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21089280. Throughput: 0: 13158.6. Samples: 21087645. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:36:46,237][81074] Avg episode reward: [(0, '2714.977')] [2023-03-06 23:36:46,619][81400] Updated weights for policy 0, policy_version 20600 (0.0006) [2023-03-06 23:36:47,377][81400] Updated weights for policy 0, policy_version 20610 (0.0006) [2023-03-06 23:36:48,161][81400] Updated weights for policy 0, policy_version 20620 (0.0006) [2023-03-06 23:36:48,937][81400] Updated weights for policy 0, policy_version 20630 (0.0008) [2023-03-06 23:36:49,714][81400] Updated weights for policy 0, policy_version 20640 (0.0006) [2023-03-06 23:36:50,487][81400] Updated weights for policy 0, policy_version 20650 (0.0006) [2023-03-06 23:36:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21154816. Throughput: 0: 13163.9. Samples: 21127281. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:36:51,237][81074] Avg episode reward: [(0, '2861.894')] [2023-03-06 23:36:51,280][81400] Updated weights for policy 0, policy_version 20660 (0.0006) [2023-03-06 23:36:52,068][81400] Updated weights for policy 0, policy_version 20670 (0.0006) [2023-03-06 23:36:52,843][81400] Updated weights for policy 0, policy_version 20680 (0.0006) [2023-03-06 23:36:53,625][81400] Updated weights for policy 0, policy_version 20690 (0.0006) [2023-03-06 23:36:54,384][81400] Updated weights for policy 0, policy_version 20700 (0.0006) [2023-03-06 23:36:55,154][81400] Updated weights for policy 0, policy_version 20710 (0.0006) [2023-03-06 23:36:55,938][81400] Updated weights for policy 0, policy_version 20720 (0.0006) [2023-03-06 23:36:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 21220352. Throughput: 0: 13164.3. Samples: 21206340. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:36:56,237][81074] Avg episode reward: [(0, '2762.100')] [2023-03-06 23:36:56,703][81400] Updated weights for policy 0, policy_version 20730 (0.0006) [2023-03-06 23:36:57,475][81400] Updated weights for policy 0, policy_version 20740 (0.0007) [2023-03-06 23:36:58,253][81400] Updated weights for policy 0, policy_version 20750 (0.0007) [2023-03-06 23:36:59,029][81400] Updated weights for policy 0, policy_version 20760 (0.0006) [2023-03-06 23:36:59,802][81400] Updated weights for policy 0, policy_version 20770 (0.0006) [2023-03-06 23:37:00,577][81400] Updated weights for policy 0, policy_version 20780 (0.0007) [2023-03-06 23:37:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 21286912. Throughput: 0: 13177.8. Samples: 21285768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:01,237][81074] Avg episode reward: [(0, '2853.771')] [2023-03-06 23:37:01,358][81400] Updated weights for policy 0, policy_version 20790 (0.0006) [2023-03-06 23:37:02,127][81400] Updated weights for policy 0, policy_version 20800 (0.0007) [2023-03-06 23:37:02,902][81400] Updated weights for policy 0, policy_version 20810 (0.0007) [2023-03-06 23:37:03,674][81400] Updated weights for policy 0, policy_version 20820 (0.0005) [2023-03-06 23:37:04,466][81400] Updated weights for policy 0, policy_version 20830 (0.0006) [2023-03-06 23:37:05,230][81400] Updated weights for policy 0, policy_version 20840 (0.0006) [2023-03-06 23:37:06,010][81400] Updated weights for policy 0, policy_version 20850 (0.0005) [2023-03-06 23:37:06,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 21353472. Throughput: 0: 13180.9. Samples: 21325292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:06,237][81074] Avg episode reward: [(0, '3017.279')] [2023-03-06 23:37:06,786][81400] Updated weights for policy 0, policy_version 20860 (0.0006) [2023-03-06 23:37:07,554][81400] Updated weights for policy 0, policy_version 20870 (0.0005) [2023-03-06 23:37:08,345][81400] Updated weights for policy 0, policy_version 20880 (0.0006) [2023-03-06 23:37:09,119][81400] Updated weights for policy 0, policy_version 20890 (0.0006) [2023-03-06 23:37:09,889][81400] Updated weights for policy 0, policy_version 20900 (0.0006) [2023-03-06 23:37:10,676][81400] Updated weights for policy 0, policy_version 20910 (0.0008) [2023-03-06 23:37:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 21419008. Throughput: 0: 13186.4. Samples: 21404502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:11,237][81074] Avg episode reward: [(0, '2787.448')] [2023-03-06 23:37:11,469][81400] Updated weights for policy 0, policy_version 20920 (0.0005) [2023-03-06 23:37:12,224][81400] Updated weights for policy 0, policy_version 20930 (0.0007) [2023-03-06 23:37:13,006][81400] Updated weights for policy 0, policy_version 20940 (0.0006) [2023-03-06 23:37:13,767][81400] Updated weights for policy 0, policy_version 20950 (0.0006) [2023-03-06 23:37:14,562][81400] Updated weights for policy 0, policy_version 20960 (0.0006) [2023-03-06 23:37:15,318][81400] Updated weights for policy 0, policy_version 20970 (0.0007) [2023-03-06 23:37:16,099][81400] Updated weights for policy 0, policy_version 20980 (0.0006) [2023-03-06 23:37:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 13159.3). Total num frames: 21484544. Throughput: 0: 13191.2. Samples: 21483475. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:16,237][81074] Avg episode reward: [(0, '3024.203')] [2023-03-06 23:37:16,872][81400] Updated weights for policy 0, policy_version 20990 (0.0007) [2023-03-06 23:37:17,660][81400] Updated weights for policy 0, policy_version 21000 (0.0007) [2023-03-06 23:37:18,426][81400] Updated weights for policy 0, policy_version 21010 (0.0005) [2023-03-06 23:37:19,212][81400] Updated weights for policy 0, policy_version 21020 (0.0006) [2023-03-06 23:37:19,964][81400] Updated weights for policy 0, policy_version 21030 (0.0006) [2023-03-06 23:37:20,742][81400] Updated weights for policy 0, policy_version 21040 (0.0006) [2023-03-06 23:37:21,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13162.7). Total num frames: 21551104. Throughput: 0: 13191.7. Samples: 21523030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:21,237][81074] Avg episode reward: [(0, '2901.473')] [2023-03-06 23:37:21,528][81400] Updated weights for policy 0, policy_version 21050 (0.0006) [2023-03-06 23:37:22,287][81400] Updated weights for policy 0, policy_version 21060 (0.0006) [2023-03-06 23:37:23,061][81400] Updated weights for policy 0, policy_version 21070 (0.0006) [2023-03-06 23:37:23,852][81400] Updated weights for policy 0, policy_version 21080 (0.0007) [2023-03-06 23:37:24,621][81400] Updated weights for policy 0, policy_version 21090 (0.0006) [2023-03-06 23:37:25,390][81400] Updated weights for policy 0, policy_version 21100 (0.0006) [2023-03-06 23:37:26,190][81400] Updated weights for policy 0, policy_version 21110 (0.0006) [2023-03-06 23:37:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13159.3). Total num frames: 21616640. Throughput: 0: 13199.3. Samples: 21602637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:26,237][81074] Avg episode reward: [(0, '2885.561')] [2023-03-06 23:37:26,976][81400] Updated weights for policy 0, policy_version 21120 (0.0006) [2023-03-06 23:37:27,753][81400] Updated weights for policy 0, policy_version 21130 (0.0007) [2023-03-06 23:37:28,537][81400] Updated weights for policy 0, policy_version 21140 (0.0006) [2023-03-06 23:37:29,325][81400] Updated weights for policy 0, policy_version 21150 (0.0006) [2023-03-06 23:37:30,104][81400] Updated weights for policy 0, policy_version 21160 (0.0006) [2023-03-06 23:37:30,877][81400] Updated weights for policy 0, policy_version 21170 (0.0006) [2023-03-06 23:37:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 13159.3). Total num frames: 21682176. Throughput: 0: 13189.0. Samples: 21681150. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:37:31,237][81074] Avg episode reward: [(0, '2842.024')] [2023-03-06 23:37:31,643][81400] Updated weights for policy 0, policy_version 21180 (0.0006) [2023-03-06 23:37:32,441][81400] Updated weights for policy 0, policy_version 21190 (0.0006) [2023-03-06 23:37:33,206][81400] Updated weights for policy 0, policy_version 21200 (0.0006) [2023-03-06 23:37:33,991][81400] Updated weights for policy 0, policy_version 21210 (0.0006) [2023-03-06 23:37:34,766][81400] Updated weights for policy 0, policy_version 21220 (0.0006) [2023-03-06 23:37:35,533][81400] Updated weights for policy 0, policy_version 21230 (0.0006) [2023-03-06 23:37:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 21747712. Throughput: 0: 13184.1. Samples: 21720567. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:37:36,237][81074] Avg episode reward: [(0, '2711.011')] [2023-03-06 23:37:36,313][81400] Updated weights for policy 0, policy_version 21240 (0.0006) [2023-03-06 23:37:37,084][81400] Updated weights for policy 0, policy_version 21250 (0.0005) [2023-03-06 23:37:37,856][81400] Updated weights for policy 0, policy_version 21260 (0.0007) [2023-03-06 23:37:38,644][81400] Updated weights for policy 0, policy_version 21270 (0.0005) [2023-03-06 23:37:39,426][81400] Updated weights for policy 0, policy_version 21280 (0.0006) [2023-03-06 23:37:40,182][81400] Updated weights for policy 0, policy_version 21290 (0.0007) [2023-03-06 23:37:40,863][81349] KL-divergence is very high: 14109.5977 [2023-03-06 23:37:40,951][81400] Updated weights for policy 0, policy_version 21300 (0.0006) [2023-03-06 23:37:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13162.7). Total num frames: 21814272. Throughput: 0: 13193.8. Samples: 21800064. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:37:41,237][81074] Avg episode reward: [(0, '2665.731')] [2023-03-06 23:37:41,730][81400] Updated weights for policy 0, policy_version 21310 (0.0005) [2023-03-06 23:37:42,498][81400] Updated weights for policy 0, policy_version 21320 (0.0006) [2023-03-06 23:37:43,268][81400] Updated weights for policy 0, policy_version 21330 (0.0006) [2023-03-06 23:37:44,044][81400] Updated weights for policy 0, policy_version 21340 (0.0006) [2023-03-06 23:37:44,822][81400] Updated weights for policy 0, policy_version 21350 (0.0008) [2023-03-06 23:37:45,605][81400] Updated weights for policy 0, policy_version 21360 (0.0006) [2023-03-06 23:37:46,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 21880832. Throughput: 0: 13188.1. Samples: 21879236. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:46,237][81074] Avg episode reward: [(0, '2439.833')] [2023-03-06 23:37:46,374][81400] Updated weights for policy 0, policy_version 21370 (0.0006) [2023-03-06 23:37:47,153][81400] Updated weights for policy 0, policy_version 21380 (0.0006) [2023-03-06 23:37:47,902][81400] Updated weights for policy 0, policy_version 21390 (0.0006) [2023-03-06 23:37:48,672][81400] Updated weights for policy 0, policy_version 21400 (0.0007) [2023-03-06 23:37:49,469][81400] Updated weights for policy 0, policy_version 21410 (0.0006) [2023-03-06 23:37:50,246][81400] Updated weights for policy 0, policy_version 21420 (0.0007) [2023-03-06 23:37:51,013][81400] Updated weights for policy 0, policy_version 21430 (0.0006) [2023-03-06 23:37:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 21946368. Throughput: 0: 13195.7. Samples: 21919096. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:51,237][81074] Avg episode reward: [(0, '2389.483')] [2023-03-06 23:37:51,790][81400] Updated weights for policy 0, policy_version 21440 (0.0005) [2023-03-06 23:37:52,561][81400] Updated weights for policy 0, policy_version 21450 (0.0006) [2023-03-06 23:37:53,346][81400] Updated weights for policy 0, policy_version 21460 (0.0006) [2023-03-06 23:37:54,120][81400] Updated weights for policy 0, policy_version 21470 (0.0006) [2023-03-06 23:37:54,902][81400] Updated weights for policy 0, policy_version 21480 (0.0005) [2023-03-06 23:37:55,668][81400] Updated weights for policy 0, policy_version 21490 (0.0006) [2023-03-06 23:37:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13166.2). Total num frames: 22012928. Throughput: 0: 13195.3. Samples: 21998293. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:37:56,237][81074] Avg episode reward: [(0, '2398.533')] [2023-03-06 23:37:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000021497_22012928.pth... [2023-03-06 23:37:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000018411_18852864.pth [2023-03-06 23:37:56,451][81400] Updated weights for policy 0, policy_version 21500 (0.0006) [2023-03-06 23:37:57,208][81400] Updated weights for policy 0, policy_version 21510 (0.0005) [2023-03-06 23:37:57,972][81400] Updated weights for policy 0, policy_version 21520 (0.0007) [2023-03-06 23:37:58,761][81400] Updated weights for policy 0, policy_version 21530 (0.0006) [2023-03-06 23:37:59,547][81400] Updated weights for policy 0, policy_version 21540 (0.0006) [2023-03-06 23:38:00,308][81400] Updated weights for policy 0, policy_version 21550 (0.0006) [2023-03-06 23:38:01,086][81400] Updated weights for policy 0, policy_version 21560 (0.0006) [2023-03-06 23:38:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 22078464. Throughput: 0: 13203.6. Samples: 22077636. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:38:01,237][81074] Avg episode reward: [(0, '2496.637')] [2023-03-06 23:38:01,854][81400] Updated weights for policy 0, policy_version 21570 (0.0005) [2023-03-06 23:38:02,638][81400] Updated weights for policy 0, policy_version 21580 (0.0007) [2023-03-06 23:38:03,417][81400] Updated weights for policy 0, policy_version 21590 (0.0007) [2023-03-06 23:38:04,192][81400] Updated weights for policy 0, policy_version 21600 (0.0006) [2023-03-06 23:38:04,966][81400] Updated weights for policy 0, policy_version 21610 (0.0006) [2023-03-06 23:38:05,717][81400] Updated weights for policy 0, policy_version 21620 (0.0005) [2023-03-06 23:38:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 22145024. Throughput: 0: 13203.2. Samples: 22117174. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:38:06,237][81074] Avg episode reward: [(0, '2418.246')] [2023-03-06 23:38:06,519][81400] Updated weights for policy 0, policy_version 21630 (0.0006) [2023-03-06 23:38:07,301][81400] Updated weights for policy 0, policy_version 21640 (0.0006) [2023-03-06 23:38:08,083][81400] Updated weights for policy 0, policy_version 21650 (0.0006) [2023-03-06 23:38:08,880][81400] Updated weights for policy 0, policy_version 21660 (0.0007) [2023-03-06 23:38:09,644][81400] Updated weights for policy 0, policy_version 21670 (0.0006) [2023-03-06 23:38:10,405][81400] Updated weights for policy 0, policy_version 21680 (0.0006) [2023-03-06 23:38:11,177][81400] Updated weights for policy 0, policy_version 21690 (0.0006) [2023-03-06 23:38:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 22210560. Throughput: 0: 13195.7. Samples: 22196444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:38:11,237][81074] Avg episode reward: [(0, '2797.590')] [2023-03-06 23:38:11,966][81400] Updated weights for policy 0, policy_version 21700 (0.0007) [2023-03-06 23:38:12,741][81400] Updated weights for policy 0, policy_version 21710 (0.0006) [2023-03-06 23:38:13,502][81400] Updated weights for policy 0, policy_version 21720 (0.0007) [2023-03-06 23:38:14,281][81400] Updated weights for policy 0, policy_version 21730 (0.0006) [2023-03-06 23:38:15,054][81400] Updated weights for policy 0, policy_version 21740 (0.0006) [2023-03-06 23:38:15,833][81400] Updated weights for policy 0, policy_version 21750 (0.0006) [2023-03-06 23:38:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 22276096. Throughput: 0: 13208.5. Samples: 22275534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:38:16,237][81074] Avg episode reward: [(0, '2828.123')] [2023-03-06 23:38:16,602][81400] Updated weights for policy 0, policy_version 21760 (0.0006) [2023-03-06 23:38:17,371][81400] Updated weights for policy 0, policy_version 21770 (0.0006) [2023-03-06 23:38:18,161][81400] Updated weights for policy 0, policy_version 21780 (0.0006) [2023-03-06 23:38:18,928][81400] Updated weights for policy 0, policy_version 21790 (0.0006) [2023-03-06 23:38:19,711][81400] Updated weights for policy 0, policy_version 21800 (0.0005) [2023-03-06 23:38:20,486][81400] Updated weights for policy 0, policy_version 21810 (0.0007) [2023-03-06 23:38:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 22342656. Throughput: 0: 13213.5. Samples: 22315172. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:38:21,247][81074] Avg episode reward: [(0, '3008.514')] [2023-03-06 23:38:21,258][81400] Updated weights for policy 0, policy_version 21820 (0.0006) [2023-03-06 23:38:22,045][81400] Updated weights for policy 0, policy_version 21830 (0.0006) [2023-03-06 23:38:22,823][81400] Updated weights for policy 0, policy_version 21840 (0.0005) [2023-03-06 23:38:23,590][81400] Updated weights for policy 0, policy_version 21850 (0.0006) [2023-03-06 23:38:24,359][81400] Updated weights for policy 0, policy_version 21860 (0.0007) [2023-03-06 23:38:25,130][81400] Updated weights for policy 0, policy_version 21870 (0.0006) [2023-03-06 23:38:25,919][81400] Updated weights for policy 0, policy_version 21880 (0.0006) [2023-03-06 23:38:26,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13209.6, 300 sec: 13173.1). Total num frames: 22409216. Throughput: 0: 13209.4. Samples: 22394489. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:38:26,237][81074] Avg episode reward: [(0, '3152.668')] [2023-03-06 23:38:26,687][81400] Updated weights for policy 0, policy_version 21890 (0.0006) [2023-03-06 23:38:27,482][81400] Updated weights for policy 0, policy_version 21900 (0.0007) [2023-03-06 23:38:28,268][81400] Updated weights for policy 0, policy_version 21910 (0.0006) [2023-03-06 23:38:29,053][81400] Updated weights for policy 0, policy_version 21920 (0.0007) [2023-03-06 23:38:29,794][81400] Updated weights for policy 0, policy_version 21930 (0.0006) [2023-03-06 23:38:30,573][81400] Updated weights for policy 0, policy_version 21940 (0.0005) [2023-03-06 23:38:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13169.7). Total num frames: 22474752. Throughput: 0: 13204.3. Samples: 22473427. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:38:31,237][81074] Avg episode reward: [(0, '3012.796')] [2023-03-06 23:38:31,351][81400] Updated weights for policy 0, policy_version 21950 (0.0006) [2023-03-06 23:38:32,139][81400] Updated weights for policy 0, policy_version 21960 (0.0006) [2023-03-06 23:38:32,906][81400] Updated weights for policy 0, policy_version 21970 (0.0005) [2023-03-06 23:38:33,689][81400] Updated weights for policy 0, policy_version 21980 (0.0006) [2023-03-06 23:38:34,461][81400] Updated weights for policy 0, policy_version 21990 (0.0006) [2023-03-06 23:38:35,228][81400] Updated weights for policy 0, policy_version 22000 (0.0006) [2023-03-06 23:38:36,014][81400] Updated weights for policy 0, policy_version 22010 (0.0006) [2023-03-06 23:38:36,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13173.2). Total num frames: 22541312. Throughput: 0: 13198.5. Samples: 22513026. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:38:36,237][81074] Avg episode reward: [(0, '2914.401')] [2023-03-06 23:38:36,812][81400] Updated weights for policy 0, policy_version 22020 (0.0006) [2023-03-06 23:38:37,577][81400] Updated weights for policy 0, policy_version 22030 (0.0006) [2023-03-06 23:38:38,351][81400] Updated weights for policy 0, policy_version 22040 (0.0006) [2023-03-06 23:38:39,105][81400] Updated weights for policy 0, policy_version 22050 (0.0006) [2023-03-06 23:38:39,914][81400] Updated weights for policy 0, policy_version 22060 (0.0006) [2023-03-06 23:38:40,679][81400] Updated weights for policy 0, policy_version 22070 (0.0006) [2023-03-06 23:38:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13173.1). Total num frames: 22606848. Throughput: 0: 13198.4. Samples: 22592220. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:38:41,237][81074] Avg episode reward: [(0, '2822.104')] [2023-03-06 23:38:41,444][81400] Updated weights for policy 0, policy_version 22080 (0.0006) [2023-03-06 23:38:42,238][81400] Updated weights for policy 0, policy_version 22090 (0.0005) [2023-03-06 23:38:42,993][81400] Updated weights for policy 0, policy_version 22100 (0.0006) [2023-03-06 23:38:43,748][81400] Updated weights for policy 0, policy_version 22110 (0.0006) [2023-03-06 23:38:44,546][81400] Updated weights for policy 0, policy_version 22120 (0.0006) [2023-03-06 23:38:45,343][81400] Updated weights for policy 0, policy_version 22130 (0.0006) [2023-03-06 23:38:46,125][81400] Updated weights for policy 0, policy_version 22140 (0.0007) [2023-03-06 23:38:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.6, 300 sec: 13169.7). Total num frames: 22672384. Throughput: 0: 13191.8. Samples: 22671269. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:38:46,237][81074] Avg episode reward: [(0, '2870.558')] [2023-03-06 23:38:46,888][81400] Updated weights for policy 0, policy_version 22150 (0.0006) [2023-03-06 23:38:47,673][81400] Updated weights for policy 0, policy_version 22160 (0.0005) [2023-03-06 23:38:48,443][81400] Updated weights for policy 0, policy_version 22170 (0.0006) [2023-03-06 23:38:49,225][81400] Updated weights for policy 0, policy_version 22180 (0.0006) [2023-03-06 23:38:50,004][81400] Updated weights for policy 0, policy_version 22190 (0.0006) [2023-03-06 23:38:50,782][81400] Updated weights for policy 0, policy_version 22200 (0.0006) [2023-03-06 23:38:51,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 22737920. Throughput: 0: 13188.1. Samples: 22710638. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:38:51,237][81074] Avg episode reward: [(0, '2750.340')] [2023-03-06 23:38:51,549][81400] Updated weights for policy 0, policy_version 22210 (0.0006) [2023-03-06 23:38:52,328][81400] Updated weights for policy 0, policy_version 22220 (0.0006) [2023-03-06 23:38:53,111][81400] Updated weights for policy 0, policy_version 22230 (0.0006) [2023-03-06 23:38:53,897][81400] Updated weights for policy 0, policy_version 22240 (0.0007) [2023-03-06 23:38:54,678][81400] Updated weights for policy 0, policy_version 22250 (0.0006) [2023-03-06 23:38:55,469][81400] Updated weights for policy 0, policy_version 22260 (0.0006) [2023-03-06 23:38:56,234][81400] Updated weights for policy 0, policy_version 22270 (0.0007) [2023-03-06 23:38:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13173.2). Total num frames: 22804480. Throughput: 0: 13180.2. Samples: 22789555. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:38:56,237][81074] Avg episode reward: [(0, '2890.495')] [2023-03-06 23:38:57,008][81400] Updated weights for policy 0, policy_version 22280 (0.0006) [2023-03-06 23:38:57,776][81400] Updated weights for policy 0, policy_version 22290 (0.0006) [2023-03-06 23:38:58,562][81400] Updated weights for policy 0, policy_version 22300 (0.0006) [2023-03-06 23:38:59,340][81400] Updated weights for policy 0, policy_version 22310 (0.0006) [2023-03-06 23:39:00,108][81400] Updated weights for policy 0, policy_version 22320 (0.0006) [2023-03-06 23:39:00,881][81400] Updated weights for policy 0, policy_version 22330 (0.0006) [2023-03-06 23:39:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 22870016. Throughput: 0: 13184.2. Samples: 22868824. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:01,237][81074] Avg episode reward: [(0, '2557.813')] [2023-03-06 23:39:01,661][81400] Updated weights for policy 0, policy_version 22340 (0.0006) [2023-03-06 23:39:02,427][81400] Updated weights for policy 0, policy_version 22350 (0.0006) [2023-03-06 23:39:03,206][81400] Updated weights for policy 0, policy_version 22360 (0.0006) [2023-03-06 23:39:03,986][81400] Updated weights for policy 0, policy_version 22370 (0.0006) [2023-03-06 23:39:04,770][81400] Updated weights for policy 0, policy_version 22380 (0.0007) [2023-03-06 23:39:05,545][81400] Updated weights for policy 0, policy_version 22390 (0.0006) [2023-03-06 23:39:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 22935552. Throughput: 0: 13183.7. Samples: 22908440. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:06,237][81074] Avg episode reward: [(0, '2497.324')] [2023-03-06 23:39:06,325][81400] Updated weights for policy 0, policy_version 22400 (0.0006) [2023-03-06 23:39:07,095][81400] Updated weights for policy 0, policy_version 22410 (0.0006) [2023-03-06 23:39:07,861][81400] Updated weights for policy 0, policy_version 22420 (0.0006) [2023-03-06 23:39:08,629][81400] Updated weights for policy 0, policy_version 22430 (0.0006) [2023-03-06 23:39:09,437][81400] Updated weights for policy 0, policy_version 22440 (0.0006) [2023-03-06 23:39:10,220][81400] Updated weights for policy 0, policy_version 22450 (0.0007) [2023-03-06 23:39:10,977][81400] Updated weights for policy 0, policy_version 22460 (0.0006) [2023-03-06 23:39:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 23002112. Throughput: 0: 13179.4. Samples: 22987559. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:39:11,237][81074] Avg episode reward: [(0, '2337.320')] [2023-03-06 23:39:11,765][81400] Updated weights for policy 0, policy_version 22470 (0.0006) [2023-03-06 23:39:12,534][81400] Updated weights for policy 0, policy_version 22480 (0.0007) [2023-03-06 23:39:13,315][81400] Updated weights for policy 0, policy_version 22490 (0.0006) [2023-03-06 23:39:14,093][81400] Updated weights for policy 0, policy_version 22500 (0.0006) [2023-03-06 23:39:14,885][81400] Updated weights for policy 0, policy_version 22510 (0.0006) [2023-03-06 23:39:15,655][81400] Updated weights for policy 0, policy_version 22520 (0.0006) [2023-03-06 23:39:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.1). Total num frames: 23067648. Throughput: 0: 13175.6. Samples: 23066331. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:39:16,237][81074] Avg episode reward: [(0, '2456.406')] [2023-03-06 23:39:16,440][81400] Updated weights for policy 0, policy_version 22530 (0.0007) [2023-03-06 23:39:17,218][81400] Updated weights for policy 0, policy_version 22540 (0.0006) [2023-03-06 23:39:17,990][81400] Updated weights for policy 0, policy_version 22550 (0.0006) [2023-03-06 23:39:18,769][81400] Updated weights for policy 0, policy_version 22560 (0.0006) [2023-03-06 23:39:19,543][81400] Updated weights for policy 0, policy_version 22570 (0.0007) [2023-03-06 23:39:20,306][81400] Updated weights for policy 0, policy_version 22580 (0.0006) [2023-03-06 23:39:21,093][81400] Updated weights for policy 0, policy_version 22590 (0.0006) [2023-03-06 23:39:21,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13169.7). Total num frames: 23133184. Throughput: 0: 13175.1. Samples: 23105906. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:39:21,248][81074] Avg episode reward: [(0, '2182.222')] [2023-03-06 23:39:21,867][81400] Updated weights for policy 0, policy_version 22600 (0.0006) [2023-03-06 23:39:22,629][81400] Updated weights for policy 0, policy_version 22610 (0.0006) [2023-03-06 23:39:23,396][81400] Updated weights for policy 0, policy_version 22620 (0.0005) [2023-03-06 23:39:24,189][81400] Updated weights for policy 0, policy_version 22630 (0.0006) [2023-03-06 23:39:24,951][81400] Updated weights for policy 0, policy_version 22640 (0.0006) [2023-03-06 23:39:25,737][81400] Updated weights for policy 0, policy_version 22650 (0.0006) [2023-03-06 23:39:26,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 23199744. Throughput: 0: 13178.7. Samples: 23185260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:26,244][81074] Avg episode reward: [(0, '2378.370')] [2023-03-06 23:39:26,506][81400] Updated weights for policy 0, policy_version 22660 (0.0005) [2023-03-06 23:39:27,301][81400] Updated weights for policy 0, policy_version 22670 (0.0006) [2023-03-06 23:39:28,089][81400] Updated weights for policy 0, policy_version 22680 (0.0006) [2023-03-06 23:39:28,866][81400] Updated weights for policy 0, policy_version 22690 (0.0006) [2023-03-06 23:39:29,635][81400] Updated weights for policy 0, policy_version 22700 (0.0006) [2023-03-06 23:39:30,418][81400] Updated weights for policy 0, policy_version 22710 (0.0006) [2023-03-06 23:39:31,181][81400] Updated weights for policy 0, policy_version 22720 (0.0006) [2023-03-06 23:39:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 23265280. Throughput: 0: 13176.4. Samples: 23264208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:31,237][81074] Avg episode reward: [(0, '2591.179')] [2023-03-06 23:39:31,973][81400] Updated weights for policy 0, policy_version 22730 (0.0006) [2023-03-06 23:39:32,757][81400] Updated weights for policy 0, policy_version 22740 (0.0006) [2023-03-06 23:39:33,527][81400] Updated weights for policy 0, policy_version 22750 (0.0006) [2023-03-06 23:39:34,299][81400] Updated weights for policy 0, policy_version 22760 (0.0006) [2023-03-06 23:39:35,092][81400] Updated weights for policy 0, policy_version 22770 (0.0006) [2023-03-06 23:39:35,881][81400] Updated weights for policy 0, policy_version 22780 (0.0007) [2023-03-06 23:39:36,236][81074] Fps is (10 sec: 13106.9, 60 sec: 13158.3, 300 sec: 13173.1). Total num frames: 23330816. Throughput: 0: 13177.3. Samples: 23303620. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:36,237][81074] Avg episode reward: [(0, '2477.639')] [2023-03-06 23:39:36,643][81400] Updated weights for policy 0, policy_version 22790 (0.0005) [2023-03-06 23:39:37,428][81400] Updated weights for policy 0, policy_version 22800 (0.0006) [2023-03-06 23:39:38,189][81400] Updated weights for policy 0, policy_version 22810 (0.0006) [2023-03-06 23:39:38,977][81400] Updated weights for policy 0, policy_version 22820 (0.0006) [2023-03-06 23:39:39,759][81400] Updated weights for policy 0, policy_version 22830 (0.0006) [2023-03-06 23:39:40,545][81400] Updated weights for policy 0, policy_version 22840 (0.0006) [2023-03-06 23:39:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 23396352. Throughput: 0: 13176.5. Samples: 23382496. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:41,237][81074] Avg episode reward: [(0, '2650.621')] [2023-03-06 23:39:41,319][81400] Updated weights for policy 0, policy_version 22850 (0.0006) [2023-03-06 23:39:42,085][81400] Updated weights for policy 0, policy_version 22860 (0.0006) [2023-03-06 23:39:42,880][81400] Updated weights for policy 0, policy_version 22870 (0.0006) [2023-03-06 23:39:43,661][81400] Updated weights for policy 0, policy_version 22880 (0.0006) [2023-03-06 23:39:44,442][81400] Updated weights for policy 0, policy_version 22890 (0.0006) [2023-03-06 23:39:45,215][81400] Updated weights for policy 0, policy_version 22900 (0.0006) [2023-03-06 23:39:45,982][81400] Updated weights for policy 0, policy_version 22910 (0.0006) [2023-03-06 23:39:46,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 23462912. Throughput: 0: 13168.6. Samples: 23461413. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:46,237][81074] Avg episode reward: [(0, '2450.970')] [2023-03-06 23:39:46,761][81400] Updated weights for policy 0, policy_version 22920 (0.0006) [2023-03-06 23:39:47,539][81400] Updated weights for policy 0, policy_version 22930 (0.0006) [2023-03-06 23:39:48,316][81400] Updated weights for policy 0, policy_version 22940 (0.0006) [2023-03-06 23:39:49,086][81400] Updated weights for policy 0, policy_version 22950 (0.0006) [2023-03-06 23:39:49,842][81400] Updated weights for policy 0, policy_version 22960 (0.0006) [2023-03-06 23:39:50,634][81400] Updated weights for policy 0, policy_version 22970 (0.0006) [2023-03-06 23:39:51,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13173.1). Total num frames: 23528448. Throughput: 0: 13173.8. Samples: 23501263. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:51,237][81074] Avg episode reward: [(0, '2295.950')] [2023-03-06 23:39:51,409][81400] Updated weights for policy 0, policy_version 22980 (0.0006) [2023-03-06 23:39:52,178][81400] Updated weights for policy 0, policy_version 22990 (0.0006) [2023-03-06 23:39:52,952][81400] Updated weights for policy 0, policy_version 23000 (0.0006) [2023-03-06 23:39:53,722][81400] Updated weights for policy 0, policy_version 23010 (0.0006) [2023-03-06 23:39:54,507][81400] Updated weights for policy 0, policy_version 23020 (0.0006) [2023-03-06 23:39:55,287][81400] Updated weights for policy 0, policy_version 23030 (0.0006) [2023-03-06 23:39:56,033][81400] Updated weights for policy 0, policy_version 23040 (0.0006) [2023-03-06 23:39:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 23595008. Throughput: 0: 13178.1. Samples: 23580573. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:39:56,237][81074] Avg episode reward: [(0, '2360.206')] [2023-03-06 23:39:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000023042_23595008.pth... [2023-03-06 23:39:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000019952_20430848.pth [2023-03-06 23:39:56,813][81400] Updated weights for policy 0, policy_version 23050 (0.0007) [2023-03-06 23:39:57,577][81400] Updated weights for policy 0, policy_version 23060 (0.0006) [2023-03-06 23:39:58,358][81400] Updated weights for policy 0, policy_version 23070 (0.0007) [2023-03-06 23:39:59,122][81400] Updated weights for policy 0, policy_version 23080 (0.0006) [2023-03-06 23:39:59,918][81400] Updated weights for policy 0, policy_version 23090 (0.0006) [2023-03-06 23:40:00,697][81400] Updated weights for policy 0, policy_version 23100 (0.0006) [2023-03-06 23:40:01,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 23661568. Throughput: 0: 13190.4. Samples: 23659898. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:40:01,237][81074] Avg episode reward: [(0, '2468.787')] [2023-03-06 23:40:01,463][81400] Updated weights for policy 0, policy_version 23110 (0.0006) [2023-03-06 23:40:02,236][81400] Updated weights for policy 0, policy_version 23120 (0.0006) [2023-03-06 23:40:03,002][81400] Updated weights for policy 0, policy_version 23130 (0.0007) [2023-03-06 23:40:03,783][81400] Updated weights for policy 0, policy_version 23140 (0.0006) [2023-03-06 23:40:04,560][81400] Updated weights for policy 0, policy_version 23150 (0.0006) [2023-03-06 23:40:05,333][81400] Updated weights for policy 0, policy_version 23160 (0.0006) [2023-03-06 23:40:06,098][81400] Updated weights for policy 0, policy_version 23170 (0.0006) [2023-03-06 23:40:06,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.6, 300 sec: 13180.1). Total num frames: 23727104. Throughput: 0: 13194.5. Samples: 23699657. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:40:06,237][81074] Avg episode reward: [(0, '2233.825')] [2023-03-06 23:40:06,877][81400] Updated weights for policy 0, policy_version 23180 (0.0006) [2023-03-06 23:40:07,656][81400] Updated weights for policy 0, policy_version 23190 (0.0007) [2023-03-06 23:40:08,431][81400] Updated weights for policy 0, policy_version 23200 (0.0006) [2023-03-06 23:40:09,194][81400] Updated weights for policy 0, policy_version 23210 (0.0005) [2023-03-06 23:40:09,968][81400] Updated weights for policy 0, policy_version 23220 (0.0006) [2023-03-06 23:40:10,739][81400] Updated weights for policy 0, policy_version 23230 (0.0006) [2023-03-06 23:40:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 23793664. Throughput: 0: 13192.9. Samples: 23778942. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:40:11,237][81074] Avg episode reward: [(0, '2420.428')] [2023-03-06 23:40:11,537][81400] Updated weights for policy 0, policy_version 23240 (0.0007) [2023-03-06 23:40:12,305][81400] Updated weights for policy 0, policy_version 23250 (0.0005) [2023-03-06 23:40:13,079][81400] Updated weights for policy 0, policy_version 23260 (0.0005) [2023-03-06 23:40:13,848][81400] Updated weights for policy 0, policy_version 23270 (0.0006) [2023-03-06 23:40:14,640][81400] Updated weights for policy 0, policy_version 23280 (0.0006) [2023-03-06 23:40:15,423][81400] Updated weights for policy 0, policy_version 23290 (0.0006) [2023-03-06 23:40:16,198][81400] Updated weights for policy 0, policy_version 23300 (0.0006) [2023-03-06 23:40:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 23859200. Throughput: 0: 13195.1. Samples: 23857990. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:40:16,237][81074] Avg episode reward: [(0, '2206.271')] [2023-03-06 23:40:16,970][81400] Updated weights for policy 0, policy_version 23310 (0.0006) [2023-03-06 23:40:17,736][81400] Updated weights for policy 0, policy_version 23320 (0.0006) [2023-03-06 23:40:18,495][81400] Updated weights for policy 0, policy_version 23330 (0.0006) [2023-03-06 23:40:19,278][81400] Updated weights for policy 0, policy_version 23340 (0.0006) [2023-03-06 23:40:20,041][81400] Updated weights for policy 0, policy_version 23350 (0.0006) [2023-03-06 23:40:20,821][81400] Updated weights for policy 0, policy_version 23360 (0.0006) [2023-03-06 23:40:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13183.6). Total num frames: 23925760. Throughput: 0: 13206.3. Samples: 23897901. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:40:21,237][81074] Avg episode reward: [(0, '1882.101')] [2023-03-06 23:40:21,598][81400] Updated weights for policy 0, policy_version 23370 (0.0006) [2023-03-06 23:40:22,377][81400] Updated weights for policy 0, policy_version 23380 (0.0006) [2023-03-06 23:40:23,137][81400] Updated weights for policy 0, policy_version 23390 (0.0006) [2023-03-06 23:40:23,914][81400] Updated weights for policy 0, policy_version 23400 (0.0006) [2023-03-06 23:40:24,678][81400] Updated weights for policy 0, policy_version 23410 (0.0005) [2023-03-06 23:40:25,454][81400] Updated weights for policy 0, policy_version 23420 (0.0007) [2023-03-06 23:40:26,212][81400] Updated weights for policy 0, policy_version 23430 (0.0006) [2023-03-06 23:40:26,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13209.6, 300 sec: 13187.0). Total num frames: 23992320. Throughput: 0: 13224.5. Samples: 23977598. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:40:26,237][81074] Avg episode reward: [(0, '1870.844')] [2023-03-06 23:40:26,994][81400] Updated weights for policy 0, policy_version 23440 (0.0007) [2023-03-06 23:40:27,761][81400] Updated weights for policy 0, policy_version 23450 (0.0006) [2023-03-06 23:40:28,539][81400] Updated weights for policy 0, policy_version 23460 (0.0006) [2023-03-06 23:40:29,318][81400] Updated weights for policy 0, policy_version 23470 (0.0006) [2023-03-06 23:40:30,078][81400] Updated weights for policy 0, policy_version 23480 (0.0006) [2023-03-06 23:40:30,857][81400] Updated weights for policy 0, policy_version 23490 (0.0007) [2023-03-06 23:40:31,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13187.0). Total num frames: 24057856. Throughput: 0: 13235.6. Samples: 24057015. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:40:31,237][81074] Avg episode reward: [(0, '1900.516')] [2023-03-06 23:40:31,642][81400] Updated weights for policy 0, policy_version 23500 (0.0005) [2023-03-06 23:40:32,416][81400] Updated weights for policy 0, policy_version 23510 (0.0006) [2023-03-06 23:40:33,202][81400] Updated weights for policy 0, policy_version 23520 (0.0006) [2023-03-06 23:40:33,965][81400] Updated weights for policy 0, policy_version 23530 (0.0006) [2023-03-06 23:40:34,754][81400] Updated weights for policy 0, policy_version 23540 (0.0006) [2023-03-06 23:40:35,523][81400] Updated weights for policy 0, policy_version 23550 (0.0007) [2023-03-06 23:40:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13190.5). Total num frames: 24124416. Throughput: 0: 13230.4. Samples: 24096629. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:40:36,237][81074] Avg episode reward: [(0, '1841.150')] [2023-03-06 23:40:36,293][81400] Updated weights for policy 0, policy_version 23560 (0.0006) [2023-03-06 23:40:37,077][81400] Updated weights for policy 0, policy_version 23570 (0.0007) [2023-03-06 23:40:37,842][81400] Updated weights for policy 0, policy_version 23580 (0.0005) [2023-03-06 23:40:38,597][81400] Updated weights for policy 0, policy_version 23590 (0.0007) [2023-03-06 23:40:39,383][81400] Updated weights for policy 0, policy_version 23600 (0.0006) [2023-03-06 23:40:40,149][81400] Updated weights for policy 0, policy_version 23610 (0.0006) [2023-03-06 23:40:40,941][81400] Updated weights for policy 0, policy_version 23620 (0.0006) [2023-03-06 23:40:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13187.0). Total num frames: 24189952. Throughput: 0: 13232.4. Samples: 24176030. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:40:41,237][81074] Avg episode reward: [(0, '1958.475')] [2023-03-06 23:40:41,691][81400] Updated weights for policy 0, policy_version 23630 (0.0006) [2023-03-06 23:40:42,461][81400] Updated weights for policy 0, policy_version 23640 (0.0006) [2023-03-06 23:40:43,264][81400] Updated weights for policy 0, policy_version 23650 (0.0006) [2023-03-06 23:40:44,035][81400] Updated weights for policy 0, policy_version 23660 (0.0006) [2023-03-06 23:40:44,814][81400] Updated weights for policy 0, policy_version 23670 (0.0006) [2023-03-06 23:40:45,577][81400] Updated weights for policy 0, policy_version 23680 (0.0007) [2023-03-06 23:40:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13190.5). Total num frames: 24256512. Throughput: 0: 13231.2. Samples: 24255304. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:40:46,237][81074] Avg episode reward: [(0, '2105.904')] [2023-03-06 23:40:46,350][81400] Updated weights for policy 0, policy_version 23690 (0.0006) [2023-03-06 23:40:47,129][81400] Updated weights for policy 0, policy_version 23700 (0.0007) [2023-03-06 23:40:47,886][81400] Updated weights for policy 0, policy_version 23710 (0.0006) [2023-03-06 23:40:48,661][81400] Updated weights for policy 0, policy_version 23720 (0.0005) [2023-03-06 23:40:49,454][81400] Updated weights for policy 0, policy_version 23730 (0.0006) [2023-03-06 23:40:50,216][81400] Updated weights for policy 0, policy_version 23740 (0.0006) [2023-03-06 23:40:51,004][81400] Updated weights for policy 0, policy_version 23750 (0.0007) [2023-03-06 23:40:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13190.5). Total num frames: 24322048. Throughput: 0: 13231.2. Samples: 24295063. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:40:51,237][81074] Avg episode reward: [(0, '1990.806')] [2023-03-06 23:40:51,781][81400] Updated weights for policy 0, policy_version 23760 (0.0006) [2023-03-06 23:40:52,551][81400] Updated weights for policy 0, policy_version 23770 (0.0006) [2023-03-06 23:40:53,322][81400] Updated weights for policy 0, policy_version 23780 (0.0008) [2023-03-06 23:40:54,099][81400] Updated weights for policy 0, policy_version 23790 (0.0005) [2023-03-06 23:40:54,865][81400] Updated weights for policy 0, policy_version 23800 (0.0006) [2023-03-06 23:40:55,649][81400] Updated weights for policy 0, policy_version 23810 (0.0005) [2023-03-06 23:40:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13194.0). Total num frames: 24388608. Throughput: 0: 13231.9. Samples: 24374379. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:40:56,237][81074] Avg episode reward: [(0, '1882.194')] [2023-03-06 23:40:56,412][81400] Updated weights for policy 0, policy_version 23820 (0.0006) [2023-03-06 23:40:57,184][81400] Updated weights for policy 0, policy_version 23830 (0.0006) [2023-03-06 23:40:57,949][81400] Updated weights for policy 0, policy_version 23840 (0.0006) [2023-03-06 23:40:58,718][81400] Updated weights for policy 0, policy_version 23850 (0.0005) [2023-03-06 23:40:59,487][81400] Updated weights for policy 0, policy_version 23860 (0.0005) [2023-03-06 23:41:00,250][81400] Updated weights for policy 0, policy_version 23870 (0.0006) [2023-03-06 23:41:01,002][81400] Updated weights for policy 0, policy_version 23880 (0.0006) [2023-03-06 23:41:01,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13194.0). Total num frames: 24455168. Throughput: 0: 13253.6. Samples: 24454403. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:01,237][81074] Avg episode reward: [(0, '2026.898')] [2023-03-06 23:41:01,790][81400] Updated weights for policy 0, policy_version 23890 (0.0006) [2023-03-06 23:41:02,567][81400] Updated weights for policy 0, policy_version 23900 (0.0006) [2023-03-06 23:41:03,338][81400] Updated weights for policy 0, policy_version 23910 (0.0006) [2023-03-06 23:41:04,106][81400] Updated weights for policy 0, policy_version 23920 (0.0006) [2023-03-06 23:41:04,876][81400] Updated weights for policy 0, policy_version 23930 (0.0005) [2023-03-06 23:41:05,648][81400] Updated weights for policy 0, policy_version 23940 (0.0007) [2023-03-06 23:41:06,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13197.5). Total num frames: 24521728. Throughput: 0: 13247.6. Samples: 24494043. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:06,237][81074] Avg episode reward: [(0, '1790.495')] [2023-03-06 23:41:06,425][81400] Updated weights for policy 0, policy_version 23950 (0.0007) [2023-03-06 23:41:07,220][81400] Updated weights for policy 0, policy_version 23960 (0.0006) [2023-03-06 23:41:07,998][81400] Updated weights for policy 0, policy_version 23970 (0.0006) [2023-03-06 23:41:08,783][81400] Updated weights for policy 0, policy_version 23980 (0.0006) [2023-03-06 23:41:09,566][81400] Updated weights for policy 0, policy_version 23990 (0.0006) [2023-03-06 23:41:10,330][81400] Updated weights for policy 0, policy_version 24000 (0.0008) [2023-03-06 23:41:11,092][81400] Updated weights for policy 0, policy_version 24010 (0.0006) [2023-03-06 23:41:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13197.4). Total num frames: 24587264. Throughput: 0: 13235.4. Samples: 24573193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:11,237][81074] Avg episode reward: [(0, '1696.348')] [2023-03-06 23:41:11,862][81400] Updated weights for policy 0, policy_version 24020 (0.0006) [2023-03-06 23:41:12,615][81400] Updated weights for policy 0, policy_version 24030 (0.0005) [2023-03-06 23:41:13,409][81400] Updated weights for policy 0, policy_version 24040 (0.0006) [2023-03-06 23:41:14,185][81400] Updated weights for policy 0, policy_version 24050 (0.0006) [2023-03-06 23:41:14,959][81400] Updated weights for policy 0, policy_version 24060 (0.0007) [2023-03-06 23:41:15,725][81400] Updated weights for policy 0, policy_version 24070 (0.0006) [2023-03-06 23:41:16,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13200.9). Total num frames: 24653824. Throughput: 0: 13241.4. Samples: 24652879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:41:16,237][81074] Avg episode reward: [(0, '1728.144')] [2023-03-06 23:41:16,475][81400] Updated weights for policy 0, policy_version 24080 (0.0007) [2023-03-06 23:41:17,269][81400] Updated weights for policy 0, policy_version 24090 (0.0006) [2023-03-06 23:41:18,036][81400] Updated weights for policy 0, policy_version 24100 (0.0006) [2023-03-06 23:41:18,803][81400] Updated weights for policy 0, policy_version 24110 (0.0006) [2023-03-06 23:41:19,580][81400] Updated weights for policy 0, policy_version 24120 (0.0006) [2023-03-06 23:41:20,351][81400] Updated weights for policy 0, policy_version 24130 (0.0007) [2023-03-06 23:41:21,114][81400] Updated weights for policy 0, policy_version 24140 (0.0006) [2023-03-06 23:41:21,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13243.7, 300 sec: 13204.4). Total num frames: 24720384. Throughput: 0: 13247.3. Samples: 24692757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:41:21,237][81074] Avg episode reward: [(0, '1734.593')] [2023-03-06 23:41:21,885][81400] Updated weights for policy 0, policy_version 24150 (0.0007) [2023-03-06 23:41:22,671][81400] Updated weights for policy 0, policy_version 24160 (0.0006) [2023-03-06 23:41:23,447][81400] Updated weights for policy 0, policy_version 24170 (0.0005) [2023-03-06 23:41:24,229][81400] Updated weights for policy 0, policy_version 24180 (0.0006) [2023-03-06 23:41:25,007][81400] Updated weights for policy 0, policy_version 24190 (0.0006) [2023-03-06 23:41:25,781][81400] Updated weights for policy 0, policy_version 24200 (0.0006) [2023-03-06 23:41:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.6, 300 sec: 13204.4). Total num frames: 24785920. Throughput: 0: 13242.7. Samples: 24771953. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:41:26,237][81074] Avg episode reward: [(0, '1566.282')] [2023-03-06 23:41:26,568][81400] Updated weights for policy 0, policy_version 24210 (0.0007) [2023-03-06 23:41:27,348][81400] Updated weights for policy 0, policy_version 24220 (0.0005) [2023-03-06 23:41:28,128][81400] Updated weights for policy 0, policy_version 24230 (0.0007) [2023-03-06 23:41:28,882][81400] Updated weights for policy 0, policy_version 24240 (0.0006) [2023-03-06 23:41:29,657][81400] Updated weights for policy 0, policy_version 24250 (0.0005) [2023-03-06 23:41:30,420][81400] Updated weights for policy 0, policy_version 24260 (0.0007) [2023-03-06 23:41:31,202][81400] Updated weights for policy 0, policy_version 24270 (0.0006) [2023-03-06 23:41:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13204.4). Total num frames: 24852480. Throughput: 0: 13242.7. Samples: 24851229. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:41:31,237][81074] Avg episode reward: [(0, '1940.338')] [2023-03-06 23:41:31,984][81400] Updated weights for policy 0, policy_version 24280 (0.0006) [2023-03-06 23:41:32,772][81400] Updated weights for policy 0, policy_version 24290 (0.0006) [2023-03-06 23:41:33,538][81400] Updated weights for policy 0, policy_version 24300 (0.0006) [2023-03-06 23:41:33,828][81349] KL-divergence is very high: 2600.4502 [2023-03-06 23:41:34,309][81400] Updated weights for policy 0, policy_version 24310 (0.0006) [2023-03-06 23:41:35,078][81400] Updated weights for policy 0, policy_version 24320 (0.0006) [2023-03-06 23:41:35,862][81400] Updated weights for policy 0, policy_version 24330 (0.0006) [2023-03-06 23:41:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13200.9). Total num frames: 24918016. Throughput: 0: 13239.7. Samples: 24890850. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:36,237][81074] Avg episode reward: [(0, '1922.184')] [2023-03-06 23:41:36,646][81400] Updated weights for policy 0, policy_version 24340 (0.0007) [2023-03-06 23:41:37,436][81400] Updated weights for policy 0, policy_version 24350 (0.0006) [2023-03-06 23:41:38,199][81400] Updated weights for policy 0, policy_version 24360 (0.0006) [2023-03-06 23:41:38,981][81400] Updated weights for policy 0, policy_version 24370 (0.0005) [2023-03-06 23:41:39,765][81400] Updated weights for policy 0, policy_version 24380 (0.0006) [2023-03-06 23:41:40,530][81400] Updated weights for policy 0, policy_version 24390 (0.0006) [2023-03-06 23:41:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13204.4). Total num frames: 24984576. Throughput: 0: 13231.0. Samples: 24969777. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:41,237][81074] Avg episode reward: [(0, '2018.774')] [2023-03-06 23:41:41,311][81400] Updated weights for policy 0, policy_version 24400 (0.0006) [2023-03-06 23:41:42,078][81400] Updated weights for policy 0, policy_version 24410 (0.0005) [2023-03-06 23:41:42,858][81400] Updated weights for policy 0, policy_version 24420 (0.0006) [2023-03-06 23:41:43,630][81400] Updated weights for policy 0, policy_version 24430 (0.0007) [2023-03-06 23:41:44,390][81400] Updated weights for policy 0, policy_version 24440 (0.0005) [2023-03-06 23:41:45,146][81400] Updated weights for policy 0, policy_version 24450 (0.0006) [2023-03-06 23:41:45,942][81400] Updated weights for policy 0, policy_version 24460 (0.0005) [2023-03-06 23:41:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13204.4). Total num frames: 25050112. Throughput: 0: 13220.1. Samples: 25049307. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:46,237][81074] Avg episode reward: [(0, '2071.031')] [2023-03-06 23:41:46,719][81400] Updated weights for policy 0, policy_version 24470 (0.0007) [2023-03-06 23:41:47,502][81400] Updated weights for policy 0, policy_version 24480 (0.0006) [2023-03-06 23:41:48,272][81400] Updated weights for policy 0, policy_version 24490 (0.0005) [2023-03-06 23:41:49,049][81400] Updated weights for policy 0, policy_version 24500 (0.0005) [2023-03-06 23:41:49,801][81400] Updated weights for policy 0, policy_version 24510 (0.0005) [2023-03-06 23:41:50,588][81400] Updated weights for policy 0, policy_version 24520 (0.0008) [2023-03-06 23:41:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13207.9). Total num frames: 25116672. Throughput: 0: 13220.1. Samples: 25088949. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:51,237][81074] Avg episode reward: [(0, '2011.622')] [2023-03-06 23:41:51,359][81400] Updated weights for policy 0, policy_version 24530 (0.0005) [2023-03-06 23:41:52,120][81400] Updated weights for policy 0, policy_version 24540 (0.0006) [2023-03-06 23:41:52,891][81400] Updated weights for policy 0, policy_version 24550 (0.0006) [2023-03-06 23:41:53,649][81400] Updated weights for policy 0, policy_version 24560 (0.0007) [2023-03-06 23:41:54,416][81400] Updated weights for policy 0, policy_version 24570 (0.0006) [2023-03-06 23:41:55,199][81400] Updated weights for policy 0, policy_version 24580 (0.0006) [2023-03-06 23:41:55,975][81400] Updated weights for policy 0, policy_version 24590 (0.0006) [2023-03-06 23:41:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13207.9). Total num frames: 25183232. Throughput: 0: 13233.4. Samples: 25168697. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:41:56,237][81074] Avg episode reward: [(0, '1842.559')] [2023-03-06 23:41:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000024593_25183232.pth... [2023-03-06 23:41:56,273][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000021497_22012928.pth [2023-03-06 23:41:56,765][81400] Updated weights for policy 0, policy_version 24600 (0.0006) [2023-03-06 23:41:57,544][81400] Updated weights for policy 0, policy_version 24610 (0.0007) [2023-03-06 23:41:58,311][81400] Updated weights for policy 0, policy_version 24620 (0.0006) [2023-03-06 23:41:59,074][81400] Updated weights for policy 0, policy_version 24630 (0.0006) [2023-03-06 23:41:59,841][81400] Updated weights for policy 0, policy_version 24640 (0.0006) [2023-03-06 23:42:00,619][81400] Updated weights for policy 0, policy_version 24650 (0.0006) [2023-03-06 23:42:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13204.4). Total num frames: 25248768. Throughput: 0: 13226.0. Samples: 25248047. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:01,247][81074] Avg episode reward: [(0, '2047.000')] [2023-03-06 23:42:01,397][81400] Updated weights for policy 0, policy_version 24660 (0.0006) [2023-03-06 23:42:02,165][81400] Updated weights for policy 0, policy_version 24670 (0.0006) [2023-03-06 23:42:02,937][81400] Updated weights for policy 0, policy_version 24680 (0.0006) [2023-03-06 23:42:03,705][81400] Updated weights for policy 0, policy_version 24690 (0.0006) [2023-03-06 23:42:04,474][81400] Updated weights for policy 0, policy_version 24700 (0.0006) [2023-03-06 23:42:05,235][81400] Updated weights for policy 0, policy_version 24710 (0.0006) [2023-03-06 23:42:06,028][81400] Updated weights for policy 0, policy_version 24720 (0.0007) [2023-03-06 23:42:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13207.9). Total num frames: 25315328. Throughput: 0: 13225.7. Samples: 25287913. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:06,247][81074] Avg episode reward: [(0, '2158.486')] [2023-03-06 23:42:06,789][81400] Updated weights for policy 0, policy_version 24730 (0.0005) [2023-03-06 23:42:07,565][81400] Updated weights for policy 0, policy_version 24740 (0.0006) [2023-03-06 23:42:08,345][81400] Updated weights for policy 0, policy_version 24750 (0.0006) [2023-03-06 23:42:09,115][81400] Updated weights for policy 0, policy_version 24760 (0.0007) [2023-03-06 23:42:09,887][81400] Updated weights for policy 0, policy_version 24770 (0.0006) [2023-03-06 23:42:10,646][81400] Updated weights for policy 0, policy_version 24780 (0.0006) [2023-03-06 23:42:11,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13211.3). Total num frames: 25381888. Throughput: 0: 13235.9. Samples: 25367566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:11,247][81074] Avg episode reward: [(0, '1847.412')] [2023-03-06 23:42:11,427][81400] Updated weights for policy 0, policy_version 24790 (0.0006) [2023-03-06 23:42:12,201][81400] Updated weights for policy 0, policy_version 24800 (0.0006) [2023-03-06 23:42:12,975][81400] Updated weights for policy 0, policy_version 24810 (0.0007) [2023-03-06 23:42:13,736][81400] Updated weights for policy 0, policy_version 24820 (0.0007) [2023-03-06 23:42:14,518][81400] Updated weights for policy 0, policy_version 24830 (0.0007) [2023-03-06 23:42:15,274][81400] Updated weights for policy 0, policy_version 24840 (0.0006) [2023-03-06 23:42:16,068][81400] Updated weights for policy 0, policy_version 24850 (0.0006) [2023-03-06 23:42:16,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13243.8, 300 sec: 13211.3). Total num frames: 25448448. Throughput: 0: 13240.7. Samples: 25447058. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:16,237][81074] Avg episode reward: [(0, '2110.444')] [2023-03-06 23:42:16,837][81400] Updated weights for policy 0, policy_version 24860 (0.0006) [2023-03-06 23:42:17,626][81400] Updated weights for policy 0, policy_version 24870 (0.0005) [2023-03-06 23:42:18,391][81349] KL-divergence is very high: 1200.0110 [2023-03-06 23:42:18,400][81400] Updated weights for policy 0, policy_version 24880 (0.0006) [2023-03-06 23:42:19,183][81400] Updated weights for policy 0, policy_version 24890 (0.0006) [2023-03-06 23:42:19,956][81400] Updated weights for policy 0, policy_version 24900 (0.0006) [2023-03-06 23:42:20,724][81400] Updated weights for policy 0, policy_version 24910 (0.0006) [2023-03-06 23:42:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13211.3). Total num frames: 25513984. Throughput: 0: 13237.7. Samples: 25486545. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:21,237][81074] Avg episode reward: [(0, '2119.860')] [2023-03-06 23:42:21,515][81400] Updated weights for policy 0, policy_version 24920 (0.0006) [2023-03-06 23:42:22,283][81400] Updated weights for policy 0, policy_version 24930 (0.0006) [2023-03-06 23:42:23,080][81400] Updated weights for policy 0, policy_version 24940 (0.0006) [2023-03-06 23:42:23,852][81400] Updated weights for policy 0, policy_version 24950 (0.0007) [2023-03-06 23:42:24,639][81400] Updated weights for policy 0, policy_version 24960 (0.0006) [2023-03-06 23:42:25,413][81400] Updated weights for policy 0, policy_version 24970 (0.0005) [2023-03-06 23:42:26,165][81400] Updated weights for policy 0, policy_version 24980 (0.0005) [2023-03-06 23:42:26,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13226.7, 300 sec: 13211.3). Total num frames: 25579520. Throughput: 0: 13237.2. Samples: 25565450. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:26,237][81074] Avg episode reward: [(0, '1910.567')] [2023-03-06 23:42:26,935][81400] Updated weights for policy 0, policy_version 24990 (0.0006) [2023-03-06 23:42:27,723][81400] Updated weights for policy 0, policy_version 25000 (0.0006) [2023-03-06 23:42:28,491][81400] Updated weights for policy 0, policy_version 25010 (0.0006) [2023-03-06 23:42:29,265][81400] Updated weights for policy 0, policy_version 25020 (0.0006) [2023-03-06 23:42:30,047][81400] Updated weights for policy 0, policy_version 25030 (0.0006) [2023-03-06 23:42:30,810][81400] Updated weights for policy 0, policy_version 25040 (0.0006) [2023-03-06 23:42:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13214.8). Total num frames: 25646080. Throughput: 0: 13233.5. Samples: 25644813. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:31,237][81074] Avg episode reward: [(0, '1814.603')] [2023-03-06 23:42:31,595][81400] Updated weights for policy 0, policy_version 25050 (0.0006) [2023-03-06 23:42:32,373][81400] Updated weights for policy 0, policy_version 25060 (0.0006) [2023-03-06 23:42:33,141][81400] Updated weights for policy 0, policy_version 25070 (0.0006) [2023-03-06 23:42:33,939][81400] Updated weights for policy 0, policy_version 25080 (0.0006) [2023-03-06 23:42:33,995][81349] KL-divergence is very high: 1682.8862 [2023-03-06 23:42:34,693][81400] Updated weights for policy 0, policy_version 25090 (0.0005) [2023-03-06 23:42:35,462][81400] Updated weights for policy 0, policy_version 25100 (0.0006) [2023-03-06 23:42:36,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.8, 300 sec: 13214.8). Total num frames: 25712640. Throughput: 0: 13229.0. Samples: 25684254. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:36,237][81074] Avg episode reward: [(0, '1845.916')] [2023-03-06 23:42:36,237][81400] Updated weights for policy 0, policy_version 25110 (0.0006) [2023-03-06 23:42:37,005][81400] Updated weights for policy 0, policy_version 25120 (0.0006) [2023-03-06 23:42:37,784][81400] Updated weights for policy 0, policy_version 25130 (0.0007) [2023-03-06 23:42:38,563][81400] Updated weights for policy 0, policy_version 25140 (0.0005) [2023-03-06 23:42:39,346][81400] Updated weights for policy 0, policy_version 25150 (0.0005) [2023-03-06 23:42:40,112][81400] Updated weights for policy 0, policy_version 25160 (0.0006) [2023-03-06 23:42:40,913][81400] Updated weights for policy 0, policy_version 25170 (0.0006) [2023-03-06 23:42:41,125][81349] KL-divergence is very high: 1681.3293 [2023-03-06 23:42:41,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13226.7, 300 sec: 13211.3). Total num frames: 25778176. Throughput: 0: 13223.3. Samples: 25763745. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:42:41,237][81074] Avg episode reward: [(0, '1630.123')] [2023-03-06 23:42:41,665][81400] Updated weights for policy 0, policy_version 25180 (0.0005) [2023-03-06 23:42:42,438][81400] Updated weights for policy 0, policy_version 25190 (0.0006) [2023-03-06 23:42:43,197][81400] Updated weights for policy 0, policy_version 25200 (0.0006) [2023-03-06 23:42:43,962][81400] Updated weights for policy 0, policy_version 25210 (0.0006) [2023-03-06 23:42:44,706][81400] Updated weights for policy 0, policy_version 25220 (0.0005) [2023-03-06 23:42:45,489][81400] Updated weights for policy 0, policy_version 25230 (0.0006) [2023-03-06 23:42:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13243.8, 300 sec: 13214.8). Total num frames: 25844736. Throughput: 0: 13240.6. Samples: 25843875. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:42:46,237][81074] Avg episode reward: [(0, '1440.946')] [2023-03-06 23:42:46,245][81400] Updated weights for policy 0, policy_version 25240 (0.0006) [2023-03-06 23:42:47,033][81400] Updated weights for policy 0, policy_version 25250 (0.0006) [2023-03-06 23:42:47,781][81400] Updated weights for policy 0, policy_version 25260 (0.0006) [2023-03-06 23:42:48,557][81400] Updated weights for policy 0, policy_version 25270 (0.0005) [2023-03-06 23:42:49,325][81400] Updated weights for policy 0, policy_version 25280 (0.0007) [2023-03-06 23:42:50,074][81400] Updated weights for policy 0, policy_version 25290 (0.0006) [2023-03-06 23:42:50,833][81400] Updated weights for policy 0, policy_version 25300 (0.0006) [2023-03-06 23:42:51,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13260.8, 300 sec: 13218.3). Total num frames: 25912320. Throughput: 0: 13245.2. Samples: 25883947. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:42:51,237][81074] Avg episode reward: [(0, '1235.662')] [2023-03-06 23:42:51,597][81400] Updated weights for policy 0, policy_version 25310 (0.0007) [2023-03-06 23:42:52,370][81400] Updated weights for policy 0, policy_version 25320 (0.0007) [2023-03-06 23:42:52,970][81349] KL-divergence is very high: 175.8269 [2023-03-06 23:42:53,134][81400] Updated weights for policy 0, policy_version 25330 (0.0006) [2023-03-06 23:42:53,893][81400] Updated weights for policy 0, policy_version 25340 (0.0006) [2023-03-06 23:42:54,653][81400] Updated weights for policy 0, policy_version 25350 (0.0006) [2023-03-06 23:42:55,425][81400] Updated weights for policy 0, policy_version 25360 (0.0005) [2023-03-06 23:42:56,192][81400] Updated weights for policy 0, policy_version 25370 (0.0006) [2023-03-06 23:42:56,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13260.8, 300 sec: 13221.7). Total num frames: 25978880. Throughput: 0: 13262.1. Samples: 25964362. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:42:56,237][81074] Avg episode reward: [(0, '1380.755')] [2023-03-06 23:42:56,973][81400] Updated weights for policy 0, policy_version 25380 (0.0006) [2023-03-06 23:42:57,745][81400] Updated weights for policy 0, policy_version 25390 (0.0006) [2023-03-06 23:42:58,355][81349] KL-divergence is very high: 691.7407 [2023-03-06 23:42:58,521][81400] Updated weights for policy 0, policy_version 25400 (0.0006) [2023-03-06 23:42:59,291][81400] Updated weights for policy 0, policy_version 25410 (0.0005) [2023-03-06 23:43:00,089][81400] Updated weights for policy 0, policy_version 25420 (0.0006) [2023-03-06 23:43:00,844][81400] Updated weights for policy 0, policy_version 25430 (0.0006) [2023-03-06 23:43:01,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13277.8, 300 sec: 13221.7). Total num frames: 26045440. Throughput: 0: 13264.4. Samples: 26043956. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:43:01,237][81074] Avg episode reward: [(0, '1396.613')] [2023-03-06 23:43:01,600][81400] Updated weights for policy 0, policy_version 25440 (0.0006) [2023-03-06 23:43:02,355][81400] Updated weights for policy 0, policy_version 25450 (0.0006) [2023-03-06 23:43:03,135][81400] Updated weights for policy 0, policy_version 25460 (0.0006) [2023-03-06 23:43:03,912][81400] Updated weights for policy 0, policy_version 25470 (0.0006) [2023-03-06 23:43:04,682][81400] Updated weights for policy 0, policy_version 25480 (0.0006) [2023-03-06 23:43:04,891][81349] KL-divergence is very high: 2289.4211 [2023-03-06 23:43:04,986][81349] KL-divergence is very high: 140.2272 [2023-03-06 23:43:05,460][81400] Updated weights for policy 0, policy_version 25490 (0.0007) [2023-03-06 23:43:06,221][81400] Updated weights for policy 0, policy_version 25500 (0.0006) [2023-03-06 23:43:06,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13225.2). Total num frames: 26112000. Throughput: 0: 13274.6. Samples: 26083904. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:43:06,237][81074] Avg episode reward: [(0, '1401.801')] [2023-03-06 23:43:06,966][81400] Updated weights for policy 0, policy_version 25510 (0.0006) [2023-03-06 23:43:07,734][81400] Updated weights for policy 0, policy_version 25520 (0.0005) [2023-03-06 23:43:08,490][81400] Updated weights for policy 0, policy_version 25530 (0.0006) [2023-03-06 23:43:09,249][81400] Updated weights for policy 0, policy_version 25540 (0.0006) [2023-03-06 23:43:10,024][81400] Updated weights for policy 0, policy_version 25550 (0.0006) [2023-03-06 23:43:10,787][81400] Updated weights for policy 0, policy_version 25560 (0.0006) [2023-03-06 23:43:11,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13294.9, 300 sec: 13232.2). Total num frames: 26179584. Throughput: 0: 13310.3. Samples: 26164414. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:43:11,237][81074] Avg episode reward: [(0, '1152.934')] [2023-03-06 23:43:11,558][81400] Updated weights for policy 0, policy_version 25570 (0.0006) [2023-03-06 23:43:12,316][81400] Updated weights for policy 0, policy_version 25580 (0.0006) [2023-03-06 23:43:13,078][81400] Updated weights for policy 0, policy_version 25590 (0.0006) [2023-03-06 23:43:13,831][81400] Updated weights for policy 0, policy_version 25600 (0.0005) [2023-03-06 23:43:14,593][81400] Updated weights for policy 0, policy_version 25610 (0.0007) [2023-03-06 23:43:15,365][81400] Updated weights for policy 0, policy_version 25620 (0.0007) [2023-03-06 23:43:16,113][81400] Updated weights for policy 0, policy_version 25630 (0.0006) [2023-03-06 23:43:16,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13294.9, 300 sec: 13232.2). Total num frames: 26246144. Throughput: 0: 13336.2. Samples: 26244945. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:43:16,237][81074] Avg episode reward: [(0, '1136.197')] [2023-03-06 23:43:16,869][81400] Updated weights for policy 0, policy_version 25640 (0.0007) [2023-03-06 23:43:17,631][81400] Updated weights for policy 0, policy_version 25650 (0.0006) [2023-03-06 23:43:18,396][81400] Updated weights for policy 0, policy_version 25660 (0.0006) [2023-03-06 23:43:19,156][81400] Updated weights for policy 0, policy_version 25670 (0.0006) [2023-03-06 23:43:19,910][81400] Updated weights for policy 0, policy_version 25680 (0.0006) [2023-03-06 23:43:20,672][81400] Updated weights for policy 0, policy_version 25690 (0.0006) [2023-03-06 23:43:21,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13329.1, 300 sec: 13235.6). Total num frames: 26313728. Throughput: 0: 13357.4. Samples: 26285339. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:43:21,237][81074] Avg episode reward: [(0, '1205.671')] [2023-03-06 23:43:21,435][81400] Updated weights for policy 0, policy_version 25700 (0.0006) [2023-03-06 23:43:22,221][81400] Updated weights for policy 0, policy_version 25710 (0.0006) [2023-03-06 23:43:22,982][81400] Updated weights for policy 0, policy_version 25720 (0.0006) [2023-03-06 23:43:23,755][81400] Updated weights for policy 0, policy_version 25730 (0.0006) [2023-03-06 23:43:24,502][81400] Updated weights for policy 0, policy_version 25740 (0.0006) [2023-03-06 23:43:25,280][81400] Updated weights for policy 0, policy_version 25750 (0.0006) [2023-03-06 23:43:26,033][81400] Updated weights for policy 0, policy_version 25760 (0.0006) [2023-03-06 23:43:26,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13346.1, 300 sec: 13239.1). Total num frames: 26380288. Throughput: 0: 13377.0. Samples: 26365708. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:43:26,237][81074] Avg episode reward: [(0, '1197.025')] [2023-03-06 23:43:26,802][81400] Updated weights for policy 0, policy_version 25770 (0.0005) [2023-03-06 23:43:27,565][81400] Updated weights for policy 0, policy_version 25780 (0.0005) [2023-03-06 23:43:28,332][81400] Updated weights for policy 0, policy_version 25790 (0.0005) [2023-03-06 23:43:29,105][81400] Updated weights for policy 0, policy_version 25800 (0.0006) [2023-03-06 23:43:29,878][81400] Updated weights for policy 0, policy_version 25810 (0.0005) [2023-03-06 23:43:30,646][81400] Updated weights for policy 0, policy_version 25820 (0.0006) [2023-03-06 23:43:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.1, 300 sec: 13239.1). Total num frames: 26446848. Throughput: 0: 13374.0. Samples: 26445705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 23:43:31,237][81074] Avg episode reward: [(0, '1194.463')] [2023-03-06 23:43:31,414][81400] Updated weights for policy 0, policy_version 25830 (0.0006) [2023-03-06 23:43:32,167][81400] Updated weights for policy 0, policy_version 25840 (0.0006) [2023-03-06 23:43:32,933][81400] Updated weights for policy 0, policy_version 25850 (0.0007) [2023-03-06 23:43:33,720][81400] Updated weights for policy 0, policy_version 25860 (0.0007) [2023-03-06 23:43:34,473][81400] Updated weights for policy 0, policy_version 25870 (0.0006) [2023-03-06 23:43:35,261][81400] Updated weights for policy 0, policy_version 25880 (0.0007) [2023-03-06 23:43:36,013][81400] Updated weights for policy 0, policy_version 25890 (0.0006) [2023-03-06 23:43:36,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.1, 300 sec: 13242.6). Total num frames: 26513408. Throughput: 0: 13377.5. Samples: 26485936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 23:43:36,237][81074] Avg episode reward: [(0, '1220.370')] [2023-03-06 23:43:36,785][81400] Updated weights for policy 0, policy_version 25900 (0.0006) [2023-03-06 23:43:37,563][81400] Updated weights for policy 0, policy_version 25910 (0.0005) [2023-03-06 23:43:38,339][81400] Updated weights for policy 0, policy_version 25920 (0.0005) [2023-03-06 23:43:39,106][81400] Updated weights for policy 0, policy_version 25930 (0.0006) [2023-03-06 23:43:39,861][81400] Updated weights for policy 0, policy_version 25940 (0.0006) [2023-03-06 23:43:40,626][81400] Updated weights for policy 0, policy_version 25950 (0.0005) [2023-03-06 23:43:41,236][81074] Fps is (10 sec: 13414.2, 60 sec: 13380.3, 300 sec: 13249.5). Total num frames: 26580992. Throughput: 0: 13363.7. Samples: 26565729. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-06 23:43:41,237][81074] Avg episode reward: [(0, '1194.788')] [2023-03-06 23:43:41,379][81400] Updated weights for policy 0, policy_version 25960 (0.0005) [2023-03-06 23:43:42,141][81400] Updated weights for policy 0, policy_version 25970 (0.0005) [2023-03-06 23:43:42,911][81400] Updated weights for policy 0, policy_version 25980 (0.0006) [2023-03-06 23:43:43,674][81400] Updated weights for policy 0, policy_version 25990 (0.0006) [2023-03-06 23:43:44,433][81400] Updated weights for policy 0, policy_version 26000 (0.0007) [2023-03-06 23:43:45,206][81400] Updated weights for policy 0, policy_version 26010 (0.0006) [2023-03-06 23:43:45,975][81400] Updated weights for policy 0, policy_version 26020 (0.0006) [2023-03-06 23:43:46,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13380.3, 300 sec: 13253.0). Total num frames: 26647552. Throughput: 0: 13386.3. Samples: 26646336. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:43:46,237][81074] Avg episode reward: [(0, '1187.182')] [2023-03-06 23:43:46,748][81400] Updated weights for policy 0, policy_version 26030 (0.0006) [2023-03-06 23:43:47,517][81400] Updated weights for policy 0, policy_version 26040 (0.0007) [2023-03-06 23:43:48,277][81400] Updated weights for policy 0, policy_version 26050 (0.0006) [2023-03-06 23:43:49,028][81400] Updated weights for policy 0, policy_version 26060 (0.0006) [2023-03-06 23:43:49,783][81400] Updated weights for policy 0, policy_version 26070 (0.0005) [2023-03-06 23:43:50,557][81400] Updated weights for policy 0, policy_version 26080 (0.0006) [2023-03-06 23:43:51,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13363.2, 300 sec: 13253.0). Total num frames: 26714112. Throughput: 0: 13392.2. Samples: 26686553. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:43:51,237][81074] Avg episode reward: [(0, '1165.186')] [2023-03-06 23:43:51,320][81400] Updated weights for policy 0, policy_version 26090 (0.0006) [2023-03-06 23:43:52,098][81400] Updated weights for policy 0, policy_version 26100 (0.0006) [2023-03-06 23:43:52,859][81400] Updated weights for policy 0, policy_version 26110 (0.0007) [2023-03-06 23:43:53,625][81400] Updated weights for policy 0, policy_version 26120 (0.0006) [2023-03-06 23:43:54,399][81400] Updated weights for policy 0, policy_version 26130 (0.0005) [2023-03-06 23:43:55,166][81400] Updated weights for policy 0, policy_version 26140 (0.0005) [2023-03-06 23:43:55,920][81400] Updated weights for policy 0, policy_version 26150 (0.0006) [2023-03-06 23:43:56,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13380.3, 300 sec: 13259.9). Total num frames: 26781696. Throughput: 0: 13382.9. Samples: 26766643. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:43:56,237][81074] Avg episode reward: [(0, '1150.314')] [2023-03-06 23:43:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000026154_26781696.pth... [2023-03-06 23:43:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000023042_23595008.pth [2023-03-06 23:43:56,693][81400] Updated weights for policy 0, policy_version 26160 (0.0005) [2023-03-06 23:43:57,472][81400] Updated weights for policy 0, policy_version 26170 (0.0006) [2023-03-06 23:43:58,235][81400] Updated weights for policy 0, policy_version 26180 (0.0006) [2023-03-06 23:43:59,007][81400] Updated weights for policy 0, policy_version 26190 (0.0006) [2023-03-06 23:43:59,768][81400] Updated weights for policy 0, policy_version 26200 (0.0006) [2023-03-06 23:44:00,537][81400] Updated weights for policy 0, policy_version 26210 (0.0006) [2023-03-06 23:44:01,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13363.2, 300 sec: 13259.9). Total num frames: 26847232. Throughput: 0: 13367.4. Samples: 26846479. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:44:01,237][81074] Avg episode reward: [(0, '1190.235')] [2023-03-06 23:44:01,305][81400] Updated weights for policy 0, policy_version 26220 (0.0006) [2023-03-06 23:44:02,095][81400] Updated weights for policy 0, policy_version 26230 (0.0006) [2023-03-06 23:44:02,862][81400] Updated weights for policy 0, policy_version 26240 (0.0006) [2023-03-06 23:44:03,632][81400] Updated weights for policy 0, policy_version 26250 (0.0006) [2023-03-06 23:44:04,396][81400] Updated weights for policy 0, policy_version 26260 (0.0006) [2023-03-06 23:44:05,151][81400] Updated weights for policy 0, policy_version 26270 (0.0005) [2023-03-06 23:44:05,941][81400] Updated weights for policy 0, policy_version 26280 (0.0007) [2023-03-06 23:44:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13363.2, 300 sec: 13259.9). Total num frames: 26913792. Throughput: 0: 13356.9. Samples: 26886398. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:06,237][81074] Avg episode reward: [(0, '1206.526')] [2023-03-06 23:44:06,705][81400] Updated weights for policy 0, policy_version 26290 (0.0006) [2023-03-06 23:44:07,480][81400] Updated weights for policy 0, policy_version 26300 (0.0005) [2023-03-06 23:44:08,259][81400] Updated weights for policy 0, policy_version 26310 (0.0006) [2023-03-06 23:44:09,027][81400] Updated weights for policy 0, policy_version 26320 (0.0006) [2023-03-06 23:44:09,786][81400] Updated weights for policy 0, policy_version 26330 (0.0006) [2023-03-06 23:44:10,548][81400] Updated weights for policy 0, policy_version 26340 (0.0006) [2023-03-06 23:44:11,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13363.2, 300 sec: 13266.9). Total num frames: 26981376. Throughput: 0: 13344.3. Samples: 26966202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:11,237][81074] Avg episode reward: [(0, '1166.557')] [2023-03-06 23:44:11,304][81400] Updated weights for policy 0, policy_version 26350 (0.0006) [2023-03-06 23:44:12,065][81400] Updated weights for policy 0, policy_version 26360 (0.0006) [2023-03-06 23:44:12,838][81400] Updated weights for policy 0, policy_version 26370 (0.0006) [2023-03-06 23:44:13,617][81400] Updated weights for policy 0, policy_version 26380 (0.0006) [2023-03-06 23:44:14,358][81400] Updated weights for policy 0, policy_version 26390 (0.0006) [2023-03-06 23:44:15,125][81400] Updated weights for policy 0, policy_version 26400 (0.0006) [2023-03-06 23:44:15,895][81400] Updated weights for policy 0, policy_version 26410 (0.0006) [2023-03-06 23:44:16,236][81074] Fps is (10 sec: 13414.1, 60 sec: 13363.2, 300 sec: 13270.3). Total num frames: 27047936. Throughput: 0: 13351.3. Samples: 27046517. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:16,237][81074] Avg episode reward: [(0, '1248.514')] [2023-03-06 23:44:16,674][81400] Updated weights for policy 0, policy_version 26420 (0.0006) [2023-03-06 23:44:17,434][81400] Updated weights for policy 0, policy_version 26430 (0.0006) [2023-03-06 23:44:18,197][81400] Updated weights for policy 0, policy_version 26440 (0.0006) [2023-03-06 23:44:18,973][81400] Updated weights for policy 0, policy_version 26450 (0.0006) [2023-03-06 23:44:19,758][81400] Updated weights for policy 0, policy_version 26460 (0.0005) [2023-03-06 23:44:20,510][81400] Updated weights for policy 0, policy_version 26470 (0.0007) [2023-03-06 23:44:21,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13346.1, 300 sec: 13270.3). Total num frames: 27114496. Throughput: 0: 13350.1. Samples: 27086692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:21,237][81074] Avg episode reward: [(0, '1215.826')] [2023-03-06 23:44:21,293][81400] Updated weights for policy 0, policy_version 26480 (0.0006) [2023-03-06 23:44:22,045][81400] Updated weights for policy 0, policy_version 26490 (0.0005) [2023-03-06 23:44:22,814][81400] Updated weights for policy 0, policy_version 26500 (0.0006) [2023-03-06 23:44:23,592][81400] Updated weights for policy 0, policy_version 26510 (0.0007) [2023-03-06 23:44:24,349][81400] Updated weights for policy 0, policy_version 26520 (0.0006) [2023-03-06 23:44:25,112][81400] Updated weights for policy 0, policy_version 26530 (0.0006) [2023-03-06 23:44:25,863][81400] Updated weights for policy 0, policy_version 26540 (0.0006) [2023-03-06 23:44:26,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13346.1, 300 sec: 13273.8). Total num frames: 27181056. Throughput: 0: 13354.7. Samples: 27166689. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:26,237][81074] Avg episode reward: [(0, '1226.647')] [2023-03-06 23:44:26,640][81400] Updated weights for policy 0, policy_version 26550 (0.0007) [2023-03-06 23:44:27,407][81400] Updated weights for policy 0, policy_version 26560 (0.0005) [2023-03-06 23:44:28,182][81400] Updated weights for policy 0, policy_version 26570 (0.0005) [2023-03-06 23:44:28,939][81400] Updated weights for policy 0, policy_version 26580 (0.0006) [2023-03-06 23:44:29,694][81400] Updated weights for policy 0, policy_version 26590 (0.0006) [2023-03-06 23:44:30,453][81400] Updated weights for policy 0, policy_version 26600 (0.0006) [2023-03-06 23:44:31,229][81400] Updated weights for policy 0, policy_version 26610 (0.0006) [2023-03-06 23:44:31,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13363.2, 300 sec: 13280.8). Total num frames: 27248640. Throughput: 0: 13348.3. Samples: 27247010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:31,237][81074] Avg episode reward: [(0, '1305.532')] [2023-03-06 23:44:32,005][81400] Updated weights for policy 0, policy_version 26620 (0.0006) [2023-03-06 23:44:32,757][81400] Updated weights for policy 0, policy_version 26630 (0.0006) [2023-03-06 23:44:33,517][81400] Updated weights for policy 0, policy_version 26640 (0.0007) [2023-03-06 23:44:34,285][81400] Updated weights for policy 0, policy_version 26650 (0.0006) [2023-03-06 23:44:35,066][81400] Updated weights for policy 0, policy_version 26660 (0.0006) [2023-03-06 23:44:35,823][81400] Updated weights for policy 0, policy_version 26670 (0.0006) [2023-03-06 23:44:36,236][81074] Fps is (10 sec: 13414.3, 60 sec: 13363.2, 300 sec: 13284.2). Total num frames: 27315200. Throughput: 0: 13348.5. Samples: 27287238. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:36,237][81074] Avg episode reward: [(0, '1194.241')] [2023-03-06 23:44:36,586][81400] Updated weights for policy 0, policy_version 26680 (0.0006) [2023-03-06 23:44:37,353][81400] Updated weights for policy 0, policy_version 26690 (0.0005) [2023-03-06 23:44:38,113][81400] Updated weights for policy 0, policy_version 26700 (0.0007) [2023-03-06 23:44:38,895][81400] Updated weights for policy 0, policy_version 26710 (0.0006) [2023-03-06 23:44:39,655][81400] Updated weights for policy 0, policy_version 26720 (0.0006) [2023-03-06 23:44:40,428][81400] Updated weights for policy 0, policy_version 26730 (0.0007) [2023-03-06 23:44:41,191][81400] Updated weights for policy 0, policy_version 26740 (0.0005) [2023-03-06 23:44:41,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13346.2, 300 sec: 13284.2). Total num frames: 27381760. Throughput: 0: 13344.0. Samples: 27367125. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:41,237][81074] Avg episode reward: [(0, '1148.696')] [2023-03-06 23:44:41,944][81400] Updated weights for policy 0, policy_version 26750 (0.0006) [2023-03-06 23:44:42,733][81400] Updated weights for policy 0, policy_version 26760 (0.0006) [2023-03-06 23:44:43,498][81400] Updated weights for policy 0, policy_version 26770 (0.0006) [2023-03-06 23:44:44,253][81400] Updated weights for policy 0, policy_version 26780 (0.0006) [2023-03-06 23:44:45,045][81400] Updated weights for policy 0, policy_version 26790 (0.0005) [2023-03-06 23:44:45,793][81400] Updated weights for policy 0, policy_version 26800 (0.0005) [2023-03-06 23:44:46,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13346.1, 300 sec: 13287.7). Total num frames: 27448320. Throughput: 0: 13350.0. Samples: 27447228. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:46,237][81074] Avg episode reward: [(0, '1107.910')] [2023-03-06 23:44:46,575][81400] Updated weights for policy 0, policy_version 26810 (0.0007) [2023-03-06 23:44:47,344][81400] Updated weights for policy 0, policy_version 26820 (0.0006) [2023-03-06 23:44:48,104][81400] Updated weights for policy 0, policy_version 26830 (0.0006) [2023-03-06 23:44:48,855][81400] Updated weights for policy 0, policy_version 26840 (0.0006) [2023-03-06 23:44:49,628][81400] Updated weights for policy 0, policy_version 26850 (0.0006) [2023-03-06 23:44:50,394][81400] Updated weights for policy 0, policy_version 26860 (0.0006) [2023-03-06 23:44:51,165][81400] Updated weights for policy 0, policy_version 26870 (0.0005) [2023-03-06 23:44:51,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13346.1, 300 sec: 13287.7). Total num frames: 27514880. Throughput: 0: 13352.3. Samples: 27487256. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:51,248][81074] Avg episode reward: [(0, '1136.454')] [2023-03-06 23:44:51,957][81400] Updated weights for policy 0, policy_version 26880 (0.0007) [2023-03-06 23:44:52,703][81400] Updated weights for policy 0, policy_version 26890 (0.0005) [2023-03-06 23:44:53,479][81400] Updated weights for policy 0, policy_version 26900 (0.0005) [2023-03-06 23:44:54,234][81400] Updated weights for policy 0, policy_version 26910 (0.0006) [2023-03-06 23:44:54,997][81400] Updated weights for policy 0, policy_version 26920 (0.0007) [2023-03-06 23:44:55,778][81400] Updated weights for policy 0, policy_version 26930 (0.0006) [2023-03-06 23:44:56,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13329.1, 300 sec: 13287.7). Total num frames: 27581440. Throughput: 0: 13362.7. Samples: 27567520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:44:56,247][81074] Avg episode reward: [(0, '1141.205')] [2023-03-06 23:44:56,546][81400] Updated weights for policy 0, policy_version 26940 (0.0006) [2023-03-06 23:44:57,309][81400] Updated weights for policy 0, policy_version 26950 (0.0006) [2023-03-06 23:44:58,073][81400] Updated weights for policy 0, policy_version 26960 (0.0006) [2023-03-06 23:44:58,845][81400] Updated weights for policy 0, policy_version 26970 (0.0006) [2023-03-06 23:44:59,602][81400] Updated weights for policy 0, policy_version 26980 (0.0006) [2023-03-06 23:45:00,362][81400] Updated weights for policy 0, policy_version 26990 (0.0006) [2023-03-06 23:45:01,118][81400] Updated weights for policy 0, policy_version 27000 (0.0005) [2023-03-06 23:45:01,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13363.2, 300 sec: 13294.6). Total num frames: 27649024. Throughput: 0: 13360.9. Samples: 27647755. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:45:01,248][81074] Avg episode reward: [(0, '1163.649')] [2023-03-06 23:45:01,874][81400] Updated weights for policy 0, policy_version 27010 (0.0005) [2023-03-06 23:45:02,661][81400] Updated weights for policy 0, policy_version 27020 (0.0006) [2023-03-06 23:45:03,406][81400] Updated weights for policy 0, policy_version 27030 (0.0005) [2023-03-06 23:45:04,173][81400] Updated weights for policy 0, policy_version 27040 (0.0006) [2023-03-06 23:45:04,921][81400] Updated weights for policy 0, policy_version 27050 (0.0006) [2023-03-06 23:45:05,689][81400] Updated weights for policy 0, policy_version 27060 (0.0006) [2023-03-06 23:45:06,236][81074] Fps is (10 sec: 13516.8, 60 sec: 13380.3, 300 sec: 13298.1). Total num frames: 27716608. Throughput: 0: 13362.5. Samples: 27688002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:45:06,247][81074] Avg episode reward: [(0, '1097.151')] [2023-03-06 23:45:06,457][81400] Updated weights for policy 0, policy_version 27070 (0.0005) [2023-03-06 23:45:07,226][81400] Updated weights for policy 0, policy_version 27080 (0.0006) [2023-03-06 23:45:07,993][81400] Updated weights for policy 0, policy_version 27090 (0.0006) [2023-03-06 23:45:08,768][81400] Updated weights for policy 0, policy_version 27100 (0.0007) [2023-03-06 23:45:09,526][81400] Updated weights for policy 0, policy_version 27110 (0.0005) [2023-03-06 23:45:10,313][81400] Updated weights for policy 0, policy_version 27120 (0.0005) [2023-03-06 23:45:11,070][81400] Updated weights for policy 0, policy_version 27130 (0.0006) [2023-03-06 23:45:11,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13363.2, 300 sec: 13301.6). Total num frames: 27783168. Throughput: 0: 13372.1. Samples: 27768431. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:45:11,237][81074] Avg episode reward: [(0, '1130.099')] [2023-03-06 23:45:11,846][81400] Updated weights for policy 0, policy_version 27140 (0.0005) [2023-03-06 23:45:12,613][81400] Updated weights for policy 0, policy_version 27150 (0.0006) [2023-03-06 23:45:13,389][81400] Updated weights for policy 0, policy_version 27160 (0.0006) [2023-03-06 23:45:14,168][81400] Updated weights for policy 0, policy_version 27170 (0.0007) [2023-03-06 23:45:14,926][81400] Updated weights for policy 0, policy_version 27180 (0.0006) [2023-03-06 23:45:15,686][81400] Updated weights for policy 0, policy_version 27190 (0.0006) [2023-03-06 23:45:16,237][81074] Fps is (10 sec: 13311.5, 60 sec: 13363.2, 300 sec: 13301.6). Total num frames: 27849728. Throughput: 0: 13357.7. Samples: 27848110. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:45:16,237][81074] Avg episode reward: [(0, '1102.180')] [2023-03-06 23:45:16,458][81400] Updated weights for policy 0, policy_version 27200 (0.0006) [2023-03-06 23:45:17,245][81400] Updated weights for policy 0, policy_version 27210 (0.0006) [2023-03-06 23:45:18,006][81400] Updated weights for policy 0, policy_version 27220 (0.0006) [2023-03-06 23:45:18,750][81400] Updated weights for policy 0, policy_version 27230 (0.0006) [2023-03-06 23:45:19,542][81400] Updated weights for policy 0, policy_version 27240 (0.0006) [2023-03-06 23:45:20,300][81400] Updated weights for policy 0, policy_version 27250 (0.0006) [2023-03-06 23:45:21,063][81400] Updated weights for policy 0, policy_version 27260 (0.0006) [2023-03-06 23:45:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13363.2, 300 sec: 13301.6). Total num frames: 27916288. Throughput: 0: 13352.9. Samples: 27888119. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:45:21,237][81074] Avg episode reward: [(0, '1197.138')] [2023-03-06 23:45:21,831][81400] Updated weights for policy 0, policy_version 27270 (0.0006) [2023-03-06 23:45:22,581][81400] Updated weights for policy 0, policy_version 27280 (0.0006) [2023-03-06 23:45:23,343][81400] Updated weights for policy 0, policy_version 27290 (0.0006) [2023-03-06 23:45:24,121][81400] Updated weights for policy 0, policy_version 27300 (0.0006) [2023-03-06 23:45:24,889][81400] Updated weights for policy 0, policy_version 27310 (0.0006) [2023-03-06 23:45:25,650][81400] Updated weights for policy 0, policy_version 27320 (0.0006) [2023-03-06 23:45:26,236][81074] Fps is (10 sec: 13312.3, 60 sec: 13363.2, 300 sec: 13305.1). Total num frames: 27982848. Throughput: 0: 13361.4. Samples: 27968390. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:45:26,237][81074] Avg episode reward: [(0, '1193.904')] [2023-03-06 23:45:26,425][81400] Updated weights for policy 0, policy_version 27330 (0.0006) [2023-03-06 23:45:27,199][81400] Updated weights for policy 0, policy_version 27340 (0.0006) [2023-03-06 23:45:27,961][81400] Updated weights for policy 0, policy_version 27350 (0.0007) [2023-03-06 23:45:28,726][81400] Updated weights for policy 0, policy_version 27360 (0.0006) [2023-03-06 23:45:29,510][81400] Updated weights for policy 0, policy_version 27370 (0.0006) [2023-03-06 23:45:30,270][81400] Updated weights for policy 0, policy_version 27380 (0.0006) [2023-03-06 23:45:31,024][81400] Updated weights for policy 0, policy_version 27390 (0.0005) [2023-03-06 23:45:31,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13346.1, 300 sec: 13305.1). Total num frames: 28049408. Throughput: 0: 13359.7. Samples: 28048414. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:45:31,237][81074] Avg episode reward: [(0, '1141.032')] [2023-03-06 23:45:31,801][81400] Updated weights for policy 0, policy_version 27400 (0.0005) [2023-03-06 23:45:32,569][81400] Updated weights for policy 0, policy_version 27410 (0.0006) [2023-03-06 23:45:33,318][81400] Updated weights for policy 0, policy_version 27420 (0.0006) [2023-03-06 23:45:34,089][81400] Updated weights for policy 0, policy_version 27430 (0.0006) [2023-03-06 23:45:34,841][81400] Updated weights for policy 0, policy_version 27440 (0.0005) [2023-03-06 23:45:35,605][81400] Updated weights for policy 0, policy_version 27450 (0.0005) [2023-03-06 23:45:36,236][81074] Fps is (10 sec: 13414.6, 60 sec: 13363.2, 300 sec: 13312.0). Total num frames: 28116992. Throughput: 0: 13362.7. Samples: 28088575. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:45:36,237][81074] Avg episode reward: [(0, '1134.789')] [2023-03-06 23:45:36,354][81400] Updated weights for policy 0, policy_version 27460 (0.0006) [2023-03-06 23:45:37,138][81400] Updated weights for policy 0, policy_version 27470 (0.0006) [2023-03-06 23:45:37,904][81400] Updated weights for policy 0, policy_version 27480 (0.0005) [2023-03-06 23:45:38,681][81400] Updated weights for policy 0, policy_version 27490 (0.0007) [2023-03-06 23:45:39,445][81400] Updated weights for policy 0, policy_version 27500 (0.0006) [2023-03-06 23:45:40,219][81400] Updated weights for policy 0, policy_version 27510 (0.0007) [2023-03-06 23:45:40,977][81400] Updated weights for policy 0, policy_version 27520 (0.0006) [2023-03-06 23:45:41,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13363.2, 300 sec: 13312.0). Total num frames: 28183552. Throughput: 0: 13361.2. Samples: 28168775. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:45:41,237][81074] Avg episode reward: [(0, '1193.936')] [2023-03-06 23:45:41,758][81400] Updated weights for policy 0, policy_version 27530 (0.0005) [2023-03-06 23:45:42,518][81400] Updated weights for policy 0, policy_version 27540 (0.0007) [2023-03-06 23:45:43,290][81400] Updated weights for policy 0, policy_version 27550 (0.0005) [2023-03-06 23:45:44,079][81400] Updated weights for policy 0, policy_version 27560 (0.0006) [2023-03-06 23:45:44,842][81400] Updated weights for policy 0, policy_version 27570 (0.0008) [2023-03-06 23:45:45,614][81400] Updated weights for policy 0, policy_version 27580 (0.0006) [2023-03-06 23:45:46,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13363.2, 300 sec: 13315.5). Total num frames: 28250112. Throughput: 0: 13346.6. Samples: 28248353. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:45:46,237][81074] Avg episode reward: [(0, '1241.389')] [2023-03-06 23:45:46,401][81400] Updated weights for policy 0, policy_version 27590 (0.0006) [2023-03-06 23:45:47,169][81400] Updated weights for policy 0, policy_version 27600 (0.0005) [2023-03-06 23:45:47,946][81400] Updated weights for policy 0, policy_version 27610 (0.0006) [2023-03-06 23:45:48,714][81400] Updated weights for policy 0, policy_version 27620 (0.0007) [2023-03-06 23:45:49,480][81400] Updated weights for policy 0, policy_version 27630 (0.0006) [2023-03-06 23:45:50,270][81400] Updated weights for policy 0, policy_version 27640 (0.0006) [2023-03-06 23:45:51,038][81400] Updated weights for policy 0, policy_version 27650 (0.0005) [2023-03-06 23:45:51,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13346.1, 300 sec: 13312.0). Total num frames: 28315648. Throughput: 0: 13334.8. Samples: 28288072. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:45:51,237][81074] Avg episode reward: [(0, '1190.230')] [2023-03-06 23:45:51,813][81400] Updated weights for policy 0, policy_version 27660 (0.0006) [2023-03-06 23:45:52,573][81400] Updated weights for policy 0, policy_version 27670 (0.0005) [2023-03-06 23:45:53,356][81400] Updated weights for policy 0, policy_version 27680 (0.0006) [2023-03-06 23:45:54,110][81400] Updated weights for policy 0, policy_version 27690 (0.0006) [2023-03-06 23:45:54,884][81400] Updated weights for policy 0, policy_version 27700 (0.0006) [2023-03-06 23:45:55,680][81400] Updated weights for policy 0, policy_version 27710 (0.0007) [2023-03-06 23:45:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13346.1, 300 sec: 13312.0). Total num frames: 28382208. Throughput: 0: 13315.0. Samples: 28367608. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:45:56,237][81074] Avg episode reward: [(0, '1207.136')] [2023-03-06 23:45:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000027717_28382208.pth... [2023-03-06 23:45:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000024593_25183232.pth [2023-03-06 23:45:56,439][81400] Updated weights for policy 0, policy_version 27720 (0.0006) [2023-03-06 23:45:57,193][81400] Updated weights for policy 0, policy_version 27730 (0.0005) [2023-03-06 23:45:57,971][81400] Updated weights for policy 0, policy_version 27740 (0.0006) [2023-03-06 23:45:58,736][81400] Updated weights for policy 0, policy_version 27750 (0.0006) [2023-03-06 23:45:59,493][81400] Updated weights for policy 0, policy_version 27760 (0.0007) [2023-03-06 23:46:00,265][81400] Updated weights for policy 0, policy_version 27770 (0.0005) [2023-03-06 23:46:01,031][81400] Updated weights for policy 0, policy_version 27780 (0.0006) [2023-03-06 23:46:01,236][81074] Fps is (10 sec: 13312.3, 60 sec: 13329.1, 300 sec: 13312.0). Total num frames: 28448768. Throughput: 0: 13328.8. Samples: 28447903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:46:01,237][81074] Avg episode reward: [(0, '1288.046')] [2023-03-06 23:46:01,774][81400] Updated weights for policy 0, policy_version 27790 (0.0007) [2023-03-06 23:46:02,552][81400] Updated weights for policy 0, policy_version 27800 (0.0005) [2023-03-06 23:46:03,320][81400] Updated weights for policy 0, policy_version 27810 (0.0006) [2023-03-06 23:46:04,075][81400] Updated weights for policy 0, policy_version 27820 (0.0006) [2023-03-06 23:46:04,841][81400] Updated weights for policy 0, policy_version 27830 (0.0006) [2023-03-06 23:46:05,621][81400] Updated weights for policy 0, policy_version 27840 (0.0007) [2023-03-06 23:46:06,236][81074] Fps is (10 sec: 13414.6, 60 sec: 13329.1, 300 sec: 13318.9). Total num frames: 28516352. Throughput: 0: 13333.0. Samples: 28488104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:46:06,237][81074] Avg episode reward: [(0, '1289.835')] [2023-03-06 23:46:06,385][81400] Updated weights for policy 0, policy_version 27850 (0.0007) [2023-03-06 23:46:07,161][81400] Updated weights for policy 0, policy_version 27860 (0.0006) [2023-03-06 23:46:07,932][81400] Updated weights for policy 0, policy_version 27870 (0.0005) [2023-03-06 23:46:08,697][81400] Updated weights for policy 0, policy_version 27880 (0.0006) [2023-03-06 23:46:09,473][81400] Updated weights for policy 0, policy_version 27890 (0.0005) [2023-03-06 23:46:10,241][81400] Updated weights for policy 0, policy_version 27900 (0.0006) [2023-03-06 23:46:11,005][81400] Updated weights for policy 0, policy_version 27910 (0.0005) [2023-03-06 23:46:11,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13312.0, 300 sec: 13315.5). Total num frames: 28581888. Throughput: 0: 13321.3. Samples: 28567851. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:46:11,237][81074] Avg episode reward: [(0, '1275.747')] [2023-03-06 23:46:11,785][81400] Updated weights for policy 0, policy_version 27920 (0.0006) [2023-03-06 23:46:12,551][81400] Updated weights for policy 0, policy_version 27930 (0.0006) [2023-03-06 23:46:13,331][81400] Updated weights for policy 0, policy_version 27940 (0.0006) [2023-03-06 23:46:14,087][81400] Updated weights for policy 0, policy_version 27950 (0.0007) [2023-03-06 23:46:14,891][81400] Updated weights for policy 0, policy_version 27960 (0.0006) [2023-03-06 23:46:15,641][81400] Updated weights for policy 0, policy_version 27970 (0.0006) [2023-03-06 23:46:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13312.1, 300 sec: 13315.5). Total num frames: 28648448. Throughput: 0: 13313.1. Samples: 28647505. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:46:16,237][81074] Avg episode reward: [(0, '1230.253')] [2023-03-06 23:46:16,415][81400] Updated weights for policy 0, policy_version 27980 (0.0007) [2023-03-06 23:46:17,189][81400] Updated weights for policy 0, policy_version 27990 (0.0006) [2023-03-06 23:46:17,967][81400] Updated weights for policy 0, policy_version 28000 (0.0006) [2023-03-06 23:46:18,737][81400] Updated weights for policy 0, policy_version 28010 (0.0006) [2023-03-06 23:46:19,504][81400] Updated weights for policy 0, policy_version 28020 (0.0006) [2023-03-06 23:46:20,284][81400] Updated weights for policy 0, policy_version 28030 (0.0007) [2023-03-06 23:46:21,051][81400] Updated weights for policy 0, policy_version 28040 (0.0006) [2023-03-06 23:46:21,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13312.0, 300 sec: 13318.9). Total num frames: 28715008. Throughput: 0: 13303.8. Samples: 28687247. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:46:21,237][81074] Avg episode reward: [(0, '1213.371')] [2023-03-06 23:46:21,821][81400] Updated weights for policy 0, policy_version 28050 (0.0006) [2023-03-06 23:46:22,584][81400] Updated weights for policy 0, policy_version 28060 (0.0006) [2023-03-06 23:46:23,368][81400] Updated weights for policy 0, policy_version 28070 (0.0006) [2023-03-06 23:46:24,134][81400] Updated weights for policy 0, policy_version 28080 (0.0006) [2023-03-06 23:46:24,918][81400] Updated weights for policy 0, policy_version 28090 (0.0007) [2023-03-06 23:46:25,688][81400] Updated weights for policy 0, policy_version 28100 (0.0006) [2023-03-06 23:46:26,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13312.0, 300 sec: 13318.9). Total num frames: 28781568. Throughput: 0: 13285.8. Samples: 28766639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:46:26,237][81074] Avg episode reward: [(0, '1234.703')] [2023-03-06 23:46:26,460][81400] Updated weights for policy 0, policy_version 28110 (0.0006) [2023-03-06 23:46:27,233][81400] Updated weights for policy 0, policy_version 28120 (0.0006) [2023-03-06 23:46:28,011][81400] Updated weights for policy 0, policy_version 28130 (0.0006) [2023-03-06 23:46:28,774][81400] Updated weights for policy 0, policy_version 28140 (0.0007) [2023-03-06 23:46:29,539][81400] Updated weights for policy 0, policy_version 28150 (0.0006) [2023-03-06 23:46:30,316][81400] Updated weights for policy 0, policy_version 28160 (0.0006) [2023-03-06 23:46:31,109][81400] Updated weights for policy 0, policy_version 28170 (0.0006) [2023-03-06 23:46:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13294.9, 300 sec: 13318.9). Total num frames: 28847104. Throughput: 0: 13288.4. Samples: 28846328. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:46:31,237][81074] Avg episode reward: [(0, '1303.017')] [2023-03-06 23:46:31,873][81400] Updated weights for policy 0, policy_version 28180 (0.0006) [2023-03-06 23:46:32,643][81400] Updated weights for policy 0, policy_version 28190 (0.0006) [2023-03-06 23:46:33,413][81400] Updated weights for policy 0, policy_version 28200 (0.0006) [2023-03-06 23:46:33,709][81349] KL-divergence is very high: 152.8921 [2023-03-06 23:46:34,181][81400] Updated weights for policy 0, policy_version 28210 (0.0006) [2023-03-06 23:46:34,959][81400] Updated weights for policy 0, policy_version 28220 (0.0006) [2023-03-06 23:46:35,738][81400] Updated weights for policy 0, policy_version 28230 (0.0006) [2023-03-06 23:46:36,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13277.9, 300 sec: 13318.9). Total num frames: 28913664. Throughput: 0: 13289.0. Samples: 28886075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:46:36,237][81074] Avg episode reward: [(0, '1246.082')] [2023-03-06 23:46:36,515][81400] Updated weights for policy 0, policy_version 28240 (0.0006) [2023-03-06 23:46:37,271][81400] Updated weights for policy 0, policy_version 28250 (0.0006) [2023-03-06 23:46:38,039][81400] Updated weights for policy 0, policy_version 28260 (0.0006) [2023-03-06 23:46:38,820][81400] Updated weights for policy 0, policy_version 28270 (0.0006) [2023-03-06 23:46:39,592][81400] Updated weights for policy 0, policy_version 28280 (0.0005) [2023-03-06 23:46:40,345][81400] Updated weights for policy 0, policy_version 28290 (0.0006) [2023-03-06 23:46:41,121][81400] Updated weights for policy 0, policy_version 28300 (0.0007) [2023-03-06 23:46:41,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13322.4). Total num frames: 28980224. Throughput: 0: 13293.1. Samples: 28965797. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:46:41,237][81074] Avg episode reward: [(0, '1219.459')] [2023-03-06 23:46:41,901][81400] Updated weights for policy 0, policy_version 28310 (0.0006) [2023-03-06 23:46:42,676][81400] Updated weights for policy 0, policy_version 28320 (0.0006) [2023-03-06 23:46:43,426][81400] Updated weights for policy 0, policy_version 28330 (0.0005) [2023-03-06 23:46:44,208][81400] Updated weights for policy 0, policy_version 28340 (0.0006) [2023-03-06 23:46:44,964][81400] Updated weights for policy 0, policy_version 28350 (0.0006) [2023-03-06 23:46:45,733][81400] Updated weights for policy 0, policy_version 28360 (0.0007) [2023-03-06 23:46:46,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13322.4). Total num frames: 29046784. Throughput: 0: 13284.6. Samples: 29045710. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:46:46,237][81074] Avg episode reward: [(0, '1256.601')] [2023-03-06 23:46:46,509][81400] Updated weights for policy 0, policy_version 28370 (0.0006) [2023-03-06 23:46:47,281][81400] Updated weights for policy 0, policy_version 28380 (0.0006) [2023-03-06 23:46:48,071][81400] Updated weights for policy 0, policy_version 28390 (0.0006) [2023-03-06 23:46:48,833][81400] Updated weights for policy 0, policy_version 28400 (0.0006) [2023-03-06 23:46:49,609][81400] Updated weights for policy 0, policy_version 28410 (0.0006) [2023-03-06 23:46:50,378][81400] Updated weights for policy 0, policy_version 28420 (0.0005) [2023-03-06 23:46:51,142][81400] Updated weights for policy 0, policy_version 28430 (0.0007) [2023-03-06 23:46:51,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13295.0, 300 sec: 13322.4). Total num frames: 29113344. Throughput: 0: 13272.4. Samples: 29085364. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:46:51,237][81074] Avg episode reward: [(0, '1245.088')] [2023-03-06 23:46:51,924][81400] Updated weights for policy 0, policy_version 28440 (0.0006) [2023-03-06 23:46:52,694][81400] Updated weights for policy 0, policy_version 28450 (0.0007) [2023-03-06 23:46:53,471][81400] Updated weights for policy 0, policy_version 28460 (0.0006) [2023-03-06 23:46:54,271][81400] Updated weights for policy 0, policy_version 28470 (0.0006) [2023-03-06 23:46:55,032][81400] Updated weights for policy 0, policy_version 28480 (0.0007) [2023-03-06 23:46:55,805][81400] Updated weights for policy 0, policy_version 28490 (0.0007) [2023-03-06 23:46:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13322.4). Total num frames: 29178880. Throughput: 0: 13261.8. Samples: 29164633. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:46:56,237][81074] Avg episode reward: [(0, '1282.738')] [2023-03-06 23:46:56,590][81400] Updated weights for policy 0, policy_version 28500 (0.0006) [2023-03-06 23:46:57,351][81400] Updated weights for policy 0, policy_version 28510 (0.0006) [2023-03-06 23:46:58,131][81400] Updated weights for policy 0, policy_version 28520 (0.0006) [2023-03-06 23:46:58,901][81400] Updated weights for policy 0, policy_version 28530 (0.0006) [2023-03-06 23:46:59,661][81400] Updated weights for policy 0, policy_version 28540 (0.0006) [2023-03-06 23:47:00,457][81400] Updated weights for policy 0, policy_version 28550 (0.0006) [2023-03-06 23:47:01,218][81400] Updated weights for policy 0, policy_version 28560 (0.0006) [2023-03-06 23:47:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13322.4). Total num frames: 29245440. Throughput: 0: 13257.7. Samples: 29244100. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:47:01,237][81074] Avg episode reward: [(0, '1259.115')] [2023-03-06 23:47:01,986][81400] Updated weights for policy 0, policy_version 28570 (0.0007) [2023-03-06 23:47:02,759][81400] Updated weights for policy 0, policy_version 28580 (0.0006) [2023-03-06 23:47:03,537][81400] Updated weights for policy 0, policy_version 28590 (0.0005) [2023-03-06 23:47:04,305][81400] Updated weights for policy 0, policy_version 28600 (0.0006) [2023-03-06 23:47:05,063][81400] Updated weights for policy 0, policy_version 28610 (0.0006) [2023-03-06 23:47:05,826][81400] Updated weights for policy 0, policy_version 28620 (0.0006) [2023-03-06 23:47:06,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13322.4). Total num frames: 29312000. Throughput: 0: 13254.7. Samples: 29283709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:47:06,237][81074] Avg episode reward: [(0, '1254.480')] [2023-03-06 23:47:06,619][81400] Updated weights for policy 0, policy_version 28630 (0.0006) [2023-03-06 23:47:07,383][81400] Updated weights for policy 0, policy_version 28640 (0.0005) [2023-03-06 23:47:08,142][81400] Updated weights for policy 0, policy_version 28650 (0.0006) [2023-03-06 23:47:08,929][81400] Updated weights for policy 0, policy_version 28660 (0.0007) [2023-03-06 23:47:09,702][81400] Updated weights for policy 0, policy_version 28670 (0.0006) [2023-03-06 23:47:10,479][81400] Updated weights for policy 0, policy_version 28680 (0.0006) [2023-03-06 23:47:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13318.9). Total num frames: 29377536. Throughput: 0: 13264.5. Samples: 29363538. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:47:11,237][81074] Avg episode reward: [(0, '1304.265')] [2023-03-06 23:47:11,250][81400] Updated weights for policy 0, policy_version 28690 (0.0006) [2023-03-06 23:47:12,015][81400] Updated weights for policy 0, policy_version 28700 (0.0006) [2023-03-06 23:47:12,777][81400] Updated weights for policy 0, policy_version 28710 (0.0006) [2023-03-06 23:47:13,563][81400] Updated weights for policy 0, policy_version 28720 (0.0006) [2023-03-06 23:47:14,330][81400] Updated weights for policy 0, policy_version 28730 (0.0006) [2023-03-06 23:47:15,090][81400] Updated weights for policy 0, policy_version 28740 (0.0006) [2023-03-06 23:47:15,876][81400] Updated weights for policy 0, policy_version 28750 (0.0006) [2023-03-06 23:47:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13260.7, 300 sec: 13322.4). Total num frames: 29444096. Throughput: 0: 13264.3. Samples: 29443222. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:47:16,237][81074] Avg episode reward: [(0, '1285.381')] [2023-03-06 23:47:16,624][81400] Updated weights for policy 0, policy_version 28760 (0.0006) [2023-03-06 23:47:17,406][81400] Updated weights for policy 0, policy_version 28770 (0.0006) [2023-03-06 23:47:18,169][81400] Updated weights for policy 0, policy_version 28780 (0.0005) [2023-03-06 23:47:18,941][81400] Updated weights for policy 0, policy_version 28790 (0.0006) [2023-03-06 23:47:19,722][81400] Updated weights for policy 0, policy_version 28800 (0.0006) [2023-03-06 23:47:20,497][81400] Updated weights for policy 0, policy_version 28810 (0.0006) [2023-03-06 23:47:21,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13325.9). Total num frames: 29510656. Throughput: 0: 13267.9. Samples: 29483135. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:21,237][81074] Avg episode reward: [(0, '1250.233')] [2023-03-06 23:47:21,268][81400] Updated weights for policy 0, policy_version 28820 (0.0006) [2023-03-06 23:47:22,034][81400] Updated weights for policy 0, policy_version 28830 (0.0005) [2023-03-06 23:47:22,821][81400] Updated weights for policy 0, policy_version 28840 (0.0006) [2023-03-06 23:47:23,586][81400] Updated weights for policy 0, policy_version 28850 (0.0006) [2023-03-06 23:47:24,334][81400] Updated weights for policy 0, policy_version 28860 (0.0006) [2023-03-06 23:47:25,115][81400] Updated weights for policy 0, policy_version 28870 (0.0006) [2023-03-06 23:47:25,882][81400] Updated weights for policy 0, policy_version 28880 (0.0007) [2023-03-06 23:47:26,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13325.9). Total num frames: 29577216. Throughput: 0: 13265.7. Samples: 29562756. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:26,237][81074] Avg episode reward: [(0, '1210.158')] [2023-03-06 23:47:26,653][81400] Updated weights for policy 0, policy_version 28890 (0.0006) [2023-03-06 23:47:27,441][81400] Updated weights for policy 0, policy_version 28900 (0.0007) [2023-03-06 23:47:28,225][81400] Updated weights for policy 0, policy_version 28910 (0.0006) [2023-03-06 23:47:28,989][81400] Updated weights for policy 0, policy_version 28920 (0.0005) [2023-03-06 23:47:29,749][81400] Updated weights for policy 0, policy_version 28930 (0.0007) [2023-03-06 23:47:30,522][81400] Updated weights for policy 0, policy_version 28940 (0.0006) [2023-03-06 23:47:31,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13325.9). Total num frames: 29643776. Throughput: 0: 13258.0. Samples: 29642321. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:31,237][81074] Avg episode reward: [(0, '1209.174')] [2023-03-06 23:47:31,289][81400] Updated weights for policy 0, policy_version 28950 (0.0007) [2023-03-06 23:47:32,050][81400] Updated weights for policy 0, policy_version 28960 (0.0005) [2023-03-06 23:47:32,824][81400] Updated weights for policy 0, policy_version 28970 (0.0007) [2023-03-06 23:47:33,614][81400] Updated weights for policy 0, policy_version 28980 (0.0005) [2023-03-06 23:47:34,048][81349] KL-divergence is very high: 400.5302 [2023-03-06 23:47:34,373][81400] Updated weights for policy 0, policy_version 28990 (0.0006) [2023-03-06 23:47:35,142][81400] Updated weights for policy 0, policy_version 29000 (0.0006) [2023-03-06 23:47:35,909][81400] Updated weights for policy 0, policy_version 29010 (0.0007) [2023-03-06 23:47:36,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13277.8, 300 sec: 13329.4). Total num frames: 29710336. Throughput: 0: 13267.2. Samples: 29682389. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:36,237][81074] Avg episode reward: [(0, '1194.184')] [2023-03-06 23:47:36,667][81400] Updated weights for policy 0, policy_version 29020 (0.0006) [2023-03-06 23:47:37,459][81400] Updated weights for policy 0, policy_version 29030 (0.0006) [2023-03-06 23:47:38,222][81400] Updated weights for policy 0, policy_version 29040 (0.0006) [2023-03-06 23:47:38,983][81400] Updated weights for policy 0, policy_version 29050 (0.0006) [2023-03-06 23:47:39,761][81400] Updated weights for policy 0, policy_version 29060 (0.0006) [2023-03-06 23:47:40,524][81400] Updated weights for policy 0, policy_version 29070 (0.0006) [2023-03-06 23:47:41,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13329.4). Total num frames: 29776896. Throughput: 0: 13280.0. Samples: 29762231. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:41,237][81074] Avg episode reward: [(0, '1154.519')] [2023-03-06 23:47:41,297][81400] Updated weights for policy 0, policy_version 29080 (0.0007) [2023-03-06 23:47:42,072][81400] Updated weights for policy 0, policy_version 29090 (0.0007) [2023-03-06 23:47:42,837][81400] Updated weights for policy 0, policy_version 29100 (0.0006) [2023-03-06 23:47:43,601][81400] Updated weights for policy 0, policy_version 29110 (0.0006) [2023-03-06 23:47:44,365][81400] Updated weights for policy 0, policy_version 29120 (0.0006) [2023-03-06 23:47:45,145][81400] Updated weights for policy 0, policy_version 29130 (0.0006) [2023-03-06 23:47:45,915][81400] Updated weights for policy 0, policy_version 29140 (0.0006) [2023-03-06 23:47:46,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13325.9). Total num frames: 29843456. Throughput: 0: 13284.9. Samples: 29841921. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:46,237][81074] Avg episode reward: [(0, '1180.683')] [2023-03-06 23:47:46,693][81400] Updated weights for policy 0, policy_version 29150 (0.0006) [2023-03-06 23:47:47,461][81400] Updated weights for policy 0, policy_version 29160 (0.0006) [2023-03-06 23:47:48,240][81400] Updated weights for policy 0, policy_version 29170 (0.0006) [2023-03-06 23:47:49,005][81400] Updated weights for policy 0, policy_version 29180 (0.0006) [2023-03-06 23:47:49,776][81400] Updated weights for policy 0, policy_version 29190 (0.0006) [2023-03-06 23:47:50,547][81400] Updated weights for policy 0, policy_version 29200 (0.0006) [2023-03-06 23:47:51,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13277.9, 300 sec: 13325.9). Total num frames: 29910016. Throughput: 0: 13288.3. Samples: 29881682. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:51,237][81074] Avg episode reward: [(0, '1109.263')] [2023-03-06 23:47:51,291][81400] Updated weights for policy 0, policy_version 29210 (0.0005) [2023-03-06 23:47:52,072][81400] Updated weights for policy 0, policy_version 29220 (0.0007) [2023-03-06 23:47:52,839][81400] Updated weights for policy 0, policy_version 29230 (0.0006) [2023-03-06 23:47:53,598][81400] Updated weights for policy 0, policy_version 29240 (0.0006) [2023-03-06 23:47:54,347][81400] Updated weights for policy 0, policy_version 29250 (0.0006) [2023-03-06 23:47:55,124][81400] Updated weights for policy 0, policy_version 29260 (0.0006) [2023-03-06 23:47:55,899][81400] Updated weights for policy 0, policy_version 29270 (0.0006) [2023-03-06 23:47:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13295.0, 300 sec: 13325.9). Total num frames: 29976576. Throughput: 0: 13296.7. Samples: 29961887. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:47:56,237][81074] Avg episode reward: [(0, '1095.512')] [2023-03-06 23:47:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000029274_29976576.pth... [2023-03-06 23:47:56,269][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000026154_26781696.pth [2023-03-06 23:47:56,669][81400] Updated weights for policy 0, policy_version 29280 (0.0006) [2023-03-06 23:47:57,439][81400] Updated weights for policy 0, policy_version 29290 (0.0006) [2023-03-06 23:47:58,209][81400] Updated weights for policy 0, policy_version 29300 (0.0007) [2023-03-06 23:47:58,969][81400] Updated weights for policy 0, policy_version 29310 (0.0006) [2023-03-06 23:47:59,754][81400] Updated weights for policy 0, policy_version 29320 (0.0006) [2023-03-06 23:48:00,517][81400] Updated weights for policy 0, policy_version 29330 (0.0006) [2023-03-06 23:48:01,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13294.9, 300 sec: 13325.9). Total num frames: 30043136. Throughput: 0: 13296.1. Samples: 30041544. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:48:01,237][81074] Avg episode reward: [(0, '1138.511')] [2023-03-06 23:48:01,305][81400] Updated weights for policy 0, policy_version 29340 (0.0007) [2023-03-06 23:48:02,054][81400] Updated weights for policy 0, policy_version 29350 (0.0006) [2023-03-06 23:48:02,841][81400] Updated weights for policy 0, policy_version 29360 (0.0006) [2023-03-06 23:48:03,617][81400] Updated weights for policy 0, policy_version 29370 (0.0006) [2023-03-06 23:48:04,387][81400] Updated weights for policy 0, policy_version 29380 (0.0007) [2023-03-06 23:48:05,154][81400] Updated weights for policy 0, policy_version 29390 (0.0006) [2023-03-06 23:48:05,947][81400] Updated weights for policy 0, policy_version 29400 (0.0006) [2023-03-06 23:48:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13318.9). Total num frames: 30108672. Throughput: 0: 13293.8. Samples: 30081354. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:48:06,237][81074] Avg episode reward: [(0, '1207.355')] [2023-03-06 23:48:06,716][81400] Updated weights for policy 0, policy_version 29410 (0.0006) [2023-03-06 23:48:07,487][81400] Updated weights for policy 0, policy_version 29420 (0.0006) [2023-03-06 23:48:08,277][81400] Updated weights for policy 0, policy_version 29430 (0.0006) [2023-03-06 23:48:09,018][81400] Updated weights for policy 0, policy_version 29440 (0.0006) [2023-03-06 23:48:09,786][81400] Updated weights for policy 0, policy_version 29450 (0.0006) [2023-03-06 23:48:10,571][81400] Updated weights for policy 0, policy_version 29460 (0.0006) [2023-03-06 23:48:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13294.9, 300 sec: 13318.9). Total num frames: 30175232. Throughput: 0: 13296.8. Samples: 30161112. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:48:11,237][81074] Avg episode reward: [(0, '1179.643')] [2023-03-06 23:48:11,340][81400] Updated weights for policy 0, policy_version 29470 (0.0006) [2023-03-06 23:48:12,121][81400] Updated weights for policy 0, policy_version 29480 (0.0006) [2023-03-06 23:48:12,873][81400] Updated weights for policy 0, policy_version 29490 (0.0006) [2023-03-06 23:48:13,649][81400] Updated weights for policy 0, policy_version 29500 (0.0005) [2023-03-06 23:48:14,428][81400] Updated weights for policy 0, policy_version 29510 (0.0005) [2023-03-06 23:48:15,187][81400] Updated weights for policy 0, policy_version 29520 (0.0006) [2023-03-06 23:48:15,962][81400] Updated weights for policy 0, policy_version 29530 (0.0005) [2023-03-06 23:48:16,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13315.5). Total num frames: 30241792. Throughput: 0: 13296.7. Samples: 30240677. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:48:16,237][81074] Avg episode reward: [(0, '1134.894')] [2023-03-06 23:48:16,714][81400] Updated weights for policy 0, policy_version 29540 (0.0005) [2023-03-06 23:48:17,490][81400] Updated weights for policy 0, policy_version 29550 (0.0006) [2023-03-06 23:48:18,247][81400] Updated weights for policy 0, policy_version 29560 (0.0006) [2023-03-06 23:48:19,033][81400] Updated weights for policy 0, policy_version 29570 (0.0006) [2023-03-06 23:48:19,788][81400] Updated weights for policy 0, policy_version 29580 (0.0006) [2023-03-06 23:48:20,386][81349] KL-divergence is very high: 219916832.0000 [2023-03-06 23:48:20,548][81400] Updated weights for policy 0, policy_version 29590 (0.0006) [2023-03-06 23:48:21,156][81349] KL-divergence is very high: 190.1172 [2023-03-06 23:48:21,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13312.0, 300 sec: 13318.9). Total num frames: 30309376. Throughput: 0: 13298.8. Samples: 30280833. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:48:21,237][81074] Avg episode reward: [(0, '1230.891')] [2023-03-06 23:48:21,312][81400] Updated weights for policy 0, policy_version 29600 (0.0006) [2023-03-06 23:48:22,084][81400] Updated weights for policy 0, policy_version 29610 (0.0006) [2023-03-06 23:48:22,862][81400] Updated weights for policy 0, policy_version 29620 (0.0007) [2023-03-06 23:48:23,626][81400] Updated weights for policy 0, policy_version 29630 (0.0006) [2023-03-06 23:48:24,390][81400] Updated weights for policy 0, policy_version 29640 (0.0006) [2023-03-06 23:48:24,461][81349] KL-divergence is very high: 5546.7515 [2023-03-06 23:48:25,170][81400] Updated weights for policy 0, policy_version 29650 (0.0007) [2023-03-06 23:48:25,933][81400] Updated weights for policy 0, policy_version 29660 (0.0006) [2023-03-06 23:48:26,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13312.0, 300 sec: 13318.9). Total num frames: 30375936. Throughput: 0: 13302.0. Samples: 30360824. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:48:26,237][81074] Avg episode reward: [(0, '1069.331')] [2023-03-06 23:48:26,697][81400] Updated weights for policy 0, policy_version 29670 (0.0007) [2023-03-06 23:48:27,460][81400] Updated weights for policy 0, policy_version 29680 (0.0005) [2023-03-06 23:48:28,219][81400] Updated weights for policy 0, policy_version 29690 (0.0005) [2023-03-06 23:48:28,984][81400] Updated weights for policy 0, policy_version 29700 (0.0006) [2023-03-06 23:48:29,740][81400] Updated weights for policy 0, policy_version 29710 (0.0006) [2023-03-06 23:48:30,516][81400] Updated weights for policy 0, policy_version 29720 (0.0006) [2023-03-06 23:48:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13312.0, 300 sec: 13318.9). Total num frames: 30442496. Throughput: 0: 13317.1. Samples: 30441192. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:48:31,237][81074] Avg episode reward: [(0, '1020.746')] [2023-03-06 23:48:31,272][81400] Updated weights for policy 0, policy_version 29730 (0.0005) [2023-03-06 23:48:32,047][81400] Updated weights for policy 0, policy_version 29740 (0.0006) [2023-03-06 23:48:32,801][81400] Updated weights for policy 0, policy_version 29750 (0.0006) [2023-03-06 23:48:33,560][81400] Updated weights for policy 0, policy_version 29760 (0.0006) [2023-03-06 23:48:34,328][81400] Updated weights for policy 0, policy_version 29770 (0.0006) [2023-03-06 23:48:35,096][81400] Updated weights for policy 0, policy_version 29780 (0.0005) [2023-03-06 23:48:35,861][81400] Updated weights for policy 0, policy_version 29790 (0.0006) [2023-03-06 23:48:36,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13312.0, 300 sec: 13315.5). Total num frames: 30509056. Throughput: 0: 13328.6. Samples: 30481470. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:48:36,237][81074] Avg episode reward: [(0, '1085.350')] [2023-03-06 23:48:36,634][81400] Updated weights for policy 0, policy_version 29800 (0.0007) [2023-03-06 23:48:37,391][81400] Updated weights for policy 0, policy_version 29810 (0.0006) [2023-03-06 23:48:38,165][81400] Updated weights for policy 0, policy_version 29820 (0.0006) [2023-03-06 23:48:38,917][81400] Updated weights for policy 0, policy_version 29830 (0.0006) [2023-03-06 23:48:39,685][81400] Updated weights for policy 0, policy_version 29840 (0.0006) [2023-03-06 23:48:40,453][81400] Updated weights for policy 0, policy_version 29850 (0.0006) [2023-03-06 23:48:41,225][81400] Updated weights for policy 0, policy_version 29860 (0.0006) [2023-03-06 23:48:41,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13329.1, 300 sec: 13318.9). Total num frames: 30576640. Throughput: 0: 13332.0. Samples: 30561827. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:48:41,237][81074] Avg episode reward: [(0, '1124.209')] [2023-03-06 23:48:41,984][81400] Updated weights for policy 0, policy_version 29870 (0.0006) [2023-03-06 23:48:42,770][81400] Updated weights for policy 0, policy_version 29880 (0.0006) [2023-03-06 23:48:43,521][81400] Updated weights for policy 0, policy_version 29890 (0.0006) [2023-03-06 23:48:44,297][81400] Updated weights for policy 0, policy_version 29900 (0.0006) [2023-03-06 23:48:45,069][81400] Updated weights for policy 0, policy_version 29910 (0.0006) [2023-03-06 23:48:45,837][81400] Updated weights for policy 0, policy_version 29920 (0.0006) [2023-03-06 23:48:46,236][81074] Fps is (10 sec: 13414.7, 60 sec: 13329.1, 300 sec: 13318.9). Total num frames: 30643200. Throughput: 0: 13336.9. Samples: 30641702. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:48:46,237][81074] Avg episode reward: [(0, '1141.315')] [2023-03-06 23:48:46,601][81400] Updated weights for policy 0, policy_version 29930 (0.0006) [2023-03-06 23:48:47,379][81400] Updated weights for policy 0, policy_version 29940 (0.0006) [2023-03-06 23:48:48,142][81400] Updated weights for policy 0, policy_version 29950 (0.0006) [2023-03-06 23:48:48,913][81400] Updated weights for policy 0, policy_version 29960 (0.0006) [2023-03-06 23:48:49,685][81400] Updated weights for policy 0, policy_version 29970 (0.0006) [2023-03-06 23:48:50,461][81400] Updated weights for policy 0, policy_version 29980 (0.0006) [2023-03-06 23:48:51,235][81400] Updated weights for policy 0, policy_version 29990 (0.0006) [2023-03-06 23:48:51,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13329.1, 300 sec: 13315.5). Total num frames: 30709760. Throughput: 0: 13340.5. Samples: 30681675. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:48:51,237][81074] Avg episode reward: [(0, '1108.475')] [2023-03-06 23:48:52,011][81400] Updated weights for policy 0, policy_version 30000 (0.0007) [2023-03-06 23:48:52,773][81400] Updated weights for policy 0, policy_version 30010 (0.0006) [2023-03-06 23:48:53,550][81400] Updated weights for policy 0, policy_version 30020 (0.0006) [2023-03-06 23:48:54,318][81400] Updated weights for policy 0, policy_version 30030 (0.0007) [2023-03-06 23:48:55,090][81400] Updated weights for policy 0, policy_version 30040 (0.0005) [2023-03-06 23:48:55,850][81400] Updated weights for policy 0, policy_version 30050 (0.0008) [2023-03-06 23:48:56,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13312.0, 300 sec: 13315.5). Total num frames: 30775296. Throughput: 0: 13336.6. Samples: 30761258. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-06 23:48:56,237][81074] Avg episode reward: [(0, '1120.004')] [2023-03-06 23:48:56,638][81400] Updated weights for policy 0, policy_version 30060 (0.0007) [2023-03-06 23:48:57,402][81400] Updated weights for policy 0, policy_version 30070 (0.0006) [2023-03-06 23:48:58,166][81400] Updated weights for policy 0, policy_version 30080 (0.0006) [2023-03-06 23:48:58,942][81400] Updated weights for policy 0, policy_version 30090 (0.0006) [2023-03-06 23:48:59,708][81400] Updated weights for policy 0, policy_version 30100 (0.0005) [2023-03-06 23:49:00,495][81400] Updated weights for policy 0, policy_version 30110 (0.0007) [2023-03-06 23:49:01,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13312.0, 300 sec: 13315.5). Total num frames: 30841856. Throughput: 0: 13337.2. Samples: 30840847. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:49:01,237][81074] Avg episode reward: [(0, '1158.033')] [2023-03-06 23:49:01,268][81400] Updated weights for policy 0, policy_version 30120 (0.0007) [2023-03-06 23:49:02,040][81400] Updated weights for policy 0, policy_version 30130 (0.0006) [2023-03-06 23:49:02,822][81400] Updated weights for policy 0, policy_version 30140 (0.0006) [2023-03-06 23:49:03,593][81400] Updated weights for policy 0, policy_version 30150 (0.0006) [2023-03-06 23:49:04,370][81400] Updated weights for policy 0, policy_version 30160 (0.0006) [2023-03-06 23:49:05,132][81400] Updated weights for policy 0, policy_version 30170 (0.0007) [2023-03-06 23:49:05,907][81400] Updated weights for policy 0, policy_version 30180 (0.0005) [2023-03-06 23:49:06,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13329.0, 300 sec: 13312.0). Total num frames: 30908416. Throughput: 0: 13324.2. Samples: 30880424. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:49:06,237][81074] Avg episode reward: [(0, '1187.406')] [2023-03-06 23:49:06,682][81400] Updated weights for policy 0, policy_version 30190 (0.0005) [2023-03-06 23:49:07,453][81400] Updated weights for policy 0, policy_version 30200 (0.0006) [2023-03-06 23:49:08,229][81400] Updated weights for policy 0, policy_version 30210 (0.0006) [2023-03-06 23:49:08,754][81349] KL-divergence is very high: 1331.3818 [2023-03-06 23:49:08,990][81400] Updated weights for policy 0, policy_version 30220 (0.0006) [2023-03-06 23:49:09,769][81400] Updated weights for policy 0, policy_version 30230 (0.0007) [2023-03-06 23:49:10,217][81349] KL-divergence is very high: 190.4808 [2023-03-06 23:49:10,542][81400] Updated weights for policy 0, policy_version 30240 (0.0006) [2023-03-06 23:49:11,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13329.1, 300 sec: 13312.0). Total num frames: 30974976. Throughput: 0: 13315.6. Samples: 30960025. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:49:11,237][81074] Avg episode reward: [(0, '1176.778')] [2023-03-06 23:49:11,285][81400] Updated weights for policy 0, policy_version 30250 (0.0005) [2023-03-06 23:49:12,067][81400] Updated weights for policy 0, policy_version 30260 (0.0006) [2023-03-06 23:49:12,833][81400] Updated weights for policy 0, policy_version 30270 (0.0006) [2023-03-06 23:49:13,601][81400] Updated weights for policy 0, policy_version 30280 (0.0005) [2023-03-06 23:49:14,375][81400] Updated weights for policy 0, policy_version 30290 (0.0006) [2023-03-06 23:49:15,154][81400] Updated weights for policy 0, policy_version 30300 (0.0006) [2023-03-06 23:49:15,898][81400] Updated weights for policy 0, policy_version 30310 (0.0006) [2023-03-06 23:49:16,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13329.1, 300 sec: 13312.0). Total num frames: 31041536. Throughput: 0: 13307.8. Samples: 31040044. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:49:16,247][81074] Avg episode reward: [(0, '940.124')] [2023-03-06 23:49:16,678][81400] Updated weights for policy 0, policy_version 30320 (0.0006) [2023-03-06 23:49:17,429][81400] Updated weights for policy 0, policy_version 30330 (0.0006) [2023-03-06 23:49:18,200][81400] Updated weights for policy 0, policy_version 30340 (0.0006) [2023-03-06 23:49:18,974][81400] Updated weights for policy 0, policy_version 30350 (0.0006) [2023-03-06 23:49:19,746][81400] Updated weights for policy 0, policy_version 30360 (0.0006) [2023-03-06 23:49:20,501][81400] Updated weights for policy 0, policy_version 30370 (0.0006) [2023-03-06 23:49:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13312.0, 300 sec: 13312.0). Total num frames: 31108096. Throughput: 0: 13307.7. Samples: 31080317. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:49:21,247][81074] Avg episode reward: [(0, '859.652')] [2023-03-06 23:49:21,272][81400] Updated weights for policy 0, policy_version 30380 (0.0007) [2023-03-06 23:49:22,022][81400] Updated weights for policy 0, policy_version 30390 (0.0007) [2023-03-06 23:49:22,798][81400] Updated weights for policy 0, policy_version 30400 (0.0007) [2023-03-06 23:49:23,554][81400] Updated weights for policy 0, policy_version 30410 (0.0006) [2023-03-06 23:49:24,318][81400] Updated weights for policy 0, policy_version 30420 (0.0006) [2023-03-06 23:49:25,087][81400] Updated weights for policy 0, policy_version 30430 (0.0005) [2023-03-06 23:49:25,851][81400] Updated weights for policy 0, policy_version 30440 (0.0006) [2023-03-06 23:49:26,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13312.0, 300 sec: 13308.5). Total num frames: 31174656. Throughput: 0: 13307.3. Samples: 31160655. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:49:26,247][81074] Avg episode reward: [(0, '899.969')] [2023-03-06 23:49:26,613][81400] Updated weights for policy 0, policy_version 30450 (0.0006) [2023-03-06 23:49:27,226][81349] KL-divergence is very high: 134.4930 [2023-03-06 23:49:27,385][81400] Updated weights for policy 0, policy_version 30460 (0.0005) [2023-03-06 23:49:28,151][81400] Updated weights for policy 0, policy_version 30470 (0.0006) [2023-03-06 23:49:28,914][81400] Updated weights for policy 0, policy_version 30480 (0.0006) [2023-03-06 23:49:29,653][81400] Updated weights for policy 0, policy_version 30490 (0.0006) [2023-03-06 23:49:30,426][81400] Updated weights for policy 0, policy_version 30500 (0.0005) [2023-03-06 23:49:31,168][81400] Updated weights for policy 0, policy_version 30510 (0.0005) [2023-03-06 23:49:31,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13329.0, 300 sec: 13312.0). Total num frames: 31242240. Throughput: 0: 13322.5. Samples: 31241216. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:49:31,248][81074] Avg episode reward: [(0, '769.644')] [2023-03-06 23:49:31,938][81400] Updated weights for policy 0, policy_version 30520 (0.0006) [2023-03-06 23:49:32,701][81400] Updated weights for policy 0, policy_version 30530 (0.0006) [2023-03-06 23:49:33,456][81400] Updated weights for policy 0, policy_version 30540 (0.0007) [2023-03-06 23:49:34,227][81400] Updated weights for policy 0, policy_version 30550 (0.0006) [2023-03-06 23:49:35,007][81400] Updated weights for policy 0, policy_version 30560 (0.0006) [2023-03-06 23:49:35,752][81400] Updated weights for policy 0, policy_version 30570 (0.0007) [2023-03-06 23:49:36,236][81074] Fps is (10 sec: 13516.8, 60 sec: 13346.2, 300 sec: 13315.5). Total num frames: 31309824. Throughput: 0: 13328.4. Samples: 31281456. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:49:36,247][81074] Avg episode reward: [(0, '648.249')] [2023-03-06 23:49:36,514][81400] Updated weights for policy 0, policy_version 30580 (0.0006) [2023-03-06 23:49:37,287][81400] Updated weights for policy 0, policy_version 30590 (0.0006) [2023-03-06 23:49:38,056][81400] Updated weights for policy 0, policy_version 30600 (0.0006) [2023-03-06 23:49:38,808][81400] Updated weights for policy 0, policy_version 30610 (0.0006) [2023-03-06 23:49:39,568][81400] Updated weights for policy 0, policy_version 30620 (0.0005) [2023-03-06 23:49:40,359][81400] Updated weights for policy 0, policy_version 30630 (0.0006) [2023-03-06 23:49:41,105][81400] Updated weights for policy 0, policy_version 30640 (0.0006) [2023-03-06 23:49:41,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13329.0, 300 sec: 13315.5). Total num frames: 31376384. Throughput: 0: 13350.1. Samples: 31362013. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:49:41,237][81074] Avg episode reward: [(0, '527.736')] [2023-03-06 23:49:41,871][81400] Updated weights for policy 0, policy_version 30650 (0.0006) [2023-03-06 23:49:42,630][81400] Updated weights for policy 0, policy_version 30660 (0.0006) [2023-03-06 23:49:43,390][81400] Updated weights for policy 0, policy_version 30670 (0.0006) [2023-03-06 23:49:44,156][81400] Updated weights for policy 0, policy_version 30680 (0.0005) [2023-03-06 23:49:44,915][81400] Updated weights for policy 0, policy_version 30690 (0.0007) [2023-03-06 23:49:45,680][81400] Updated weights for policy 0, policy_version 30700 (0.0006) [2023-03-06 23:49:46,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13346.1, 300 sec: 13319.0). Total num frames: 31443968. Throughput: 0: 13369.3. Samples: 31442464. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:49:46,237][81074] Avg episode reward: [(0, '630.595')] [2023-03-06 23:49:46,441][81400] Updated weights for policy 0, policy_version 30710 (0.0008) [2023-03-06 23:49:47,185][81400] Updated weights for policy 0, policy_version 30720 (0.0006) [2023-03-06 23:49:47,953][81400] Updated weights for policy 0, policy_version 30730 (0.0006) [2023-03-06 23:49:48,708][81400] Updated weights for policy 0, policy_version 30740 (0.0007) [2023-03-06 23:49:49,480][81400] Updated weights for policy 0, policy_version 30750 (0.0005) [2023-03-06 23:49:50,229][81400] Updated weights for policy 0, policy_version 30760 (0.0006) [2023-03-06 23:49:51,005][81400] Updated weights for policy 0, policy_version 30770 (0.0005) [2023-03-06 23:49:51,236][81074] Fps is (10 sec: 13516.9, 60 sec: 13363.2, 300 sec: 13322.4). Total num frames: 31511552. Throughput: 0: 13389.1. Samples: 31482933. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:49:51,237][81074] Avg episode reward: [(0, '644.118')] [2023-03-06 23:49:51,753][81400] Updated weights for policy 0, policy_version 30780 (0.0006) [2023-03-06 23:49:52,509][81400] Updated weights for policy 0, policy_version 30790 (0.0006) [2023-03-06 23:49:53,286][81400] Updated weights for policy 0, policy_version 30800 (0.0006) [2023-03-06 23:49:54,043][81400] Updated weights for policy 0, policy_version 30810 (0.0006) [2023-03-06 23:49:54,785][81400] Updated weights for policy 0, policy_version 30820 (0.0005) [2023-03-06 23:49:55,547][81400] Updated weights for policy 0, policy_version 30830 (0.0007) [2023-03-06 23:49:56,236][81074] Fps is (10 sec: 13516.8, 60 sec: 13397.4, 300 sec: 13322.4). Total num frames: 31579136. Throughput: 0: 13421.2. Samples: 31563978. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:49:56,237][81074] Avg episode reward: [(0, '694.403')] [2023-03-06 23:49:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000030839_31579136.pth... [2023-03-06 23:49:56,269][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000027717_28382208.pth [2023-03-06 23:49:56,317][81400] Updated weights for policy 0, policy_version 30840 (0.0006) [2023-03-06 23:49:57,074][81400] Updated weights for policy 0, policy_version 30850 (0.0006) [2023-03-06 23:49:57,833][81400] Updated weights for policy 0, policy_version 30860 (0.0006) [2023-03-06 23:49:58,590][81400] Updated weights for policy 0, policy_version 30870 (0.0005) [2023-03-06 23:49:59,334][81400] Updated weights for policy 0, policy_version 30880 (0.0005) [2023-03-06 23:50:00,092][81400] Updated weights for policy 0, policy_version 30890 (0.0006) [2023-03-06 23:50:00,848][81400] Updated weights for policy 0, policy_version 30900 (0.0006) [2023-03-06 23:50:01,236][81074] Fps is (10 sec: 13516.9, 60 sec: 13414.4, 300 sec: 13322.4). Total num frames: 31646720. Throughput: 0: 13445.7. Samples: 31645097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:01,237][81074] Avg episode reward: [(0, '579.204')] [2023-03-06 23:50:01,601][81400] Updated weights for policy 0, policy_version 30910 (0.0006) [2023-03-06 23:50:02,376][81400] Updated weights for policy 0, policy_version 30920 (0.0006) [2023-03-06 23:50:03,145][81400] Updated weights for policy 0, policy_version 30930 (0.0006) [2023-03-06 23:50:03,906][81349] KL-divergence is very high: 383.4777 [2023-03-06 23:50:03,914][81400] Updated weights for policy 0, policy_version 30940 (0.0007) [2023-03-06 23:50:04,686][81400] Updated weights for policy 0, policy_version 30950 (0.0008) [2023-03-06 23:50:05,455][81400] Updated weights for policy 0, policy_version 30960 (0.0008) [2023-03-06 23:50:06,217][81400] Updated weights for policy 0, policy_version 30970 (0.0007) [2023-03-06 23:50:06,236][81074] Fps is (10 sec: 13414.2, 60 sec: 13414.4, 300 sec: 13322.4). Total num frames: 31713280. Throughput: 0: 13438.6. Samples: 31685053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:06,237][81074] Avg episode reward: [(0, '765.542')] [2023-03-06 23:50:06,988][81400] Updated weights for policy 0, policy_version 30980 (0.0006) [2023-03-06 23:50:07,725][81400] Updated weights for policy 0, policy_version 30990 (0.0005) [2023-03-06 23:50:08,498][81400] Updated weights for policy 0, policy_version 31000 (0.0006) [2023-03-06 23:50:09,275][81400] Updated weights for policy 0, policy_version 31010 (0.0007) [2023-03-06 23:50:10,045][81400] Updated weights for policy 0, policy_version 31020 (0.0005) [2023-03-06 23:50:10,820][81400] Updated weights for policy 0, policy_version 31030 (0.0006) [2023-03-06 23:50:11,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13414.4, 300 sec: 13322.4). Total num frames: 31779840. Throughput: 0: 13437.1. Samples: 31765324. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:11,237][81074] Avg episode reward: [(0, '742.660')] [2023-03-06 23:50:11,569][81400] Updated weights for policy 0, policy_version 31040 (0.0006) [2023-03-06 23:50:12,329][81400] Updated weights for policy 0, policy_version 31050 (0.0006) [2023-03-06 23:50:13,075][81400] Updated weights for policy 0, policy_version 31060 (0.0006) [2023-03-06 23:50:13,851][81400] Updated weights for policy 0, policy_version 31070 (0.0005) [2023-03-06 23:50:14,613][81400] Updated weights for policy 0, policy_version 31080 (0.0006) [2023-03-06 23:50:15,383][81400] Updated weights for policy 0, policy_version 31090 (0.0006) [2023-03-06 23:50:16,143][81400] Updated weights for policy 0, policy_version 31100 (0.0006) [2023-03-06 23:50:16,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13431.5, 300 sec: 13325.9). Total num frames: 31847424. Throughput: 0: 13436.6. Samples: 31845865. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:16,237][81074] Avg episode reward: [(0, '839.605')] [2023-03-06 23:50:16,935][81400] Updated weights for policy 0, policy_version 31110 (0.0006) [2023-03-06 23:50:17,706][81400] Updated weights for policy 0, policy_version 31120 (0.0006) [2023-03-06 23:50:18,461][81400] Updated weights for policy 0, policy_version 31130 (0.0006) [2023-03-06 23:50:19,221][81400] Updated weights for policy 0, policy_version 31140 (0.0007) [2023-03-06 23:50:20,003][81400] Updated weights for policy 0, policy_version 31150 (0.0006) [2023-03-06 23:50:20,785][81400] Updated weights for policy 0, policy_version 31160 (0.0006) [2023-03-06 23:50:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13414.4, 300 sec: 13322.4). Total num frames: 31912960. Throughput: 0: 13427.7. Samples: 31885703. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:50:21,237][81074] Avg episode reward: [(0, '1060.935')] [2023-03-06 23:50:21,562][81400] Updated weights for policy 0, policy_version 31170 (0.0007) [2023-03-06 23:50:22,325][81400] Updated weights for policy 0, policy_version 31180 (0.0006) [2023-03-06 23:50:23,096][81400] Updated weights for policy 0, policy_version 31190 (0.0006) [2023-03-06 23:50:23,868][81400] Updated weights for policy 0, policy_version 31200 (0.0006) [2023-03-06 23:50:24,627][81400] Updated weights for policy 0, policy_version 31210 (0.0006) [2023-03-06 23:50:25,395][81400] Updated weights for policy 0, policy_version 31220 (0.0006) [2023-03-06 23:50:26,156][81400] Updated weights for policy 0, policy_version 31230 (0.0006) [2023-03-06 23:50:26,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13414.4, 300 sec: 13322.4). Total num frames: 31979520. Throughput: 0: 13409.8. Samples: 31965455. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:50:26,237][81074] Avg episode reward: [(0, '1050.884')] [2023-03-06 23:50:26,918][81400] Updated weights for policy 0, policy_version 31240 (0.0006) [2023-03-06 23:50:27,702][81400] Updated weights for policy 0, policy_version 31250 (0.0005) [2023-03-06 23:50:28,445][81400] Updated weights for policy 0, policy_version 31260 (0.0005) [2023-03-06 23:50:29,234][81400] Updated weights for policy 0, policy_version 31270 (0.0005) [2023-03-06 23:50:30,011][81400] Updated weights for policy 0, policy_version 31280 (0.0006) [2023-03-06 23:50:30,768][81400] Updated weights for policy 0, policy_version 31290 (0.0005) [2023-03-06 23:50:31,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13414.4, 300 sec: 13322.4). Total num frames: 32047104. Throughput: 0: 13401.3. Samples: 32045522. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:50:31,237][81074] Avg episode reward: [(0, '1051.526')] [2023-03-06 23:50:31,528][81400] Updated weights for policy 0, policy_version 31300 (0.0006) [2023-03-06 23:50:32,303][81400] Updated weights for policy 0, policy_version 31310 (0.0006) [2023-03-06 23:50:33,071][81400] Updated weights for policy 0, policy_version 31320 (0.0006) [2023-03-06 23:50:33,845][81400] Updated weights for policy 0, policy_version 31330 (0.0006) [2023-03-06 23:50:34,617][81400] Updated weights for policy 0, policy_version 31340 (0.0006) [2023-03-06 23:50:35,398][81400] Updated weights for policy 0, policy_version 31350 (0.0006) [2023-03-06 23:50:36,171][81400] Updated weights for policy 0, policy_version 31360 (0.0006) [2023-03-06 23:50:36,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13380.2, 300 sec: 13318.9). Total num frames: 32112640. Throughput: 0: 13388.2. Samples: 32085405. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-06 23:50:36,237][81074] Avg episode reward: [(0, '1066.353')] [2023-03-06 23:50:36,930][81400] Updated weights for policy 0, policy_version 31370 (0.0007) [2023-03-06 23:50:37,712][81400] Updated weights for policy 0, policy_version 31380 (0.0006) [2023-03-06 23:50:38,489][81400] Updated weights for policy 0, policy_version 31390 (0.0006) [2023-03-06 23:50:39,249][81400] Updated weights for policy 0, policy_version 31400 (0.0006) [2023-03-06 23:50:40,028][81400] Updated weights for policy 0, policy_version 31410 (0.0006) [2023-03-06 23:50:40,805][81400] Updated weights for policy 0, policy_version 31420 (0.0006) [2023-03-06 23:50:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13380.3, 300 sec: 13318.9). Total num frames: 32179200. Throughput: 0: 13355.3. Samples: 32164968. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:41,237][81074] Avg episode reward: [(0, '1000.791')] [2023-03-06 23:50:41,565][81400] Updated weights for policy 0, policy_version 31430 (0.0006) [2023-03-06 23:50:42,330][81400] Updated weights for policy 0, policy_version 31440 (0.0007) [2023-03-06 23:50:43,096][81400] Updated weights for policy 0, policy_version 31450 (0.0006) [2023-03-06 23:50:43,876][81400] Updated weights for policy 0, policy_version 31460 (0.0006) [2023-03-06 23:50:44,658][81400] Updated weights for policy 0, policy_version 31470 (0.0006) [2023-03-06 23:50:45,421][81400] Updated weights for policy 0, policy_version 31480 (0.0005) [2023-03-06 23:50:46,174][81400] Updated weights for policy 0, policy_version 31490 (0.0005) [2023-03-06 23:50:46,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13363.1, 300 sec: 13322.4). Total num frames: 32245760. Throughput: 0: 13324.7. Samples: 32244711. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:46,237][81074] Avg episode reward: [(0, '1095.641')] [2023-03-06 23:50:46,965][81400] Updated weights for policy 0, policy_version 31500 (0.0006) [2023-03-06 23:50:47,744][81400] Updated weights for policy 0, policy_version 31510 (0.0007) [2023-03-06 23:50:48,433][81349] KL-divergence is very high: 4895506.0000 [2023-03-06 23:50:48,509][81400] Updated weights for policy 0, policy_version 31520 (0.0005) [2023-03-06 23:50:49,284][81400] Updated weights for policy 0, policy_version 31530 (0.0006) [2023-03-06 23:50:50,057][81400] Updated weights for policy 0, policy_version 31540 (0.0006) [2023-03-06 23:50:50,837][81400] Updated weights for policy 0, policy_version 31550 (0.0006) [2023-03-06 23:50:51,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13346.1, 300 sec: 13322.4). Total num frames: 32312320. Throughput: 0: 13318.4. Samples: 32284379. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:51,237][81074] Avg episode reward: [(0, '1169.034')] [2023-03-06 23:50:51,598][81400] Updated weights for policy 0, policy_version 31560 (0.0006) [2023-03-06 23:50:52,379][81400] Updated weights for policy 0, policy_version 31570 (0.0006) [2023-03-06 23:50:53,151][81400] Updated weights for policy 0, policy_version 31580 (0.0006) [2023-03-06 23:50:53,941][81400] Updated weights for policy 0, policy_version 31590 (0.0007) [2023-03-06 23:50:54,704][81400] Updated weights for policy 0, policy_version 31600 (0.0006) [2023-03-06 23:50:55,473][81400] Updated weights for policy 0, policy_version 31610 (0.0006) [2023-03-06 23:50:56,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13312.0, 300 sec: 13318.9). Total num frames: 32377856. Throughput: 0: 13299.1. Samples: 32363782. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:50:56,237][81074] Avg episode reward: [(0, '1109.195')] [2023-03-06 23:50:56,242][81400] Updated weights for policy 0, policy_version 31620 (0.0006) [2023-03-06 23:50:56,995][81400] Updated weights for policy 0, policy_version 31630 (0.0006) [2023-03-06 23:50:57,785][81400] Updated weights for policy 0, policy_version 31640 (0.0005) [2023-03-06 23:50:58,545][81400] Updated weights for policy 0, policy_version 31650 (0.0006) [2023-03-06 23:50:59,338][81400] Updated weights for policy 0, policy_version 31660 (0.0006) [2023-03-06 23:51:00,087][81400] Updated weights for policy 0, policy_version 31670 (0.0006) [2023-03-06 23:51:00,863][81400] Updated weights for policy 0, policy_version 31680 (0.0007) [2023-03-06 23:51:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13294.9, 300 sec: 13315.5). Total num frames: 32444416. Throughput: 0: 13281.1. Samples: 32443514. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:51:01,237][81074] Avg episode reward: [(0, '1217.393')] [2023-03-06 23:51:01,630][81400] Updated weights for policy 0, policy_version 31690 (0.0005) [2023-03-06 23:51:02,364][81400] Updated weights for policy 0, policy_version 31700 (0.0006) [2023-03-06 23:51:03,134][81400] Updated weights for policy 0, policy_version 31710 (0.0006) [2023-03-06 23:51:03,913][81400] Updated weights for policy 0, policy_version 31720 (0.0005) [2023-03-06 23:51:04,686][81400] Updated weights for policy 0, policy_version 31730 (0.0005) [2023-03-06 23:51:05,443][81400] Updated weights for policy 0, policy_version 31740 (0.0006) [2023-03-06 23:51:06,221][81400] Updated weights for policy 0, policy_version 31750 (0.0006) [2023-03-06 23:51:06,236][81074] Fps is (10 sec: 13414.3, 60 sec: 13312.0, 300 sec: 13322.4). Total num frames: 32512000. Throughput: 0: 13291.5. Samples: 32483822. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:51:06,237][81074] Avg episode reward: [(0, '1248.780')] [2023-03-06 23:51:07,004][81400] Updated weights for policy 0, policy_version 31760 (0.0006) [2023-03-06 23:51:07,778][81400] Updated weights for policy 0, policy_version 31770 (0.0006) [2023-03-06 23:51:08,547][81400] Updated weights for policy 0, policy_version 31780 (0.0007) [2023-03-06 23:51:09,326][81400] Updated weights for policy 0, policy_version 31790 (0.0006) [2023-03-06 23:51:10,077][81400] Updated weights for policy 0, policy_version 31800 (0.0006) [2023-03-06 23:51:10,860][81400] Updated weights for policy 0, policy_version 31810 (0.0005) [2023-03-06 23:51:11,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13318.9). Total num frames: 32577536. Throughput: 0: 13288.5. Samples: 32563437. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:51:11,237][81074] Avg episode reward: [(0, '1251.426')] [2023-03-06 23:51:11,626][81400] Updated weights for policy 0, policy_version 31820 (0.0006) [2023-03-06 23:51:12,399][81400] Updated weights for policy 0, policy_version 31830 (0.0006) [2023-03-06 23:51:13,178][81400] Updated weights for policy 0, policy_version 31840 (0.0006) [2023-03-06 23:51:13,929][81400] Updated weights for policy 0, policy_version 31850 (0.0007) [2023-03-06 23:51:14,698][81400] Updated weights for policy 0, policy_version 31860 (0.0006) [2023-03-06 23:51:15,463][81400] Updated weights for policy 0, policy_version 31870 (0.0006) [2023-03-06 23:51:16,232][81400] Updated weights for policy 0, policy_version 31880 (0.0006) [2023-03-06 23:51:16,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13294.9, 300 sec: 13322.4). Total num frames: 32645120. Throughput: 0: 13283.5. Samples: 32643280. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:51:16,237][81074] Avg episode reward: [(0, '1235.541')] [2023-03-06 23:51:16,997][81400] Updated weights for policy 0, policy_version 31890 (0.0006) [2023-03-06 23:51:17,778][81400] Updated weights for policy 0, policy_version 31900 (0.0006) [2023-03-06 23:51:18,542][81400] Updated weights for policy 0, policy_version 31910 (0.0006) [2023-03-06 23:51:19,328][81400] Updated weights for policy 0, policy_version 31920 (0.0006) [2023-03-06 23:51:20,109][81400] Updated weights for policy 0, policy_version 31930 (0.0006) [2023-03-06 23:51:20,878][81400] Updated weights for policy 0, policy_version 31940 (0.0006) [2023-03-06 23:51:21,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13318.9). Total num frames: 32710656. Throughput: 0: 13284.7. Samples: 32683214. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:51:21,237][81074] Avg episode reward: [(0, '1257.231')] [2023-03-06 23:51:21,655][81400] Updated weights for policy 0, policy_version 31950 (0.0006) [2023-03-06 23:51:22,425][81400] Updated weights for policy 0, policy_version 31960 (0.0005) [2023-03-06 23:51:23,195][81400] Updated weights for policy 0, policy_version 31970 (0.0006) [2023-03-06 23:51:23,957][81400] Updated weights for policy 0, policy_version 31980 (0.0006) [2023-03-06 23:51:24,731][81400] Updated weights for policy 0, policy_version 31990 (0.0006) [2023-03-06 23:51:25,497][81400] Updated weights for policy 0, policy_version 32000 (0.0006) [2023-03-06 23:51:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13295.0, 300 sec: 13322.4). Total num frames: 32777216. Throughput: 0: 13285.8. Samples: 32762828. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:51:26,237][81074] Avg episode reward: [(0, '1252.548')] [2023-03-06 23:51:26,251][81400] Updated weights for policy 0, policy_version 32010 (0.0006) [2023-03-06 23:51:27,034][81400] Updated weights for policy 0, policy_version 32020 (0.0005) [2023-03-06 23:51:27,812][81400] Updated weights for policy 0, policy_version 32030 (0.0008) [2023-03-06 23:51:28,583][81400] Updated weights for policy 0, policy_version 32040 (0.0007) [2023-03-06 23:51:29,361][81400] Updated weights for policy 0, policy_version 32050 (0.0006) [2023-03-06 23:51:30,125][81400] Updated weights for policy 0, policy_version 32060 (0.0005) [2023-03-06 23:51:30,883][81400] Updated weights for policy 0, policy_version 32070 (0.0006) [2023-03-06 23:51:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13322.4). Total num frames: 32843776. Throughput: 0: 13288.8. Samples: 32842705. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:51:31,237][81074] Avg episode reward: [(0, '1244.853')] [2023-03-06 23:51:31,640][81400] Updated weights for policy 0, policy_version 32080 (0.0005) [2023-03-06 23:51:32,425][81400] Updated weights for policy 0, policy_version 32090 (0.0005) [2023-03-06 23:51:33,196][81400] Updated weights for policy 0, policy_version 32100 (0.0005) [2023-03-06 23:51:33,965][81400] Updated weights for policy 0, policy_version 32110 (0.0006) [2023-03-06 23:51:34,765][81400] Updated weights for policy 0, policy_version 32120 (0.0005) [2023-03-06 23:51:35,531][81400] Updated weights for policy 0, policy_version 32130 (0.0006) [2023-03-06 23:51:36,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13295.0, 300 sec: 13322.4). Total num frames: 32910336. Throughput: 0: 13290.0. Samples: 32882429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:51:36,237][81074] Avg episode reward: [(0, '1325.025')] [2023-03-06 23:51:36,300][81400] Updated weights for policy 0, policy_version 32140 (0.0006) [2023-03-06 23:51:37,075][81400] Updated weights for policy 0, policy_version 32150 (0.0006) [2023-03-06 23:51:37,856][81400] Updated weights for policy 0, policy_version 32160 (0.0006) [2023-03-06 23:51:38,614][81400] Updated weights for policy 0, policy_version 32170 (0.0006) [2023-03-06 23:51:39,385][81400] Updated weights for policy 0, policy_version 32180 (0.0006) [2023-03-06 23:51:40,145][81400] Updated weights for policy 0, policy_version 32190 (0.0006) [2023-03-06 23:51:40,916][81400] Updated weights for policy 0, policy_version 32200 (0.0005) [2023-03-06 23:51:41,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13322.4). Total num frames: 32976896. Throughput: 0: 13294.6. Samples: 32962040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:51:41,237][81074] Avg episode reward: [(0, '1218.541')] [2023-03-06 23:51:41,704][81400] Updated weights for policy 0, policy_version 32210 (0.0008) [2023-03-06 23:51:42,458][81400] Updated weights for policy 0, policy_version 32220 (0.0006) [2023-03-06 23:51:43,236][81400] Updated weights for policy 0, policy_version 32230 (0.0005) [2023-03-06 23:51:43,981][81400] Updated weights for policy 0, policy_version 32240 (0.0005) [2023-03-06 23:51:44,774][81400] Updated weights for policy 0, policy_version 32250 (0.0006) [2023-03-06 23:51:45,536][81400] Updated weights for policy 0, policy_version 32260 (0.0006) [2023-03-06 23:51:46,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13322.4). Total num frames: 33043456. Throughput: 0: 13296.7. Samples: 33041869. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:51:46,237][81074] Avg episode reward: [(0, '1335.625')] [2023-03-06 23:51:46,298][81400] Updated weights for policy 0, policy_version 32270 (0.0006) [2023-03-06 23:51:47,089][81400] Updated weights for policy 0, policy_version 32280 (0.0006) [2023-03-06 23:51:47,854][81400] Updated weights for policy 0, policy_version 32290 (0.0005) [2023-03-06 23:51:48,629][81400] Updated weights for policy 0, policy_version 32300 (0.0006) [2023-03-06 23:51:49,419][81400] Updated weights for policy 0, policy_version 32310 (0.0005) [2023-03-06 23:51:50,204][81400] Updated weights for policy 0, policy_version 32320 (0.0006) [2023-03-06 23:51:50,965][81400] Updated weights for policy 0, policy_version 32330 (0.0006) [2023-03-06 23:51:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13277.8, 300 sec: 13322.4). Total num frames: 33108992. Throughput: 0: 13282.1. Samples: 33081516. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:51:51,237][81074] Avg episode reward: [(0, '1357.724')] [2023-03-06 23:51:51,729][81400] Updated weights for policy 0, policy_version 32340 (0.0006) [2023-03-06 23:51:52,498][81400] Updated weights for policy 0, policy_version 32350 (0.0007) [2023-03-06 23:51:53,281][81400] Updated weights for policy 0, policy_version 32360 (0.0006) [2023-03-06 23:51:54,049][81400] Updated weights for policy 0, policy_version 32370 (0.0006) [2023-03-06 23:51:54,827][81400] Updated weights for policy 0, policy_version 32380 (0.0006) [2023-03-06 23:51:55,585][81400] Updated weights for policy 0, policy_version 32390 (0.0007) [2023-03-06 23:51:56,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13294.9, 300 sec: 13322.4). Total num frames: 33175552. Throughput: 0: 13279.5. Samples: 33161014. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:51:56,237][81074] Avg episode reward: [(0, '1533.793')] [2023-03-06 23:51:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000032398_33175552.pth... [2023-03-06 23:51:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000029274_29976576.pth [2023-03-06 23:51:56,377][81400] Updated weights for policy 0, policy_version 32400 (0.0005) [2023-03-06 23:51:57,162][81400] Updated weights for policy 0, policy_version 32410 (0.0006) [2023-03-06 23:51:57,917][81400] Updated weights for policy 0, policy_version 32420 (0.0006) [2023-03-06 23:51:58,683][81400] Updated weights for policy 0, policy_version 32430 (0.0006) [2023-03-06 23:51:59,467][81400] Updated weights for policy 0, policy_version 32440 (0.0006) [2023-03-06 23:52:00,251][81400] Updated weights for policy 0, policy_version 32450 (0.0006) [2023-03-06 23:52:01,018][81400] Updated weights for policy 0, policy_version 32460 (0.0006) [2023-03-06 23:52:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13318.9). Total num frames: 33241088. Throughput: 0: 13263.9. Samples: 33240155. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:52:01,237][81074] Avg episode reward: [(0, '1589.263')] [2023-03-06 23:52:01,792][81400] Updated weights for policy 0, policy_version 32470 (0.0006) [2023-03-06 23:52:02,575][81400] Updated weights for policy 0, policy_version 32480 (0.0007) [2023-03-06 23:52:03,353][81400] Updated weights for policy 0, policy_version 32490 (0.0006) [2023-03-06 23:52:04,138][81400] Updated weights for policy 0, policy_version 32500 (0.0006) [2023-03-06 23:52:04,897][81400] Updated weights for policy 0, policy_version 32510 (0.0006) [2023-03-06 23:52:05,674][81400] Updated weights for policy 0, policy_version 32520 (0.0006) [2023-03-06 23:52:06,236][81074] Fps is (10 sec: 13209.3, 60 sec: 13260.8, 300 sec: 13322.4). Total num frames: 33307648. Throughput: 0: 13252.9. Samples: 33279598. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:52:06,248][81074] Avg episode reward: [(0, '1567.866')] [2023-03-06 23:52:06,446][81400] Updated weights for policy 0, policy_version 32530 (0.0007) [2023-03-06 23:52:07,211][81400] Updated weights for policy 0, policy_version 32540 (0.0006) [2023-03-06 23:52:07,991][81400] Updated weights for policy 0, policy_version 32550 (0.0006) [2023-03-06 23:52:08,755][81400] Updated weights for policy 0, policy_version 32560 (0.0006) [2023-03-06 23:52:09,540][81400] Updated weights for policy 0, policy_version 32570 (0.0006) [2023-03-06 23:52:10,300][81400] Updated weights for policy 0, policy_version 32580 (0.0005) [2023-03-06 23:52:11,081][81400] Updated weights for policy 0, policy_version 32590 (0.0006) [2023-03-06 23:52:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13319.0). Total num frames: 33373184. Throughput: 0: 13254.5. Samples: 33359281. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:52:11,247][81074] Avg episode reward: [(0, '1554.955')] [2023-03-06 23:52:11,837][81400] Updated weights for policy 0, policy_version 32600 (0.0005) [2023-03-06 23:52:12,597][81400] Updated weights for policy 0, policy_version 32610 (0.0007) [2023-03-06 23:52:13,385][81400] Updated weights for policy 0, policy_version 32620 (0.0006) [2023-03-06 23:52:14,155][81400] Updated weights for policy 0, policy_version 32630 (0.0006) [2023-03-06 23:52:14,937][81400] Updated weights for policy 0, policy_version 32640 (0.0006) [2023-03-06 23:52:15,714][81400] Updated weights for policy 0, policy_version 32650 (0.0006) [2023-03-06 23:52:16,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13243.8, 300 sec: 13319.0). Total num frames: 33439744. Throughput: 0: 13248.8. Samples: 33438901. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:52:16,247][81074] Avg episode reward: [(0, '1589.433')] [2023-03-06 23:52:16,490][81400] Updated weights for policy 0, policy_version 32660 (0.0006) [2023-03-06 23:52:17,278][81400] Updated weights for policy 0, policy_version 32670 (0.0006) [2023-03-06 23:52:18,049][81400] Updated weights for policy 0, policy_version 32680 (0.0007) [2023-03-06 23:52:18,830][81400] Updated weights for policy 0, policy_version 32690 (0.0007) [2023-03-06 23:52:19,600][81400] Updated weights for policy 0, policy_version 32700 (0.0006) [2023-03-06 23:52:20,364][81400] Updated weights for policy 0, policy_version 32710 (0.0006) [2023-03-06 23:52:21,152][81400] Updated weights for policy 0, policy_version 32720 (0.0006) [2023-03-06 23:52:21,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13318.9). Total num frames: 33506304. Throughput: 0: 13238.2. Samples: 33478149. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:52:21,248][81074] Avg episode reward: [(0, '1539.073')] [2023-03-06 23:52:21,910][81400] Updated weights for policy 0, policy_version 32730 (0.0006) [2023-03-06 23:52:22,697][81400] Updated weights for policy 0, policy_version 32740 (0.0006) [2023-03-06 23:52:23,482][81400] Updated weights for policy 0, policy_version 32750 (0.0006) [2023-03-06 23:52:24,241][81400] Updated weights for policy 0, policy_version 32760 (0.0006) [2023-03-06 23:52:25,009][81400] Updated weights for policy 0, policy_version 32770 (0.0006) [2023-03-06 23:52:25,805][81400] Updated weights for policy 0, policy_version 32780 (0.0006) [2023-03-06 23:52:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13315.5). Total num frames: 33571840. Throughput: 0: 13234.8. Samples: 33557602. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:52:26,247][81074] Avg episode reward: [(0, '1738.443')] [2023-03-06 23:52:26,492][81349] KL-divergence is very high: 3407.3586 [2023-03-06 23:52:26,569][81400] Updated weights for policy 0, policy_version 32790 (0.0006) [2023-03-06 23:52:27,345][81400] Updated weights for policy 0, policy_version 32800 (0.0006) [2023-03-06 23:52:28,127][81400] Updated weights for policy 0, policy_version 32810 (0.0006) [2023-03-06 23:52:28,903][81400] Updated weights for policy 0, policy_version 32820 (0.0006) [2023-03-06 23:52:29,673][81400] Updated weights for policy 0, policy_version 32830 (0.0006) [2023-03-06 23:52:30,453][81400] Updated weights for policy 0, policy_version 32840 (0.0005) [2023-03-06 23:52:31,214][81400] Updated weights for policy 0, policy_version 32850 (0.0006) [2023-03-06 23:52:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13315.5). Total num frames: 33638400. Throughput: 0: 13226.0. Samples: 33637038. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:52:31,247][81074] Avg episode reward: [(0, '1953.244')] [2023-03-06 23:52:31,977][81400] Updated weights for policy 0, policy_version 32860 (0.0007) [2023-03-06 23:52:32,763][81400] Updated weights for policy 0, policy_version 32870 (0.0005) [2023-03-06 23:52:33,538][81400] Updated weights for policy 0, policy_version 32880 (0.0006) [2023-03-06 23:52:34,328][81400] Updated weights for policy 0, policy_version 32890 (0.0006) [2023-03-06 23:52:35,098][81400] Updated weights for policy 0, policy_version 32900 (0.0006) [2023-03-06 23:52:35,870][81400] Updated weights for policy 0, policy_version 32910 (0.0006) [2023-03-06 23:52:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13312.0). Total num frames: 33703936. Throughput: 0: 13222.7. Samples: 33676535. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:52:36,247][81074] Avg episode reward: [(0, '1905.369')] [2023-03-06 23:52:36,634][81400] Updated weights for policy 0, policy_version 32920 (0.0006) [2023-03-06 23:52:37,406][81400] Updated weights for policy 0, policy_version 32930 (0.0006) [2023-03-06 23:52:38,193][81400] Updated weights for policy 0, policy_version 32940 (0.0006) [2023-03-06 23:52:38,945][81400] Updated weights for policy 0, policy_version 32950 (0.0006) [2023-03-06 23:52:39,727][81400] Updated weights for policy 0, policy_version 32960 (0.0006) [2023-03-06 23:52:40,509][81400] Updated weights for policy 0, policy_version 32970 (0.0006) [2023-03-06 23:52:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13312.0). Total num frames: 33770496. Throughput: 0: 13222.1. Samples: 33756011. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:52:41,237][81074] Avg episode reward: [(0, '1824.249')] [2023-03-06 23:52:41,272][81400] Updated weights for policy 0, policy_version 32980 (0.0006) [2023-03-06 23:52:41,420][81349] KL-divergence is very high: 36482188.0000 [2023-03-06 23:52:42,048][81400] Updated weights for policy 0, policy_version 32990 (0.0006) [2023-03-06 23:52:42,857][81400] Updated weights for policy 0, policy_version 33000 (0.0006) [2023-03-06 23:52:43,624][81400] Updated weights for policy 0, policy_version 33010 (0.0006) [2023-03-06 23:52:44,408][81400] Updated weights for policy 0, policy_version 33020 (0.0007) [2023-03-06 23:52:45,175][81400] Updated weights for policy 0, policy_version 33030 (0.0006) [2023-03-06 23:52:45,944][81349] KL-divergence is very high: 497.4876 [2023-03-06 23:52:45,952][81400] Updated weights for policy 0, policy_version 33040 (0.0006) [2023-03-06 23:52:46,236][81074] Fps is (10 sec: 13209.3, 60 sec: 13209.6, 300 sec: 13308.5). Total num frames: 33836032. Throughput: 0: 13220.8. Samples: 33835093. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:52:46,237][81074] Avg episode reward: [(0, '1804.520')] [2023-03-06 23:52:46,743][81400] Updated weights for policy 0, policy_version 33050 (0.0006) [2023-03-06 23:52:47,524][81400] Updated weights for policy 0, policy_version 33060 (0.0005) [2023-03-06 23:52:48,126][81349] KL-divergence is very high: 2203.2529 [2023-03-06 23:52:48,282][81400] Updated weights for policy 0, policy_version 33070 (0.0006) [2023-03-06 23:52:49,071][81400] Updated weights for policy 0, policy_version 33080 (0.0006) [2023-03-06 23:52:49,837][81400] Updated weights for policy 0, policy_version 33090 (0.0005) [2023-03-06 23:52:50,515][81349] KL-divergence is very high: 490.0833 [2023-03-06 23:52:50,623][81400] Updated weights for policy 0, policy_version 33100 (0.0007) [2023-03-06 23:52:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13308.5). Total num frames: 33902592. Throughput: 0: 13223.8. Samples: 33874667. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:52:51,237][81074] Avg episode reward: [(0, '1726.829')] [2023-03-06 23:52:51,384][81400] Updated weights for policy 0, policy_version 33110 (0.0006) [2023-03-06 23:52:52,144][81400] Updated weights for policy 0, policy_version 33120 (0.0006) [2023-03-06 23:52:52,896][81400] Updated weights for policy 0, policy_version 33130 (0.0006) [2023-03-06 23:52:53,681][81400] Updated weights for policy 0, policy_version 33140 (0.0005) [2023-03-06 23:52:54,470][81400] Updated weights for policy 0, policy_version 33150 (0.0006) [2023-03-06 23:52:55,246][81400] Updated weights for policy 0, policy_version 33160 (0.0006) [2023-03-06 23:52:56,035][81400] Updated weights for policy 0, policy_version 33170 (0.0006) [2023-03-06 23:52:56,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13209.6, 300 sec: 13305.1). Total num frames: 33968128. Throughput: 0: 13217.6. Samples: 33954075. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:52:56,237][81074] Avg episode reward: [(0, '1587.931')] [2023-03-06 23:52:56,795][81400] Updated weights for policy 0, policy_version 33180 (0.0006) [2023-03-06 23:52:57,550][81400] Updated weights for policy 0, policy_version 33190 (0.0006) [2023-03-06 23:52:58,339][81400] Updated weights for policy 0, policy_version 33200 (0.0006) [2023-03-06 23:52:59,117][81400] Updated weights for policy 0, policy_version 33210 (0.0006) [2023-03-06 23:52:59,881][81400] Updated weights for policy 0, policy_version 33220 (0.0006) [2023-03-06 23:53:00,654][81400] Updated weights for policy 0, policy_version 33230 (0.0006) [2023-03-06 23:53:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13308.5). Total num frames: 34034688. Throughput: 0: 13216.8. Samples: 34033656. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:53:01,237][81074] Avg episode reward: [(0, '1557.035')] [2023-03-06 23:53:01,435][81400] Updated weights for policy 0, policy_version 33240 (0.0006) [2023-03-06 23:53:02,193][81400] Updated weights for policy 0, policy_version 33250 (0.0006) [2023-03-06 23:53:02,985][81400] Updated weights for policy 0, policy_version 33260 (0.0006) [2023-03-06 23:53:03,742][81400] Updated weights for policy 0, policy_version 33270 (0.0005) [2023-03-06 23:53:04,516][81400] Updated weights for policy 0, policy_version 33280 (0.0005) [2023-03-06 23:53:05,300][81400] Updated weights for policy 0, policy_version 33290 (0.0006) [2023-03-06 23:53:06,058][81400] Updated weights for policy 0, policy_version 33300 (0.0005) [2023-03-06 23:53:06,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13226.7, 300 sec: 13308.5). Total num frames: 34101248. Throughput: 0: 13225.8. Samples: 34073310. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:53:06,237][81074] Avg episode reward: [(0, '1509.086')] [2023-03-06 23:53:06,838][81400] Updated weights for policy 0, policy_version 33310 (0.0005) [2023-03-06 23:53:07,612][81400] Updated weights for policy 0, policy_version 33320 (0.0006) [2023-03-06 23:53:08,379][81400] Updated weights for policy 0, policy_version 33330 (0.0007) [2023-03-06 23:53:09,168][81400] Updated weights for policy 0, policy_version 33340 (0.0007) [2023-03-06 23:53:09,933][81400] Updated weights for policy 0, policy_version 33350 (0.0006) [2023-03-06 23:53:10,709][81400] Updated weights for policy 0, policy_version 33360 (0.0006) [2023-03-06 23:53:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13305.1). Total num frames: 34166784. Throughput: 0: 13222.6. Samples: 34152618. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:53:11,237][81074] Avg episode reward: [(0, '1469.972')] [2023-03-06 23:53:11,487][81400] Updated weights for policy 0, policy_version 33370 (0.0006) [2023-03-06 23:53:12,254][81400] Updated weights for policy 0, policy_version 33380 (0.0006) [2023-03-06 23:53:13,013][81400] Updated weights for policy 0, policy_version 33390 (0.0006) [2023-03-06 23:53:13,793][81400] Updated weights for policy 0, policy_version 33400 (0.0006) [2023-03-06 23:53:14,560][81400] Updated weights for policy 0, policy_version 33410 (0.0007) [2023-03-06 23:53:15,340][81400] Updated weights for policy 0, policy_version 33420 (0.0006) [2023-03-06 23:53:16,103][81400] Updated weights for policy 0, policy_version 33430 (0.0006) [2023-03-06 23:53:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13301.6). Total num frames: 34233344. Throughput: 0: 13225.6. Samples: 34232189. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:53:16,237][81074] Avg episode reward: [(0, '1657.526')] [2023-03-06 23:53:16,884][81400] Updated weights for policy 0, policy_version 33440 (0.0006) [2023-03-06 23:53:17,643][81400] Updated weights for policy 0, policy_version 33450 (0.0006) [2023-03-06 23:53:18,416][81400] Updated weights for policy 0, policy_version 33460 (0.0006) [2023-03-06 23:53:19,195][81400] Updated weights for policy 0, policy_version 33470 (0.0005) [2023-03-06 23:53:19,947][81400] Updated weights for policy 0, policy_version 33480 (0.0006) [2023-03-06 23:53:20,705][81349] KL-divergence is very high: 34245.0117 [2023-03-06 23:53:20,713][81400] Updated weights for policy 0, policy_version 33490 (0.0007) [2023-03-06 23:53:21,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13226.7, 300 sec: 13301.6). Total num frames: 34299904. Throughput: 0: 13232.5. Samples: 34271998. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:53:21,237][81074] Avg episode reward: [(0, '1499.185')] [2023-03-06 23:53:21,337][81349] KL-divergence is very high: 133.5966 [2023-03-06 23:53:21,497][81400] Updated weights for policy 0, policy_version 33500 (0.0006) [2023-03-06 23:53:22,274][81400] Updated weights for policy 0, policy_version 33510 (0.0006) [2023-03-06 23:53:23,041][81400] Updated weights for policy 0, policy_version 33520 (0.0006) [2023-03-06 23:53:23,810][81400] Updated weights for policy 0, policy_version 33530 (0.0006) [2023-03-06 23:53:24,016][81349] KL-divergence is very high: 1376.9470 [2023-03-06 23:53:24,567][81400] Updated weights for policy 0, policy_version 33540 (0.0007) [2023-03-06 23:53:25,332][81400] Updated weights for policy 0, policy_version 33550 (0.0005) [2023-03-06 23:53:26,110][81400] Updated weights for policy 0, policy_version 33560 (0.0005) [2023-03-06 23:53:26,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13301.6). Total num frames: 34366464. Throughput: 0: 13250.1. Samples: 34352265. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:53:26,237][81074] Avg episode reward: [(0, '1859.961')] [2023-03-06 23:53:26,882][81400] Updated weights for policy 0, policy_version 33570 (0.0008) [2023-03-06 23:53:27,637][81400] Updated weights for policy 0, policy_version 33580 (0.0007) [2023-03-06 23:53:28,422][81400] Updated weights for policy 0, policy_version 33590 (0.0007) [2023-03-06 23:53:29,188][81400] Updated weights for policy 0, policy_version 33600 (0.0007) [2023-03-06 23:53:29,973][81400] Updated weights for policy 0, policy_version 33610 (0.0006) [2023-03-06 23:53:30,746][81400] Updated weights for policy 0, policy_version 33620 (0.0006) [2023-03-06 23:53:31,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13301.6). Total num frames: 34433024. Throughput: 0: 13260.4. Samples: 34431810. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:53:31,237][81074] Avg episode reward: [(0, '1566.553')] [2023-03-06 23:53:31,517][81400] Updated weights for policy 0, policy_version 33630 (0.0006) [2023-03-06 23:53:32,287][81400] Updated weights for policy 0, policy_version 33640 (0.0006) [2023-03-06 23:53:33,058][81400] Updated weights for policy 0, policy_version 33650 (0.0006) [2023-03-06 23:53:33,840][81400] Updated weights for policy 0, policy_version 33660 (0.0006) [2023-03-06 23:53:34,606][81400] Updated weights for policy 0, policy_version 33670 (0.0006) [2023-03-06 23:53:35,385][81400] Updated weights for policy 0, policy_version 33680 (0.0006) [2023-03-06 23:53:36,163][81400] Updated weights for policy 0, policy_version 33690 (0.0005) [2023-03-06 23:53:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13294.6). Total num frames: 34498560. Throughput: 0: 13258.8. Samples: 34471311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:53:36,237][81074] Avg episode reward: [(0, '1499.878')] [2023-03-06 23:53:36,925][81400] Updated weights for policy 0, policy_version 33700 (0.0006) [2023-03-06 23:53:37,715][81400] Updated weights for policy 0, policy_version 33710 (0.0006) [2023-03-06 23:53:38,494][81400] Updated weights for policy 0, policy_version 33720 (0.0006) [2023-03-06 23:53:39,258][81400] Updated weights for policy 0, policy_version 33730 (0.0006) [2023-03-06 23:53:40,050][81400] Updated weights for policy 0, policy_version 33740 (0.0006) [2023-03-06 23:53:40,816][81400] Updated weights for policy 0, policy_version 33750 (0.0005) [2023-03-06 23:53:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13294.6). Total num frames: 34565120. Throughput: 0: 13254.3. Samples: 34550519. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:53:41,237][81074] Avg episode reward: [(0, '1387.550')] [2023-03-06 23:53:41,594][81400] Updated weights for policy 0, policy_version 33760 (0.0006) [2023-03-06 23:53:42,345][81400] Updated weights for policy 0, policy_version 33770 (0.0006) [2023-03-06 23:53:43,127][81400] Updated weights for policy 0, policy_version 33780 (0.0006) [2023-03-06 23:53:43,892][81400] Updated weights for policy 0, policy_version 33790 (0.0006) [2023-03-06 23:53:44,673][81400] Updated weights for policy 0, policy_version 33800 (0.0006) [2023-03-06 23:53:45,441][81400] Updated weights for policy 0, policy_version 33810 (0.0006) [2023-03-06 23:53:46,203][81400] Updated weights for policy 0, policy_version 33820 (0.0006) [2023-03-06 23:53:46,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13294.6). Total num frames: 34631680. Throughput: 0: 13258.6. Samples: 34630295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:53:46,237][81074] Avg episode reward: [(0, '1348.766')] [2023-03-06 23:53:47,015][81400] Updated weights for policy 0, policy_version 33830 (0.0007) [2023-03-06 23:53:47,784][81400] Updated weights for policy 0, policy_version 33840 (0.0007) [2023-03-06 23:53:48,544][81400] Updated weights for policy 0, policy_version 33850 (0.0006) [2023-03-06 23:53:49,317][81400] Updated weights for policy 0, policy_version 33860 (0.0006) [2023-03-06 23:53:50,097][81400] Updated weights for policy 0, policy_version 33870 (0.0006) [2023-03-06 23:53:50,877][81400] Updated weights for policy 0, policy_version 33880 (0.0005) [2023-03-06 23:53:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13294.6). Total num frames: 34697216. Throughput: 0: 13253.7. Samples: 34669726. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:53:51,237][81074] Avg episode reward: [(0, '1555.124')] [2023-03-06 23:53:51,633][81400] Updated weights for policy 0, policy_version 33890 (0.0006) [2023-03-06 23:53:52,414][81400] Updated weights for policy 0, policy_version 33900 (0.0006) [2023-03-06 23:53:53,172][81400] Updated weights for policy 0, policy_version 33910 (0.0006) [2023-03-06 23:53:53,942][81400] Updated weights for policy 0, policy_version 33920 (0.0005) [2023-03-06 23:53:54,718][81400] Updated weights for policy 0, policy_version 33930 (0.0007) [2023-03-06 23:53:55,492][81400] Updated weights for policy 0, policy_version 33940 (0.0006) [2023-03-06 23:53:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13294.6). Total num frames: 34763776. Throughput: 0: 13263.5. Samples: 34749476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:53:56,237][81074] Avg episode reward: [(0, '1287.610')] [2023-03-06 23:53:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000033949_34763776.pth... [2023-03-06 23:53:56,278][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000030839_31579136.pth [2023-03-06 23:53:56,297][81400] Updated weights for policy 0, policy_version 33950 (0.0006) [2023-03-06 23:53:57,050][81400] Updated weights for policy 0, policy_version 33960 (0.0006) [2023-03-06 23:53:57,827][81400] Updated weights for policy 0, policy_version 33970 (0.0005) [2023-03-06 23:53:58,599][81400] Updated weights for policy 0, policy_version 33980 (0.0006) [2023-03-06 23:53:59,372][81400] Updated weights for policy 0, policy_version 33990 (0.0005) [2023-03-06 23:54:00,155][81400] Updated weights for policy 0, policy_version 34000 (0.0006) [2023-03-06 23:54:00,901][81400] Updated weights for policy 0, policy_version 34010 (0.0006) [2023-03-06 23:54:01,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13294.6). Total num frames: 34830336. Throughput: 0: 13260.8. Samples: 34828925. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:54:01,237][81074] Avg episode reward: [(0, '1275.642')] [2023-03-06 23:54:01,674][81400] Updated weights for policy 0, policy_version 34020 (0.0006) [2023-03-06 23:54:02,442][81400] Updated weights for policy 0, policy_version 34030 (0.0007) [2023-03-06 23:54:03,211][81400] Updated weights for policy 0, policy_version 34040 (0.0005) [2023-03-06 23:54:03,979][81400] Updated weights for policy 0, policy_version 34050 (0.0006) [2023-03-06 23:54:04,760][81400] Updated weights for policy 0, policy_version 34060 (0.0006) [2023-03-06 23:54:05,540][81400] Updated weights for policy 0, policy_version 34070 (0.0006) [2023-03-06 23:54:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.8, 300 sec: 13291.2). Total num frames: 34895872. Throughput: 0: 13262.9. Samples: 34868827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:54:06,237][81074] Avg episode reward: [(0, '1339.117')] [2023-03-06 23:54:06,306][81400] Updated weights for policy 0, policy_version 34080 (0.0006) [2023-03-06 23:54:07,104][81400] Updated weights for policy 0, policy_version 34090 (0.0006) [2023-03-06 23:54:07,867][81400] Updated weights for policy 0, policy_version 34100 (0.0006) [2023-03-06 23:54:08,633][81400] Updated weights for policy 0, policy_version 34110 (0.0006) [2023-03-06 23:54:09,396][81400] Updated weights for policy 0, policy_version 34120 (0.0006) [2023-03-06 23:54:10,153][81400] Updated weights for policy 0, policy_version 34130 (0.0006) [2023-03-06 23:54:10,928][81400] Updated weights for policy 0, policy_version 34140 (0.0006) [2023-03-06 23:54:11,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13294.6). Total num frames: 34963456. Throughput: 0: 13249.2. Samples: 34948477. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:11,237][81074] Avg episode reward: [(0, '1180.442')] [2023-03-06 23:54:11,705][81400] Updated weights for policy 0, policy_version 34150 (0.0006) [2023-03-06 23:54:12,463][81400] Updated weights for policy 0, policy_version 34160 (0.0006) [2023-03-06 23:54:13,239][81400] Updated weights for policy 0, policy_version 34170 (0.0006) [2023-03-06 23:54:14,015][81400] Updated weights for policy 0, policy_version 34180 (0.0006) [2023-03-06 23:54:14,232][81349] KL-divergence is very high: 201.5846 [2023-03-06 23:54:14,780][81400] Updated weights for policy 0, policy_version 34190 (0.0007) [2023-03-06 23:54:15,553][81400] Updated weights for policy 0, policy_version 34200 (0.0007) [2023-03-06 23:54:16,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13260.8, 300 sec: 13291.2). Total num frames: 35028992. Throughput: 0: 13250.7. Samples: 35028093. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:16,237][81074] Avg episode reward: [(0, '1498.541')] [2023-03-06 23:54:16,335][81400] Updated weights for policy 0, policy_version 34210 (0.0006) [2023-03-06 23:54:17,109][81400] Updated weights for policy 0, policy_version 34220 (0.0007) [2023-03-06 23:54:17,899][81400] Updated weights for policy 0, policy_version 34230 (0.0007) [2023-03-06 23:54:18,668][81400] Updated weights for policy 0, policy_version 34240 (0.0006) [2023-03-06 23:54:19,431][81400] Updated weights for policy 0, policy_version 34250 (0.0006) [2023-03-06 23:54:20,205][81400] Updated weights for policy 0, policy_version 34260 (0.0006) [2023-03-06 23:54:20,977][81400] Updated weights for policy 0, policy_version 34270 (0.0006) [2023-03-06 23:54:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13291.2). Total num frames: 35095552. Throughput: 0: 13254.0. Samples: 35067744. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:21,237][81074] Avg episode reward: [(0, '1375.544')] [2023-03-06 23:54:21,740][81400] Updated weights for policy 0, policy_version 34280 (0.0006) [2023-03-06 23:54:22,506][81400] Updated weights for policy 0, policy_version 34290 (0.0006) [2023-03-06 23:54:22,660][81349] KL-divergence is very high: 11098.5586 [2023-03-06 23:54:23,287][81400] Updated weights for policy 0, policy_version 34300 (0.0006) [2023-03-06 23:54:24,059][81400] Updated weights for policy 0, policy_version 34310 (0.0006) [2023-03-06 23:54:24,812][81400] Updated weights for policy 0, policy_version 34320 (0.0006) [2023-03-06 23:54:25,581][81400] Updated weights for policy 0, policy_version 34330 (0.0006) [2023-03-06 23:54:26,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13287.7). Total num frames: 35162112. Throughput: 0: 13266.4. Samples: 35147506. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:26,237][81074] Avg episode reward: [(0, '1178.825')] [2023-03-06 23:54:26,337][81400] Updated weights for policy 0, policy_version 34340 (0.0005) [2023-03-06 23:54:27,117][81400] Updated weights for policy 0, policy_version 34350 (0.0006) [2023-03-06 23:54:27,886][81400] Updated weights for policy 0, policy_version 34360 (0.0006) [2023-03-06 23:54:28,664][81400] Updated weights for policy 0, policy_version 34370 (0.0005) [2023-03-06 23:54:29,442][81400] Updated weights for policy 0, policy_version 34380 (0.0006) [2023-03-06 23:54:29,904][81349] KL-divergence is very high: 805.5814 [2023-03-06 23:54:30,223][81400] Updated weights for policy 0, policy_version 34390 (0.0005) [2023-03-06 23:54:30,985][81400] Updated weights for policy 0, policy_version 34400 (0.0006) [2023-03-06 23:54:31,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13284.2). Total num frames: 35228672. Throughput: 0: 13263.6. Samples: 35227156. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:31,237][81074] Avg episode reward: [(0, '1459.927')] [2023-03-06 23:54:31,766][81400] Updated weights for policy 0, policy_version 34410 (0.0006) [2023-03-06 23:54:32,532][81400] Updated weights for policy 0, policy_version 34420 (0.0006) [2023-03-06 23:54:33,294][81400] Updated weights for policy 0, policy_version 34430 (0.0006) [2023-03-06 23:54:34,067][81400] Updated weights for policy 0, policy_version 34440 (0.0006) [2023-03-06 23:54:34,846][81400] Updated weights for policy 0, policy_version 34450 (0.0006) [2023-03-06 23:54:35,609][81400] Updated weights for policy 0, policy_version 34460 (0.0006) [2023-03-06 23:54:36,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13277.9, 300 sec: 13284.2). Total num frames: 35295232. Throughput: 0: 13277.0. Samples: 35267191. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:36,237][81074] Avg episode reward: [(0, '1370.381')] [2023-03-06 23:54:36,389][81400] Updated weights for policy 0, policy_version 34470 (0.0008) [2023-03-06 23:54:37,161][81400] Updated weights for policy 0, policy_version 34480 (0.0006) [2023-03-06 23:54:37,933][81400] Updated weights for policy 0, policy_version 34490 (0.0006) [2023-03-06 23:54:38,716][81400] Updated weights for policy 0, policy_version 34500 (0.0007) [2023-03-06 23:54:39,484][81400] Updated weights for policy 0, policy_version 34510 (0.0006) [2023-03-06 23:54:40,270][81400] Updated weights for policy 0, policy_version 34520 (0.0006) [2023-03-06 23:54:41,061][81400] Updated weights for policy 0, policy_version 34530 (0.0006) [2023-03-06 23:54:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13277.3). Total num frames: 35360768. Throughput: 0: 13265.6. Samples: 35346429. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:41,237][81074] Avg episode reward: [(0, '1536.603')] [2023-03-06 23:54:41,823][81400] Updated weights for policy 0, policy_version 34540 (0.0006) [2023-03-06 23:54:42,608][81400] Updated weights for policy 0, policy_version 34550 (0.0008) [2023-03-06 23:54:43,379][81400] Updated weights for policy 0, policy_version 34560 (0.0007) [2023-03-06 23:54:44,165][81400] Updated weights for policy 0, policy_version 34570 (0.0006) [2023-03-06 23:54:44,949][81400] Updated weights for policy 0, policy_version 34580 (0.0006) [2023-03-06 23:54:45,717][81400] Updated weights for policy 0, policy_version 34590 (0.0006) [2023-03-06 23:54:46,236][81074] Fps is (10 sec: 13106.9, 60 sec: 13243.7, 300 sec: 13270.3). Total num frames: 35426304. Throughput: 0: 13256.6. Samples: 35425474. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:46,237][81074] Avg episode reward: [(0, '1521.072')] [2023-03-06 23:54:46,468][81400] Updated weights for policy 0, policy_version 34600 (0.0006) [2023-03-06 23:54:47,276][81400] Updated weights for policy 0, policy_version 34610 (0.0005) [2023-03-06 23:54:48,038][81400] Updated weights for policy 0, policy_version 34620 (0.0006) [2023-03-06 23:54:48,807][81400] Updated weights for policy 0, policy_version 34630 (0.0007) [2023-03-06 23:54:49,581][81400] Updated weights for policy 0, policy_version 34640 (0.0006) [2023-03-06 23:54:50,382][81400] Updated weights for policy 0, policy_version 34650 (0.0006) [2023-03-06 23:54:51,128][81400] Updated weights for policy 0, policy_version 34660 (0.0006) [2023-03-06 23:54:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13266.9). Total num frames: 35492864. Throughput: 0: 13248.2. Samples: 35464995. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:51,237][81074] Avg episode reward: [(0, '1716.990')] [2023-03-06 23:54:51,898][81400] Updated weights for policy 0, policy_version 34670 (0.0006) [2023-03-06 23:54:52,681][81400] Updated weights for policy 0, policy_version 34680 (0.0006) [2023-03-06 23:54:53,458][81400] Updated weights for policy 0, policy_version 34690 (0.0006) [2023-03-06 23:54:54,241][81400] Updated weights for policy 0, policy_version 34700 (0.0006) [2023-03-06 23:54:55,011][81400] Updated weights for policy 0, policy_version 34710 (0.0005) [2023-03-06 23:54:55,787][81400] Updated weights for policy 0, policy_version 34720 (0.0006) [2023-03-06 23:54:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 35558400. Throughput: 0: 13237.3. Samples: 35544155. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:54:56,237][81074] Avg episode reward: [(0, '1588.253')] [2023-03-06 23:54:56,556][81400] Updated weights for policy 0, policy_version 34730 (0.0006) [2023-03-06 23:54:57,329][81400] Updated weights for policy 0, policy_version 34740 (0.0006) [2023-03-06 23:54:58,107][81400] Updated weights for policy 0, policy_version 34750 (0.0006) [2023-03-06 23:54:58,880][81400] Updated weights for policy 0, policy_version 34760 (0.0005) [2023-03-06 23:54:59,651][81400] Updated weights for policy 0, policy_version 34770 (0.0007) [2023-03-06 23:55:00,417][81400] Updated weights for policy 0, policy_version 34780 (0.0006) [2023-03-06 23:55:01,196][81400] Updated weights for policy 0, policy_version 34790 (0.0006) [2023-03-06 23:55:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13259.9). Total num frames: 35624960. Throughput: 0: 13236.2. Samples: 35623721. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:55:01,237][81074] Avg episode reward: [(0, '1594.074')] [2023-03-06 23:55:01,968][81400] Updated weights for policy 0, policy_version 34800 (0.0006) [2023-03-06 23:55:02,752][81400] Updated weights for policy 0, policy_version 34810 (0.0006) [2023-03-06 23:55:03,527][81400] Updated weights for policy 0, policy_version 34820 (0.0005) [2023-03-06 23:55:04,298][81400] Updated weights for policy 0, policy_version 34830 (0.0006) [2023-03-06 23:55:05,066][81400] Updated weights for policy 0, policy_version 34840 (0.0005) [2023-03-06 23:55:05,834][81400] Updated weights for policy 0, policy_version 34850 (0.0006) [2023-03-06 23:55:06,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 35691520. Throughput: 0: 13237.1. Samples: 35663411. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:55:06,237][81074] Avg episode reward: [(0, '1632.310')] [2023-03-06 23:55:06,600][81400] Updated weights for policy 0, policy_version 34860 (0.0006) [2023-03-06 23:55:07,372][81400] Updated weights for policy 0, policy_version 34870 (0.0006) [2023-03-06 23:55:08,145][81400] Updated weights for policy 0, policy_version 34880 (0.0006) [2023-03-06 23:55:08,924][81400] Updated weights for policy 0, policy_version 34890 (0.0006) [2023-03-06 23:55:09,694][81400] Updated weights for policy 0, policy_version 34900 (0.0006) [2023-03-06 23:55:10,471][81400] Updated weights for policy 0, policy_version 34910 (0.0006) [2023-03-06 23:55:11,231][81400] Updated weights for policy 0, policy_version 34920 (0.0006) [2023-03-06 23:55:11,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 35758080. Throughput: 0: 13233.4. Samples: 35743009. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:55:11,237][81074] Avg episode reward: [(0, '1582.068')] [2023-03-06 23:55:11,989][81400] Updated weights for policy 0, policy_version 34930 (0.0006) [2023-03-06 23:55:12,742][81400] Updated weights for policy 0, policy_version 34940 (0.0006) [2023-03-06 23:55:13,514][81400] Updated weights for policy 0, policy_version 34950 (0.0006) [2023-03-06 23:55:14,279][81400] Updated weights for policy 0, policy_version 34960 (0.0007) [2023-03-06 23:55:15,055][81400] Updated weights for policy 0, policy_version 34970 (0.0006) [2023-03-06 23:55:15,821][81400] Updated weights for policy 0, policy_version 34980 (0.0006) [2023-03-06 23:55:16,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13259.9). Total num frames: 35824640. Throughput: 0: 13247.2. Samples: 35823278. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:55:16,237][81074] Avg episode reward: [(0, '1655.720')] [2023-03-06 23:55:16,597][81400] Updated weights for policy 0, policy_version 34990 (0.0006) [2023-03-06 23:55:17,396][81400] Updated weights for policy 0, policy_version 35000 (0.0006) [2023-03-06 23:55:18,174][81400] Updated weights for policy 0, policy_version 35010 (0.0006) [2023-03-06 23:55:18,950][81400] Updated weights for policy 0, policy_version 35020 (0.0005) [2023-03-06 23:55:19,715][81400] Updated weights for policy 0, policy_version 35030 (0.0006) [2023-03-06 23:55:20,501][81400] Updated weights for policy 0, policy_version 35040 (0.0006) [2023-03-06 23:55:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 35890176. Throughput: 0: 13232.0. Samples: 35862631. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:55:21,237][81074] Avg episode reward: [(0, '1428.487')] [2023-03-06 23:55:21,245][81400] Updated weights for policy 0, policy_version 35050 (0.0006) [2023-03-06 23:55:22,008][81400] Updated weights for policy 0, policy_version 35060 (0.0006) [2023-03-06 23:55:22,784][81400] Updated weights for policy 0, policy_version 35070 (0.0007) [2023-03-06 23:55:23,565][81400] Updated weights for policy 0, policy_version 35080 (0.0006) [2023-03-06 23:55:24,341][81400] Updated weights for policy 0, policy_version 35090 (0.0006) [2023-03-06 23:55:25,110][81400] Updated weights for policy 0, policy_version 35100 (0.0006) [2023-03-06 23:55:25,886][81400] Updated weights for policy 0, policy_version 35110 (0.0006) [2023-03-06 23:55:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 35956736. Throughput: 0: 13242.9. Samples: 35942357. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:55:26,237][81074] Avg episode reward: [(0, '1521.904')] [2023-03-06 23:55:26,658][81400] Updated weights for policy 0, policy_version 35120 (0.0007) [2023-03-06 23:55:27,433][81400] Updated weights for policy 0, policy_version 35130 (0.0005) [2023-03-06 23:55:28,197][81400] Updated weights for policy 0, policy_version 35140 (0.0006) [2023-03-06 23:55:28,987][81400] Updated weights for policy 0, policy_version 35150 (0.0006) [2023-03-06 23:55:29,767][81400] Updated weights for policy 0, policy_version 35160 (0.0006) [2023-03-06 23:55:30,530][81400] Updated weights for policy 0, policy_version 35170 (0.0007) [2023-03-06 23:55:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 36023296. Throughput: 0: 13250.6. Samples: 36021751. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:55:31,237][81074] Avg episode reward: [(0, '1692.538')] [2023-03-06 23:55:31,302][81400] Updated weights for policy 0, policy_version 35180 (0.0006) [2023-03-06 23:55:32,075][81400] Updated weights for policy 0, policy_version 35190 (0.0005) [2023-03-06 23:55:32,842][81400] Updated weights for policy 0, policy_version 35200 (0.0006) [2023-03-06 23:55:33,607][81400] Updated weights for policy 0, policy_version 35210 (0.0006) [2023-03-06 23:55:34,373][81400] Updated weights for policy 0, policy_version 35220 (0.0006) [2023-03-06 23:55:35,137][81400] Updated weights for policy 0, policy_version 35230 (0.0006) [2023-03-06 23:55:35,910][81400] Updated weights for policy 0, policy_version 35240 (0.0006) [2023-03-06 23:55:36,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13256.5). Total num frames: 36089856. Throughput: 0: 13257.6. Samples: 36061585. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:55:36,247][81074] Avg episode reward: [(0, '1550.580')] [2023-03-06 23:55:36,716][81400] Updated weights for policy 0, policy_version 35250 (0.0007) [2023-03-06 23:55:37,483][81400] Updated weights for policy 0, policy_version 35260 (0.0006) [2023-03-06 23:55:38,246][81400] Updated weights for policy 0, policy_version 35270 (0.0006) [2023-03-06 23:55:39,042][81400] Updated weights for policy 0, policy_version 35280 (0.0006) [2023-03-06 23:55:39,805][81400] Updated weights for policy 0, policy_version 35290 (0.0006) [2023-03-06 23:55:40,575][81400] Updated weights for policy 0, policy_version 35300 (0.0006) [2023-03-06 23:55:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 36155392. Throughput: 0: 13263.1. Samples: 36140994. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:55:41,247][81074] Avg episode reward: [(0, '1418.180')] [2023-03-06 23:55:41,366][81400] Updated weights for policy 0, policy_version 35310 (0.0006) [2023-03-06 23:55:42,140][81400] Updated weights for policy 0, policy_version 35320 (0.0007) [2023-03-06 23:55:42,908][81400] Updated weights for policy 0, policy_version 35330 (0.0006) [2023-03-06 23:55:43,668][81400] Updated weights for policy 0, policy_version 35340 (0.0006) [2023-03-06 23:55:44,453][81400] Updated weights for policy 0, policy_version 35350 (0.0006) [2023-03-06 23:55:45,226][81400] Updated weights for policy 0, policy_version 35360 (0.0006) [2023-03-06 23:55:46,004][81400] Updated weights for policy 0, policy_version 35370 (0.0006) [2023-03-06 23:55:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13253.0). Total num frames: 36221952. Throughput: 0: 13255.8. Samples: 36220233. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:55:46,247][81074] Avg episode reward: [(0, '1495.100')] [2023-03-06 23:55:46,793][81400] Updated weights for policy 0, policy_version 35380 (0.0007) [2023-03-06 23:55:47,562][81400] Updated weights for policy 0, policy_version 35390 (0.0007) [2023-03-06 23:55:48,338][81400] Updated weights for policy 0, policy_version 35400 (0.0006) [2023-03-06 23:55:49,113][81400] Updated weights for policy 0, policy_version 35410 (0.0006) [2023-03-06 23:55:49,883][81400] Updated weights for policy 0, policy_version 35420 (0.0006) [2023-03-06 23:55:50,683][81400] Updated weights for policy 0, policy_version 35430 (0.0006) [2023-03-06 23:55:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13253.0). Total num frames: 36287488. Throughput: 0: 13249.2. Samples: 36259624. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:55:51,237][81074] Avg episode reward: [(0, '1326.754')] [2023-03-06 23:55:51,442][81400] Updated weights for policy 0, policy_version 35440 (0.0007) [2023-03-06 23:55:52,209][81400] Updated weights for policy 0, policy_version 35450 (0.0005) [2023-03-06 23:55:52,990][81400] Updated weights for policy 0, policy_version 35460 (0.0006) [2023-03-06 23:55:53,772][81400] Updated weights for policy 0, policy_version 35470 (0.0008) [2023-03-06 23:55:54,543][81400] Updated weights for policy 0, policy_version 35480 (0.0006) [2023-03-06 23:55:55,338][81400] Updated weights for policy 0, policy_version 35490 (0.0006) [2023-03-06 23:55:56,104][81400] Updated weights for policy 0, policy_version 35500 (0.0006) [2023-03-06 23:55:56,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13243.7, 300 sec: 13249.5). Total num frames: 36353024. Throughput: 0: 13241.2. Samples: 36338867. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:55:56,237][81074] Avg episode reward: [(0, '1432.008')] [2023-03-06 23:55:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000035501_36353024.pth... [2023-03-06 23:55:56,273][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000032398_33175552.pth [2023-03-06 23:55:56,881][81400] Updated weights for policy 0, policy_version 35510 (0.0006) [2023-03-06 23:55:57,658][81400] Updated weights for policy 0, policy_version 35520 (0.0006) [2023-03-06 23:55:58,438][81400] Updated weights for policy 0, policy_version 35530 (0.0006) [2023-03-06 23:55:59,206][81400] Updated weights for policy 0, policy_version 35540 (0.0006) [2023-03-06 23:55:59,993][81400] Updated weights for policy 0, policy_version 35550 (0.0006) [2023-03-06 23:56:00,755][81400] Updated weights for policy 0, policy_version 35560 (0.0007) [2023-03-06 23:56:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13246.1). Total num frames: 36419584. Throughput: 0: 13217.0. Samples: 36418045. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:56:01,237][81074] Avg episode reward: [(0, '1513.895')] [2023-03-06 23:56:01,528][81400] Updated weights for policy 0, policy_version 35570 (0.0006) [2023-03-06 23:56:02,301][81400] Updated weights for policy 0, policy_version 35580 (0.0006) [2023-03-06 23:56:03,077][81400] Updated weights for policy 0, policy_version 35590 (0.0006) [2023-03-06 23:56:03,837][81400] Updated weights for policy 0, policy_version 35600 (0.0006) [2023-03-06 23:56:04,610][81400] Updated weights for policy 0, policy_version 35610 (0.0006) [2023-03-06 23:56:05,387][81400] Updated weights for policy 0, policy_version 35620 (0.0006) [2023-03-06 23:56:06,154][81400] Updated weights for policy 0, policy_version 35630 (0.0006) [2023-03-06 23:56:06,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13246.1). Total num frames: 36485120. Throughput: 0: 13225.8. Samples: 36457789. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:56:06,237][81074] Avg episode reward: [(0, '1402.414')] [2023-03-06 23:56:06,927][81400] Updated weights for policy 0, policy_version 35640 (0.0006) [2023-03-06 23:56:07,687][81400] Updated weights for policy 0, policy_version 35650 (0.0005) [2023-03-06 23:56:08,473][81400] Updated weights for policy 0, policy_version 35660 (0.0006) [2023-03-06 23:56:09,229][81400] Updated weights for policy 0, policy_version 35670 (0.0006) [2023-03-06 23:56:10,007][81400] Updated weights for policy 0, policy_version 35680 (0.0006) [2023-03-06 23:56:10,785][81400] Updated weights for policy 0, policy_version 35690 (0.0007) [2023-03-06 23:56:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 36551680. Throughput: 0: 13225.7. Samples: 36537515. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:56:11,237][81074] Avg episode reward: [(0, '1312.550')] [2023-03-06 23:56:11,573][81400] Updated weights for policy 0, policy_version 35700 (0.0006) [2023-03-06 23:56:12,337][81400] Updated weights for policy 0, policy_version 35710 (0.0007) [2023-03-06 23:56:13,114][81400] Updated weights for policy 0, policy_version 35720 (0.0007) [2023-03-06 23:56:13,900][81400] Updated weights for policy 0, policy_version 35730 (0.0007) [2023-03-06 23:56:14,683][81400] Updated weights for policy 0, policy_version 35740 (0.0006) [2023-03-06 23:56:15,449][81400] Updated weights for policy 0, policy_version 35750 (0.0007) [2023-03-06 23:56:16,231][81400] Updated weights for policy 0, policy_version 35760 (0.0006) [2023-03-06 23:56:16,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13226.6, 300 sec: 13246.0). Total num frames: 36618240. Throughput: 0: 13217.2. Samples: 36616526. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:56:16,237][81074] Avg episode reward: [(0, '1354.427')] [2023-03-06 23:56:17,009][81400] Updated weights for policy 0, policy_version 35770 (0.0006) [2023-03-06 23:56:17,782][81400] Updated weights for policy 0, policy_version 35780 (0.0006) [2023-03-06 23:56:18,566][81400] Updated weights for policy 0, policy_version 35790 (0.0007) [2023-03-06 23:56:19,332][81400] Updated weights for policy 0, policy_version 35800 (0.0007) [2023-03-06 23:56:20,100][81400] Updated weights for policy 0, policy_version 35810 (0.0006) [2023-03-06 23:56:20,864][81400] Updated weights for policy 0, policy_version 35820 (0.0006) [2023-03-06 23:56:21,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 36683776. Throughput: 0: 13213.6. Samples: 36656199. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-06 23:56:21,237][81074] Avg episode reward: [(0, '1318.806')] [2023-03-06 23:56:21,652][81400] Updated weights for policy 0, policy_version 35830 (0.0005) [2023-03-06 23:56:22,422][81400] Updated weights for policy 0, policy_version 35840 (0.0006) [2023-03-06 23:56:23,195][81400] Updated weights for policy 0, policy_version 35850 (0.0006) [2023-03-06 23:56:23,979][81400] Updated weights for policy 0, policy_version 35860 (0.0006) [2023-03-06 23:56:24,734][81400] Updated weights for policy 0, policy_version 35870 (0.0006) [2023-03-06 23:56:25,527][81400] Updated weights for policy 0, policy_version 35880 (0.0006) [2023-03-06 23:56:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.6, 300 sec: 13242.6). Total num frames: 36750336. Throughput: 0: 13213.3. Samples: 36735595. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:56:26,237][81074] Avg episode reward: [(0, '1362.462')] [2023-03-06 23:56:26,289][81400] Updated weights for policy 0, policy_version 35890 (0.0005) [2023-03-06 23:56:27,083][81400] Updated weights for policy 0, policy_version 35900 (0.0006) [2023-03-06 23:56:27,870][81400] Updated weights for policy 0, policy_version 35910 (0.0006) [2023-03-06 23:56:28,644][81400] Updated weights for policy 0, policy_version 35920 (0.0006) [2023-03-06 23:56:29,412][81400] Updated weights for policy 0, policy_version 35930 (0.0006) [2023-03-06 23:56:30,210][81400] Updated weights for policy 0, policy_version 35940 (0.0006) [2023-03-06 23:56:30,990][81400] Updated weights for policy 0, policy_version 35950 (0.0008) [2023-03-06 23:56:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 36815872. Throughput: 0: 13202.9. Samples: 36814366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:56:31,237][81074] Avg episode reward: [(0, '1474.019')] [2023-03-06 23:56:31,767][81400] Updated weights for policy 0, policy_version 35960 (0.0007) [2023-03-06 23:56:32,554][81400] Updated weights for policy 0, policy_version 35970 (0.0006) [2023-03-06 23:56:33,302][81400] Updated weights for policy 0, policy_version 35980 (0.0006) [2023-03-06 23:56:34,081][81400] Updated weights for policy 0, policy_version 35990 (0.0006) [2023-03-06 23:56:34,843][81400] Updated weights for policy 0, policy_version 36000 (0.0006) [2023-03-06 23:56:35,621][81400] Updated weights for policy 0, policy_version 36010 (0.0006) [2023-03-06 23:56:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13235.6). Total num frames: 36881408. Throughput: 0: 13209.0. Samples: 36854029. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:56:36,237][81074] Avg episode reward: [(0, '1522.913')] [2023-03-06 23:56:36,396][81400] Updated weights for policy 0, policy_version 36020 (0.0005) [2023-03-06 23:56:37,163][81400] Updated weights for policy 0, policy_version 36030 (0.0006) [2023-03-06 23:56:37,957][81400] Updated weights for policy 0, policy_version 36040 (0.0007) [2023-03-06 23:56:38,730][81400] Updated weights for policy 0, policy_version 36050 (0.0006) [2023-03-06 23:56:39,497][81400] Updated weights for policy 0, policy_version 36060 (0.0007) [2023-03-06 23:56:40,269][81400] Updated weights for policy 0, policy_version 36070 (0.0006) [2023-03-06 23:56:41,032][81400] Updated weights for policy 0, policy_version 36080 (0.0006) [2023-03-06 23:56:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 36947968. Throughput: 0: 13215.7. Samples: 36933572. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:56:41,237][81074] Avg episode reward: [(0, '1334.880')] [2023-03-06 23:56:41,813][81400] Updated weights for policy 0, policy_version 36090 (0.0006) [2023-03-06 23:56:42,588][81400] Updated weights for policy 0, policy_version 36100 (0.0006) [2023-03-06 23:56:43,346][81400] Updated weights for policy 0, policy_version 36110 (0.0007) [2023-03-06 23:56:44,128][81400] Updated weights for policy 0, policy_version 36120 (0.0005) [2023-03-06 23:56:44,889][81400] Updated weights for policy 0, policy_version 36130 (0.0006) [2023-03-06 23:56:45,656][81400] Updated weights for policy 0, policy_version 36140 (0.0006) [2023-03-06 23:56:46,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13239.1). Total num frames: 37014528. Throughput: 0: 13228.2. Samples: 37013313. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:56:46,237][81074] Avg episode reward: [(0, '1490.135')] [2023-03-06 23:56:46,427][81400] Updated weights for policy 0, policy_version 36150 (0.0006) [2023-03-06 23:56:47,207][81400] Updated weights for policy 0, policy_version 36160 (0.0006) [2023-03-06 23:56:47,992][81400] Updated weights for policy 0, policy_version 36170 (0.0006) [2023-03-06 23:56:48,772][81400] Updated weights for policy 0, policy_version 36180 (0.0006) [2023-03-06 23:56:49,548][81400] Updated weights for policy 0, policy_version 36190 (0.0005) [2023-03-06 23:56:50,333][81400] Updated weights for policy 0, policy_version 36200 (0.0007) [2023-03-06 23:56:51,089][81400] Updated weights for policy 0, policy_version 36210 (0.0006) [2023-03-06 23:56:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 37080064. Throughput: 0: 13222.4. Samples: 37052799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:56:51,237][81074] Avg episode reward: [(0, '1512.235')] [2023-03-06 23:56:51,862][81400] Updated weights for policy 0, policy_version 36220 (0.0006) [2023-03-06 23:56:52,654][81400] Updated weights for policy 0, policy_version 36230 (0.0006) [2023-03-06 23:56:53,406][81400] Updated weights for policy 0, policy_version 36240 (0.0006) [2023-03-06 23:56:54,193][81400] Updated weights for policy 0, policy_version 36250 (0.0006) [2023-03-06 23:56:54,968][81400] Updated weights for policy 0, policy_version 36260 (0.0006) [2023-03-06 23:56:55,745][81400] Updated weights for policy 0, policy_version 36270 (0.0007) [2023-03-06 23:56:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 37146624. Throughput: 0: 13213.5. Samples: 37132122. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:56:56,247][81074] Avg episode reward: [(0, '1445.552')] [2023-03-06 23:56:56,513][81400] Updated weights for policy 0, policy_version 36280 (0.0006) [2023-03-06 23:56:57,299][81400] Updated weights for policy 0, policy_version 36290 (0.0006) [2023-03-06 23:56:58,071][81400] Updated weights for policy 0, policy_version 36300 (0.0006) [2023-03-06 23:56:58,853][81400] Updated weights for policy 0, policy_version 36310 (0.0006) [2023-03-06 23:56:59,628][81400] Updated weights for policy 0, policy_version 36320 (0.0006) [2023-03-06 23:57:00,394][81400] Updated weights for policy 0, policy_version 36330 (0.0006) [2023-03-06 23:57:01,181][81400] Updated weights for policy 0, policy_version 36340 (0.0006) [2023-03-06 23:57:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 37212160. Throughput: 0: 13219.5. Samples: 37211403. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:01,247][81074] Avg episode reward: [(0, '1659.620')] [2023-03-06 23:57:01,940][81400] Updated weights for policy 0, policy_version 36350 (0.0006) [2023-03-06 23:57:02,697][81400] Updated weights for policy 0, policy_version 36360 (0.0006) [2023-03-06 23:57:03,495][81400] Updated weights for policy 0, policy_version 36370 (0.0005) [2023-03-06 23:57:04,276][81400] Updated weights for policy 0, policy_version 36380 (0.0006) [2023-03-06 23:57:05,048][81400] Updated weights for policy 0, policy_version 36390 (0.0006) [2023-03-06 23:57:05,825][81400] Updated weights for policy 0, policy_version 36400 (0.0006) [2023-03-06 23:57:06,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 37278720. Throughput: 0: 13216.7. Samples: 37250949. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:06,248][81074] Avg episode reward: [(0, '1486.840')] [2023-03-06 23:57:06,602][81400] Updated weights for policy 0, policy_version 36410 (0.0006) [2023-03-06 23:57:07,367][81400] Updated weights for policy 0, policy_version 36420 (0.0006) [2023-03-06 23:57:08,155][81400] Updated weights for policy 0, policy_version 36430 (0.0005) [2023-03-06 23:57:08,945][81400] Updated weights for policy 0, policy_version 36440 (0.0006) [2023-03-06 23:57:09,716][81400] Updated weights for policy 0, policy_version 36450 (0.0006) [2023-03-06 23:57:10,497][81400] Updated weights for policy 0, policy_version 36460 (0.0006) [2023-03-06 23:57:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 37344256. Throughput: 0: 13210.5. Samples: 37330066. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:11,248][81074] Avg episode reward: [(0, '1491.540')] [2023-03-06 23:57:11,258][81400] Updated weights for policy 0, policy_version 36470 (0.0006) [2023-03-06 23:57:12,034][81400] Updated weights for policy 0, policy_version 36480 (0.0006) [2023-03-06 23:57:12,807][81349] KL-divergence is very high: 103.7943 [2023-03-06 23:57:12,815][81400] Updated weights for policy 0, policy_version 36490 (0.0006) [2023-03-06 23:57:13,593][81400] Updated weights for policy 0, policy_version 36500 (0.0006) [2023-03-06 23:57:14,357][81400] Updated weights for policy 0, policy_version 36510 (0.0006) [2023-03-06 23:57:15,128][81400] Updated weights for policy 0, policy_version 36520 (0.0006) [2023-03-06 23:57:15,894][81400] Updated weights for policy 0, policy_version 36530 (0.0006) [2023-03-06 23:57:16,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 37410816. Throughput: 0: 13224.4. Samples: 37409466. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:16,247][81074] Avg episode reward: [(0, '1358.845')] [2023-03-06 23:57:16,664][81400] Updated weights for policy 0, policy_version 36540 (0.0006) [2023-03-06 23:57:17,429][81400] Updated weights for policy 0, policy_version 36550 (0.0006) [2023-03-06 23:57:18,215][81400] Updated weights for policy 0, policy_version 36560 (0.0006) [2023-03-06 23:57:18,984][81400] Updated weights for policy 0, policy_version 36570 (0.0006) [2023-03-06 23:57:19,750][81400] Updated weights for policy 0, policy_version 36580 (0.0006) [2023-03-06 23:57:20,527][81400] Updated weights for policy 0, policy_version 36590 (0.0008) [2023-03-06 23:57:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 37476352. Throughput: 0: 13227.2. Samples: 37449252. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:21,237][81074] Avg episode reward: [(0, '1384.907')] [2023-03-06 23:57:21,309][81400] Updated weights for policy 0, policy_version 36600 (0.0006) [2023-03-06 23:57:22,077][81400] Updated weights for policy 0, policy_version 36610 (0.0006) [2023-03-06 23:57:22,864][81400] Updated weights for policy 0, policy_version 36620 (0.0006) [2023-03-06 23:57:23,655][81400] Updated weights for policy 0, policy_version 36630 (0.0006) [2023-03-06 23:57:24,417][81400] Updated weights for policy 0, policy_version 36640 (0.0006) [2023-03-06 23:57:25,196][81400] Updated weights for policy 0, policy_version 36650 (0.0006) [2023-03-06 23:57:25,975][81400] Updated weights for policy 0, policy_version 36660 (0.0006) [2023-03-06 23:57:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 37542912. Throughput: 0: 13223.8. Samples: 37528643. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:26,237][81074] Avg episode reward: [(0, '1423.938')] [2023-03-06 23:57:26,755][81400] Updated weights for policy 0, policy_version 36670 (0.0006) [2023-03-06 23:57:27,518][81400] Updated weights for policy 0, policy_version 36680 (0.0006) [2023-03-06 23:57:28,270][81400] Updated weights for policy 0, policy_version 36690 (0.0006) [2023-03-06 23:57:29,042][81400] Updated weights for policy 0, policy_version 36700 (0.0007) [2023-03-06 23:57:29,813][81400] Updated weights for policy 0, policy_version 36710 (0.0006) [2023-03-06 23:57:30,591][81400] Updated weights for policy 0, policy_version 36720 (0.0005) [2023-03-06 23:57:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 37609472. Throughput: 0: 13218.5. Samples: 37608146. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:31,237][81074] Avg episode reward: [(0, '1382.806')] [2023-03-06 23:57:31,371][81400] Updated weights for policy 0, policy_version 36730 (0.0006) [2023-03-06 23:57:32,151][81400] Updated weights for policy 0, policy_version 36740 (0.0005) [2023-03-06 23:57:32,937][81400] Updated weights for policy 0, policy_version 36750 (0.0006) [2023-03-06 23:57:33,707][81400] Updated weights for policy 0, policy_version 36760 (0.0006) [2023-03-06 23:57:34,488][81400] Updated weights for policy 0, policy_version 36770 (0.0006) [2023-03-06 23:57:35,262][81400] Updated weights for policy 0, policy_version 36780 (0.0006) [2023-03-06 23:57:36,038][81400] Updated weights for policy 0, policy_version 36790 (0.0006) [2023-03-06 23:57:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 37675008. Throughput: 0: 13217.9. Samples: 37647606. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:36,237][81074] Avg episode reward: [(0, '1322.796')] [2023-03-06 23:57:36,793][81400] Updated weights for policy 0, policy_version 36800 (0.0006) [2023-03-06 23:57:37,573][81400] Updated weights for policy 0, policy_version 36810 (0.0006) [2023-03-06 23:57:38,346][81400] Updated weights for policy 0, policy_version 36820 (0.0006) [2023-03-06 23:57:39,126][81400] Updated weights for policy 0, policy_version 36830 (0.0007) [2023-03-06 23:57:39,915][81400] Updated weights for policy 0, policy_version 36840 (0.0007) [2023-03-06 23:57:40,671][81400] Updated weights for policy 0, policy_version 36850 (0.0006) [2023-03-06 23:57:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 37741568. Throughput: 0: 13213.0. Samples: 37726709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:41,237][81074] Avg episode reward: [(0, '1283.168')] [2023-03-06 23:57:41,452][81400] Updated weights for policy 0, policy_version 36860 (0.0006) [2023-03-06 23:57:42,233][81400] Updated weights for policy 0, policy_version 36870 (0.0006) [2023-03-06 23:57:43,017][81400] Updated weights for policy 0, policy_version 36880 (0.0007) [2023-03-06 23:57:43,803][81400] Updated weights for policy 0, policy_version 36890 (0.0006) [2023-03-06 23:57:44,577][81400] Updated weights for policy 0, policy_version 36900 (0.0007) [2023-03-06 23:57:45,350][81400] Updated weights for policy 0, policy_version 36910 (0.0006) [2023-03-06 23:57:46,130][81400] Updated weights for policy 0, policy_version 36920 (0.0006) [2023-03-06 23:57:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13235.6). Total num frames: 37807104. Throughput: 0: 13210.3. Samples: 37805866. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:46,237][81074] Avg episode reward: [(0, '1274.481')] [2023-03-06 23:57:46,902][81400] Updated weights for policy 0, policy_version 36930 (0.0005) [2023-03-06 23:57:47,658][81400] Updated weights for policy 0, policy_version 36940 (0.0006) [2023-03-06 23:57:48,434][81400] Updated weights for policy 0, policy_version 36950 (0.0007) [2023-03-06 23:57:49,192][81400] Updated weights for policy 0, policy_version 36960 (0.0006) [2023-03-06 23:57:49,965][81400] Updated weights for policy 0, policy_version 36970 (0.0007) [2023-03-06 23:57:50,740][81400] Updated weights for policy 0, policy_version 36980 (0.0006) [2023-03-06 23:57:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 37873664. Throughput: 0: 13220.9. Samples: 37845887. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:57:51,237][81074] Avg episode reward: [(0, '1305.845')] [2023-03-06 23:57:51,510][81400] Updated weights for policy 0, policy_version 36990 (0.0006) [2023-03-06 23:57:52,278][81400] Updated weights for policy 0, policy_version 37000 (0.0006) [2023-03-06 23:57:53,039][81400] Updated weights for policy 0, policy_version 37010 (0.0006) [2023-03-06 23:57:53,823][81400] Updated weights for policy 0, policy_version 37020 (0.0007) [2023-03-06 23:57:54,581][81400] Updated weights for policy 0, policy_version 37030 (0.0006) [2023-03-06 23:57:55,355][81400] Updated weights for policy 0, policy_version 37040 (0.0006) [2023-03-06 23:57:56,142][81400] Updated weights for policy 0, policy_version 37050 (0.0006) [2023-03-06 23:57:56,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13226.6, 300 sec: 13239.1). Total num frames: 37940224. Throughput: 0: 13236.4. Samples: 37925705. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:57:56,237][81074] Avg episode reward: [(0, '1325.174')] [2023-03-06 23:57:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000037051_37940224.pth... [2023-03-06 23:57:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000033949_34763776.pth [2023-03-06 23:57:56,922][81400] Updated weights for policy 0, policy_version 37060 (0.0006) [2023-03-06 23:57:57,686][81400] Updated weights for policy 0, policy_version 37070 (0.0006) [2023-03-06 23:57:58,461][81400] Updated weights for policy 0, policy_version 37080 (0.0006) [2023-03-06 23:57:59,237][81400] Updated weights for policy 0, policy_version 37090 (0.0006) [2023-03-06 23:58:00,000][81400] Updated weights for policy 0, policy_version 37100 (0.0006) [2023-03-06 23:58:00,791][81400] Updated weights for policy 0, policy_version 37110 (0.0006) [2023-03-06 23:58:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 38005760. Throughput: 0: 13234.4. Samples: 38005016. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:58:01,237][81074] Avg episode reward: [(0, '1402.124')] [2023-03-06 23:58:01,549][81400] Updated weights for policy 0, policy_version 37120 (0.0006) [2023-03-06 23:58:02,316][81400] Updated weights for policy 0, policy_version 37130 (0.0006) [2023-03-06 23:58:03,103][81400] Updated weights for policy 0, policy_version 37140 (0.0006) [2023-03-06 23:58:03,887][81400] Updated weights for policy 0, policy_version 37150 (0.0006) [2023-03-06 23:58:04,670][81400] Updated weights for policy 0, policy_version 37160 (0.0006) [2023-03-06 23:58:05,453][81400] Updated weights for policy 0, policy_version 37170 (0.0006) [2023-03-06 23:58:06,228][81400] Updated weights for policy 0, policy_version 37180 (0.0006) [2023-03-06 23:58:06,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13239.1). Total num frames: 38072320. Throughput: 0: 13229.5. Samples: 38044579. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:58:06,237][81074] Avg episode reward: [(0, '1360.291')] [2023-03-06 23:58:06,996][81400] Updated weights for policy 0, policy_version 37190 (0.0005) [2023-03-06 23:58:07,763][81400] Updated weights for policy 0, policy_version 37200 (0.0007) [2023-03-06 23:58:08,551][81400] Updated weights for policy 0, policy_version 37210 (0.0006) [2023-03-06 23:58:09,338][81400] Updated weights for policy 0, policy_version 37220 (0.0005) [2023-03-06 23:58:10,110][81400] Updated weights for policy 0, policy_version 37230 (0.0007) [2023-03-06 23:58:10,890][81400] Updated weights for policy 0, policy_version 37240 (0.0006) [2023-03-06 23:58:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13235.6). Total num frames: 38137856. Throughput: 0: 13218.9. Samples: 38123494. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:58:11,237][81074] Avg episode reward: [(0, '1360.215')] [2023-03-06 23:58:11,670][81400] Updated weights for policy 0, policy_version 37250 (0.0006) [2023-03-06 23:58:12,437][81400] Updated weights for policy 0, policy_version 37260 (0.0006) [2023-03-06 23:58:13,213][81400] Updated weights for policy 0, policy_version 37270 (0.0005) [2023-03-06 23:58:13,998][81400] Updated weights for policy 0, policy_version 37280 (0.0006) [2023-03-06 23:58:14,767][81400] Updated weights for policy 0, policy_version 37290 (0.0006) [2023-03-06 23:58:15,537][81400] Updated weights for policy 0, policy_version 37300 (0.0006) [2023-03-06 23:58:16,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13209.6, 300 sec: 13232.2). Total num frames: 38203392. Throughput: 0: 13209.7. Samples: 38202585. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:58:16,237][81074] Avg episode reward: [(0, '1298.447')] [2023-03-06 23:58:16,310][81400] Updated weights for policy 0, policy_version 37310 (0.0006) [2023-03-06 23:58:17,086][81400] Updated weights for policy 0, policy_version 37320 (0.0006) [2023-03-06 23:58:17,859][81400] Updated weights for policy 0, policy_version 37330 (0.0006) [2023-03-06 23:58:18,645][81400] Updated weights for policy 0, policy_version 37340 (0.0006) [2023-03-06 23:58:19,417][81400] Updated weights for policy 0, policy_version 37350 (0.0006) [2023-03-06 23:58:20,201][81400] Updated weights for policy 0, policy_version 37360 (0.0006) [2023-03-06 23:58:20,965][81400] Updated weights for policy 0, policy_version 37370 (0.0006) [2023-03-06 23:58:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 38269952. Throughput: 0: 13213.6. Samples: 38242219. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:58:21,237][81074] Avg episode reward: [(0, '1324.784')] [2023-03-06 23:58:21,731][81400] Updated weights for policy 0, policy_version 37380 (0.0006) [2023-03-06 23:58:22,506][81400] Updated weights for policy 0, policy_version 37390 (0.0006) [2023-03-06 23:58:23,291][81400] Updated weights for policy 0, policy_version 37400 (0.0006) [2023-03-06 23:58:24,076][81400] Updated weights for policy 0, policy_version 37410 (0.0006) [2023-03-06 23:58:24,835][81400] Updated weights for policy 0, policy_version 37420 (0.0007) [2023-03-06 23:58:25,624][81400] Updated weights for policy 0, policy_version 37430 (0.0006) [2023-03-06 23:58:26,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 38336512. Throughput: 0: 13220.0. Samples: 38321611. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:58:26,237][81074] Avg episode reward: [(0, '1432.362')] [2023-03-06 23:58:26,387][81400] Updated weights for policy 0, policy_version 37440 (0.0007) [2023-03-06 23:58:27,167][81400] Updated weights for policy 0, policy_version 37450 (0.0006) [2023-03-06 23:58:27,925][81400] Updated weights for policy 0, policy_version 37460 (0.0006) [2023-03-06 23:58:28,694][81400] Updated weights for policy 0, policy_version 37470 (0.0006) [2023-03-06 23:58:29,471][81400] Updated weights for policy 0, policy_version 37480 (0.0006) [2023-03-06 23:58:30,246][81400] Updated weights for policy 0, policy_version 37490 (0.0006) [2023-03-06 23:58:31,026][81400] Updated weights for policy 0, policy_version 37500 (0.0006) [2023-03-06 23:58:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13226.6, 300 sec: 13235.6). Total num frames: 38403072. Throughput: 0: 13231.9. Samples: 38401301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:58:31,237][81074] Avg episode reward: [(0, '1356.193')] [2023-03-06 23:58:31,789][81400] Updated weights for policy 0, policy_version 37510 (0.0007) [2023-03-06 23:58:32,564][81400] Updated weights for policy 0, policy_version 37520 (0.0006) [2023-03-06 23:58:33,336][81400] Updated weights for policy 0, policy_version 37530 (0.0007) [2023-03-06 23:58:34,117][81400] Updated weights for policy 0, policy_version 37540 (0.0007) [2023-03-06 23:58:34,888][81400] Updated weights for policy 0, policy_version 37550 (0.0007) [2023-03-06 23:58:35,646][81400] Updated weights for policy 0, policy_version 37560 (0.0006) [2023-03-06 23:58:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13232.2). Total num frames: 38468608. Throughput: 0: 13224.8. Samples: 38441002. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:58:36,237][81074] Avg episode reward: [(0, '1547.130')] [2023-03-06 23:58:36,425][81400] Updated weights for policy 0, policy_version 37570 (0.0007) [2023-03-06 23:58:37,187][81400] Updated weights for policy 0, policy_version 37580 (0.0006) [2023-03-06 23:58:37,968][81400] Updated weights for policy 0, policy_version 37590 (0.0006) [2023-03-06 23:58:38,748][81400] Updated weights for policy 0, policy_version 37600 (0.0006) [2023-03-06 23:58:39,529][81400] Updated weights for policy 0, policy_version 37610 (0.0006) [2023-03-06 23:58:40,297][81400] Updated weights for policy 0, policy_version 37620 (0.0007) [2023-03-06 23:58:41,074][81400] Updated weights for policy 0, policy_version 37630 (0.0007) [2023-03-06 23:58:41,237][81074] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 38535168. Throughput: 0: 13213.9. Samples: 38520330. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:58:41,237][81074] Avg episode reward: [(0, '1476.023')] [2023-03-06 23:58:41,845][81400] Updated weights for policy 0, policy_version 37640 (0.0006) [2023-03-06 23:58:42,624][81400] Updated weights for policy 0, policy_version 37650 (0.0006) [2023-03-06 23:58:43,397][81400] Updated weights for policy 0, policy_version 37660 (0.0006) [2023-03-06 23:58:44,177][81400] Updated weights for policy 0, policy_version 37670 (0.0006) [2023-03-06 23:58:44,944][81400] Updated weights for policy 0, policy_version 37680 (0.0006) [2023-03-06 23:58:45,717][81400] Updated weights for policy 0, policy_version 37690 (0.0006) [2023-03-06 23:58:46,236][81074] Fps is (10 sec: 13209.3, 60 sec: 13226.6, 300 sec: 13232.2). Total num frames: 38600704. Throughput: 0: 13214.4. Samples: 38599667. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:58:46,237][81074] Avg episode reward: [(0, '1326.009')] [2023-03-06 23:58:46,501][81400] Updated weights for policy 0, policy_version 37700 (0.0006) [2023-03-06 23:58:47,272][81400] Updated weights for policy 0, policy_version 37710 (0.0007) [2023-03-06 23:58:48,049][81400] Updated weights for policy 0, policy_version 37720 (0.0006) [2023-03-06 23:58:48,851][81400] Updated weights for policy 0, policy_version 37730 (0.0006) [2023-03-06 23:58:49,605][81400] Updated weights for policy 0, policy_version 37740 (0.0006) [2023-03-06 23:58:50,403][81400] Updated weights for policy 0, policy_version 37750 (0.0006) [2023-03-06 23:58:51,189][81400] Updated weights for policy 0, policy_version 37760 (0.0006) [2023-03-06 23:58:51,236][81074] Fps is (10 sec: 13107.6, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 38666240. Throughput: 0: 13209.8. Samples: 38639021. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:58:51,237][81074] Avg episode reward: [(0, '1415.630')] [2023-03-06 23:58:51,947][81400] Updated weights for policy 0, policy_version 37770 (0.0006) [2023-03-06 23:58:52,720][81400] Updated weights for policy 0, policy_version 37780 (0.0005) [2023-03-06 23:58:53,497][81400] Updated weights for policy 0, policy_version 37790 (0.0007) [2023-03-06 23:58:54,269][81400] Updated weights for policy 0, policy_version 37800 (0.0007) [2023-03-06 23:58:55,027][81400] Updated weights for policy 0, policy_version 37810 (0.0005) [2023-03-06 23:58:55,788][81400] Updated weights for policy 0, policy_version 37820 (0.0006) [2023-03-06 23:58:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 38732800. Throughput: 0: 13219.1. Samples: 38718355. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:58:56,237][81074] Avg episode reward: [(0, '1430.091')] [2023-03-06 23:58:56,587][81400] Updated weights for policy 0, policy_version 37830 (0.0006) [2023-03-06 23:58:57,352][81400] Updated weights for policy 0, policy_version 37840 (0.0005) [2023-03-06 23:58:58,135][81400] Updated weights for policy 0, policy_version 37850 (0.0006) [2023-03-06 23:58:58,910][81400] Updated weights for policy 0, policy_version 37860 (0.0005) [2023-03-06 23:58:59,682][81400] Updated weights for policy 0, policy_version 37870 (0.0006) [2023-03-06 23:59:00,458][81400] Updated weights for policy 0, policy_version 37880 (0.0006) [2023-03-06 23:59:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13228.7). Total num frames: 38798336. Throughput: 0: 13222.7. Samples: 38797607. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:01,237][81074] Avg episode reward: [(0, '1425.575')] [2023-03-06 23:59:01,249][81400] Updated weights for policy 0, policy_version 37890 (0.0006) [2023-03-06 23:59:02,018][81400] Updated weights for policy 0, policy_version 37900 (0.0007) [2023-03-06 23:59:02,777][81400] Updated weights for policy 0, policy_version 37910 (0.0006) [2023-03-06 23:59:03,562][81400] Updated weights for policy 0, policy_version 37920 (0.0006) [2023-03-06 23:59:04,334][81400] Updated weights for policy 0, policy_version 37930 (0.0006) [2023-03-06 23:59:05,105][81400] Updated weights for policy 0, policy_version 37940 (0.0006) [2023-03-06 23:59:05,884][81400] Updated weights for policy 0, policy_version 37950 (0.0006) [2023-03-06 23:59:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.5, 300 sec: 13225.2). Total num frames: 38864896. Throughput: 0: 13224.4. Samples: 38837318. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:06,237][81074] Avg episode reward: [(0, '1430.561')] [2023-03-06 23:59:06,659][81400] Updated weights for policy 0, policy_version 37960 (0.0005) [2023-03-06 23:59:07,438][81400] Updated weights for policy 0, policy_version 37970 (0.0005) [2023-03-06 23:59:08,197][81400] Updated weights for policy 0, policy_version 37980 (0.0006) [2023-03-06 23:59:08,970][81400] Updated weights for policy 0, policy_version 37990 (0.0006) [2023-03-06 23:59:09,729][81400] Updated weights for policy 0, policy_version 38000 (0.0006) [2023-03-06 23:59:10,514][81400] Updated weights for policy 0, policy_version 38010 (0.0006) [2023-03-06 23:59:11,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13228.7). Total num frames: 38931456. Throughput: 0: 13228.8. Samples: 38916908. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:11,237][81074] Avg episode reward: [(0, '1391.131')] [2023-03-06 23:59:11,290][81400] Updated weights for policy 0, policy_version 38020 (0.0006) [2023-03-06 23:59:12,052][81400] Updated weights for policy 0, policy_version 38030 (0.0006) [2023-03-06 23:59:12,844][81400] Updated weights for policy 0, policy_version 38040 (0.0006) [2023-03-06 23:59:13,611][81400] Updated weights for policy 0, policy_version 38050 (0.0006) [2023-03-06 23:59:14,394][81400] Updated weights for policy 0, policy_version 38060 (0.0006) [2023-03-06 23:59:15,171][81400] Updated weights for policy 0, policy_version 38070 (0.0005) [2023-03-06 23:59:15,940][81400] Updated weights for policy 0, policy_version 38080 (0.0006) [2023-03-06 23:59:16,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 38996992. Throughput: 0: 13219.8. Samples: 38996190. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:16,237][81074] Avg episode reward: [(0, '1394.112')] [2023-03-06 23:59:16,703][81400] Updated weights for policy 0, policy_version 38090 (0.0007) [2023-03-06 23:59:17,491][81400] Updated weights for policy 0, policy_version 38100 (0.0006) [2023-03-06 23:59:18,267][81400] Updated weights for policy 0, policy_version 38110 (0.0006) [2023-03-06 23:59:19,049][81400] Updated weights for policy 0, policy_version 38120 (0.0006) [2023-03-06 23:59:19,826][81400] Updated weights for policy 0, policy_version 38130 (0.0006) [2023-03-06 23:59:20,612][81400] Updated weights for policy 0, policy_version 38140 (0.0005) [2023-03-06 23:59:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 39063552. Throughput: 0: 13213.9. Samples: 39035628. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:21,237][81074] Avg episode reward: [(0, '1395.964')] [2023-03-06 23:59:21,389][81400] Updated weights for policy 0, policy_version 38150 (0.0007) [2023-03-06 23:59:22,162][81400] Updated weights for policy 0, policy_version 38160 (0.0007) [2023-03-06 23:59:22,939][81400] Updated weights for policy 0, policy_version 38170 (0.0006) [2023-03-06 23:59:23,703][81400] Updated weights for policy 0, policy_version 38180 (0.0006) [2023-03-06 23:59:24,489][81400] Updated weights for policy 0, policy_version 38190 (0.0006) [2023-03-06 23:59:25,258][81400] Updated weights for policy 0, policy_version 38200 (0.0006) [2023-03-06 23:59:26,037][81400] Updated weights for policy 0, policy_version 38210 (0.0006) [2023-03-06 23:59:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13221.7). Total num frames: 39129088. Throughput: 0: 13211.5. Samples: 39114843. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:26,237][81074] Avg episode reward: [(0, '1448.362')] [2023-03-06 23:59:26,807][81400] Updated weights for policy 0, policy_version 38220 (0.0006) [2023-03-06 23:59:27,602][81400] Updated weights for policy 0, policy_version 38230 (0.0007) [2023-03-06 23:59:28,365][81400] Updated weights for policy 0, policy_version 38240 (0.0006) [2023-03-06 23:59:29,159][81400] Updated weights for policy 0, policy_version 38250 (0.0006) [2023-03-06 23:59:29,923][81400] Updated weights for policy 0, policy_version 38260 (0.0006) [2023-03-06 23:59:30,712][81400] Updated weights for policy 0, policy_version 38270 (0.0006) [2023-03-06 23:59:31,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13218.3). Total num frames: 39194624. Throughput: 0: 13202.6. Samples: 39193782. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:31,237][81074] Avg episode reward: [(0, '1338.735')] [2023-03-06 23:59:31,486][81400] Updated weights for policy 0, policy_version 38280 (0.0006) [2023-03-06 23:59:32,255][81400] Updated weights for policy 0, policy_version 38290 (0.0006) [2023-03-06 23:59:32,631][81349] KL-divergence is very high: 168.6589 [2023-03-06 23:59:33,008][81400] Updated weights for policy 0, policy_version 38300 (0.0006) [2023-03-06 23:59:33,798][81400] Updated weights for policy 0, policy_version 38310 (0.0007) [2023-03-06 23:59:34,578][81400] Updated weights for policy 0, policy_version 38320 (0.0006) [2023-03-06 23:59:35,336][81400] Updated weights for policy 0, policy_version 38330 (0.0006) [2023-03-06 23:59:36,102][81400] Updated weights for policy 0, policy_version 38340 (0.0006) [2023-03-06 23:59:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13221.7). Total num frames: 39261184. Throughput: 0: 13214.3. Samples: 39233666. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:36,237][81074] Avg episode reward: [(0, '1341.314')] [2023-03-06 23:59:36,889][81400] Updated weights for policy 0, policy_version 38350 (0.0008) [2023-03-06 23:59:37,653][81400] Updated weights for policy 0, policy_version 38360 (0.0006) [2023-03-06 23:59:38,433][81400] Updated weights for policy 0, policy_version 38370 (0.0006) [2023-03-06 23:59:39,211][81400] Updated weights for policy 0, policy_version 38380 (0.0006) [2023-03-06 23:59:39,992][81400] Updated weights for policy 0, policy_version 38390 (0.0005) [2023-03-06 23:59:40,751][81400] Updated weights for policy 0, policy_version 38400 (0.0007) [2023-03-06 23:59:41,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13209.7, 300 sec: 13225.2). Total num frames: 39327744. Throughput: 0: 13213.5. Samples: 39312959. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:41,237][81074] Avg episode reward: [(0, '1309.617')] [2023-03-06 23:59:41,531][81400] Updated weights for policy 0, policy_version 38410 (0.0007) [2023-03-06 23:59:42,309][81400] Updated weights for policy 0, policy_version 38420 (0.0006) [2023-03-06 23:59:43,074][81400] Updated weights for policy 0, policy_version 38430 (0.0006) [2023-03-06 23:59:43,846][81400] Updated weights for policy 0, policy_version 38440 (0.0005) [2023-03-06 23:59:44,632][81400] Updated weights for policy 0, policy_version 38450 (0.0006) [2023-03-06 23:59:45,393][81400] Updated weights for policy 0, policy_version 38460 (0.0006) [2023-03-06 23:59:46,180][81400] Updated weights for policy 0, policy_version 38470 (0.0006) [2023-03-06 23:59:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.7, 300 sec: 13221.8). Total num frames: 39393280. Throughput: 0: 13217.3. Samples: 39392387. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:46,237][81074] Avg episode reward: [(0, '1345.024')] [2023-03-06 23:59:46,963][81400] Updated weights for policy 0, policy_version 38480 (0.0006) [2023-03-06 23:59:47,725][81400] Updated weights for policy 0, policy_version 38490 (0.0006) [2023-03-06 23:59:48,493][81400] Updated weights for policy 0, policy_version 38500 (0.0006) [2023-03-06 23:59:49,273][81400] Updated weights for policy 0, policy_version 38510 (0.0006) [2023-03-06 23:59:50,039][81400] Updated weights for policy 0, policy_version 38520 (0.0006) [2023-03-06 23:59:50,809][81400] Updated weights for policy 0, policy_version 38530 (0.0006) [2023-03-06 23:59:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 39459840. Throughput: 0: 13218.8. Samples: 39432162. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-06 23:59:51,237][81074] Avg episode reward: [(0, '1382.023')] [2023-03-06 23:59:51,576][81400] Updated weights for policy 0, policy_version 38540 (0.0007) [2023-03-06 23:59:52,346][81400] Updated weights for policy 0, policy_version 38550 (0.0007) [2023-03-06 23:59:53,142][81400] Updated weights for policy 0, policy_version 38560 (0.0006) [2023-03-06 23:59:53,910][81400] Updated weights for policy 0, policy_version 38570 (0.0007) [2023-03-06 23:59:54,683][81400] Updated weights for policy 0, policy_version 38580 (0.0006) [2023-03-06 23:59:55,474][81400] Updated weights for policy 0, policy_version 38590 (0.0007) [2023-03-06 23:59:56,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13221.7). Total num frames: 39525376. Throughput: 0: 13211.4. Samples: 39511424. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-06 23:59:56,237][81074] Avg episode reward: [(0, '1446.919')] [2023-03-06 23:59:56,238][81400] Updated weights for policy 0, policy_version 38600 (0.0006) [2023-03-06 23:59:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000038600_39526400.pth... [2023-03-06 23:59:56,270][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000035501_36353024.pth [2023-03-06 23:59:57,029][81400] Updated weights for policy 0, policy_version 38610 (0.0007) [2023-03-06 23:59:57,824][81400] Updated weights for policy 0, policy_version 38620 (0.0006) [2023-03-06 23:59:58,594][81400] Updated weights for policy 0, policy_version 38630 (0.0006) [2023-03-06 23:59:59,361][81400] Updated weights for policy 0, policy_version 38640 (0.0006) [2023-03-07 00:00:00,147][81400] Updated weights for policy 0, policy_version 38650 (0.0005) [2023-03-07 00:00:00,916][81400] Updated weights for policy 0, policy_version 38660 (0.0006) [2023-03-07 00:00:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13221.7). Total num frames: 39591936. Throughput: 0: 13204.4. Samples: 39590388. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:01,237][81074] Avg episode reward: [(0, '1445.534')] [2023-03-07 00:00:01,675][81400] Updated weights for policy 0, policy_version 38670 (0.0006) [2023-03-07 00:00:02,453][81400] Updated weights for policy 0, policy_version 38680 (0.0005) [2023-03-07 00:00:03,236][81400] Updated weights for policy 0, policy_version 38690 (0.0006) [2023-03-07 00:00:04,013][81400] Updated weights for policy 0, policy_version 38700 (0.0006) [2023-03-07 00:00:04,797][81400] Updated weights for policy 0, policy_version 38710 (0.0006) [2023-03-07 00:00:05,550][81400] Updated weights for policy 0, policy_version 38720 (0.0006) [2023-03-07 00:00:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13218.3). Total num frames: 39657472. Throughput: 0: 13209.3. Samples: 39630047. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:06,237][81074] Avg episode reward: [(0, '1379.838')] [2023-03-07 00:00:06,338][81400] Updated weights for policy 0, policy_version 38730 (0.0006) [2023-03-07 00:00:07,099][81400] Updated weights for policy 0, policy_version 38740 (0.0006) [2023-03-07 00:00:07,884][81400] Updated weights for policy 0, policy_version 38750 (0.0006) [2023-03-07 00:00:08,649][81400] Updated weights for policy 0, policy_version 38760 (0.0006) [2023-03-07 00:00:09,414][81400] Updated weights for policy 0, policy_version 38770 (0.0005) [2023-03-07 00:00:10,190][81400] Updated weights for policy 0, policy_version 38780 (0.0006) [2023-03-07 00:00:10,957][81400] Updated weights for policy 0, policy_version 38790 (0.0005) [2023-03-07 00:00:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13218.3). Total num frames: 39724032. Throughput: 0: 13217.8. Samples: 39709644. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:11,237][81074] Avg episode reward: [(0, '1353.992')] [2023-03-07 00:00:11,722][81400] Updated weights for policy 0, policy_version 38800 (0.0007) [2023-03-07 00:00:12,500][81400] Updated weights for policy 0, policy_version 38810 (0.0006) [2023-03-07 00:00:13,289][81400] Updated weights for policy 0, policy_version 38820 (0.0006) [2023-03-07 00:00:13,659][81349] KL-divergence is very high: 100.8707 [2023-03-07 00:00:14,064][81400] Updated weights for policy 0, policy_version 38830 (0.0006) [2023-03-07 00:00:14,843][81400] Updated weights for policy 0, policy_version 38840 (0.0006) [2023-03-07 00:00:15,634][81400] Updated weights for policy 0, policy_version 38850 (0.0007) [2023-03-07 00:00:16,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13226.7, 300 sec: 13221.8). Total num frames: 39790592. Throughput: 0: 13222.3. Samples: 39788785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:16,237][81074] Avg episode reward: [(0, '1508.449')] [2023-03-07 00:00:16,393][81400] Updated weights for policy 0, policy_version 38860 (0.0006) [2023-03-07 00:00:17,170][81400] Updated weights for policy 0, policy_version 38870 (0.0006) [2023-03-07 00:00:17,955][81400] Updated weights for policy 0, policy_version 38880 (0.0006) [2023-03-07 00:00:18,720][81400] Updated weights for policy 0, policy_version 38890 (0.0007) [2023-03-07 00:00:19,507][81400] Updated weights for policy 0, policy_version 38900 (0.0005) [2023-03-07 00:00:20,282][81400] Updated weights for policy 0, policy_version 38910 (0.0006) [2023-03-07 00:00:21,062][81400] Updated weights for policy 0, policy_version 38920 (0.0007) [2023-03-07 00:00:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13218.3). Total num frames: 39856128. Throughput: 0: 13216.6. Samples: 39828411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:21,237][81074] Avg episode reward: [(0, '1502.698')] [2023-03-07 00:00:21,815][81400] Updated weights for policy 0, policy_version 38930 (0.0007) [2023-03-07 00:00:22,587][81400] Updated weights for policy 0, policy_version 38940 (0.0007) [2023-03-07 00:00:23,359][81400] Updated weights for policy 0, policy_version 38950 (0.0006) [2023-03-07 00:00:24,147][81400] Updated weights for policy 0, policy_version 38960 (0.0006) [2023-03-07 00:00:24,917][81400] Updated weights for policy 0, policy_version 38970 (0.0006) [2023-03-07 00:00:25,694][81400] Updated weights for policy 0, policy_version 38980 (0.0006) [2023-03-07 00:00:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13209.6, 300 sec: 13214.8). Total num frames: 39921664. Throughput: 0: 13218.6. Samples: 39907797. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:26,237][81074] Avg episode reward: [(0, '1504.585')] [2023-03-07 00:00:26,458][81400] Updated weights for policy 0, policy_version 38990 (0.0006) [2023-03-07 00:00:27,217][81400] Updated weights for policy 0, policy_version 39000 (0.0006) [2023-03-07 00:00:28,021][81400] Updated weights for policy 0, policy_version 39010 (0.0006) [2023-03-07 00:00:28,776][81400] Updated weights for policy 0, policy_version 39020 (0.0005) [2023-03-07 00:00:29,544][81400] Updated weights for policy 0, policy_version 39030 (0.0005) [2023-03-07 00:00:30,337][81400] Updated weights for policy 0, policy_version 39040 (0.0006) [2023-03-07 00:00:31,105][81400] Updated weights for policy 0, policy_version 39050 (0.0006) [2023-03-07 00:00:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13214.8). Total num frames: 39988224. Throughput: 0: 13218.2. Samples: 39987205. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:31,237][81074] Avg episode reward: [(0, '1495.224')] [2023-03-07 00:00:31,888][81400] Updated weights for policy 0, policy_version 39060 (0.0006) [2023-03-07 00:00:32,666][81400] Updated weights for policy 0, policy_version 39070 (0.0006) [2023-03-07 00:00:33,436][81400] Updated weights for policy 0, policy_version 39080 (0.0006) [2023-03-07 00:00:34,208][81400] Updated weights for policy 0, policy_version 39090 (0.0006) [2023-03-07 00:00:34,989][81400] Updated weights for policy 0, policy_version 39100 (0.0008) [2023-03-07 00:00:35,781][81400] Updated weights for policy 0, policy_version 39110 (0.0006) [2023-03-07 00:00:36,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13226.7, 300 sec: 13218.3). Total num frames: 40054784. Throughput: 0: 13212.6. Samples: 40026730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:36,237][81074] Avg episode reward: [(0, '1446.526')] [2023-03-07 00:00:36,539][81400] Updated weights for policy 0, policy_version 39120 (0.0006) [2023-03-07 00:00:37,325][81400] Updated weights for policy 0, policy_version 39130 (0.0007) [2023-03-07 00:00:38,106][81400] Updated weights for policy 0, policy_version 39140 (0.0006) [2023-03-07 00:00:38,865][81400] Updated weights for policy 0, policy_version 39150 (0.0006) [2023-03-07 00:00:39,645][81400] Updated weights for policy 0, policy_version 39160 (0.0006) [2023-03-07 00:00:40,425][81400] Updated weights for policy 0, policy_version 39170 (0.0006) [2023-03-07 00:00:41,201][81400] Updated weights for policy 0, policy_version 39180 (0.0006) [2023-03-07 00:00:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13214.8). Total num frames: 40120320. Throughput: 0: 13210.6. Samples: 40105900. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:00:41,237][81074] Avg episode reward: [(0, '1565.111')] [2023-03-07 00:00:41,970][81400] Updated weights for policy 0, policy_version 39190 (0.0006) [2023-03-07 00:00:42,753][81400] Updated weights for policy 0, policy_version 39200 (0.0006) [2023-03-07 00:00:43,523][81400] Updated weights for policy 0, policy_version 39210 (0.0007) [2023-03-07 00:00:44,294][81400] Updated weights for policy 0, policy_version 39220 (0.0006) [2023-03-07 00:00:45,061][81400] Updated weights for policy 0, policy_version 39230 (0.0006) [2023-03-07 00:00:45,840][81400] Updated weights for policy 0, policy_version 39240 (0.0006) [2023-03-07 00:00:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13226.6, 300 sec: 13218.3). Total num frames: 40186880. Throughput: 0: 13221.2. Samples: 40185344. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:00:46,237][81074] Avg episode reward: [(0, '1612.362')] [2023-03-07 00:00:46,601][81400] Updated weights for policy 0, policy_version 39250 (0.0006) [2023-03-07 00:00:47,369][81400] Updated weights for policy 0, policy_version 39260 (0.0006) [2023-03-07 00:00:48,152][81400] Updated weights for policy 0, policy_version 39270 (0.0006) [2023-03-07 00:00:48,919][81400] Updated weights for policy 0, policy_version 39280 (0.0006) [2023-03-07 00:00:49,698][81400] Updated weights for policy 0, policy_version 39290 (0.0006) [2023-03-07 00:00:50,482][81400] Updated weights for policy 0, policy_version 39300 (0.0005) [2023-03-07 00:00:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13218.3). Total num frames: 40252416. Throughput: 0: 13226.8. Samples: 40225250. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:00:51,237][81074] Avg episode reward: [(0, '1474.232')] [2023-03-07 00:00:51,262][81400] Updated weights for policy 0, policy_version 39310 (0.0006) [2023-03-07 00:00:52,021][81400] Updated weights for policy 0, policy_version 39320 (0.0006) [2023-03-07 00:00:52,792][81400] Updated weights for policy 0, policy_version 39330 (0.0006) [2023-03-07 00:00:53,549][81400] Updated weights for policy 0, policy_version 39340 (0.0006) [2023-03-07 00:00:54,329][81400] Updated weights for policy 0, policy_version 39350 (0.0006) [2023-03-07 00:00:55,099][81400] Updated weights for policy 0, policy_version 39360 (0.0007) [2023-03-07 00:00:55,873][81400] Updated weights for policy 0, policy_version 39370 (0.0006) [2023-03-07 00:00:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.6, 300 sec: 13218.3). Total num frames: 40318976. Throughput: 0: 13225.7. Samples: 40304801. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:00:56,237][81074] Avg episode reward: [(0, '1481.872')] [2023-03-07 00:00:56,630][81400] Updated weights for policy 0, policy_version 39380 (0.0005) [2023-03-07 00:00:57,417][81400] Updated weights for policy 0, policy_version 39390 (0.0006) [2023-03-07 00:00:58,186][81400] Updated weights for policy 0, policy_version 39400 (0.0006) [2023-03-07 00:00:58,950][81400] Updated weights for policy 0, policy_version 39410 (0.0007) [2023-03-07 00:00:59,717][81400] Updated weights for policy 0, policy_version 39420 (0.0005) [2023-03-07 00:01:00,493][81400] Updated weights for policy 0, policy_version 39430 (0.0007) [2023-03-07 00:01:01,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13226.7, 300 sec: 13221.7). Total num frames: 40385536. Throughput: 0: 13241.5. Samples: 40384653. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:01:01,237][81074] Avg episode reward: [(0, '1284.878')] [2023-03-07 00:01:01,245][81400] Updated weights for policy 0, policy_version 39440 (0.0006) [2023-03-07 00:01:02,029][81400] Updated weights for policy 0, policy_version 39450 (0.0006) [2023-03-07 00:01:02,810][81400] Updated weights for policy 0, policy_version 39460 (0.0006) [2023-03-07 00:01:03,600][81400] Updated weights for policy 0, policy_version 39470 (0.0006) [2023-03-07 00:01:04,362][81400] Updated weights for policy 0, policy_version 39480 (0.0006) [2023-03-07 00:01:05,128][81400] Updated weights for policy 0, policy_version 39490 (0.0006) [2023-03-07 00:01:05,902][81400] Updated weights for policy 0, policy_version 39500 (0.0006) [2023-03-07 00:01:06,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13221.7). Total num frames: 40452096. Throughput: 0: 13241.9. Samples: 40424296. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:01:06,237][81074] Avg episode reward: [(0, '1317.563')] [2023-03-07 00:01:06,668][81400] Updated weights for policy 0, policy_version 39510 (0.0005) [2023-03-07 00:01:07,442][81400] Updated weights for policy 0, policy_version 39520 (0.0007) [2023-03-07 00:01:08,223][81400] Updated weights for policy 0, policy_version 39530 (0.0006) [2023-03-07 00:01:08,991][81400] Updated weights for policy 0, policy_version 39540 (0.0005) [2023-03-07 00:01:09,761][81400] Updated weights for policy 0, policy_version 39550 (0.0006) [2023-03-07 00:01:10,532][81400] Updated weights for policy 0, policy_version 39560 (0.0006) [2023-03-07 00:01:11,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13221.8). Total num frames: 40518656. Throughput: 0: 13241.3. Samples: 40503655. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:01:11,237][81074] Avg episode reward: [(0, '1260.136')] [2023-03-07 00:01:11,290][81400] Updated weights for policy 0, policy_version 39570 (0.0006) [2023-03-07 00:01:12,067][81400] Updated weights for policy 0, policy_version 39580 (0.0006) [2023-03-07 00:01:12,845][81400] Updated weights for policy 0, policy_version 39590 (0.0005) [2023-03-07 00:01:13,156][81349] KL-divergence is very high: 114.8920 [2023-03-07 00:01:13,626][81400] Updated weights for policy 0, policy_version 39600 (0.0006) [2023-03-07 00:01:14,405][81400] Updated weights for policy 0, policy_version 39610 (0.0006) [2023-03-07 00:01:14,632][81349] KL-divergence is very high: 103.6620 [2023-03-07 00:01:15,197][81400] Updated weights for policy 0, policy_version 39620 (0.0006) [2023-03-07 00:01:15,954][81400] Updated weights for policy 0, policy_version 39630 (0.0005) [2023-03-07 00:01:16,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13221.8). Total num frames: 40584192. Throughput: 0: 13241.7. Samples: 40583082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:01:16,237][81074] Avg episode reward: [(0, '1244.010')] [2023-03-07 00:01:16,729][81400] Updated weights for policy 0, policy_version 39640 (0.0006) [2023-03-07 00:01:17,505][81400] Updated weights for policy 0, policy_version 39650 (0.0006) [2023-03-07 00:01:18,275][81400] Updated weights for policy 0, policy_version 39660 (0.0006) [2023-03-07 00:01:19,049][81400] Updated weights for policy 0, policy_version 39670 (0.0006) [2023-03-07 00:01:19,828][81400] Updated weights for policy 0, policy_version 39680 (0.0006) [2023-03-07 00:01:20,599][81400] Updated weights for policy 0, policy_version 39690 (0.0005) [2023-03-07 00:01:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13221.8). Total num frames: 40650752. Throughput: 0: 13243.5. Samples: 40622688. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:01:21,237][81074] Avg episode reward: [(0, '1236.259')] [2023-03-07 00:01:21,360][81400] Updated weights for policy 0, policy_version 39700 (0.0006) [2023-03-07 00:01:22,125][81400] Updated weights for policy 0, policy_version 39710 (0.0005) [2023-03-07 00:01:22,898][81400] Updated weights for policy 0, policy_version 39720 (0.0006) [2023-03-07 00:01:23,668][81400] Updated weights for policy 0, policy_version 39730 (0.0005) [2023-03-07 00:01:24,433][81400] Updated weights for policy 0, policy_version 39740 (0.0005) [2023-03-07 00:01:25,211][81400] Updated weights for policy 0, policy_version 39750 (0.0006) [2023-03-07 00:01:25,976][81400] Updated weights for policy 0, policy_version 39760 (0.0007) [2023-03-07 00:01:26,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13260.8, 300 sec: 13225.2). Total num frames: 40717312. Throughput: 0: 13263.0. Samples: 40702737. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:01:26,237][81074] Avg episode reward: [(0, '1255.714')] [2023-03-07 00:01:26,749][81400] Updated weights for policy 0, policy_version 39770 (0.0006) [2023-03-07 00:01:27,526][81400] Updated weights for policy 0, policy_version 39780 (0.0005) [2023-03-07 00:01:28,283][81400] Updated weights for policy 0, policy_version 39790 (0.0006) [2023-03-07 00:01:29,051][81400] Updated weights for policy 0, policy_version 39800 (0.0006) [2023-03-07 00:01:29,821][81400] Updated weights for policy 0, policy_version 39810 (0.0006) [2023-03-07 00:01:30,597][81400] Updated weights for policy 0, policy_version 39820 (0.0005) [2023-03-07 00:01:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13228.7). Total num frames: 40783872. Throughput: 0: 13268.2. Samples: 40782411. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:01:31,237][81074] Avg episode reward: [(0, '1306.470')] [2023-03-07 00:01:31,374][81400] Updated weights for policy 0, policy_version 39830 (0.0006) [2023-03-07 00:01:32,146][81400] Updated weights for policy 0, policy_version 39840 (0.0006) [2023-03-07 00:01:32,927][81400] Updated weights for policy 0, policy_version 39850 (0.0006) [2023-03-07 00:01:33,714][81400] Updated weights for policy 0, policy_version 39860 (0.0006) [2023-03-07 00:01:34,478][81400] Updated weights for policy 0, policy_version 39870 (0.0007) [2023-03-07 00:01:35,252][81400] Updated weights for policy 0, policy_version 39880 (0.0006) [2023-03-07 00:01:36,023][81400] Updated weights for policy 0, policy_version 39890 (0.0006) [2023-03-07 00:01:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13225.2). Total num frames: 40849408. Throughput: 0: 13261.4. Samples: 40822015. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:01:36,237][81074] Avg episode reward: [(0, '1343.444')] [2023-03-07 00:01:36,798][81400] Updated weights for policy 0, policy_version 39900 (0.0006) [2023-03-07 00:01:37,578][81400] Updated weights for policy 0, policy_version 39910 (0.0006) [2023-03-07 00:01:38,340][81400] Updated weights for policy 0, policy_version 39920 (0.0006) [2023-03-07 00:01:39,101][81400] Updated weights for policy 0, policy_version 39930 (0.0006) [2023-03-07 00:01:39,870][81400] Updated weights for policy 0, policy_version 39940 (0.0005) [2023-03-07 00:01:40,649][81400] Updated weights for policy 0, policy_version 39950 (0.0005) [2023-03-07 00:01:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13225.2). Total num frames: 40915968. Throughput: 0: 13265.3. Samples: 40901738. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:01:41,237][81074] Avg episode reward: [(0, '1212.455')] [2023-03-07 00:01:41,403][81400] Updated weights for policy 0, policy_version 39960 (0.0006) [2023-03-07 00:01:42,201][81400] Updated weights for policy 0, policy_version 39970 (0.0006) [2023-03-07 00:01:42,990][81400] Updated weights for policy 0, policy_version 39980 (0.0006) [2023-03-07 00:01:43,768][81400] Updated weights for policy 0, policy_version 39990 (0.0006) [2023-03-07 00:01:44,561][81400] Updated weights for policy 0, policy_version 40000 (0.0006) [2023-03-07 00:01:45,327][81400] Updated weights for policy 0, policy_version 40010 (0.0006) [2023-03-07 00:01:46,104][81400] Updated weights for policy 0, policy_version 40020 (0.0006) [2023-03-07 00:01:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13225.2). Total num frames: 40981504. Throughput: 0: 13243.1. Samples: 40980593. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:01:46,237][81074] Avg episode reward: [(0, '1339.884')] [2023-03-07 00:01:46,878][81400] Updated weights for policy 0, policy_version 40030 (0.0005) [2023-03-07 00:01:47,656][81400] Updated weights for policy 0, policy_version 40040 (0.0006) [2023-03-07 00:01:48,430][81400] Updated weights for policy 0, policy_version 40050 (0.0006) [2023-03-07 00:01:49,195][81400] Updated weights for policy 0, policy_version 40060 (0.0006) [2023-03-07 00:01:49,957][81400] Updated weights for policy 0, policy_version 40070 (0.0006) [2023-03-07 00:01:50,741][81400] Updated weights for policy 0, policy_version 40080 (0.0006) [2023-03-07 00:01:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13225.2). Total num frames: 41048064. Throughput: 0: 13247.7. Samples: 41020439. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:01:51,237][81074] Avg episode reward: [(0, '1359.206')] [2023-03-07 00:01:51,530][81400] Updated weights for policy 0, policy_version 40090 (0.0007) [2023-03-07 00:01:52,293][81400] Updated weights for policy 0, policy_version 40100 (0.0006) [2023-03-07 00:01:53,065][81400] Updated weights for policy 0, policy_version 40110 (0.0006) [2023-03-07 00:01:53,842][81400] Updated weights for policy 0, policy_version 40120 (0.0007) [2023-03-07 00:01:54,611][81400] Updated weights for policy 0, policy_version 40130 (0.0007) [2023-03-07 00:01:55,387][81400] Updated weights for policy 0, policy_version 40140 (0.0006) [2023-03-07 00:01:56,141][81400] Updated weights for policy 0, policy_version 40150 (0.0006) [2023-03-07 00:01:56,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13260.8, 300 sec: 13228.7). Total num frames: 41114624. Throughput: 0: 13245.9. Samples: 41099724. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:01:56,237][81074] Avg episode reward: [(0, '1384.788')] [2023-03-07 00:01:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000040151_41114624.pth... [2023-03-07 00:01:56,274][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000037051_37940224.pth [2023-03-07 00:01:56,904][81400] Updated weights for policy 0, policy_version 40160 (0.0005) [2023-03-07 00:01:57,693][81400] Updated weights for policy 0, policy_version 40170 (0.0006) [2023-03-07 00:01:58,456][81400] Updated weights for policy 0, policy_version 40180 (0.0006) [2023-03-07 00:01:59,225][81400] Updated weights for policy 0, policy_version 40190 (0.0006) [2023-03-07 00:02:00,007][81400] Updated weights for policy 0, policy_version 40200 (0.0005) [2023-03-07 00:02:00,786][81400] Updated weights for policy 0, policy_version 40210 (0.0006) [2023-03-07 00:02:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13225.2). Total num frames: 41180160. Throughput: 0: 13247.5. Samples: 41179222. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:02:01,247][81074] Avg episode reward: [(0, '1206.807')] [2023-03-07 00:02:01,547][81400] Updated weights for policy 0, policy_version 40220 (0.0006) [2023-03-07 00:02:02,315][81400] Updated weights for policy 0, policy_version 40230 (0.0006) [2023-03-07 00:02:03,092][81400] Updated weights for policy 0, policy_version 40240 (0.0007) [2023-03-07 00:02:03,890][81400] Updated weights for policy 0, policy_version 40250 (0.0006) [2023-03-07 00:02:04,565][81349] KL-divergence is very high: 133.4643 [2023-03-07 00:02:04,649][81400] Updated weights for policy 0, policy_version 40260 (0.0006) [2023-03-07 00:02:05,434][81400] Updated weights for policy 0, policy_version 40270 (0.0006) [2023-03-07 00:02:06,207][81400] Updated weights for policy 0, policy_version 40280 (0.0005) [2023-03-07 00:02:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.8, 300 sec: 13228.7). Total num frames: 41246720. Throughput: 0: 13251.3. Samples: 41218997. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:02:06,247][81074] Avg episode reward: [(0, '1219.753')] [2023-03-07 00:02:06,971][81400] Updated weights for policy 0, policy_version 40290 (0.0006) [2023-03-07 00:02:07,747][81400] Updated weights for policy 0, policy_version 40300 (0.0006) [2023-03-07 00:02:08,549][81400] Updated weights for policy 0, policy_version 40310 (0.0006) [2023-03-07 00:02:09,310][81400] Updated weights for policy 0, policy_version 40320 (0.0007) [2023-03-07 00:02:10,105][81400] Updated weights for policy 0, policy_version 40330 (0.0006) [2023-03-07 00:02:10,877][81400] Updated weights for policy 0, policy_version 40340 (0.0007) [2023-03-07 00:02:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13225.2). Total num frames: 41312256. Throughput: 0: 13229.9. Samples: 41298085. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:02:11,237][81074] Avg episode reward: [(0, '1363.492')] [2023-03-07 00:02:11,647][81400] Updated weights for policy 0, policy_version 40350 (0.0007) [2023-03-07 00:02:12,421][81400] Updated weights for policy 0, policy_version 40360 (0.0007) [2023-03-07 00:02:13,192][81400] Updated weights for policy 0, policy_version 40370 (0.0006) [2023-03-07 00:02:13,957][81400] Updated weights for policy 0, policy_version 40380 (0.0006) [2023-03-07 00:02:14,757][81400] Updated weights for policy 0, policy_version 40390 (0.0006) [2023-03-07 00:02:15,526][81400] Updated weights for policy 0, policy_version 40400 (0.0005) [2023-03-07 00:02:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13228.7). Total num frames: 41378816. Throughput: 0: 13220.1. Samples: 41377317. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:02:16,237][81074] Avg episode reward: [(0, '1366.353')] [2023-03-07 00:02:16,286][81400] Updated weights for policy 0, policy_version 40410 (0.0006) [2023-03-07 00:02:17,069][81400] Updated weights for policy 0, policy_version 40420 (0.0006) [2023-03-07 00:02:17,839][81400] Updated weights for policy 0, policy_version 40430 (0.0006) [2023-03-07 00:02:18,625][81400] Updated weights for policy 0, policy_version 40440 (0.0005) [2023-03-07 00:02:19,407][81400] Updated weights for policy 0, policy_version 40450 (0.0006) [2023-03-07 00:02:20,177][81400] Updated weights for policy 0, policy_version 40460 (0.0006) [2023-03-07 00:02:20,959][81400] Updated weights for policy 0, policy_version 40470 (0.0006) [2023-03-07 00:02:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 41444352. Throughput: 0: 13220.2. Samples: 41416922. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:02:21,237][81074] Avg episode reward: [(0, '1401.946')] [2023-03-07 00:02:21,726][81400] Updated weights for policy 0, policy_version 40480 (0.0006) [2023-03-07 00:02:22,494][81400] Updated weights for policy 0, policy_version 40490 (0.0007) [2023-03-07 00:02:23,275][81400] Updated weights for policy 0, policy_version 40500 (0.0006) [2023-03-07 00:02:24,055][81400] Updated weights for policy 0, policy_version 40510 (0.0006) [2023-03-07 00:02:24,813][81400] Updated weights for policy 0, policy_version 40520 (0.0006) [2023-03-07 00:02:25,610][81400] Updated weights for policy 0, policy_version 40530 (0.0005) [2023-03-07 00:02:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13225.2). Total num frames: 41510912. Throughput: 0: 13209.8. Samples: 41496181. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:02:26,237][81074] Avg episode reward: [(0, '1260.674')] [2023-03-07 00:02:26,387][81400] Updated weights for policy 0, policy_version 40540 (0.0006) [2023-03-07 00:02:27,145][81400] Updated weights for policy 0, policy_version 40550 (0.0008) [2023-03-07 00:02:27,929][81400] Updated weights for policy 0, policy_version 40560 (0.0006) [2023-03-07 00:02:28,696][81400] Updated weights for policy 0, policy_version 40570 (0.0006) [2023-03-07 00:02:29,479][81400] Updated weights for policy 0, policy_version 40580 (0.0006) [2023-03-07 00:02:30,240][81400] Updated weights for policy 0, policy_version 40590 (0.0006) [2023-03-07 00:02:31,017][81400] Updated weights for policy 0, policy_version 40600 (0.0006) [2023-03-07 00:02:31,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 41576448. Throughput: 0: 13223.2. Samples: 41575635. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:02:31,237][81074] Avg episode reward: [(0, '1389.781')] [2023-03-07 00:02:31,787][81400] Updated weights for policy 0, policy_version 40610 (0.0006) [2023-03-07 00:02:32,556][81400] Updated weights for policy 0, policy_version 40620 (0.0006) [2023-03-07 00:02:33,324][81400] Updated weights for policy 0, policy_version 40630 (0.0006) [2023-03-07 00:02:34,099][81400] Updated weights for policy 0, policy_version 40640 (0.0005) [2023-03-07 00:02:34,875][81400] Updated weights for policy 0, policy_version 40650 (0.0006) [2023-03-07 00:02:35,654][81400] Updated weights for policy 0, policy_version 40660 (0.0007) [2023-03-07 00:02:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.6, 300 sec: 13225.2). Total num frames: 41643008. Throughput: 0: 13225.9. Samples: 41615609. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:02:36,237][81074] Avg episode reward: [(0, '1294.995')] [2023-03-07 00:02:36,434][81400] Updated weights for policy 0, policy_version 40670 (0.0006) [2023-03-07 00:02:37,211][81400] Updated weights for policy 0, policy_version 40680 (0.0007) [2023-03-07 00:02:37,971][81400] Updated weights for policy 0, policy_version 40690 (0.0006) [2023-03-07 00:02:38,755][81400] Updated weights for policy 0, policy_version 40700 (0.0006) [2023-03-07 00:02:39,529][81400] Updated weights for policy 0, policy_version 40710 (0.0006) [2023-03-07 00:02:40,316][81400] Updated weights for policy 0, policy_version 40720 (0.0006) [2023-03-07 00:02:41,086][81400] Updated weights for policy 0, policy_version 40730 (0.0006) [2023-03-07 00:02:41,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 41708544. Throughput: 0: 13217.8. Samples: 41694523. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:02:41,237][81074] Avg episode reward: [(0, '1276.398')] [2023-03-07 00:02:41,865][81400] Updated weights for policy 0, policy_version 40740 (0.0006) [2023-03-07 00:02:42,653][81400] Updated weights for policy 0, policy_version 40750 (0.0007) [2023-03-07 00:02:43,424][81400] Updated weights for policy 0, policy_version 40760 (0.0007) [2023-03-07 00:02:44,199][81400] Updated weights for policy 0, policy_version 40770 (0.0006) [2023-03-07 00:02:44,990][81400] Updated weights for policy 0, policy_version 40780 (0.0006) [2023-03-07 00:02:45,769][81400] Updated weights for policy 0, policy_version 40790 (0.0006) [2023-03-07 00:02:46,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13226.7, 300 sec: 13225.2). Total num frames: 41775104. Throughput: 0: 13209.0. Samples: 41773628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:02:46,237][81074] Avg episode reward: [(0, '1475.419')] [2023-03-07 00:02:46,536][81400] Updated weights for policy 0, policy_version 40800 (0.0006) [2023-03-07 00:02:47,302][81400] Updated weights for policy 0, policy_version 40810 (0.0006) [2023-03-07 00:02:48,074][81400] Updated weights for policy 0, policy_version 40820 (0.0006) [2023-03-07 00:02:48,855][81400] Updated weights for policy 0, policy_version 40830 (0.0006) [2023-03-07 00:02:49,615][81400] Updated weights for policy 0, policy_version 40840 (0.0006) [2023-03-07 00:02:50,401][81400] Updated weights for policy 0, policy_version 40850 (0.0006) [2023-03-07 00:02:51,169][81400] Updated weights for policy 0, policy_version 40860 (0.0006) [2023-03-07 00:02:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13221.8). Total num frames: 41840640. Throughput: 0: 13206.3. Samples: 41813281. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:02:51,237][81074] Avg episode reward: [(0, '1453.743')] [2023-03-07 00:02:51,947][81400] Updated weights for policy 0, policy_version 40870 (0.0005) [2023-03-07 00:02:52,732][81400] Updated weights for policy 0, policy_version 40880 (0.0007) [2023-03-07 00:02:53,499][81400] Updated weights for policy 0, policy_version 40890 (0.0006) [2023-03-07 00:02:54,301][81400] Updated weights for policy 0, policy_version 40900 (0.0006) [2023-03-07 00:02:55,087][81400] Updated weights for policy 0, policy_version 40910 (0.0007) [2023-03-07 00:02:55,866][81400] Updated weights for policy 0, policy_version 40920 (0.0007) [2023-03-07 00:02:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13221.7). Total num frames: 41906176. Throughput: 0: 13200.3. Samples: 41892098. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:02:56,237][81074] Avg episode reward: [(0, '1483.258')] [2023-03-07 00:02:56,651][81400] Updated weights for policy 0, policy_version 40930 (0.0006) [2023-03-07 00:02:57,422][81400] Updated weights for policy 0, policy_version 40940 (0.0006) [2023-03-07 00:02:58,208][81400] Updated weights for policy 0, policy_version 40950 (0.0006) [2023-03-07 00:02:58,985][81400] Updated weights for policy 0, policy_version 40960 (0.0007) [2023-03-07 00:02:59,753][81400] Updated weights for policy 0, policy_version 40970 (0.0006) [2023-03-07 00:03:00,514][81400] Updated weights for policy 0, policy_version 40980 (0.0005) [2023-03-07 00:03:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13221.7). Total num frames: 41972736. Throughput: 0: 13199.6. Samples: 41971299. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:03:01,237][81074] Avg episode reward: [(0, '1602.250')] [2023-03-07 00:03:01,297][81400] Updated weights for policy 0, policy_version 40990 (0.0006) [2023-03-07 00:03:02,063][81400] Updated weights for policy 0, policy_version 41000 (0.0006) [2023-03-07 00:03:02,840][81400] Updated weights for policy 0, policy_version 41010 (0.0006) [2023-03-07 00:03:03,634][81400] Updated weights for policy 0, policy_version 41020 (0.0006) [2023-03-07 00:03:04,407][81400] Updated weights for policy 0, policy_version 41030 (0.0006) [2023-03-07 00:03:05,182][81400] Updated weights for policy 0, policy_version 41040 (0.0006) [2023-03-07 00:03:05,957][81400] Updated weights for policy 0, policy_version 41050 (0.0007) [2023-03-07 00:03:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13221.8). Total num frames: 42038272. Throughput: 0: 13197.9. Samples: 42010828. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:03:06,237][81074] Avg episode reward: [(0, '1468.000')] [2023-03-07 00:03:06,733][81400] Updated weights for policy 0, policy_version 41060 (0.0006) [2023-03-07 00:03:07,497][81400] Updated weights for policy 0, policy_version 41070 (0.0005) [2023-03-07 00:03:08,274][81400] Updated weights for policy 0, policy_version 41080 (0.0005) [2023-03-07 00:03:09,048][81400] Updated weights for policy 0, policy_version 41090 (0.0006) [2023-03-07 00:03:09,428][81349] KL-divergence is very high: 168.8180 [2023-03-07 00:03:09,810][81400] Updated weights for policy 0, policy_version 41100 (0.0006) [2023-03-07 00:03:10,583][81400] Updated weights for policy 0, policy_version 41110 (0.0005) [2023-03-07 00:03:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 42104832. Throughput: 0: 13202.3. Samples: 42090286. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:03:11,237][81074] Avg episode reward: [(0, '1421.389')] [2023-03-07 00:03:11,364][81400] Updated weights for policy 0, policy_version 41120 (0.0006) [2023-03-07 00:03:12,155][81400] Updated weights for policy 0, policy_version 41130 (0.0005) [2023-03-07 00:03:12,917][81400] Updated weights for policy 0, policy_version 41140 (0.0006) [2023-03-07 00:03:13,682][81400] Updated weights for policy 0, policy_version 41150 (0.0006) [2023-03-07 00:03:14,449][81400] Updated weights for policy 0, policy_version 41160 (0.0006) [2023-03-07 00:03:15,237][81400] Updated weights for policy 0, policy_version 41170 (0.0006) [2023-03-07 00:03:16,017][81400] Updated weights for policy 0, policy_version 41180 (0.0006) [2023-03-07 00:03:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13221.8). Total num frames: 42170368. Throughput: 0: 13196.1. Samples: 42169456. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:03:16,237][81074] Avg episode reward: [(0, '1470.877')] [2023-03-07 00:03:16,792][81400] Updated weights for policy 0, policy_version 41190 (0.0006) [2023-03-07 00:03:17,565][81400] Updated weights for policy 0, policy_version 41200 (0.0006) [2023-03-07 00:03:18,340][81400] Updated weights for policy 0, policy_version 41210 (0.0006) [2023-03-07 00:03:19,124][81400] Updated weights for policy 0, policy_version 41220 (0.0005) [2023-03-07 00:03:19,874][81400] Updated weights for policy 0, policy_version 41230 (0.0005) [2023-03-07 00:03:20,670][81400] Updated weights for policy 0, policy_version 41240 (0.0007) [2023-03-07 00:03:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13221.7). Total num frames: 42236928. Throughput: 0: 13191.5. Samples: 42209223. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 00:03:21,247][81074] Avg episode reward: [(0, '1491.965')] [2023-03-07 00:03:21,434][81400] Updated weights for policy 0, policy_version 41250 (0.0006) [2023-03-07 00:03:22,221][81400] Updated weights for policy 0, policy_version 41260 (0.0006) [2023-03-07 00:03:23,007][81400] Updated weights for policy 0, policy_version 41270 (0.0006) [2023-03-07 00:03:23,783][81400] Updated weights for policy 0, policy_version 41280 (0.0007) [2023-03-07 00:03:24,566][81400] Updated weights for policy 0, policy_version 41290 (0.0007) [2023-03-07 00:03:25,348][81400] Updated weights for policy 0, policy_version 41300 (0.0007) [2023-03-07 00:03:26,109][81400] Updated weights for policy 0, policy_version 41310 (0.0005) [2023-03-07 00:03:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13218.3). Total num frames: 42302464. Throughput: 0: 13192.3. Samples: 42288177. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 00:03:26,247][81074] Avg episode reward: [(0, '1534.007')] [2023-03-07 00:03:26,894][81400] Updated weights for policy 0, policy_version 41320 (0.0005) [2023-03-07 00:03:27,678][81400] Updated weights for policy 0, policy_version 41330 (0.0005) [2023-03-07 00:03:28,449][81400] Updated weights for policy 0, policy_version 41340 (0.0007) [2023-03-07 00:03:29,237][81400] Updated weights for policy 0, policy_version 41350 (0.0006) [2023-03-07 00:03:30,000][81400] Updated weights for policy 0, policy_version 41360 (0.0006) [2023-03-07 00:03:30,781][81400] Updated weights for policy 0, policy_version 41370 (0.0005) [2023-03-07 00:03:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.6, 300 sec: 13218.3). Total num frames: 42368000. Throughput: 0: 13192.9. Samples: 42367311. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 00:03:31,247][81074] Avg episode reward: [(0, '1526.180')] [2023-03-07 00:03:31,565][81400] Updated weights for policy 0, policy_version 41380 (0.0006) [2023-03-07 00:03:32,330][81400] Updated weights for policy 0, policy_version 41390 (0.0006) [2023-03-07 00:03:33,095][81400] Updated weights for policy 0, policy_version 41400 (0.0006) [2023-03-07 00:03:33,870][81400] Updated weights for policy 0, policy_version 41410 (0.0006) [2023-03-07 00:03:34,627][81400] Updated weights for policy 0, policy_version 41420 (0.0006) [2023-03-07 00:03:35,385][81400] Updated weights for policy 0, policy_version 41430 (0.0006) [2023-03-07 00:03:36,152][81400] Updated weights for policy 0, policy_version 41440 (0.0006) [2023-03-07 00:03:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13218.3). Total num frames: 42434560. Throughput: 0: 13198.9. Samples: 42407232. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 00:03:36,248][81074] Avg episode reward: [(0, '1551.659')] [2023-03-07 00:03:36,950][81400] Updated weights for policy 0, policy_version 41450 (0.0005) [2023-03-07 00:03:37,719][81400] Updated weights for policy 0, policy_version 41460 (0.0007) [2023-03-07 00:03:38,480][81400] Updated weights for policy 0, policy_version 41470 (0.0006) [2023-03-07 00:03:39,269][81400] Updated weights for policy 0, policy_version 41480 (0.0006) [2023-03-07 00:03:40,009][81400] Updated weights for policy 0, policy_version 41490 (0.0007) [2023-03-07 00:03:40,793][81400] Updated weights for policy 0, policy_version 41500 (0.0007) [2023-03-07 00:03:41,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13209.6, 300 sec: 13221.8). Total num frames: 42501120. Throughput: 0: 13212.7. Samples: 42486669. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 00:03:41,247][81074] Avg episode reward: [(0, '1449.200')] [2023-03-07 00:03:41,579][81400] Updated weights for policy 0, policy_version 41510 (0.0006) [2023-03-07 00:03:42,357][81400] Updated weights for policy 0, policy_version 41520 (0.0006) [2023-03-07 00:03:43,096][81400] Updated weights for policy 0, policy_version 41530 (0.0006) [2023-03-07 00:03:43,878][81400] Updated weights for policy 0, policy_version 41540 (0.0006) [2023-03-07 00:03:44,659][81400] Updated weights for policy 0, policy_version 41550 (0.0006) [2023-03-07 00:03:45,438][81400] Updated weights for policy 0, policy_version 41560 (0.0006) [2023-03-07 00:03:46,205][81400] Updated weights for policy 0, policy_version 41570 (0.0006) [2023-03-07 00:03:46,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13209.6, 300 sec: 13225.2). Total num frames: 42567680. Throughput: 0: 13221.4. Samples: 42566263. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:03:46,248][81074] Avg episode reward: [(0, '1518.974')] [2023-03-07 00:03:47,007][81400] Updated weights for policy 0, policy_version 41580 (0.0005) [2023-03-07 00:03:47,761][81400] Updated weights for policy 0, policy_version 41590 (0.0006) [2023-03-07 00:03:48,564][81400] Updated weights for policy 0, policy_version 41600 (0.0006) [2023-03-07 00:03:49,346][81400] Updated weights for policy 0, policy_version 41610 (0.0006) [2023-03-07 00:03:50,110][81400] Updated weights for policy 0, policy_version 41620 (0.0006) [2023-03-07 00:03:50,888][81400] Updated weights for policy 0, policy_version 41630 (0.0006) [2023-03-07 00:03:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13221.8). Total num frames: 42633216. Throughput: 0: 13220.3. Samples: 42605742. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:03:51,237][81074] Avg episode reward: [(0, '1411.206')] [2023-03-07 00:03:51,666][81400] Updated weights for policy 0, policy_version 41640 (0.0007) [2023-03-07 00:03:52,440][81400] Updated weights for policy 0, policy_version 41650 (0.0006) [2023-03-07 00:03:53,217][81400] Updated weights for policy 0, policy_version 41660 (0.0006) [2023-03-07 00:03:54,019][81400] Updated weights for policy 0, policy_version 41670 (0.0006) [2023-03-07 00:03:54,788][81400] Updated weights for policy 0, policy_version 41680 (0.0007) [2023-03-07 00:03:55,574][81400] Updated weights for policy 0, policy_version 41690 (0.0006) [2023-03-07 00:03:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13209.6, 300 sec: 13221.7). Total num frames: 42698752. Throughput: 0: 13205.3. Samples: 42684525. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:03:56,237][81074] Avg episode reward: [(0, '1332.924')] [2023-03-07 00:03:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000041698_42698752.pth... [2023-03-07 00:03:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000038600_39526400.pth [2023-03-07 00:03:56,355][81400] Updated weights for policy 0, policy_version 41700 (0.0006) [2023-03-07 00:03:57,121][81400] Updated weights for policy 0, policy_version 41710 (0.0006) [2023-03-07 00:03:57,898][81400] Updated weights for policy 0, policy_version 41720 (0.0006) [2023-03-07 00:03:58,681][81400] Updated weights for policy 0, policy_version 41730 (0.0006) [2023-03-07 00:03:59,457][81400] Updated weights for policy 0, policy_version 41740 (0.0007) [2023-03-07 00:04:00,231][81400] Updated weights for policy 0, policy_version 41750 (0.0007) [2023-03-07 00:04:01,019][81400] Updated weights for policy 0, policy_version 41760 (0.0005) [2023-03-07 00:04:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13218.3). Total num frames: 42764288. Throughput: 0: 13197.1. Samples: 42763325. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:01,237][81074] Avg episode reward: [(0, '1427.277')] [2023-03-07 00:04:01,821][81400] Updated weights for policy 0, policy_version 41770 (0.0007) [2023-03-07 00:04:02,585][81400] Updated weights for policy 0, policy_version 41780 (0.0006) [2023-03-07 00:04:03,373][81400] Updated weights for policy 0, policy_version 41790 (0.0008) [2023-03-07 00:04:04,141][81400] Updated weights for policy 0, policy_version 41800 (0.0006) [2023-03-07 00:04:04,925][81400] Updated weights for policy 0, policy_version 41810 (0.0006) [2023-03-07 00:04:05,702][81400] Updated weights for policy 0, policy_version 41820 (0.0006) [2023-03-07 00:04:06,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13218.3). Total num frames: 42830848. Throughput: 0: 13189.4. Samples: 42802745. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:06,237][81074] Avg episode reward: [(0, '1485.610')] [2023-03-07 00:04:06,451][81400] Updated weights for policy 0, policy_version 41830 (0.0006) [2023-03-07 00:04:07,241][81400] Updated weights for policy 0, policy_version 41840 (0.0006) [2023-03-07 00:04:08,023][81400] Updated weights for policy 0, policy_version 41850 (0.0006) [2023-03-07 00:04:08,797][81400] Updated weights for policy 0, policy_version 41860 (0.0007) [2023-03-07 00:04:09,581][81400] Updated weights for policy 0, policy_version 41870 (0.0005) [2023-03-07 00:04:10,348][81400] Updated weights for policy 0, policy_version 41880 (0.0007) [2023-03-07 00:04:11,119][81400] Updated weights for policy 0, policy_version 41890 (0.0006) [2023-03-07 00:04:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13218.3). Total num frames: 42896384. Throughput: 0: 13192.8. Samples: 42881853. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:11,237][81074] Avg episode reward: [(0, '1443.874')] [2023-03-07 00:04:11,900][81400] Updated weights for policy 0, policy_version 41900 (0.0007) [2023-03-07 00:04:12,672][81400] Updated weights for policy 0, policy_version 41910 (0.0006) [2023-03-07 00:04:13,442][81400] Updated weights for policy 0, policy_version 41920 (0.0006) [2023-03-07 00:04:14,219][81400] Updated weights for policy 0, policy_version 41930 (0.0006) [2023-03-07 00:04:14,994][81400] Updated weights for policy 0, policy_version 41940 (0.0006) [2023-03-07 00:04:15,769][81400] Updated weights for policy 0, policy_version 41950 (0.0006) [2023-03-07 00:04:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13214.8). Total num frames: 42961920. Throughput: 0: 13198.2. Samples: 42961229. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:16,237][81074] Avg episode reward: [(0, '1444.878')] [2023-03-07 00:04:16,549][81400] Updated weights for policy 0, policy_version 41960 (0.0006) [2023-03-07 00:04:17,323][81400] Updated weights for policy 0, policy_version 41970 (0.0006) [2023-03-07 00:04:18,105][81400] Updated weights for policy 0, policy_version 41980 (0.0006) [2023-03-07 00:04:18,885][81400] Updated weights for policy 0, policy_version 41990 (0.0007) [2023-03-07 00:04:19,667][81400] Updated weights for policy 0, policy_version 42000 (0.0006) [2023-03-07 00:04:20,446][81400] Updated weights for policy 0, policy_version 42010 (0.0006) [2023-03-07 00:04:21,229][81400] Updated weights for policy 0, policy_version 42020 (0.0006) [2023-03-07 00:04:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13218.3). Total num frames: 43028480. Throughput: 0: 13189.7. Samples: 43000767. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:21,237][81074] Avg episode reward: [(0, '1507.514')] [2023-03-07 00:04:21,990][81400] Updated weights for policy 0, policy_version 42030 (0.0006) [2023-03-07 00:04:22,777][81400] Updated weights for policy 0, policy_version 42040 (0.0006) [2023-03-07 00:04:23,553][81400] Updated weights for policy 0, policy_version 42050 (0.0006) [2023-03-07 00:04:24,321][81400] Updated weights for policy 0, policy_version 42060 (0.0006) [2023-03-07 00:04:25,106][81400] Updated weights for policy 0, policy_version 42070 (0.0007) [2023-03-07 00:04:25,912][81400] Updated weights for policy 0, policy_version 42080 (0.0006) [2023-03-07 00:04:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13218.3). Total num frames: 43094016. Throughput: 0: 13178.8. Samples: 43079713. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:26,237][81074] Avg episode reward: [(0, '1555.407')] [2023-03-07 00:04:26,672][81400] Updated weights for policy 0, policy_version 42090 (0.0006) [2023-03-07 00:04:27,430][81400] Updated weights for policy 0, policy_version 42100 (0.0006) [2023-03-07 00:04:28,246][81400] Updated weights for policy 0, policy_version 42110 (0.0006) [2023-03-07 00:04:28,995][81400] Updated weights for policy 0, policy_version 42120 (0.0006) [2023-03-07 00:04:29,763][81400] Updated weights for policy 0, policy_version 42130 (0.0006) [2023-03-07 00:04:30,557][81400] Updated weights for policy 0, policy_version 42140 (0.0006) [2023-03-07 00:04:31,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13214.8). Total num frames: 43159552. Throughput: 0: 13163.9. Samples: 43158637. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:31,237][81074] Avg episode reward: [(0, '1696.444')] [2023-03-07 00:04:31,340][81400] Updated weights for policy 0, policy_version 42150 (0.0005) [2023-03-07 00:04:32,115][81400] Updated weights for policy 0, policy_version 42160 (0.0006) [2023-03-07 00:04:32,898][81400] Updated weights for policy 0, policy_version 42170 (0.0005) [2023-03-07 00:04:33,669][81400] Updated weights for policy 0, policy_version 42180 (0.0006) [2023-03-07 00:04:34,437][81400] Updated weights for policy 0, policy_version 42190 (0.0006) [2023-03-07 00:04:35,208][81400] Updated weights for policy 0, policy_version 42200 (0.0005) [2023-03-07 00:04:35,987][81400] Updated weights for policy 0, policy_version 42210 (0.0006) [2023-03-07 00:04:36,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13175.5, 300 sec: 13211.3). Total num frames: 43225088. Throughput: 0: 13163.2. Samples: 43198088. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:04:36,237][81074] Avg episode reward: [(0, '1649.447')] [2023-03-07 00:04:36,761][81400] Updated weights for policy 0, policy_version 42220 (0.0006) [2023-03-07 00:04:37,536][81400] Updated weights for policy 0, policy_version 42230 (0.0005) [2023-03-07 00:04:38,327][81400] Updated weights for policy 0, policy_version 42240 (0.0006) [2023-03-07 00:04:39,109][81400] Updated weights for policy 0, policy_version 42250 (0.0006) [2023-03-07 00:04:39,880][81400] Updated weights for policy 0, policy_version 42260 (0.0007) [2023-03-07 00:04:40,655][81400] Updated weights for policy 0, policy_version 42270 (0.0006) [2023-03-07 00:04:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13214.8). Total num frames: 43291648. Throughput: 0: 13168.5. Samples: 43277108. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:04:41,237][81074] Avg episode reward: [(0, '1490.322')] [2023-03-07 00:04:41,434][81400] Updated weights for policy 0, policy_version 42280 (0.0005) [2023-03-07 00:04:42,208][81400] Updated weights for policy 0, policy_version 42290 (0.0007) [2023-03-07 00:04:42,987][81400] Updated weights for policy 0, policy_version 42300 (0.0007) [2023-03-07 00:04:43,775][81400] Updated weights for policy 0, policy_version 42310 (0.0007) [2023-03-07 00:04:44,528][81400] Updated weights for policy 0, policy_version 42320 (0.0006) [2023-03-07 00:04:45,313][81400] Updated weights for policy 0, policy_version 42330 (0.0006) [2023-03-07 00:04:46,082][81400] Updated weights for policy 0, policy_version 42340 (0.0006) [2023-03-07 00:04:46,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13211.3). Total num frames: 43357184. Throughput: 0: 13182.5. Samples: 43356536. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:04:46,237][81074] Avg episode reward: [(0, '1634.922')] [2023-03-07 00:04:46,834][81400] Updated weights for policy 0, policy_version 42350 (0.0006) [2023-03-07 00:04:47,617][81400] Updated weights for policy 0, policy_version 42360 (0.0007) [2023-03-07 00:04:48,414][81400] Updated weights for policy 0, policy_version 42370 (0.0006) [2023-03-07 00:04:49,189][81400] Updated weights for policy 0, policy_version 42380 (0.0006) [2023-03-07 00:04:49,964][81400] Updated weights for policy 0, policy_version 42390 (0.0006) [2023-03-07 00:04:50,738][81400] Updated weights for policy 0, policy_version 42400 (0.0006) [2023-03-07 00:04:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13214.8). Total num frames: 43423744. Throughput: 0: 13187.6. Samples: 43396191. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:04:51,237][81074] Avg episode reward: [(0, '1691.778')] [2023-03-07 00:04:51,515][81400] Updated weights for policy 0, policy_version 42410 (0.0006) [2023-03-07 00:04:52,302][81400] Updated weights for policy 0, policy_version 42420 (0.0006) [2023-03-07 00:04:53,071][81400] Updated weights for policy 0, policy_version 42430 (0.0006) [2023-03-07 00:04:53,857][81400] Updated weights for policy 0, policy_version 42440 (0.0006) [2023-03-07 00:04:54,616][81400] Updated weights for policy 0, policy_version 42450 (0.0005) [2023-03-07 00:04:55,398][81400] Updated weights for policy 0, policy_version 42460 (0.0006) [2023-03-07 00:04:56,160][81400] Updated weights for policy 0, policy_version 42470 (0.0005) [2023-03-07 00:04:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13192.6, 300 sec: 13214.8). Total num frames: 43490304. Throughput: 0: 13185.1. Samples: 43475183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:04:56,237][81074] Avg episode reward: [(0, '1755.584')] [2023-03-07 00:04:56,919][81400] Updated weights for policy 0, policy_version 42480 (0.0006) [2023-03-07 00:04:57,685][81400] Updated weights for policy 0, policy_version 42490 (0.0007) [2023-03-07 00:04:58,472][81400] Updated weights for policy 0, policy_version 42500 (0.0006) [2023-03-07 00:04:59,238][81400] Updated weights for policy 0, policy_version 42510 (0.0006) [2023-03-07 00:04:59,999][81400] Updated weights for policy 0, policy_version 42520 (0.0006) [2023-03-07 00:05:00,769][81400] Updated weights for policy 0, policy_version 42530 (0.0005) [2023-03-07 00:05:01,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13218.3). Total num frames: 43556864. Throughput: 0: 13197.2. Samples: 43555102. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:05:01,237][81074] Avg episode reward: [(0, '1716.732')] [2023-03-07 00:05:01,546][81400] Updated weights for policy 0, policy_version 42540 (0.0006) [2023-03-07 00:05:02,329][81400] Updated weights for policy 0, policy_version 42550 (0.0007) [2023-03-07 00:05:03,106][81400] Updated weights for policy 0, policy_version 42560 (0.0006) [2023-03-07 00:05:03,895][81400] Updated weights for policy 0, policy_version 42570 (0.0006) [2023-03-07 00:05:04,678][81400] Updated weights for policy 0, policy_version 42580 (0.0006) [2023-03-07 00:05:05,462][81400] Updated weights for policy 0, policy_version 42590 (0.0006) [2023-03-07 00:05:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13211.3). Total num frames: 43621376. Throughput: 0: 13192.8. Samples: 43594445. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:06,237][81074] Avg episode reward: [(0, '1710.705')] [2023-03-07 00:05:06,244][81400] Updated weights for policy 0, policy_version 42600 (0.0006) [2023-03-07 00:05:07,024][81400] Updated weights for policy 0, policy_version 42610 (0.0007) [2023-03-07 00:05:07,782][81400] Updated weights for policy 0, policy_version 42620 (0.0006) [2023-03-07 00:05:08,556][81400] Updated weights for policy 0, policy_version 42630 (0.0006) [2023-03-07 00:05:09,335][81400] Updated weights for policy 0, policy_version 42640 (0.0007) [2023-03-07 00:05:10,120][81400] Updated weights for policy 0, policy_version 42650 (0.0007) [2023-03-07 00:05:10,884][81400] Updated weights for policy 0, policy_version 42660 (0.0006) [2023-03-07 00:05:11,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13192.5, 300 sec: 13211.3). Total num frames: 43687936. Throughput: 0: 13195.7. Samples: 43673523. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:11,237][81074] Avg episode reward: [(0, '1816.432')] [2023-03-07 00:05:11,654][81400] Updated weights for policy 0, policy_version 42670 (0.0005) [2023-03-07 00:05:12,426][81400] Updated weights for policy 0, policy_version 42680 (0.0006) [2023-03-07 00:05:13,202][81400] Updated weights for policy 0, policy_version 42690 (0.0006) [2023-03-07 00:05:13,989][81400] Updated weights for policy 0, policy_version 42700 (0.0007) [2023-03-07 00:05:14,747][81400] Updated weights for policy 0, policy_version 42710 (0.0006) [2023-03-07 00:05:15,522][81400] Updated weights for policy 0, policy_version 42720 (0.0006) [2023-03-07 00:05:16,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13214.8). Total num frames: 43754496. Throughput: 0: 13207.3. Samples: 43752964. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:16,237][81074] Avg episode reward: [(0, '1755.095')] [2023-03-07 00:05:16,306][81400] Updated weights for policy 0, policy_version 42730 (0.0006) [2023-03-07 00:05:17,085][81400] Updated weights for policy 0, policy_version 42740 (0.0006) [2023-03-07 00:05:17,873][81400] Updated weights for policy 0, policy_version 42750 (0.0006) [2023-03-07 00:05:18,643][81400] Updated weights for policy 0, policy_version 42760 (0.0005) [2023-03-07 00:05:19,408][81400] Updated weights for policy 0, policy_version 42770 (0.0005) [2023-03-07 00:05:20,182][81400] Updated weights for policy 0, policy_version 42780 (0.0006) [2023-03-07 00:05:20,937][81400] Updated weights for policy 0, policy_version 42790 (0.0007) [2023-03-07 00:05:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13214.8). Total num frames: 43820032. Throughput: 0: 13206.9. Samples: 43792397. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:21,237][81074] Avg episode reward: [(0, '1817.435')] [2023-03-07 00:05:21,737][81400] Updated weights for policy 0, policy_version 42800 (0.0007) [2023-03-07 00:05:22,502][81400] Updated weights for policy 0, policy_version 42810 (0.0006) [2023-03-07 00:05:23,284][81400] Updated weights for policy 0, policy_version 42820 (0.0006) [2023-03-07 00:05:24,050][81400] Updated weights for policy 0, policy_version 42830 (0.0006) [2023-03-07 00:05:24,819][81400] Updated weights for policy 0, policy_version 42840 (0.0006) [2023-03-07 00:05:25,592][81400] Updated weights for policy 0, policy_version 42850 (0.0006) [2023-03-07 00:05:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13214.8). Total num frames: 43886592. Throughput: 0: 13221.3. Samples: 43872067. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:26,237][81074] Avg episode reward: [(0, '1592.331')] [2023-03-07 00:05:26,378][81400] Updated weights for policy 0, policy_version 42860 (0.0006) [2023-03-07 00:05:27,146][81400] Updated weights for policy 0, policy_version 42870 (0.0006) [2023-03-07 00:05:27,925][81400] Updated weights for policy 0, policy_version 42880 (0.0006) [2023-03-07 00:05:28,716][81400] Updated weights for policy 0, policy_version 42890 (0.0006) [2023-03-07 00:05:29,493][81400] Updated weights for policy 0, policy_version 42900 (0.0006) [2023-03-07 00:05:30,285][81400] Updated weights for policy 0, policy_version 42910 (0.0006) [2023-03-07 00:05:31,067][81400] Updated weights for policy 0, policy_version 42920 (0.0007) [2023-03-07 00:05:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13211.3). Total num frames: 43952128. Throughput: 0: 13204.7. Samples: 43950750. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:31,237][81074] Avg episode reward: [(0, '1543.508')] [2023-03-07 00:05:31,828][81400] Updated weights for policy 0, policy_version 42930 (0.0006) [2023-03-07 00:05:32,614][81400] Updated weights for policy 0, policy_version 42940 (0.0006) [2023-03-07 00:05:33,390][81400] Updated weights for policy 0, policy_version 42950 (0.0006) [2023-03-07 00:05:34,149][81400] Updated weights for policy 0, policy_version 42960 (0.0006) [2023-03-07 00:05:34,933][81400] Updated weights for policy 0, policy_version 42970 (0.0006) [2023-03-07 00:05:35,709][81400] Updated weights for policy 0, policy_version 42980 (0.0006) [2023-03-07 00:05:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13214.8). Total num frames: 44018688. Throughput: 0: 13205.1. Samples: 43990420. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:36,237][81074] Avg episode reward: [(0, '1683.439')] [2023-03-07 00:05:36,475][81400] Updated weights for policy 0, policy_version 42990 (0.0006) [2023-03-07 00:05:37,250][81400] Updated weights for policy 0, policy_version 43000 (0.0006) [2023-03-07 00:05:38,030][81400] Updated weights for policy 0, policy_version 43010 (0.0006) [2023-03-07 00:05:38,809][81400] Updated weights for policy 0, policy_version 43020 (0.0006) [2023-03-07 00:05:39,587][81400] Updated weights for policy 0, policy_version 43030 (0.0006) [2023-03-07 00:05:40,361][81400] Updated weights for policy 0, policy_version 43040 (0.0006) [2023-03-07 00:05:41,120][81400] Updated weights for policy 0, policy_version 43050 (0.0006) [2023-03-07 00:05:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13211.3). Total num frames: 44084224. Throughput: 0: 13211.1. Samples: 44069682. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:41,237][81074] Avg episode reward: [(0, '1900.696')] [2023-03-07 00:05:41,878][81400] Updated weights for policy 0, policy_version 43060 (0.0007) [2023-03-07 00:05:42,657][81400] Updated weights for policy 0, policy_version 43070 (0.0005) [2023-03-07 00:05:43,452][81400] Updated weights for policy 0, policy_version 43080 (0.0006) [2023-03-07 00:05:44,218][81400] Updated weights for policy 0, policy_version 43090 (0.0007) [2023-03-07 00:05:44,988][81400] Updated weights for policy 0, policy_version 43100 (0.0006) [2023-03-07 00:05:45,784][81400] Updated weights for policy 0, policy_version 43110 (0.0005) [2023-03-07 00:05:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13209.6, 300 sec: 13211.3). Total num frames: 44149760. Throughput: 0: 13195.9. Samples: 44148918. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:46,237][81074] Avg episode reward: [(0, '1901.782')] [2023-03-07 00:05:46,570][81400] Updated weights for policy 0, policy_version 43120 (0.0006) [2023-03-07 00:05:47,342][81400] Updated weights for policy 0, policy_version 43130 (0.0006) [2023-03-07 00:05:48,123][81400] Updated weights for policy 0, policy_version 43140 (0.0006) [2023-03-07 00:05:48,897][81400] Updated weights for policy 0, policy_version 43150 (0.0006) [2023-03-07 00:05:49,687][81400] Updated weights for policy 0, policy_version 43160 (0.0006) [2023-03-07 00:05:50,451][81400] Updated weights for policy 0, policy_version 43170 (0.0006) [2023-03-07 00:05:51,232][81400] Updated weights for policy 0, policy_version 43180 (0.0006) [2023-03-07 00:05:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13211.3). Total num frames: 44216320. Throughput: 0: 13201.4. Samples: 44188508. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:51,237][81074] Avg episode reward: [(0, '1746.470')] [2023-03-07 00:05:51,985][81400] Updated weights for policy 0, policy_version 43190 (0.0006) [2023-03-07 00:05:52,769][81400] Updated weights for policy 0, policy_version 43200 (0.0005) [2023-03-07 00:05:53,556][81400] Updated weights for policy 0, policy_version 43210 (0.0006) [2023-03-07 00:05:54,341][81400] Updated weights for policy 0, policy_version 43220 (0.0006) [2023-03-07 00:05:55,112][81400] Updated weights for policy 0, policy_version 43230 (0.0006) [2023-03-07 00:05:55,888][81400] Updated weights for policy 0, policy_version 43240 (0.0006) [2023-03-07 00:05:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13207.9). Total num frames: 44281856. Throughput: 0: 13196.3. Samples: 44267355. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:05:56,237][81074] Avg episode reward: [(0, '1740.767')] [2023-03-07 00:05:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000043244_44281856.pth... [2023-03-07 00:05:56,270][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000040151_41114624.pth [2023-03-07 00:05:56,666][81400] Updated weights for policy 0, policy_version 43250 (0.0006) [2023-03-07 00:05:57,436][81400] Updated weights for policy 0, policy_version 43260 (0.0006) [2023-03-07 00:05:58,222][81400] Updated weights for policy 0, policy_version 43270 (0.0006) [2023-03-07 00:05:58,996][81400] Updated weights for policy 0, policy_version 43280 (0.0006) [2023-03-07 00:05:59,777][81400] Updated weights for policy 0, policy_version 43290 (0.0006) [2023-03-07 00:06:00,545][81400] Updated weights for policy 0, policy_version 43300 (0.0006) [2023-03-07 00:06:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13204.4). Total num frames: 44347392. Throughput: 0: 13195.8. Samples: 44346775. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:06:01,237][81074] Avg episode reward: [(0, '1757.982')] [2023-03-07 00:06:01,307][81400] Updated weights for policy 0, policy_version 43310 (0.0007) [2023-03-07 00:06:02,095][81400] Updated weights for policy 0, policy_version 43320 (0.0006) [2023-03-07 00:06:02,878][81400] Updated weights for policy 0, policy_version 43330 (0.0006) [2023-03-07 00:06:03,650][81400] Updated weights for policy 0, policy_version 43340 (0.0006) [2023-03-07 00:06:04,425][81400] Updated weights for policy 0, policy_version 43350 (0.0006) [2023-03-07 00:06:05,212][81400] Updated weights for policy 0, policy_version 43360 (0.0007) [2023-03-07 00:06:05,973][81400] Updated weights for policy 0, policy_version 43370 (0.0005) [2023-03-07 00:06:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13204.4). Total num frames: 44413952. Throughput: 0: 13195.7. Samples: 44386202. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:06:06,237][81074] Avg episode reward: [(0, '1808.452')] [2023-03-07 00:06:06,766][81400] Updated weights for policy 0, policy_version 43380 (0.0006) [2023-03-07 00:06:07,540][81400] Updated weights for policy 0, policy_version 43390 (0.0006) [2023-03-07 00:06:08,309][81400] Updated weights for policy 0, policy_version 43400 (0.0006) [2023-03-07 00:06:09,077][81400] Updated weights for policy 0, policy_version 43410 (0.0006) [2023-03-07 00:06:09,853][81400] Updated weights for policy 0, policy_version 43420 (0.0005) [2023-03-07 00:06:10,637][81400] Updated weights for policy 0, policy_version 43430 (0.0006) [2023-03-07 00:06:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13204.4). Total num frames: 44479488. Throughput: 0: 13182.7. Samples: 44465289. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:06:11,237][81074] Avg episode reward: [(0, '1714.295')] [2023-03-07 00:06:11,430][81400] Updated weights for policy 0, policy_version 43440 (0.0006) [2023-03-07 00:06:12,195][81400] Updated weights for policy 0, policy_version 43450 (0.0007) [2023-03-07 00:06:12,979][81400] Updated weights for policy 0, policy_version 43460 (0.0006) [2023-03-07 00:06:13,763][81400] Updated weights for policy 0, policy_version 43470 (0.0006) [2023-03-07 00:06:14,537][81400] Updated weights for policy 0, policy_version 43480 (0.0005) [2023-03-07 00:06:15,331][81400] Updated weights for policy 0, policy_version 43490 (0.0007) [2023-03-07 00:06:16,118][81400] Updated weights for policy 0, policy_version 43500 (0.0006) [2023-03-07 00:06:16,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13200.9). Total num frames: 44545024. Throughput: 0: 13182.2. Samples: 44543951. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:06:16,237][81074] Avg episode reward: [(0, '1820.645')] [2023-03-07 00:06:16,886][81400] Updated weights for policy 0, policy_version 43510 (0.0005) [2023-03-07 00:06:17,657][81400] Updated weights for policy 0, policy_version 43520 (0.0006) [2023-03-07 00:06:18,430][81400] Updated weights for policy 0, policy_version 43530 (0.0006) [2023-03-07 00:06:19,211][81400] Updated weights for policy 0, policy_version 43540 (0.0006) [2023-03-07 00:06:19,986][81400] Updated weights for policy 0, policy_version 43550 (0.0006) [2023-03-07 00:06:20,784][81400] Updated weights for policy 0, policy_version 43560 (0.0006) [2023-03-07 00:06:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13200.9). Total num frames: 44611584. Throughput: 0: 13182.3. Samples: 44583621. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:06:21,237][81074] Avg episode reward: [(0, '1941.545')] [2023-03-07 00:06:21,551][81400] Updated weights for policy 0, policy_version 43570 (0.0006) [2023-03-07 00:06:22,342][81400] Updated weights for policy 0, policy_version 43580 (0.0007) [2023-03-07 00:06:23,093][81400] Updated weights for policy 0, policy_version 43590 (0.0006) [2023-03-07 00:06:23,877][81400] Updated weights for policy 0, policy_version 43600 (0.0006) [2023-03-07 00:06:24,622][81400] Updated weights for policy 0, policy_version 43610 (0.0005) [2023-03-07 00:06:25,400][81400] Updated weights for policy 0, policy_version 43620 (0.0006) [2023-03-07 00:06:26,175][81400] Updated weights for policy 0, policy_version 43630 (0.0006) [2023-03-07 00:06:26,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 13197.4). Total num frames: 44677120. Throughput: 0: 13183.2. Samples: 44662926. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:06:26,237][81074] Avg episode reward: [(0, '1814.027')] [2023-03-07 00:06:26,945][81400] Updated weights for policy 0, policy_version 43640 (0.0007) [2023-03-07 00:06:27,728][81400] Updated weights for policy 0, policy_version 43650 (0.0007) [2023-03-07 00:06:28,501][81400] Updated weights for policy 0, policy_version 43660 (0.0005) [2023-03-07 00:06:29,283][81400] Updated weights for policy 0, policy_version 43670 (0.0006) [2023-03-07 00:06:30,051][81400] Updated weights for policy 0, policy_version 43680 (0.0007) [2023-03-07 00:06:30,839][81400] Updated weights for policy 0, policy_version 43690 (0.0006) [2023-03-07 00:06:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13200.9). Total num frames: 44743680. Throughput: 0: 13179.9. Samples: 44742011. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:06:31,237][81074] Avg episode reward: [(0, '1937.596')] [2023-03-07 00:06:31,600][81400] Updated weights for policy 0, policy_version 43700 (0.0006) [2023-03-07 00:06:32,384][81400] Updated weights for policy 0, policy_version 43710 (0.0006) [2023-03-07 00:06:33,166][81400] Updated weights for policy 0, policy_version 43720 (0.0006) [2023-03-07 00:06:33,955][81400] Updated weights for policy 0, policy_version 43730 (0.0006) [2023-03-07 00:06:34,700][81400] Updated weights for policy 0, policy_version 43740 (0.0006) [2023-03-07 00:06:35,463][81400] Updated weights for policy 0, policy_version 43750 (0.0006) [2023-03-07 00:06:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13197.4). Total num frames: 44809216. Throughput: 0: 13182.0. Samples: 44781699. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:06:36,237][81074] Avg episode reward: [(0, '2019.456')] [2023-03-07 00:06:36,247][81400] Updated weights for policy 0, policy_version 43760 (0.0005) [2023-03-07 00:06:37,030][81400] Updated weights for policy 0, policy_version 43770 (0.0006) [2023-03-07 00:06:37,790][81400] Updated weights for policy 0, policy_version 43780 (0.0006) [2023-03-07 00:06:38,571][81400] Updated weights for policy 0, policy_version 43790 (0.0007) [2023-03-07 00:06:39,356][81400] Updated weights for policy 0, policy_version 43800 (0.0005) [2023-03-07 00:06:40,139][81400] Updated weights for policy 0, policy_version 43810 (0.0007) [2023-03-07 00:06:40,909][81400] Updated weights for policy 0, policy_version 43820 (0.0006) [2023-03-07 00:06:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13200.9). Total num frames: 44875776. Throughput: 0: 13195.2. Samples: 44861138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:06:41,237][81074] Avg episode reward: [(0, '2066.823')] [2023-03-07 00:06:41,674][81400] Updated weights for policy 0, policy_version 43830 (0.0006) [2023-03-07 00:06:42,462][81400] Updated weights for policy 0, policy_version 43840 (0.0006) [2023-03-07 00:06:43,246][81400] Updated weights for policy 0, policy_version 43850 (0.0007) [2023-03-07 00:06:44,007][81400] Updated weights for policy 0, policy_version 43860 (0.0006) [2023-03-07 00:06:44,806][81400] Updated weights for policy 0, policy_version 43870 (0.0006) [2023-03-07 00:06:45,571][81400] Updated weights for policy 0, policy_version 43880 (0.0006) [2023-03-07 00:06:46,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.6, 300 sec: 13197.5). Total num frames: 44941312. Throughput: 0: 13190.6. Samples: 44940352. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:06:46,237][81074] Avg episode reward: [(0, '2094.387')] [2023-03-07 00:06:46,341][81400] Updated weights for policy 0, policy_version 43890 (0.0006) [2023-03-07 00:06:47,126][81400] Updated weights for policy 0, policy_version 43900 (0.0006) [2023-03-07 00:06:47,905][81400] Updated weights for policy 0, policy_version 43910 (0.0006) [2023-03-07 00:06:48,654][81400] Updated weights for policy 0, policy_version 43920 (0.0006) [2023-03-07 00:06:49,439][81400] Updated weights for policy 0, policy_version 43930 (0.0007) [2023-03-07 00:06:50,215][81400] Updated weights for policy 0, policy_version 43940 (0.0007) [2023-03-07 00:06:50,993][81400] Updated weights for policy 0, policy_version 43950 (0.0006) [2023-03-07 00:06:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13197.5). Total num frames: 45007872. Throughput: 0: 13195.8. Samples: 44980011. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:06:51,237][81074] Avg episode reward: [(0, '1934.261')] [2023-03-07 00:06:51,767][81400] Updated weights for policy 0, policy_version 43960 (0.0006) [2023-03-07 00:06:52,535][81400] Updated weights for policy 0, policy_version 43970 (0.0006) [2023-03-07 00:06:53,307][81400] Updated weights for policy 0, policy_version 43980 (0.0005) [2023-03-07 00:06:54,080][81400] Updated weights for policy 0, policy_version 43990 (0.0006) [2023-03-07 00:06:54,873][81400] Updated weights for policy 0, policy_version 44000 (0.0006) [2023-03-07 00:06:55,643][81400] Updated weights for policy 0, policy_version 44010 (0.0006) [2023-03-07 00:06:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13197.4). Total num frames: 45073408. Throughput: 0: 13195.4. Samples: 45059083. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:06:56,237][81074] Avg episode reward: [(0, '2072.523')] [2023-03-07 00:06:56,416][81400] Updated weights for policy 0, policy_version 44020 (0.0007) [2023-03-07 00:06:57,187][81400] Updated weights for policy 0, policy_version 44030 (0.0006) [2023-03-07 00:06:57,958][81400] Updated weights for policy 0, policy_version 44040 (0.0006) [2023-03-07 00:06:58,727][81400] Updated weights for policy 0, policy_version 44050 (0.0007) [2023-03-07 00:06:59,527][81400] Updated weights for policy 0, policy_version 44060 (0.0006) [2023-03-07 00:07:00,321][81400] Updated weights for policy 0, policy_version 44070 (0.0006) [2023-03-07 00:07:01,077][81400] Updated weights for policy 0, policy_version 44080 (0.0008) [2023-03-07 00:07:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.6, 300 sec: 13194.0). Total num frames: 45138944. Throughput: 0: 13207.8. Samples: 45138300. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:01,237][81074] Avg episode reward: [(0, '1908.336')] [2023-03-07 00:07:01,850][81400] Updated weights for policy 0, policy_version 44090 (0.0006) [2023-03-07 00:07:02,633][81400] Updated weights for policy 0, policy_version 44100 (0.0007) [2023-03-07 00:07:03,392][81400] Updated weights for policy 0, policy_version 44110 (0.0006) [2023-03-07 00:07:04,176][81400] Updated weights for policy 0, policy_version 44120 (0.0006) [2023-03-07 00:07:04,939][81400] Updated weights for policy 0, policy_version 44130 (0.0005) [2023-03-07 00:07:05,729][81400] Updated weights for policy 0, policy_version 44140 (0.0006) [2023-03-07 00:07:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13197.5). Total num frames: 45205504. Throughput: 0: 13208.9. Samples: 45178022. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:06,237][81074] Avg episode reward: [(0, '1880.349')] [2023-03-07 00:07:06,487][81400] Updated weights for policy 0, policy_version 44150 (0.0006) [2023-03-07 00:07:07,269][81400] Updated weights for policy 0, policy_version 44160 (0.0006) [2023-03-07 00:07:08,050][81400] Updated weights for policy 0, policy_version 44170 (0.0007) [2023-03-07 00:07:08,804][81400] Updated weights for policy 0, policy_version 44180 (0.0007) [2023-03-07 00:07:09,586][81400] Updated weights for policy 0, policy_version 44190 (0.0007) [2023-03-07 00:07:10,367][81400] Updated weights for policy 0, policy_version 44200 (0.0007) [2023-03-07 00:07:11,159][81400] Updated weights for policy 0, policy_version 44210 (0.0006) [2023-03-07 00:07:11,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13209.6, 300 sec: 13197.4). Total num frames: 45272064. Throughput: 0: 13206.0. Samples: 45257199. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:11,237][81074] Avg episode reward: [(0, '2070.110')] [2023-03-07 00:07:11,916][81400] Updated weights for policy 0, policy_version 44220 (0.0007) [2023-03-07 00:07:12,725][81400] Updated weights for policy 0, policy_version 44230 (0.0006) [2023-03-07 00:07:13,498][81400] Updated weights for policy 0, policy_version 44240 (0.0007) [2023-03-07 00:07:14,276][81400] Updated weights for policy 0, policy_version 44250 (0.0007) [2023-03-07 00:07:15,042][81400] Updated weights for policy 0, policy_version 44260 (0.0006) [2023-03-07 00:07:15,814][81400] Updated weights for policy 0, policy_version 44270 (0.0006) [2023-03-07 00:07:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.7, 300 sec: 13197.5). Total num frames: 45337600. Throughput: 0: 13211.4. Samples: 45336521. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:16,237][81074] Avg episode reward: [(0, '1740.763')] [2023-03-07 00:07:16,581][81400] Updated weights for policy 0, policy_version 44280 (0.0005) [2023-03-07 00:07:17,360][81400] Updated weights for policy 0, policy_version 44290 (0.0006) [2023-03-07 00:07:18,139][81400] Updated weights for policy 0, policy_version 44300 (0.0007) [2023-03-07 00:07:18,917][81400] Updated weights for policy 0, policy_version 44310 (0.0007) [2023-03-07 00:07:19,683][81400] Updated weights for policy 0, policy_version 44320 (0.0006) [2023-03-07 00:07:20,451][81400] Updated weights for policy 0, policy_version 44330 (0.0006) [2023-03-07 00:07:21,229][81400] Updated weights for policy 0, policy_version 44340 (0.0006) [2023-03-07 00:07:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13197.5). Total num frames: 45404160. Throughput: 0: 13207.4. Samples: 45376030. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:21,237][81074] Avg episode reward: [(0, '1882.513')] [2023-03-07 00:07:21,998][81400] Updated weights for policy 0, policy_version 44350 (0.0006) [2023-03-07 00:07:22,777][81400] Updated weights for policy 0, policy_version 44360 (0.0006) [2023-03-07 00:07:23,565][81400] Updated weights for policy 0, policy_version 44370 (0.0006) [2023-03-07 00:07:24,336][81400] Updated weights for policy 0, policy_version 44380 (0.0006) [2023-03-07 00:07:25,119][81400] Updated weights for policy 0, policy_version 44390 (0.0006) [2023-03-07 00:07:25,886][81400] Updated weights for policy 0, policy_version 44400 (0.0006) [2023-03-07 00:07:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13197.5). Total num frames: 45469696. Throughput: 0: 13202.7. Samples: 45455260. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:26,237][81074] Avg episode reward: [(0, '1999.273')] [2023-03-07 00:07:26,665][81400] Updated weights for policy 0, policy_version 44410 (0.0006) [2023-03-07 00:07:27,450][81400] Updated weights for policy 0, policy_version 44420 (0.0006) [2023-03-07 00:07:28,218][81400] Updated weights for policy 0, policy_version 44430 (0.0007) [2023-03-07 00:07:28,998][81400] Updated weights for policy 0, policy_version 44440 (0.0006) [2023-03-07 00:07:29,757][81400] Updated weights for policy 0, policy_version 44450 (0.0006) [2023-03-07 00:07:30,530][81400] Updated weights for policy 0, policy_version 44460 (0.0006) [2023-03-07 00:07:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13197.5). Total num frames: 45536256. Throughput: 0: 13205.1. Samples: 45534582. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:31,237][81074] Avg episode reward: [(0, '1810.576')] [2023-03-07 00:07:31,313][81400] Updated weights for policy 0, policy_version 44470 (0.0006) [2023-03-07 00:07:32,073][81400] Updated weights for policy 0, policy_version 44480 (0.0005) [2023-03-07 00:07:32,853][81400] Updated weights for policy 0, policy_version 44490 (0.0006) [2023-03-07 00:07:33,634][81400] Updated weights for policy 0, policy_version 44500 (0.0005) [2023-03-07 00:07:34,397][81400] Updated weights for policy 0, policy_version 44510 (0.0006) [2023-03-07 00:07:35,170][81400] Updated weights for policy 0, policy_version 44520 (0.0006) [2023-03-07 00:07:35,959][81400] Updated weights for policy 0, policy_version 44530 (0.0006) [2023-03-07 00:07:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13197.5). Total num frames: 45601792. Throughput: 0: 13208.3. Samples: 45574384. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:36,237][81074] Avg episode reward: [(0, '1978.859')] [2023-03-07 00:07:36,738][81400] Updated weights for policy 0, policy_version 44540 (0.0006) [2023-03-07 00:07:37,512][81400] Updated weights for policy 0, policy_version 44550 (0.0006) [2023-03-07 00:07:38,308][81400] Updated weights for policy 0, policy_version 44560 (0.0006) [2023-03-07 00:07:39,067][81400] Updated weights for policy 0, policy_version 44570 (0.0006) [2023-03-07 00:07:39,857][81400] Updated weights for policy 0, policy_version 44580 (0.0007) [2023-03-07 00:07:40,624][81400] Updated weights for policy 0, policy_version 44590 (0.0010) [2023-03-07 00:07:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 13194.0). Total num frames: 45667328. Throughput: 0: 13206.5. Samples: 45653374. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:41,237][81074] Avg episode reward: [(0, '1816.067')] [2023-03-07 00:07:41,415][81400] Updated weights for policy 0, policy_version 44600 (0.0006) [2023-03-07 00:07:42,189][81400] Updated weights for policy 0, policy_version 44610 (0.0007) [2023-03-07 00:07:42,960][81400] Updated weights for policy 0, policy_version 44620 (0.0005) [2023-03-07 00:07:43,725][81400] Updated weights for policy 0, policy_version 44630 (0.0007) [2023-03-07 00:07:44,498][81400] Updated weights for policy 0, policy_version 44640 (0.0006) [2023-03-07 00:07:45,261][81400] Updated weights for policy 0, policy_version 44650 (0.0006) [2023-03-07 00:07:46,044][81400] Updated weights for policy 0, policy_version 44660 (0.0007) [2023-03-07 00:07:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13197.5). Total num frames: 45733888. Throughput: 0: 13205.2. Samples: 45732533. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:07:46,237][81074] Avg episode reward: [(0, '2121.326')] [2023-03-07 00:07:46,822][81400] Updated weights for policy 0, policy_version 44670 (0.0006) [2023-03-07 00:07:47,605][81400] Updated weights for policy 0, policy_version 44680 (0.0006) [2023-03-07 00:07:48,358][81400] Updated weights for policy 0, policy_version 44690 (0.0006) [2023-03-07 00:07:49,131][81400] Updated weights for policy 0, policy_version 44700 (0.0006) [2023-03-07 00:07:49,911][81400] Updated weights for policy 0, policy_version 44710 (0.0005) [2023-03-07 00:07:50,682][81400] Updated weights for policy 0, policy_version 44720 (0.0006) [2023-03-07 00:07:51,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13200.9). Total num frames: 45800448. Throughput: 0: 13208.9. Samples: 45772421. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:07:51,237][81074] Avg episode reward: [(0, '2004.176')] [2023-03-07 00:07:51,480][81400] Updated weights for policy 0, policy_version 44730 (0.0006) [2023-03-07 00:07:52,249][81400] Updated weights for policy 0, policy_version 44740 (0.0007) [2023-03-07 00:07:53,037][81400] Updated weights for policy 0, policy_version 44750 (0.0005) [2023-03-07 00:07:53,819][81400] Updated weights for policy 0, policy_version 44760 (0.0006) [2023-03-07 00:07:54,609][81400] Updated weights for policy 0, policy_version 44770 (0.0006) [2023-03-07 00:07:55,384][81400] Updated weights for policy 0, policy_version 44780 (0.0006) [2023-03-07 00:07:56,161][81400] Updated weights for policy 0, policy_version 44790 (0.0006) [2023-03-07 00:07:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13197.5). Total num frames: 45865984. Throughput: 0: 13199.8. Samples: 45851190. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:07:56,237][81074] Avg episode reward: [(0, '1923.387')] [2023-03-07 00:07:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000044791_45865984.pth... [2023-03-07 00:07:56,273][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000041698_42698752.pth [2023-03-07 00:07:56,935][81400] Updated weights for policy 0, policy_version 44800 (0.0006) [2023-03-07 00:07:57,710][81400] Updated weights for policy 0, policy_version 44810 (0.0006) [2023-03-07 00:07:58,312][81349] KL-divergence is very high: 272.4395 [2023-03-07 00:07:58,473][81400] Updated weights for policy 0, policy_version 44820 (0.0007) [2023-03-07 00:07:59,246][81400] Updated weights for policy 0, policy_version 44830 (0.0006) [2023-03-07 00:08:00,013][81400] Updated weights for policy 0, policy_version 44840 (0.0006) [2023-03-07 00:08:00,787][81400] Updated weights for policy 0, policy_version 44850 (0.0006) [2023-03-07 00:08:01,236][81074] Fps is (10 sec: 13106.9, 60 sec: 13209.6, 300 sec: 13197.4). Total num frames: 45931520. Throughput: 0: 13203.1. Samples: 45930662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:08:01,237][81074] Avg episode reward: [(0, '2048.803')] [2023-03-07 00:08:01,546][81400] Updated weights for policy 0, policy_version 44860 (0.0006) [2023-03-07 00:08:02,343][81400] Updated weights for policy 0, policy_version 44870 (0.0006) [2023-03-07 00:08:03,136][81400] Updated weights for policy 0, policy_version 44880 (0.0006) [2023-03-07 00:08:03,916][81400] Updated weights for policy 0, policy_version 44890 (0.0006) [2023-03-07 00:08:04,702][81400] Updated weights for policy 0, policy_version 44900 (0.0007) [2023-03-07 00:08:05,490][81400] Updated weights for policy 0, policy_version 44910 (0.0006) [2023-03-07 00:08:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.6, 300 sec: 13194.0). Total num frames: 45997056. Throughput: 0: 13197.8. Samples: 45969931. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:08:06,237][81074] Avg episode reward: [(0, '2119.081')] [2023-03-07 00:08:06,268][81400] Updated weights for policy 0, policy_version 44920 (0.0006) [2023-03-07 00:08:07,041][81400] Updated weights for policy 0, policy_version 44930 (0.0006) [2023-03-07 00:08:07,824][81400] Updated weights for policy 0, policy_version 44940 (0.0007) [2023-03-07 00:08:08,602][81400] Updated weights for policy 0, policy_version 44950 (0.0006) [2023-03-07 00:08:09,365][81400] Updated weights for policy 0, policy_version 44960 (0.0006) [2023-03-07 00:08:10,145][81400] Updated weights for policy 0, policy_version 44970 (0.0005) [2023-03-07 00:08:10,929][81400] Updated weights for policy 0, policy_version 44980 (0.0006) [2023-03-07 00:08:11,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 46062592. Throughput: 0: 13188.0. Samples: 46048719. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:08:11,237][81074] Avg episode reward: [(0, '2117.369')] [2023-03-07 00:08:11,712][81400] Updated weights for policy 0, policy_version 44990 (0.0007) [2023-03-07 00:08:12,497][81400] Updated weights for policy 0, policy_version 45000 (0.0007) [2023-03-07 00:08:13,277][81400] Updated weights for policy 0, policy_version 45010 (0.0007) [2023-03-07 00:08:14,025][81400] Updated weights for policy 0, policy_version 45020 (0.0006) [2023-03-07 00:08:14,807][81400] Updated weights for policy 0, policy_version 45030 (0.0006) [2023-03-07 00:08:15,574][81400] Updated weights for policy 0, policy_version 45040 (0.0006) [2023-03-07 00:08:16,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13194.0). Total num frames: 46129152. Throughput: 0: 13189.1. Samples: 46128092. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:08:16,237][81074] Avg episode reward: [(0, '2066.795')] [2023-03-07 00:08:16,346][81400] Updated weights for policy 0, policy_version 45050 (0.0007) [2023-03-07 00:08:17,134][81400] Updated weights for policy 0, policy_version 45060 (0.0007) [2023-03-07 00:08:17,896][81400] Updated weights for policy 0, policy_version 45070 (0.0007) [2023-03-07 00:08:18,681][81400] Updated weights for policy 0, policy_version 45080 (0.0006) [2023-03-07 00:08:19,466][81400] Updated weights for policy 0, policy_version 45090 (0.0006) [2023-03-07 00:08:20,242][81400] Updated weights for policy 0, policy_version 45100 (0.0006) [2023-03-07 00:08:21,033][81400] Updated weights for policy 0, policy_version 45110 (0.0006) [2023-03-07 00:08:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 46194688. Throughput: 0: 13179.3. Samples: 46167452. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:08:21,247][81074] Avg episode reward: [(0, '2033.894')] [2023-03-07 00:08:21,798][81400] Updated weights for policy 0, policy_version 45120 (0.0006) [2023-03-07 00:08:22,572][81400] Updated weights for policy 0, policy_version 45130 (0.0007) [2023-03-07 00:08:23,349][81400] Updated weights for policy 0, policy_version 45140 (0.0006) [2023-03-07 00:08:24,125][81400] Updated weights for policy 0, policy_version 45150 (0.0006) [2023-03-07 00:08:24,911][81400] Updated weights for policy 0, policy_version 45160 (0.0006) [2023-03-07 00:08:25,678][81400] Updated weights for policy 0, policy_version 45170 (0.0006) [2023-03-07 00:08:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13197.5). Total num frames: 46261248. Throughput: 0: 13178.6. Samples: 46246412. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:08:26,247][81074] Avg episode reward: [(0, '1862.063')] [2023-03-07 00:08:26,468][81400] Updated weights for policy 0, policy_version 45180 (0.0007) [2023-03-07 00:08:27,247][81400] Updated weights for policy 0, policy_version 45190 (0.0006) [2023-03-07 00:08:28,014][81400] Updated weights for policy 0, policy_version 45200 (0.0006) [2023-03-07 00:08:28,783][81400] Updated weights for policy 0, policy_version 45210 (0.0007) [2023-03-07 00:08:29,569][81400] Updated weights for policy 0, policy_version 45220 (0.0007) [2023-03-07 00:08:30,345][81400] Updated weights for policy 0, policy_version 45230 (0.0006) [2023-03-07 00:08:31,112][81400] Updated weights for policy 0, policy_version 45240 (0.0006) [2023-03-07 00:08:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 46326784. Throughput: 0: 13177.4. Samples: 46325516. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:08:31,246][81074] Avg episode reward: [(0, '2087.486')] [2023-03-07 00:08:31,902][81400] Updated weights for policy 0, policy_version 45250 (0.0007) [2023-03-07 00:08:32,669][81400] Updated weights for policy 0, policy_version 45260 (0.0006) [2023-03-07 00:08:33,438][81400] Updated weights for policy 0, policy_version 45270 (0.0006) [2023-03-07 00:08:34,225][81400] Updated weights for policy 0, policy_version 45280 (0.0008) [2023-03-07 00:08:34,993][81400] Updated weights for policy 0, policy_version 45290 (0.0006) [2023-03-07 00:08:35,766][81400] Updated weights for policy 0, policy_version 45300 (0.0006) [2023-03-07 00:08:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 46392320. Throughput: 0: 13170.2. Samples: 46365080. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:08:36,247][81074] Avg episode reward: [(0, '2143.498')] [2023-03-07 00:08:36,552][81400] Updated weights for policy 0, policy_version 45310 (0.0006) [2023-03-07 00:08:37,315][81400] Updated weights for policy 0, policy_version 45320 (0.0006) [2023-03-07 00:08:38,107][81400] Updated weights for policy 0, policy_version 45330 (0.0005) [2023-03-07 00:08:38,888][81400] Updated weights for policy 0, policy_version 45340 (0.0006) [2023-03-07 00:08:39,669][81400] Updated weights for policy 0, policy_version 45350 (0.0006) [2023-03-07 00:08:40,450][81400] Updated weights for policy 0, policy_version 45360 (0.0006) [2023-03-07 00:08:41,217][81400] Updated weights for policy 0, policy_version 45370 (0.0006) [2023-03-07 00:08:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 46458880. Throughput: 0: 13175.8. Samples: 46444102. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:08:41,248][81074] Avg episode reward: [(0, '2344.687')] [2023-03-07 00:08:42,005][81400] Updated weights for policy 0, policy_version 45380 (0.0007) [2023-03-07 00:08:42,777][81400] Updated weights for policy 0, policy_version 45390 (0.0006) [2023-03-07 00:08:43,570][81400] Updated weights for policy 0, policy_version 45400 (0.0006) [2023-03-07 00:08:44,325][81400] Updated weights for policy 0, policy_version 45410 (0.0007) [2023-03-07 00:08:45,113][81400] Updated weights for policy 0, policy_version 45420 (0.0006) [2023-03-07 00:08:45,894][81400] Updated weights for policy 0, policy_version 45430 (0.0006) [2023-03-07 00:08:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 46524416. Throughput: 0: 13168.2. Samples: 46523230. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:08:46,237][81074] Avg episode reward: [(0, '2254.088')] [2023-03-07 00:08:46,683][81400] Updated weights for policy 0, policy_version 45440 (0.0006) [2023-03-07 00:08:47,450][81400] Updated weights for policy 0, policy_version 45450 (0.0006) [2023-03-07 00:08:48,235][81400] Updated weights for policy 0, policy_version 45460 (0.0006) [2023-03-07 00:08:49,009][81400] Updated weights for policy 0, policy_version 45470 (0.0006) [2023-03-07 00:08:49,784][81400] Updated weights for policy 0, policy_version 45480 (0.0006) [2023-03-07 00:08:50,549][81400] Updated weights for policy 0, policy_version 45490 (0.0005) [2023-03-07 00:08:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13190.5). Total num frames: 46589952. Throughput: 0: 13169.2. Samples: 46562546. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:08:51,237][81074] Avg episode reward: [(0, '2313.770')] [2023-03-07 00:08:51,328][81400] Updated weights for policy 0, policy_version 45500 (0.0006) [2023-03-07 00:08:52,093][81400] Updated weights for policy 0, policy_version 45510 (0.0006) [2023-03-07 00:08:52,870][81400] Updated weights for policy 0, policy_version 45520 (0.0006) [2023-03-07 00:08:53,653][81400] Updated weights for policy 0, policy_version 45530 (0.0006) [2023-03-07 00:08:54,438][81400] Updated weights for policy 0, policy_version 45540 (0.0006) [2023-03-07 00:08:55,212][81400] Updated weights for policy 0, policy_version 45550 (0.0006) [2023-03-07 00:08:55,991][81400] Updated weights for policy 0, policy_version 45560 (0.0006) [2023-03-07 00:08:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 46656512. Throughput: 0: 13177.0. Samples: 46641682. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:08:56,237][81074] Avg episode reward: [(0, '2311.860')] [2023-03-07 00:08:56,770][81400] Updated weights for policy 0, policy_version 45570 (0.0006) [2023-03-07 00:08:57,560][81400] Updated weights for policy 0, policy_version 45580 (0.0007) [2023-03-07 00:08:58,326][81400] Updated weights for policy 0, policy_version 45590 (0.0005) [2023-03-07 00:08:59,110][81400] Updated weights for policy 0, policy_version 45600 (0.0006) [2023-03-07 00:08:59,887][81400] Updated weights for policy 0, policy_version 45610 (0.0006) [2023-03-07 00:09:00,653][81400] Updated weights for policy 0, policy_version 45620 (0.0005) [2023-03-07 00:09:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 46722048. Throughput: 0: 13171.6. Samples: 46720813. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:09:01,237][81074] Avg episode reward: [(0, '2202.807')] [2023-03-07 00:09:01,436][81400] Updated weights for policy 0, policy_version 45630 (0.0006) [2023-03-07 00:09:02,209][81400] Updated weights for policy 0, policy_version 45640 (0.0006) [2023-03-07 00:09:03,001][81400] Updated weights for policy 0, policy_version 45650 (0.0006) [2023-03-07 00:09:03,785][81400] Updated weights for policy 0, policy_version 45660 (0.0006) [2023-03-07 00:09:04,553][81400] Updated weights for policy 0, policy_version 45670 (0.0006) [2023-03-07 00:09:05,311][81400] Updated weights for policy 0, policy_version 45680 (0.0006) [2023-03-07 00:09:06,105][81400] Updated weights for policy 0, policy_version 45690 (0.0006) [2023-03-07 00:09:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13190.5). Total num frames: 46787584. Throughput: 0: 13170.8. Samples: 46760137. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:09:06,237][81074] Avg episode reward: [(0, '2116.044')] [2023-03-07 00:09:06,874][81400] Updated weights for policy 0, policy_version 45700 (0.0006) [2023-03-07 00:09:07,665][81400] Updated weights for policy 0, policy_version 45710 (0.0006) [2023-03-07 00:09:08,445][81400] Updated weights for policy 0, policy_version 45720 (0.0007) [2023-03-07 00:09:09,222][81400] Updated weights for policy 0, policy_version 45730 (0.0006) [2023-03-07 00:09:10,000][81400] Updated weights for policy 0, policy_version 45740 (0.0005) [2023-03-07 00:09:10,770][81400] Updated weights for policy 0, policy_version 45750 (0.0006) [2023-03-07 00:09:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 46853120. Throughput: 0: 13171.2. Samples: 46839117. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:09:11,237][81074] Avg episode reward: [(0, '2285.785')] [2023-03-07 00:09:11,564][81400] Updated weights for policy 0, policy_version 45760 (0.0006) [2023-03-07 00:09:12,340][81400] Updated weights for policy 0, policy_version 45770 (0.0005) [2023-03-07 00:09:13,105][81400] Updated weights for policy 0, policy_version 45780 (0.0005) [2023-03-07 00:09:13,885][81400] Updated weights for policy 0, policy_version 45790 (0.0005) [2023-03-07 00:09:14,678][81400] Updated weights for policy 0, policy_version 45800 (0.0005) [2023-03-07 00:09:15,444][81400] Updated weights for policy 0, policy_version 45810 (0.0005) [2023-03-07 00:09:16,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 46919680. Throughput: 0: 13164.9. Samples: 46917936. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:09:16,237][81074] Avg episode reward: [(0, '2254.339')] [2023-03-07 00:09:16,237][81400] Updated weights for policy 0, policy_version 45820 (0.0005) [2023-03-07 00:09:17,001][81400] Updated weights for policy 0, policy_version 45830 (0.0006) [2023-03-07 00:09:17,757][81400] Updated weights for policy 0, policy_version 45840 (0.0006) [2023-03-07 00:09:18,545][81400] Updated weights for policy 0, policy_version 45850 (0.0007) [2023-03-07 00:09:19,331][81400] Updated weights for policy 0, policy_version 45860 (0.0006) [2023-03-07 00:09:20,101][81400] Updated weights for policy 0, policy_version 45870 (0.0006) [2023-03-07 00:09:20,873][81400] Updated weights for policy 0, policy_version 45880 (0.0007) [2023-03-07 00:09:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 46985216. Throughput: 0: 13171.9. Samples: 46957818. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:09:21,237][81074] Avg episode reward: [(0, '2319.896')] [2023-03-07 00:09:21,637][81400] Updated weights for policy 0, policy_version 45890 (0.0006) [2023-03-07 00:09:22,423][81400] Updated weights for policy 0, policy_version 45900 (0.0006) [2023-03-07 00:09:23,185][81400] Updated weights for policy 0, policy_version 45910 (0.0006) [2023-03-07 00:09:23,974][81400] Updated weights for policy 0, policy_version 45920 (0.0006) [2023-03-07 00:09:24,751][81400] Updated weights for policy 0, policy_version 45930 (0.0006) [2023-03-07 00:09:25,524][81400] Updated weights for policy 0, policy_version 45940 (0.0006) [2023-03-07 00:09:26,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13194.0). Total num frames: 47051776. Throughput: 0: 13175.9. Samples: 47037019. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:09:26,237][81074] Avg episode reward: [(0, '2157.964')] [2023-03-07 00:09:26,278][81400] Updated weights for policy 0, policy_version 45950 (0.0006) [2023-03-07 00:09:27,077][81400] Updated weights for policy 0, policy_version 45960 (0.0007) [2023-03-07 00:09:27,842][81400] Updated weights for policy 0, policy_version 45970 (0.0005) [2023-03-07 00:09:28,627][81400] Updated weights for policy 0, policy_version 45980 (0.0006) [2023-03-07 00:09:29,403][81400] Updated weights for policy 0, policy_version 45990 (0.0006) [2023-03-07 00:09:30,185][81400] Updated weights for policy 0, policy_version 46000 (0.0006) [2023-03-07 00:09:30,955][81400] Updated weights for policy 0, policy_version 46010 (0.0006) [2023-03-07 00:09:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 47117312. Throughput: 0: 13178.4. Samples: 47116260. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:09:31,237][81074] Avg episode reward: [(0, '2295.714')] [2023-03-07 00:09:31,734][81400] Updated weights for policy 0, policy_version 46020 (0.0006) [2023-03-07 00:09:32,522][81400] Updated weights for policy 0, policy_version 46030 (0.0006) [2023-03-07 00:09:33,297][81400] Updated weights for policy 0, policy_version 46040 (0.0005) [2023-03-07 00:09:34,083][81400] Updated weights for policy 0, policy_version 46050 (0.0006) [2023-03-07 00:09:34,846][81400] Updated weights for policy 0, policy_version 46060 (0.0006) [2023-03-07 00:09:35,632][81400] Updated weights for policy 0, policy_version 46070 (0.0007) [2023-03-07 00:09:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 47182848. Throughput: 0: 13176.7. Samples: 47155499. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:09:36,237][81074] Avg episode reward: [(0, '2190.220')] [2023-03-07 00:09:36,397][81400] Updated weights for policy 0, policy_version 46080 (0.0006) [2023-03-07 00:09:37,179][81400] Updated weights for policy 0, policy_version 46090 (0.0006) [2023-03-07 00:09:37,945][81400] Updated weights for policy 0, policy_version 46100 (0.0006) [2023-03-07 00:09:38,732][81400] Updated weights for policy 0, policy_version 46110 (0.0005) [2023-03-07 00:09:39,518][81400] Updated weights for policy 0, policy_version 46120 (0.0006) [2023-03-07 00:09:40,290][81400] Updated weights for policy 0, policy_version 46130 (0.0007) [2023-03-07 00:09:41,056][81400] Updated weights for policy 0, policy_version 46140 (0.0006) [2023-03-07 00:09:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 47249408. Throughput: 0: 13180.0. Samples: 47234782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:09:41,237][81074] Avg episode reward: [(0, '2210.907')] [2023-03-07 00:09:41,829][81400] Updated weights for policy 0, policy_version 46150 (0.0005) [2023-03-07 00:09:42,610][81400] Updated weights for policy 0, policy_version 46160 (0.0007) [2023-03-07 00:09:43,382][81400] Updated weights for policy 0, policy_version 46170 (0.0006) [2023-03-07 00:09:44,154][81400] Updated weights for policy 0, policy_version 46180 (0.0006) [2023-03-07 00:09:44,932][81400] Updated weights for policy 0, policy_version 46190 (0.0006) [2023-03-07 00:09:45,695][81400] Updated weights for policy 0, policy_version 46200 (0.0005) [2023-03-07 00:09:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 47314944. Throughput: 0: 13182.1. Samples: 47314011. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:09:46,237][81074] Avg episode reward: [(0, '2329.046')] [2023-03-07 00:09:46,494][81400] Updated weights for policy 0, policy_version 46210 (0.0006) [2023-03-07 00:09:47,258][81400] Updated weights for policy 0, policy_version 46220 (0.0005) [2023-03-07 00:09:48,066][81400] Updated weights for policy 0, policy_version 46230 (0.0005) [2023-03-07 00:09:48,837][81400] Updated weights for policy 0, policy_version 46240 (0.0006) [2023-03-07 00:09:49,609][81400] Updated weights for policy 0, policy_version 46250 (0.0006) [2023-03-07 00:09:50,371][81400] Updated weights for policy 0, policy_version 46260 (0.0006) [2023-03-07 00:09:51,175][81400] Updated weights for policy 0, policy_version 46270 (0.0006) [2023-03-07 00:09:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 47381504. Throughput: 0: 13186.7. Samples: 47353539. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:09:51,237][81074] Avg episode reward: [(0, '2314.454')] [2023-03-07 00:09:51,953][81400] Updated weights for policy 0, policy_version 46280 (0.0007) [2023-03-07 00:09:52,730][81400] Updated weights for policy 0, policy_version 46290 (0.0007) [2023-03-07 00:09:53,513][81400] Updated weights for policy 0, policy_version 46300 (0.0006) [2023-03-07 00:09:54,291][81400] Updated weights for policy 0, policy_version 46310 (0.0006) [2023-03-07 00:09:55,053][81400] Updated weights for policy 0, policy_version 46320 (0.0006) [2023-03-07 00:09:55,829][81400] Updated weights for policy 0, policy_version 46330 (0.0006) [2023-03-07 00:09:56,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 47447040. Throughput: 0: 13181.9. Samples: 47432304. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:09:56,247][81074] Avg episode reward: [(0, '2344.176')] [2023-03-07 00:09:56,253][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000046335_47447040.pth... [2023-03-07 00:09:56,283][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000043244_44281856.pth [2023-03-07 00:09:56,612][81400] Updated weights for policy 0, policy_version 46340 (0.0005) [2023-03-07 00:09:57,402][81400] Updated weights for policy 0, policy_version 46350 (0.0006) [2023-03-07 00:09:58,174][81400] Updated weights for policy 0, policy_version 46360 (0.0006) [2023-03-07 00:09:58,950][81400] Updated weights for policy 0, policy_version 46370 (0.0005) [2023-03-07 00:09:59,731][81400] Updated weights for policy 0, policy_version 46380 (0.0006) [2023-03-07 00:10:00,525][81400] Updated weights for policy 0, policy_version 46390 (0.0007) [2023-03-07 00:10:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13190.5). Total num frames: 47512576. Throughput: 0: 13180.7. Samples: 47511070. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:10:01,247][81074] Avg episode reward: [(0, '2150.541')] [2023-03-07 00:10:01,294][81400] Updated weights for policy 0, policy_version 46400 (0.0005) [2023-03-07 00:10:02,080][81400] Updated weights for policy 0, policy_version 46410 (0.0006) [2023-03-07 00:10:02,869][81400] Updated weights for policy 0, policy_version 46420 (0.0006) [2023-03-07 00:10:03,642][81400] Updated weights for policy 0, policy_version 46430 (0.0005) [2023-03-07 00:10:04,421][81400] Updated weights for policy 0, policy_version 46440 (0.0006) [2023-03-07 00:10:05,190][81400] Updated weights for policy 0, policy_version 46450 (0.0006) [2023-03-07 00:10:05,966][81400] Updated weights for policy 0, policy_version 46460 (0.0006) [2023-03-07 00:10:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 47578112. Throughput: 0: 13171.5. Samples: 47550537. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:10:06,247][81074] Avg episode reward: [(0, '2436.512')] [2023-03-07 00:10:06,744][81400] Updated weights for policy 0, policy_version 46470 (0.0006) [2023-03-07 00:10:07,519][81400] Updated weights for policy 0, policy_version 46480 (0.0006) [2023-03-07 00:10:08,306][81400] Updated weights for policy 0, policy_version 46490 (0.0006) [2023-03-07 00:10:09,077][81400] Updated weights for policy 0, policy_version 46500 (0.0007) [2023-03-07 00:10:09,873][81400] Updated weights for policy 0, policy_version 46510 (0.0006) [2023-03-07 00:10:10,636][81400] Updated weights for policy 0, policy_version 46520 (0.0006) [2023-03-07 00:10:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 47643648. Throughput: 0: 13165.5. Samples: 47629465. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:10:11,247][81074] Avg episode reward: [(0, '2173.153')] [2023-03-07 00:10:11,402][81400] Updated weights for policy 0, policy_version 46530 (0.0007) [2023-03-07 00:10:12,192][81400] Updated weights for policy 0, policy_version 46540 (0.0006) [2023-03-07 00:10:12,965][81400] Updated weights for policy 0, policy_version 46550 (0.0006) [2023-03-07 00:10:13,751][81400] Updated weights for policy 0, policy_version 46560 (0.0006) [2023-03-07 00:10:14,534][81400] Updated weights for policy 0, policy_version 46570 (0.0006) [2023-03-07 00:10:15,310][81400] Updated weights for policy 0, policy_version 46580 (0.0006) [2023-03-07 00:10:16,088][81400] Updated weights for policy 0, policy_version 46590 (0.0006) [2023-03-07 00:10:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 47709184. Throughput: 0: 13160.8. Samples: 47708494. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:10:16,247][81074] Avg episode reward: [(0, '2131.314')] [2023-03-07 00:10:16,850][81400] Updated weights for policy 0, policy_version 46600 (0.0006) [2023-03-07 00:10:17,623][81400] Updated weights for policy 0, policy_version 46610 (0.0006) [2023-03-07 00:10:18,394][81400] Updated weights for policy 0, policy_version 46620 (0.0005) [2023-03-07 00:10:19,191][81400] Updated weights for policy 0, policy_version 46630 (0.0006) [2023-03-07 00:10:19,967][81400] Updated weights for policy 0, policy_version 46640 (0.0006) [2023-03-07 00:10:20,743][81400] Updated weights for policy 0, policy_version 46650 (0.0007) [2023-03-07 00:10:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 47775744. Throughput: 0: 13170.6. Samples: 47748174. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:10:21,237][81074] Avg episode reward: [(0, '2417.074')] [2023-03-07 00:10:21,516][81400] Updated weights for policy 0, policy_version 46660 (0.0006) [2023-03-07 00:10:22,315][81400] Updated weights for policy 0, policy_version 46670 (0.0007) [2023-03-07 00:10:23,090][81400] Updated weights for policy 0, policy_version 46680 (0.0006) [2023-03-07 00:10:23,854][81400] Updated weights for policy 0, policy_version 46690 (0.0006) [2023-03-07 00:10:24,637][81400] Updated weights for policy 0, policy_version 46700 (0.0006) [2023-03-07 00:10:25,434][81400] Updated weights for policy 0, policy_version 46710 (0.0006) [2023-03-07 00:10:26,208][81400] Updated weights for policy 0, policy_version 46720 (0.0007) [2023-03-07 00:10:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 47841280. Throughput: 0: 13157.7. Samples: 47826879. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:10:26,237][81074] Avg episode reward: [(0, '2282.972')] [2023-03-07 00:10:26,994][81400] Updated weights for policy 0, policy_version 46730 (0.0007) [2023-03-07 00:10:27,787][81400] Updated weights for policy 0, policy_version 46740 (0.0007) [2023-03-07 00:10:28,582][81400] Updated weights for policy 0, policy_version 46750 (0.0006) [2023-03-07 00:10:29,354][81400] Updated weights for policy 0, policy_version 46760 (0.0006) [2023-03-07 00:10:30,129][81400] Updated weights for policy 0, policy_version 46770 (0.0006) [2023-03-07 00:10:30,911][81400] Updated weights for policy 0, policy_version 46780 (0.0006) [2023-03-07 00:10:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13180.1). Total num frames: 47906816. Throughput: 0: 13138.2. Samples: 47905230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:10:31,237][81074] Avg episode reward: [(0, '2264.976')] [2023-03-07 00:10:31,689][81400] Updated weights for policy 0, policy_version 46790 (0.0006) [2023-03-07 00:10:32,452][81400] Updated weights for policy 0, policy_version 46800 (0.0006) [2023-03-07 00:10:33,243][81400] Updated weights for policy 0, policy_version 46810 (0.0006) [2023-03-07 00:10:34,008][81400] Updated weights for policy 0, policy_version 46820 (0.0006) [2023-03-07 00:10:34,781][81400] Updated weights for policy 0, policy_version 46830 (0.0006) [2023-03-07 00:10:35,543][81400] Updated weights for policy 0, policy_version 46840 (0.0006) [2023-03-07 00:10:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13180.1). Total num frames: 47972352. Throughput: 0: 13140.1. Samples: 47944845. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:10:36,237][81074] Avg episode reward: [(0, '2338.104')] [2023-03-07 00:10:36,320][81400] Updated weights for policy 0, policy_version 46850 (0.0006) [2023-03-07 00:10:37,107][81400] Updated weights for policy 0, policy_version 46860 (0.0006) [2023-03-07 00:10:37,870][81400] Updated weights for policy 0, policy_version 46870 (0.0006) [2023-03-07 00:10:38,643][81400] Updated weights for policy 0, policy_version 46880 (0.0006) [2023-03-07 00:10:39,431][81400] Updated weights for policy 0, policy_version 46890 (0.0006) [2023-03-07 00:10:40,236][81400] Updated weights for policy 0, policy_version 46900 (0.0006) [2023-03-07 00:10:40,997][81400] Updated weights for policy 0, policy_version 46910 (0.0006) [2023-03-07 00:10:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 48038912. Throughput: 0: 13151.6. Samples: 48024126. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:10:41,237][81074] Avg episode reward: [(0, '2422.906')] [2023-03-07 00:10:41,790][81400] Updated weights for policy 0, policy_version 46920 (0.0006) [2023-03-07 00:10:42,556][81400] Updated weights for policy 0, policy_version 46930 (0.0007) [2023-03-07 00:10:43,328][81400] Updated weights for policy 0, policy_version 46940 (0.0006) [2023-03-07 00:10:44,109][81400] Updated weights for policy 0, policy_version 46950 (0.0006) [2023-03-07 00:10:44,884][81400] Updated weights for policy 0, policy_version 46960 (0.0006) [2023-03-07 00:10:45,661][81400] Updated weights for policy 0, policy_version 46970 (0.0008) [2023-03-07 00:10:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13180.1). Total num frames: 48104448. Throughput: 0: 13158.6. Samples: 48103205. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:10:46,237][81074] Avg episode reward: [(0, '2526.526')] [2023-03-07 00:10:46,425][81400] Updated weights for policy 0, policy_version 46980 (0.0006) [2023-03-07 00:10:47,206][81400] Updated weights for policy 0, policy_version 46990 (0.0006) [2023-03-07 00:10:47,967][81400] Updated weights for policy 0, policy_version 47000 (0.0006) [2023-03-07 00:10:48,738][81400] Updated weights for policy 0, policy_version 47010 (0.0007) [2023-03-07 00:10:49,522][81400] Updated weights for policy 0, policy_version 47020 (0.0006) [2023-03-07 00:10:50,306][81400] Updated weights for policy 0, policy_version 47030 (0.0005) [2023-03-07 00:10:51,082][81400] Updated weights for policy 0, policy_version 47040 (0.0006) [2023-03-07 00:10:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13180.1). Total num frames: 48169984. Throughput: 0: 13168.0. Samples: 48143097. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:10:51,237][81074] Avg episode reward: [(0, '2283.121')] [2023-03-07 00:10:51,869][81400] Updated weights for policy 0, policy_version 47050 (0.0006) [2023-03-07 00:10:52,622][81400] Updated weights for policy 0, policy_version 47060 (0.0006) [2023-03-07 00:10:53,396][81400] Updated weights for policy 0, policy_version 47070 (0.0006) [2023-03-07 00:10:54,183][81400] Updated weights for policy 0, policy_version 47080 (0.0006) [2023-03-07 00:10:54,958][81400] Updated weights for policy 0, policy_version 47090 (0.0007) [2023-03-07 00:10:55,732][81400] Updated weights for policy 0, policy_version 47100 (0.0006) [2023-03-07 00:10:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 48236544. Throughput: 0: 13167.9. Samples: 48222019. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:10:56,237][81074] Avg episode reward: [(0, '2629.706')] [2023-03-07 00:10:56,519][81400] Updated weights for policy 0, policy_version 47110 (0.0007) [2023-03-07 00:10:57,266][81400] Updated weights for policy 0, policy_version 47120 (0.0006) [2023-03-07 00:10:58,050][81400] Updated weights for policy 0, policy_version 47130 (0.0006) [2023-03-07 00:10:58,823][81400] Updated weights for policy 0, policy_version 47140 (0.0006) [2023-03-07 00:10:59,607][81400] Updated weights for policy 0, policy_version 47150 (0.0006) [2023-03-07 00:11:00,389][81400] Updated weights for policy 0, policy_version 47160 (0.0006) [2023-03-07 00:11:01,167][81400] Updated weights for policy 0, policy_version 47170 (0.0006) [2023-03-07 00:11:01,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 48303104. Throughput: 0: 13173.9. Samples: 48301317. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:11:01,237][81074] Avg episode reward: [(0, '2497.371')] [2023-03-07 00:11:01,951][81400] Updated weights for policy 0, policy_version 47180 (0.0006) [2023-03-07 00:11:02,718][81400] Updated weights for policy 0, policy_version 47190 (0.0006) [2023-03-07 00:11:03,483][81400] Updated weights for policy 0, policy_version 47200 (0.0006) [2023-03-07 00:11:04,259][81400] Updated weights for policy 0, policy_version 47210 (0.0006) [2023-03-07 00:11:05,042][81400] Updated weights for policy 0, policy_version 47220 (0.0006) [2023-03-07 00:11:05,825][81400] Updated weights for policy 0, policy_version 47230 (0.0007) [2023-03-07 00:11:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 48368640. Throughput: 0: 13170.4. Samples: 48340843. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:11:06,237][81074] Avg episode reward: [(0, '2328.795')] [2023-03-07 00:11:06,606][81400] Updated weights for policy 0, policy_version 47240 (0.0006) [2023-03-07 00:11:07,390][81400] Updated weights for policy 0, policy_version 47250 (0.0006) [2023-03-07 00:11:08,163][81400] Updated weights for policy 0, policy_version 47260 (0.0006) [2023-03-07 00:11:08,935][81400] Updated weights for policy 0, policy_version 47270 (0.0006) [2023-03-07 00:11:09,711][81400] Updated weights for policy 0, policy_version 47280 (0.0006) [2023-03-07 00:11:10,489][81400] Updated weights for policy 0, policy_version 47290 (0.0006) [2023-03-07 00:11:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 48434176. Throughput: 0: 13183.7. Samples: 48420144. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:11:11,237][81074] Avg episode reward: [(0, '2481.857')] [2023-03-07 00:11:11,258][81400] Updated weights for policy 0, policy_version 47300 (0.0005) [2023-03-07 00:11:12,044][81400] Updated weights for policy 0, policy_version 47310 (0.0006) [2023-03-07 00:11:12,813][81400] Updated weights for policy 0, policy_version 47320 (0.0006) [2023-03-07 00:11:13,585][81400] Updated weights for policy 0, policy_version 47330 (0.0007) [2023-03-07 00:11:14,360][81400] Updated weights for policy 0, policy_version 47340 (0.0006) [2023-03-07 00:11:15,158][81400] Updated weights for policy 0, policy_version 47350 (0.0006) [2023-03-07 00:11:15,926][81400] Updated weights for policy 0, policy_version 47360 (0.0006) [2023-03-07 00:11:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 48500736. Throughput: 0: 13197.2. Samples: 48499104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:11:16,237][81074] Avg episode reward: [(0, '2273.408')] [2023-03-07 00:11:16,707][81400] Updated weights for policy 0, policy_version 47370 (0.0007) [2023-03-07 00:11:17,501][81400] Updated weights for policy 0, policy_version 47380 (0.0006) [2023-03-07 00:11:18,267][81400] Updated weights for policy 0, policy_version 47390 (0.0006) [2023-03-07 00:11:19,023][81400] Updated weights for policy 0, policy_version 47400 (0.0006) [2023-03-07 00:11:19,818][81400] Updated weights for policy 0, policy_version 47410 (0.0005) [2023-03-07 00:11:20,607][81400] Updated weights for policy 0, policy_version 47420 (0.0006) [2023-03-07 00:11:21,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 48566272. Throughput: 0: 13197.9. Samples: 48538751. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:11:21,237][81074] Avg episode reward: [(0, '2330.239')] [2023-03-07 00:11:21,377][81400] Updated weights for policy 0, policy_version 47430 (0.0007) [2023-03-07 00:11:22,136][81400] Updated weights for policy 0, policy_version 47440 (0.0005) [2023-03-07 00:11:22,909][81400] Updated weights for policy 0, policy_version 47450 (0.0006) [2023-03-07 00:11:23,689][81400] Updated weights for policy 0, policy_version 47460 (0.0006) [2023-03-07 00:11:24,452][81400] Updated weights for policy 0, policy_version 47470 (0.0006) [2023-03-07 00:11:25,233][81400] Updated weights for policy 0, policy_version 47480 (0.0005) [2023-03-07 00:11:26,003][81400] Updated weights for policy 0, policy_version 47490 (0.0006) [2023-03-07 00:11:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 48631808. Throughput: 0: 13190.7. Samples: 48617707. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:11:26,237][81074] Avg episode reward: [(0, '2239.559')] [2023-03-07 00:11:26,808][81400] Updated weights for policy 0, policy_version 47500 (0.0006) [2023-03-07 00:11:27,579][81400] Updated weights for policy 0, policy_version 47510 (0.0006) [2023-03-07 00:11:28,353][81400] Updated weights for policy 0, policy_version 47520 (0.0006) [2023-03-07 00:11:29,138][81400] Updated weights for policy 0, policy_version 47530 (0.0007) [2023-03-07 00:11:29,917][81400] Updated weights for policy 0, policy_version 47540 (0.0006) [2023-03-07 00:11:30,698][81400] Updated weights for policy 0, policy_version 47550 (0.0007) [2023-03-07 00:11:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 48698368. Throughput: 0: 13187.2. Samples: 48696628. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:11:31,237][81074] Avg episode reward: [(0, '2497.530')] [2023-03-07 00:11:31,466][81400] Updated weights for policy 0, policy_version 47560 (0.0006) [2023-03-07 00:11:32,239][81400] Updated weights for policy 0, policy_version 47570 (0.0006) [2023-03-07 00:11:33,033][81400] Updated weights for policy 0, policy_version 47580 (0.0006) [2023-03-07 00:11:33,801][81400] Updated weights for policy 0, policy_version 47590 (0.0006) [2023-03-07 00:11:34,576][81400] Updated weights for policy 0, policy_version 47600 (0.0006) [2023-03-07 00:11:35,338][81400] Updated weights for policy 0, policy_version 47610 (0.0007) [2023-03-07 00:11:36,105][81400] Updated weights for policy 0, policy_version 47620 (0.0006) [2023-03-07 00:11:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 48763904. Throughput: 0: 13182.1. Samples: 48736291. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:11:36,237][81074] Avg episode reward: [(0, '2463.948')] [2023-03-07 00:11:36,895][81400] Updated weights for policy 0, policy_version 47630 (0.0006) [2023-03-07 00:11:37,679][81400] Updated weights for policy 0, policy_version 47640 (0.0006) [2023-03-07 00:11:38,437][81400] Updated weights for policy 0, policy_version 47650 (0.0006) [2023-03-07 00:11:39,238][81400] Updated weights for policy 0, policy_version 47660 (0.0006) [2023-03-07 00:11:40,026][81400] Updated weights for policy 0, policy_version 47670 (0.0006) [2023-03-07 00:11:40,806][81400] Updated weights for policy 0, policy_version 47680 (0.0006) [2023-03-07 00:11:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 48829440. Throughput: 0: 13180.1. Samples: 48815122. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:11:41,237][81074] Avg episode reward: [(0, '2440.816')] [2023-03-07 00:11:41,577][81400] Updated weights for policy 0, policy_version 47690 (0.0006) [2023-03-07 00:11:42,364][81400] Updated weights for policy 0, policy_version 47700 (0.0006) [2023-03-07 00:11:43,145][81400] Updated weights for policy 0, policy_version 47710 (0.0006) [2023-03-07 00:11:43,926][81400] Updated weights for policy 0, policy_version 47720 (0.0006) [2023-03-07 00:11:44,703][81400] Updated weights for policy 0, policy_version 47730 (0.0005) [2023-03-07 00:11:45,482][81400] Updated weights for policy 0, policy_version 47740 (0.0006) [2023-03-07 00:11:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 48894976. Throughput: 0: 13169.8. Samples: 48893959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:11:46,237][81074] Avg episode reward: [(0, '2400.692')] [2023-03-07 00:11:46,258][81400] Updated weights for policy 0, policy_version 47750 (0.0006) [2023-03-07 00:11:47,054][81400] Updated weights for policy 0, policy_version 47760 (0.0005) [2023-03-07 00:11:47,837][81400] Updated weights for policy 0, policy_version 47770 (0.0006) [2023-03-07 00:11:48,623][81400] Updated weights for policy 0, policy_version 47780 (0.0007) [2023-03-07 00:11:49,391][81400] Updated weights for policy 0, policy_version 47790 (0.0006) [2023-03-07 00:11:50,161][81400] Updated weights for policy 0, policy_version 47800 (0.0006) [2023-03-07 00:11:50,943][81400] Updated weights for policy 0, policy_version 47810 (0.0006) [2023-03-07 00:11:51,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 48960512. Throughput: 0: 13164.1. Samples: 48933227. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:11:51,237][81074] Avg episode reward: [(0, '2387.756')] [2023-03-07 00:11:51,714][81400] Updated weights for policy 0, policy_version 47820 (0.0006) [2023-03-07 00:11:52,504][81400] Updated weights for policy 0, policy_version 47830 (0.0006) [2023-03-07 00:11:53,258][81400] Updated weights for policy 0, policy_version 47840 (0.0006) [2023-03-07 00:11:54,044][81400] Updated weights for policy 0, policy_version 47850 (0.0007) [2023-03-07 00:11:54,805][81400] Updated weights for policy 0, policy_version 47860 (0.0006) [2023-03-07 00:11:55,588][81400] Updated weights for policy 0, policy_version 47870 (0.0007) [2023-03-07 00:11:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 49027072. Throughput: 0: 13167.4. Samples: 49012678. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:11:56,237][81074] Avg episode reward: [(0, '2392.836')] [2023-03-07 00:11:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000047878_49027072.pth... [2023-03-07 00:11:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000044791_45865984.pth [2023-03-07 00:11:56,365][81400] Updated weights for policy 0, policy_version 47880 (0.0006) [2023-03-07 00:11:57,155][81400] Updated weights for policy 0, policy_version 47890 (0.0006) [2023-03-07 00:11:57,931][81400] Updated weights for policy 0, policy_version 47900 (0.0007) [2023-03-07 00:11:58,704][81400] Updated weights for policy 0, policy_version 47910 (0.0006) [2023-03-07 00:11:59,493][81400] Updated weights for policy 0, policy_version 47920 (0.0006) [2023-03-07 00:12:00,260][81400] Updated weights for policy 0, policy_version 47930 (0.0006) [2023-03-07 00:12:01,041][81400] Updated weights for policy 0, policy_version 47940 (0.0006) [2023-03-07 00:12:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 49092608. Throughput: 0: 13165.0. Samples: 49091527. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:12:01,237][81074] Avg episode reward: [(0, '2427.394')] [2023-03-07 00:12:01,819][81400] Updated weights for policy 0, policy_version 47950 (0.0006) [2023-03-07 00:12:02,595][81400] Updated weights for policy 0, policy_version 47960 (0.0006) [2023-03-07 00:12:03,377][81400] Updated weights for policy 0, policy_version 47970 (0.0005) [2023-03-07 00:12:04,162][81400] Updated weights for policy 0, policy_version 47980 (0.0006) [2023-03-07 00:12:04,916][81400] Updated weights for policy 0, policy_version 47990 (0.0006) [2023-03-07 00:12:05,710][81400] Updated weights for policy 0, policy_version 48000 (0.0006) [2023-03-07 00:12:06,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 49159168. Throughput: 0: 13158.3. Samples: 49130875. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:12:06,237][81074] Avg episode reward: [(0, '2423.258')] [2023-03-07 00:12:06,488][81400] Updated weights for policy 0, policy_version 48010 (0.0006) [2023-03-07 00:12:07,258][81400] Updated weights for policy 0, policy_version 48020 (0.0008) [2023-03-07 00:12:08,039][81400] Updated weights for policy 0, policy_version 48030 (0.0006) [2023-03-07 00:12:08,814][81400] Updated weights for policy 0, policy_version 48040 (0.0006) [2023-03-07 00:12:09,581][81400] Updated weights for policy 0, policy_version 48050 (0.0007) [2023-03-07 00:12:10,356][81400] Updated weights for policy 0, policy_version 48060 (0.0006) [2023-03-07 00:12:11,130][81400] Updated weights for policy 0, policy_version 48070 (0.0006) [2023-03-07 00:12:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 49224704. Throughput: 0: 13165.4. Samples: 49210147. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:12:11,237][81074] Avg episode reward: [(0, '2321.876')] [2023-03-07 00:12:11,877][81400] Updated weights for policy 0, policy_version 48080 (0.0006) [2023-03-07 00:12:12,664][81400] Updated weights for policy 0, policy_version 48090 (0.0007) [2023-03-07 00:12:13,456][81400] Updated weights for policy 0, policy_version 48100 (0.0006) [2023-03-07 00:12:14,227][81400] Updated weights for policy 0, policy_version 48110 (0.0006) [2023-03-07 00:12:14,990][81400] Updated weights for policy 0, policy_version 48120 (0.0006) [2023-03-07 00:12:15,774][81400] Updated weights for policy 0, policy_version 48130 (0.0006) [2023-03-07 00:12:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 49291264. Throughput: 0: 13174.0. Samples: 49289458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:12:16,237][81074] Avg episode reward: [(0, '2532.075')] [2023-03-07 00:12:16,551][81400] Updated weights for policy 0, policy_version 48140 (0.0006) [2023-03-07 00:12:17,316][81400] Updated weights for policy 0, policy_version 48150 (0.0006) [2023-03-07 00:12:18,082][81400] Updated weights for policy 0, policy_version 48160 (0.0006) [2023-03-07 00:12:18,877][81400] Updated weights for policy 0, policy_version 48170 (0.0006) [2023-03-07 00:12:19,652][81400] Updated weights for policy 0, policy_version 48180 (0.0006) [2023-03-07 00:12:20,424][81400] Updated weights for policy 0, policy_version 48190 (0.0007) [2023-03-07 00:12:21,202][81400] Updated weights for policy 0, policy_version 48200 (0.0006) [2023-03-07 00:12:21,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 49356800. Throughput: 0: 13175.0. Samples: 49329166. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:12:21,237][81074] Avg episode reward: [(0, '2374.003')] [2023-03-07 00:12:21,974][81400] Updated weights for policy 0, policy_version 48210 (0.0006) [2023-03-07 00:12:22,748][81400] Updated weights for policy 0, policy_version 48220 (0.0006) [2023-03-07 00:12:23,548][81400] Updated weights for policy 0, policy_version 48230 (0.0006) [2023-03-07 00:12:24,315][81400] Updated weights for policy 0, policy_version 48240 (0.0007) [2023-03-07 00:12:25,086][81400] Updated weights for policy 0, policy_version 48250 (0.0006) [2023-03-07 00:12:25,872][81400] Updated weights for policy 0, policy_version 48260 (0.0006) [2023-03-07 00:12:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 49422336. Throughput: 0: 13176.7. Samples: 49408073. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:12:26,237][81074] Avg episode reward: [(0, '2312.447')] [2023-03-07 00:12:26,645][81400] Updated weights for policy 0, policy_version 48270 (0.0006) [2023-03-07 00:12:27,394][81400] Updated weights for policy 0, policy_version 48280 (0.0006) [2023-03-07 00:12:28,186][81400] Updated weights for policy 0, policy_version 48290 (0.0006) [2023-03-07 00:12:28,978][81400] Updated weights for policy 0, policy_version 48300 (0.0006) [2023-03-07 00:12:29,754][81400] Updated weights for policy 0, policy_version 48310 (0.0007) [2023-03-07 00:12:30,539][81400] Updated weights for policy 0, policy_version 48320 (0.0006) [2023-03-07 00:12:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 49488896. Throughput: 0: 13182.1. Samples: 49487153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:12:31,237][81074] Avg episode reward: [(0, '2262.909')] [2023-03-07 00:12:31,310][81400] Updated weights for policy 0, policy_version 48330 (0.0006) [2023-03-07 00:12:32,085][81400] Updated weights for policy 0, policy_version 48340 (0.0006) [2023-03-07 00:12:32,865][81400] Updated weights for policy 0, policy_version 48350 (0.0006) [2023-03-07 00:12:33,649][81400] Updated weights for policy 0, policy_version 48360 (0.0007) [2023-03-07 00:12:34,418][81400] Updated weights for policy 0, policy_version 48370 (0.0006) [2023-03-07 00:12:35,180][81400] Updated weights for policy 0, policy_version 48380 (0.0006) [2023-03-07 00:12:35,991][81400] Updated weights for policy 0, policy_version 48390 (0.0006) [2023-03-07 00:12:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 49554432. Throughput: 0: 13187.7. Samples: 49526673. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:12:36,237][81074] Avg episode reward: [(0, '2342.624')] [2023-03-07 00:12:36,748][81400] Updated weights for policy 0, policy_version 48400 (0.0006) [2023-03-07 00:12:37,513][81400] Updated weights for policy 0, policy_version 48410 (0.0006) [2023-03-07 00:12:38,315][81400] Updated weights for policy 0, policy_version 48420 (0.0006) [2023-03-07 00:12:39,094][81400] Updated weights for policy 0, policy_version 48430 (0.0007) [2023-03-07 00:12:39,885][81400] Updated weights for policy 0, policy_version 48440 (0.0006) [2023-03-07 00:12:40,655][81400] Updated weights for policy 0, policy_version 48450 (0.0006) [2023-03-07 00:12:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 49619968. Throughput: 0: 13179.7. Samples: 49605765. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:12:41,237][81074] Avg episode reward: [(0, '2230.619')] [2023-03-07 00:12:41,431][81400] Updated weights for policy 0, policy_version 48460 (0.0006) [2023-03-07 00:12:42,196][81400] Updated weights for policy 0, policy_version 48470 (0.0006) [2023-03-07 00:12:42,981][81400] Updated weights for policy 0, policy_version 48480 (0.0006) [2023-03-07 00:12:43,766][81400] Updated weights for policy 0, policy_version 48490 (0.0006) [2023-03-07 00:12:44,540][81400] Updated weights for policy 0, policy_version 48500 (0.0006) [2023-03-07 00:12:45,320][81400] Updated weights for policy 0, policy_version 48510 (0.0006) [2023-03-07 00:12:46,092][81400] Updated weights for policy 0, policy_version 48520 (0.0007) [2023-03-07 00:12:46,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13169.7). Total num frames: 49685504. Throughput: 0: 13178.9. Samples: 49684581. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:12:46,237][81074] Avg episode reward: [(0, '2449.139')] [2023-03-07 00:12:46,868][81400] Updated weights for policy 0, policy_version 48530 (0.0008) [2023-03-07 00:12:47,629][81400] Updated weights for policy 0, policy_version 48540 (0.0006) [2023-03-07 00:12:48,404][81400] Updated weights for policy 0, policy_version 48550 (0.0006) [2023-03-07 00:12:49,191][81400] Updated weights for policy 0, policy_version 48560 (0.0006) [2023-03-07 00:12:49,976][81400] Updated weights for policy 0, policy_version 48570 (0.0006) [2023-03-07 00:12:50,757][81400] Updated weights for policy 0, policy_version 48580 (0.0006) [2023-03-07 00:12:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 49752064. Throughput: 0: 13188.0. Samples: 49724337. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:12:51,237][81074] Avg episode reward: [(0, '2212.493')] [2023-03-07 00:12:51,532][81400] Updated weights for policy 0, policy_version 48590 (0.0007) [2023-03-07 00:12:52,301][81400] Updated weights for policy 0, policy_version 48600 (0.0005) [2023-03-07 00:12:53,078][81400] Updated weights for policy 0, policy_version 48610 (0.0006) [2023-03-07 00:12:53,840][81400] Updated weights for policy 0, policy_version 48620 (0.0006) [2023-03-07 00:12:54,609][81400] Updated weights for policy 0, policy_version 48630 (0.0007) [2023-03-07 00:12:55,388][81400] Updated weights for policy 0, policy_version 48640 (0.0006) [2023-03-07 00:12:56,184][81400] Updated weights for policy 0, policy_version 48650 (0.0006) [2023-03-07 00:12:56,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 49817600. Throughput: 0: 13188.8. Samples: 49803645. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:12:56,237][81074] Avg episode reward: [(0, '2091.560')] [2023-03-07 00:12:56,949][81400] Updated weights for policy 0, policy_version 48660 (0.0006) [2023-03-07 00:12:57,721][81400] Updated weights for policy 0, policy_version 48670 (0.0006) [2023-03-07 00:12:58,504][81400] Updated weights for policy 0, policy_version 48680 (0.0006) [2023-03-07 00:12:59,289][81400] Updated weights for policy 0, policy_version 48690 (0.0006) [2023-03-07 00:13:00,090][81400] Updated weights for policy 0, policy_version 48700 (0.0006) [2023-03-07 00:13:00,872][81400] Updated weights for policy 0, policy_version 48710 (0.0006) [2023-03-07 00:13:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 49883136. Throughput: 0: 13174.4. Samples: 49882307. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:13:01,237][81074] Avg episode reward: [(0, '2096.209')] [2023-03-07 00:13:01,643][81400] Updated weights for policy 0, policy_version 48720 (0.0006) [2023-03-07 00:13:02,416][81400] Updated weights for policy 0, policy_version 48730 (0.0006) [2023-03-07 00:13:03,186][81400] Updated weights for policy 0, policy_version 48740 (0.0005) [2023-03-07 00:13:03,957][81400] Updated weights for policy 0, policy_version 48750 (0.0006) [2023-03-07 00:13:04,723][81400] Updated weights for policy 0, policy_version 48760 (0.0006) [2023-03-07 00:13:05,507][81400] Updated weights for policy 0, policy_version 48770 (0.0006) [2023-03-07 00:13:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 49949696. Throughput: 0: 13173.8. Samples: 49921985. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:13:06,237][81074] Avg episode reward: [(0, '2035.951')] [2023-03-07 00:13:06,290][81400] Updated weights for policy 0, policy_version 48780 (0.0007) [2023-03-07 00:13:07,077][81400] Updated weights for policy 0, policy_version 48790 (0.0007) [2023-03-07 00:13:07,849][81400] Updated weights for policy 0, policy_version 48800 (0.0006) [2023-03-07 00:13:08,629][81400] Updated weights for policy 0, policy_version 48810 (0.0006) [2023-03-07 00:13:09,418][81400] Updated weights for policy 0, policy_version 48820 (0.0006) [2023-03-07 00:13:10,197][81400] Updated weights for policy 0, policy_version 48830 (0.0006) [2023-03-07 00:13:10,954][81400] Updated weights for policy 0, policy_version 48840 (0.0006) [2023-03-07 00:13:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 50015232. Throughput: 0: 13173.6. Samples: 50000883. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:13:11,237][81074] Avg episode reward: [(0, '2318.157')] [2023-03-07 00:13:11,747][81400] Updated weights for policy 0, policy_version 48850 (0.0006) [2023-03-07 00:13:12,493][81400] Updated weights for policy 0, policy_version 48860 (0.0005) [2023-03-07 00:13:13,293][81400] Updated weights for policy 0, policy_version 48870 (0.0006) [2023-03-07 00:13:14,045][81400] Updated weights for policy 0, policy_version 48880 (0.0006) [2023-03-07 00:13:14,822][81400] Updated weights for policy 0, policy_version 48890 (0.0006) [2023-03-07 00:13:15,592][81400] Updated weights for policy 0, policy_version 48900 (0.0007) [2023-03-07 00:13:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 50081792. Throughput: 0: 13178.2. Samples: 50080173. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:13:16,237][81074] Avg episode reward: [(0, '2101.197')] [2023-03-07 00:13:16,382][81400] Updated weights for policy 0, policy_version 48910 (0.0007) [2023-03-07 00:13:17,153][81400] Updated weights for policy 0, policy_version 48920 (0.0007) [2023-03-07 00:13:17,945][81400] Updated weights for policy 0, policy_version 48930 (0.0006) [2023-03-07 00:13:18,713][81400] Updated weights for policy 0, policy_version 48940 (0.0006) [2023-03-07 00:13:19,506][81400] Updated weights for policy 0, policy_version 48950 (0.0006) [2023-03-07 00:13:20,298][81400] Updated weights for policy 0, policy_version 48960 (0.0006) [2023-03-07 00:13:21,073][81400] Updated weights for policy 0, policy_version 48970 (0.0007) [2023-03-07 00:13:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 50147328. Throughput: 0: 13174.9. Samples: 50119543. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:13:21,237][81074] Avg episode reward: [(0, '2102.137')] [2023-03-07 00:13:21,857][81400] Updated weights for policy 0, policy_version 48980 (0.0006) [2023-03-07 00:13:22,636][81400] Updated weights for policy 0, policy_version 48990 (0.0006) [2023-03-07 00:13:23,423][81400] Updated weights for policy 0, policy_version 49000 (0.0005) [2023-03-07 00:13:24,196][81400] Updated weights for policy 0, policy_version 49010 (0.0006) [2023-03-07 00:13:24,971][81400] Updated weights for policy 0, policy_version 49020 (0.0006) [2023-03-07 00:13:25,752][81400] Updated weights for policy 0, policy_version 49030 (0.0006) [2023-03-07 00:13:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 50212864. Throughput: 0: 13166.1. Samples: 50198242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:13:26,237][81074] Avg episode reward: [(0, '2072.098')] [2023-03-07 00:13:26,547][81400] Updated weights for policy 0, policy_version 49040 (0.0007) [2023-03-07 00:13:27,340][81400] Updated weights for policy 0, policy_version 49050 (0.0007) [2023-03-07 00:13:28,122][81400] Updated weights for policy 0, policy_version 49060 (0.0006) [2023-03-07 00:13:28,898][81400] Updated weights for policy 0, policy_version 49070 (0.0006) [2023-03-07 00:13:29,679][81400] Updated weights for policy 0, policy_version 49080 (0.0007) [2023-03-07 00:13:30,478][81400] Updated weights for policy 0, policy_version 49090 (0.0005) [2023-03-07 00:13:31,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13141.4, 300 sec: 13169.7). Total num frames: 50277376. Throughput: 0: 13156.6. Samples: 50276625. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:13:31,237][81074] Avg episode reward: [(0, '2145.064')] [2023-03-07 00:13:31,246][81400] Updated weights for policy 0, policy_version 49100 (0.0005) [2023-03-07 00:13:32,032][81400] Updated weights for policy 0, policy_version 49110 (0.0007) [2023-03-07 00:13:32,802][81400] Updated weights for policy 0, policy_version 49120 (0.0006) [2023-03-07 00:13:33,577][81400] Updated weights for policy 0, policy_version 49130 (0.0007) [2023-03-07 00:13:34,360][81400] Updated weights for policy 0, policy_version 49140 (0.0006) [2023-03-07 00:13:35,125][81400] Updated weights for policy 0, policy_version 49150 (0.0006) [2023-03-07 00:13:35,912][81400] Updated weights for policy 0, policy_version 49160 (0.0006) [2023-03-07 00:13:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 50343936. Throughput: 0: 13149.8. Samples: 50316079. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:13:36,237][81074] Avg episode reward: [(0, '2151.212')] [2023-03-07 00:13:36,690][81400] Updated weights for policy 0, policy_version 49170 (0.0006) [2023-03-07 00:13:37,465][81400] Updated weights for policy 0, policy_version 49180 (0.0007) [2023-03-07 00:13:38,253][81400] Updated weights for policy 0, policy_version 49190 (0.0005) [2023-03-07 00:13:39,021][81400] Updated weights for policy 0, policy_version 49200 (0.0007) [2023-03-07 00:13:39,794][81400] Updated weights for policy 0, policy_version 49210 (0.0007) [2023-03-07 00:13:40,567][81400] Updated weights for policy 0, policy_version 49220 (0.0006) [2023-03-07 00:13:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 50409472. Throughput: 0: 13143.4. Samples: 50395098. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:13:41,247][81074] Avg episode reward: [(0, '2190.787')] [2023-03-07 00:13:41,350][81400] Updated weights for policy 0, policy_version 49230 (0.0005) [2023-03-07 00:13:42,122][81400] Updated weights for policy 0, policy_version 49240 (0.0006) [2023-03-07 00:13:42,890][81400] Updated weights for policy 0, policy_version 49250 (0.0006) [2023-03-07 00:13:43,667][81400] Updated weights for policy 0, policy_version 49260 (0.0006) [2023-03-07 00:13:44,462][81400] Updated weights for policy 0, policy_version 49270 (0.0005) [2023-03-07 00:13:45,230][81400] Updated weights for policy 0, policy_version 49280 (0.0007) [2023-03-07 00:13:46,013][81400] Updated weights for policy 0, policy_version 49290 (0.0006) [2023-03-07 00:13:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 50476032. Throughput: 0: 13154.7. Samples: 50474270. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:13:46,237][81074] Avg episode reward: [(0, '2148.273')] [2023-03-07 00:13:46,792][81400] Updated weights for policy 0, policy_version 49300 (0.0006) [2023-03-07 00:13:47,562][81400] Updated weights for policy 0, policy_version 49310 (0.0007) [2023-03-07 00:13:48,348][81400] Updated weights for policy 0, policy_version 49320 (0.0007) [2023-03-07 00:13:49,123][81400] Updated weights for policy 0, policy_version 49330 (0.0006) [2023-03-07 00:13:49,885][81400] Updated weights for policy 0, policy_version 49340 (0.0006) [2023-03-07 00:13:50,661][81400] Updated weights for policy 0, policy_version 49350 (0.0007) [2023-03-07 00:13:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 50541568. Throughput: 0: 13150.6. Samples: 50513759. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:13:51,237][81074] Avg episode reward: [(0, '2241.935')] [2023-03-07 00:13:51,434][81400] Updated weights for policy 0, policy_version 49360 (0.0006) [2023-03-07 00:13:52,208][81400] Updated weights for policy 0, policy_version 49370 (0.0006) [2023-03-07 00:13:52,984][81400] Updated weights for policy 0, policy_version 49380 (0.0006) [2023-03-07 00:13:53,765][81400] Updated weights for policy 0, policy_version 49390 (0.0006) [2023-03-07 00:13:54,549][81400] Updated weights for policy 0, policy_version 49400 (0.0006) [2023-03-07 00:13:55,331][81400] Updated weights for policy 0, policy_version 49410 (0.0006) [2023-03-07 00:13:56,133][81400] Updated weights for policy 0, policy_version 49420 (0.0007) [2023-03-07 00:13:56,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 50607104. Throughput: 0: 13156.1. Samples: 50592907. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:13:56,237][81074] Avg episode reward: [(0, '2115.763')] [2023-03-07 00:13:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000049421_50607104.pth... [2023-03-07 00:13:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000046335_47447040.pth [2023-03-07 00:13:56,889][81400] Updated weights for policy 0, policy_version 49430 (0.0007) [2023-03-07 00:13:57,659][81400] Updated weights for policy 0, policy_version 49440 (0.0006) [2023-03-07 00:13:58,442][81400] Updated weights for policy 0, policy_version 49450 (0.0006) [2023-03-07 00:13:59,223][81400] Updated weights for policy 0, policy_version 49460 (0.0006) [2023-03-07 00:13:59,997][81400] Updated weights for policy 0, policy_version 49470 (0.0005) [2023-03-07 00:14:00,778][81400] Updated weights for policy 0, policy_version 49480 (0.0007) [2023-03-07 00:14:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 50673664. Throughput: 0: 13150.8. Samples: 50671959. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:01,237][81074] Avg episode reward: [(0, '2200.336')] [2023-03-07 00:14:01,533][81400] Updated weights for policy 0, policy_version 49490 (0.0006) [2023-03-07 00:14:02,335][81400] Updated weights for policy 0, policy_version 49500 (0.0006) [2023-03-07 00:14:03,108][81400] Updated weights for policy 0, policy_version 49510 (0.0006) [2023-03-07 00:14:03,874][81400] Updated weights for policy 0, policy_version 49520 (0.0006) [2023-03-07 00:14:04,657][81400] Updated weights for policy 0, policy_version 49530 (0.0006) [2023-03-07 00:14:05,429][81400] Updated weights for policy 0, policy_version 49540 (0.0006) [2023-03-07 00:14:06,186][81400] Updated weights for policy 0, policy_version 49550 (0.0005) [2023-03-07 00:14:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 50739200. Throughput: 0: 13155.8. Samples: 50711556. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:06,237][81074] Avg episode reward: [(0, '2203.525')] [2023-03-07 00:14:06,969][81400] Updated weights for policy 0, policy_version 49560 (0.0006) [2023-03-07 00:14:07,733][81400] Updated weights for policy 0, policy_version 49570 (0.0006) [2023-03-07 00:14:08,524][81400] Updated weights for policy 0, policy_version 49580 (0.0006) [2023-03-07 00:14:09,316][81400] Updated weights for policy 0, policy_version 49590 (0.0006) [2023-03-07 00:14:10,084][81400] Updated weights for policy 0, policy_version 49600 (0.0006) [2023-03-07 00:14:10,862][81400] Updated weights for policy 0, policy_version 49610 (0.0006) [2023-03-07 00:14:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 50804736. Throughput: 0: 13163.9. Samples: 50790615. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:11,237][81074] Avg episode reward: [(0, '1995.805')] [2023-03-07 00:14:11,650][81400] Updated weights for policy 0, policy_version 49620 (0.0006) [2023-03-07 00:14:12,420][81400] Updated weights for policy 0, policy_version 49630 (0.0006) [2023-03-07 00:14:13,186][81400] Updated weights for policy 0, policy_version 49640 (0.0006) [2023-03-07 00:14:13,950][81400] Updated weights for policy 0, policy_version 49650 (0.0006) [2023-03-07 00:14:14,736][81400] Updated weights for policy 0, policy_version 49660 (0.0006) [2023-03-07 00:14:15,504][81400] Updated weights for policy 0, policy_version 49670 (0.0006) [2023-03-07 00:14:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 50871296. Throughput: 0: 13190.6. Samples: 50870203. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:16,237][81074] Avg episode reward: [(0, '2040.050')] [2023-03-07 00:14:16,276][81400] Updated weights for policy 0, policy_version 49680 (0.0006) [2023-03-07 00:14:17,061][81400] Updated weights for policy 0, policy_version 49690 (0.0007) [2023-03-07 00:14:17,833][81400] Updated weights for policy 0, policy_version 49700 (0.0006) [2023-03-07 00:14:18,609][81400] Updated weights for policy 0, policy_version 49710 (0.0006) [2023-03-07 00:14:19,382][81400] Updated weights for policy 0, policy_version 49720 (0.0006) [2023-03-07 00:14:20,155][81400] Updated weights for policy 0, policy_version 49730 (0.0007) [2023-03-07 00:14:20,931][81400] Updated weights for policy 0, policy_version 49740 (0.0006) [2023-03-07 00:14:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 50936832. Throughput: 0: 13191.5. Samples: 50909699. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:21,237][81074] Avg episode reward: [(0, '2069.654')] [2023-03-07 00:14:21,726][81400] Updated weights for policy 0, policy_version 49750 (0.0005) [2023-03-07 00:14:22,498][81400] Updated weights for policy 0, policy_version 49760 (0.0006) [2023-03-07 00:14:23,272][81400] Updated weights for policy 0, policy_version 49770 (0.0006) [2023-03-07 00:14:24,044][81400] Updated weights for policy 0, policy_version 49780 (0.0007) [2023-03-07 00:14:24,819][81400] Updated weights for policy 0, policy_version 49790 (0.0007) [2023-03-07 00:14:25,607][81400] Updated weights for policy 0, policy_version 49800 (0.0006) [2023-03-07 00:14:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 51003392. Throughput: 0: 13187.7. Samples: 50988544. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:26,237][81074] Avg episode reward: [(0, '2009.827')] [2023-03-07 00:14:26,385][81400] Updated weights for policy 0, policy_version 49810 (0.0006) [2023-03-07 00:14:27,147][81400] Updated weights for policy 0, policy_version 49820 (0.0006) [2023-03-07 00:14:27,921][81400] Updated weights for policy 0, policy_version 49830 (0.0006) [2023-03-07 00:14:28,702][81400] Updated weights for policy 0, policy_version 49840 (0.0006) [2023-03-07 00:14:29,482][81400] Updated weights for policy 0, policy_version 49850 (0.0006) [2023-03-07 00:14:30,257][81400] Updated weights for policy 0, policy_version 49860 (0.0006) [2023-03-07 00:14:31,010][81400] Updated weights for policy 0, policy_version 49870 (0.0007) [2023-03-07 00:14:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 51068928. Throughput: 0: 13193.7. Samples: 51067988. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:31,237][81074] Avg episode reward: [(0, '2270.938')] [2023-03-07 00:14:31,809][81400] Updated weights for policy 0, policy_version 49880 (0.0006) [2023-03-07 00:14:32,593][81400] Updated weights for policy 0, policy_version 49890 (0.0006) [2023-03-07 00:14:33,375][81400] Updated weights for policy 0, policy_version 49900 (0.0006) [2023-03-07 00:14:34,141][81400] Updated weights for policy 0, policy_version 49910 (0.0006) [2023-03-07 00:14:34,906][81400] Updated weights for policy 0, policy_version 49920 (0.0006) [2023-03-07 00:14:35,702][81400] Updated weights for policy 0, policy_version 49930 (0.0006) [2023-03-07 00:14:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 51135488. Throughput: 0: 13191.2. Samples: 51107364. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:36,237][81074] Avg episode reward: [(0, '2201.101')] [2023-03-07 00:14:36,477][81400] Updated weights for policy 0, policy_version 49940 (0.0006) [2023-03-07 00:14:37,238][81400] Updated weights for policy 0, policy_version 49950 (0.0006) [2023-03-07 00:14:38,024][81400] Updated weights for policy 0, policy_version 49960 (0.0006) [2023-03-07 00:14:38,781][81400] Updated weights for policy 0, policy_version 49970 (0.0006) [2023-03-07 00:14:39,569][81400] Updated weights for policy 0, policy_version 49980 (0.0006) [2023-03-07 00:14:40,350][81400] Updated weights for policy 0, policy_version 49990 (0.0005) [2023-03-07 00:14:41,132][81400] Updated weights for policy 0, policy_version 50000 (0.0007) [2023-03-07 00:14:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 51201024. Throughput: 0: 13190.7. Samples: 51186485. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:41,237][81074] Avg episode reward: [(0, '2116.366')] [2023-03-07 00:14:41,915][81400] Updated weights for policy 0, policy_version 50010 (0.0007) [2023-03-07 00:14:42,690][81400] Updated weights for policy 0, policy_version 50020 (0.0006) [2023-03-07 00:14:43,472][81400] Updated weights for policy 0, policy_version 50030 (0.0006) [2023-03-07 00:14:44,253][81400] Updated weights for policy 0, policy_version 50040 (0.0006) [2023-03-07 00:14:45,014][81400] Updated weights for policy 0, policy_version 50050 (0.0006) [2023-03-07 00:14:45,764][81400] Updated weights for policy 0, policy_version 50060 (0.0006) [2023-03-07 00:14:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 51266560. Throughput: 0: 13192.4. Samples: 51265618. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:46,237][81074] Avg episode reward: [(0, '2184.734')] [2023-03-07 00:14:46,566][81400] Updated weights for policy 0, policy_version 50070 (0.0006) [2023-03-07 00:14:47,322][81400] Updated weights for policy 0, policy_version 50080 (0.0005) [2023-03-07 00:14:48,111][81400] Updated weights for policy 0, policy_version 50090 (0.0007) [2023-03-07 00:14:48,926][81400] Updated weights for policy 0, policy_version 50100 (0.0006) [2023-03-07 00:14:49,698][81400] Updated weights for policy 0, policy_version 50110 (0.0007) [2023-03-07 00:14:50,497][81400] Updated weights for policy 0, policy_version 50120 (0.0006) [2023-03-07 00:14:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 51332096. Throughput: 0: 13185.9. Samples: 51304921. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:51,237][81074] Avg episode reward: [(0, '2006.690')] [2023-03-07 00:14:51,277][81400] Updated weights for policy 0, policy_version 50130 (0.0005) [2023-03-07 00:14:52,046][81400] Updated weights for policy 0, policy_version 50140 (0.0006) [2023-03-07 00:14:52,817][81400] Updated weights for policy 0, policy_version 50150 (0.0005) [2023-03-07 00:14:53,578][81400] Updated weights for policy 0, policy_version 50160 (0.0006) [2023-03-07 00:14:54,354][81400] Updated weights for policy 0, policy_version 50170 (0.0006) [2023-03-07 00:14:55,133][81400] Updated weights for policy 0, policy_version 50180 (0.0006) [2023-03-07 00:14:55,910][81400] Updated weights for policy 0, policy_version 50190 (0.0005) [2023-03-07 00:14:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13173.2). Total num frames: 51398656. Throughput: 0: 13186.2. Samples: 51383996. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:14:56,237][81074] Avg episode reward: [(0, '2171.221')] [2023-03-07 00:14:56,684][81400] Updated weights for policy 0, policy_version 50200 (0.0007) [2023-03-07 00:14:57,461][81400] Updated weights for policy 0, policy_version 50210 (0.0005) [2023-03-07 00:14:58,227][81400] Updated weights for policy 0, policy_version 50220 (0.0006) [2023-03-07 00:14:59,010][81400] Updated weights for policy 0, policy_version 50230 (0.0006) [2023-03-07 00:14:59,782][81400] Updated weights for policy 0, policy_version 50240 (0.0006) [2023-03-07 00:15:00,552][81400] Updated weights for policy 0, policy_version 50250 (0.0006) [2023-03-07 00:15:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 51464192. Throughput: 0: 13180.3. Samples: 51463316. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:01,237][81074] Avg episode reward: [(0, '2170.429')] [2023-03-07 00:15:01,326][81400] Updated weights for policy 0, policy_version 50260 (0.0006) [2023-03-07 00:15:02,104][81400] Updated weights for policy 0, policy_version 50270 (0.0006) [2023-03-07 00:15:02,896][81400] Updated weights for policy 0, policy_version 50280 (0.0007) [2023-03-07 00:15:03,654][81400] Updated weights for policy 0, policy_version 50290 (0.0006) [2023-03-07 00:15:04,430][81400] Updated weights for policy 0, policy_version 50300 (0.0006) [2023-03-07 00:15:05,198][81400] Updated weights for policy 0, policy_version 50310 (0.0006) [2023-03-07 00:15:05,988][81400] Updated weights for policy 0, policy_version 50320 (0.0006) [2023-03-07 00:15:06,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 51530752. Throughput: 0: 13180.1. Samples: 51502804. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:06,237][81074] Avg episode reward: [(0, '2204.164')] [2023-03-07 00:15:06,741][81400] Updated weights for policy 0, policy_version 50330 (0.0006) [2023-03-07 00:15:07,534][81400] Updated weights for policy 0, policy_version 50340 (0.0006) [2023-03-07 00:15:08,305][81400] Updated weights for policy 0, policy_version 50350 (0.0006) [2023-03-07 00:15:09,096][81400] Updated weights for policy 0, policy_version 50360 (0.0006) [2023-03-07 00:15:09,783][81349] KL-divergence is very high: 295.4039 [2023-03-07 00:15:09,872][81400] Updated weights for policy 0, policy_version 50370 (0.0006) [2023-03-07 00:15:10,641][81400] Updated weights for policy 0, policy_version 50380 (0.0006) [2023-03-07 00:15:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 51596288. Throughput: 0: 13189.3. Samples: 51582060. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:11,237][81074] Avg episode reward: [(0, '2107.518')] [2023-03-07 00:15:11,415][81400] Updated weights for policy 0, policy_version 50390 (0.0006) [2023-03-07 00:15:12,205][81400] Updated weights for policy 0, policy_version 50400 (0.0007) [2023-03-07 00:15:12,984][81400] Updated weights for policy 0, policy_version 50410 (0.0006) [2023-03-07 00:15:13,743][81400] Updated weights for policy 0, policy_version 50420 (0.0006) [2023-03-07 00:15:14,539][81400] Updated weights for policy 0, policy_version 50430 (0.0006) [2023-03-07 00:15:15,324][81400] Updated weights for policy 0, policy_version 50440 (0.0006) [2023-03-07 00:15:16,095][81400] Updated weights for policy 0, policy_version 50450 (0.0006) [2023-03-07 00:15:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 51661824. Throughput: 0: 13176.2. Samples: 51660918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:16,237][81074] Avg episode reward: [(0, '2137.362')] [2023-03-07 00:15:16,864][81400] Updated weights for policy 0, policy_version 50460 (0.0006) [2023-03-07 00:15:17,642][81400] Updated weights for policy 0, policy_version 50470 (0.0006) [2023-03-07 00:15:18,418][81400] Updated weights for policy 0, policy_version 50480 (0.0006) [2023-03-07 00:15:19,192][81400] Updated weights for policy 0, policy_version 50490 (0.0006) [2023-03-07 00:15:19,972][81400] Updated weights for policy 0, policy_version 50500 (0.0006) [2023-03-07 00:15:20,747][81400] Updated weights for policy 0, policy_version 50510 (0.0007) [2023-03-07 00:15:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.6, 300 sec: 13176.6). Total num frames: 51728384. Throughput: 0: 13186.8. Samples: 51700769. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:21,237][81074] Avg episode reward: [(0, '2120.889')] [2023-03-07 00:15:21,533][81400] Updated weights for policy 0, policy_version 50520 (0.0006) [2023-03-07 00:15:22,317][81400] Updated weights for policy 0, policy_version 50530 (0.0008) [2023-03-07 00:15:23,105][81400] Updated weights for policy 0, policy_version 50540 (0.0006) [2023-03-07 00:15:23,885][81400] Updated weights for policy 0, policy_version 50550 (0.0005) [2023-03-07 00:15:24,666][81400] Updated weights for policy 0, policy_version 50560 (0.0006) [2023-03-07 00:15:25,064][81349] KL-divergence is very high: 113.0907 [2023-03-07 00:15:25,444][81400] Updated weights for policy 0, policy_version 50570 (0.0006) [2023-03-07 00:15:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 51793920. Throughput: 0: 13169.9. Samples: 51779133. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:26,237][81074] Avg episode reward: [(0, '2198.163')] [2023-03-07 00:15:26,237][81400] Updated weights for policy 0, policy_version 50580 (0.0005) [2023-03-07 00:15:27,011][81400] Updated weights for policy 0, policy_version 50590 (0.0007) [2023-03-07 00:15:27,802][81400] Updated weights for policy 0, policy_version 50600 (0.0007) [2023-03-07 00:15:28,580][81400] Updated weights for policy 0, policy_version 50610 (0.0007) [2023-03-07 00:15:29,369][81400] Updated weights for policy 0, policy_version 50620 (0.0006) [2023-03-07 00:15:30,148][81400] Updated weights for policy 0, policy_version 50630 (0.0007) [2023-03-07 00:15:30,913][81400] Updated weights for policy 0, policy_version 50640 (0.0006) [2023-03-07 00:15:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 51859456. Throughput: 0: 13160.2. Samples: 51857825. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:31,237][81074] Avg episode reward: [(0, '2282.981')] [2023-03-07 00:15:31,710][81400] Updated weights for policy 0, policy_version 50650 (0.0007) [2023-03-07 00:15:32,498][81400] Updated weights for policy 0, policy_version 50660 (0.0007) [2023-03-07 00:15:33,257][81400] Updated weights for policy 0, policy_version 50670 (0.0006) [2023-03-07 00:15:34,053][81400] Updated weights for policy 0, policy_version 50680 (0.0006) [2023-03-07 00:15:34,842][81400] Updated weights for policy 0, policy_version 50690 (0.0007) [2023-03-07 00:15:35,614][81400] Updated weights for policy 0, policy_version 50700 (0.0006) [2023-03-07 00:15:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 51924992. Throughput: 0: 13162.2. Samples: 51897221. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:36,237][81074] Avg episode reward: [(0, '1960.670')] [2023-03-07 00:15:36,376][81400] Updated weights for policy 0, policy_version 50710 (0.0006) [2023-03-07 00:15:37,157][81400] Updated weights for policy 0, policy_version 50720 (0.0006) [2023-03-07 00:15:37,936][81400] Updated weights for policy 0, policy_version 50730 (0.0007) [2023-03-07 00:15:38,691][81400] Updated weights for policy 0, policy_version 50740 (0.0006) [2023-03-07 00:15:39,461][81400] Updated weights for policy 0, policy_version 50750 (0.0006) [2023-03-07 00:15:40,236][81400] Updated weights for policy 0, policy_version 50760 (0.0006) [2023-03-07 00:15:41,008][81400] Updated weights for policy 0, policy_version 50770 (0.0005) [2023-03-07 00:15:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 51990528. Throughput: 0: 13166.4. Samples: 51976483. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:15:41,237][81074] Avg episode reward: [(0, '2102.347')] [2023-03-07 00:15:41,805][81400] Updated weights for policy 0, policy_version 50780 (0.0006) [2023-03-07 00:15:42,566][81400] Updated weights for policy 0, policy_version 50790 (0.0006) [2023-03-07 00:15:43,331][81400] Updated weights for policy 0, policy_version 50800 (0.0006) [2023-03-07 00:15:44,122][81400] Updated weights for policy 0, policy_version 50810 (0.0007) [2023-03-07 00:15:44,897][81400] Updated weights for policy 0, policy_version 50820 (0.0006) [2023-03-07 00:15:45,659][81400] Updated weights for policy 0, policy_version 50830 (0.0006) [2023-03-07 00:15:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 52057088. Throughput: 0: 13164.2. Samples: 52055706. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:15:46,237][81074] Avg episode reward: [(0, '2110.020')] [2023-03-07 00:15:46,439][81400] Updated weights for policy 0, policy_version 50840 (0.0006) [2023-03-07 00:15:47,209][81400] Updated weights for policy 0, policy_version 50850 (0.0006) [2023-03-07 00:15:47,991][81400] Updated weights for policy 0, policy_version 50860 (0.0006) [2023-03-07 00:15:48,772][81400] Updated weights for policy 0, policy_version 50870 (0.0007) [2023-03-07 00:15:49,519][81400] Updated weights for policy 0, policy_version 50880 (0.0006) [2023-03-07 00:15:50,305][81400] Updated weights for policy 0, policy_version 50890 (0.0006) [2023-03-07 00:15:51,084][81400] Updated weights for policy 0, policy_version 50900 (0.0006) [2023-03-07 00:15:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 52122624. Throughput: 0: 13168.8. Samples: 52095398. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:15:51,237][81074] Avg episode reward: [(0, '1961.151')] [2023-03-07 00:15:51,848][81400] Updated weights for policy 0, policy_version 50910 (0.0006) [2023-03-07 00:15:52,628][81400] Updated weights for policy 0, policy_version 50920 (0.0006) [2023-03-07 00:15:53,396][81400] Updated weights for policy 0, policy_version 50930 (0.0006) [2023-03-07 00:15:54,185][81400] Updated weights for policy 0, policy_version 50940 (0.0006) [2023-03-07 00:15:54,963][81400] Updated weights for policy 0, policy_version 50950 (0.0005) [2023-03-07 00:15:55,739][81400] Updated weights for policy 0, policy_version 50960 (0.0006) [2023-03-07 00:15:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 52189184. Throughput: 0: 13172.9. Samples: 52174840. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:15:56,237][81074] Avg episode reward: [(0, '1844.320')] [2023-03-07 00:15:56,254][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000050967_52190208.pth... [2023-03-07 00:15:56,283][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000047878_49027072.pth [2023-03-07 00:15:56,511][81400] Updated weights for policy 0, policy_version 50970 (0.0006) [2023-03-07 00:15:57,281][81400] Updated weights for policy 0, policy_version 50980 (0.0006) [2023-03-07 00:15:58,055][81400] Updated weights for policy 0, policy_version 50990 (0.0006) [2023-03-07 00:15:58,818][81400] Updated weights for policy 0, policy_version 51000 (0.0006) [2023-03-07 00:15:59,584][81400] Updated weights for policy 0, policy_version 51010 (0.0005) [2023-03-07 00:16:00,360][81400] Updated weights for policy 0, policy_version 51020 (0.0006) [2023-03-07 00:16:01,157][81400] Updated weights for policy 0, policy_version 51030 (0.0006) [2023-03-07 00:16:01,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 52255744. Throughput: 0: 13179.7. Samples: 52254005. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:01,237][81074] Avg episode reward: [(0, '1999.305')] [2023-03-07 00:16:01,913][81400] Updated weights for policy 0, policy_version 51040 (0.0006) [2023-03-07 00:16:02,701][81400] Updated weights for policy 0, policy_version 51050 (0.0006) [2023-03-07 00:16:03,463][81400] Updated weights for policy 0, policy_version 51060 (0.0006) [2023-03-07 00:16:04,250][81400] Updated weights for policy 0, policy_version 51070 (0.0006) [2023-03-07 00:16:05,029][81400] Updated weights for policy 0, policy_version 51080 (0.0006) [2023-03-07 00:16:05,802][81400] Updated weights for policy 0, policy_version 51090 (0.0006) [2023-03-07 00:16:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 52321280. Throughput: 0: 13178.1. Samples: 52293782. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:06,237][81074] Avg episode reward: [(0, '1878.975')] [2023-03-07 00:16:06,577][81400] Updated weights for policy 0, policy_version 51100 (0.0006) [2023-03-07 00:16:07,357][81400] Updated weights for policy 0, policy_version 51110 (0.0007) [2023-03-07 00:16:08,127][81400] Updated weights for policy 0, policy_version 51120 (0.0006) [2023-03-07 00:16:08,897][81400] Updated weights for policy 0, policy_version 51130 (0.0006) [2023-03-07 00:16:09,667][81400] Updated weights for policy 0, policy_version 51140 (0.0006) [2023-03-07 00:16:10,442][81400] Updated weights for policy 0, policy_version 51150 (0.0006) [2023-03-07 00:16:11,213][81400] Updated weights for policy 0, policy_version 51160 (0.0006) [2023-03-07 00:16:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 52387840. Throughput: 0: 13197.4. Samples: 52373014. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:11,237][81074] Avg episode reward: [(0, '1783.016')] [2023-03-07 00:16:11,994][81400] Updated weights for policy 0, policy_version 51170 (0.0006) [2023-03-07 00:16:12,790][81400] Updated weights for policy 0, policy_version 51180 (0.0006) [2023-03-07 00:16:13,557][81400] Updated weights for policy 0, policy_version 51190 (0.0006) [2023-03-07 00:16:14,343][81400] Updated weights for policy 0, policy_version 51200 (0.0006) [2023-03-07 00:16:15,145][81400] Updated weights for policy 0, policy_version 51210 (0.0006) [2023-03-07 00:16:15,909][81400] Updated weights for policy 0, policy_version 51220 (0.0006) [2023-03-07 00:16:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 52453376. Throughput: 0: 13205.1. Samples: 52452053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:16,237][81074] Avg episode reward: [(0, '1802.913')] [2023-03-07 00:16:16,669][81400] Updated weights for policy 0, policy_version 51230 (0.0006) [2023-03-07 00:16:17,453][81400] Updated weights for policy 0, policy_version 51240 (0.0006) [2023-03-07 00:16:18,237][81400] Updated weights for policy 0, policy_version 51250 (0.0006) [2023-03-07 00:16:18,994][81400] Updated weights for policy 0, policy_version 51260 (0.0006) [2023-03-07 00:16:19,774][81400] Updated weights for policy 0, policy_version 51270 (0.0006) [2023-03-07 00:16:20,546][81400] Updated weights for policy 0, policy_version 51280 (0.0006) [2023-03-07 00:16:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 52518912. Throughput: 0: 13213.0. Samples: 52491804. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:21,237][81074] Avg episode reward: [(0, '1957.456')] [2023-03-07 00:16:21,304][81400] Updated weights for policy 0, policy_version 51290 (0.0005) [2023-03-07 00:16:22,090][81400] Updated weights for policy 0, policy_version 51300 (0.0007) [2023-03-07 00:16:22,853][81400] Updated weights for policy 0, policy_version 51310 (0.0006) [2023-03-07 00:16:23,641][81400] Updated weights for policy 0, policy_version 51320 (0.0006) [2023-03-07 00:16:24,428][81400] Updated weights for policy 0, policy_version 51330 (0.0006) [2023-03-07 00:16:25,212][81400] Updated weights for policy 0, policy_version 51340 (0.0006) [2023-03-07 00:16:25,993][81400] Updated weights for policy 0, policy_version 51350 (0.0006) [2023-03-07 00:16:26,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 52585472. Throughput: 0: 13207.4. Samples: 52570816. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:26,237][81074] Avg episode reward: [(0, '1955.485')] [2023-03-07 00:16:26,759][81400] Updated weights for policy 0, policy_version 51360 (0.0006) [2023-03-07 00:16:27,554][81400] Updated weights for policy 0, policy_version 51370 (0.0006) [2023-03-07 00:16:28,299][81400] Updated weights for policy 0, policy_version 51380 (0.0007) [2023-03-07 00:16:29,073][81400] Updated weights for policy 0, policy_version 51390 (0.0005) [2023-03-07 00:16:29,842][81400] Updated weights for policy 0, policy_version 51400 (0.0006) [2023-03-07 00:16:30,608][81400] Updated weights for policy 0, policy_version 51410 (0.0007) [2023-03-07 00:16:31,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13209.6, 300 sec: 13180.1). Total num frames: 52652032. Throughput: 0: 13214.0. Samples: 52650338. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:31,237][81074] Avg episode reward: [(0, '1983.893')] [2023-03-07 00:16:31,388][81400] Updated weights for policy 0, policy_version 51420 (0.0007) [2023-03-07 00:16:32,168][81400] Updated weights for policy 0, policy_version 51430 (0.0007) [2023-03-07 00:16:32,949][81400] Updated weights for policy 0, policy_version 51440 (0.0006) [2023-03-07 00:16:33,729][81400] Updated weights for policy 0, policy_version 51450 (0.0006) [2023-03-07 00:16:34,511][81400] Updated weights for policy 0, policy_version 51460 (0.0007) [2023-03-07 00:16:35,270][81400] Updated weights for policy 0, policy_version 51470 (0.0006) [2023-03-07 00:16:36,057][81400] Updated weights for policy 0, policy_version 51480 (0.0006) [2023-03-07 00:16:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13180.1). Total num frames: 52717568. Throughput: 0: 13209.6. Samples: 52689830. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:36,237][81074] Avg episode reward: [(0, '1718.907')] [2023-03-07 00:16:36,816][81400] Updated weights for policy 0, policy_version 51490 (0.0006) [2023-03-07 00:16:37,617][81400] Updated weights for policy 0, policy_version 51500 (0.0006) [2023-03-07 00:16:38,393][81400] Updated weights for policy 0, policy_version 51510 (0.0007) [2023-03-07 00:16:39,187][81400] Updated weights for policy 0, policy_version 51520 (0.0007) [2023-03-07 00:16:39,945][81400] Updated weights for policy 0, policy_version 51530 (0.0005) [2023-03-07 00:16:40,726][81400] Updated weights for policy 0, policy_version 51540 (0.0007) [2023-03-07 00:16:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13209.6, 300 sec: 13180.1). Total num frames: 52783104. Throughput: 0: 13202.3. Samples: 52768943. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:41,237][81074] Avg episode reward: [(0, '1924.929')] [2023-03-07 00:16:41,511][81400] Updated weights for policy 0, policy_version 51550 (0.0006) [2023-03-07 00:16:42,304][81400] Updated weights for policy 0, policy_version 51560 (0.0006) [2023-03-07 00:16:43,050][81400] Updated weights for policy 0, policy_version 51570 (0.0006) [2023-03-07 00:16:43,841][81400] Updated weights for policy 0, policy_version 51580 (0.0006) [2023-03-07 00:16:44,615][81400] Updated weights for policy 0, policy_version 51590 (0.0007) [2023-03-07 00:16:45,384][81400] Updated weights for policy 0, policy_version 51600 (0.0006) [2023-03-07 00:16:46,151][81400] Updated weights for policy 0, policy_version 51610 (0.0005) [2023-03-07 00:16:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 52848640. Throughput: 0: 13195.8. Samples: 52847815. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:46,237][81074] Avg episode reward: [(0, '1840.338')] [2023-03-07 00:16:46,946][81400] Updated weights for policy 0, policy_version 51620 (0.0005) [2023-03-07 00:16:47,714][81400] Updated weights for policy 0, policy_version 51630 (0.0006) [2023-03-07 00:16:48,488][81400] Updated weights for policy 0, policy_version 51640 (0.0007) [2023-03-07 00:16:49,266][81400] Updated weights for policy 0, policy_version 51650 (0.0007) [2023-03-07 00:16:50,057][81400] Updated weights for policy 0, policy_version 51660 (0.0006) [2023-03-07 00:16:50,834][81400] Updated weights for policy 0, policy_version 51670 (0.0006) [2023-03-07 00:16:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13180.1). Total num frames: 52915200. Throughput: 0: 13193.1. Samples: 52887472. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:51,237][81074] Avg episode reward: [(0, '1839.329')] [2023-03-07 00:16:51,600][81400] Updated weights for policy 0, policy_version 51680 (0.0006) [2023-03-07 00:16:52,387][81400] Updated weights for policy 0, policy_version 51690 (0.0006) [2023-03-07 00:16:53,146][81400] Updated weights for policy 0, policy_version 51700 (0.0006) [2023-03-07 00:16:53,923][81400] Updated weights for policy 0, policy_version 51710 (0.0006) [2023-03-07 00:16:54,704][81400] Updated weights for policy 0, policy_version 51720 (0.0006) [2023-03-07 00:16:55,483][81400] Updated weights for policy 0, policy_version 51730 (0.0006) [2023-03-07 00:16:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 52980736. Throughput: 0: 13189.4. Samples: 52966540. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:16:56,237][81074] Avg episode reward: [(0, '1982.182')] [2023-03-07 00:16:56,261][81400] Updated weights for policy 0, policy_version 51740 (0.0007) [2023-03-07 00:16:57,040][81400] Updated weights for policy 0, policy_version 51750 (0.0006) [2023-03-07 00:16:57,799][81400] Updated weights for policy 0, policy_version 51760 (0.0005) [2023-03-07 00:16:58,575][81400] Updated weights for policy 0, policy_version 51770 (0.0007) [2023-03-07 00:16:59,350][81400] Updated weights for policy 0, policy_version 51780 (0.0005) [2023-03-07 00:17:00,122][81400] Updated weights for policy 0, policy_version 51790 (0.0007) [2023-03-07 00:17:00,906][81400] Updated weights for policy 0, policy_version 51800 (0.0006) [2023-03-07 00:17:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 53047296. Throughput: 0: 13198.3. Samples: 53045978. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:17:01,237][81074] Avg episode reward: [(0, '2072.231')] [2023-03-07 00:17:01,655][81400] Updated weights for policy 0, policy_version 51810 (0.0006) [2023-03-07 00:17:02,431][81400] Updated weights for policy 0, policy_version 51820 (0.0006) [2023-03-07 00:17:03,209][81400] Updated weights for policy 0, policy_version 51830 (0.0006) [2023-03-07 00:17:03,981][81400] Updated weights for policy 0, policy_version 51840 (0.0006) [2023-03-07 00:17:04,765][81400] Updated weights for policy 0, policy_version 51850 (0.0006) [2023-03-07 00:17:05,545][81400] Updated weights for policy 0, policy_version 51860 (0.0006) [2023-03-07 00:17:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 53112832. Throughput: 0: 13197.3. Samples: 53085681. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:17:06,237][81074] Avg episode reward: [(0, '2245.857')] [2023-03-07 00:17:06,318][81400] Updated weights for policy 0, policy_version 51870 (0.0006) [2023-03-07 00:17:07,107][81400] Updated weights for policy 0, policy_version 51880 (0.0007) [2023-03-07 00:17:07,868][81400] Updated weights for policy 0, policy_version 51890 (0.0006) [2023-03-07 00:17:08,622][81400] Updated weights for policy 0, policy_version 51900 (0.0007) [2023-03-07 00:17:09,392][81400] Updated weights for policy 0, policy_version 51910 (0.0006) [2023-03-07 00:17:10,171][81400] Updated weights for policy 0, policy_version 51920 (0.0006) [2023-03-07 00:17:10,961][81400] Updated weights for policy 0, policy_version 51930 (0.0006) [2023-03-07 00:17:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 53179392. Throughput: 0: 13207.2. Samples: 53165140. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:17:11,237][81074] Avg episode reward: [(0, '2087.431')] [2023-03-07 00:17:11,728][81400] Updated weights for policy 0, policy_version 51940 (0.0006) [2023-03-07 00:17:12,496][81400] Updated weights for policy 0, policy_version 51950 (0.0006) [2023-03-07 00:17:13,287][81400] Updated weights for policy 0, policy_version 51960 (0.0006) [2023-03-07 00:17:14,062][81400] Updated weights for policy 0, policy_version 51970 (0.0006) [2023-03-07 00:17:14,841][81400] Updated weights for policy 0, policy_version 51980 (0.0006) [2023-03-07 00:17:15,616][81400] Updated weights for policy 0, policy_version 51990 (0.0006) [2023-03-07 00:17:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 53244928. Throughput: 0: 13195.2. Samples: 53244124. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:17:16,237][81074] Avg episode reward: [(0, '2187.000')] [2023-03-07 00:17:16,409][81400] Updated weights for policy 0, policy_version 52000 (0.0006) [2023-03-07 00:17:17,187][81400] Updated weights for policy 0, policy_version 52010 (0.0007) [2023-03-07 00:17:17,992][81400] Updated weights for policy 0, policy_version 52020 (0.0006) [2023-03-07 00:17:18,744][81400] Updated weights for policy 0, policy_version 52030 (0.0007) [2023-03-07 00:17:19,551][81400] Updated weights for policy 0, policy_version 52040 (0.0005) [2023-03-07 00:17:20,306][81400] Updated weights for policy 0, policy_version 52050 (0.0006) [2023-03-07 00:17:21,079][81400] Updated weights for policy 0, policy_version 52060 (0.0006) [2023-03-07 00:17:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 53310464. Throughput: 0: 13189.1. Samples: 53283340. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:17:21,237][81074] Avg episode reward: [(0, '2309.155')] [2023-03-07 00:17:21,882][81400] Updated weights for policy 0, policy_version 52070 (0.0006) [2023-03-07 00:17:22,643][81400] Updated weights for policy 0, policy_version 52080 (0.0007) [2023-03-07 00:17:23,440][81400] Updated weights for policy 0, policy_version 52090 (0.0007) [2023-03-07 00:17:24,231][81400] Updated weights for policy 0, policy_version 52100 (0.0006) [2023-03-07 00:17:25,012][81400] Updated weights for policy 0, policy_version 52110 (0.0007) [2023-03-07 00:17:25,784][81400] Updated weights for policy 0, policy_version 52120 (0.0006) [2023-03-07 00:17:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 53376000. Throughput: 0: 13178.3. Samples: 53361968. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:17:26,237][81074] Avg episode reward: [(0, '2221.308')] [2023-03-07 00:17:26,562][81400] Updated weights for policy 0, policy_version 52130 (0.0006) [2023-03-07 00:17:27,346][81400] Updated weights for policy 0, policy_version 52140 (0.0007) [2023-03-07 00:17:28,114][81400] Updated weights for policy 0, policy_version 52150 (0.0006) [2023-03-07 00:17:28,888][81400] Updated weights for policy 0, policy_version 52160 (0.0006) [2023-03-07 00:17:29,659][81400] Updated weights for policy 0, policy_version 52170 (0.0006) [2023-03-07 00:17:30,433][81400] Updated weights for policy 0, policy_version 52180 (0.0006) [2023-03-07 00:17:31,226][81400] Updated weights for policy 0, policy_version 52190 (0.0006) [2023-03-07 00:17:31,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 53442560. Throughput: 0: 13179.7. Samples: 53440903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:17:31,237][81074] Avg episode reward: [(0, '2155.231')] [2023-03-07 00:17:32,003][81400] Updated weights for policy 0, policy_version 52200 (0.0006) [2023-03-07 00:17:32,784][81400] Updated weights for policy 0, policy_version 52210 (0.0006) [2023-03-07 00:17:33,580][81400] Updated weights for policy 0, policy_version 52220 (0.0007) [2023-03-07 00:17:34,353][81400] Updated weights for policy 0, policy_version 52230 (0.0006) [2023-03-07 00:17:35,123][81400] Updated weights for policy 0, policy_version 52240 (0.0006) [2023-03-07 00:17:35,892][81400] Updated weights for policy 0, policy_version 52250 (0.0006) [2023-03-07 00:17:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13180.1). Total num frames: 53508096. Throughput: 0: 13170.2. Samples: 53480132. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:17:36,237][81074] Avg episode reward: [(0, '2110.363')] [2023-03-07 00:17:36,675][81400] Updated weights for policy 0, policy_version 52260 (0.0007) [2023-03-07 00:17:37,431][81400] Updated weights for policy 0, policy_version 52270 (0.0006) [2023-03-07 00:17:38,219][81400] Updated weights for policy 0, policy_version 52280 (0.0006) [2023-03-07 00:17:39,012][81400] Updated weights for policy 0, policy_version 52290 (0.0006) [2023-03-07 00:17:39,770][81400] Updated weights for policy 0, policy_version 52300 (0.0005) [2023-03-07 00:17:40,545][81400] Updated weights for policy 0, policy_version 52310 (0.0006) [2023-03-07 00:17:41,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 53574656. Throughput: 0: 13180.5. Samples: 53559661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:17:41,237][81074] Avg episode reward: [(0, '2182.687')] [2023-03-07 00:17:41,318][81400] Updated weights for policy 0, policy_version 52320 (0.0006) [2023-03-07 00:17:42,108][81400] Updated weights for policy 0, policy_version 52330 (0.0006) [2023-03-07 00:17:42,863][81400] Updated weights for policy 0, policy_version 52340 (0.0006) [2023-03-07 00:17:43,638][81400] Updated weights for policy 0, policy_version 52350 (0.0007) [2023-03-07 00:17:44,432][81400] Updated weights for policy 0, policy_version 52360 (0.0006) [2023-03-07 00:17:45,206][81400] Updated weights for policy 0, policy_version 52370 (0.0006) [2023-03-07 00:17:45,983][81400] Updated weights for policy 0, policy_version 52380 (0.0006) [2023-03-07 00:17:46,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 53640192. Throughput: 0: 13176.2. Samples: 53638905. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:17:46,237][81074] Avg episode reward: [(0, '2222.022')] [2023-03-07 00:17:46,751][81400] Updated weights for policy 0, policy_version 52390 (0.0006) [2023-03-07 00:17:47,538][81400] Updated weights for policy 0, policy_version 52400 (0.0006) [2023-03-07 00:17:48,314][81400] Updated weights for policy 0, policy_version 52410 (0.0006) [2023-03-07 00:17:49,077][81400] Updated weights for policy 0, policy_version 52420 (0.0007) [2023-03-07 00:17:49,883][81400] Updated weights for policy 0, policy_version 52430 (0.0006) [2023-03-07 00:17:50,625][81400] Updated weights for policy 0, policy_version 52440 (0.0005) [2023-03-07 00:17:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 53705728. Throughput: 0: 13174.3. Samples: 53678522. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:17:51,237][81074] Avg episode reward: [(0, '2038.589')] [2023-03-07 00:17:51,418][81400] Updated weights for policy 0, policy_version 52450 (0.0006) [2023-03-07 00:17:52,195][81400] Updated weights for policy 0, policy_version 52460 (0.0007) [2023-03-07 00:17:52,951][81400] Updated weights for policy 0, policy_version 52470 (0.0007) [2023-03-07 00:17:53,719][81400] Updated weights for policy 0, policy_version 52480 (0.0006) [2023-03-07 00:17:54,503][81400] Updated weights for policy 0, policy_version 52490 (0.0006) [2023-03-07 00:17:55,277][81400] Updated weights for policy 0, policy_version 52500 (0.0006) [2023-03-07 00:17:56,061][81400] Updated weights for policy 0, policy_version 52510 (0.0006) [2023-03-07 00:17:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 53772288. Throughput: 0: 13169.9. Samples: 53757788. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:17:56,237][81074] Avg episode reward: [(0, '2105.027')] [2023-03-07 00:17:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000052512_53772288.pth... [2023-03-07 00:17:56,273][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000049421_50607104.pth [2023-03-07 00:17:56,825][81400] Updated weights for policy 0, policy_version 52520 (0.0007) [2023-03-07 00:17:57,597][81400] Updated weights for policy 0, policy_version 52530 (0.0006) [2023-03-07 00:17:58,366][81400] Updated weights for policy 0, policy_version 52540 (0.0006) [2023-03-07 00:17:59,137][81400] Updated weights for policy 0, policy_version 52550 (0.0006) [2023-03-07 00:17:59,894][81400] Updated weights for policy 0, policy_version 52560 (0.0006) [2023-03-07 00:18:00,670][81400] Updated weights for policy 0, policy_version 52570 (0.0006) [2023-03-07 00:18:01,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 53838848. Throughput: 0: 13184.9. Samples: 53837444. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:18:01,237][81074] Avg episode reward: [(0, '2240.788')] [2023-03-07 00:18:01,441][81400] Updated weights for policy 0, policy_version 52580 (0.0006) [2023-03-07 00:18:02,225][81400] Updated weights for policy 0, policy_version 52590 (0.0006) [2023-03-07 00:18:02,993][81400] Updated weights for policy 0, policy_version 52600 (0.0006) [2023-03-07 00:18:03,779][81400] Updated weights for policy 0, policy_version 52610 (0.0006) [2023-03-07 00:18:04,553][81400] Updated weights for policy 0, policy_version 52620 (0.0007) [2023-03-07 00:18:05,333][81400] Updated weights for policy 0, policy_version 52630 (0.0006) [2023-03-07 00:18:06,097][81400] Updated weights for policy 0, policy_version 52640 (0.0006) [2023-03-07 00:18:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 53904384. Throughput: 0: 13192.8. Samples: 53877016. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:18:06,237][81074] Avg episode reward: [(0, '2262.667')] [2023-03-07 00:18:06,891][81400] Updated weights for policy 0, policy_version 52650 (0.0006) [2023-03-07 00:18:07,664][81400] Updated weights for policy 0, policy_version 52660 (0.0006) [2023-03-07 00:18:08,437][81400] Updated weights for policy 0, policy_version 52670 (0.0005) [2023-03-07 00:18:09,214][81400] Updated weights for policy 0, policy_version 52680 (0.0006) [2023-03-07 00:18:10,003][81400] Updated weights for policy 0, policy_version 52690 (0.0006) [2023-03-07 00:18:10,798][81400] Updated weights for policy 0, policy_version 52700 (0.0006) [2023-03-07 00:18:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 13180.1). Total num frames: 53969920. Throughput: 0: 13198.5. Samples: 53955900. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:18:11,237][81074] Avg episode reward: [(0, '2121.121')] [2023-03-07 00:18:11,592][81400] Updated weights for policy 0, policy_version 52710 (0.0006) [2023-03-07 00:18:12,369][81400] Updated weights for policy 0, policy_version 52720 (0.0007) [2023-03-07 00:18:13,139][81400] Updated weights for policy 0, policy_version 52730 (0.0007) [2023-03-07 00:18:13,927][81400] Updated weights for policy 0, policy_version 52740 (0.0006) [2023-03-07 00:18:14,680][81400] Updated weights for policy 0, policy_version 52750 (0.0006) [2023-03-07 00:18:15,470][81400] Updated weights for policy 0, policy_version 52760 (0.0006) [2023-03-07 00:18:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 54035456. Throughput: 0: 13195.1. Samples: 54034682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:18:16,237][81074] Avg episode reward: [(0, '2018.024')] [2023-03-07 00:18:16,250][81400] Updated weights for policy 0, policy_version 52770 (0.0006) [2023-03-07 00:18:17,010][81400] Updated weights for policy 0, policy_version 52780 (0.0006) [2023-03-07 00:18:17,807][81400] Updated weights for policy 0, policy_version 52790 (0.0005) [2023-03-07 00:18:18,569][81400] Updated weights for policy 0, policy_version 52800 (0.0005) [2023-03-07 00:18:19,342][81400] Updated weights for policy 0, policy_version 52810 (0.0006) [2023-03-07 00:18:20,130][81400] Updated weights for policy 0, policy_version 52820 (0.0005) [2023-03-07 00:18:20,893][81400] Updated weights for policy 0, policy_version 52830 (0.0007) [2023-03-07 00:18:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 54102016. Throughput: 0: 13202.9. Samples: 54074261. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:18:21,247][81074] Avg episode reward: [(0, '2035.198')] [2023-03-07 00:18:21,662][81400] Updated weights for policy 0, policy_version 52840 (0.0006) [2023-03-07 00:18:22,440][81400] Updated weights for policy 0, policy_version 52850 (0.0006) [2023-03-07 00:18:23,217][81400] Updated weights for policy 0, policy_version 52860 (0.0005) [2023-03-07 00:18:23,978][81400] Updated weights for policy 0, policy_version 52870 (0.0007) [2023-03-07 00:18:24,752][81400] Updated weights for policy 0, policy_version 52880 (0.0006) [2023-03-07 00:18:25,519][81400] Updated weights for policy 0, policy_version 52890 (0.0006) [2023-03-07 00:18:26,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13190.5). Total num frames: 54168576. Throughput: 0: 13202.8. Samples: 54153789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:18:26,247][81074] Avg episode reward: [(0, '2147.702')] [2023-03-07 00:18:26,292][81400] Updated weights for policy 0, policy_version 52900 (0.0006) [2023-03-07 00:18:27,069][81400] Updated weights for policy 0, policy_version 52910 (0.0006) [2023-03-07 00:18:27,860][81400] Updated weights for policy 0, policy_version 52920 (0.0006) [2023-03-07 00:18:28,625][81400] Updated weights for policy 0, policy_version 52930 (0.0006) [2023-03-07 00:18:29,406][81400] Updated weights for policy 0, policy_version 52940 (0.0006) [2023-03-07 00:18:30,182][81400] Updated weights for policy 0, policy_version 52950 (0.0006) [2023-03-07 00:18:30,969][81400] Updated weights for policy 0, policy_version 52960 (0.0006) [2023-03-07 00:18:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54234112. Throughput: 0: 13193.4. Samples: 54232608. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:18:31,247][81074] Avg episode reward: [(0, '2433.887')] [2023-03-07 00:18:31,755][81400] Updated weights for policy 0, policy_version 52970 (0.0006) [2023-03-07 00:18:32,547][81400] Updated weights for policy 0, policy_version 52980 (0.0006) [2023-03-07 00:18:33,317][81400] Updated weights for policy 0, policy_version 52990 (0.0006) [2023-03-07 00:18:34,085][81400] Updated weights for policy 0, policy_version 53000 (0.0007) [2023-03-07 00:18:34,868][81400] Updated weights for policy 0, policy_version 53010 (0.0006) [2023-03-07 00:18:35,642][81400] Updated weights for policy 0, policy_version 53020 (0.0006) [2023-03-07 00:18:36,236][81074] Fps is (10 sec: 13106.9, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54299648. Throughput: 0: 13190.0. Samples: 54272075. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:18:36,248][81074] Avg episode reward: [(0, '2219.236')] [2023-03-07 00:18:36,422][81400] Updated weights for policy 0, policy_version 53030 (0.0006) [2023-03-07 00:18:37,210][81400] Updated weights for policy 0, policy_version 53040 (0.0006) [2023-03-07 00:18:37,989][81400] Updated weights for policy 0, policy_version 53050 (0.0007) [2023-03-07 00:18:38,779][81400] Updated weights for policy 0, policy_version 53060 (0.0007) [2023-03-07 00:18:39,542][81400] Updated weights for policy 0, policy_version 53070 (0.0007) [2023-03-07 00:18:40,327][81400] Updated weights for policy 0, policy_version 53080 (0.0006) [2023-03-07 00:18:41,086][81400] Updated weights for policy 0, policy_version 53090 (0.0005) [2023-03-07 00:18:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54366208. Throughput: 0: 13181.6. Samples: 54350959. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:18:41,247][81074] Avg episode reward: [(0, '2099.704')] [2023-03-07 00:18:41,873][81400] Updated weights for policy 0, policy_version 53100 (0.0006) [2023-03-07 00:18:42,657][81400] Updated weights for policy 0, policy_version 53110 (0.0006) [2023-03-07 00:18:43,436][81400] Updated weights for policy 0, policy_version 53120 (0.0006) [2023-03-07 00:18:44,185][81400] Updated weights for policy 0, policy_version 53130 (0.0006) [2023-03-07 00:18:44,974][81400] Updated weights for policy 0, policy_version 53140 (0.0006) [2023-03-07 00:18:45,757][81400] Updated weights for policy 0, policy_version 53150 (0.0007) [2023-03-07 00:18:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54431744. Throughput: 0: 13175.8. Samples: 54430358. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:18:46,247][81074] Avg episode reward: [(0, '2118.624')] [2023-03-07 00:18:46,510][81400] Updated weights for policy 0, policy_version 53160 (0.0007) [2023-03-07 00:18:47,314][81400] Updated weights for policy 0, policy_version 53170 (0.0007) [2023-03-07 00:18:48,098][81400] Updated weights for policy 0, policy_version 53180 (0.0006) [2023-03-07 00:18:48,870][81400] Updated weights for policy 0, policy_version 53190 (0.0006) [2023-03-07 00:18:49,647][81400] Updated weights for policy 0, policy_version 53200 (0.0006) [2023-03-07 00:18:50,409][81400] Updated weights for policy 0, policy_version 53210 (0.0007) [2023-03-07 00:18:51,198][81400] Updated weights for policy 0, policy_version 53220 (0.0006) [2023-03-07 00:18:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54497280. Throughput: 0: 13174.5. Samples: 54469866. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:18:51,237][81074] Avg episode reward: [(0, '2130.211')] [2023-03-07 00:18:51,961][81400] Updated weights for policy 0, policy_version 53230 (0.0006) [2023-03-07 00:18:52,747][81400] Updated weights for policy 0, policy_version 53240 (0.0007) [2023-03-07 00:18:53,509][81400] Updated weights for policy 0, policy_version 53250 (0.0007) [2023-03-07 00:18:54,274][81400] Updated weights for policy 0, policy_version 53260 (0.0006) [2023-03-07 00:18:55,050][81400] Updated weights for policy 0, policy_version 53270 (0.0006) [2023-03-07 00:18:55,838][81400] Updated weights for policy 0, policy_version 53280 (0.0006) [2023-03-07 00:18:56,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54563840. Throughput: 0: 13185.4. Samples: 54549241. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:18:56,237][81074] Avg episode reward: [(0, '2297.101')] [2023-03-07 00:18:56,611][81400] Updated weights for policy 0, policy_version 53290 (0.0007) [2023-03-07 00:18:57,403][81400] Updated weights for policy 0, policy_version 53300 (0.0006) [2023-03-07 00:18:58,156][81400] Updated weights for policy 0, policy_version 53310 (0.0006) [2023-03-07 00:18:58,933][81400] Updated weights for policy 0, policy_version 53320 (0.0006) [2023-03-07 00:18:59,698][81400] Updated weights for policy 0, policy_version 53330 (0.0006) [2023-03-07 00:19:00,473][81400] Updated weights for policy 0, policy_version 53340 (0.0006) [2023-03-07 00:19:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 54629376. Throughput: 0: 13196.5. Samples: 54628524. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:19:01,237][81074] Avg episode reward: [(0, '2028.311')] [2023-03-07 00:19:01,262][81400] Updated weights for policy 0, policy_version 53350 (0.0007) [2023-03-07 00:19:02,032][81400] Updated weights for policy 0, policy_version 53360 (0.0007) [2023-03-07 00:19:02,811][81400] Updated weights for policy 0, policy_version 53370 (0.0006) [2023-03-07 00:19:03,577][81400] Updated weights for policy 0, policy_version 53380 (0.0005) [2023-03-07 00:19:04,360][81400] Updated weights for policy 0, policy_version 53390 (0.0007) [2023-03-07 00:19:05,135][81400] Updated weights for policy 0, policy_version 53400 (0.0007) [2023-03-07 00:19:05,894][81400] Updated weights for policy 0, policy_version 53410 (0.0006) [2023-03-07 00:19:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13190.5). Total num frames: 54695936. Throughput: 0: 13199.6. Samples: 54668244. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:19:06,237][81074] Avg episode reward: [(0, '2484.165')] [2023-03-07 00:19:06,690][81400] Updated weights for policy 0, policy_version 53420 (0.0006) [2023-03-07 00:19:07,465][81400] Updated weights for policy 0, policy_version 53430 (0.0005) [2023-03-07 00:19:08,262][81400] Updated weights for policy 0, policy_version 53440 (0.0006) [2023-03-07 00:19:09,031][81400] Updated weights for policy 0, policy_version 53450 (0.0006) [2023-03-07 00:19:09,789][81400] Updated weights for policy 0, policy_version 53460 (0.0006) [2023-03-07 00:19:10,577][81400] Updated weights for policy 0, policy_version 53470 (0.0007) [2023-03-07 00:19:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54761472. Throughput: 0: 13186.9. Samples: 54747202. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:19:11,237][81074] Avg episode reward: [(0, '2506.097')] [2023-03-07 00:19:11,354][81400] Updated weights for policy 0, policy_version 53480 (0.0006) [2023-03-07 00:19:12,133][81400] Updated weights for policy 0, policy_version 53490 (0.0008) [2023-03-07 00:19:12,896][81400] Updated weights for policy 0, policy_version 53500 (0.0006) [2023-03-07 00:19:13,669][81400] Updated weights for policy 0, policy_version 53510 (0.0005) [2023-03-07 00:19:14,440][81400] Updated weights for policy 0, policy_version 53520 (0.0006) [2023-03-07 00:19:15,210][81400] Updated weights for policy 0, policy_version 53530 (0.0006) [2023-03-07 00:19:15,995][81400] Updated weights for policy 0, policy_version 53540 (0.0006) [2023-03-07 00:19:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13190.5). Total num frames: 54828032. Throughput: 0: 13195.1. Samples: 54826386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:19:16,237][81074] Avg episode reward: [(0, '2388.085')] [2023-03-07 00:19:16,763][81400] Updated weights for policy 0, policy_version 53550 (0.0006) [2023-03-07 00:19:17,549][81400] Updated weights for policy 0, policy_version 53560 (0.0005) [2023-03-07 00:19:18,310][81400] Updated weights for policy 0, policy_version 53570 (0.0006) [2023-03-07 00:19:19,090][81400] Updated weights for policy 0, policy_version 53580 (0.0007) [2023-03-07 00:19:19,870][81400] Updated weights for policy 0, policy_version 53590 (0.0006) [2023-03-07 00:19:20,645][81400] Updated weights for policy 0, policy_version 53600 (0.0006) [2023-03-07 00:19:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 54893568. Throughput: 0: 13200.1. Samples: 54866077. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:19:21,237][81074] Avg episode reward: [(0, '2511.117')] [2023-03-07 00:19:21,437][81400] Updated weights for policy 0, policy_version 53610 (0.0005) [2023-03-07 00:19:22,208][81400] Updated weights for policy 0, policy_version 53620 (0.0005) [2023-03-07 00:19:22,981][81400] Updated weights for policy 0, policy_version 53630 (0.0006) [2023-03-07 00:19:23,747][81400] Updated weights for policy 0, policy_version 53640 (0.0006) [2023-03-07 00:19:24,525][81400] Updated weights for policy 0, policy_version 53650 (0.0007) [2023-03-07 00:19:25,298][81400] Updated weights for policy 0, policy_version 53660 (0.0006) [2023-03-07 00:19:26,070][81400] Updated weights for policy 0, policy_version 53670 (0.0006) [2023-03-07 00:19:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 54960128. Throughput: 0: 13209.9. Samples: 54945406. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:19:26,237][81074] Avg episode reward: [(0, '2610.074')] [2023-03-07 00:19:26,850][81400] Updated weights for policy 0, policy_version 53680 (0.0007) [2023-03-07 00:19:27,642][81400] Updated weights for policy 0, policy_version 53690 (0.0006) [2023-03-07 00:19:28,405][81400] Updated weights for policy 0, policy_version 53700 (0.0006) [2023-03-07 00:19:29,173][81400] Updated weights for policy 0, policy_version 53710 (0.0006) [2023-03-07 00:19:29,956][81400] Updated weights for policy 0, policy_version 53720 (0.0006) [2023-03-07 00:19:30,736][81400] Updated weights for policy 0, policy_version 53730 (0.0006) [2023-03-07 00:19:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 55025664. Throughput: 0: 13204.2. Samples: 55024545. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:19:31,237][81074] Avg episode reward: [(0, '2610.923')] [2023-03-07 00:19:31,503][81400] Updated weights for policy 0, policy_version 53740 (0.0006) [2023-03-07 00:19:32,280][81400] Updated weights for policy 0, policy_version 53750 (0.0006) [2023-03-07 00:19:33,053][81400] Updated weights for policy 0, policy_version 53760 (0.0006) [2023-03-07 00:19:33,803][81400] Updated weights for policy 0, policy_version 53770 (0.0006) [2023-03-07 00:19:34,604][81400] Updated weights for policy 0, policy_version 53780 (0.0006) [2023-03-07 00:19:35,377][81400] Updated weights for policy 0, policy_version 53790 (0.0006) [2023-03-07 00:19:36,146][81400] Updated weights for policy 0, policy_version 53800 (0.0006) [2023-03-07 00:19:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13190.5). Total num frames: 55092224. Throughput: 0: 13209.5. Samples: 55064293. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:19:36,237][81074] Avg episode reward: [(0, '2602.692')] [2023-03-07 00:19:36,921][81400] Updated weights for policy 0, policy_version 53810 (0.0008) [2023-03-07 00:19:37,702][81400] Updated weights for policy 0, policy_version 53820 (0.0006) [2023-03-07 00:19:38,466][81400] Updated weights for policy 0, policy_version 53830 (0.0006) [2023-03-07 00:19:39,229][81400] Updated weights for policy 0, policy_version 53840 (0.0006) [2023-03-07 00:19:39,982][81400] Updated weights for policy 0, policy_version 53850 (0.0006) [2023-03-07 00:19:40,781][81400] Updated weights for policy 0, policy_version 53860 (0.0006) [2023-03-07 00:19:41,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13209.6, 300 sec: 13194.0). Total num frames: 55158784. Throughput: 0: 13211.4. Samples: 55143753. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:19:41,237][81074] Avg episode reward: [(0, '2617.399')] [2023-03-07 00:19:41,543][81400] Updated weights for policy 0, policy_version 53870 (0.0005) [2023-03-07 00:19:42,325][81400] Updated weights for policy 0, policy_version 53880 (0.0006) [2023-03-07 00:19:43,100][81400] Updated weights for policy 0, policy_version 53890 (0.0006) [2023-03-07 00:19:43,870][81400] Updated weights for policy 0, policy_version 53900 (0.0006) [2023-03-07 00:19:44,645][81400] Updated weights for policy 0, policy_version 53910 (0.0006) [2023-03-07 00:19:45,435][81400] Updated weights for policy 0, policy_version 53920 (0.0007) [2023-03-07 00:19:46,211][81400] Updated weights for policy 0, policy_version 53930 (0.0007) [2023-03-07 00:19:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13194.0). Total num frames: 55224320. Throughput: 0: 13208.6. Samples: 55222914. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:19:46,237][81074] Avg episode reward: [(0, '2802.084')] [2023-03-07 00:19:46,995][81400] Updated weights for policy 0, policy_version 53940 (0.0006) [2023-03-07 00:19:47,761][81400] Updated weights for policy 0, policy_version 53950 (0.0007) [2023-03-07 00:19:48,541][81400] Updated weights for policy 0, policy_version 53960 (0.0006) [2023-03-07 00:19:49,325][81400] Updated weights for policy 0, policy_version 53970 (0.0006) [2023-03-07 00:19:50,092][81400] Updated weights for policy 0, policy_version 53980 (0.0007) [2023-03-07 00:19:50,875][81400] Updated weights for policy 0, policy_version 53990 (0.0006) [2023-03-07 00:19:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13209.6, 300 sec: 13190.5). Total num frames: 55289856. Throughput: 0: 13204.1. Samples: 55262429. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:19:51,237][81074] Avg episode reward: [(0, '2769.593')] [2023-03-07 00:19:51,659][81400] Updated weights for policy 0, policy_version 54000 (0.0007) [2023-03-07 00:19:52,437][81400] Updated weights for policy 0, policy_version 54010 (0.0006) [2023-03-07 00:19:53,214][81400] Updated weights for policy 0, policy_version 54020 (0.0007) [2023-03-07 00:19:53,988][81400] Updated weights for policy 0, policy_version 54030 (0.0006) [2023-03-07 00:19:54,772][81400] Updated weights for policy 0, policy_version 54040 (0.0006) [2023-03-07 00:19:55,561][81400] Updated weights for policy 0, policy_version 54050 (0.0006) [2023-03-07 00:19:56,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 55355392. Throughput: 0: 13203.0. Samples: 55341334. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:19:56,237][81074] Avg episode reward: [(0, '2606.644')] [2023-03-07 00:19:56,240][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000054058_55355392.pth... [2023-03-07 00:19:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000050967_52190208.pth [2023-03-07 00:19:56,340][81400] Updated weights for policy 0, policy_version 54060 (0.0007) [2023-03-07 00:19:57,125][81400] Updated weights for policy 0, policy_version 54070 (0.0007) [2023-03-07 00:19:57,893][81400] Updated weights for policy 0, policy_version 54080 (0.0006) [2023-03-07 00:19:58,678][81400] Updated weights for policy 0, policy_version 54090 (0.0006) [2023-03-07 00:19:59,458][81400] Updated weights for policy 0, policy_version 54100 (0.0006) [2023-03-07 00:20:00,250][81400] Updated weights for policy 0, policy_version 54110 (0.0006) [2023-03-07 00:20:01,016][81400] Updated weights for policy 0, policy_version 54120 (0.0006) [2023-03-07 00:20:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 55420928. Throughput: 0: 13193.8. Samples: 55420106. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:20:01,237][81074] Avg episode reward: [(0, '2417.356')] [2023-03-07 00:20:01,797][81400] Updated weights for policy 0, policy_version 54130 (0.0006) [2023-03-07 00:20:02,590][81400] Updated weights for policy 0, policy_version 54140 (0.0006) [2023-03-07 00:20:03,361][81400] Updated weights for policy 0, policy_version 54150 (0.0006) [2023-03-07 00:20:04,143][81400] Updated weights for policy 0, policy_version 54160 (0.0006) [2023-03-07 00:20:04,919][81400] Updated weights for policy 0, policy_version 54170 (0.0006) [2023-03-07 00:20:05,691][81400] Updated weights for policy 0, policy_version 54180 (0.0006) [2023-03-07 00:20:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 55486464. Throughput: 0: 13183.8. Samples: 55459346. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:20:06,237][81074] Avg episode reward: [(0, '2494.044')] [2023-03-07 00:20:06,463][81400] Updated weights for policy 0, policy_version 54190 (0.0006) [2023-03-07 00:20:07,246][81400] Updated weights for policy 0, policy_version 54200 (0.0006) [2023-03-07 00:20:08,033][81400] Updated weights for policy 0, policy_version 54210 (0.0005) [2023-03-07 00:20:08,802][81400] Updated weights for policy 0, policy_version 54220 (0.0006) [2023-03-07 00:20:09,581][81400] Updated weights for policy 0, policy_version 54230 (0.0006) [2023-03-07 00:20:10,363][81400] Updated weights for policy 0, policy_version 54240 (0.0007) [2023-03-07 00:20:11,146][81400] Updated weights for policy 0, policy_version 54250 (0.0006) [2023-03-07 00:20:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13190.5). Total num frames: 55553024. Throughput: 0: 13180.2. Samples: 55538516. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:20:11,237][81074] Avg episode reward: [(0, '2613.026')] [2023-03-07 00:20:11,927][81400] Updated weights for policy 0, policy_version 54260 (0.0006) [2023-03-07 00:20:12,694][81400] Updated weights for policy 0, policy_version 54270 (0.0006) [2023-03-07 00:20:13,467][81400] Updated weights for policy 0, policy_version 54280 (0.0006) [2023-03-07 00:20:14,253][81400] Updated weights for policy 0, policy_version 54290 (0.0006) [2023-03-07 00:20:15,041][81400] Updated weights for policy 0, policy_version 54300 (0.0006) [2023-03-07 00:20:15,800][81400] Updated weights for policy 0, policy_version 54310 (0.0006) [2023-03-07 00:20:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 55618560. Throughput: 0: 13176.3. Samples: 55617477. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:20:16,237][81074] Avg episode reward: [(0, '2671.908')] [2023-03-07 00:20:16,583][81400] Updated weights for policy 0, policy_version 54320 (0.0006) [2023-03-07 00:20:17,361][81400] Updated weights for policy 0, policy_version 54330 (0.0007) [2023-03-07 00:20:18,134][81400] Updated weights for policy 0, policy_version 54340 (0.0006) [2023-03-07 00:20:18,905][81400] Updated weights for policy 0, policy_version 54350 (0.0007) [2023-03-07 00:20:19,670][81400] Updated weights for policy 0, policy_version 54360 (0.0008) [2023-03-07 00:20:20,449][81400] Updated weights for policy 0, policy_version 54370 (0.0006) [2023-03-07 00:20:21,222][81400] Updated weights for policy 0, policy_version 54380 (0.0005) [2023-03-07 00:20:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 55685120. Throughput: 0: 13171.1. Samples: 55656993. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:20:21,247][81074] Avg episode reward: [(0, '2560.826')] [2023-03-07 00:20:22,006][81400] Updated weights for policy 0, policy_version 54390 (0.0007) [2023-03-07 00:20:22,786][81400] Updated weights for policy 0, policy_version 54400 (0.0007) [2023-03-07 00:20:23,552][81400] Updated weights for policy 0, policy_version 54410 (0.0007) [2023-03-07 00:20:24,323][81400] Updated weights for policy 0, policy_version 54420 (0.0006) [2023-03-07 00:20:25,125][81400] Updated weights for policy 0, policy_version 54430 (0.0006) [2023-03-07 00:20:25,894][81400] Updated weights for policy 0, policy_version 54440 (0.0006) [2023-03-07 00:20:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 55750656. Throughput: 0: 13163.0. Samples: 55736090. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:20:26,247][81074] Avg episode reward: [(0, '2801.646')] [2023-03-07 00:20:26,667][81400] Updated weights for policy 0, policy_version 54450 (0.0006) [2023-03-07 00:20:27,458][81400] Updated weights for policy 0, policy_version 54460 (0.0006) [2023-03-07 00:20:28,230][81400] Updated weights for policy 0, policy_version 54470 (0.0006) [2023-03-07 00:20:29,001][81400] Updated weights for policy 0, policy_version 54480 (0.0006) [2023-03-07 00:20:29,770][81400] Updated weights for policy 0, policy_version 54490 (0.0006) [2023-03-07 00:20:30,538][81400] Updated weights for policy 0, policy_version 54500 (0.0006) [2023-03-07 00:20:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13194.0). Total num frames: 55817216. Throughput: 0: 13170.0. Samples: 55815561. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:20:31,237][81074] Avg episode reward: [(0, '2804.710')] [2023-03-07 00:20:31,322][81400] Updated weights for policy 0, policy_version 54510 (0.0006) [2023-03-07 00:20:32,089][81400] Updated weights for policy 0, policy_version 54520 (0.0007) [2023-03-07 00:20:32,859][81400] Updated weights for policy 0, policy_version 54530 (0.0006) [2023-03-07 00:20:33,640][81400] Updated weights for policy 0, policy_version 54540 (0.0006) [2023-03-07 00:20:34,403][81400] Updated weights for policy 0, policy_version 54550 (0.0006) [2023-03-07 00:20:35,189][81400] Updated weights for policy 0, policy_version 54560 (0.0006) [2023-03-07 00:20:35,985][81400] Updated weights for policy 0, policy_version 54570 (0.0007) [2023-03-07 00:20:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 55882752. Throughput: 0: 13171.2. Samples: 55855131. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:20:36,237][81074] Avg episode reward: [(0, '2732.768')] [2023-03-07 00:20:36,755][81400] Updated weights for policy 0, policy_version 54580 (0.0005) [2023-03-07 00:20:37,530][81400] Updated weights for policy 0, policy_version 54590 (0.0006) [2023-03-07 00:20:38,306][81400] Updated weights for policy 0, policy_version 54600 (0.0007) [2023-03-07 00:20:39,078][81400] Updated weights for policy 0, policy_version 54610 (0.0006) [2023-03-07 00:20:39,846][81400] Updated weights for policy 0, policy_version 54620 (0.0006) [2023-03-07 00:20:40,627][81400] Updated weights for policy 0, policy_version 54630 (0.0006) [2023-03-07 00:20:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13190.5). Total num frames: 55948288. Throughput: 0: 13176.0. Samples: 55934255. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:20:41,237][81074] Avg episode reward: [(0, '2697.091')] [2023-03-07 00:20:41,401][81400] Updated weights for policy 0, policy_version 54640 (0.0006) [2023-03-07 00:20:42,156][81400] Updated weights for policy 0, policy_version 54650 (0.0006) [2023-03-07 00:20:42,951][81400] Updated weights for policy 0, policy_version 54660 (0.0006) [2023-03-07 00:20:43,716][81400] Updated weights for policy 0, policy_version 54670 (0.0007) [2023-03-07 00:20:44,518][81400] Updated weights for policy 0, policy_version 54680 (0.0007) [2023-03-07 00:20:45,297][81400] Updated weights for policy 0, policy_version 54690 (0.0006) [2023-03-07 00:20:46,064][81400] Updated weights for policy 0, policy_version 54700 (0.0005) [2023-03-07 00:20:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 56014848. Throughput: 0: 13187.1. Samples: 56013526. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:20:46,237][81074] Avg episode reward: [(0, '2600.136')] [2023-03-07 00:20:46,831][81400] Updated weights for policy 0, policy_version 54710 (0.0006) [2023-03-07 00:20:47,611][81400] Updated weights for policy 0, policy_version 54720 (0.0006) [2023-03-07 00:20:48,394][81400] Updated weights for policy 0, policy_version 54730 (0.0006) [2023-03-07 00:20:49,186][81400] Updated weights for policy 0, policy_version 54740 (0.0006) [2023-03-07 00:20:49,965][81400] Updated weights for policy 0, policy_version 54750 (0.0006) [2023-03-07 00:20:50,734][81400] Updated weights for policy 0, policy_version 54760 (0.0005) [2023-03-07 00:20:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 56080384. Throughput: 0: 13190.0. Samples: 56052895. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:20:51,237][81074] Avg episode reward: [(0, '2600.511')] [2023-03-07 00:20:51,498][81400] Updated weights for policy 0, policy_version 54770 (0.0006) [2023-03-07 00:20:52,268][81400] Updated weights for policy 0, policy_version 54780 (0.0006) [2023-03-07 00:20:53,058][81400] Updated weights for policy 0, policy_version 54790 (0.0006) [2023-03-07 00:20:53,829][81400] Updated weights for policy 0, policy_version 54800 (0.0006) [2023-03-07 00:20:54,595][81400] Updated weights for policy 0, policy_version 54810 (0.0005) [2023-03-07 00:20:55,372][81400] Updated weights for policy 0, policy_version 54820 (0.0006) [2023-03-07 00:20:56,136][81400] Updated weights for policy 0, policy_version 54830 (0.0006) [2023-03-07 00:20:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 56146944. Throughput: 0: 13194.0. Samples: 56132246. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:20:56,237][81074] Avg episode reward: [(0, '2557.083')] [2023-03-07 00:20:56,915][81400] Updated weights for policy 0, policy_version 54840 (0.0006) [2023-03-07 00:20:57,690][81400] Updated weights for policy 0, policy_version 54850 (0.0007) [2023-03-07 00:20:58,482][81400] Updated weights for policy 0, policy_version 54860 (0.0006) [2023-03-07 00:20:59,245][81400] Updated weights for policy 0, policy_version 54870 (0.0006) [2023-03-07 00:21:00,031][81400] Updated weights for policy 0, policy_version 54880 (0.0006) [2023-03-07 00:21:00,818][81400] Updated weights for policy 0, policy_version 54890 (0.0006) [2023-03-07 00:21:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13190.5). Total num frames: 56212480. Throughput: 0: 13197.4. Samples: 56211362. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:21:01,237][81074] Avg episode reward: [(0, '2573.093')] [2023-03-07 00:21:01,591][81400] Updated weights for policy 0, policy_version 54900 (0.0006) [2023-03-07 00:21:02,365][81400] Updated weights for policy 0, policy_version 54910 (0.0006) [2023-03-07 00:21:03,135][81400] Updated weights for policy 0, policy_version 54920 (0.0006) [2023-03-07 00:21:03,913][81400] Updated weights for policy 0, policy_version 54930 (0.0005) [2023-03-07 00:21:04,692][81400] Updated weights for policy 0, policy_version 54940 (0.0006) [2023-03-07 00:21:05,477][81400] Updated weights for policy 0, policy_version 54950 (0.0005) [2023-03-07 00:21:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 56278016. Throughput: 0: 13195.3. Samples: 56250783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:21:06,237][81074] Avg episode reward: [(0, '2781.650')] [2023-03-07 00:21:06,255][81400] Updated weights for policy 0, policy_version 54960 (0.0006) [2023-03-07 00:21:07,022][81400] Updated weights for policy 0, policy_version 54970 (0.0006) [2023-03-07 00:21:07,820][81400] Updated weights for policy 0, policy_version 54980 (0.0006) [2023-03-07 00:21:08,593][81400] Updated weights for policy 0, policy_version 54990 (0.0006) [2023-03-07 00:21:09,369][81400] Updated weights for policy 0, policy_version 55000 (0.0007) [2023-03-07 00:21:10,151][81400] Updated weights for policy 0, policy_version 55010 (0.0006) [2023-03-07 00:21:10,926][81400] Updated weights for policy 0, policy_version 55020 (0.0006) [2023-03-07 00:21:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 56344576. Throughput: 0: 13192.2. Samples: 56329736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:21:11,237][81074] Avg episode reward: [(0, '2692.119')] [2023-03-07 00:21:11,697][81400] Updated weights for policy 0, policy_version 55030 (0.0006) [2023-03-07 00:21:12,475][81400] Updated weights for policy 0, policy_version 55040 (0.0006) [2023-03-07 00:21:13,255][81400] Updated weights for policy 0, policy_version 55050 (0.0007) [2023-03-07 00:21:14,021][81400] Updated weights for policy 0, policy_version 55060 (0.0006) [2023-03-07 00:21:14,809][81400] Updated weights for policy 0, policy_version 55070 (0.0006) [2023-03-07 00:21:15,590][81400] Updated weights for policy 0, policy_version 55080 (0.0006) [2023-03-07 00:21:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13190.5). Total num frames: 56410112. Throughput: 0: 13178.1. Samples: 56408577. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:21:16,237][81074] Avg episode reward: [(0, '2632.385')] [2023-03-07 00:21:16,366][81400] Updated weights for policy 0, policy_version 55090 (0.0006) [2023-03-07 00:21:17,141][81400] Updated weights for policy 0, policy_version 55100 (0.0006) [2023-03-07 00:21:17,928][81400] Updated weights for policy 0, policy_version 55110 (0.0005) [2023-03-07 00:21:18,693][81400] Updated weights for policy 0, policy_version 55120 (0.0006) [2023-03-07 00:21:19,481][81400] Updated weights for policy 0, policy_version 55130 (0.0006) [2023-03-07 00:21:20,256][81400] Updated weights for policy 0, policy_version 55140 (0.0007) [2023-03-07 00:21:21,009][81400] Updated weights for policy 0, policy_version 55150 (0.0006) [2023-03-07 00:21:21,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 56475648. Throughput: 0: 13181.1. Samples: 56448282. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:21:21,237][81074] Avg episode reward: [(0, '2419.398')] [2023-03-07 00:21:21,819][81400] Updated weights for policy 0, policy_version 55160 (0.0006) [2023-03-07 00:21:22,603][81400] Updated weights for policy 0, policy_version 55170 (0.0006) [2023-03-07 00:21:23,356][81400] Updated weights for policy 0, policy_version 55180 (0.0006) [2023-03-07 00:21:24,131][81400] Updated weights for policy 0, policy_version 55190 (0.0006) [2023-03-07 00:21:24,909][81400] Updated weights for policy 0, policy_version 55200 (0.0006) [2023-03-07 00:21:25,694][81400] Updated weights for policy 0, policy_version 55210 (0.0006) [2023-03-07 00:21:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 56541184. Throughput: 0: 13180.8. Samples: 56527391. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:21:26,237][81074] Avg episode reward: [(0, '2524.987')] [2023-03-07 00:21:26,453][81400] Updated weights for policy 0, policy_version 55220 (0.0006) [2023-03-07 00:21:27,244][81400] Updated weights for policy 0, policy_version 55230 (0.0007) [2023-03-07 00:21:28,011][81400] Updated weights for policy 0, policy_version 55240 (0.0005) [2023-03-07 00:21:28,790][81400] Updated weights for policy 0, policy_version 55250 (0.0006) [2023-03-07 00:21:29,590][81400] Updated weights for policy 0, policy_version 55260 (0.0006) [2023-03-07 00:21:30,359][81400] Updated weights for policy 0, policy_version 55270 (0.0006) [2023-03-07 00:21:31,138][81400] Updated weights for policy 0, policy_version 55280 (0.0005) [2023-03-07 00:21:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 56607744. Throughput: 0: 13171.7. Samples: 56606254. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:21:31,237][81074] Avg episode reward: [(0, '2330.122')] [2023-03-07 00:21:31,922][81400] Updated weights for policy 0, policy_version 55290 (0.0007) [2023-03-07 00:21:32,714][81400] Updated weights for policy 0, policy_version 55300 (0.0006) [2023-03-07 00:21:33,502][81400] Updated weights for policy 0, policy_version 55310 (0.0006) [2023-03-07 00:21:34,263][81400] Updated weights for policy 0, policy_version 55320 (0.0007) [2023-03-07 00:21:35,040][81400] Updated weights for policy 0, policy_version 55330 (0.0005) [2023-03-07 00:21:35,831][81400] Updated weights for policy 0, policy_version 55340 (0.0006) [2023-03-07 00:21:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 56673280. Throughput: 0: 13172.7. Samples: 56645666. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:21:36,237][81074] Avg episode reward: [(0, '2353.205')] [2023-03-07 00:21:36,605][81400] Updated weights for policy 0, policy_version 55350 (0.0007) [2023-03-07 00:21:37,388][81400] Updated weights for policy 0, policy_version 55360 (0.0007) [2023-03-07 00:21:38,161][81400] Updated weights for policy 0, policy_version 55370 (0.0006) [2023-03-07 00:21:38,935][81400] Updated weights for policy 0, policy_version 55380 (0.0006) [2023-03-07 00:21:39,733][81400] Updated weights for policy 0, policy_version 55390 (0.0006) [2023-03-07 00:21:40,497][81400] Updated weights for policy 0, policy_version 55400 (0.0006) [2023-03-07 00:21:41,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 56738816. Throughput: 0: 13157.1. Samples: 56724314. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:21:41,237][81074] Avg episode reward: [(0, '2546.332')] [2023-03-07 00:21:41,270][81400] Updated weights for policy 0, policy_version 55410 (0.0007) [2023-03-07 00:21:42,050][81400] Updated weights for policy 0, policy_version 55420 (0.0006) [2023-03-07 00:21:42,833][81400] Updated weights for policy 0, policy_version 55430 (0.0006) [2023-03-07 00:21:43,600][81400] Updated weights for policy 0, policy_version 55440 (0.0007) [2023-03-07 00:21:44,360][81400] Updated weights for policy 0, policy_version 55450 (0.0007) [2023-03-07 00:21:45,155][81400] Updated weights for policy 0, policy_version 55460 (0.0006) [2023-03-07 00:21:45,955][81400] Updated weights for policy 0, policy_version 55470 (0.0006) [2023-03-07 00:21:46,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 56804352. Throughput: 0: 13157.5. Samples: 56803451. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:21:46,237][81074] Avg episode reward: [(0, '2453.657')] [2023-03-07 00:21:46,715][81400] Updated weights for policy 0, policy_version 55480 (0.0006) [2023-03-07 00:21:47,513][81400] Updated weights for policy 0, policy_version 55490 (0.0006) [2023-03-07 00:21:48,270][81400] Updated weights for policy 0, policy_version 55500 (0.0006) [2023-03-07 00:21:49,058][81400] Updated weights for policy 0, policy_version 55510 (0.0006) [2023-03-07 00:21:49,846][81400] Updated weights for policy 0, policy_version 55520 (0.0006) [2023-03-07 00:21:50,620][81400] Updated weights for policy 0, policy_version 55530 (0.0006) [2023-03-07 00:21:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 56870912. Throughput: 0: 13161.3. Samples: 56843040. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:21:51,237][81074] Avg episode reward: [(0, '2291.713')] [2023-03-07 00:21:51,379][81400] Updated weights for policy 0, policy_version 55540 (0.0005) [2023-03-07 00:21:52,150][81400] Updated weights for policy 0, policy_version 55550 (0.0006) [2023-03-07 00:21:52,920][81400] Updated weights for policy 0, policy_version 55560 (0.0007) [2023-03-07 00:21:53,698][81400] Updated weights for policy 0, policy_version 55570 (0.0005) [2023-03-07 00:21:54,481][81400] Updated weights for policy 0, policy_version 55580 (0.0006) [2023-03-07 00:21:55,256][81400] Updated weights for policy 0, policy_version 55590 (0.0006) [2023-03-07 00:21:56,033][81400] Updated weights for policy 0, policy_version 55600 (0.0005) [2023-03-07 00:21:56,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 56936448. Throughput: 0: 13165.2. Samples: 56922173. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:21:56,237][81074] Avg episode reward: [(0, '2507.122')] [2023-03-07 00:21:56,243][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000055602_56936448.pth... [2023-03-07 00:21:56,273][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000052512_53772288.pth [2023-03-07 00:21:56,812][81400] Updated weights for policy 0, policy_version 55610 (0.0006) [2023-03-07 00:21:57,598][81400] Updated weights for policy 0, policy_version 55620 (0.0006) [2023-03-07 00:21:58,371][81400] Updated weights for policy 0, policy_version 55630 (0.0006) [2023-03-07 00:21:59,147][81400] Updated weights for policy 0, policy_version 55640 (0.0005) [2023-03-07 00:21:59,923][81400] Updated weights for policy 0, policy_version 55650 (0.0005) [2023-03-07 00:22:00,698][81400] Updated weights for policy 0, policy_version 55660 (0.0006) [2023-03-07 00:22:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 57001984. Throughput: 0: 13166.1. Samples: 57001053. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:01,237][81074] Avg episode reward: [(0, '2448.579')] [2023-03-07 00:22:01,498][81400] Updated weights for policy 0, policy_version 55670 (0.0005) [2023-03-07 00:22:02,298][81400] Updated weights for policy 0, policy_version 55680 (0.0006) [2023-03-07 00:22:03,056][81400] Updated weights for policy 0, policy_version 55690 (0.0006) [2023-03-07 00:22:03,829][81400] Updated weights for policy 0, policy_version 55700 (0.0006) [2023-03-07 00:22:04,602][81400] Updated weights for policy 0, policy_version 55710 (0.0006) [2023-03-07 00:22:05,383][81400] Updated weights for policy 0, policy_version 55720 (0.0006) [2023-03-07 00:22:06,174][81400] Updated weights for policy 0, policy_version 55730 (0.0006) [2023-03-07 00:22:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13183.6). Total num frames: 57068544. Throughput: 0: 13159.5. Samples: 57040461. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:06,237][81074] Avg episode reward: [(0, '2562.807')] [2023-03-07 00:22:06,954][81400] Updated weights for policy 0, policy_version 55740 (0.0006) [2023-03-07 00:22:07,736][81400] Updated weights for policy 0, policy_version 55750 (0.0005) [2023-03-07 00:22:08,530][81400] Updated weights for policy 0, policy_version 55760 (0.0006) [2023-03-07 00:22:09,304][81400] Updated weights for policy 0, policy_version 55770 (0.0006) [2023-03-07 00:22:10,100][81400] Updated weights for policy 0, policy_version 55780 (0.0008) [2023-03-07 00:22:10,861][81400] Updated weights for policy 0, policy_version 55790 (0.0006) [2023-03-07 00:22:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13180.1). Total num frames: 57133056. Throughput: 0: 13146.1. Samples: 57118966. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:11,237][81074] Avg episode reward: [(0, '2590.559')] [2023-03-07 00:22:11,619][81400] Updated weights for policy 0, policy_version 55800 (0.0006) [2023-03-07 00:22:12,408][81400] Updated weights for policy 0, policy_version 55810 (0.0006) [2023-03-07 00:22:13,169][81400] Updated weights for policy 0, policy_version 55820 (0.0006) [2023-03-07 00:22:13,942][81349] KL-divergence is very high: 126613.3516 [2023-03-07 00:22:13,949][81400] Updated weights for policy 0, policy_version 55830 (0.0006) [2023-03-07 00:22:14,729][81400] Updated weights for policy 0, policy_version 55840 (0.0007) [2023-03-07 00:22:15,499][81400] Updated weights for policy 0, policy_version 55850 (0.0005) [2023-03-07 00:22:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13183.6). Total num frames: 57199616. Throughput: 0: 13161.4. Samples: 57198517. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:16,237][81074] Avg episode reward: [(0, '2588.504')] [2023-03-07 00:22:16,254][81400] Updated weights for policy 0, policy_version 55860 (0.0006) [2023-03-07 00:22:17,040][81400] Updated weights for policy 0, policy_version 55870 (0.0007) [2023-03-07 00:22:17,823][81400] Updated weights for policy 0, policy_version 55880 (0.0006) [2023-03-07 00:22:18,596][81400] Updated weights for policy 0, policy_version 55890 (0.0006) [2023-03-07 00:22:19,377][81400] Updated weights for policy 0, policy_version 55900 (0.0006) [2023-03-07 00:22:20,143][81400] Updated weights for policy 0, policy_version 55910 (0.0007) [2023-03-07 00:22:20,925][81400] Updated weights for policy 0, policy_version 55920 (0.0006) [2023-03-07 00:22:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 57266176. Throughput: 0: 13163.4. Samples: 57238016. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:21,237][81074] Avg episode reward: [(0, '2398.327')] [2023-03-07 00:22:21,691][81400] Updated weights for policy 0, policy_version 55930 (0.0006) [2023-03-07 00:22:22,459][81400] Updated weights for policy 0, policy_version 55940 (0.0006) [2023-03-07 00:22:23,244][81400] Updated weights for policy 0, policy_version 55950 (0.0006) [2023-03-07 00:22:24,025][81400] Updated weights for policy 0, policy_version 55960 (0.0006) [2023-03-07 00:22:24,782][81400] Updated weights for policy 0, policy_version 55970 (0.0007) [2023-03-07 00:22:25,555][81400] Updated weights for policy 0, policy_version 55980 (0.0006) [2023-03-07 00:22:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 57331712. Throughput: 0: 13182.7. Samples: 57317536. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:26,237][81074] Avg episode reward: [(0, '2266.277')] [2023-03-07 00:22:26,303][81400] Updated weights for policy 0, policy_version 55990 (0.0006) [2023-03-07 00:22:27,084][81400] Updated weights for policy 0, policy_version 56000 (0.0006) [2023-03-07 00:22:27,866][81400] Updated weights for policy 0, policy_version 56010 (0.0007) [2023-03-07 00:22:28,657][81400] Updated weights for policy 0, policy_version 56020 (0.0006) [2023-03-07 00:22:29,430][81400] Updated weights for policy 0, policy_version 56030 (0.0006) [2023-03-07 00:22:30,189][81400] Updated weights for policy 0, policy_version 56040 (0.0005) [2023-03-07 00:22:30,959][81400] Updated weights for policy 0, policy_version 56050 (0.0006) [2023-03-07 00:22:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 57398272. Throughput: 0: 13187.1. Samples: 57396871. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:31,237][81074] Avg episode reward: [(0, '2301.235')] [2023-03-07 00:22:31,743][81400] Updated weights for policy 0, policy_version 56060 (0.0006) [2023-03-07 00:22:32,529][81400] Updated weights for policy 0, policy_version 56070 (0.0006) [2023-03-07 00:22:33,313][81400] Updated weights for policy 0, policy_version 56080 (0.0006) [2023-03-07 00:22:34,094][81400] Updated weights for policy 0, policy_version 56090 (0.0005) [2023-03-07 00:22:34,875][81400] Updated weights for policy 0, policy_version 56100 (0.0006) [2023-03-07 00:22:35,646][81400] Updated weights for policy 0, policy_version 56110 (0.0006) [2023-03-07 00:22:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 57463808. Throughput: 0: 13181.3. Samples: 57436198. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:36,237][81074] Avg episode reward: [(0, '2401.267')] [2023-03-07 00:22:36,442][81400] Updated weights for policy 0, policy_version 56120 (0.0006) [2023-03-07 00:22:37,227][81400] Updated weights for policy 0, policy_version 56130 (0.0007) [2023-03-07 00:22:37,997][81400] Updated weights for policy 0, policy_version 56140 (0.0005) [2023-03-07 00:22:38,781][81400] Updated weights for policy 0, policy_version 56150 (0.0006) [2023-03-07 00:22:39,541][81400] Updated weights for policy 0, policy_version 56160 (0.0006) [2023-03-07 00:22:40,312][81400] Updated weights for policy 0, policy_version 56170 (0.0007) [2023-03-07 00:22:41,100][81400] Updated weights for policy 0, policy_version 56180 (0.0006) [2023-03-07 00:22:41,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 57529344. Throughput: 0: 13180.7. Samples: 57515305. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:41,237][81074] Avg episode reward: [(0, '2400.026')] [2023-03-07 00:22:41,872][81400] Updated weights for policy 0, policy_version 56190 (0.0006) [2023-03-07 00:22:42,643][81400] Updated weights for policy 0, policy_version 56200 (0.0006) [2023-03-07 00:22:43,424][81400] Updated weights for policy 0, policy_version 56210 (0.0006) [2023-03-07 00:22:44,221][81400] Updated weights for policy 0, policy_version 56220 (0.0006) [2023-03-07 00:22:44,990][81400] Updated weights for policy 0, policy_version 56230 (0.0006) [2023-03-07 00:22:45,767][81400] Updated weights for policy 0, policy_version 56240 (0.0006) [2023-03-07 00:22:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 57594880. Throughput: 0: 13179.7. Samples: 57594138. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:46,237][81074] Avg episode reward: [(0, '2464.027')] [2023-03-07 00:22:46,553][81400] Updated weights for policy 0, policy_version 56250 (0.0006) [2023-03-07 00:22:47,324][81400] Updated weights for policy 0, policy_version 56260 (0.0006) [2023-03-07 00:22:48,102][81400] Updated weights for policy 0, policy_version 56270 (0.0006) [2023-03-07 00:22:48,875][81400] Updated weights for policy 0, policy_version 56280 (0.0006) [2023-03-07 00:22:49,656][81400] Updated weights for policy 0, policy_version 56290 (0.0006) [2023-03-07 00:22:50,420][81400] Updated weights for policy 0, policy_version 56300 (0.0006) [2023-03-07 00:22:51,182][81400] Updated weights for policy 0, policy_version 56310 (0.0006) [2023-03-07 00:22:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 57661440. Throughput: 0: 13188.3. Samples: 57633934. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:51,237][81074] Avg episode reward: [(0, '2555.419')] [2023-03-07 00:22:51,962][81400] Updated weights for policy 0, policy_version 56320 (0.0006) [2023-03-07 00:22:52,741][81400] Updated weights for policy 0, policy_version 56330 (0.0006) [2023-03-07 00:22:53,518][81400] Updated weights for policy 0, policy_version 56340 (0.0006) [2023-03-07 00:22:54,294][81400] Updated weights for policy 0, policy_version 56350 (0.0007) [2023-03-07 00:22:55,085][81400] Updated weights for policy 0, policy_version 56360 (0.0005) [2023-03-07 00:22:55,839][81400] Updated weights for policy 0, policy_version 56370 (0.0007) [2023-03-07 00:22:56,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13192.6, 300 sec: 13183.6). Total num frames: 57728000. Throughput: 0: 13199.8. Samples: 57712958. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:22:56,237][81074] Avg episode reward: [(0, '2586.156')] [2023-03-07 00:22:56,598][81400] Updated weights for policy 0, policy_version 56380 (0.0005) [2023-03-07 00:22:57,354][81400] Updated weights for policy 0, policy_version 56390 (0.0006) [2023-03-07 00:22:58,145][81400] Updated weights for policy 0, policy_version 56400 (0.0007) [2023-03-07 00:22:58,917][81400] Updated weights for policy 0, policy_version 56410 (0.0006) [2023-03-07 00:22:59,686][81400] Updated weights for policy 0, policy_version 56420 (0.0007) [2023-03-07 00:23:00,471][81400] Updated weights for policy 0, policy_version 56430 (0.0006) [2023-03-07 00:23:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 57793536. Throughput: 0: 13201.0. Samples: 57792563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:01,237][81074] Avg episode reward: [(0, '2410.980')] [2023-03-07 00:23:01,259][81400] Updated weights for policy 0, policy_version 56440 (0.0006) [2023-03-07 00:23:02,031][81400] Updated weights for policy 0, policy_version 56450 (0.0006) [2023-03-07 00:23:02,829][81400] Updated weights for policy 0, policy_version 56460 (0.0007) [2023-03-07 00:23:03,590][81400] Updated weights for policy 0, policy_version 56470 (0.0006) [2023-03-07 00:23:04,380][81400] Updated weights for policy 0, policy_version 56480 (0.0006) [2023-03-07 00:23:05,158][81400] Updated weights for policy 0, policy_version 56490 (0.0006) [2023-03-07 00:23:05,931][81400] Updated weights for policy 0, policy_version 56500 (0.0007) [2023-03-07 00:23:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 57859072. Throughput: 0: 13198.5. Samples: 57831950. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:06,237][81074] Avg episode reward: [(0, '2644.495')] [2023-03-07 00:23:06,709][81400] Updated weights for policy 0, policy_version 56510 (0.0006) [2023-03-07 00:23:07,496][81400] Updated weights for policy 0, policy_version 56520 (0.0006) [2023-03-07 00:23:08,284][81400] Updated weights for policy 0, policy_version 56530 (0.0006) [2023-03-07 00:23:09,063][81400] Updated weights for policy 0, policy_version 56540 (0.0006) [2023-03-07 00:23:09,835][81400] Updated weights for policy 0, policy_version 56550 (0.0006) [2023-03-07 00:23:10,611][81400] Updated weights for policy 0, policy_version 56560 (0.0007) [2023-03-07 00:23:11,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 57924608. Throughput: 0: 13184.6. Samples: 57910844. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:11,237][81074] Avg episode reward: [(0, '2700.574')] [2023-03-07 00:23:11,394][81400] Updated weights for policy 0, policy_version 56570 (0.0007) [2023-03-07 00:23:12,164][81400] Updated weights for policy 0, policy_version 56580 (0.0006) [2023-03-07 00:23:12,937][81400] Updated weights for policy 0, policy_version 56590 (0.0006) [2023-03-07 00:23:13,717][81400] Updated weights for policy 0, policy_version 56600 (0.0007) [2023-03-07 00:23:14,505][81400] Updated weights for policy 0, policy_version 56610 (0.0006) [2023-03-07 00:23:15,277][81400] Updated weights for policy 0, policy_version 56620 (0.0007) [2023-03-07 00:23:16,063][81400] Updated weights for policy 0, policy_version 56630 (0.0007) [2023-03-07 00:23:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 57991168. Throughput: 0: 13175.8. Samples: 57989782. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:16,237][81074] Avg episode reward: [(0, '2609.654')] [2023-03-07 00:23:16,821][81400] Updated weights for policy 0, policy_version 56640 (0.0006) [2023-03-07 00:23:17,608][81400] Updated weights for policy 0, policy_version 56650 (0.0007) [2023-03-07 00:23:18,368][81400] Updated weights for policy 0, policy_version 56660 (0.0007) [2023-03-07 00:23:19,144][81400] Updated weights for policy 0, policy_version 56670 (0.0006) [2023-03-07 00:23:19,911][81400] Updated weights for policy 0, policy_version 56680 (0.0006) [2023-03-07 00:23:20,700][81400] Updated weights for policy 0, policy_version 56690 (0.0006) [2023-03-07 00:23:21,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 58057728. Throughput: 0: 13187.6. Samples: 58029637. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:21,247][81074] Avg episode reward: [(0, '2863.460')] [2023-03-07 00:23:21,473][81400] Updated weights for policy 0, policy_version 56700 (0.0007) [2023-03-07 00:23:22,252][81400] Updated weights for policy 0, policy_version 56710 (0.0007) [2023-03-07 00:23:23,021][81400] Updated weights for policy 0, policy_version 56720 (0.0006) [2023-03-07 00:23:23,792][81400] Updated weights for policy 0, policy_version 56730 (0.0006) [2023-03-07 00:23:24,559][81400] Updated weights for policy 0, policy_version 56740 (0.0006) [2023-03-07 00:23:25,322][81400] Updated weights for policy 0, policy_version 56750 (0.0006) [2023-03-07 00:23:26,086][81400] Updated weights for policy 0, policy_version 56760 (0.0007) [2023-03-07 00:23:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 58123264. Throughput: 0: 13193.1. Samples: 58108993. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:26,247][81074] Avg episode reward: [(0, '2861.032')] [2023-03-07 00:23:26,877][81400] Updated weights for policy 0, policy_version 56770 (0.0006) [2023-03-07 00:23:27,664][81400] Updated weights for policy 0, policy_version 56780 (0.0006) [2023-03-07 00:23:28,447][81400] Updated weights for policy 0, policy_version 56790 (0.0006) [2023-03-07 00:23:29,225][81400] Updated weights for policy 0, policy_version 56800 (0.0007) [2023-03-07 00:23:30,001][81400] Updated weights for policy 0, policy_version 56810 (0.0006) [2023-03-07 00:23:30,782][81400] Updated weights for policy 0, policy_version 56820 (0.0007) [2023-03-07 00:23:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 58189824. Throughput: 0: 13197.1. Samples: 58188006. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:31,248][81074] Avg episode reward: [(0, '2694.111')] [2023-03-07 00:23:31,557][81400] Updated weights for policy 0, policy_version 56830 (0.0006) [2023-03-07 00:23:32,332][81400] Updated weights for policy 0, policy_version 56840 (0.0006) [2023-03-07 00:23:33,124][81400] Updated weights for policy 0, policy_version 56850 (0.0006) [2023-03-07 00:23:33,884][81400] Updated weights for policy 0, policy_version 56860 (0.0006) [2023-03-07 00:23:34,668][81400] Updated weights for policy 0, policy_version 56870 (0.0006) [2023-03-07 00:23:35,440][81400] Updated weights for policy 0, policy_version 56880 (0.0006) [2023-03-07 00:23:36,219][81400] Updated weights for policy 0, policy_version 56890 (0.0007) [2023-03-07 00:23:36,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 58255360. Throughput: 0: 13189.7. Samples: 58227473. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:36,247][81074] Avg episode reward: [(0, '2762.047')] [2023-03-07 00:23:36,995][81400] Updated weights for policy 0, policy_version 56900 (0.0006) [2023-03-07 00:23:37,764][81400] Updated weights for policy 0, policy_version 56910 (0.0007) [2023-03-07 00:23:38,545][81400] Updated weights for policy 0, policy_version 56920 (0.0006) [2023-03-07 00:23:39,334][81400] Updated weights for policy 0, policy_version 56930 (0.0007) [2023-03-07 00:23:40,097][81400] Updated weights for policy 0, policy_version 56940 (0.0006) [2023-03-07 00:23:40,876][81400] Updated weights for policy 0, policy_version 56950 (0.0006) [2023-03-07 00:23:41,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 58320896. Throughput: 0: 13194.4. Samples: 58306704. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:41,247][81074] Avg episode reward: [(0, '2620.332')] [2023-03-07 00:23:41,646][81400] Updated weights for policy 0, policy_version 56960 (0.0006) [2023-03-07 00:23:42,427][81400] Updated weights for policy 0, policy_version 56970 (0.0007) [2023-03-07 00:23:43,198][81400] Updated weights for policy 0, policy_version 56980 (0.0006) [2023-03-07 00:23:43,990][81400] Updated weights for policy 0, policy_version 56990 (0.0006) [2023-03-07 00:23:44,767][81400] Updated weights for policy 0, policy_version 57000 (0.0006) [2023-03-07 00:23:45,549][81400] Updated weights for policy 0, policy_version 57010 (0.0006) [2023-03-07 00:23:46,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 58386432. Throughput: 0: 13177.3. Samples: 58385540. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:46,247][81074] Avg episode reward: [(0, '2927.666')] [2023-03-07 00:23:46,338][81400] Updated weights for policy 0, policy_version 57020 (0.0006) [2023-03-07 00:23:47,109][81400] Updated weights for policy 0, policy_version 57030 (0.0007) [2023-03-07 00:23:47,881][81400] Updated weights for policy 0, policy_version 57040 (0.0006) [2023-03-07 00:23:48,661][81400] Updated weights for policy 0, policy_version 57050 (0.0006) [2023-03-07 00:23:49,430][81400] Updated weights for policy 0, policy_version 57060 (0.0006) [2023-03-07 00:23:50,219][81400] Updated weights for policy 0, policy_version 57070 (0.0007) [2023-03-07 00:23:51,013][81400] Updated weights for policy 0, policy_version 57080 (0.0006) [2023-03-07 00:23:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13183.6). Total num frames: 58452992. Throughput: 0: 13178.0. Samples: 58424960. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:51,247][81074] Avg episode reward: [(0, '2773.781')] [2023-03-07 00:23:51,783][81400] Updated weights for policy 0, policy_version 57090 (0.0007) [2023-03-07 00:23:52,582][81400] Updated weights for policy 0, policy_version 57100 (0.0006) [2023-03-07 00:23:53,354][81400] Updated weights for policy 0, policy_version 57110 (0.0008) [2023-03-07 00:23:54,146][81400] Updated weights for policy 0, policy_version 57120 (0.0006) [2023-03-07 00:23:54,908][81400] Updated weights for policy 0, policy_version 57130 (0.0005) [2023-03-07 00:23:55,694][81400] Updated weights for policy 0, policy_version 57140 (0.0005) [2023-03-07 00:23:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13180.1). Total num frames: 58517504. Throughput: 0: 13174.8. Samples: 58503710. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:23:56,247][81074] Avg episode reward: [(0, '2763.749')] [2023-03-07 00:23:56,252][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000057147_58518528.pth... [2023-03-07 00:23:56,283][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000054058_55355392.pth [2023-03-07 00:23:56,465][81400] Updated weights for policy 0, policy_version 57150 (0.0006) [2023-03-07 00:23:57,229][81400] Updated weights for policy 0, policy_version 57160 (0.0007) [2023-03-07 00:23:57,998][81400] Updated weights for policy 0, policy_version 57170 (0.0007) [2023-03-07 00:23:58,783][81400] Updated weights for policy 0, policy_version 57180 (0.0006) [2023-03-07 00:23:59,563][81400] Updated weights for policy 0, policy_version 57190 (0.0006) [2023-03-07 00:24:00,340][81400] Updated weights for policy 0, policy_version 57200 (0.0007) [2023-03-07 00:24:01,126][81400] Updated weights for policy 0, policy_version 57210 (0.0006) [2023-03-07 00:24:01,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 58584064. Throughput: 0: 13176.8. Samples: 58582739. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:24:01,237][81074] Avg episode reward: [(0, '2592.145')] [2023-03-07 00:24:01,900][81400] Updated weights for policy 0, policy_version 57220 (0.0006) [2023-03-07 00:24:02,688][81400] Updated weights for policy 0, policy_version 57230 (0.0006) [2023-03-07 00:24:03,471][81400] Updated weights for policy 0, policy_version 57240 (0.0006) [2023-03-07 00:24:04,246][81400] Updated weights for policy 0, policy_version 57250 (0.0005) [2023-03-07 00:24:05,039][81400] Updated weights for policy 0, policy_version 57260 (0.0006) [2023-03-07 00:24:05,820][81400] Updated weights for policy 0, policy_version 57270 (0.0007) [2023-03-07 00:24:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 58649600. Throughput: 0: 13165.8. Samples: 58622098. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:24:06,237][81074] Avg episode reward: [(0, '2549.438')] [2023-03-07 00:24:06,589][81400] Updated weights for policy 0, policy_version 57280 (0.0006) [2023-03-07 00:24:07,363][81400] Updated weights for policy 0, policy_version 57290 (0.0006) [2023-03-07 00:24:08,140][81400] Updated weights for policy 0, policy_version 57300 (0.0007) [2023-03-07 00:24:08,921][81400] Updated weights for policy 0, policy_version 57310 (0.0006) [2023-03-07 00:24:09,680][81400] Updated weights for policy 0, policy_version 57320 (0.0006) [2023-03-07 00:24:10,460][81400] Updated weights for policy 0, policy_version 57330 (0.0005) [2023-03-07 00:24:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 58715136. Throughput: 0: 13160.2. Samples: 58701201. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:24:11,237][81074] Avg episode reward: [(0, '2686.751')] [2023-03-07 00:24:11,243][81400] Updated weights for policy 0, policy_version 57340 (0.0006) [2023-03-07 00:24:12,018][81400] Updated weights for policy 0, policy_version 57350 (0.0005) [2023-03-07 00:24:12,797][81400] Updated weights for policy 0, policy_version 57360 (0.0007) [2023-03-07 00:24:13,569][81400] Updated weights for policy 0, policy_version 57370 (0.0006) [2023-03-07 00:24:14,345][81400] Updated weights for policy 0, policy_version 57380 (0.0006) [2023-03-07 00:24:15,112][81400] Updated weights for policy 0, policy_version 57390 (0.0006) [2023-03-07 00:24:15,891][81400] Updated weights for policy 0, policy_version 57400 (0.0006) [2023-03-07 00:24:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 58781696. Throughput: 0: 13164.5. Samples: 58780406. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:24:16,237][81074] Avg episode reward: [(0, '2633.927')] [2023-03-07 00:24:16,663][81400] Updated weights for policy 0, policy_version 57410 (0.0006) [2023-03-07 00:24:17,436][81400] Updated weights for policy 0, policy_version 57420 (0.0006) [2023-03-07 00:24:18,206][81400] Updated weights for policy 0, policy_version 57430 (0.0007) [2023-03-07 00:24:19,019][81400] Updated weights for policy 0, policy_version 57440 (0.0006) [2023-03-07 00:24:19,795][81400] Updated weights for policy 0, policy_version 57450 (0.0006) [2023-03-07 00:24:20,552][81400] Updated weights for policy 0, policy_version 57460 (0.0006) [2023-03-07 00:24:21,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 58847232. Throughput: 0: 13167.5. Samples: 58820012. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:24:21,237][81074] Avg episode reward: [(0, '2857.468')] [2023-03-07 00:24:21,331][81400] Updated weights for policy 0, policy_version 57470 (0.0006) [2023-03-07 00:24:22,113][81400] Updated weights for policy 0, policy_version 57480 (0.0006) [2023-03-07 00:24:22,890][81400] Updated weights for policy 0, policy_version 57490 (0.0007) [2023-03-07 00:24:23,658][81400] Updated weights for policy 0, policy_version 57500 (0.0006) [2023-03-07 00:24:24,427][81400] Updated weights for policy 0, policy_version 57510 (0.0006) [2023-03-07 00:24:25,208][81400] Updated weights for policy 0, policy_version 57520 (0.0007) [2023-03-07 00:24:25,994][81400] Updated weights for policy 0, policy_version 57530 (0.0006) [2023-03-07 00:24:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 58913792. Throughput: 0: 13165.5. Samples: 58899150. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:24:26,237][81074] Avg episode reward: [(0, '2609.791')] [2023-03-07 00:24:26,763][81400] Updated weights for policy 0, policy_version 57540 (0.0006) [2023-03-07 00:24:27,557][81400] Updated weights for policy 0, policy_version 57550 (0.0006) [2023-03-07 00:24:28,315][81400] Updated weights for policy 0, policy_version 57560 (0.0006) [2023-03-07 00:24:29,105][81400] Updated weights for policy 0, policy_version 57570 (0.0006) [2023-03-07 00:24:29,886][81400] Updated weights for policy 0, policy_version 57580 (0.0007) [2023-03-07 00:24:30,658][81400] Updated weights for policy 0, policy_version 57590 (0.0006) [2023-03-07 00:24:31,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 58979328. Throughput: 0: 13168.4. Samples: 58978119. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:24:31,237][81074] Avg episode reward: [(0, '2771.683')] [2023-03-07 00:24:31,434][81400] Updated weights for policy 0, policy_version 57600 (0.0005) [2023-03-07 00:24:32,219][81400] Updated weights for policy 0, policy_version 57610 (0.0007) [2023-03-07 00:24:32,977][81400] Updated weights for policy 0, policy_version 57620 (0.0006) [2023-03-07 00:24:33,765][81400] Updated weights for policy 0, policy_version 57630 (0.0005) [2023-03-07 00:24:34,533][81400] Updated weights for policy 0, policy_version 57640 (0.0007) [2023-03-07 00:24:35,329][81400] Updated weights for policy 0, policy_version 57650 (0.0007) [2023-03-07 00:24:36,109][81400] Updated weights for policy 0, policy_version 57660 (0.0006) [2023-03-07 00:24:36,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 59044864. Throughput: 0: 13171.5. Samples: 59017678. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:24:36,237][81074] Avg episode reward: [(0, '2588.279')] [2023-03-07 00:24:36,909][81400] Updated weights for policy 0, policy_version 57670 (0.0007) [2023-03-07 00:24:37,668][81400] Updated weights for policy 0, policy_version 57680 (0.0007) [2023-03-07 00:24:38,451][81400] Updated weights for policy 0, policy_version 57690 (0.0006) [2023-03-07 00:24:39,208][81400] Updated weights for policy 0, policy_version 57700 (0.0007) [2023-03-07 00:24:39,989][81400] Updated weights for policy 0, policy_version 57710 (0.0006) [2023-03-07 00:24:40,748][81400] Updated weights for policy 0, policy_version 57720 (0.0006) [2023-03-07 00:24:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 59111424. Throughput: 0: 13177.9. Samples: 59096715. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:24:41,237][81074] Avg episode reward: [(0, '2810.095')] [2023-03-07 00:24:41,539][81400] Updated weights for policy 0, policy_version 57730 (0.0006) [2023-03-07 00:24:42,303][81400] Updated weights for policy 0, policy_version 57740 (0.0006) [2023-03-07 00:24:43,068][81400] Updated weights for policy 0, policy_version 57750 (0.0006) [2023-03-07 00:24:43,831][81400] Updated weights for policy 0, policy_version 57760 (0.0006) [2023-03-07 00:24:44,622][81400] Updated weights for policy 0, policy_version 57770 (0.0007) [2023-03-07 00:24:45,390][81400] Updated weights for policy 0, policy_version 57780 (0.0006) [2023-03-07 00:24:46,180][81400] Updated weights for policy 0, policy_version 57790 (0.0006) [2023-03-07 00:24:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 59176960. Throughput: 0: 13183.8. Samples: 59176012. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:24:46,237][81074] Avg episode reward: [(0, '2670.603')] [2023-03-07 00:24:46,973][81400] Updated weights for policy 0, policy_version 57800 (0.0006) [2023-03-07 00:24:47,733][81400] Updated weights for policy 0, policy_version 57810 (0.0006) [2023-03-07 00:24:48,501][81400] Updated weights for policy 0, policy_version 57820 (0.0006) [2023-03-07 00:24:49,271][81400] Updated weights for policy 0, policy_version 57830 (0.0006) [2023-03-07 00:24:50,029][81400] Updated weights for policy 0, policy_version 57840 (0.0006) [2023-03-07 00:24:50,810][81400] Updated weights for policy 0, policy_version 57850 (0.0006) [2023-03-07 00:24:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13180.1). Total num frames: 59243520. Throughput: 0: 13192.0. Samples: 59215738. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:24:51,237][81074] Avg episode reward: [(0, '2723.271')] [2023-03-07 00:24:51,586][81400] Updated weights for policy 0, policy_version 57860 (0.0006) [2023-03-07 00:24:52,372][81400] Updated weights for policy 0, policy_version 57870 (0.0006) [2023-03-07 00:24:53,129][81400] Updated weights for policy 0, policy_version 57880 (0.0005) [2023-03-07 00:24:53,924][81400] Updated weights for policy 0, policy_version 57890 (0.0007) [2023-03-07 00:24:54,683][81400] Updated weights for policy 0, policy_version 57900 (0.0007) [2023-03-07 00:24:55,466][81400] Updated weights for policy 0, policy_version 57910 (0.0006) [2023-03-07 00:24:56,231][81400] Updated weights for policy 0, policy_version 57920 (0.0006) [2023-03-07 00:24:56,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13209.6, 300 sec: 13183.6). Total num frames: 59310080. Throughput: 0: 13198.2. Samples: 59295120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:24:56,237][81074] Avg episode reward: [(0, '2998.587')] [2023-03-07 00:24:57,006][81400] Updated weights for policy 0, policy_version 57930 (0.0006) [2023-03-07 00:24:57,784][81400] Updated weights for policy 0, policy_version 57940 (0.0006) [2023-03-07 00:24:58,561][81400] Updated weights for policy 0, policy_version 57950 (0.0006) [2023-03-07 00:24:59,354][81400] Updated weights for policy 0, policy_version 57960 (0.0006) [2023-03-07 00:25:00,126][81400] Updated weights for policy 0, policy_version 57970 (0.0006) [2023-03-07 00:25:00,881][81400] Updated weights for policy 0, policy_version 57980 (0.0006) [2023-03-07 00:25:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13183.6). Total num frames: 59375616. Throughput: 0: 13200.7. Samples: 59374438. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:01,237][81074] Avg episode reward: [(0, '2969.724')] [2023-03-07 00:25:01,652][81400] Updated weights for policy 0, policy_version 57990 (0.0006) [2023-03-07 00:25:02,424][81400] Updated weights for policy 0, policy_version 58000 (0.0006) [2023-03-07 00:25:03,195][81400] Updated weights for policy 0, policy_version 58010 (0.0006) [2023-03-07 00:25:03,982][81400] Updated weights for policy 0, policy_version 58020 (0.0006) [2023-03-07 00:25:04,765][81400] Updated weights for policy 0, policy_version 58030 (0.0006) [2023-03-07 00:25:05,529][81400] Updated weights for policy 0, policy_version 58040 (0.0006) [2023-03-07 00:25:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 59441152. Throughput: 0: 13203.0. Samples: 59414145. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:06,237][81074] Avg episode reward: [(0, '2776.680')] [2023-03-07 00:25:06,323][81400] Updated weights for policy 0, policy_version 58050 (0.0006) [2023-03-07 00:25:07,109][81400] Updated weights for policy 0, policy_version 58060 (0.0007) [2023-03-07 00:25:07,882][81400] Updated weights for policy 0, policy_version 58070 (0.0007) [2023-03-07 00:25:08,670][81400] Updated weights for policy 0, policy_version 58080 (0.0005) [2023-03-07 00:25:09,459][81400] Updated weights for policy 0, policy_version 58090 (0.0006) [2023-03-07 00:25:10,214][81400] Updated weights for policy 0, policy_version 58100 (0.0006) [2023-03-07 00:25:10,998][81400] Updated weights for policy 0, policy_version 58110 (0.0006) [2023-03-07 00:25:11,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13183.6). Total num frames: 59507712. Throughput: 0: 13195.7. Samples: 59492958. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:11,237][81074] Avg episode reward: [(0, '2871.430')] [2023-03-07 00:25:11,766][81400] Updated weights for policy 0, policy_version 58120 (0.0006) [2023-03-07 00:25:12,550][81400] Updated weights for policy 0, policy_version 58130 (0.0006) [2023-03-07 00:25:13,337][81400] Updated weights for policy 0, policy_version 58140 (0.0006) [2023-03-07 00:25:14,101][81400] Updated weights for policy 0, policy_version 58150 (0.0006) [2023-03-07 00:25:14,882][81400] Updated weights for policy 0, policy_version 58160 (0.0007) [2023-03-07 00:25:15,661][81400] Updated weights for policy 0, policy_version 58170 (0.0007) [2023-03-07 00:25:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 59573248. Throughput: 0: 13199.0. Samples: 59572075. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:16,237][81074] Avg episode reward: [(0, '3005.087')] [2023-03-07 00:25:16,427][81400] Updated weights for policy 0, policy_version 58180 (0.0006) [2023-03-07 00:25:17,213][81400] Updated weights for policy 0, policy_version 58190 (0.0006) [2023-03-07 00:25:17,984][81400] Updated weights for policy 0, policy_version 58200 (0.0006) [2023-03-07 00:25:18,755][81400] Updated weights for policy 0, policy_version 58210 (0.0006) [2023-03-07 00:25:19,549][81400] Updated weights for policy 0, policy_version 58220 (0.0006) [2023-03-07 00:25:20,311][81400] Updated weights for policy 0, policy_version 58230 (0.0006) [2023-03-07 00:25:21,082][81400] Updated weights for policy 0, policy_version 58240 (0.0005) [2023-03-07 00:25:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.6, 300 sec: 13180.1). Total num frames: 59638784. Throughput: 0: 13196.6. Samples: 59611523. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:21,237][81074] Avg episode reward: [(0, '2870.779')] [2023-03-07 00:25:21,866][81400] Updated weights for policy 0, policy_version 58250 (0.0006) [2023-03-07 00:25:22,637][81400] Updated weights for policy 0, policy_version 58260 (0.0007) [2023-03-07 00:25:23,394][81400] Updated weights for policy 0, policy_version 58270 (0.0006) [2023-03-07 00:25:24,185][81400] Updated weights for policy 0, policy_version 58280 (0.0006) [2023-03-07 00:25:24,954][81400] Updated weights for policy 0, policy_version 58290 (0.0006) [2023-03-07 00:25:25,742][81400] Updated weights for policy 0, policy_version 58300 (0.0007) [2023-03-07 00:25:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 59705344. Throughput: 0: 13199.5. Samples: 59690694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:26,247][81074] Avg episode reward: [(0, '3003.962')] [2023-03-07 00:25:26,548][81400] Updated weights for policy 0, policy_version 58310 (0.0006) [2023-03-07 00:25:27,309][81400] Updated weights for policy 0, policy_version 58320 (0.0006) [2023-03-07 00:25:28,097][81400] Updated weights for policy 0, policy_version 58330 (0.0006) [2023-03-07 00:25:28,865][81400] Updated weights for policy 0, policy_version 58340 (0.0006) [2023-03-07 00:25:29,640][81400] Updated weights for policy 0, policy_version 58350 (0.0006) [2023-03-07 00:25:30,414][81400] Updated weights for policy 0, policy_version 58360 (0.0006) [2023-03-07 00:25:31,203][81400] Updated weights for policy 0, policy_version 58370 (0.0006) [2023-03-07 00:25:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 59770880. Throughput: 0: 13188.2. Samples: 59769480. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:31,247][81074] Avg episode reward: [(0, '2929.566')] [2023-03-07 00:25:31,982][81400] Updated weights for policy 0, policy_version 58380 (0.0006) [2023-03-07 00:25:32,769][81400] Updated weights for policy 0, policy_version 58390 (0.0006) [2023-03-07 00:25:33,531][81400] Updated weights for policy 0, policy_version 58400 (0.0007) [2023-03-07 00:25:34,293][81400] Updated weights for policy 0, policy_version 58410 (0.0006) [2023-03-07 00:25:35,088][81400] Updated weights for policy 0, policy_version 58420 (0.0006) [2023-03-07 00:25:35,849][81400] Updated weights for policy 0, policy_version 58430 (0.0006) [2023-03-07 00:25:36,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13192.6, 300 sec: 13180.1). Total num frames: 59836416. Throughput: 0: 13188.9. Samples: 59809239. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:36,247][81074] Avg episode reward: [(0, '3057.595')] [2023-03-07 00:25:36,629][81400] Updated weights for policy 0, policy_version 58440 (0.0005) [2023-03-07 00:25:37,425][81400] Updated weights for policy 0, policy_version 58450 (0.0006) [2023-03-07 00:25:38,194][81400] Updated weights for policy 0, policy_version 58460 (0.0006) [2023-03-07 00:25:38,951][81400] Updated weights for policy 0, policy_version 58470 (0.0005) [2023-03-07 00:25:39,727][81400] Updated weights for policy 0, policy_version 58480 (0.0006) [2023-03-07 00:25:40,497][81400] Updated weights for policy 0, policy_version 58490 (0.0006) [2023-03-07 00:25:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 59902976. Throughput: 0: 13186.5. Samples: 59888513. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:41,237][81074] Avg episode reward: [(0, '3014.824')] [2023-03-07 00:25:41,281][81400] Updated weights for policy 0, policy_version 58500 (0.0007) [2023-03-07 00:25:42,061][81400] Updated weights for policy 0, policy_version 58510 (0.0006) [2023-03-07 00:25:42,850][81400] Updated weights for policy 0, policy_version 58520 (0.0006) [2023-03-07 00:25:43,629][81400] Updated weights for policy 0, policy_version 58530 (0.0006) [2023-03-07 00:25:44,408][81400] Updated weights for policy 0, policy_version 58540 (0.0006) [2023-03-07 00:25:45,173][81400] Updated weights for policy 0, policy_version 58550 (0.0007) [2023-03-07 00:25:45,953][81400] Updated weights for policy 0, policy_version 58560 (0.0006) [2023-03-07 00:25:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13180.1). Total num frames: 59968512. Throughput: 0: 13177.0. Samples: 59967405. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:46,237][81074] Avg episode reward: [(0, '3141.670')] [2023-03-07 00:25:46,742][81400] Updated weights for policy 0, policy_version 58570 (0.0007) [2023-03-07 00:25:47,515][81400] Updated weights for policy 0, policy_version 58580 (0.0006) [2023-03-07 00:25:48,296][81400] Updated weights for policy 0, policy_version 58590 (0.0007) [2023-03-07 00:25:49,049][81400] Updated weights for policy 0, policy_version 58600 (0.0006) [2023-03-07 00:25:49,844][81400] Updated weights for policy 0, policy_version 58610 (0.0006) [2023-03-07 00:25:50,616][81400] Updated weights for policy 0, policy_version 58620 (0.0006) [2023-03-07 00:25:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 60035072. Throughput: 0: 13173.1. Samples: 60006934. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:51,237][81074] Avg episode reward: [(0, '3192.820')] [2023-03-07 00:25:51,390][81400] Updated weights for policy 0, policy_version 58630 (0.0006) [2023-03-07 00:25:52,182][81400] Updated weights for policy 0, policy_version 58640 (0.0006) [2023-03-07 00:25:52,953][81400] Updated weights for policy 0, policy_version 58650 (0.0006) [2023-03-07 00:25:53,754][81400] Updated weights for policy 0, policy_version 58660 (0.0006) [2023-03-07 00:25:54,529][81400] Updated weights for policy 0, policy_version 58670 (0.0006) [2023-03-07 00:25:55,330][81400] Updated weights for policy 0, policy_version 58680 (0.0006) [2023-03-07 00:25:56,100][81400] Updated weights for policy 0, policy_version 58690 (0.0005) [2023-03-07 00:25:56,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 60099584. Throughput: 0: 13164.4. Samples: 60085355. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:25:56,237][81074] Avg episode reward: [(0, '2848.961')] [2023-03-07 00:25:56,252][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000058692_60100608.pth... [2023-03-07 00:25:56,336][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000055602_56936448.pth [2023-03-07 00:25:56,871][81400] Updated weights for policy 0, policy_version 58700 (0.0006) [2023-03-07 00:25:57,677][81400] Updated weights for policy 0, policy_version 58710 (0.0006) [2023-03-07 00:25:58,454][81400] Updated weights for policy 0, policy_version 58720 (0.0006) [2023-03-07 00:25:59,244][81400] Updated weights for policy 0, policy_version 58730 (0.0006) [2023-03-07 00:26:00,017][81400] Updated weights for policy 0, policy_version 58740 (0.0006) [2023-03-07 00:26:00,793][81400] Updated weights for policy 0, policy_version 58750 (0.0006) [2023-03-07 00:26:01,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 60165120. Throughput: 0: 13155.7. Samples: 60164080. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:01,237][81074] Avg episode reward: [(0, '3142.396')] [2023-03-07 00:26:01,582][81400] Updated weights for policy 0, policy_version 58760 (0.0006) [2023-03-07 00:26:02,361][81400] Updated weights for policy 0, policy_version 58770 (0.0006) [2023-03-07 00:26:03,132][81400] Updated weights for policy 0, policy_version 58780 (0.0006) [2023-03-07 00:26:03,914][81400] Updated weights for policy 0, policy_version 58790 (0.0006) [2023-03-07 00:26:04,717][81400] Updated weights for policy 0, policy_version 58800 (0.0006) [2023-03-07 00:26:05,486][81400] Updated weights for policy 0, policy_version 58810 (0.0006) [2023-03-07 00:26:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 60230656. Throughput: 0: 13151.9. Samples: 60203359. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:06,237][81074] Avg episode reward: [(0, '2888.948')] [2023-03-07 00:26:06,278][81400] Updated weights for policy 0, policy_version 58820 (0.0006) [2023-03-07 00:26:07,061][81400] Updated weights for policy 0, policy_version 58830 (0.0006) [2023-03-07 00:26:07,845][81400] Updated weights for policy 0, policy_version 58840 (0.0007) [2023-03-07 00:26:08,627][81400] Updated weights for policy 0, policy_version 58850 (0.0006) [2023-03-07 00:26:09,407][81400] Updated weights for policy 0, policy_version 58860 (0.0007) [2023-03-07 00:26:10,196][81400] Updated weights for policy 0, policy_version 58870 (0.0006) [2023-03-07 00:26:10,970][81400] Updated weights for policy 0, policy_version 58880 (0.0005) [2023-03-07 00:26:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13173.1). Total num frames: 60296192. Throughput: 0: 13137.3. Samples: 60281873. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:11,237][81074] Avg episode reward: [(0, '3236.362')] [2023-03-07 00:26:11,758][81400] Updated weights for policy 0, policy_version 58890 (0.0007) [2023-03-07 00:26:12,537][81400] Updated weights for policy 0, policy_version 58900 (0.0006) [2023-03-07 00:26:13,303][81400] Updated weights for policy 0, policy_version 58910 (0.0006) [2023-03-07 00:26:14,070][81400] Updated weights for policy 0, policy_version 58920 (0.0006) [2023-03-07 00:26:14,864][81400] Updated weights for policy 0, policy_version 58930 (0.0005) [2023-03-07 00:26:15,626][81400] Updated weights for policy 0, policy_version 58940 (0.0006) [2023-03-07 00:26:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13173.2). Total num frames: 60361728. Throughput: 0: 13143.1. Samples: 60360918. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:16,237][81074] Avg episode reward: [(0, '3009.259')] [2023-03-07 00:26:16,413][81400] Updated weights for policy 0, policy_version 58950 (0.0005) [2023-03-07 00:26:17,177][81400] Updated weights for policy 0, policy_version 58960 (0.0006) [2023-03-07 00:26:17,953][81400] Updated weights for policy 0, policy_version 58970 (0.0006) [2023-03-07 00:26:18,746][81400] Updated weights for policy 0, policy_version 58980 (0.0007) [2023-03-07 00:26:19,521][81400] Updated weights for policy 0, policy_version 58990 (0.0006) [2023-03-07 00:26:20,305][81400] Updated weights for policy 0, policy_version 59000 (0.0006) [2023-03-07 00:26:21,073][81400] Updated weights for policy 0, policy_version 59010 (0.0005) [2023-03-07 00:26:21,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 60428288. Throughput: 0: 13134.3. Samples: 60400281. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:21,237][81074] Avg episode reward: [(0, '2920.489')] [2023-03-07 00:26:21,839][81400] Updated weights for policy 0, policy_version 59020 (0.0005) [2023-03-07 00:26:22,598][81400] Updated weights for policy 0, policy_version 59030 (0.0006) [2023-03-07 00:26:23,389][81400] Updated weights for policy 0, policy_version 59040 (0.0006) [2023-03-07 00:26:24,151][81400] Updated weights for policy 0, policy_version 59050 (0.0006) [2023-03-07 00:26:24,917][81400] Updated weights for policy 0, policy_version 59060 (0.0006) [2023-03-07 00:26:25,697][81400] Updated weights for policy 0, policy_version 59070 (0.0007) [2023-03-07 00:26:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13173.2). Total num frames: 60493824. Throughput: 0: 13139.2. Samples: 60479776. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:26,237][81074] Avg episode reward: [(0, '2923.763')] [2023-03-07 00:26:26,471][81400] Updated weights for policy 0, policy_version 59080 (0.0007) [2023-03-07 00:26:27,258][81400] Updated weights for policy 0, policy_version 59090 (0.0006) [2023-03-07 00:26:28,034][81400] Updated weights for policy 0, policy_version 59100 (0.0007) [2023-03-07 00:26:28,809][81400] Updated weights for policy 0, policy_version 59110 (0.0005) [2023-03-07 00:26:29,582][81400] Updated weights for policy 0, policy_version 59120 (0.0006) [2023-03-07 00:26:30,345][81400] Updated weights for policy 0, policy_version 59130 (0.0006) [2023-03-07 00:26:31,147][81400] Updated weights for policy 0, policy_version 59140 (0.0005) [2023-03-07 00:26:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 60560384. Throughput: 0: 13143.5. Samples: 60558864. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:31,237][81074] Avg episode reward: [(0, '2926.886')] [2023-03-07 00:26:31,919][81400] Updated weights for policy 0, policy_version 59150 (0.0006) [2023-03-07 00:26:32,712][81400] Updated weights for policy 0, policy_version 59160 (0.0006) [2023-03-07 00:26:33,478][81400] Updated weights for policy 0, policy_version 59170 (0.0006) [2023-03-07 00:26:34,256][81400] Updated weights for policy 0, policy_version 59180 (0.0006) [2023-03-07 00:26:35,033][81400] Updated weights for policy 0, policy_version 59190 (0.0006) [2023-03-07 00:26:35,814][81400] Updated weights for policy 0, policy_version 59200 (0.0006) [2023-03-07 00:26:36,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 60625920. Throughput: 0: 13146.1. Samples: 60598508. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:36,247][81074] Avg episode reward: [(0, '3087.497')] [2023-03-07 00:26:36,590][81400] Updated weights for policy 0, policy_version 59210 (0.0006) [2023-03-07 00:26:37,362][81400] Updated weights for policy 0, policy_version 59220 (0.0006) [2023-03-07 00:26:38,124][81400] Updated weights for policy 0, policy_version 59230 (0.0007) [2023-03-07 00:26:38,905][81400] Updated weights for policy 0, policy_version 59240 (0.0007) [2023-03-07 00:26:39,674][81400] Updated weights for policy 0, policy_version 59250 (0.0006) [2023-03-07 00:26:40,454][81400] Updated weights for policy 0, policy_version 59260 (0.0007) [2023-03-07 00:26:41,219][81400] Updated weights for policy 0, policy_version 59270 (0.0006) [2023-03-07 00:26:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13180.1). Total num frames: 60692480. Throughput: 0: 13163.7. Samples: 60677720. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:41,247][81074] Avg episode reward: [(0, '3297.640')] [2023-03-07 00:26:41,987][81400] Updated weights for policy 0, policy_version 59280 (0.0007) [2023-03-07 00:26:42,777][81400] Updated weights for policy 0, policy_version 59290 (0.0005) [2023-03-07 00:26:43,553][81400] Updated weights for policy 0, policy_version 59300 (0.0006) [2023-03-07 00:26:44,331][81400] Updated weights for policy 0, policy_version 59310 (0.0006) [2023-03-07 00:26:45,100][81400] Updated weights for policy 0, policy_version 59320 (0.0006) [2023-03-07 00:26:45,898][81400] Updated weights for policy 0, policy_version 59330 (0.0006) [2023-03-07 00:26:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 60758016. Throughput: 0: 13173.8. Samples: 60756900. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:46,247][81074] Avg episode reward: [(0, '3196.864')] [2023-03-07 00:26:46,668][81400] Updated weights for policy 0, policy_version 59340 (0.0006) [2023-03-07 00:26:47,431][81400] Updated weights for policy 0, policy_version 59350 (0.0006) [2023-03-07 00:26:48,196][81400] Updated weights for policy 0, policy_version 59360 (0.0006) [2023-03-07 00:26:48,985][81400] Updated weights for policy 0, policy_version 59370 (0.0007) [2023-03-07 00:26:49,759][81400] Updated weights for policy 0, policy_version 59380 (0.0006) [2023-03-07 00:26:50,537][81400] Updated weights for policy 0, policy_version 59390 (0.0007) [2023-03-07 00:26:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13176.6). Total num frames: 60823552. Throughput: 0: 13178.5. Samples: 60796392. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:51,247][81074] Avg episode reward: [(0, '2968.936')] [2023-03-07 00:26:51,323][81400] Updated weights for policy 0, policy_version 59400 (0.0006) [2023-03-07 00:26:52,099][81400] Updated weights for policy 0, policy_version 59410 (0.0006) [2023-03-07 00:26:52,873][81400] Updated weights for policy 0, policy_version 59420 (0.0006) [2023-03-07 00:26:53,654][81400] Updated weights for policy 0, policy_version 59430 (0.0006) [2023-03-07 00:26:54,414][81400] Updated weights for policy 0, policy_version 59440 (0.0006) [2023-03-07 00:26:55,185][81400] Updated weights for policy 0, policy_version 59450 (0.0007) [2023-03-07 00:26:55,954][81400] Updated weights for policy 0, policy_version 59460 (0.0006) [2023-03-07 00:26:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 60890112. Throughput: 0: 13196.0. Samples: 60875690. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:26:56,247][81074] Avg episode reward: [(0, '2816.305')] [2023-03-07 00:26:56,731][81400] Updated weights for policy 0, policy_version 59470 (0.0006) [2023-03-07 00:26:57,533][81400] Updated weights for policy 0, policy_version 59480 (0.0007) [2023-03-07 00:26:58,301][81400] Updated weights for policy 0, policy_version 59490 (0.0006) [2023-03-07 00:26:59,093][81400] Updated weights for policy 0, policy_version 59500 (0.0006) [2023-03-07 00:26:59,859][81400] Updated weights for policy 0, policy_version 59510 (0.0006) [2023-03-07 00:27:00,627][81400] Updated weights for policy 0, policy_version 59520 (0.0007) [2023-03-07 00:27:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 60955648. Throughput: 0: 13194.6. Samples: 60954675. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:01,247][81074] Avg episode reward: [(0, '2922.397')] [2023-03-07 00:27:01,419][81400] Updated weights for policy 0, policy_version 59530 (0.0006) [2023-03-07 00:27:02,211][81400] Updated weights for policy 0, policy_version 59540 (0.0007) [2023-03-07 00:27:02,990][81400] Updated weights for policy 0, policy_version 59550 (0.0006) [2023-03-07 00:27:03,765][81400] Updated weights for policy 0, policy_version 59560 (0.0006) [2023-03-07 00:27:04,538][81400] Updated weights for policy 0, policy_version 59570 (0.0007) [2023-03-07 00:27:05,310][81400] Updated weights for policy 0, policy_version 59580 (0.0006) [2023-03-07 00:27:06,071][81400] Updated weights for policy 0, policy_version 59590 (0.0005) [2023-03-07 00:27:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 61022208. Throughput: 0: 13190.1. Samples: 60993839. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:06,247][81074] Avg episode reward: [(0, '3057.578')] [2023-03-07 00:27:06,848][81400] Updated weights for policy 0, policy_version 59600 (0.0006) [2023-03-07 00:27:07,639][81400] Updated weights for policy 0, policy_version 59610 (0.0006) [2023-03-07 00:27:08,428][81400] Updated weights for policy 0, policy_version 59620 (0.0006) [2023-03-07 00:27:09,196][81400] Updated weights for policy 0, policy_version 59630 (0.0006) [2023-03-07 00:27:09,950][81400] Updated weights for policy 0, policy_version 59640 (0.0006) [2023-03-07 00:27:10,741][81400] Updated weights for policy 0, policy_version 59650 (0.0007) [2023-03-07 00:27:11,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 61087744. Throughput: 0: 13190.7. Samples: 61073361. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:11,247][81074] Avg episode reward: [(0, '3139.397')] [2023-03-07 00:27:11,525][81400] Updated weights for policy 0, policy_version 59660 (0.0007) [2023-03-07 00:27:12,302][81400] Updated weights for policy 0, policy_version 59670 (0.0006) [2023-03-07 00:27:13,081][81400] Updated weights for policy 0, policy_version 59680 (0.0006) [2023-03-07 00:27:13,873][81400] Updated weights for policy 0, policy_version 59690 (0.0006) [2023-03-07 00:27:14,656][81400] Updated weights for policy 0, policy_version 59700 (0.0006) [2023-03-07 00:27:15,458][81400] Updated weights for policy 0, policy_version 59710 (0.0007) [2023-03-07 00:27:16,210][81400] Updated weights for policy 0, policy_version 59720 (0.0007) [2023-03-07 00:27:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 61153280. Throughput: 0: 13178.4. Samples: 61151891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:16,247][81074] Avg episode reward: [(0, '3101.801')] [2023-03-07 00:27:16,980][81400] Updated weights for policy 0, policy_version 59730 (0.0006) [2023-03-07 00:27:17,757][81400] Updated weights for policy 0, policy_version 59740 (0.0006) [2023-03-07 00:27:18,526][81400] Updated weights for policy 0, policy_version 59750 (0.0006) [2023-03-07 00:27:19,302][81400] Updated weights for policy 0, policy_version 59760 (0.0008) [2023-03-07 00:27:20,077][81400] Updated weights for policy 0, policy_version 59770 (0.0006) [2023-03-07 00:27:20,855][81400] Updated weights for policy 0, policy_version 59780 (0.0008) [2023-03-07 00:27:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 61219840. Throughput: 0: 13181.0. Samples: 61191650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:21,237][81074] Avg episode reward: [(0, '3070.476')] [2023-03-07 00:27:21,636][81400] Updated weights for policy 0, policy_version 59790 (0.0006) [2023-03-07 00:27:22,390][81400] Updated weights for policy 0, policy_version 59800 (0.0006) [2023-03-07 00:27:23,169][81400] Updated weights for policy 0, policy_version 59810 (0.0006) [2023-03-07 00:27:23,935][81400] Updated weights for policy 0, policy_version 59820 (0.0007) [2023-03-07 00:27:24,707][81400] Updated weights for policy 0, policy_version 59830 (0.0006) [2023-03-07 00:27:25,489][81400] Updated weights for policy 0, policy_version 59840 (0.0006) [2023-03-07 00:27:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 61285376. Throughput: 0: 13187.4. Samples: 61271153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:26,237][81074] Avg episode reward: [(0, '3203.643')] [2023-03-07 00:27:26,253][81400] Updated weights for policy 0, policy_version 59850 (0.0006) [2023-03-07 00:27:27,033][81400] Updated weights for policy 0, policy_version 59860 (0.0006) [2023-03-07 00:27:27,804][81400] Updated weights for policy 0, policy_version 59870 (0.0006) [2023-03-07 00:27:28,595][81400] Updated weights for policy 0, policy_version 59880 (0.0006) [2023-03-07 00:27:29,357][81400] Updated weights for policy 0, policy_version 59890 (0.0006) [2023-03-07 00:27:30,140][81400] Updated weights for policy 0, policy_version 59900 (0.0006) [2023-03-07 00:27:30,902][81400] Updated weights for policy 0, policy_version 59910 (0.0006) [2023-03-07 00:27:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 61351936. Throughput: 0: 13187.6. Samples: 61350342. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:31,237][81074] Avg episode reward: [(0, '3092.288')] [2023-03-07 00:27:31,682][81400] Updated weights for policy 0, policy_version 59920 (0.0006) [2023-03-07 00:27:32,451][81400] Updated weights for policy 0, policy_version 59930 (0.0006) [2023-03-07 00:27:33,241][81400] Updated weights for policy 0, policy_version 59940 (0.0006) [2023-03-07 00:27:34,013][81400] Updated weights for policy 0, policy_version 59950 (0.0006) [2023-03-07 00:27:34,785][81400] Updated weights for policy 0, policy_version 59960 (0.0006) [2023-03-07 00:27:35,550][81400] Updated weights for policy 0, policy_version 59970 (0.0006) [2023-03-07 00:27:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 61417472. Throughput: 0: 13189.4. Samples: 61389916. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:36,237][81074] Avg episode reward: [(0, '3142.069')] [2023-03-07 00:27:36,333][81400] Updated weights for policy 0, policy_version 59980 (0.0006) [2023-03-07 00:27:37,108][81400] Updated weights for policy 0, policy_version 59990 (0.0007) [2023-03-07 00:27:37,894][81400] Updated weights for policy 0, policy_version 60000 (0.0006) [2023-03-07 00:27:38,656][81400] Updated weights for policy 0, policy_version 60010 (0.0006) [2023-03-07 00:27:39,448][81400] Updated weights for policy 0, policy_version 60020 (0.0006) [2023-03-07 00:27:40,221][81400] Updated weights for policy 0, policy_version 60030 (0.0006) [2023-03-07 00:27:40,997][81400] Updated weights for policy 0, policy_version 60040 (0.0007) [2023-03-07 00:27:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 61484032. Throughput: 0: 13190.2. Samples: 61469249. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:27:41,237][81074] Avg episode reward: [(0, '3178.530')] [2023-03-07 00:27:41,785][81400] Updated weights for policy 0, policy_version 60050 (0.0007) [2023-03-07 00:27:42,569][81400] Updated weights for policy 0, policy_version 60060 (0.0006) [2023-03-07 00:27:43,326][81400] Updated weights for policy 0, policy_version 60070 (0.0006) [2023-03-07 00:27:44,138][81400] Updated weights for policy 0, policy_version 60080 (0.0007) [2023-03-07 00:27:44,901][81400] Updated weights for policy 0, policy_version 60090 (0.0006) [2023-03-07 00:27:45,672][81400] Updated weights for policy 0, policy_version 60100 (0.0006) [2023-03-07 00:27:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13180.1). Total num frames: 61549568. Throughput: 0: 13185.8. Samples: 61548037. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:27:46,237][81074] Avg episode reward: [(0, '3213.710')] [2023-03-07 00:27:46,449][81400] Updated weights for policy 0, policy_version 60110 (0.0006) [2023-03-07 00:27:47,253][81400] Updated weights for policy 0, policy_version 60120 (0.0007) [2023-03-07 00:27:48,021][81400] Updated weights for policy 0, policy_version 60130 (0.0006) [2023-03-07 00:27:48,809][81400] Updated weights for policy 0, policy_version 60140 (0.0006) [2023-03-07 00:27:49,594][81400] Updated weights for policy 0, policy_version 60150 (0.0006) [2023-03-07 00:27:50,357][81400] Updated weights for policy 0, policy_version 60160 (0.0006) [2023-03-07 00:27:51,129][81400] Updated weights for policy 0, policy_version 60170 (0.0006) [2023-03-07 00:27:51,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 61615104. Throughput: 0: 13187.3. Samples: 61587267. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:27:51,237][81074] Avg episode reward: [(0, '3086.330')] [2023-03-07 00:27:51,910][81400] Updated weights for policy 0, policy_version 60180 (0.0006) [2023-03-07 00:27:52,673][81400] Updated weights for policy 0, policy_version 60190 (0.0006) [2023-03-07 00:27:53,441][81400] Updated weights for policy 0, policy_version 60200 (0.0006) [2023-03-07 00:27:54,212][81400] Updated weights for policy 0, policy_version 60210 (0.0006) [2023-03-07 00:27:55,006][81400] Updated weights for policy 0, policy_version 60220 (0.0006) [2023-03-07 00:27:55,769][81400] Updated weights for policy 0, policy_version 60230 (0.0006) [2023-03-07 00:27:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 61680640. Throughput: 0: 13184.6. Samples: 61666669. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:27:56,237][81074] Avg episode reward: [(0, '3183.899')] [2023-03-07 00:27:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000060236_61681664.pth... [2023-03-07 00:27:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000057147_58518528.pth [2023-03-07 00:27:56,552][81400] Updated weights for policy 0, policy_version 60240 (0.0007) [2023-03-07 00:27:57,329][81400] Updated weights for policy 0, policy_version 60250 (0.0006) [2023-03-07 00:27:58,105][81400] Updated weights for policy 0, policy_version 60260 (0.0006) [2023-03-07 00:27:58,874][81400] Updated weights for policy 0, policy_version 60270 (0.0006) [2023-03-07 00:27:59,647][81400] Updated weights for policy 0, policy_version 60280 (0.0006) [2023-03-07 00:28:00,418][81400] Updated weights for policy 0, policy_version 60290 (0.0006) [2023-03-07 00:28:01,197][81400] Updated weights for policy 0, policy_version 60300 (0.0006) [2023-03-07 00:28:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 61747200. Throughput: 0: 13198.6. Samples: 61745827. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:01,237][81074] Avg episode reward: [(0, '3099.410')] [2023-03-07 00:28:01,981][81400] Updated weights for policy 0, policy_version 60310 (0.0005) [2023-03-07 00:28:02,757][81400] Updated weights for policy 0, policy_version 60320 (0.0006) [2023-03-07 00:28:03,535][81400] Updated weights for policy 0, policy_version 60330 (0.0005) [2023-03-07 00:28:04,311][81400] Updated weights for policy 0, policy_version 60340 (0.0006) [2023-03-07 00:28:05,086][81400] Updated weights for policy 0, policy_version 60350 (0.0005) [2023-03-07 00:28:05,858][81400] Updated weights for policy 0, policy_version 60360 (0.0006) [2023-03-07 00:28:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 61812736. Throughput: 0: 13194.2. Samples: 61785391. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:06,237][81074] Avg episode reward: [(0, '3258.374')] [2023-03-07 00:28:06,646][81400] Updated weights for policy 0, policy_version 60370 (0.0006) [2023-03-07 00:28:07,423][81400] Updated weights for policy 0, policy_version 60380 (0.0006) [2023-03-07 00:28:08,205][81400] Updated weights for policy 0, policy_version 60390 (0.0006) [2023-03-07 00:28:08,981][81400] Updated weights for policy 0, policy_version 60400 (0.0006) [2023-03-07 00:28:09,757][81400] Updated weights for policy 0, policy_version 60410 (0.0006) [2023-03-07 00:28:10,533][81400] Updated weights for policy 0, policy_version 60420 (0.0006) [2023-03-07 00:28:11,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 61878272. Throughput: 0: 13184.6. Samples: 61864461. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:11,237][81074] Avg episode reward: [(0, '3265.655')] [2023-03-07 00:28:11,306][81400] Updated weights for policy 0, policy_version 60430 (0.0005) [2023-03-07 00:28:12,078][81400] Updated weights for policy 0, policy_version 60440 (0.0006) [2023-03-07 00:28:12,880][81400] Updated weights for policy 0, policy_version 60450 (0.0007) [2023-03-07 00:28:13,659][81400] Updated weights for policy 0, policy_version 60460 (0.0006) [2023-03-07 00:28:14,437][81400] Updated weights for policy 0, policy_version 60470 (0.0007) [2023-03-07 00:28:15,240][81400] Updated weights for policy 0, policy_version 60480 (0.0007) [2023-03-07 00:28:16,018][81400] Updated weights for policy 0, policy_version 60490 (0.0006) [2023-03-07 00:28:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 61944832. Throughput: 0: 13173.8. Samples: 61943162. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:16,237][81074] Avg episode reward: [(0, '3004.479')] [2023-03-07 00:28:16,796][81400] Updated weights for policy 0, policy_version 60500 (0.0006) [2023-03-07 00:28:17,595][81400] Updated weights for policy 0, policy_version 60510 (0.0006) [2023-03-07 00:28:18,357][81400] Updated weights for policy 0, policy_version 60520 (0.0006) [2023-03-07 00:28:19,134][81400] Updated weights for policy 0, policy_version 60530 (0.0006) [2023-03-07 00:28:19,911][81400] Updated weights for policy 0, policy_version 60540 (0.0006) [2023-03-07 00:28:20,669][81400] Updated weights for policy 0, policy_version 60550 (0.0006) [2023-03-07 00:28:21,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 62010368. Throughput: 0: 13167.1. Samples: 61982434. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:21,237][81074] Avg episode reward: [(0, '3130.794')] [2023-03-07 00:28:21,450][81400] Updated weights for policy 0, policy_version 60560 (0.0008) [2023-03-07 00:28:22,238][81400] Updated weights for policy 0, policy_version 60570 (0.0006) [2023-03-07 00:28:22,999][81400] Updated weights for policy 0, policy_version 60580 (0.0006) [2023-03-07 00:28:23,775][81400] Updated weights for policy 0, policy_version 60590 (0.0005) [2023-03-07 00:28:24,569][81400] Updated weights for policy 0, policy_version 60600 (0.0007) [2023-03-07 00:28:25,344][81400] Updated weights for policy 0, policy_version 60610 (0.0006) [2023-03-07 00:28:26,101][81400] Updated weights for policy 0, policy_version 60620 (0.0006) [2023-03-07 00:28:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13173.1). Total num frames: 62075904. Throughput: 0: 13163.9. Samples: 62061625. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:26,237][81074] Avg episode reward: [(0, '3025.978')] [2023-03-07 00:28:26,892][81400] Updated weights for policy 0, policy_version 60630 (0.0007) [2023-03-07 00:28:27,671][81400] Updated weights for policy 0, policy_version 60640 (0.0006) [2023-03-07 00:28:28,441][81400] Updated weights for policy 0, policy_version 60650 (0.0007) [2023-03-07 00:28:29,232][81400] Updated weights for policy 0, policy_version 60660 (0.0006) [2023-03-07 00:28:30,005][81400] Updated weights for policy 0, policy_version 60670 (0.0006) [2023-03-07 00:28:30,797][81400] Updated weights for policy 0, policy_version 60680 (0.0006) [2023-03-07 00:28:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 62141440. Throughput: 0: 13165.8. Samples: 62140500. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:31,237][81074] Avg episode reward: [(0, '2875.873')] [2023-03-07 00:28:31,577][81400] Updated weights for policy 0, policy_version 60690 (0.0006) [2023-03-07 00:28:32,352][81400] Updated weights for policy 0, policy_version 60700 (0.0007) [2023-03-07 00:28:33,130][81400] Updated weights for policy 0, policy_version 60710 (0.0006) [2023-03-07 00:28:33,909][81400] Updated weights for policy 0, policy_version 60720 (0.0006) [2023-03-07 00:28:34,694][81400] Updated weights for policy 0, policy_version 60730 (0.0006) [2023-03-07 00:28:35,444][81400] Updated weights for policy 0, policy_version 60740 (0.0006) [2023-03-07 00:28:36,225][81400] Updated weights for policy 0, policy_version 60750 (0.0006) [2023-03-07 00:28:36,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 62208000. Throughput: 0: 13169.7. Samples: 62179902. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:36,237][81074] Avg episode reward: [(0, '2827.132')] [2023-03-07 00:28:37,006][81400] Updated weights for policy 0, policy_version 60760 (0.0005) [2023-03-07 00:28:37,794][81400] Updated weights for policy 0, policy_version 60770 (0.0006) [2023-03-07 00:28:38,562][81400] Updated weights for policy 0, policy_version 60780 (0.0006) [2023-03-07 00:28:39,345][81400] Updated weights for policy 0, policy_version 60790 (0.0006) [2023-03-07 00:28:40,132][81400] Updated weights for policy 0, policy_version 60800 (0.0006) [2023-03-07 00:28:40,919][81400] Updated weights for policy 0, policy_version 60810 (0.0006) [2023-03-07 00:28:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13173.1). Total num frames: 62272512. Throughput: 0: 13159.0. Samples: 62258825. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:41,247][81074] Avg episode reward: [(0, '2800.440')] [2023-03-07 00:28:41,686][81400] Updated weights for policy 0, policy_version 60820 (0.0006) [2023-03-07 00:28:42,474][81400] Updated weights for policy 0, policy_version 60830 (0.0006) [2023-03-07 00:28:43,256][81400] Updated weights for policy 0, policy_version 60840 (0.0006) [2023-03-07 00:28:44,023][81400] Updated weights for policy 0, policy_version 60850 (0.0007) [2023-03-07 00:28:44,803][81400] Updated weights for policy 0, policy_version 60860 (0.0006) [2023-03-07 00:28:45,581][81400] Updated weights for policy 0, policy_version 60870 (0.0006) [2023-03-07 00:28:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 62339072. Throughput: 0: 13152.5. Samples: 62337687. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:46,247][81074] Avg episode reward: [(0, '2971.554')] [2023-03-07 00:28:46,365][81400] Updated weights for policy 0, policy_version 60880 (0.0006) [2023-03-07 00:28:47,152][81400] Updated weights for policy 0, policy_version 60890 (0.0006) [2023-03-07 00:28:47,924][81400] Updated weights for policy 0, policy_version 60900 (0.0007) [2023-03-07 00:28:48,690][81400] Updated weights for policy 0, policy_version 60910 (0.0006) [2023-03-07 00:28:49,469][81400] Updated weights for policy 0, policy_version 60920 (0.0007) [2023-03-07 00:28:50,255][81400] Updated weights for policy 0, policy_version 60930 (0.0006) [2023-03-07 00:28:51,052][81400] Updated weights for policy 0, policy_version 60940 (0.0006) [2023-03-07 00:28:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 62404608. Throughput: 0: 13153.0. Samples: 62377276. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:51,247][81074] Avg episode reward: [(0, '3093.262')] [2023-03-07 00:28:51,820][81400] Updated weights for policy 0, policy_version 60950 (0.0007) [2023-03-07 00:28:52,600][81400] Updated weights for policy 0, policy_version 60960 (0.0006) [2023-03-07 00:28:53,368][81400] Updated weights for policy 0, policy_version 60970 (0.0006) [2023-03-07 00:28:54,148][81400] Updated weights for policy 0, policy_version 60980 (0.0006) [2023-03-07 00:28:54,932][81400] Updated weights for policy 0, policy_version 60990 (0.0006) [2023-03-07 00:28:55,695][81400] Updated weights for policy 0, policy_version 61000 (0.0006) [2023-03-07 00:28:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 62471168. Throughput: 0: 13148.9. Samples: 62456158. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:28:56,247][81074] Avg episode reward: [(0, '2964.104')] [2023-03-07 00:28:56,482][81400] Updated weights for policy 0, policy_version 61010 (0.0006) [2023-03-07 00:28:57,265][81400] Updated weights for policy 0, policy_version 61020 (0.0006) [2023-03-07 00:28:58,049][81400] Updated weights for policy 0, policy_version 61030 (0.0006) [2023-03-07 00:28:58,844][81400] Updated weights for policy 0, policy_version 61040 (0.0006) [2023-03-07 00:28:59,591][81400] Updated weights for policy 0, policy_version 61050 (0.0006) [2023-03-07 00:29:00,394][81400] Updated weights for policy 0, policy_version 61060 (0.0007) [2023-03-07 00:29:01,162][81400] Updated weights for policy 0, policy_version 61070 (0.0006) [2023-03-07 00:29:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 62536704. Throughput: 0: 13150.6. Samples: 62534941. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:01,247][81074] Avg episode reward: [(0, '3016.950')] [2023-03-07 00:29:01,947][81400] Updated weights for policy 0, policy_version 61080 (0.0007) [2023-03-07 00:29:02,715][81400] Updated weights for policy 0, policy_version 61090 (0.0006) [2023-03-07 00:29:03,501][81400] Updated weights for policy 0, policy_version 61100 (0.0006) [2023-03-07 00:29:04,276][81400] Updated weights for policy 0, policy_version 61110 (0.0007) [2023-03-07 00:29:05,043][81400] Updated weights for policy 0, policy_version 61120 (0.0006) [2023-03-07 00:29:05,815][81400] Updated weights for policy 0, policy_version 61130 (0.0006) [2023-03-07 00:29:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 62602240. Throughput: 0: 13159.1. Samples: 62574593. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:06,247][81074] Avg episode reward: [(0, '2835.819')] [2023-03-07 00:29:06,598][81400] Updated weights for policy 0, policy_version 61140 (0.0006) [2023-03-07 00:29:07,370][81400] Updated weights for policy 0, policy_version 61150 (0.0006) [2023-03-07 00:29:08,152][81400] Updated weights for policy 0, policy_version 61160 (0.0006) [2023-03-07 00:29:08,937][81400] Updated weights for policy 0, policy_version 61170 (0.0007) [2023-03-07 00:29:09,701][81400] Updated weights for policy 0, policy_version 61180 (0.0007) [2023-03-07 00:29:10,490][81400] Updated weights for policy 0, policy_version 61190 (0.0006) [2023-03-07 00:29:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 62667776. Throughput: 0: 13154.1. Samples: 62653559. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:11,237][81074] Avg episode reward: [(0, '2864.525')] [2023-03-07 00:29:11,282][81400] Updated weights for policy 0, policy_version 61200 (0.0006) [2023-03-07 00:29:12,047][81400] Updated weights for policy 0, policy_version 61210 (0.0007) [2023-03-07 00:29:12,830][81400] Updated weights for policy 0, policy_version 61220 (0.0007) [2023-03-07 00:29:13,614][81400] Updated weights for policy 0, policy_version 61230 (0.0005) [2023-03-07 00:29:14,404][81400] Updated weights for policy 0, policy_version 61240 (0.0006) [2023-03-07 00:29:15,169][81400] Updated weights for policy 0, policy_version 61250 (0.0006) [2023-03-07 00:29:15,976][81400] Updated weights for policy 0, policy_version 61260 (0.0006) [2023-03-07 00:29:16,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13173.2). Total num frames: 62733312. Throughput: 0: 13149.0. Samples: 62732204. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:16,237][81074] Avg episode reward: [(0, '3124.434')] [2023-03-07 00:29:16,733][81400] Updated weights for policy 0, policy_version 61270 (0.0007) [2023-03-07 00:29:17,513][81400] Updated weights for policy 0, policy_version 61280 (0.0006) [2023-03-07 00:29:18,304][81400] Updated weights for policy 0, policy_version 61290 (0.0008) [2023-03-07 00:29:19,085][81400] Updated weights for policy 0, policy_version 61300 (0.0006) [2023-03-07 00:29:19,863][81400] Updated weights for policy 0, policy_version 61310 (0.0007) [2023-03-07 00:29:20,658][81400] Updated weights for policy 0, policy_version 61320 (0.0006) [2023-03-07 00:29:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 62798848. Throughput: 0: 13146.2. Samples: 62771480. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:21,237][81074] Avg episode reward: [(0, '2947.086')] [2023-03-07 00:29:21,417][81400] Updated weights for policy 0, policy_version 61330 (0.0006) [2023-03-07 00:29:22,189][81400] Updated weights for policy 0, policy_version 61340 (0.0006) [2023-03-07 00:29:22,959][81400] Updated weights for policy 0, policy_version 61350 (0.0006) [2023-03-07 00:29:23,745][81400] Updated weights for policy 0, policy_version 61360 (0.0006) [2023-03-07 00:29:24,520][81400] Updated weights for policy 0, policy_version 61370 (0.0006) [2023-03-07 00:29:25,304][81400] Updated weights for policy 0, policy_version 61380 (0.0006) [2023-03-07 00:29:26,080][81400] Updated weights for policy 0, policy_version 61390 (0.0006) [2023-03-07 00:29:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 62865408. Throughput: 0: 13150.1. Samples: 62850580. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:26,237][81074] Avg episode reward: [(0, '2904.128')] [2023-03-07 00:29:26,845][81400] Updated weights for policy 0, policy_version 61400 (0.0006) [2023-03-07 00:29:27,645][81400] Updated weights for policy 0, policy_version 61410 (0.0006) [2023-03-07 00:29:28,415][81400] Updated weights for policy 0, policy_version 61420 (0.0006) [2023-03-07 00:29:29,185][81400] Updated weights for policy 0, policy_version 61430 (0.0006) [2023-03-07 00:29:29,966][81400] Updated weights for policy 0, policy_version 61440 (0.0006) [2023-03-07 00:29:30,745][81400] Updated weights for policy 0, policy_version 61450 (0.0007) [2023-03-07 00:29:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 62930944. Throughput: 0: 13153.3. Samples: 62929584. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:31,237][81074] Avg episode reward: [(0, '2816.247')] [2023-03-07 00:29:31,514][81400] Updated weights for policy 0, policy_version 61460 (0.0006) [2023-03-07 00:29:32,285][81400] Updated weights for policy 0, policy_version 61470 (0.0006) [2023-03-07 00:29:33,080][81400] Updated weights for policy 0, policy_version 61480 (0.0006) [2023-03-07 00:29:33,839][81400] Updated weights for policy 0, policy_version 61490 (0.0006) [2023-03-07 00:29:34,617][81400] Updated weights for policy 0, policy_version 61500 (0.0006) [2023-03-07 00:29:35,394][81400] Updated weights for policy 0, policy_version 61510 (0.0006) [2023-03-07 00:29:36,183][81400] Updated weights for policy 0, policy_version 61520 (0.0006) [2023-03-07 00:29:36,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 62996480. Throughput: 0: 13152.8. Samples: 62969152. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:29:36,237][81074] Avg episode reward: [(0, '2862.324')] [2023-03-07 00:29:36,978][81400] Updated weights for policy 0, policy_version 61530 (0.0007) [2023-03-07 00:29:37,742][81400] Updated weights for policy 0, policy_version 61540 (0.0006) [2023-03-07 00:29:38,520][81400] Updated weights for policy 0, policy_version 61550 (0.0006) [2023-03-07 00:29:39,290][81400] Updated weights for policy 0, policy_version 61560 (0.0006) [2023-03-07 00:29:40,064][81400] Updated weights for policy 0, policy_version 61570 (0.0006) [2023-03-07 00:29:40,847][81400] Updated weights for policy 0, policy_version 61580 (0.0006) [2023-03-07 00:29:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 63063040. Throughput: 0: 13153.8. Samples: 63048076. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:29:41,237][81074] Avg episode reward: [(0, '2942.276')] [2023-03-07 00:29:41,625][81400] Updated weights for policy 0, policy_version 61590 (0.0006) [2023-03-07 00:29:42,399][81400] Updated weights for policy 0, policy_version 61600 (0.0006) [2023-03-07 00:29:43,192][81400] Updated weights for policy 0, policy_version 61610 (0.0007) [2023-03-07 00:29:43,972][81400] Updated weights for policy 0, policy_version 61620 (0.0006) [2023-03-07 00:29:44,738][81400] Updated weights for policy 0, policy_version 61630 (0.0005) [2023-03-07 00:29:45,524][81400] Updated weights for policy 0, policy_version 61640 (0.0007) [2023-03-07 00:29:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 63128576. Throughput: 0: 13159.5. Samples: 63127119. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:29:46,237][81074] Avg episode reward: [(0, '2881.617')] [2023-03-07 00:29:46,296][81400] Updated weights for policy 0, policy_version 61650 (0.0006) [2023-03-07 00:29:47,078][81400] Updated weights for policy 0, policy_version 61660 (0.0006) [2023-03-07 00:29:47,835][81400] Updated weights for policy 0, policy_version 61670 (0.0007) [2023-03-07 00:29:48,623][81400] Updated weights for policy 0, policy_version 61680 (0.0006) [2023-03-07 00:29:49,393][81400] Updated weights for policy 0, policy_version 61690 (0.0006) [2023-03-07 00:29:50,158][81400] Updated weights for policy 0, policy_version 61700 (0.0006) [2023-03-07 00:29:50,939][81400] Updated weights for policy 0, policy_version 61710 (0.0006) [2023-03-07 00:29:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 63194112. Throughput: 0: 13158.5. Samples: 63166724. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:29:51,237][81074] Avg episode reward: [(0, '2869.009')] [2023-03-07 00:29:51,718][81400] Updated weights for policy 0, policy_version 61720 (0.0006) [2023-03-07 00:29:52,474][81400] Updated weights for policy 0, policy_version 61730 (0.0006) [2023-03-07 00:29:53,272][81400] Updated weights for policy 0, policy_version 61740 (0.0006) [2023-03-07 00:29:54,038][81400] Updated weights for policy 0, policy_version 61750 (0.0006) [2023-03-07 00:29:54,816][81400] Updated weights for policy 0, policy_version 61760 (0.0006) [2023-03-07 00:29:55,590][81400] Updated weights for policy 0, policy_version 61770 (0.0006) [2023-03-07 00:29:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 63260672. Throughput: 0: 13167.5. Samples: 63246096. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:29:56,237][81074] Avg episode reward: [(0, '2668.075')] [2023-03-07 00:29:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000061778_63260672.pth... [2023-03-07 00:29:56,274][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000058692_60100608.pth [2023-03-07 00:29:56,371][81400] Updated weights for policy 0, policy_version 61780 (0.0006) [2023-03-07 00:29:57,154][81400] Updated weights for policy 0, policy_version 61790 (0.0006) [2023-03-07 00:29:57,928][81400] Updated weights for policy 0, policy_version 61800 (0.0006) [2023-03-07 00:29:58,691][81400] Updated weights for policy 0, policy_version 61810 (0.0006) [2023-03-07 00:29:59,477][81400] Updated weights for policy 0, policy_version 61820 (0.0006) [2023-03-07 00:30:00,246][81400] Updated weights for policy 0, policy_version 61830 (0.0006) [2023-03-07 00:30:01,042][81400] Updated weights for policy 0, policy_version 61840 (0.0006) [2023-03-07 00:30:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 63326208. Throughput: 0: 13175.3. Samples: 63325092. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:01,237][81074] Avg episode reward: [(0, '2672.761')] [2023-03-07 00:30:01,799][81400] Updated weights for policy 0, policy_version 61850 (0.0006) [2023-03-07 00:30:02,572][81400] Updated weights for policy 0, policy_version 61860 (0.0007) [2023-03-07 00:30:03,369][81400] Updated weights for policy 0, policy_version 61870 (0.0006) [2023-03-07 00:30:04,126][81400] Updated weights for policy 0, policy_version 61880 (0.0006) [2023-03-07 00:30:04,916][81400] Updated weights for policy 0, policy_version 61890 (0.0006) [2023-03-07 00:30:05,695][81400] Updated weights for policy 0, policy_version 61900 (0.0007) [2023-03-07 00:30:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 63391744. Throughput: 0: 13181.2. Samples: 63364632. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:06,237][81074] Avg episode reward: [(0, '2971.362')] [2023-03-07 00:30:06,470][81400] Updated weights for policy 0, policy_version 61910 (0.0006) [2023-03-07 00:30:07,265][81400] Updated weights for policy 0, policy_version 61920 (0.0007) [2023-03-07 00:30:08,028][81400] Updated weights for policy 0, policy_version 61930 (0.0006) [2023-03-07 00:30:08,810][81400] Updated weights for policy 0, policy_version 61940 (0.0006) [2023-03-07 00:30:09,593][81400] Updated weights for policy 0, policy_version 61950 (0.0006) [2023-03-07 00:30:10,346][81400] Updated weights for policy 0, policy_version 61960 (0.0007) [2023-03-07 00:30:11,125][81400] Updated weights for policy 0, policy_version 61970 (0.0006) [2023-03-07 00:30:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 63458304. Throughput: 0: 13177.2. Samples: 63443553. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:11,237][81074] Avg episode reward: [(0, '3110.374')] [2023-03-07 00:30:11,909][81400] Updated weights for policy 0, policy_version 61980 (0.0006) [2023-03-07 00:30:12,683][81400] Updated weights for policy 0, policy_version 61990 (0.0005) [2023-03-07 00:30:13,457][81400] Updated weights for policy 0, policy_version 62000 (0.0006) [2023-03-07 00:30:14,219][81400] Updated weights for policy 0, policy_version 62010 (0.0007) [2023-03-07 00:30:15,002][81400] Updated weights for policy 0, policy_version 62020 (0.0005) [2023-03-07 00:30:15,773][81400] Updated weights for policy 0, policy_version 62030 (0.0006) [2023-03-07 00:30:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 63523840. Throughput: 0: 13189.3. Samples: 63523102. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:16,247][81074] Avg episode reward: [(0, '3031.314')] [2023-03-07 00:30:16,565][81400] Updated weights for policy 0, policy_version 62040 (0.0008) [2023-03-07 00:30:17,349][81400] Updated weights for policy 0, policy_version 62050 (0.0006) [2023-03-07 00:30:18,116][81400] Updated weights for policy 0, policy_version 62060 (0.0006) [2023-03-07 00:30:18,891][81400] Updated weights for policy 0, policy_version 62070 (0.0006) [2023-03-07 00:30:19,686][81400] Updated weights for policy 0, policy_version 62080 (0.0006) [2023-03-07 00:30:20,437][81400] Updated weights for policy 0, policy_version 62090 (0.0006) [2023-03-07 00:30:21,212][81400] Updated weights for policy 0, policy_version 62100 (0.0006) [2023-03-07 00:30:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 63590400. Throughput: 0: 13184.1. Samples: 63562436. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:21,247][81074] Avg episode reward: [(0, '2917.241')] [2023-03-07 00:30:22,012][81400] Updated weights for policy 0, policy_version 62110 (0.0006) [2023-03-07 00:30:22,778][81400] Updated weights for policy 0, policy_version 62120 (0.0006) [2023-03-07 00:30:23,556][81400] Updated weights for policy 0, policy_version 62130 (0.0006) [2023-03-07 00:30:24,327][81400] Updated weights for policy 0, policy_version 62140 (0.0006) [2023-03-07 00:30:25,114][81400] Updated weights for policy 0, policy_version 62150 (0.0007) [2023-03-07 00:30:25,880][81400] Updated weights for policy 0, policy_version 62160 (0.0006) [2023-03-07 00:30:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.4, 300 sec: 13169.7). Total num frames: 63655936. Throughput: 0: 13190.2. Samples: 63641636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:26,248][81074] Avg episode reward: [(0, '3095.684')] [2023-03-07 00:30:26,661][81400] Updated weights for policy 0, policy_version 62170 (0.0006) [2023-03-07 00:30:27,430][81400] Updated weights for policy 0, policy_version 62180 (0.0007) [2023-03-07 00:30:28,212][81400] Updated weights for policy 0, policy_version 62190 (0.0006) [2023-03-07 00:30:28,987][81400] Updated weights for policy 0, policy_version 62200 (0.0006) [2023-03-07 00:30:29,770][81400] Updated weights for policy 0, policy_version 62210 (0.0005) [2023-03-07 00:30:30,570][81400] Updated weights for policy 0, policy_version 62220 (0.0006) [2023-03-07 00:30:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 13169.7). Total num frames: 63721472. Throughput: 0: 13183.3. Samples: 63720366. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:31,247][81074] Avg episode reward: [(0, '3208.261')] [2023-03-07 00:30:31,343][81400] Updated weights for policy 0, policy_version 62230 (0.0006) [2023-03-07 00:30:32,129][81400] Updated weights for policy 0, policy_version 62240 (0.0006) [2023-03-07 00:30:32,913][81400] Updated weights for policy 0, policy_version 62250 (0.0007) [2023-03-07 00:30:33,698][81400] Updated weights for policy 0, policy_version 62260 (0.0006) [2023-03-07 00:30:34,466][81400] Updated weights for policy 0, policy_version 62270 (0.0006) [2023-03-07 00:30:35,229][81400] Updated weights for policy 0, policy_version 62280 (0.0006) [2023-03-07 00:30:36,022][81400] Updated weights for policy 0, policy_version 62290 (0.0006) [2023-03-07 00:30:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 63787008. Throughput: 0: 13175.9. Samples: 63759638. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:36,247][81074] Avg episode reward: [(0, '2879.059')] [2023-03-07 00:30:36,794][81400] Updated weights for policy 0, policy_version 62300 (0.0006) [2023-03-07 00:30:37,579][81400] Updated weights for policy 0, policy_version 62310 (0.0006) [2023-03-07 00:30:38,347][81400] Updated weights for policy 0, policy_version 62320 (0.0006) [2023-03-07 00:30:39,129][81400] Updated weights for policy 0, policy_version 62330 (0.0006) [2023-03-07 00:30:39,913][81400] Updated weights for policy 0, policy_version 62340 (0.0006) [2023-03-07 00:30:40,674][81400] Updated weights for policy 0, policy_version 62350 (0.0007) [2023-03-07 00:30:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 63853568. Throughput: 0: 13171.1. Samples: 63838795. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:41,247][81074] Avg episode reward: [(0, '3033.797')] [2023-03-07 00:30:41,465][81400] Updated weights for policy 0, policy_version 62360 (0.0006) [2023-03-07 00:30:42,241][81400] Updated weights for policy 0, policy_version 62370 (0.0006) [2023-03-07 00:30:43,002][81400] Updated weights for policy 0, policy_version 62380 (0.0007) [2023-03-07 00:30:43,794][81400] Updated weights for policy 0, policy_version 62390 (0.0006) [2023-03-07 00:30:44,577][81400] Updated weights for policy 0, policy_version 62400 (0.0005) [2023-03-07 00:30:45,351][81400] Updated weights for policy 0, policy_version 62410 (0.0006) [2023-03-07 00:30:46,126][81400] Updated weights for policy 0, policy_version 62420 (0.0006) [2023-03-07 00:30:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 63919104. Throughput: 0: 13170.9. Samples: 63917783. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:46,247][81074] Avg episode reward: [(0, '2942.603')] [2023-03-07 00:30:46,890][81400] Updated weights for policy 0, policy_version 62430 (0.0006) [2023-03-07 00:30:47,692][81400] Updated weights for policy 0, policy_version 62440 (0.0008) [2023-03-07 00:30:48,476][81400] Updated weights for policy 0, policy_version 62450 (0.0006) [2023-03-07 00:30:49,243][81400] Updated weights for policy 0, policy_version 62460 (0.0007) [2023-03-07 00:30:50,028][81400] Updated weights for policy 0, policy_version 62470 (0.0007) [2023-03-07 00:30:50,815][81400] Updated weights for policy 0, policy_version 62480 (0.0006) [2023-03-07 00:30:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13169.7). Total num frames: 63984640. Throughput: 0: 13165.7. Samples: 63957088. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:30:51,237][81074] Avg episode reward: [(0, '3107.985')] [2023-03-07 00:30:51,590][81400] Updated weights for policy 0, policy_version 62490 (0.0006) [2023-03-07 00:30:52,362][81400] Updated weights for policy 0, policy_version 62500 (0.0006) [2023-03-07 00:30:53,129][81400] Updated weights for policy 0, policy_version 62510 (0.0006) [2023-03-07 00:30:53,911][81400] Updated weights for policy 0, policy_version 62520 (0.0006) [2023-03-07 00:30:54,696][81400] Updated weights for policy 0, policy_version 62530 (0.0006) [2023-03-07 00:30:55,454][81400] Updated weights for policy 0, policy_version 62540 (0.0006) [2023-03-07 00:30:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 64050176. Throughput: 0: 13167.3. Samples: 64036081. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:30:56,237][81074] Avg episode reward: [(0, '2868.138')] [2023-03-07 00:30:56,243][81400] Updated weights for policy 0, policy_version 62550 (0.0006) [2023-03-07 00:30:57,015][81400] Updated weights for policy 0, policy_version 62560 (0.0006) [2023-03-07 00:30:57,810][81400] Updated weights for policy 0, policy_version 62570 (0.0006) [2023-03-07 00:30:58,590][81400] Updated weights for policy 0, policy_version 62580 (0.0007) [2023-03-07 00:30:59,357][81400] Updated weights for policy 0, policy_version 62590 (0.0006) [2023-03-07 00:31:00,137][81400] Updated weights for policy 0, policy_version 62600 (0.0006) [2023-03-07 00:31:00,932][81400] Updated weights for policy 0, policy_version 62610 (0.0007) [2023-03-07 00:31:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 64115712. Throughput: 0: 13155.1. Samples: 64115082. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:01,237][81074] Avg episode reward: [(0, '2961.308')] [2023-03-07 00:31:01,697][81400] Updated weights for policy 0, policy_version 62620 (0.0006) [2023-03-07 00:31:02,465][81400] Updated weights for policy 0, policy_version 62630 (0.0006) [2023-03-07 00:31:03,238][81400] Updated weights for policy 0, policy_version 62640 (0.0007) [2023-03-07 00:31:04,009][81400] Updated weights for policy 0, policy_version 62650 (0.0006) [2023-03-07 00:31:04,782][81400] Updated weights for policy 0, policy_version 62660 (0.0006) [2023-03-07 00:31:05,546][81400] Updated weights for policy 0, policy_version 62670 (0.0006) [2023-03-07 00:31:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 64182272. Throughput: 0: 13164.3. Samples: 64154827. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:06,237][81074] Avg episode reward: [(0, '3020.273')] [2023-03-07 00:31:06,336][81400] Updated weights for policy 0, policy_version 62680 (0.0006) [2023-03-07 00:31:07,102][81400] Updated weights for policy 0, policy_version 62690 (0.0006) [2023-03-07 00:31:07,891][81400] Updated weights for policy 0, policy_version 62700 (0.0006) [2023-03-07 00:31:08,678][81400] Updated weights for policy 0, policy_version 62710 (0.0005) [2023-03-07 00:31:09,445][81400] Updated weights for policy 0, policy_version 62720 (0.0006) [2023-03-07 00:31:10,209][81400] Updated weights for policy 0, policy_version 62730 (0.0007) [2023-03-07 00:31:10,984][81400] Updated weights for policy 0, policy_version 62740 (0.0005) [2023-03-07 00:31:11,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 64248832. Throughput: 0: 13166.1. Samples: 64234111. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:11,237][81074] Avg episode reward: [(0, '3121.223')] [2023-03-07 00:31:11,774][81400] Updated weights for policy 0, policy_version 62750 (0.0005) [2023-03-07 00:31:12,538][81400] Updated weights for policy 0, policy_version 62760 (0.0006) [2023-03-07 00:31:13,331][81400] Updated weights for policy 0, policy_version 62770 (0.0007) [2023-03-07 00:31:14,103][81400] Updated weights for policy 0, policy_version 62780 (0.0006) [2023-03-07 00:31:14,885][81400] Updated weights for policy 0, policy_version 62790 (0.0007) [2023-03-07 00:31:15,653][81400] Updated weights for policy 0, policy_version 62800 (0.0006) [2023-03-07 00:31:16,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13173.1). Total num frames: 64314368. Throughput: 0: 13176.6. Samples: 64313313. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:16,237][81074] Avg episode reward: [(0, '3076.126')] [2023-03-07 00:31:16,408][81400] Updated weights for policy 0, policy_version 62810 (0.0006) [2023-03-07 00:31:17,206][81400] Updated weights for policy 0, policy_version 62820 (0.0007) [2023-03-07 00:31:17,972][81400] Updated weights for policy 0, policy_version 62830 (0.0006) [2023-03-07 00:31:18,738][81400] Updated weights for policy 0, policy_version 62840 (0.0006) [2023-03-07 00:31:19,517][81400] Updated weights for policy 0, policy_version 62850 (0.0006) [2023-03-07 00:31:20,292][81400] Updated weights for policy 0, policy_version 62860 (0.0006) [2023-03-07 00:31:21,072][81400] Updated weights for policy 0, policy_version 62870 (0.0006) [2023-03-07 00:31:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 64380928. Throughput: 0: 13184.3. Samples: 64352933. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:21,237][81074] Avg episode reward: [(0, '2992.300')] [2023-03-07 00:31:21,836][81400] Updated weights for policy 0, policy_version 62880 (0.0006) [2023-03-07 00:31:22,613][81400] Updated weights for policy 0, policy_version 62890 (0.0006) [2023-03-07 00:31:23,372][81400] Updated weights for policy 0, policy_version 62900 (0.0007) [2023-03-07 00:31:24,157][81400] Updated weights for policy 0, policy_version 62910 (0.0007) [2023-03-07 00:31:24,934][81400] Updated weights for policy 0, policy_version 62920 (0.0006) [2023-03-07 00:31:25,720][81400] Updated weights for policy 0, policy_version 62930 (0.0006) [2023-03-07 00:31:26,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 64446464. Throughput: 0: 13188.1. Samples: 64432260. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:26,237][81074] Avg episode reward: [(0, '3166.974')] [2023-03-07 00:31:26,478][81400] Updated weights for policy 0, policy_version 62940 (0.0007) [2023-03-07 00:31:27,264][81400] Updated weights for policy 0, policy_version 62950 (0.0006) [2023-03-07 00:31:28,040][81400] Updated weights for policy 0, policy_version 62960 (0.0006) [2023-03-07 00:31:28,810][81400] Updated weights for policy 0, policy_version 62970 (0.0005) [2023-03-07 00:31:29,579][81400] Updated weights for policy 0, policy_version 62980 (0.0007) [2023-03-07 00:31:30,369][81400] Updated weights for policy 0, policy_version 62990 (0.0006) [2023-03-07 00:31:31,134][81400] Updated weights for policy 0, policy_version 63000 (0.0007) [2023-03-07 00:31:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 64513024. Throughput: 0: 13196.4. Samples: 64511621. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:31,237][81074] Avg episode reward: [(0, '3176.927')] [2023-03-07 00:31:31,897][81400] Updated weights for policy 0, policy_version 63010 (0.0006) [2023-03-07 00:31:32,677][81400] Updated weights for policy 0, policy_version 63020 (0.0006) [2023-03-07 00:31:33,467][81400] Updated weights for policy 0, policy_version 63030 (0.0006) [2023-03-07 00:31:34,272][81400] Updated weights for policy 0, policy_version 63040 (0.0007) [2023-03-07 00:31:35,035][81400] Updated weights for policy 0, policy_version 63050 (0.0006) [2023-03-07 00:31:35,813][81400] Updated weights for policy 0, policy_version 63060 (0.0006) [2023-03-07 00:31:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 64578560. Throughput: 0: 13197.0. Samples: 64550954. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:36,237][81074] Avg episode reward: [(0, '3252.343')] [2023-03-07 00:31:36,578][81400] Updated weights for policy 0, policy_version 63070 (0.0006) [2023-03-07 00:31:37,357][81400] Updated weights for policy 0, policy_version 63080 (0.0007) [2023-03-07 00:31:38,134][81400] Updated weights for policy 0, policy_version 63090 (0.0007) [2023-03-07 00:31:38,907][81400] Updated weights for policy 0, policy_version 63100 (0.0006) [2023-03-07 00:31:39,690][81400] Updated weights for policy 0, policy_version 63110 (0.0006) [2023-03-07 00:31:40,471][81400] Updated weights for policy 0, policy_version 63120 (0.0006) [2023-03-07 00:31:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13173.2). Total num frames: 64644096. Throughput: 0: 13199.7. Samples: 64630067. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:41,237][81074] Avg episode reward: [(0, '3352.099')] [2023-03-07 00:31:41,248][81400] Updated weights for policy 0, policy_version 63130 (0.0006) [2023-03-07 00:31:42,027][81400] Updated weights for policy 0, policy_version 63140 (0.0005) [2023-03-07 00:31:42,809][81400] Updated weights for policy 0, policy_version 63150 (0.0007) [2023-03-07 00:31:43,590][81400] Updated weights for policy 0, policy_version 63160 (0.0007) [2023-03-07 00:31:44,371][81400] Updated weights for policy 0, policy_version 63170 (0.0006) [2023-03-07 00:31:45,133][81400] Updated weights for policy 0, policy_version 63180 (0.0006) [2023-03-07 00:31:45,914][81400] Updated weights for policy 0, policy_version 63190 (0.0006) [2023-03-07 00:31:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 64710656. Throughput: 0: 13195.9. Samples: 64708900. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:46,237][81074] Avg episode reward: [(0, '3318.297')] [2023-03-07 00:31:46,706][81400] Updated weights for policy 0, policy_version 63200 (0.0006) [2023-03-07 00:31:47,481][81400] Updated weights for policy 0, policy_version 63210 (0.0007) [2023-03-07 00:31:48,256][81400] Updated weights for policy 0, policy_version 63220 (0.0006) [2023-03-07 00:31:49,037][81400] Updated weights for policy 0, policy_version 63230 (0.0006) [2023-03-07 00:31:49,806][81400] Updated weights for policy 0, policy_version 63240 (0.0006) [2023-03-07 00:31:50,577][81400] Updated weights for policy 0, policy_version 63250 (0.0007) [2023-03-07 00:31:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13173.1). Total num frames: 64776192. Throughput: 0: 13191.8. Samples: 64748458. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:51,237][81074] Avg episode reward: [(0, '3438.756')] [2023-03-07 00:31:51,356][81400] Updated weights for policy 0, policy_version 63260 (0.0006) [2023-03-07 00:31:52,122][81400] Updated weights for policy 0, policy_version 63270 (0.0006) [2023-03-07 00:31:52,907][81400] Updated weights for policy 0, policy_version 63280 (0.0006) [2023-03-07 00:31:53,655][81400] Updated weights for policy 0, policy_version 63290 (0.0006) [2023-03-07 00:31:54,436][81400] Updated weights for policy 0, policy_version 63300 (0.0006) [2023-03-07 00:31:55,228][81400] Updated weights for policy 0, policy_version 63310 (0.0007) [2023-03-07 00:31:55,979][81400] Updated weights for policy 0, policy_version 63320 (0.0006) [2023-03-07 00:31:56,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 64841728. Throughput: 0: 13197.2. Samples: 64827986. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:31:56,237][81074] Avg episode reward: [(0, '3335.975')] [2023-03-07 00:31:56,240][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000063323_64842752.pth... [2023-03-07 00:31:56,270][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000060236_61681664.pth [2023-03-07 00:31:56,750][81400] Updated weights for policy 0, policy_version 63330 (0.0006) [2023-03-07 00:31:57,534][81400] Updated weights for policy 0, policy_version 63340 (0.0007) [2023-03-07 00:31:58,341][81400] Updated weights for policy 0, policy_version 63350 (0.0006) [2023-03-07 00:31:59,124][81400] Updated weights for policy 0, policy_version 63360 (0.0007) [2023-03-07 00:31:59,893][81400] Updated weights for policy 0, policy_version 63370 (0.0006) [2023-03-07 00:32:00,661][81400] Updated weights for policy 0, policy_version 63380 (0.0006) [2023-03-07 00:32:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13173.2). Total num frames: 64908288. Throughput: 0: 13195.0. Samples: 64907085. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:01,237][81074] Avg episode reward: [(0, '3343.579')] [2023-03-07 00:32:01,446][81400] Updated weights for policy 0, policy_version 63390 (0.0006) [2023-03-07 00:32:02,209][81400] Updated weights for policy 0, policy_version 63400 (0.0006) [2023-03-07 00:32:02,990][81400] Updated weights for policy 0, policy_version 63410 (0.0006) [2023-03-07 00:32:03,770][81400] Updated weights for policy 0, policy_version 63420 (0.0006) [2023-03-07 00:32:04,547][81400] Updated weights for policy 0, policy_version 63430 (0.0006) [2023-03-07 00:32:05,348][81400] Updated weights for policy 0, policy_version 63440 (0.0006) [2023-03-07 00:32:06,110][81400] Updated weights for policy 0, policy_version 63450 (0.0006) [2023-03-07 00:32:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 64973824. Throughput: 0: 13188.8. Samples: 64946430. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:06,237][81074] Avg episode reward: [(0, '3130.344')] [2023-03-07 00:32:06,870][81400] Updated weights for policy 0, policy_version 63460 (0.0006) [2023-03-07 00:32:07,649][81400] Updated weights for policy 0, policy_version 63470 (0.0006) [2023-03-07 00:32:08,437][81400] Updated weights for policy 0, policy_version 63480 (0.0008) [2023-03-07 00:32:09,230][81400] Updated weights for policy 0, policy_version 63490 (0.0006) [2023-03-07 00:32:09,999][81400] Updated weights for policy 0, policy_version 63500 (0.0006) [2023-03-07 00:32:10,781][81400] Updated weights for policy 0, policy_version 63510 (0.0006) [2023-03-07 00:32:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 65039360. Throughput: 0: 13182.7. Samples: 65025481. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:11,237][81074] Avg episode reward: [(0, '3223.221')] [2023-03-07 00:32:11,549][81400] Updated weights for policy 0, policy_version 63520 (0.0007) [2023-03-07 00:32:12,327][81400] Updated weights for policy 0, policy_version 63530 (0.0007) [2023-03-07 00:32:13,126][81400] Updated weights for policy 0, policy_version 63540 (0.0007) [2023-03-07 00:32:13,902][81400] Updated weights for policy 0, policy_version 63550 (0.0006) [2023-03-07 00:32:14,677][81400] Updated weights for policy 0, policy_version 63560 (0.0006) [2023-03-07 00:32:15,444][81400] Updated weights for policy 0, policy_version 63570 (0.0006) [2023-03-07 00:32:16,223][81400] Updated weights for policy 0, policy_version 63580 (0.0006) [2023-03-07 00:32:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13173.2). Total num frames: 65105920. Throughput: 0: 13171.0. Samples: 65104315. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:16,237][81074] Avg episode reward: [(0, '3447.454')] [2023-03-07 00:32:17,007][81400] Updated weights for policy 0, policy_version 63590 (0.0007) [2023-03-07 00:32:17,769][81400] Updated weights for policy 0, policy_version 63600 (0.0007) [2023-03-07 00:32:18,555][81400] Updated weights for policy 0, policy_version 63610 (0.0006) [2023-03-07 00:32:19,344][81400] Updated weights for policy 0, policy_version 63620 (0.0006) [2023-03-07 00:32:20,119][81400] Updated weights for policy 0, policy_version 63630 (0.0006) [2023-03-07 00:32:20,893][81400] Updated weights for policy 0, policy_version 63640 (0.0006) [2023-03-07 00:32:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 65171456. Throughput: 0: 13178.6. Samples: 65143991. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:21,237][81074] Avg episode reward: [(0, '3495.695')] [2023-03-07 00:32:21,677][81400] Updated weights for policy 0, policy_version 63650 (0.0006) [2023-03-07 00:32:22,440][81400] Updated weights for policy 0, policy_version 63660 (0.0006) [2023-03-07 00:32:23,217][81400] Updated weights for policy 0, policy_version 63670 (0.0006) [2023-03-07 00:32:23,988][81400] Updated weights for policy 0, policy_version 63680 (0.0006) [2023-03-07 00:32:24,762][81400] Updated weights for policy 0, policy_version 63690 (0.0006) [2023-03-07 00:32:25,545][81400] Updated weights for policy 0, policy_version 63700 (0.0007) [2023-03-07 00:32:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 65238016. Throughput: 0: 13176.7. Samples: 65223018. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:26,237][81074] Avg episode reward: [(0, '3402.549')] [2023-03-07 00:32:26,306][81400] Updated weights for policy 0, policy_version 63710 (0.0006) [2023-03-07 00:32:27,079][81400] Updated weights for policy 0, policy_version 63720 (0.0005) [2023-03-07 00:32:27,859][81400] Updated weights for policy 0, policy_version 63730 (0.0006) [2023-03-07 00:32:28,633][81400] Updated weights for policy 0, policy_version 63740 (0.0006) [2023-03-07 00:32:29,410][81400] Updated weights for policy 0, policy_version 63750 (0.0007) [2023-03-07 00:32:30,203][81400] Updated weights for policy 0, policy_version 63760 (0.0006) [2023-03-07 00:32:30,969][81400] Updated weights for policy 0, policy_version 63770 (0.0007) [2023-03-07 00:32:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 65303552. Throughput: 0: 13187.4. Samples: 65302332. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:31,237][81074] Avg episode reward: [(0, '3278.033')] [2023-03-07 00:32:31,741][81400] Updated weights for policy 0, policy_version 63780 (0.0006) [2023-03-07 00:32:32,531][81400] Updated weights for policy 0, policy_version 63790 (0.0006) [2023-03-07 00:32:33,310][81400] Updated weights for policy 0, policy_version 63800 (0.0005) [2023-03-07 00:32:34,071][81400] Updated weights for policy 0, policy_version 63810 (0.0006) [2023-03-07 00:32:34,844][81400] Updated weights for policy 0, policy_version 63820 (0.0006) [2023-03-07 00:32:35,624][81400] Updated weights for policy 0, policy_version 63830 (0.0006) [2023-03-07 00:32:36,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13169.7). Total num frames: 65369088. Throughput: 0: 13184.8. Samples: 65341776. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:36,237][81074] Avg episode reward: [(0, '3180.441')] [2023-03-07 00:32:36,402][81400] Updated weights for policy 0, policy_version 63840 (0.0006) [2023-03-07 00:32:37,172][81400] Updated weights for policy 0, policy_version 63850 (0.0006) [2023-03-07 00:32:37,957][81400] Updated weights for policy 0, policy_version 63860 (0.0007) [2023-03-07 00:32:38,729][81400] Updated weights for policy 0, policy_version 63870 (0.0006) [2023-03-07 00:32:39,505][81400] Updated weights for policy 0, policy_version 63880 (0.0007) [2023-03-07 00:32:40,287][81400] Updated weights for policy 0, policy_version 63890 (0.0006) [2023-03-07 00:32:41,070][81400] Updated weights for policy 0, policy_version 63900 (0.0006) [2023-03-07 00:32:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 65435648. Throughput: 0: 13177.3. Samples: 65420964. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:41,237][81074] Avg episode reward: [(0, '3117.219')] [2023-03-07 00:32:41,845][81400] Updated weights for policy 0, policy_version 63910 (0.0007) [2023-03-07 00:32:42,660][81400] Updated weights for policy 0, policy_version 63920 (0.0006) [2023-03-07 00:32:43,424][81400] Updated weights for policy 0, policy_version 63930 (0.0006) [2023-03-07 00:32:44,200][81400] Updated weights for policy 0, policy_version 63940 (0.0006) [2023-03-07 00:32:44,961][81400] Updated weights for policy 0, policy_version 63950 (0.0007) [2023-03-07 00:32:45,717][81400] Updated weights for policy 0, policy_version 63960 (0.0006) [2023-03-07 00:32:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 65501184. Throughput: 0: 13178.0. Samples: 65500093. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:32:46,237][81074] Avg episode reward: [(0, '3220.193')] [2023-03-07 00:32:46,502][81400] Updated weights for policy 0, policy_version 63970 (0.0006) [2023-03-07 00:32:47,286][81400] Updated weights for policy 0, policy_version 63980 (0.0006) [2023-03-07 00:32:48,057][81400] Updated weights for policy 0, policy_version 63990 (0.0005) [2023-03-07 00:32:48,848][81400] Updated weights for policy 0, policy_version 64000 (0.0006) [2023-03-07 00:32:49,629][81400] Updated weights for policy 0, policy_version 64010 (0.0006) [2023-03-07 00:32:50,393][81400] Updated weights for policy 0, policy_version 64020 (0.0006) [2023-03-07 00:32:51,171][81400] Updated weights for policy 0, policy_version 64030 (0.0006) [2023-03-07 00:32:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 65566720. Throughput: 0: 13177.5. Samples: 65539420. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:32:51,237][81074] Avg episode reward: [(0, '3173.068')] [2023-03-07 00:32:51,942][81400] Updated weights for policy 0, policy_version 64040 (0.0005) [2023-03-07 00:32:52,730][81400] Updated weights for policy 0, policy_version 64050 (0.0007) [2023-03-07 00:32:53,501][81400] Updated weights for policy 0, policy_version 64060 (0.0006) [2023-03-07 00:32:54,290][81400] Updated weights for policy 0, policy_version 64070 (0.0007) [2023-03-07 00:32:55,058][81400] Updated weights for policy 0, policy_version 64080 (0.0007) [2023-03-07 00:32:55,834][81400] Updated weights for policy 0, policy_version 64090 (0.0006) [2023-03-07 00:32:56,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 65633280. Throughput: 0: 13178.9. Samples: 65618534. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:32:56,237][81074] Avg episode reward: [(0, '2931.833')] [2023-03-07 00:32:56,588][81400] Updated weights for policy 0, policy_version 64100 (0.0007) [2023-03-07 00:32:57,376][81400] Updated weights for policy 0, policy_version 64110 (0.0006) [2023-03-07 00:32:58,137][81400] Updated weights for policy 0, policy_version 64120 (0.0006) [2023-03-07 00:32:58,912][81400] Updated weights for policy 0, policy_version 64130 (0.0005) [2023-03-07 00:32:59,684][81400] Updated weights for policy 0, policy_version 64140 (0.0007) [2023-03-07 00:33:00,461][81400] Updated weights for policy 0, policy_version 64150 (0.0006) [2023-03-07 00:33:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 65698816. Throughput: 0: 13190.4. Samples: 65697884. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:33:01,237][81074] Avg episode reward: [(0, '2731.200')] [2023-03-07 00:33:01,238][81400] Updated weights for policy 0, policy_version 64160 (0.0006) [2023-03-07 00:33:02,013][81400] Updated weights for policy 0, policy_version 64170 (0.0006) [2023-03-07 00:33:02,780][81400] Updated weights for policy 0, policy_version 64180 (0.0006) [2023-03-07 00:33:03,574][81400] Updated weights for policy 0, policy_version 64190 (0.0006) [2023-03-07 00:33:04,346][81400] Updated weights for policy 0, policy_version 64200 (0.0006) [2023-03-07 00:33:05,124][81400] Updated weights for policy 0, policy_version 64210 (0.0006) [2023-03-07 00:33:05,913][81400] Updated weights for policy 0, policy_version 64220 (0.0007) [2023-03-07 00:33:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 65765376. Throughput: 0: 13189.4. Samples: 65737518. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:33:06,237][81074] Avg episode reward: [(0, '2994.630')] [2023-03-07 00:33:06,688][81400] Updated weights for policy 0, policy_version 64230 (0.0006) [2023-03-07 00:33:07,466][81400] Updated weights for policy 0, policy_version 64240 (0.0007) [2023-03-07 00:33:08,256][81400] Updated weights for policy 0, policy_version 64250 (0.0007) [2023-03-07 00:33:09,027][81400] Updated weights for policy 0, policy_version 64260 (0.0006) [2023-03-07 00:33:09,789][81400] Updated weights for policy 0, policy_version 64270 (0.0006) [2023-03-07 00:33:10,576][81400] Updated weights for policy 0, policy_version 64280 (0.0007) [2023-03-07 00:33:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13173.2). Total num frames: 65830912. Throughput: 0: 13187.8. Samples: 65816470. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:33:11,237][81074] Avg episode reward: [(0, '3155.802')] [2023-03-07 00:33:11,357][81400] Updated weights for policy 0, policy_version 64290 (0.0006) [2023-03-07 00:33:12,136][81400] Updated weights for policy 0, policy_version 64300 (0.0006) [2023-03-07 00:33:12,915][81400] Updated weights for policy 0, policy_version 64310 (0.0006) [2023-03-07 00:33:13,670][81400] Updated weights for policy 0, policy_version 64320 (0.0006) [2023-03-07 00:33:14,452][81400] Updated weights for policy 0, policy_version 64330 (0.0006) [2023-03-07 00:33:15,245][81400] Updated weights for policy 0, policy_version 64340 (0.0007) [2023-03-07 00:33:16,010][81400] Updated weights for policy 0, policy_version 64350 (0.0006) [2023-03-07 00:33:16,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 65897472. Throughput: 0: 13185.5. Samples: 65895678. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:33:16,237][81074] Avg episode reward: [(0, '3261.154')] [2023-03-07 00:33:16,790][81400] Updated weights for policy 0, policy_version 64360 (0.0006) [2023-03-07 00:33:17,564][81400] Updated weights for policy 0, policy_version 64370 (0.0006) [2023-03-07 00:33:18,338][81400] Updated weights for policy 0, policy_version 64380 (0.0006) [2023-03-07 00:33:19,113][81400] Updated weights for policy 0, policy_version 64390 (0.0006) [2023-03-07 00:33:19,876][81400] Updated weights for policy 0, policy_version 64400 (0.0006) [2023-03-07 00:33:20,664][81400] Updated weights for policy 0, policy_version 64410 (0.0006) [2023-03-07 00:33:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 65963008. Throughput: 0: 13193.0. Samples: 65935460. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:33:21,237][81074] Avg episode reward: [(0, '3294.691')] [2023-03-07 00:33:21,454][81400] Updated weights for policy 0, policy_version 64420 (0.0006) [2023-03-07 00:33:22,226][81400] Updated weights for policy 0, policy_version 64430 (0.0006) [2023-03-07 00:33:23,011][81400] Updated weights for policy 0, policy_version 64440 (0.0007) [2023-03-07 00:33:23,795][81400] Updated weights for policy 0, policy_version 64450 (0.0006) [2023-03-07 00:33:24,572][81400] Updated weights for policy 0, policy_version 64460 (0.0006) [2023-03-07 00:33:25,357][81400] Updated weights for policy 0, policy_version 64470 (0.0006) [2023-03-07 00:33:26,129][81400] Updated weights for policy 0, policy_version 64480 (0.0006) [2023-03-07 00:33:26,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 66028544. Throughput: 0: 13181.8. Samples: 66014145. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:33:26,237][81074] Avg episode reward: [(0, '3349.203')] [2023-03-07 00:33:26,891][81400] Updated weights for policy 0, policy_version 64490 (0.0006) [2023-03-07 00:33:27,668][81400] Updated weights for policy 0, policy_version 64500 (0.0006) [2023-03-07 00:33:28,455][81400] Updated weights for policy 0, policy_version 64510 (0.0005) [2023-03-07 00:33:29,233][81400] Updated weights for policy 0, policy_version 64520 (0.0006) [2023-03-07 00:33:30,018][81400] Updated weights for policy 0, policy_version 64530 (0.0006) [2023-03-07 00:33:30,786][81400] Updated weights for policy 0, policy_version 64540 (0.0006) [2023-03-07 00:33:31,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13173.1). Total num frames: 66094080. Throughput: 0: 13178.5. Samples: 66093127. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:33:31,237][81074] Avg episode reward: [(0, '3342.678')] [2023-03-07 00:33:31,573][81400] Updated weights for policy 0, policy_version 64550 (0.0006) [2023-03-07 00:33:32,348][81400] Updated weights for policy 0, policy_version 64560 (0.0006) [2023-03-07 00:33:33,134][81400] Updated weights for policy 0, policy_version 64570 (0.0007) [2023-03-07 00:33:33,911][81400] Updated weights for policy 0, policy_version 64580 (0.0006) [2023-03-07 00:33:34,521][81349] KL-divergence is very high: 102.1779 [2023-03-07 00:33:34,689][81400] Updated weights for policy 0, policy_version 64590 (0.0006) [2023-03-07 00:33:35,458][81400] Updated weights for policy 0, policy_version 64600 (0.0006) [2023-03-07 00:33:36,235][81400] Updated weights for policy 0, policy_version 64610 (0.0006) [2023-03-07 00:33:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 66160640. Throughput: 0: 13181.5. Samples: 66132586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:33:36,237][81074] Avg episode reward: [(0, '3205.237')] [2023-03-07 00:33:37,004][81400] Updated weights for policy 0, policy_version 64620 (0.0006) [2023-03-07 00:33:37,789][81400] Updated weights for policy 0, policy_version 64630 (0.0006) [2023-03-07 00:33:38,553][81400] Updated weights for policy 0, policy_version 64640 (0.0006) [2023-03-07 00:33:39,314][81400] Updated weights for policy 0, policy_version 64650 (0.0006) [2023-03-07 00:33:40,097][81400] Updated weights for policy 0, policy_version 64660 (0.0006) [2023-03-07 00:33:40,888][81400] Updated weights for policy 0, policy_version 64670 (0.0006) [2023-03-07 00:33:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 66226176. Throughput: 0: 13184.5. Samples: 66211833. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:33:41,237][81074] Avg episode reward: [(0, '3379.241')] [2023-03-07 00:33:41,327][81349] KL-divergence is very high: 121.8101 [2023-03-07 00:33:41,646][81400] Updated weights for policy 0, policy_version 64680 (0.0007) [2023-03-07 00:33:42,422][81400] Updated weights for policy 0, policy_version 64690 (0.0006) [2023-03-07 00:33:43,214][81400] Updated weights for policy 0, policy_version 64700 (0.0006) [2023-03-07 00:33:43,984][81400] Updated weights for policy 0, policy_version 64710 (0.0007) [2023-03-07 00:33:44,755][81400] Updated weights for policy 0, policy_version 64720 (0.0006) [2023-03-07 00:33:45,547][81400] Updated weights for policy 0, policy_version 64730 (0.0007) [2023-03-07 00:33:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13176.6). Total num frames: 66291712. Throughput: 0: 13178.5. Samples: 66290916. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:33:46,237][81074] Avg episode reward: [(0, '3275.615')] [2023-03-07 00:33:46,315][81400] Updated weights for policy 0, policy_version 64740 (0.0006) [2023-03-07 00:33:47,093][81400] Updated weights for policy 0, policy_version 64750 (0.0006) [2023-03-07 00:33:47,855][81400] Updated weights for policy 0, policy_version 64760 (0.0006) [2023-03-07 00:33:48,637][81400] Updated weights for policy 0, policy_version 64770 (0.0006) [2023-03-07 00:33:49,412][81400] Updated weights for policy 0, policy_version 64780 (0.0007) [2023-03-07 00:33:50,192][81400] Updated weights for policy 0, policy_version 64790 (0.0006) [2023-03-07 00:33:50,976][81400] Updated weights for policy 0, policy_version 64800 (0.0007) [2023-03-07 00:33:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13176.6). Total num frames: 66358272. Throughput: 0: 13176.9. Samples: 66330478. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:33:51,237][81074] Avg episode reward: [(0, '3123.158')] [2023-03-07 00:33:51,761][81400] Updated weights for policy 0, policy_version 64810 (0.0007) [2023-03-07 00:33:52,544][81400] Updated weights for policy 0, policy_version 64820 (0.0006) [2023-03-07 00:33:53,318][81400] Updated weights for policy 0, policy_version 64830 (0.0007) [2023-03-07 00:33:54,099][81400] Updated weights for policy 0, policy_version 64840 (0.0005) [2023-03-07 00:33:54,880][81400] Updated weights for policy 0, policy_version 64850 (0.0007) [2023-03-07 00:33:55,664][81400] Updated weights for policy 0, policy_version 64860 (0.0006) [2023-03-07 00:33:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 66423808. Throughput: 0: 13176.7. Samples: 66409421. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:33:56,237][81074] Avg episode reward: [(0, '2959.031')] [2023-03-07 00:33:56,239][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000064867_66423808.pth... [2023-03-07 00:33:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000061778_63260672.pth [2023-03-07 00:33:56,439][81400] Updated weights for policy 0, policy_version 64870 (0.0007) [2023-03-07 00:33:57,212][81400] Updated weights for policy 0, policy_version 64880 (0.0007) [2023-03-07 00:33:57,994][81400] Updated weights for policy 0, policy_version 64890 (0.0006) [2023-03-07 00:33:58,792][81400] Updated weights for policy 0, policy_version 64900 (0.0007) [2023-03-07 00:33:59,576][81400] Updated weights for policy 0, policy_version 64910 (0.0007) [2023-03-07 00:34:00,355][81400] Updated weights for policy 0, policy_version 64920 (0.0006) [2023-03-07 00:34:01,146][81400] Updated weights for policy 0, policy_version 64930 (0.0006) [2023-03-07 00:34:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 66489344. Throughput: 0: 13166.7. Samples: 66488180. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:34:01,237][81074] Avg episode reward: [(0, '3282.799')] [2023-03-07 00:34:01,910][81400] Updated weights for policy 0, policy_version 64940 (0.0006) [2023-03-07 00:34:02,691][81400] Updated weights for policy 0, policy_version 64950 (0.0006) [2023-03-07 00:34:03,477][81400] Updated weights for policy 0, policy_version 64960 (0.0006) [2023-03-07 00:34:04,257][81400] Updated weights for policy 0, policy_version 64970 (0.0006) [2023-03-07 00:34:05,033][81400] Updated weights for policy 0, policy_version 64980 (0.0006) [2023-03-07 00:34:05,803][81400] Updated weights for policy 0, policy_version 64990 (0.0006) [2023-03-07 00:34:06,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 66554880. Throughput: 0: 13155.5. Samples: 66527457. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:34:06,237][81074] Avg episode reward: [(0, '3092.117')] [2023-03-07 00:34:06,563][81400] Updated weights for policy 0, policy_version 65000 (0.0006) [2023-03-07 00:34:07,351][81400] Updated weights for policy 0, policy_version 65010 (0.0007) [2023-03-07 00:34:08,135][81400] Updated weights for policy 0, policy_version 65020 (0.0006) [2023-03-07 00:34:08,898][81400] Updated weights for policy 0, policy_version 65030 (0.0006) [2023-03-07 00:34:09,666][81400] Updated weights for policy 0, policy_version 65040 (0.0006) [2023-03-07 00:34:10,440][81400] Updated weights for policy 0, policy_version 65050 (0.0006) [2023-03-07 00:34:11,222][81400] Updated weights for policy 0, policy_version 65060 (0.0007) [2023-03-07 00:34:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 66621440. Throughput: 0: 13164.5. Samples: 66606546. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:11,237][81074] Avg episode reward: [(0, '3097.237')] [2023-03-07 00:34:12,005][81400] Updated weights for policy 0, policy_version 65070 (0.0006) [2023-03-07 00:34:12,783][81400] Updated weights for policy 0, policy_version 65080 (0.0007) [2023-03-07 00:34:13,572][81400] Updated weights for policy 0, policy_version 65090 (0.0006) [2023-03-07 00:34:14,349][81400] Updated weights for policy 0, policy_version 65100 (0.0006) [2023-03-07 00:34:15,122][81400] Updated weights for policy 0, policy_version 65110 (0.0007) [2023-03-07 00:34:15,885][81400] Updated weights for policy 0, policy_version 65120 (0.0006) [2023-03-07 00:34:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13180.1). Total num frames: 66686976. Throughput: 0: 13165.5. Samples: 66685573. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:16,237][81074] Avg episode reward: [(0, '3163.809')] [2023-03-07 00:34:16,681][81400] Updated weights for policy 0, policy_version 65130 (0.0006) [2023-03-07 00:34:17,475][81400] Updated weights for policy 0, policy_version 65140 (0.0006) [2023-03-07 00:34:18,257][81400] Updated weights for policy 0, policy_version 65150 (0.0007) [2023-03-07 00:34:19,018][81400] Updated weights for policy 0, policy_version 65160 (0.0006) [2023-03-07 00:34:19,775][81400] Updated weights for policy 0, policy_version 65170 (0.0006) [2023-03-07 00:34:20,550][81400] Updated weights for policy 0, policy_version 65180 (0.0006) [2023-03-07 00:34:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 66752512. Throughput: 0: 13166.2. Samples: 66725063. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:21,237][81074] Avg episode reward: [(0, '3130.526')] [2023-03-07 00:34:21,319][81400] Updated weights for policy 0, policy_version 65190 (0.0006) [2023-03-07 00:34:22,092][81400] Updated weights for policy 0, policy_version 65200 (0.0006) [2023-03-07 00:34:22,869][81400] Updated weights for policy 0, policy_version 65210 (0.0006) [2023-03-07 00:34:23,654][81400] Updated weights for policy 0, policy_version 65220 (0.0006) [2023-03-07 00:34:24,412][81400] Updated weights for policy 0, policy_version 65230 (0.0006) [2023-03-07 00:34:25,209][81400] Updated weights for policy 0, policy_version 65240 (0.0006) [2023-03-07 00:34:25,984][81400] Updated weights for policy 0, policy_version 65250 (0.0006) [2023-03-07 00:34:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 66819072. Throughput: 0: 13169.9. Samples: 66804481. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:26,237][81074] Avg episode reward: [(0, '3172.159')] [2023-03-07 00:34:26,766][81400] Updated weights for policy 0, policy_version 65260 (0.0005) [2023-03-07 00:34:27,538][81400] Updated weights for policy 0, policy_version 65270 (0.0006) [2023-03-07 00:34:28,318][81400] Updated weights for policy 0, policy_version 65280 (0.0007) [2023-03-07 00:34:29,090][81400] Updated weights for policy 0, policy_version 65290 (0.0006) [2023-03-07 00:34:29,867][81400] Updated weights for policy 0, policy_version 65300 (0.0006) [2023-03-07 00:34:30,667][81400] Updated weights for policy 0, policy_version 65310 (0.0006) [2023-03-07 00:34:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 66884608. Throughput: 0: 13161.6. Samples: 66883188. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:31,237][81074] Avg episode reward: [(0, '3281.336')] [2023-03-07 00:34:31,451][81400] Updated weights for policy 0, policy_version 65320 (0.0006) [2023-03-07 00:34:32,213][81400] Updated weights for policy 0, policy_version 65330 (0.0006) [2023-03-07 00:34:32,974][81400] Updated weights for policy 0, policy_version 65340 (0.0006) [2023-03-07 00:34:33,757][81400] Updated weights for policy 0, policy_version 65350 (0.0006) [2023-03-07 00:34:34,535][81400] Updated weights for policy 0, policy_version 65360 (0.0006) [2023-03-07 00:34:35,307][81400] Updated weights for policy 0, policy_version 65370 (0.0006) [2023-03-07 00:34:36,094][81400] Updated weights for policy 0, policy_version 65380 (0.0006) [2023-03-07 00:34:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 66950144. Throughput: 0: 13164.1. Samples: 66922863. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:36,237][81074] Avg episode reward: [(0, '3336.954')] [2023-03-07 00:34:36,866][81400] Updated weights for policy 0, policy_version 65390 (0.0006) [2023-03-07 00:34:37,647][81400] Updated weights for policy 0, policy_version 65400 (0.0006) [2023-03-07 00:34:38,430][81400] Updated weights for policy 0, policy_version 65410 (0.0006) [2023-03-07 00:34:39,209][81400] Updated weights for policy 0, policy_version 65420 (0.0006) [2023-03-07 00:34:39,986][81400] Updated weights for policy 0, policy_version 65430 (0.0006) [2023-03-07 00:34:40,762][81400] Updated weights for policy 0, policy_version 65440 (0.0006) [2023-03-07 00:34:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 67016704. Throughput: 0: 13164.7. Samples: 67001833. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:41,237][81074] Avg episode reward: [(0, '3418.828')] [2023-03-07 00:34:41,528][81400] Updated weights for policy 0, policy_version 65450 (0.0006) [2023-03-07 00:34:42,304][81400] Updated weights for policy 0, policy_version 65460 (0.0007) [2023-03-07 00:34:43,089][81400] Updated weights for policy 0, policy_version 65470 (0.0006) [2023-03-07 00:34:43,889][81400] Updated weights for policy 0, policy_version 65480 (0.0006) [2023-03-07 00:34:44,645][81400] Updated weights for policy 0, policy_version 65490 (0.0006) [2023-03-07 00:34:45,427][81400] Updated weights for policy 0, policy_version 65500 (0.0006) [2023-03-07 00:34:46,207][81400] Updated weights for policy 0, policy_version 65510 (0.0006) [2023-03-07 00:34:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 67082240. Throughput: 0: 13171.6. Samples: 67080903. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:46,237][81074] Avg episode reward: [(0, '3552.621')] [2023-03-07 00:34:46,980][81400] Updated weights for policy 0, policy_version 65520 (0.0006) [2023-03-07 00:34:47,745][81400] Updated weights for policy 0, policy_version 65530 (0.0006) [2023-03-07 00:34:48,534][81349] KL-divergence is very high: 6776.8350 [2023-03-07 00:34:48,542][81400] Updated weights for policy 0, policy_version 65540 (0.0006) [2023-03-07 00:34:48,927][81349] KL-divergence is very high: 103.4533 [2023-03-07 00:34:49,322][81400] Updated weights for policy 0, policy_version 65550 (0.0006) [2023-03-07 00:34:50,112][81400] Updated weights for policy 0, policy_version 65560 (0.0007) [2023-03-07 00:34:50,889][81400] Updated weights for policy 0, policy_version 65570 (0.0006) [2023-03-07 00:34:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 67147776. Throughput: 0: 13175.6. Samples: 67120356. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:51,237][81074] Avg episode reward: [(0, '3365.837')] [2023-03-07 00:34:51,669][81400] Updated weights for policy 0, policy_version 65580 (0.0006) [2023-03-07 00:34:52,458][81400] Updated weights for policy 0, policy_version 65590 (0.0006) [2023-03-07 00:34:53,241][81400] Updated weights for policy 0, policy_version 65600 (0.0006) [2023-03-07 00:34:54,019][81400] Updated weights for policy 0, policy_version 65610 (0.0006) [2023-03-07 00:34:54,801][81400] Updated weights for policy 0, policy_version 65620 (0.0006) [2023-03-07 00:34:55,565][81400] Updated weights for policy 0, policy_version 65630 (0.0006) [2023-03-07 00:34:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 67213312. Throughput: 0: 13165.7. Samples: 67199003. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:34:56,237][81074] Avg episode reward: [(0, '3436.067')] [2023-03-07 00:34:56,349][81400] Updated weights for policy 0, policy_version 65640 (0.0006) [2023-03-07 00:34:57,132][81400] Updated weights for policy 0, policy_version 65650 (0.0007) [2023-03-07 00:34:57,913][81400] Updated weights for policy 0, policy_version 65660 (0.0006) [2023-03-07 00:34:58,685][81400] Updated weights for policy 0, policy_version 65670 (0.0006) [2023-03-07 00:34:59,465][81400] Updated weights for policy 0, policy_version 65680 (0.0005) [2023-03-07 00:35:00,237][81400] Updated weights for policy 0, policy_version 65690 (0.0006) [2023-03-07 00:35:01,023][81400] Updated weights for policy 0, policy_version 65700 (0.0006) [2023-03-07 00:35:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 67278848. Throughput: 0: 13164.2. Samples: 67277962. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:35:01,237][81074] Avg episode reward: [(0, '3564.338')] [2023-03-07 00:35:01,795][81400] Updated weights for policy 0, policy_version 65710 (0.0006) [2023-03-07 00:35:02,565][81400] Updated weights for policy 0, policy_version 65720 (0.0006) [2023-03-07 00:35:03,351][81400] Updated weights for policy 0, policy_version 65730 (0.0007) [2023-03-07 00:35:04,125][81400] Updated weights for policy 0, policy_version 65740 (0.0006) [2023-03-07 00:35:04,908][81400] Updated weights for policy 0, policy_version 65750 (0.0006) [2023-03-07 00:35:05,689][81400] Updated weights for policy 0, policy_version 65760 (0.0006) [2023-03-07 00:35:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 67344384. Throughput: 0: 13160.0. Samples: 67317262. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:35:06,237][81074] Avg episode reward: [(0, '3570.195')] [2023-03-07 00:35:06,465][81400] Updated weights for policy 0, policy_version 65770 (0.0006) [2023-03-07 00:35:07,244][81400] Updated weights for policy 0, policy_version 65780 (0.0006) [2023-03-07 00:35:08,021][81400] Updated weights for policy 0, policy_version 65790 (0.0007) [2023-03-07 00:35:08,808][81400] Updated weights for policy 0, policy_version 65800 (0.0008) [2023-03-07 00:35:09,566][81400] Updated weights for policy 0, policy_version 65810 (0.0006) [2023-03-07 00:35:10,362][81400] Updated weights for policy 0, policy_version 65820 (0.0006) [2023-03-07 00:35:11,125][81400] Updated weights for policy 0, policy_version 65830 (0.0006) [2023-03-07 00:35:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 67410944. Throughput: 0: 13153.7. Samples: 67396398. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:35:11,237][81074] Avg episode reward: [(0, '3459.496')] [2023-03-07 00:35:11,901][81400] Updated weights for policy 0, policy_version 65840 (0.0006) [2023-03-07 00:35:12,674][81400] Updated weights for policy 0, policy_version 65850 (0.0006) [2023-03-07 00:35:13,481][81400] Updated weights for policy 0, policy_version 65860 (0.0006) [2023-03-07 00:35:14,253][81400] Updated weights for policy 0, policy_version 65870 (0.0006) [2023-03-07 00:35:15,027][81400] Updated weights for policy 0, policy_version 65880 (0.0006) [2023-03-07 00:35:15,821][81400] Updated weights for policy 0, policy_version 65890 (0.0007) [2023-03-07 00:35:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 67476480. Throughput: 0: 13157.5. Samples: 67475277. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:35:16,237][81074] Avg episode reward: [(0, '3396.980')] [2023-03-07 00:35:16,598][81400] Updated weights for policy 0, policy_version 65900 (0.0007) [2023-03-07 00:35:17,381][81400] Updated weights for policy 0, policy_version 65910 (0.0006) [2023-03-07 00:35:18,161][81400] Updated weights for policy 0, policy_version 65920 (0.0006) [2023-03-07 00:35:18,940][81400] Updated weights for policy 0, policy_version 65930 (0.0006) [2023-03-07 00:35:19,707][81400] Updated weights for policy 0, policy_version 65940 (0.0006) [2023-03-07 00:35:20,487][81400] Updated weights for policy 0, policy_version 65950 (0.0007) [2023-03-07 00:35:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 67542016. Throughput: 0: 13150.5. Samples: 67514634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:35:21,237][81074] Avg episode reward: [(0, '3511.674')] [2023-03-07 00:35:21,254][81400] Updated weights for policy 0, policy_version 65960 (0.0006) [2023-03-07 00:35:22,062][81400] Updated weights for policy 0, policy_version 65970 (0.0006) [2023-03-07 00:35:22,842][81400] Updated weights for policy 0, policy_version 65980 (0.0005) [2023-03-07 00:35:23,601][81400] Updated weights for policy 0, policy_version 65990 (0.0006) [2023-03-07 00:35:24,397][81400] Updated weights for policy 0, policy_version 66000 (0.0006) [2023-03-07 00:35:25,163][81400] Updated weights for policy 0, policy_version 66010 (0.0006) [2023-03-07 00:35:25,933][81400] Updated weights for policy 0, policy_version 66020 (0.0006) [2023-03-07 00:35:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13173.2). Total num frames: 67607552. Throughput: 0: 13148.9. Samples: 67593533. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:35:26,237][81074] Avg episode reward: [(0, '3284.754')] [2023-03-07 00:35:26,717][81400] Updated weights for policy 0, policy_version 66030 (0.0006) [2023-03-07 00:35:27,481][81400] Updated weights for policy 0, policy_version 66040 (0.0007) [2023-03-07 00:35:28,267][81400] Updated weights for policy 0, policy_version 66050 (0.0006) [2023-03-07 00:35:29,049][81400] Updated weights for policy 0, policy_version 66060 (0.0006) [2023-03-07 00:35:29,807][81400] Updated weights for policy 0, policy_version 66070 (0.0006) [2023-03-07 00:35:30,597][81400] Updated weights for policy 0, policy_version 66080 (0.0006) [2023-03-07 00:35:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13176.6). Total num frames: 67674112. Throughput: 0: 13148.7. Samples: 67672593. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:35:31,237][81074] Avg episode reward: [(0, '3268.618')] [2023-03-07 00:35:31,380][81400] Updated weights for policy 0, policy_version 66090 (0.0005) [2023-03-07 00:35:32,149][81400] Updated weights for policy 0, policy_version 66100 (0.0006) [2023-03-07 00:35:32,918][81400] Updated weights for policy 0, policy_version 66110 (0.0006) [2023-03-07 00:35:33,685][81400] Updated weights for policy 0, policy_version 66120 (0.0006) [2023-03-07 00:35:34,461][81400] Updated weights for policy 0, policy_version 66130 (0.0006) [2023-03-07 00:35:35,237][81400] Updated weights for policy 0, policy_version 66140 (0.0006) [2023-03-07 00:35:36,025][81400] Updated weights for policy 0, policy_version 66150 (0.0006) [2023-03-07 00:35:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13173.1). Total num frames: 67739648. Throughput: 0: 13157.8. Samples: 67712458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:35:36,237][81074] Avg episode reward: [(0, '3106.792')] [2023-03-07 00:35:36,797][81400] Updated weights for policy 0, policy_version 66160 (0.0006) [2023-03-07 00:35:37,569][81400] Updated weights for policy 0, policy_version 66170 (0.0006) [2023-03-07 00:35:38,346][81400] Updated weights for policy 0, policy_version 66180 (0.0006) [2023-03-07 00:35:39,162][81400] Updated weights for policy 0, policy_version 66190 (0.0006) [2023-03-07 00:35:39,949][81400] Updated weights for policy 0, policy_version 66200 (0.0006) [2023-03-07 00:35:40,722][81400] Updated weights for policy 0, policy_version 66210 (0.0006) [2023-03-07 00:35:41,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13173.1). Total num frames: 67805184. Throughput: 0: 13151.9. Samples: 67790842. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:35:41,237][81074] Avg episode reward: [(0, '3061.126')] [2023-03-07 00:35:41,501][81400] Updated weights for policy 0, policy_version 66220 (0.0006) [2023-03-07 00:35:42,278][81400] Updated weights for policy 0, policy_version 66230 (0.0006) [2023-03-07 00:35:43,081][81400] Updated weights for policy 0, policy_version 66240 (0.0006) [2023-03-07 00:35:43,867][81400] Updated weights for policy 0, policy_version 66250 (0.0006) [2023-03-07 00:35:44,634][81400] Updated weights for policy 0, policy_version 66260 (0.0006) [2023-03-07 00:35:45,411][81400] Updated weights for policy 0, policy_version 66270 (0.0007) [2023-03-07 00:35:46,193][81400] Updated weights for policy 0, policy_version 66280 (0.0007) [2023-03-07 00:35:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13173.2). Total num frames: 67870720. Throughput: 0: 13149.2. Samples: 67869674. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:35:46,237][81074] Avg episode reward: [(0, '3150.601')] [2023-03-07 00:35:46,981][81400] Updated weights for policy 0, policy_version 66290 (0.0006) [2023-03-07 00:35:47,756][81400] Updated weights for policy 0, policy_version 66300 (0.0006) [2023-03-07 00:35:48,525][81400] Updated weights for policy 0, policy_version 66310 (0.0007) [2023-03-07 00:35:49,322][81400] Updated weights for policy 0, policy_version 66320 (0.0006) [2023-03-07 00:35:50,094][81400] Updated weights for policy 0, policy_version 66330 (0.0006) [2023-03-07 00:35:50,871][81400] Updated weights for policy 0, policy_version 66340 (0.0006) [2023-03-07 00:35:51,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13173.2). Total num frames: 67936256. Throughput: 0: 13150.5. Samples: 67909033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:35:51,237][81074] Avg episode reward: [(0, '3176.457')] [2023-03-07 00:35:51,657][81400] Updated weights for policy 0, policy_version 66350 (0.0006) [2023-03-07 00:35:52,424][81400] Updated weights for policy 0, policy_version 66360 (0.0006) [2023-03-07 00:35:53,215][81400] Updated weights for policy 0, policy_version 66370 (0.0007) [2023-03-07 00:35:53,995][81400] Updated weights for policy 0, policy_version 66380 (0.0005) [2023-03-07 00:35:54,770][81400] Updated weights for policy 0, policy_version 66390 (0.0005) [2023-03-07 00:35:55,546][81400] Updated weights for policy 0, policy_version 66400 (0.0006) [2023-03-07 00:35:56,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13173.1). Total num frames: 68001792. Throughput: 0: 13143.0. Samples: 67987833. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:35:56,237][81074] Avg episode reward: [(0, '3387.784')] [2023-03-07 00:35:56,250][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000066409_68002816.pth... [2023-03-07 00:35:56,280][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000063323_64842752.pth [2023-03-07 00:35:56,328][81400] Updated weights for policy 0, policy_version 66410 (0.0006) [2023-03-07 00:35:57,103][81400] Updated weights for policy 0, policy_version 66420 (0.0006) [2023-03-07 00:35:57,885][81400] Updated weights for policy 0, policy_version 66430 (0.0006) [2023-03-07 00:35:58,651][81400] Updated weights for policy 0, policy_version 66440 (0.0007) [2023-03-07 00:35:59,443][81400] Updated weights for policy 0, policy_version 66450 (0.0005) [2023-03-07 00:36:00,225][81400] Updated weights for policy 0, policy_version 66460 (0.0006) [2023-03-07 00:36:00,993][81400] Updated weights for policy 0, policy_version 66470 (0.0006) [2023-03-07 00:36:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 68068352. Throughput: 0: 13138.4. Samples: 68066504. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:36:01,237][81074] Avg episode reward: [(0, '3354.745')] [2023-03-07 00:36:01,789][81400] Updated weights for policy 0, policy_version 66480 (0.0006) [2023-03-07 00:36:02,563][81400] Updated weights for policy 0, policy_version 66490 (0.0005) [2023-03-07 00:36:03,355][81400] Updated weights for policy 0, policy_version 66500 (0.0006) [2023-03-07 00:36:04,114][81400] Updated weights for policy 0, policy_version 66510 (0.0006) [2023-03-07 00:36:04,904][81400] Updated weights for policy 0, policy_version 66520 (0.0006) [2023-03-07 00:36:05,682][81400] Updated weights for policy 0, policy_version 66530 (0.0006) [2023-03-07 00:36:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13169.7). Total num frames: 68133888. Throughput: 0: 13144.0. Samples: 68106117. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:36:06,237][81074] Avg episode reward: [(0, '3434.155')] [2023-03-07 00:36:06,462][81400] Updated weights for policy 0, policy_version 66540 (0.0008) [2023-03-07 00:36:07,221][81400] Updated weights for policy 0, policy_version 66550 (0.0007) [2023-03-07 00:36:07,997][81400] Updated weights for policy 0, policy_version 66560 (0.0007) [2023-03-07 00:36:08,775][81400] Updated weights for policy 0, policy_version 66570 (0.0006) [2023-03-07 00:36:09,577][81400] Updated weights for policy 0, policy_version 66580 (0.0006) [2023-03-07 00:36:10,343][81400] Updated weights for policy 0, policy_version 66590 (0.0006) [2023-03-07 00:36:11,125][81400] Updated weights for policy 0, policy_version 66600 (0.0007) [2023-03-07 00:36:11,236][81074] Fps is (10 sec: 13004.7, 60 sec: 13124.3, 300 sec: 13166.2). Total num frames: 68198400. Throughput: 0: 13139.1. Samples: 68184793. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:11,237][81074] Avg episode reward: [(0, '3430.419')] [2023-03-07 00:36:11,910][81400] Updated weights for policy 0, policy_version 66610 (0.0006) [2023-03-07 00:36:12,689][81400] Updated weights for policy 0, policy_version 66620 (0.0006) [2023-03-07 00:36:13,487][81400] Updated weights for policy 0, policy_version 66630 (0.0006) [2023-03-07 00:36:14,262][81400] Updated weights for policy 0, policy_version 66640 (0.0005) [2023-03-07 00:36:15,033][81400] Updated weights for policy 0, policy_version 66650 (0.0006) [2023-03-07 00:36:15,821][81400] Updated weights for policy 0, policy_version 66660 (0.0006) [2023-03-07 00:36:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 68264960. Throughput: 0: 13135.6. Samples: 68263698. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:16,237][81074] Avg episode reward: [(0, '3119.029')] [2023-03-07 00:36:16,600][81400] Updated weights for policy 0, policy_version 66670 (0.0007) [2023-03-07 00:36:17,379][81400] Updated weights for policy 0, policy_version 66680 (0.0006) [2023-03-07 00:36:18,140][81400] Updated weights for policy 0, policy_version 66690 (0.0006) [2023-03-07 00:36:18,920][81400] Updated weights for policy 0, policy_version 66700 (0.0006) [2023-03-07 00:36:19,703][81400] Updated weights for policy 0, policy_version 66710 (0.0006) [2023-03-07 00:36:20,485][81400] Updated weights for policy 0, policy_version 66720 (0.0006) [2023-03-07 00:36:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 68330496. Throughput: 0: 13127.1. Samples: 68303179. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:21,237][81074] Avg episode reward: [(0, '2929.255')] [2023-03-07 00:36:21,259][81400] Updated weights for policy 0, policy_version 66730 (0.0007) [2023-03-07 00:36:22,044][81400] Updated weights for policy 0, policy_version 66740 (0.0006) [2023-03-07 00:36:22,825][81400] Updated weights for policy 0, policy_version 66750 (0.0006) [2023-03-07 00:36:23,597][81400] Updated weights for policy 0, policy_version 66760 (0.0006) [2023-03-07 00:36:24,380][81400] Updated weights for policy 0, policy_version 66770 (0.0007) [2023-03-07 00:36:25,161][81400] Updated weights for policy 0, policy_version 66780 (0.0006) [2023-03-07 00:36:25,950][81400] Updated weights for policy 0, policy_version 66790 (0.0006) [2023-03-07 00:36:26,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.4, 300 sec: 13162.7). Total num frames: 68396032. Throughput: 0: 13138.5. Samples: 68382074. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:26,237][81074] Avg episode reward: [(0, '3028.167')] [2023-03-07 00:36:26,736][81400] Updated weights for policy 0, policy_version 66800 (0.0006) [2023-03-07 00:36:27,489][81400] Updated weights for policy 0, policy_version 66810 (0.0006) [2023-03-07 00:36:28,288][81400] Updated weights for policy 0, policy_version 66820 (0.0007) [2023-03-07 00:36:29,057][81400] Updated weights for policy 0, policy_version 66830 (0.0006) [2023-03-07 00:36:29,837][81400] Updated weights for policy 0, policy_version 66840 (0.0006) [2023-03-07 00:36:30,627][81400] Updated weights for policy 0, policy_version 66850 (0.0006) [2023-03-07 00:36:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13162.7). Total num frames: 68461568. Throughput: 0: 13135.8. Samples: 68460787. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:31,237][81074] Avg episode reward: [(0, '2823.036')] [2023-03-07 00:36:31,400][81400] Updated weights for policy 0, policy_version 66860 (0.0005) [2023-03-07 00:36:32,167][81400] Updated weights for policy 0, policy_version 66870 (0.0006) [2023-03-07 00:36:32,952][81400] Updated weights for policy 0, policy_version 66880 (0.0005) [2023-03-07 00:36:33,723][81400] Updated weights for policy 0, policy_version 66890 (0.0005) [2023-03-07 00:36:34,495][81400] Updated weights for policy 0, policy_version 66900 (0.0007) [2023-03-07 00:36:35,271][81400] Updated weights for policy 0, policy_version 66910 (0.0006) [2023-03-07 00:36:36,050][81400] Updated weights for policy 0, policy_version 66920 (0.0006) [2023-03-07 00:36:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 68528128. Throughput: 0: 13144.7. Samples: 68500545. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:36,237][81074] Avg episode reward: [(0, '2512.639')] [2023-03-07 00:36:36,808][81400] Updated weights for policy 0, policy_version 66930 (0.0006) [2023-03-07 00:36:37,572][81400] Updated weights for policy 0, policy_version 66940 (0.0007) [2023-03-07 00:36:38,366][81400] Updated weights for policy 0, policy_version 66950 (0.0006) [2023-03-07 00:36:39,146][81400] Updated weights for policy 0, policy_version 66960 (0.0006) [2023-03-07 00:36:39,924][81400] Updated weights for policy 0, policy_version 66970 (0.0007) [2023-03-07 00:36:40,703][81400] Updated weights for policy 0, policy_version 66980 (0.0007) [2023-03-07 00:36:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13162.7). Total num frames: 68593664. Throughput: 0: 13150.4. Samples: 68579599. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:41,237][81074] Avg episode reward: [(0, '2442.116')] [2023-03-07 00:36:41,489][81400] Updated weights for policy 0, policy_version 66990 (0.0006) [2023-03-07 00:36:42,275][81400] Updated weights for policy 0, policy_version 67000 (0.0006) [2023-03-07 00:36:43,061][81400] Updated weights for policy 0, policy_version 67010 (0.0006) [2023-03-07 00:36:43,842][81400] Updated weights for policy 0, policy_version 67020 (0.0008) [2023-03-07 00:36:44,598][81400] Updated weights for policy 0, policy_version 67030 (0.0007) [2023-03-07 00:36:45,389][81400] Updated weights for policy 0, policy_version 67040 (0.0006) [2023-03-07 00:36:46,166][81400] Updated weights for policy 0, policy_version 67050 (0.0006) [2023-03-07 00:36:46,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 68659200. Throughput: 0: 13151.0. Samples: 68658298. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:46,237][81074] Avg episode reward: [(0, '2421.571')] [2023-03-07 00:36:46,942][81400] Updated weights for policy 0, policy_version 67060 (0.0006) [2023-03-07 00:36:47,707][81400] Updated weights for policy 0, policy_version 67070 (0.0005) [2023-03-07 00:36:48,492][81400] Updated weights for policy 0, policy_version 67080 (0.0007) [2023-03-07 00:36:49,266][81400] Updated weights for policy 0, policy_version 67090 (0.0007) [2023-03-07 00:36:50,041][81400] Updated weights for policy 0, policy_version 67100 (0.0006) [2023-03-07 00:36:50,844][81400] Updated weights for policy 0, policy_version 67110 (0.0006) [2023-03-07 00:36:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 68725760. Throughput: 0: 13151.7. Samples: 68697944. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:36:51,237][81074] Avg episode reward: [(0, '2867.571')] [2023-03-07 00:36:51,626][81400] Updated weights for policy 0, policy_version 67120 (0.0006) [2023-03-07 00:36:52,401][81400] Updated weights for policy 0, policy_version 67130 (0.0007) [2023-03-07 00:36:53,189][81400] Updated weights for policy 0, policy_version 67140 (0.0006) [2023-03-07 00:36:53,974][81400] Updated weights for policy 0, policy_version 67150 (0.0006) [2023-03-07 00:36:54,757][81400] Updated weights for policy 0, policy_version 67160 (0.0007) [2023-03-07 00:36:55,540][81400] Updated weights for policy 0, policy_version 67170 (0.0007) [2023-03-07 00:36:56,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 68791296. Throughput: 0: 13146.5. Samples: 68776386. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:36:56,237][81074] Avg episode reward: [(0, '2974.491')] [2023-03-07 00:36:56,300][81400] Updated weights for policy 0, policy_version 67180 (0.0006) [2023-03-07 00:36:57,076][81400] Updated weights for policy 0, policy_version 67190 (0.0006) [2023-03-07 00:36:57,854][81400] Updated weights for policy 0, policy_version 67200 (0.0006) [2023-03-07 00:36:58,651][81400] Updated weights for policy 0, policy_version 67210 (0.0007) [2023-03-07 00:36:59,430][81400] Updated weights for policy 0, policy_version 67220 (0.0006) [2023-03-07 00:37:00,216][81400] Updated weights for policy 0, policy_version 67230 (0.0006) [2023-03-07 00:37:00,987][81400] Updated weights for policy 0, policy_version 67240 (0.0006) [2023-03-07 00:37:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 68856832. Throughput: 0: 13145.9. Samples: 68855264. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:37:01,237][81074] Avg episode reward: [(0, '2819.962')] [2023-03-07 00:37:01,751][81400] Updated weights for policy 0, policy_version 67250 (0.0006) [2023-03-07 00:37:02,527][81400] Updated weights for policy 0, policy_version 67260 (0.0005) [2023-03-07 00:37:03,322][81400] Updated weights for policy 0, policy_version 67270 (0.0006) [2023-03-07 00:37:04,106][81400] Updated weights for policy 0, policy_version 67280 (0.0007) [2023-03-07 00:37:04,901][81400] Updated weights for policy 0, policy_version 67290 (0.0006) [2023-03-07 00:37:05,677][81400] Updated weights for policy 0, policy_version 67300 (0.0006) [2023-03-07 00:37:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13162.7). Total num frames: 68922368. Throughput: 0: 13146.2. Samples: 68894757. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:37:06,237][81074] Avg episode reward: [(0, '2834.497')] [2023-03-07 00:37:06,455][81400] Updated weights for policy 0, policy_version 67310 (0.0005) [2023-03-07 00:37:07,225][81400] Updated weights for policy 0, policy_version 67320 (0.0005) [2023-03-07 00:37:07,994][81400] Updated weights for policy 0, policy_version 67330 (0.0006) [2023-03-07 00:37:08,778][81400] Updated weights for policy 0, policy_version 67340 (0.0006) [2023-03-07 00:37:09,541][81400] Updated weights for policy 0, policy_version 67350 (0.0006) [2023-03-07 00:37:10,340][81400] Updated weights for policy 0, policy_version 67360 (0.0007) [2023-03-07 00:37:11,105][81400] Updated weights for policy 0, policy_version 67370 (0.0006) [2023-03-07 00:37:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 68987904. Throughput: 0: 13148.9. Samples: 68973775. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:37:11,237][81074] Avg episode reward: [(0, '2565.871')] [2023-03-07 00:37:11,880][81400] Updated weights for policy 0, policy_version 67380 (0.0006) [2023-03-07 00:37:12,653][81400] Updated weights for policy 0, policy_version 67390 (0.0006) [2023-03-07 00:37:13,430][81400] Updated weights for policy 0, policy_version 67400 (0.0006) [2023-03-07 00:37:14,189][81400] Updated weights for policy 0, policy_version 67410 (0.0006) [2023-03-07 00:37:14,965][81400] Updated weights for policy 0, policy_version 67420 (0.0006) [2023-03-07 00:37:15,753][81400] Updated weights for policy 0, policy_version 67430 (0.0005) [2023-03-07 00:37:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 69054464. Throughput: 0: 13163.7. Samples: 69053153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:37:16,237][81074] Avg episode reward: [(0, '2737.131')] [2023-03-07 00:37:16,510][81400] Updated weights for policy 0, policy_version 67440 (0.0006) [2023-03-07 00:37:17,301][81400] Updated weights for policy 0, policy_version 67450 (0.0006) [2023-03-07 00:37:18,059][81400] Updated weights for policy 0, policy_version 67460 (0.0006) [2023-03-07 00:37:18,847][81400] Updated weights for policy 0, policy_version 67470 (0.0007) [2023-03-07 00:37:19,633][81400] Updated weights for policy 0, policy_version 67480 (0.0007) [2023-03-07 00:37:20,430][81400] Updated weights for policy 0, policy_version 67490 (0.0006) [2023-03-07 00:37:21,212][81400] Updated weights for policy 0, policy_version 67500 (0.0006) [2023-03-07 00:37:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 69120000. Throughput: 0: 13159.7. Samples: 69092732. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:37:21,237][81074] Avg episode reward: [(0, '3036.295')] [2023-03-07 00:37:21,991][81400] Updated weights for policy 0, policy_version 67510 (0.0007) [2023-03-07 00:37:22,757][81400] Updated weights for policy 0, policy_version 67520 (0.0006) [2023-03-07 00:37:23,534][81400] Updated weights for policy 0, policy_version 67530 (0.0006) [2023-03-07 00:37:24,311][81400] Updated weights for policy 0, policy_version 67540 (0.0006) [2023-03-07 00:37:25,081][81400] Updated weights for policy 0, policy_version 67550 (0.0006) [2023-03-07 00:37:25,873][81400] Updated weights for policy 0, policy_version 67560 (0.0006) [2023-03-07 00:37:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 69185536. Throughput: 0: 13151.3. Samples: 69171408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:37:26,237][81074] Avg episode reward: [(0, '2955.609')] [2023-03-07 00:37:26,645][81400] Updated weights for policy 0, policy_version 67570 (0.0006) [2023-03-07 00:37:27,442][81400] Updated weights for policy 0, policy_version 67580 (0.0006) [2023-03-07 00:37:28,201][81400] Updated weights for policy 0, policy_version 67590 (0.0006) [2023-03-07 00:37:28,968][81400] Updated weights for policy 0, policy_version 67600 (0.0006) [2023-03-07 00:37:29,755][81400] Updated weights for policy 0, policy_version 67610 (0.0007) [2023-03-07 00:37:30,529][81400] Updated weights for policy 0, policy_version 67620 (0.0006) [2023-03-07 00:37:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 69252096. Throughput: 0: 13159.7. Samples: 69250484. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:37:31,237][81074] Avg episode reward: [(0, '2975.657')] [2023-03-07 00:37:31,298][81400] Updated weights for policy 0, policy_version 67630 (0.0006) [2023-03-07 00:37:32,087][81400] Updated weights for policy 0, policy_version 67640 (0.0006) [2023-03-07 00:37:32,864][81400] Updated weights for policy 0, policy_version 67650 (0.0006) [2023-03-07 00:37:33,638][81400] Updated weights for policy 0, policy_version 67660 (0.0006) [2023-03-07 00:37:34,437][81400] Updated weights for policy 0, policy_version 67670 (0.0007) [2023-03-07 00:37:35,213][81400] Updated weights for policy 0, policy_version 67680 (0.0007) [2023-03-07 00:37:35,996][81400] Updated weights for policy 0, policy_version 67690 (0.0006) [2023-03-07 00:37:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 69317632. Throughput: 0: 13155.8. Samples: 69289957. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:37:36,237][81074] Avg episode reward: [(0, '2800.627')] [2023-03-07 00:37:36,790][81400] Updated weights for policy 0, policy_version 67700 (0.0006) [2023-03-07 00:37:37,551][81400] Updated weights for policy 0, policy_version 67710 (0.0007) [2023-03-07 00:37:38,329][81400] Updated weights for policy 0, policy_version 67720 (0.0006) [2023-03-07 00:37:39,125][81400] Updated weights for policy 0, policy_version 67730 (0.0006) [2023-03-07 00:37:39,885][81400] Updated weights for policy 0, policy_version 67740 (0.0006) [2023-03-07 00:37:40,661][81400] Updated weights for policy 0, policy_version 67750 (0.0006) [2023-03-07 00:37:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 69383168. Throughput: 0: 13161.2. Samples: 69368640. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:37:41,237][81074] Avg episode reward: [(0, '2762.589')] [2023-03-07 00:37:41,444][81400] Updated weights for policy 0, policy_version 67760 (0.0006) [2023-03-07 00:37:42,212][81400] Updated weights for policy 0, policy_version 67770 (0.0007) [2023-03-07 00:37:43,007][81400] Updated weights for policy 0, policy_version 67780 (0.0006) [2023-03-07 00:37:43,779][81400] Updated weights for policy 0, policy_version 67790 (0.0006) [2023-03-07 00:37:44,558][81400] Updated weights for policy 0, policy_version 67800 (0.0007) [2023-03-07 00:37:45,326][81400] Updated weights for policy 0, policy_version 67810 (0.0006) [2023-03-07 00:37:46,091][81400] Updated weights for policy 0, policy_version 67820 (0.0006) [2023-03-07 00:37:46,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 69448704. Throughput: 0: 13171.0. Samples: 69447960. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:37:46,237][81074] Avg episode reward: [(0, '2843.224')] [2023-03-07 00:37:46,864][81400] Updated weights for policy 0, policy_version 67830 (0.0006) [2023-03-07 00:37:47,639][81400] Updated weights for policy 0, policy_version 67840 (0.0006) [2023-03-07 00:37:48,412][81400] Updated weights for policy 0, policy_version 67850 (0.0006) [2023-03-07 00:37:49,176][81400] Updated weights for policy 0, policy_version 67860 (0.0006) [2023-03-07 00:37:49,939][81400] Updated weights for policy 0, policy_version 67870 (0.0006) [2023-03-07 00:37:50,726][81400] Updated weights for policy 0, policy_version 67880 (0.0006) [2023-03-07 00:37:51,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 69515264. Throughput: 0: 13181.2. Samples: 69487915. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:37:51,237][81074] Avg episode reward: [(0, '2721.613')] [2023-03-07 00:37:51,505][81400] Updated weights for policy 0, policy_version 67890 (0.0006) [2023-03-07 00:37:52,277][81400] Updated weights for policy 0, policy_version 67900 (0.0006) [2023-03-07 00:37:53,050][81400] Updated weights for policy 0, policy_version 67910 (0.0006) [2023-03-07 00:37:53,822][81400] Updated weights for policy 0, policy_version 67920 (0.0006) [2023-03-07 00:37:54,602][81400] Updated weights for policy 0, policy_version 67930 (0.0006) [2023-03-07 00:37:55,370][81400] Updated weights for policy 0, policy_version 67940 (0.0006) [2023-03-07 00:37:56,146][81400] Updated weights for policy 0, policy_version 67950 (0.0007) [2023-03-07 00:37:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 69581824. Throughput: 0: 13185.9. Samples: 69567138. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:37:56,247][81074] Avg episode reward: [(0, '3021.254')] [2023-03-07 00:37:56,252][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000067951_69581824.pth... [2023-03-07 00:37:56,285][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000064867_66423808.pth [2023-03-07 00:37:56,922][81400] Updated weights for policy 0, policy_version 67960 (0.0007) [2023-03-07 00:37:57,691][81400] Updated weights for policy 0, policy_version 67970 (0.0006) [2023-03-07 00:37:58,481][81400] Updated weights for policy 0, policy_version 67980 (0.0006) [2023-03-07 00:37:59,260][81400] Updated weights for policy 0, policy_version 67990 (0.0007) [2023-03-07 00:38:00,041][81400] Updated weights for policy 0, policy_version 68000 (0.0006) [2023-03-07 00:38:00,819][81400] Updated weights for policy 0, policy_version 68010 (0.0007) [2023-03-07 00:38:01,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 69647360. Throughput: 0: 13172.1. Samples: 69645898. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:38:01,237][81074] Avg episode reward: [(0, '3100.532')] [2023-03-07 00:38:01,615][81400] Updated weights for policy 0, policy_version 68020 (0.0006) [2023-03-07 00:38:02,355][81400] Updated weights for policy 0, policy_version 68030 (0.0007) [2023-03-07 00:38:03,144][81400] Updated weights for policy 0, policy_version 68040 (0.0006) [2023-03-07 00:38:03,935][81400] Updated weights for policy 0, policy_version 68050 (0.0006) [2023-03-07 00:38:04,713][81400] Updated weights for policy 0, policy_version 68060 (0.0006) [2023-03-07 00:38:05,493][81400] Updated weights for policy 0, policy_version 68070 (0.0006) [2023-03-07 00:38:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 69712896. Throughput: 0: 13171.8. Samples: 69685464. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:38:06,237][81074] Avg episode reward: [(0, '3077.054')] [2023-03-07 00:38:06,271][81400] Updated weights for policy 0, policy_version 68080 (0.0007) [2023-03-07 00:38:07,043][81400] Updated weights for policy 0, policy_version 68090 (0.0006) [2023-03-07 00:38:07,828][81400] Updated weights for policy 0, policy_version 68100 (0.0007) [2023-03-07 00:38:08,604][81400] Updated weights for policy 0, policy_version 68110 (0.0006) [2023-03-07 00:38:09,377][81400] Updated weights for policy 0, policy_version 68120 (0.0006) [2023-03-07 00:38:10,152][81400] Updated weights for policy 0, policy_version 68130 (0.0006) [2023-03-07 00:38:10,932][81400] Updated weights for policy 0, policy_version 68140 (0.0006) [2023-03-07 00:38:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 69778432. Throughput: 0: 13176.6. Samples: 69764355. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:38:11,237][81074] Avg episode reward: [(0, '3347.029')] [2023-03-07 00:38:11,725][81400] Updated weights for policy 0, policy_version 68150 (0.0006) [2023-03-07 00:38:12,511][81400] Updated weights for policy 0, policy_version 68160 (0.0006) [2023-03-07 00:38:13,303][81400] Updated weights for policy 0, policy_version 68170 (0.0006) [2023-03-07 00:38:14,082][81400] Updated weights for policy 0, policy_version 68180 (0.0006) [2023-03-07 00:38:14,855][81400] Updated weights for policy 0, policy_version 68190 (0.0005) [2023-03-07 00:38:15,644][81400] Updated weights for policy 0, policy_version 68200 (0.0006) [2023-03-07 00:38:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 69843968. Throughput: 0: 13164.2. Samples: 69842874. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:16,237][81074] Avg episode reward: [(0, '3120.760')] [2023-03-07 00:38:16,425][81400] Updated weights for policy 0, policy_version 68210 (0.0006) [2023-03-07 00:38:17,210][81400] Updated weights for policy 0, policy_version 68220 (0.0006) [2023-03-07 00:38:17,987][81400] Updated weights for policy 0, policy_version 68230 (0.0006) [2023-03-07 00:38:18,780][81400] Updated weights for policy 0, policy_version 68240 (0.0006) [2023-03-07 00:38:19,566][81400] Updated weights for policy 0, policy_version 68250 (0.0006) [2023-03-07 00:38:20,334][81400] Updated weights for policy 0, policy_version 68260 (0.0006) [2023-03-07 00:38:21,129][81400] Updated weights for policy 0, policy_version 68270 (0.0006) [2023-03-07 00:38:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 69909504. Throughput: 0: 13156.0. Samples: 69881975. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:21,237][81074] Avg episode reward: [(0, '3178.807')] [2023-03-07 00:38:21,885][81400] Updated weights for policy 0, policy_version 68280 (0.0007) [2023-03-07 00:38:22,662][81400] Updated weights for policy 0, policy_version 68290 (0.0006) [2023-03-07 00:38:23,447][81400] Updated weights for policy 0, policy_version 68300 (0.0006) [2023-03-07 00:38:24,205][81400] Updated weights for policy 0, policy_version 68310 (0.0006) [2023-03-07 00:38:24,990][81400] Updated weights for policy 0, policy_version 68320 (0.0005) [2023-03-07 00:38:25,768][81400] Updated weights for policy 0, policy_version 68330 (0.0006) [2023-03-07 00:38:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 69975040. Throughput: 0: 13169.0. Samples: 69961244. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:26,237][81074] Avg episode reward: [(0, '2986.370')] [2023-03-07 00:38:26,548][81400] Updated weights for policy 0, policy_version 68340 (0.0006) [2023-03-07 00:38:27,309][81400] Updated weights for policy 0, policy_version 68350 (0.0006) [2023-03-07 00:38:28,110][81400] Updated weights for policy 0, policy_version 68360 (0.0007) [2023-03-07 00:38:28,866][81400] Updated weights for policy 0, policy_version 68370 (0.0006) [2023-03-07 00:38:29,677][81400] Updated weights for policy 0, policy_version 68380 (0.0006) [2023-03-07 00:38:30,446][81400] Updated weights for policy 0, policy_version 68390 (0.0005) [2023-03-07 00:38:31,216][81400] Updated weights for policy 0, policy_version 68400 (0.0006) [2023-03-07 00:38:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 70041600. Throughput: 0: 13163.0. Samples: 70040295. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:31,237][81074] Avg episode reward: [(0, '2865.244')] [2023-03-07 00:38:31,993][81400] Updated weights for policy 0, policy_version 68410 (0.0006) [2023-03-07 00:38:32,783][81400] Updated weights for policy 0, policy_version 68420 (0.0006) [2023-03-07 00:38:33,527][81400] Updated weights for policy 0, policy_version 68430 (0.0006) [2023-03-07 00:38:34,309][81400] Updated weights for policy 0, policy_version 68440 (0.0006) [2023-03-07 00:38:35,081][81400] Updated weights for policy 0, policy_version 68450 (0.0006) [2023-03-07 00:38:35,835][81400] Updated weights for policy 0, policy_version 68460 (0.0006) [2023-03-07 00:38:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 70107136. Throughput: 0: 13157.9. Samples: 70080017. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:36,237][81074] Avg episode reward: [(0, '2750.833')] [2023-03-07 00:38:36,618][81400] Updated weights for policy 0, policy_version 68470 (0.0007) [2023-03-07 00:38:37,423][81400] Updated weights for policy 0, policy_version 68480 (0.0006) [2023-03-07 00:38:38,197][81400] Updated weights for policy 0, policy_version 68490 (0.0006) [2023-03-07 00:38:38,963][81400] Updated weights for policy 0, policy_version 68500 (0.0007) [2023-03-07 00:38:39,737][81400] Updated weights for policy 0, policy_version 68510 (0.0005) [2023-03-07 00:38:40,510][81400] Updated weights for policy 0, policy_version 68520 (0.0007) [2023-03-07 00:38:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13159.3). Total num frames: 70173696. Throughput: 0: 13153.9. Samples: 70159064. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:41,237][81074] Avg episode reward: [(0, '2859.535')] [2023-03-07 00:38:41,273][81400] Updated weights for policy 0, policy_version 68530 (0.0006) [2023-03-07 00:38:42,047][81400] Updated weights for policy 0, policy_version 68540 (0.0006) [2023-03-07 00:38:42,829][81400] Updated weights for policy 0, policy_version 68550 (0.0006) [2023-03-07 00:38:43,613][81400] Updated weights for policy 0, policy_version 68560 (0.0006) [2023-03-07 00:38:44,395][81400] Updated weights for policy 0, policy_version 68570 (0.0006) [2023-03-07 00:38:45,174][81400] Updated weights for policy 0, policy_version 68580 (0.0006) [2023-03-07 00:38:45,962][81400] Updated weights for policy 0, policy_version 68590 (0.0007) [2023-03-07 00:38:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 70239232. Throughput: 0: 13162.1. Samples: 70238194. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:46,237][81074] Avg episode reward: [(0, '2231.012')] [2023-03-07 00:38:46,749][81400] Updated weights for policy 0, policy_version 68600 (0.0008) [2023-03-07 00:38:47,526][81400] Updated weights for policy 0, policy_version 68610 (0.0005) [2023-03-07 00:38:48,304][81400] Updated weights for policy 0, policy_version 68620 (0.0006) [2023-03-07 00:38:49,102][81400] Updated weights for policy 0, policy_version 68630 (0.0006) [2023-03-07 00:38:49,858][81400] Updated weights for policy 0, policy_version 68640 (0.0006) [2023-03-07 00:38:50,632][81400] Updated weights for policy 0, policy_version 68650 (0.0006) [2023-03-07 00:38:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 70304768. Throughput: 0: 13151.0. Samples: 70277258. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:51,237][81074] Avg episode reward: [(0, '2692.955')] [2023-03-07 00:38:51,412][81400] Updated weights for policy 0, policy_version 68660 (0.0006) [2023-03-07 00:38:52,202][81400] Updated weights for policy 0, policy_version 68670 (0.0006) [2023-03-07 00:38:52,983][81400] Updated weights for policy 0, policy_version 68680 (0.0006) [2023-03-07 00:38:53,773][81400] Updated weights for policy 0, policy_version 68690 (0.0007) [2023-03-07 00:38:54,553][81400] Updated weights for policy 0, policy_version 68700 (0.0007) [2023-03-07 00:38:55,322][81400] Updated weights for policy 0, policy_version 68710 (0.0006) [2023-03-07 00:38:56,117][81400] Updated weights for policy 0, policy_version 68720 (0.0007) [2023-03-07 00:38:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 70370304. Throughput: 0: 13151.6. Samples: 70356178. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 00:38:56,237][81074] Avg episode reward: [(0, '3178.348')] [2023-03-07 00:38:56,892][81400] Updated weights for policy 0, policy_version 68730 (0.0007) [2023-03-07 00:38:57,657][81400] Updated weights for policy 0, policy_version 68740 (0.0006) [2023-03-07 00:38:58,431][81400] Updated weights for policy 0, policy_version 68750 (0.0006) [2023-03-07 00:38:59,217][81400] Updated weights for policy 0, policy_version 68760 (0.0005) [2023-03-07 00:38:59,998][81400] Updated weights for policy 0, policy_version 68770 (0.0005) [2023-03-07 00:39:00,775][81400] Updated weights for policy 0, policy_version 68780 (0.0006) [2023-03-07 00:39:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 70435840. Throughput: 0: 13163.2. Samples: 70435219. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:01,237][81074] Avg episode reward: [(0, '3077.731')] [2023-03-07 00:39:01,552][81400] Updated weights for policy 0, policy_version 68790 (0.0006) [2023-03-07 00:39:02,326][81400] Updated weights for policy 0, policy_version 68800 (0.0006) [2023-03-07 00:39:03,106][81400] Updated weights for policy 0, policy_version 68810 (0.0006) [2023-03-07 00:39:03,905][81400] Updated weights for policy 0, policy_version 68820 (0.0006) [2023-03-07 00:39:04,687][81400] Updated weights for policy 0, policy_version 68830 (0.0006) [2023-03-07 00:39:05,465][81400] Updated weights for policy 0, policy_version 68840 (0.0006) [2023-03-07 00:39:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 70501376. Throughput: 0: 13165.9. Samples: 70474440. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:06,237][81074] Avg episode reward: [(0, '2975.558')] [2023-03-07 00:39:06,253][81400] Updated weights for policy 0, policy_version 68850 (0.0006) [2023-03-07 00:39:07,026][81400] Updated weights for policy 0, policy_version 68860 (0.0007) [2023-03-07 00:39:07,822][81400] Updated weights for policy 0, policy_version 68870 (0.0006) [2023-03-07 00:39:08,590][81400] Updated weights for policy 0, policy_version 68880 (0.0005) [2023-03-07 00:39:09,374][81400] Updated weights for policy 0, policy_version 68890 (0.0006) [2023-03-07 00:39:10,154][81400] Updated weights for policy 0, policy_version 68900 (0.0006) [2023-03-07 00:39:10,914][81400] Updated weights for policy 0, policy_version 68910 (0.0006) [2023-03-07 00:39:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 70566912. Throughput: 0: 13154.3. Samples: 70553184. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:11,237][81074] Avg episode reward: [(0, '2894.353')] [2023-03-07 00:39:11,712][81400] Updated weights for policy 0, policy_version 68920 (0.0006) [2023-03-07 00:39:12,492][81400] Updated weights for policy 0, policy_version 68930 (0.0005) [2023-03-07 00:39:13,257][81400] Updated weights for policy 0, policy_version 68940 (0.0006) [2023-03-07 00:39:14,044][81400] Updated weights for policy 0, policy_version 68950 (0.0006) [2023-03-07 00:39:14,835][81400] Updated weights for policy 0, policy_version 68960 (0.0006) [2023-03-07 00:39:15,608][81400] Updated weights for policy 0, policy_version 68970 (0.0006) [2023-03-07 00:39:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 70632448. Throughput: 0: 13142.2. Samples: 70631692. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:16,237][81074] Avg episode reward: [(0, '2954.908')] [2023-03-07 00:39:16,403][81400] Updated weights for policy 0, policy_version 68980 (0.0006) [2023-03-07 00:39:17,188][81400] Updated weights for policy 0, policy_version 68990 (0.0006) [2023-03-07 00:39:17,969][81400] Updated weights for policy 0, policy_version 69000 (0.0007) [2023-03-07 00:39:18,752][81400] Updated weights for policy 0, policy_version 69010 (0.0006) [2023-03-07 00:39:19,540][81400] Updated weights for policy 0, policy_version 69020 (0.0007) [2023-03-07 00:39:20,329][81400] Updated weights for policy 0, policy_version 69030 (0.0006) [2023-03-07 00:39:21,115][81400] Updated weights for policy 0, policy_version 69040 (0.0006) [2023-03-07 00:39:21,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 70697984. Throughput: 0: 13131.0. Samples: 70670911. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:21,237][81074] Avg episode reward: [(0, '2889.932')] [2023-03-07 00:39:21,882][81400] Updated weights for policy 0, policy_version 69050 (0.0006) [2023-03-07 00:39:22,658][81400] Updated weights for policy 0, policy_version 69060 (0.0006) [2023-03-07 00:39:23,441][81400] Updated weights for policy 0, policy_version 69070 (0.0007) [2023-03-07 00:39:24,214][81400] Updated weights for policy 0, policy_version 69080 (0.0007) [2023-03-07 00:39:24,998][81400] Updated weights for policy 0, policy_version 69090 (0.0006) [2023-03-07 00:39:25,778][81400] Updated weights for policy 0, policy_version 69100 (0.0006) [2023-03-07 00:39:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 70763520. Throughput: 0: 13124.4. Samples: 70749664. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:26,237][81074] Avg episode reward: [(0, '2775.227')] [2023-03-07 00:39:26,549][81400] Updated weights for policy 0, policy_version 69110 (0.0006) [2023-03-07 00:39:27,318][81400] Updated weights for policy 0, policy_version 69120 (0.0006) [2023-03-07 00:39:28,104][81400] Updated weights for policy 0, policy_version 69130 (0.0006) [2023-03-07 00:39:28,888][81400] Updated weights for policy 0, policy_version 69140 (0.0006) [2023-03-07 00:39:29,665][81400] Updated weights for policy 0, policy_version 69150 (0.0006) [2023-03-07 00:39:30,449][81400] Updated weights for policy 0, policy_version 69160 (0.0006) [2023-03-07 00:39:31,220][81400] Updated weights for policy 0, policy_version 69170 (0.0006) [2023-03-07 00:39:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 70830080. Throughput: 0: 13120.4. Samples: 70828611. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:31,237][81074] Avg episode reward: [(0, '2797.392')] [2023-03-07 00:39:31,994][81400] Updated weights for policy 0, policy_version 69180 (0.0006) [2023-03-07 00:39:32,757][81400] Updated weights for policy 0, policy_version 69190 (0.0006) [2023-03-07 00:39:33,549][81400] Updated weights for policy 0, policy_version 69200 (0.0007) [2023-03-07 00:39:34,322][81400] Updated weights for policy 0, policy_version 69210 (0.0007) [2023-03-07 00:39:35,103][81400] Updated weights for policy 0, policy_version 69220 (0.0006) [2023-03-07 00:39:35,892][81400] Updated weights for policy 0, policy_version 69230 (0.0006) [2023-03-07 00:39:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 70895616. Throughput: 0: 13130.9. Samples: 70868151. Policy #0 lag: (min: 0.0, avg: 1.4, max: 3.0) [2023-03-07 00:39:36,248][81074] Avg episode reward: [(0, '3140.854')] [2023-03-07 00:39:36,662][81400] Updated weights for policy 0, policy_version 69240 (0.0005) [2023-03-07 00:39:37,448][81400] Updated weights for policy 0, policy_version 69250 (0.0007) [2023-03-07 00:39:38,223][81400] Updated weights for policy 0, policy_version 69260 (0.0006) [2023-03-07 00:39:38,989][81400] Updated weights for policy 0, policy_version 69270 (0.0007) [2023-03-07 00:39:39,765][81400] Updated weights for policy 0, policy_version 69280 (0.0006) [2023-03-07 00:39:40,546][81400] Updated weights for policy 0, policy_version 69290 (0.0007) [2023-03-07 00:39:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 70962176. Throughput: 0: 13132.9. Samples: 70947155. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:39:41,247][81074] Avg episode reward: [(0, '3161.244')] [2023-03-07 00:39:41,299][81400] Updated weights for policy 0, policy_version 69300 (0.0006) [2023-03-07 00:39:42,082][81400] Updated weights for policy 0, policy_version 69310 (0.0006) [2023-03-07 00:39:42,859][81400] Updated weights for policy 0, policy_version 69320 (0.0006) [2023-03-07 00:39:43,625][81400] Updated weights for policy 0, policy_version 69330 (0.0007) [2023-03-07 00:39:44,390][81400] Updated weights for policy 0, policy_version 69340 (0.0006) [2023-03-07 00:39:45,165][81400] Updated weights for policy 0, policy_version 69350 (0.0008) [2023-03-07 00:39:45,960][81400] Updated weights for policy 0, policy_version 69360 (0.0006) [2023-03-07 00:39:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 71027712. Throughput: 0: 13139.1. Samples: 71026481. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:39:46,247][81074] Avg episode reward: [(0, '3164.629')] [2023-03-07 00:39:46,751][81400] Updated weights for policy 0, policy_version 69370 (0.0006) [2023-03-07 00:39:47,517][81400] Updated weights for policy 0, policy_version 69380 (0.0006) [2023-03-07 00:39:48,298][81400] Updated weights for policy 0, policy_version 69390 (0.0007) [2023-03-07 00:39:49,103][81400] Updated weights for policy 0, policy_version 69400 (0.0006) [2023-03-07 00:39:49,863][81400] Updated weights for policy 0, policy_version 69410 (0.0006) [2023-03-07 00:39:50,638][81400] Updated weights for policy 0, policy_version 69420 (0.0006) [2023-03-07 00:39:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 71093248. Throughput: 0: 13141.0. Samples: 71065786. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:39:51,237][81074] Avg episode reward: [(0, '3073.463')] [2023-03-07 00:39:51,427][81400] Updated weights for policy 0, policy_version 69430 (0.0006) [2023-03-07 00:39:52,220][81400] Updated weights for policy 0, policy_version 69440 (0.0006) [2023-03-07 00:39:52,998][81400] Updated weights for policy 0, policy_version 69450 (0.0006) [2023-03-07 00:39:53,790][81400] Updated weights for policy 0, policy_version 69460 (0.0006) [2023-03-07 00:39:54,569][81400] Updated weights for policy 0, policy_version 69470 (0.0006) [2023-03-07 00:39:55,340][81400] Updated weights for policy 0, policy_version 69480 (0.0006) [2023-03-07 00:39:56,103][81400] Updated weights for policy 0, policy_version 69490 (0.0006) [2023-03-07 00:39:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 71158784. Throughput: 0: 13141.1. Samples: 71144536. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:39:56,237][81074] Avg episode reward: [(0, '3263.884')] [2023-03-07 00:39:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000069491_71158784.pth... [2023-03-07 00:39:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000066409_68002816.pth [2023-03-07 00:39:56,880][81400] Updated weights for policy 0, policy_version 69500 (0.0007) [2023-03-07 00:39:57,661][81400] Updated weights for policy 0, policy_version 69510 (0.0006) [2023-03-07 00:39:58,432][81400] Updated weights for policy 0, policy_version 69520 (0.0006) [2023-03-07 00:39:59,217][81400] Updated weights for policy 0, policy_version 69530 (0.0006) [2023-03-07 00:39:59,983][81400] Updated weights for policy 0, policy_version 69540 (0.0006) [2023-03-07 00:40:00,762][81400] Updated weights for policy 0, policy_version 69550 (0.0006) [2023-03-07 00:40:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 71225344. Throughput: 0: 13152.6. Samples: 71223557. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:01,237][81074] Avg episode reward: [(0, '3148.933')] [2023-03-07 00:40:01,547][81400] Updated weights for policy 0, policy_version 69560 (0.0006) [2023-03-07 00:40:02,324][81400] Updated weights for policy 0, policy_version 69570 (0.0006) [2023-03-07 00:40:03,081][81400] Updated weights for policy 0, policy_version 69580 (0.0006) [2023-03-07 00:40:03,867][81400] Updated weights for policy 0, policy_version 69590 (0.0006) [2023-03-07 00:40:04,651][81400] Updated weights for policy 0, policy_version 69600 (0.0006) [2023-03-07 00:40:05,421][81400] Updated weights for policy 0, policy_version 69610 (0.0006) [2023-03-07 00:40:06,204][81400] Updated weights for policy 0, policy_version 69620 (0.0006) [2023-03-07 00:40:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71290880. Throughput: 0: 13168.1. Samples: 71263480. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:06,237][81074] Avg episode reward: [(0, '3252.457')] [2023-03-07 00:40:06,976][81400] Updated weights for policy 0, policy_version 69630 (0.0006) [2023-03-07 00:40:07,758][81400] Updated weights for policy 0, policy_version 69640 (0.0007) [2023-03-07 00:40:08,539][81400] Updated weights for policy 0, policy_version 69650 (0.0006) [2023-03-07 00:40:09,323][81400] Updated weights for policy 0, policy_version 69660 (0.0007) [2023-03-07 00:40:10,106][81400] Updated weights for policy 0, policy_version 69670 (0.0006) [2023-03-07 00:40:10,892][81400] Updated weights for policy 0, policy_version 69680 (0.0007) [2023-03-07 00:40:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71356416. Throughput: 0: 13168.1. Samples: 71342231. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:11,237][81074] Avg episode reward: [(0, '2970.986')] [2023-03-07 00:40:11,655][81400] Updated weights for policy 0, policy_version 69690 (0.0006) [2023-03-07 00:40:12,443][81400] Updated weights for policy 0, policy_version 69700 (0.0006) [2023-03-07 00:40:13,215][81400] Updated weights for policy 0, policy_version 69710 (0.0006) [2023-03-07 00:40:13,979][81400] Updated weights for policy 0, policy_version 69720 (0.0007) [2023-03-07 00:40:14,748][81400] Updated weights for policy 0, policy_version 69730 (0.0006) [2023-03-07 00:40:15,528][81400] Updated weights for policy 0, policy_version 69740 (0.0006) [2023-03-07 00:40:16,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 71422976. Throughput: 0: 13170.5. Samples: 71421283. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:16,237][81074] Avg episode reward: [(0, '3133.255')] [2023-03-07 00:40:16,316][81400] Updated weights for policy 0, policy_version 69750 (0.0007) [2023-03-07 00:40:17,086][81400] Updated weights for policy 0, policy_version 69760 (0.0006) [2023-03-07 00:40:17,869][81400] Updated weights for policy 0, policy_version 69770 (0.0006) [2023-03-07 00:40:18,646][81400] Updated weights for policy 0, policy_version 69780 (0.0006) [2023-03-07 00:40:19,425][81400] Updated weights for policy 0, policy_version 69790 (0.0006) [2023-03-07 00:40:20,188][81400] Updated weights for policy 0, policy_version 69800 (0.0006) [2023-03-07 00:40:20,975][81400] Updated weights for policy 0, policy_version 69810 (0.0007) [2023-03-07 00:40:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 71488512. Throughput: 0: 13170.1. Samples: 71460807. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:21,237][81074] Avg episode reward: [(0, '3082.806')] [2023-03-07 00:40:21,741][81400] Updated weights for policy 0, policy_version 69820 (0.0005) [2023-03-07 00:40:22,507][81400] Updated weights for policy 0, policy_version 69830 (0.0006) [2023-03-07 00:40:23,290][81400] Updated weights for policy 0, policy_version 69840 (0.0006) [2023-03-07 00:40:24,078][81400] Updated weights for policy 0, policy_version 69850 (0.0006) [2023-03-07 00:40:24,854][81400] Updated weights for policy 0, policy_version 69860 (0.0006) [2023-03-07 00:40:25,638][81400] Updated weights for policy 0, policy_version 69870 (0.0006) [2023-03-07 00:40:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 71554048. Throughput: 0: 13167.9. Samples: 71539712. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:26,237][81074] Avg episode reward: [(0, '3339.041')] [2023-03-07 00:40:26,417][81400] Updated weights for policy 0, policy_version 69880 (0.0006) [2023-03-07 00:40:27,202][81400] Updated weights for policy 0, policy_version 69890 (0.0006) [2023-03-07 00:40:27,995][81400] Updated weights for policy 0, policy_version 69900 (0.0006) [2023-03-07 00:40:28,783][81400] Updated weights for policy 0, policy_version 69910 (0.0006) [2023-03-07 00:40:29,553][81400] Updated weights for policy 0, policy_version 69920 (0.0006) [2023-03-07 00:40:30,329][81400] Updated weights for policy 0, policy_version 69930 (0.0006) [2023-03-07 00:40:31,109][81400] Updated weights for policy 0, policy_version 69940 (0.0007) [2023-03-07 00:40:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71619584. Throughput: 0: 13156.3. Samples: 71618512. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:31,237][81074] Avg episode reward: [(0, '3116.608')] [2023-03-07 00:40:31,893][81400] Updated weights for policy 0, policy_version 69950 (0.0005) [2023-03-07 00:40:32,676][81400] Updated weights for policy 0, policy_version 69960 (0.0006) [2023-03-07 00:40:33,462][81400] Updated weights for policy 0, policy_version 69970 (0.0006) [2023-03-07 00:40:34,228][81400] Updated weights for policy 0, policy_version 69980 (0.0006) [2023-03-07 00:40:35,014][81400] Updated weights for policy 0, policy_version 69990 (0.0006) [2023-03-07 00:40:35,800][81400] Updated weights for policy 0, policy_version 70000 (0.0005) [2023-03-07 00:40:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71685120. Throughput: 0: 13157.8. Samples: 71657885. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:36,237][81074] Avg episode reward: [(0, '3353.020')] [2023-03-07 00:40:36,572][81400] Updated weights for policy 0, policy_version 70010 (0.0006) [2023-03-07 00:40:37,359][81400] Updated weights for policy 0, policy_version 70020 (0.0006) [2023-03-07 00:40:38,126][81400] Updated weights for policy 0, policy_version 70030 (0.0006) [2023-03-07 00:40:38,890][81400] Updated weights for policy 0, policy_version 70040 (0.0006) [2023-03-07 00:40:39,691][81400] Updated weights for policy 0, policy_version 70050 (0.0005) [2023-03-07 00:40:40,461][81400] Updated weights for policy 0, policy_version 70060 (0.0006) [2023-03-07 00:40:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 71750656. Throughput: 0: 13154.6. Samples: 71736493. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:41,237][81074] Avg episode reward: [(0, '3465.386')] [2023-03-07 00:40:41,243][81400] Updated weights for policy 0, policy_version 70070 (0.0006) [2023-03-07 00:40:42,034][81400] Updated weights for policy 0, policy_version 70080 (0.0007) [2023-03-07 00:40:42,812][81400] Updated weights for policy 0, policy_version 70090 (0.0007) [2023-03-07 00:40:43,592][81400] Updated weights for policy 0, policy_version 70100 (0.0007) [2023-03-07 00:40:44,378][81400] Updated weights for policy 0, policy_version 70110 (0.0006) [2023-03-07 00:40:45,149][81400] Updated weights for policy 0, policy_version 70120 (0.0006) [2023-03-07 00:40:45,910][81400] Updated weights for policy 0, policy_version 70130 (0.0007) [2023-03-07 00:40:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 71817216. Throughput: 0: 13152.1. Samples: 71815404. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:46,237][81074] Avg episode reward: [(0, '3449.282')] [2023-03-07 00:40:46,685][81400] Updated weights for policy 0, policy_version 70140 (0.0006) [2023-03-07 00:40:47,471][81400] Updated weights for policy 0, policy_version 70150 (0.0006) [2023-03-07 00:40:48,244][81400] Updated weights for policy 0, policy_version 70160 (0.0006) [2023-03-07 00:40:49,022][81400] Updated weights for policy 0, policy_version 70170 (0.0005) [2023-03-07 00:40:49,793][81400] Updated weights for policy 0, policy_version 70180 (0.0006) [2023-03-07 00:40:50,574][81400] Updated weights for policy 0, policy_version 70190 (0.0006) [2023-03-07 00:40:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 71882752. Throughput: 0: 13147.3. Samples: 71855105. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:51,237][81074] Avg episode reward: [(0, '3305.299')] [2023-03-07 00:40:51,352][81400] Updated weights for policy 0, policy_version 70200 (0.0007) [2023-03-07 00:40:52,132][81400] Updated weights for policy 0, policy_version 70210 (0.0007) [2023-03-07 00:40:52,919][81400] Updated weights for policy 0, policy_version 70220 (0.0006) [2023-03-07 00:40:53,696][81400] Updated weights for policy 0, policy_version 70230 (0.0006) [2023-03-07 00:40:54,472][81400] Updated weights for policy 0, policy_version 70240 (0.0006) [2023-03-07 00:40:55,272][81400] Updated weights for policy 0, policy_version 70250 (0.0006) [2023-03-07 00:40:56,047][81400] Updated weights for policy 0, policy_version 70260 (0.0006) [2023-03-07 00:40:56,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 71948288. Throughput: 0: 13145.9. Samples: 71933800. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:40:56,237][81074] Avg episode reward: [(0, '3275.468')] [2023-03-07 00:40:56,843][81400] Updated weights for policy 0, policy_version 70270 (0.0006) [2023-03-07 00:40:57,630][81400] Updated weights for policy 0, policy_version 70280 (0.0006) [2023-03-07 00:40:58,397][81400] Updated weights for policy 0, policy_version 70290 (0.0006) [2023-03-07 00:40:59,195][81400] Updated weights for policy 0, policy_version 70300 (0.0006) [2023-03-07 00:40:59,976][81400] Updated weights for policy 0, policy_version 70310 (0.0007) [2023-03-07 00:41:00,743][81400] Updated weights for policy 0, policy_version 70320 (0.0006) [2023-03-07 00:41:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 72013824. Throughput: 0: 13129.2. Samples: 72012099. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:41:01,237][81074] Avg episode reward: [(0, '3286.726')] [2023-03-07 00:41:01,542][81400] Updated weights for policy 0, policy_version 70330 (0.0007) [2023-03-07 00:41:02,314][81400] Updated weights for policy 0, policy_version 70340 (0.0007) [2023-03-07 00:41:03,101][81400] Updated weights for policy 0, policy_version 70350 (0.0006) [2023-03-07 00:41:03,874][81400] Updated weights for policy 0, policy_version 70360 (0.0007) [2023-03-07 00:41:04,661][81400] Updated weights for policy 0, policy_version 70370 (0.0006) [2023-03-07 00:41:05,426][81400] Updated weights for policy 0, policy_version 70380 (0.0005) [2023-03-07 00:41:06,207][81400] Updated weights for policy 0, policy_version 70390 (0.0005) [2023-03-07 00:41:06,236][81074] Fps is (10 sec: 13107.6, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 72079360. Throughput: 0: 13128.8. Samples: 72051600. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:06,237][81074] Avg episode reward: [(0, '3426.129')] [2023-03-07 00:41:07,014][81400] Updated weights for policy 0, policy_version 70400 (0.0006) [2023-03-07 00:41:07,781][81400] Updated weights for policy 0, policy_version 70410 (0.0006) [2023-03-07 00:41:08,564][81400] Updated weights for policy 0, policy_version 70420 (0.0006) [2023-03-07 00:41:09,340][81400] Updated weights for policy 0, policy_version 70430 (0.0005) [2023-03-07 00:41:10,125][81400] Updated weights for policy 0, policy_version 70440 (0.0006) [2023-03-07 00:41:10,897][81400] Updated weights for policy 0, policy_version 70450 (0.0006) [2023-03-07 00:41:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 72144896. Throughput: 0: 13126.1. Samples: 72130388. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:11,237][81074] Avg episode reward: [(0, '3311.681')] [2023-03-07 00:41:11,686][81400] Updated weights for policy 0, policy_version 70460 (0.0006) [2023-03-07 00:41:12,465][81400] Updated weights for policy 0, policy_version 70470 (0.0005) [2023-03-07 00:41:13,238][81400] Updated weights for policy 0, policy_version 70480 (0.0006) [2023-03-07 00:41:14,013][81400] Updated weights for policy 0, policy_version 70490 (0.0006) [2023-03-07 00:41:14,809][81400] Updated weights for policy 0, policy_version 70500 (0.0007) [2023-03-07 00:41:15,582][81400] Updated weights for policy 0, policy_version 70510 (0.0006) [2023-03-07 00:41:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 72210432. Throughput: 0: 13124.3. Samples: 72209104. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:16,237][81074] Avg episode reward: [(0, '2949.650')] [2023-03-07 00:41:16,366][81400] Updated weights for policy 0, policy_version 70520 (0.0006) [2023-03-07 00:41:17,153][81400] Updated weights for policy 0, policy_version 70530 (0.0006) [2023-03-07 00:41:17,932][81400] Updated weights for policy 0, policy_version 70540 (0.0007) [2023-03-07 00:41:18,709][81400] Updated weights for policy 0, policy_version 70550 (0.0007) [2023-03-07 00:41:19,487][81400] Updated weights for policy 0, policy_version 70560 (0.0006) [2023-03-07 00:41:20,289][81400] Updated weights for policy 0, policy_version 70570 (0.0007) [2023-03-07 00:41:21,072][81400] Updated weights for policy 0, policy_version 70580 (0.0008) [2023-03-07 00:41:21,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13107.2, 300 sec: 13148.9). Total num frames: 72274944. Throughput: 0: 13120.5. Samples: 72248309. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:21,237][81074] Avg episode reward: [(0, '3234.742')] [2023-03-07 00:41:21,854][81400] Updated weights for policy 0, policy_version 70590 (0.0006) [2023-03-07 00:41:22,637][81400] Updated weights for policy 0, policy_version 70600 (0.0006) [2023-03-07 00:41:23,406][81400] Updated weights for policy 0, policy_version 70610 (0.0005) [2023-03-07 00:41:24,186][81400] Updated weights for policy 0, policy_version 70620 (0.0006) [2023-03-07 00:41:24,956][81400] Updated weights for policy 0, policy_version 70630 (0.0006) [2023-03-07 00:41:25,757][81400] Updated weights for policy 0, policy_version 70640 (0.0006) [2023-03-07 00:41:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 72341504. Throughput: 0: 13119.1. Samples: 72326851. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:26,237][81074] Avg episode reward: [(0, '3042.532')] [2023-03-07 00:41:26,554][81400] Updated weights for policy 0, policy_version 70650 (0.0006) [2023-03-07 00:41:27,311][81400] Updated weights for policy 0, policy_version 70660 (0.0006) [2023-03-07 00:41:28,098][81400] Updated weights for policy 0, policy_version 70670 (0.0006) [2023-03-07 00:41:28,893][81400] Updated weights for policy 0, policy_version 70680 (0.0006) [2023-03-07 00:41:29,674][81400] Updated weights for policy 0, policy_version 70690 (0.0006) [2023-03-07 00:41:30,462][81400] Updated weights for policy 0, policy_version 70700 (0.0006) [2023-03-07 00:41:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 72406016. Throughput: 0: 13106.8. Samples: 72405209. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:31,237][81074] Avg episode reward: [(0, '3323.718')] [2023-03-07 00:41:31,248][81400] Updated weights for policy 0, policy_version 70710 (0.0006) [2023-03-07 00:41:32,026][81400] Updated weights for policy 0, policy_version 70720 (0.0006) [2023-03-07 00:41:32,807][81400] Updated weights for policy 0, policy_version 70730 (0.0006) [2023-03-07 00:41:33,568][81400] Updated weights for policy 0, policy_version 70740 (0.0006) [2023-03-07 00:41:34,369][81400] Updated weights for policy 0, policy_version 70750 (0.0006) [2023-03-07 00:41:35,147][81400] Updated weights for policy 0, policy_version 70760 (0.0006) [2023-03-07 00:41:35,915][81400] Updated weights for policy 0, policy_version 70770 (0.0006) [2023-03-07 00:41:36,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 72471552. Throughput: 0: 13099.5. Samples: 72444580. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:36,237][81074] Avg episode reward: [(0, '3124.731')] [2023-03-07 00:41:36,691][81400] Updated weights for policy 0, policy_version 70780 (0.0006) [2023-03-07 00:41:37,481][81400] Updated weights for policy 0, policy_version 70790 (0.0006) [2023-03-07 00:41:38,244][81400] Updated weights for policy 0, policy_version 70800 (0.0006) [2023-03-07 00:41:39,039][81400] Updated weights for policy 0, policy_version 70810 (0.0006) [2023-03-07 00:41:39,835][81400] Updated weights for policy 0, policy_version 70820 (0.0006) [2023-03-07 00:41:40,612][81400] Updated weights for policy 0, policy_version 70830 (0.0006) [2023-03-07 00:41:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 72538112. Throughput: 0: 13100.1. Samples: 72523301. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:41,237][81074] Avg episode reward: [(0, '3180.984')] [2023-03-07 00:41:41,370][81400] Updated weights for policy 0, policy_version 70840 (0.0006) [2023-03-07 00:41:42,171][81400] Updated weights for policy 0, policy_version 70850 (0.0006) [2023-03-07 00:41:42,933][81400] Updated weights for policy 0, policy_version 70860 (0.0006) [2023-03-07 00:41:43,729][81400] Updated weights for policy 0, policy_version 70870 (0.0006) [2023-03-07 00:41:44,497][81400] Updated weights for policy 0, policy_version 70880 (0.0006) [2023-03-07 00:41:45,277][81400] Updated weights for policy 0, policy_version 70890 (0.0006) [2023-03-07 00:41:46,064][81400] Updated weights for policy 0, policy_version 70900 (0.0006) [2023-03-07 00:41:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 72603648. Throughput: 0: 13110.6. Samples: 72602077. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:41:46,237][81074] Avg episode reward: [(0, '3197.478')] [2023-03-07 00:41:46,847][81400] Updated weights for policy 0, policy_version 70910 (0.0006) [2023-03-07 00:41:47,613][81400] Updated weights for policy 0, policy_version 70920 (0.0006) [2023-03-07 00:41:48,413][81400] Updated weights for policy 0, policy_version 70930 (0.0006) [2023-03-07 00:41:49,196][81400] Updated weights for policy 0, policy_version 70940 (0.0006) [2023-03-07 00:41:49,989][81400] Updated weights for policy 0, policy_version 70950 (0.0006) [2023-03-07 00:41:50,755][81400] Updated weights for policy 0, policy_version 70960 (0.0006) [2023-03-07 00:41:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 72669184. Throughput: 0: 13107.7. Samples: 72641446. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:41:51,237][81074] Avg episode reward: [(0, '3104.865')] [2023-03-07 00:41:51,512][81400] Updated weights for policy 0, policy_version 70970 (0.0006) [2023-03-07 00:41:52,301][81400] Updated weights for policy 0, policy_version 70980 (0.0007) [2023-03-07 00:41:53,089][81400] Updated weights for policy 0, policy_version 70990 (0.0006) [2023-03-07 00:41:53,842][81400] Updated weights for policy 0, policy_version 71000 (0.0006) [2023-03-07 00:41:54,616][81400] Updated weights for policy 0, policy_version 71010 (0.0006) [2023-03-07 00:41:55,407][81400] Updated weights for policy 0, policy_version 71020 (0.0006) [2023-03-07 00:41:56,173][81400] Updated weights for policy 0, policy_version 71030 (0.0007) [2023-03-07 00:41:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13145.4). Total num frames: 72734720. Throughput: 0: 13115.7. Samples: 72720596. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:41:56,237][81074] Avg episode reward: [(0, '3217.006')] [2023-03-07 00:41:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000071030_72734720.pth... [2023-03-07 00:41:56,269][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000067951_69581824.pth [2023-03-07 00:41:56,961][81400] Updated weights for policy 0, policy_version 71040 (0.0006) [2023-03-07 00:41:57,754][81400] Updated weights for policy 0, policy_version 71050 (0.0007) [2023-03-07 00:41:58,543][81400] Updated weights for policy 0, policy_version 71060 (0.0005) [2023-03-07 00:41:59,303][81400] Updated weights for policy 0, policy_version 71070 (0.0006) [2023-03-07 00:42:00,069][81400] Updated weights for policy 0, policy_version 71080 (0.0006) [2023-03-07 00:42:00,845][81400] Updated weights for policy 0, policy_version 71090 (0.0006) [2023-03-07 00:42:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 72801280. Throughput: 0: 13121.9. Samples: 72799587. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:42:01,237][81074] Avg episode reward: [(0, '3313.731')] [2023-03-07 00:42:01,616][81400] Updated weights for policy 0, policy_version 71100 (0.0006) [2023-03-07 00:42:02,401][81400] Updated weights for policy 0, policy_version 71110 (0.0005) [2023-03-07 00:42:03,184][81400] Updated weights for policy 0, policy_version 71120 (0.0006) [2023-03-07 00:42:03,953][81400] Updated weights for policy 0, policy_version 71130 (0.0006) [2023-03-07 00:42:04,710][81400] Updated weights for policy 0, policy_version 71140 (0.0006) [2023-03-07 00:42:05,519][81400] Updated weights for policy 0, policy_version 71150 (0.0006) [2023-03-07 00:42:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 72866816. Throughput: 0: 13129.7. Samples: 72839146. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:42:06,237][81074] Avg episode reward: [(0, '3262.759')] [2023-03-07 00:42:06,297][81400] Updated weights for policy 0, policy_version 71160 (0.0006) [2023-03-07 00:42:07,079][81400] Updated weights for policy 0, policy_version 71170 (0.0006) [2023-03-07 00:42:07,855][81400] Updated weights for policy 0, policy_version 71180 (0.0006) [2023-03-07 00:42:08,617][81400] Updated weights for policy 0, policy_version 71190 (0.0006) [2023-03-07 00:42:09,399][81400] Updated weights for policy 0, policy_version 71200 (0.0006) [2023-03-07 00:42:10,174][81400] Updated weights for policy 0, policy_version 71210 (0.0006) [2023-03-07 00:42:10,981][81400] Updated weights for policy 0, policy_version 71220 (0.0006) [2023-03-07 00:42:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 72932352. Throughput: 0: 13136.3. Samples: 72917986. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:42:11,237][81074] Avg episode reward: [(0, '3391.494')] [2023-03-07 00:42:11,746][81400] Updated weights for policy 0, policy_version 71230 (0.0006) [2023-03-07 00:42:12,530][81400] Updated weights for policy 0, policy_version 71240 (0.0007) [2023-03-07 00:42:13,307][81400] Updated weights for policy 0, policy_version 71250 (0.0006) [2023-03-07 00:42:14,093][81400] Updated weights for policy 0, policy_version 71260 (0.0006) [2023-03-07 00:42:14,856][81400] Updated weights for policy 0, policy_version 71270 (0.0006) [2023-03-07 00:42:15,615][81400] Updated weights for policy 0, policy_version 71280 (0.0006) [2023-03-07 00:42:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 72997888. Throughput: 0: 13155.5. Samples: 72997208. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:42:16,237][81074] Avg episode reward: [(0, '3236.770')] [2023-03-07 00:42:16,406][81400] Updated weights for policy 0, policy_version 71290 (0.0006) [2023-03-07 00:42:17,177][81400] Updated weights for policy 0, policy_version 71300 (0.0007) [2023-03-07 00:42:17,953][81400] Updated weights for policy 0, policy_version 71310 (0.0006) [2023-03-07 00:42:18,744][81400] Updated weights for policy 0, policy_version 71320 (0.0006) [2023-03-07 00:42:19,541][81400] Updated weights for policy 0, policy_version 71330 (0.0006) [2023-03-07 00:42:20,325][81400] Updated weights for policy 0, policy_version 71340 (0.0007) [2023-03-07 00:42:21,109][81400] Updated weights for policy 0, policy_version 71350 (0.0006) [2023-03-07 00:42:21,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 73063424. Throughput: 0: 13150.9. Samples: 73036373. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:42:21,237][81074] Avg episode reward: [(0, '3202.093')] [2023-03-07 00:42:21,897][81400] Updated weights for policy 0, policy_version 71360 (0.0006) [2023-03-07 00:42:22,673][81400] Updated weights for policy 0, policy_version 71370 (0.0006) [2023-03-07 00:42:23,450][81400] Updated weights for policy 0, policy_version 71380 (0.0006) [2023-03-07 00:42:24,217][81400] Updated weights for policy 0, policy_version 71390 (0.0007) [2023-03-07 00:42:25,011][81400] Updated weights for policy 0, policy_version 71400 (0.0006) [2023-03-07 00:42:25,764][81400] Updated weights for policy 0, policy_version 71410 (0.0006) [2023-03-07 00:42:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 73128960. Throughput: 0: 13149.5. Samples: 73115025. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:42:26,237][81074] Avg episode reward: [(0, '3266.322')] [2023-03-07 00:42:26,542][81400] Updated weights for policy 0, policy_version 71420 (0.0006) [2023-03-07 00:42:27,341][81400] Updated weights for policy 0, policy_version 71430 (0.0006) [2023-03-07 00:42:28,121][81400] Updated weights for policy 0, policy_version 71440 (0.0006) [2023-03-07 00:42:28,909][81400] Updated weights for policy 0, policy_version 71450 (0.0006) [2023-03-07 00:42:29,681][81400] Updated weights for policy 0, policy_version 71460 (0.0006) [2023-03-07 00:42:30,465][81400] Updated weights for policy 0, policy_version 71470 (0.0006) [2023-03-07 00:42:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 73194496. Throughput: 0: 13147.9. Samples: 73193732. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:42:31,237][81074] Avg episode reward: [(0, '3393.831')] [2023-03-07 00:42:31,249][81400] Updated weights for policy 0, policy_version 71480 (0.0006) [2023-03-07 00:42:32,015][81400] Updated weights for policy 0, policy_version 71490 (0.0005) [2023-03-07 00:42:32,780][81400] Updated weights for policy 0, policy_version 71500 (0.0006) [2023-03-07 00:42:33,588][81400] Updated weights for policy 0, policy_version 71510 (0.0006) [2023-03-07 00:42:34,353][81400] Updated weights for policy 0, policy_version 71520 (0.0006) [2023-03-07 00:42:35,112][81400] Updated weights for policy 0, policy_version 71530 (0.0005) [2023-03-07 00:42:35,909][81400] Updated weights for policy 0, policy_version 71540 (0.0006) [2023-03-07 00:42:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 73261056. Throughput: 0: 13150.6. Samples: 73233222. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:42:36,237][81074] Avg episode reward: [(0, '3404.057')] [2023-03-07 00:42:36,675][81400] Updated weights for policy 0, policy_version 71550 (0.0006) [2023-03-07 00:42:37,461][81400] Updated weights for policy 0, policy_version 71560 (0.0007) [2023-03-07 00:42:38,247][81400] Updated weights for policy 0, policy_version 71570 (0.0006) [2023-03-07 00:42:39,017][81400] Updated weights for policy 0, policy_version 71580 (0.0006) [2023-03-07 00:42:39,798][81400] Updated weights for policy 0, policy_version 71590 (0.0006) [2023-03-07 00:42:40,571][81400] Updated weights for policy 0, policy_version 71600 (0.0006) [2023-03-07 00:42:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 73326592. Throughput: 0: 13150.2. Samples: 73312352. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:42:41,237][81074] Avg episode reward: [(0, '3437.898')] [2023-03-07 00:42:41,359][81400] Updated weights for policy 0, policy_version 71610 (0.0007) [2023-03-07 00:42:42,122][81400] Updated weights for policy 0, policy_version 71620 (0.0007) [2023-03-07 00:42:42,913][81400] Updated weights for policy 0, policy_version 71630 (0.0006) [2023-03-07 00:42:43,685][81400] Updated weights for policy 0, policy_version 71640 (0.0006) [2023-03-07 00:42:44,487][81400] Updated weights for policy 0, policy_version 71650 (0.0006) [2023-03-07 00:42:45,256][81400] Updated weights for policy 0, policy_version 71660 (0.0007) [2023-03-07 00:42:46,025][81400] Updated weights for policy 0, policy_version 71670 (0.0006) [2023-03-07 00:42:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 73392128. Throughput: 0: 13148.1. Samples: 73391254. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:42:46,237][81074] Avg episode reward: [(0, '3302.584')] [2023-03-07 00:42:46,810][81400] Updated weights for policy 0, policy_version 71680 (0.0006) [2023-03-07 00:42:47,586][81400] Updated weights for policy 0, policy_version 71690 (0.0006) [2023-03-07 00:42:48,366][81400] Updated weights for policy 0, policy_version 71700 (0.0006) [2023-03-07 00:42:49,142][81400] Updated weights for policy 0, policy_version 71710 (0.0007) [2023-03-07 00:42:49,941][81400] Updated weights for policy 0, policy_version 71720 (0.0006) [2023-03-07 00:42:50,723][81400] Updated weights for policy 0, policy_version 71730 (0.0006) [2023-03-07 00:42:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13138.4). Total num frames: 73457664. Throughput: 0: 13141.6. Samples: 73430516. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:42:51,237][81074] Avg episode reward: [(0, '3434.613')] [2023-03-07 00:42:51,515][81400] Updated weights for policy 0, policy_version 71740 (0.0006) [2023-03-07 00:42:52,304][81400] Updated weights for policy 0, policy_version 71750 (0.0006) [2023-03-07 00:42:53,065][81400] Updated weights for policy 0, policy_version 71760 (0.0006) [2023-03-07 00:42:53,833][81400] Updated weights for policy 0, policy_version 71770 (0.0006) [2023-03-07 00:42:54,604][81400] Updated weights for policy 0, policy_version 71780 (0.0006) [2023-03-07 00:42:55,371][81400] Updated weights for policy 0, policy_version 71790 (0.0006) [2023-03-07 00:42:56,160][81400] Updated weights for policy 0, policy_version 71800 (0.0007) [2023-03-07 00:42:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 73523200. Throughput: 0: 13137.9. Samples: 73509191. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:42:56,237][81074] Avg episode reward: [(0, '3233.290')] [2023-03-07 00:42:56,949][81400] Updated weights for policy 0, policy_version 71810 (0.0006) [2023-03-07 00:42:57,728][81400] Updated weights for policy 0, policy_version 71820 (0.0006) [2023-03-07 00:42:58,495][81400] Updated weights for policy 0, policy_version 71830 (0.0006) [2023-03-07 00:42:59,279][81400] Updated weights for policy 0, policy_version 71840 (0.0006) [2023-03-07 00:43:00,051][81400] Updated weights for policy 0, policy_version 71850 (0.0006) [2023-03-07 00:43:00,849][81400] Updated weights for policy 0, policy_version 71860 (0.0006) [2023-03-07 00:43:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 73588736. Throughput: 0: 13123.8. Samples: 73587781. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:43:01,237][81074] Avg episode reward: [(0, '3363.009')] [2023-03-07 00:43:01,651][81400] Updated weights for policy 0, policy_version 71870 (0.0006) [2023-03-07 00:43:02,425][81400] Updated weights for policy 0, policy_version 71880 (0.0006) [2023-03-07 00:43:03,201][81400] Updated weights for policy 0, policy_version 71890 (0.0006) [2023-03-07 00:43:03,971][81400] Updated weights for policy 0, policy_version 71900 (0.0006) [2023-03-07 00:43:04,759][81400] Updated weights for policy 0, policy_version 71910 (0.0006) [2023-03-07 00:43:05,536][81400] Updated weights for policy 0, policy_version 71920 (0.0006) [2023-03-07 00:43:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 73655296. Throughput: 0: 13131.9. Samples: 73627310. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:43:06,237][81074] Avg episode reward: [(0, '3078.200')] [2023-03-07 00:43:06,312][81400] Updated weights for policy 0, policy_version 71930 (0.0006) [2023-03-07 00:43:07,105][81400] Updated weights for policy 0, policy_version 71940 (0.0006) [2023-03-07 00:43:07,873][81400] Updated weights for policy 0, policy_version 71950 (0.0006) [2023-03-07 00:43:08,653][81400] Updated weights for policy 0, policy_version 71960 (0.0006) [2023-03-07 00:43:09,436][81400] Updated weights for policy 0, policy_version 71970 (0.0006) [2023-03-07 00:43:10,220][81400] Updated weights for policy 0, policy_version 71980 (0.0006) [2023-03-07 00:43:10,985][81400] Updated weights for policy 0, policy_version 71990 (0.0006) [2023-03-07 00:43:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 73720832. Throughput: 0: 13133.8. Samples: 73706047. Policy #0 lag: (min: 0.0, avg: 1.0, max: 2.0) [2023-03-07 00:43:11,237][81074] Avg episode reward: [(0, '3312.333')] [2023-03-07 00:43:11,771][81400] Updated weights for policy 0, policy_version 72000 (0.0006) [2023-03-07 00:43:12,542][81400] Updated weights for policy 0, policy_version 72010 (0.0005) [2023-03-07 00:43:13,327][81400] Updated weights for policy 0, policy_version 72020 (0.0005) [2023-03-07 00:43:14,102][81400] Updated weights for policy 0, policy_version 72030 (0.0006) [2023-03-07 00:43:14,894][81400] Updated weights for policy 0, policy_version 72040 (0.0007) [2023-03-07 00:43:15,674][81400] Updated weights for policy 0, policy_version 72050 (0.0006) [2023-03-07 00:43:16,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13124.2, 300 sec: 13138.4). Total num frames: 73785344. Throughput: 0: 13133.7. Samples: 73784750. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:16,237][81074] Avg episode reward: [(0, '3341.974')] [2023-03-07 00:43:16,452][81400] Updated weights for policy 0, policy_version 72060 (0.0006) [2023-03-07 00:43:17,243][81400] Updated weights for policy 0, policy_version 72070 (0.0006) [2023-03-07 00:43:18,005][81400] Updated weights for policy 0, policy_version 72080 (0.0006) [2023-03-07 00:43:18,777][81400] Updated weights for policy 0, policy_version 72090 (0.0007) [2023-03-07 00:43:19,573][81400] Updated weights for policy 0, policy_version 72100 (0.0006) [2023-03-07 00:43:20,341][81400] Updated weights for policy 0, policy_version 72110 (0.0007) [2023-03-07 00:43:21,134][81400] Updated weights for policy 0, policy_version 72120 (0.0006) [2023-03-07 00:43:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 73851904. Throughput: 0: 13135.9. Samples: 73824339. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:21,237][81074] Avg episode reward: [(0, '3350.603')] [2023-03-07 00:43:21,906][81400] Updated weights for policy 0, policy_version 72130 (0.0006) [2023-03-07 00:43:22,678][81400] Updated weights for policy 0, policy_version 72140 (0.0007) [2023-03-07 00:43:23,456][81400] Updated weights for policy 0, policy_version 72150 (0.0006) [2023-03-07 00:43:24,247][81400] Updated weights for policy 0, policy_version 72160 (0.0006) [2023-03-07 00:43:25,032][81400] Updated weights for policy 0, policy_version 72170 (0.0006) [2023-03-07 00:43:25,794][81400] Updated weights for policy 0, policy_version 72180 (0.0006) [2023-03-07 00:43:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 73917440. Throughput: 0: 13127.0. Samples: 73903068. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:26,237][81074] Avg episode reward: [(0, '3262.106')] [2023-03-07 00:43:26,596][81400] Updated weights for policy 0, policy_version 72190 (0.0007) [2023-03-07 00:43:27,376][81400] Updated weights for policy 0, policy_version 72200 (0.0006) [2023-03-07 00:43:28,143][81400] Updated weights for policy 0, policy_version 72210 (0.0006) [2023-03-07 00:43:28,938][81400] Updated weights for policy 0, policy_version 72220 (0.0007) [2023-03-07 00:43:29,719][81400] Updated weights for policy 0, policy_version 72230 (0.0006) [2023-03-07 00:43:30,476][81400] Updated weights for policy 0, policy_version 72240 (0.0006) [2023-03-07 00:43:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 73982976. Throughput: 0: 13125.3. Samples: 73981893. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:31,237][81074] Avg episode reward: [(0, '3384.955')] [2023-03-07 00:43:31,262][81400] Updated weights for policy 0, policy_version 72250 (0.0006) [2023-03-07 00:43:32,029][81400] Updated weights for policy 0, policy_version 72260 (0.0007) [2023-03-07 00:43:32,822][81400] Updated weights for policy 0, policy_version 72270 (0.0006) [2023-03-07 00:43:33,588][81400] Updated weights for policy 0, policy_version 72280 (0.0006) [2023-03-07 00:43:34,365][81400] Updated weights for policy 0, policy_version 72290 (0.0006) [2023-03-07 00:43:35,137][81400] Updated weights for policy 0, policy_version 72300 (0.0006) [2023-03-07 00:43:35,926][81400] Updated weights for policy 0, policy_version 72310 (0.0006) [2023-03-07 00:43:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 74049536. Throughput: 0: 13134.3. Samples: 74021560. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:36,237][81074] Avg episode reward: [(0, '3385.949')] [2023-03-07 00:43:36,697][81400] Updated weights for policy 0, policy_version 72320 (0.0006) [2023-03-07 00:43:37,464][81400] Updated weights for policy 0, policy_version 72330 (0.0006) [2023-03-07 00:43:38,253][81400] Updated weights for policy 0, policy_version 72340 (0.0006) [2023-03-07 00:43:39,023][81400] Updated weights for policy 0, policy_version 72350 (0.0005) [2023-03-07 00:43:39,802][81400] Updated weights for policy 0, policy_version 72360 (0.0006) [2023-03-07 00:43:40,582][81400] Updated weights for policy 0, policy_version 72370 (0.0006) [2023-03-07 00:43:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 74115072. Throughput: 0: 13140.3. Samples: 74100505. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:41,237][81074] Avg episode reward: [(0, '3424.748')] [2023-03-07 00:43:41,363][81400] Updated weights for policy 0, policy_version 72380 (0.0006) [2023-03-07 00:43:42,126][81400] Updated weights for policy 0, policy_version 72390 (0.0006) [2023-03-07 00:43:42,923][81400] Updated weights for policy 0, policy_version 72400 (0.0006) [2023-03-07 00:43:43,690][81400] Updated weights for policy 0, policy_version 72410 (0.0006) [2023-03-07 00:43:44,480][81400] Updated weights for policy 0, policy_version 72420 (0.0006) [2023-03-07 00:43:45,250][81400] Updated weights for policy 0, policy_version 72430 (0.0006) [2023-03-07 00:43:46,036][81400] Updated weights for policy 0, policy_version 72440 (0.0008) [2023-03-07 00:43:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 74180608. Throughput: 0: 13150.5. Samples: 74179552. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:46,237][81074] Avg episode reward: [(0, '3517.872')] [2023-03-07 00:43:46,812][81400] Updated weights for policy 0, policy_version 72450 (0.0006) [2023-03-07 00:43:47,596][81400] Updated weights for policy 0, policy_version 72460 (0.0006) [2023-03-07 00:43:48,366][81400] Updated weights for policy 0, policy_version 72470 (0.0006) [2023-03-07 00:43:49,143][81400] Updated weights for policy 0, policy_version 72480 (0.0007) [2023-03-07 00:43:49,931][81400] Updated weights for policy 0, policy_version 72490 (0.0006) [2023-03-07 00:43:50,719][81400] Updated weights for policy 0, policy_version 72500 (0.0006) [2023-03-07 00:43:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 74246144. Throughput: 0: 13146.3. Samples: 74218892. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:51,237][81074] Avg episode reward: [(0, '3518.227')] [2023-03-07 00:43:51,500][81400] Updated weights for policy 0, policy_version 72510 (0.0006) [2023-03-07 00:43:52,261][81400] Updated weights for policy 0, policy_version 72520 (0.0006) [2023-03-07 00:43:53,045][81400] Updated weights for policy 0, policy_version 72530 (0.0006) [2023-03-07 00:43:53,827][81400] Updated weights for policy 0, policy_version 72540 (0.0006) [2023-03-07 00:43:54,590][81400] Updated weights for policy 0, policy_version 72550 (0.0006) [2023-03-07 00:43:55,381][81400] Updated weights for policy 0, policy_version 72560 (0.0006) [2023-03-07 00:43:56,173][81400] Updated weights for policy 0, policy_version 72570 (0.0006) [2023-03-07 00:43:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 74311680. Throughput: 0: 13146.1. Samples: 74297620. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:43:56,237][81074] Avg episode reward: [(0, '3344.732')] [2023-03-07 00:43:56,249][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000072571_74312704.pth... [2023-03-07 00:43:56,281][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000069491_71158784.pth [2023-03-07 00:43:56,947][81400] Updated weights for policy 0, policy_version 72580 (0.0006) [2023-03-07 00:43:57,726][81400] Updated weights for policy 0, policy_version 72590 (0.0007) [2023-03-07 00:43:58,510][81400] Updated weights for policy 0, policy_version 72600 (0.0006) [2023-03-07 00:43:59,270][81400] Updated weights for policy 0, policy_version 72610 (0.0006) [2023-03-07 00:44:00,062][81400] Updated weights for policy 0, policy_version 72620 (0.0006) [2023-03-07 00:44:00,842][81400] Updated weights for policy 0, policy_version 72630 (0.0007) [2023-03-07 00:44:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 74378240. Throughput: 0: 13153.4. Samples: 74376650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:01,237][81074] Avg episode reward: [(0, '3296.630')] [2023-03-07 00:44:01,642][81400] Updated weights for policy 0, policy_version 72640 (0.0005) [2023-03-07 00:44:02,401][81400] Updated weights for policy 0, policy_version 72650 (0.0007) [2023-03-07 00:44:03,185][81400] Updated weights for policy 0, policy_version 72660 (0.0006) [2023-03-07 00:44:03,983][81400] Updated weights for policy 0, policy_version 72670 (0.0006) [2023-03-07 00:44:04,761][81400] Updated weights for policy 0, policy_version 72680 (0.0006) [2023-03-07 00:44:05,541][81400] Updated weights for policy 0, policy_version 72690 (0.0006) [2023-03-07 00:44:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 74442752. Throughput: 0: 13146.4. Samples: 74415927. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:06,237][81074] Avg episode reward: [(0, '3173.331')] [2023-03-07 00:44:06,308][81400] Updated weights for policy 0, policy_version 72700 (0.0006) [2023-03-07 00:44:07,104][81400] Updated weights for policy 0, policy_version 72710 (0.0006) [2023-03-07 00:44:07,872][81400] Updated weights for policy 0, policy_version 72720 (0.0006) [2023-03-07 00:44:08,636][81400] Updated weights for policy 0, policy_version 72730 (0.0007) [2023-03-07 00:44:09,415][81400] Updated weights for policy 0, policy_version 72740 (0.0006) [2023-03-07 00:44:10,195][81400] Updated weights for policy 0, policy_version 72750 (0.0006) [2023-03-07 00:44:10,980][81400] Updated weights for policy 0, policy_version 72760 (0.0006) [2023-03-07 00:44:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 74509312. Throughput: 0: 13149.8. Samples: 74494808. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:11,237][81074] Avg episode reward: [(0, '3173.686')] [2023-03-07 00:44:11,753][81400] Updated weights for policy 0, policy_version 72770 (0.0007) [2023-03-07 00:44:12,534][81400] Updated weights for policy 0, policy_version 72780 (0.0006) [2023-03-07 00:44:13,318][81400] Updated weights for policy 0, policy_version 72790 (0.0006) [2023-03-07 00:44:14,088][81400] Updated weights for policy 0, policy_version 72800 (0.0006) [2023-03-07 00:44:14,879][81400] Updated weights for policy 0, policy_version 72810 (0.0006) [2023-03-07 00:44:15,626][81400] Updated weights for policy 0, policy_version 72820 (0.0006) [2023-03-07 00:44:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 74574848. Throughput: 0: 13152.0. Samples: 74573732. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:16,237][81074] Avg episode reward: [(0, '3298.352')] [2023-03-07 00:44:16,430][81400] Updated weights for policy 0, policy_version 72830 (0.0005) [2023-03-07 00:44:17,204][81400] Updated weights for policy 0, policy_version 72840 (0.0006) [2023-03-07 00:44:17,971][81400] Updated weights for policy 0, policy_version 72850 (0.0007) [2023-03-07 00:44:18,757][81400] Updated weights for policy 0, policy_version 72860 (0.0007) [2023-03-07 00:44:19,546][81400] Updated weights for policy 0, policy_version 72870 (0.0007) [2023-03-07 00:44:20,317][81400] Updated weights for policy 0, policy_version 72880 (0.0006) [2023-03-07 00:44:21,090][81400] Updated weights for policy 0, policy_version 72890 (0.0006) [2023-03-07 00:44:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 74640384. Throughput: 0: 13146.1. Samples: 74613135. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:21,237][81074] Avg episode reward: [(0, '3274.951')] [2023-03-07 00:44:21,877][81400] Updated weights for policy 0, policy_version 72900 (0.0006) [2023-03-07 00:44:22,640][81400] Updated weights for policy 0, policy_version 72910 (0.0006) [2023-03-07 00:44:23,425][81400] Updated weights for policy 0, policy_version 72920 (0.0006) [2023-03-07 00:44:24,206][81400] Updated weights for policy 0, policy_version 72930 (0.0006) [2023-03-07 00:44:24,969][81400] Updated weights for policy 0, policy_version 72940 (0.0006) [2023-03-07 00:44:25,758][81400] Updated weights for policy 0, policy_version 72950 (0.0006) [2023-03-07 00:44:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 74706944. Throughput: 0: 13146.1. Samples: 74692080. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:26,237][81074] Avg episode reward: [(0, '3247.093')] [2023-03-07 00:44:26,520][81400] Updated weights for policy 0, policy_version 72960 (0.0006) [2023-03-07 00:44:27,299][81400] Updated weights for policy 0, policy_version 72970 (0.0006) [2023-03-07 00:44:28,098][81400] Updated weights for policy 0, policy_version 72980 (0.0006) [2023-03-07 00:44:28,869][81400] Updated weights for policy 0, policy_version 72990 (0.0006) [2023-03-07 00:44:29,645][81400] Updated weights for policy 0, policy_version 73000 (0.0006) [2023-03-07 00:44:30,432][81400] Updated weights for policy 0, policy_version 73010 (0.0007) [2023-03-07 00:44:31,206][81400] Updated weights for policy 0, policy_version 73020 (0.0006) [2023-03-07 00:44:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 74772480. Throughput: 0: 13148.0. Samples: 74771211. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:31,237][81074] Avg episode reward: [(0, '3251.110')] [2023-03-07 00:44:31,984][81400] Updated weights for policy 0, policy_version 73030 (0.0006) [2023-03-07 00:44:32,773][81400] Updated weights for policy 0, policy_version 73040 (0.0006) [2023-03-07 00:44:33,543][81400] Updated weights for policy 0, policy_version 73050 (0.0006) [2023-03-07 00:44:34,332][81400] Updated weights for policy 0, policy_version 73060 (0.0007) [2023-03-07 00:44:35,109][81400] Updated weights for policy 0, policy_version 73070 (0.0006) [2023-03-07 00:44:35,878][81400] Updated weights for policy 0, policy_version 73080 (0.0006) [2023-03-07 00:44:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 74838016. Throughput: 0: 13147.6. Samples: 74810534. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:36,237][81074] Avg episode reward: [(0, '3332.287')] [2023-03-07 00:44:36,656][81400] Updated weights for policy 0, policy_version 73090 (0.0006) [2023-03-07 00:44:37,435][81400] Updated weights for policy 0, policy_version 73100 (0.0007) [2023-03-07 00:44:38,212][81400] Updated weights for policy 0, policy_version 73110 (0.0006) [2023-03-07 00:44:38,994][81400] Updated weights for policy 0, policy_version 73120 (0.0006) [2023-03-07 00:44:39,769][81400] Updated weights for policy 0, policy_version 73130 (0.0007) [2023-03-07 00:44:40,557][81400] Updated weights for policy 0, policy_version 73140 (0.0006) [2023-03-07 00:44:41,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13138.4). Total num frames: 74903552. Throughput: 0: 13152.7. Samples: 74889492. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:41,247][81074] Avg episode reward: [(0, '3268.016')] [2023-03-07 00:44:41,326][81400] Updated weights for policy 0, policy_version 73150 (0.0006) [2023-03-07 00:44:42,105][81400] Updated weights for policy 0, policy_version 73160 (0.0006) [2023-03-07 00:44:42,891][81400] Updated weights for policy 0, policy_version 73170 (0.0006) [2023-03-07 00:44:43,673][81400] Updated weights for policy 0, policy_version 73180 (0.0006) [2023-03-07 00:44:44,452][81400] Updated weights for policy 0, policy_version 73190 (0.0006) [2023-03-07 00:44:45,237][81400] Updated weights for policy 0, policy_version 73200 (0.0006) [2023-03-07 00:44:45,992][81400] Updated weights for policy 0, policy_version 73210 (0.0006) [2023-03-07 00:44:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 74970112. Throughput: 0: 13149.5. Samples: 74968377. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:46,247][81074] Avg episode reward: [(0, '3386.514')] [2023-03-07 00:44:46,789][81400] Updated weights for policy 0, policy_version 73220 (0.0006) [2023-03-07 00:44:47,568][81400] Updated weights for policy 0, policy_version 73230 (0.0006) [2023-03-07 00:44:48,351][81400] Updated weights for policy 0, policy_version 73240 (0.0006) [2023-03-07 00:44:49,123][81400] Updated weights for policy 0, policy_version 73250 (0.0006) [2023-03-07 00:44:49,897][81400] Updated weights for policy 0, policy_version 73260 (0.0006) [2023-03-07 00:44:50,665][81400] Updated weights for policy 0, policy_version 73270 (0.0006) [2023-03-07 00:44:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 75034624. Throughput: 0: 13149.7. Samples: 75007661. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:51,247][81074] Avg episode reward: [(0, '3390.178')] [2023-03-07 00:44:51,465][81400] Updated weights for policy 0, policy_version 73280 (0.0006) [2023-03-07 00:44:52,230][81400] Updated weights for policy 0, policy_version 73290 (0.0006) [2023-03-07 00:44:53,015][81400] Updated weights for policy 0, policy_version 73300 (0.0007) [2023-03-07 00:44:53,810][81400] Updated weights for policy 0, policy_version 73310 (0.0007) [2023-03-07 00:44:54,586][81400] Updated weights for policy 0, policy_version 73320 (0.0005) [2023-03-07 00:44:55,366][81400] Updated weights for policy 0, policy_version 73330 (0.0007) [2023-03-07 00:44:56,149][81400] Updated weights for policy 0, policy_version 73340 (0.0006) [2023-03-07 00:44:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13138.4). Total num frames: 75101184. Throughput: 0: 13144.7. Samples: 75086320. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:44:56,247][81074] Avg episode reward: [(0, '3548.109')] [2023-03-07 00:44:56,919][81400] Updated weights for policy 0, policy_version 73350 (0.0007) [2023-03-07 00:44:57,695][81400] Updated weights for policy 0, policy_version 73360 (0.0005) [2023-03-07 00:44:58,478][81400] Updated weights for policy 0, policy_version 73370 (0.0008) [2023-03-07 00:44:59,286][81400] Updated weights for policy 0, policy_version 73380 (0.0006) [2023-03-07 00:45:00,055][81400] Updated weights for policy 0, policy_version 73390 (0.0006) [2023-03-07 00:45:00,841][81400] Updated weights for policy 0, policy_version 73400 (0.0006) [2023-03-07 00:45:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13138.5). Total num frames: 75166720. Throughput: 0: 13142.0. Samples: 75165118. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:45:01,237][81074] Avg episode reward: [(0, '3607.629')] [2023-03-07 00:45:01,630][81400] Updated weights for policy 0, policy_version 73410 (0.0006) [2023-03-07 00:45:02,429][81400] Updated weights for policy 0, policy_version 73420 (0.0006) [2023-03-07 00:45:03,194][81400] Updated weights for policy 0, policy_version 73430 (0.0007) [2023-03-07 00:45:03,973][81400] Updated weights for policy 0, policy_version 73440 (0.0006) [2023-03-07 00:45:04,753][81400] Updated weights for policy 0, policy_version 73450 (0.0006) [2023-03-07 00:45:05,560][81400] Updated weights for policy 0, policy_version 73460 (0.0007) [2023-03-07 00:45:06,236][81074] Fps is (10 sec: 13004.5, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 75231232. Throughput: 0: 13138.1. Samples: 75204350. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:45:06,237][81074] Avg episode reward: [(0, '3558.284')] [2023-03-07 00:45:06,335][81400] Updated weights for policy 0, policy_version 73470 (0.0006) [2023-03-07 00:45:07,101][81400] Updated weights for policy 0, policy_version 73480 (0.0006) [2023-03-07 00:45:07,898][81400] Updated weights for policy 0, policy_version 73490 (0.0006) [2023-03-07 00:45:08,685][81400] Updated weights for policy 0, policy_version 73500 (0.0006) [2023-03-07 00:45:09,454][81400] Updated weights for policy 0, policy_version 73510 (0.0005) [2023-03-07 00:45:10,235][81400] Updated weights for policy 0, policy_version 73520 (0.0007) [2023-03-07 00:45:11,009][81400] Updated weights for policy 0, policy_version 73530 (0.0006) [2023-03-07 00:45:11,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 75297792. Throughput: 0: 13127.4. Samples: 75282812. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:45:11,237][81074] Avg episode reward: [(0, '3398.556')] [2023-03-07 00:45:11,769][81400] Updated weights for policy 0, policy_version 73540 (0.0006) [2023-03-07 00:45:12,564][81400] Updated weights for policy 0, policy_version 73550 (0.0006) [2023-03-07 00:45:13,339][81400] Updated weights for policy 0, policy_version 73560 (0.0006) [2023-03-07 00:45:14,108][81400] Updated weights for policy 0, policy_version 73570 (0.0007) [2023-03-07 00:45:14,885][81400] Updated weights for policy 0, policy_version 73580 (0.0006) [2023-03-07 00:45:15,661][81400] Updated weights for policy 0, policy_version 73590 (0.0006) [2023-03-07 00:45:16,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 75363328. Throughput: 0: 13131.0. Samples: 75362107. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:45:16,237][81074] Avg episode reward: [(0, '3383.437')] [2023-03-07 00:45:16,429][81400] Updated weights for policy 0, policy_version 73600 (0.0006) [2023-03-07 00:45:17,218][81400] Updated weights for policy 0, policy_version 73610 (0.0006) [2023-03-07 00:45:17,991][81400] Updated weights for policy 0, policy_version 73620 (0.0006) [2023-03-07 00:45:18,766][81400] Updated weights for policy 0, policy_version 73630 (0.0006) [2023-03-07 00:45:19,543][81400] Updated weights for policy 0, policy_version 73640 (0.0006) [2023-03-07 00:45:20,320][81400] Updated weights for policy 0, policy_version 73650 (0.0006) [2023-03-07 00:45:21,109][81400] Updated weights for policy 0, policy_version 73660 (0.0008) [2023-03-07 00:45:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 75428864. Throughput: 0: 13136.3. Samples: 75401667. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:45:21,237][81074] Avg episode reward: [(0, '3560.316')] [2023-03-07 00:45:21,880][81400] Updated weights for policy 0, policy_version 73670 (0.0007) [2023-03-07 00:45:22,645][81400] Updated weights for policy 0, policy_version 73680 (0.0006) [2023-03-07 00:45:23,450][81400] Updated weights for policy 0, policy_version 73690 (0.0006) [2023-03-07 00:45:24,210][81400] Updated weights for policy 0, policy_version 73700 (0.0006) [2023-03-07 00:45:24,982][81400] Updated weights for policy 0, policy_version 73710 (0.0006) [2023-03-07 00:45:25,776][81400] Updated weights for policy 0, policy_version 73720 (0.0006) [2023-03-07 00:45:26,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 75494400. Throughput: 0: 13136.3. Samples: 75480627. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:45:26,237][81074] Avg episode reward: [(0, '3313.703')] [2023-03-07 00:45:26,557][81400] Updated weights for policy 0, policy_version 73730 (0.0006) [2023-03-07 00:45:27,357][81400] Updated weights for policy 0, policy_version 73740 (0.0006) [2023-03-07 00:45:28,108][81400] Updated weights for policy 0, policy_version 73750 (0.0006) [2023-03-07 00:45:28,903][81400] Updated weights for policy 0, policy_version 73760 (0.0006) [2023-03-07 00:45:29,672][81400] Updated weights for policy 0, policy_version 73770 (0.0006) [2023-03-07 00:45:30,453][81400] Updated weights for policy 0, policy_version 73780 (0.0006) [2023-03-07 00:45:31,228][81400] Updated weights for policy 0, policy_version 73790 (0.0006) [2023-03-07 00:45:31,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 75560960. Throughput: 0: 13131.1. Samples: 75559279. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:45:31,237][81074] Avg episode reward: [(0, '3483.042')] [2023-03-07 00:45:32,010][81400] Updated weights for policy 0, policy_version 73800 (0.0005) [2023-03-07 00:45:32,788][81400] Updated weights for policy 0, policy_version 73810 (0.0006) [2023-03-07 00:45:33,567][81400] Updated weights for policy 0, policy_version 73820 (0.0007) [2023-03-07 00:45:34,353][81400] Updated weights for policy 0, policy_version 73830 (0.0006) [2023-03-07 00:45:35,128][81400] Updated weights for policy 0, policy_version 73840 (0.0007) [2023-03-07 00:45:35,913][81400] Updated weights for policy 0, policy_version 73850 (0.0006) [2023-03-07 00:45:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 75626496. Throughput: 0: 13134.1. Samples: 75598694. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:45:36,237][81074] Avg episode reward: [(0, '3457.236')] [2023-03-07 00:45:36,686][81400] Updated weights for policy 0, policy_version 73860 (0.0006) [2023-03-07 00:45:37,452][81400] Updated weights for policy 0, policy_version 73870 (0.0006) [2023-03-07 00:45:38,255][81400] Updated weights for policy 0, policy_version 73880 (0.0006) [2023-03-07 00:45:39,038][81400] Updated weights for policy 0, policy_version 73890 (0.0006) [2023-03-07 00:45:39,791][81400] Updated weights for policy 0, policy_version 73900 (0.0006) [2023-03-07 00:45:40,575][81400] Updated weights for policy 0, policy_version 73910 (0.0006) [2023-03-07 00:45:41,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 75692032. Throughput: 0: 13141.2. Samples: 75677676. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:45:41,237][81074] Avg episode reward: [(0, '3473.461')] [2023-03-07 00:45:41,358][81400] Updated weights for policy 0, policy_version 73920 (0.0006) [2023-03-07 00:45:42,134][81400] Updated weights for policy 0, policy_version 73930 (0.0006) [2023-03-07 00:45:42,930][81400] Updated weights for policy 0, policy_version 73940 (0.0007) [2023-03-07 00:45:43,715][81400] Updated weights for policy 0, policy_version 73950 (0.0005) [2023-03-07 00:45:44,497][81400] Updated weights for policy 0, policy_version 73960 (0.0007) [2023-03-07 00:45:45,285][81400] Updated weights for policy 0, policy_version 73970 (0.0006) [2023-03-07 00:45:46,059][81400] Updated weights for policy 0, policy_version 73980 (0.0006) [2023-03-07 00:45:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 75757568. Throughput: 0: 13135.3. Samples: 75756209. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:45:46,237][81074] Avg episode reward: [(0, '3433.860')] [2023-03-07 00:45:46,842][81400] Updated weights for policy 0, policy_version 73990 (0.0006) [2023-03-07 00:45:47,629][81400] Updated weights for policy 0, policy_version 74000 (0.0007) [2023-03-07 00:45:48,419][81400] Updated weights for policy 0, policy_version 74010 (0.0006) [2023-03-07 00:45:49,210][81400] Updated weights for policy 0, policy_version 74020 (0.0006) [2023-03-07 00:45:49,979][81400] Updated weights for policy 0, policy_version 74030 (0.0006) [2023-03-07 00:45:50,761][81400] Updated weights for policy 0, policy_version 74040 (0.0007) [2023-03-07 00:45:51,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13135.0). Total num frames: 75823104. Throughput: 0: 13132.6. Samples: 75795315. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:45:51,237][81074] Avg episode reward: [(0, '3310.905')] [2023-03-07 00:45:51,540][81400] Updated weights for policy 0, policy_version 74050 (0.0006) [2023-03-07 00:45:52,313][81400] Updated weights for policy 0, policy_version 74060 (0.0006) [2023-03-07 00:45:53,090][81400] Updated weights for policy 0, policy_version 74070 (0.0006) [2023-03-07 00:45:53,870][81400] Updated weights for policy 0, policy_version 74080 (0.0006) [2023-03-07 00:45:54,113][81349] KL-divergence is very high: 188.3771 [2023-03-07 00:45:54,646][81400] Updated weights for policy 0, policy_version 74090 (0.0006) [2023-03-07 00:45:55,436][81400] Updated weights for policy 0, policy_version 74100 (0.0007) [2023-03-07 00:45:56,232][81400] Updated weights for policy 0, policy_version 74110 (0.0007) [2023-03-07 00:45:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 75888640. Throughput: 0: 13139.9. Samples: 75874107. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:45:56,237][81074] Avg episode reward: [(0, '2850.125')] [2023-03-07 00:45:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000074110_75888640.pth... [2023-03-07 00:45:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000071030_72734720.pth [2023-03-07 00:45:57,013][81400] Updated weights for policy 0, policy_version 74120 (0.0006) [2023-03-07 00:45:57,795][81400] Updated weights for policy 0, policy_version 74130 (0.0006) [2023-03-07 00:45:58,574][81400] Updated weights for policy 0, policy_version 74140 (0.0006) [2023-03-07 00:45:59,343][81400] Updated weights for policy 0, policy_version 74150 (0.0006) [2023-03-07 00:46:00,119][81400] Updated weights for policy 0, policy_version 74160 (0.0006) [2023-03-07 00:46:00,916][81400] Updated weights for policy 0, policy_version 74170 (0.0006) [2023-03-07 00:46:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 75954176. Throughput: 0: 13123.2. Samples: 75952650. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:46:01,237][81074] Avg episode reward: [(0, '3231.656')] [2023-03-07 00:46:01,708][81400] Updated weights for policy 0, policy_version 74180 (0.0006) [2023-03-07 00:46:02,469][81400] Updated weights for policy 0, policy_version 74190 (0.0006) [2023-03-07 00:46:03,245][81400] Updated weights for policy 0, policy_version 74200 (0.0006) [2023-03-07 00:46:04,044][81400] Updated weights for policy 0, policy_version 74210 (0.0006) [2023-03-07 00:46:04,809][81400] Updated weights for policy 0, policy_version 74220 (0.0006) [2023-03-07 00:46:05,585][81400] Updated weights for policy 0, policy_version 74230 (0.0006) [2023-03-07 00:46:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13135.0). Total num frames: 76019712. Throughput: 0: 13116.8. Samples: 75991922. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:46:06,237][81074] Avg episode reward: [(0, '3075.138')] [2023-03-07 00:46:06,367][81400] Updated weights for policy 0, policy_version 74240 (0.0007) [2023-03-07 00:46:07,136][81400] Updated weights for policy 0, policy_version 74250 (0.0006) [2023-03-07 00:46:07,911][81400] Updated weights for policy 0, policy_version 74260 (0.0006) [2023-03-07 00:46:08,692][81400] Updated weights for policy 0, policy_version 74270 (0.0006) [2023-03-07 00:46:09,473][81400] Updated weights for policy 0, policy_version 74280 (0.0006) [2023-03-07 00:46:10,246][81400] Updated weights for policy 0, policy_version 74290 (0.0006) [2023-03-07 00:46:11,024][81400] Updated weights for policy 0, policy_version 74300 (0.0007) [2023-03-07 00:46:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13135.0). Total num frames: 76085248. Throughput: 0: 13123.1. Samples: 76071164. Policy #0 lag: (min: 0.0, avg: 1.2, max: 2.0) [2023-03-07 00:46:11,237][81074] Avg episode reward: [(0, '2979.752')] [2023-03-07 00:46:11,785][81400] Updated weights for policy 0, policy_version 74310 (0.0006) [2023-03-07 00:46:12,338][81349] KL-divergence is very high: 11037.8887 [2023-03-07 00:46:12,568][81400] Updated weights for policy 0, policy_version 74320 (0.0006) [2023-03-07 00:46:13,353][81400] Updated weights for policy 0, policy_version 74330 (0.0008) [2023-03-07 00:46:13,979][81349] KL-divergence is very high: 110.5093 [2023-03-07 00:46:14,138][81400] Updated weights for policy 0, policy_version 74340 (0.0006) [2023-03-07 00:46:14,943][81400] Updated weights for policy 0, policy_version 74350 (0.0006) [2023-03-07 00:46:15,716][81400] Updated weights for policy 0, policy_version 74360 (0.0006) [2023-03-07 00:46:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 76150784. Throughput: 0: 13124.3. Samples: 76149873. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:16,237][81074] Avg episode reward: [(0, '3108.132')] [2023-03-07 00:46:16,497][81400] Updated weights for policy 0, policy_version 74370 (0.0006) [2023-03-07 00:46:17,270][81400] Updated weights for policy 0, policy_version 74380 (0.0006) [2023-03-07 00:46:18,054][81400] Updated weights for policy 0, policy_version 74390 (0.0007) [2023-03-07 00:46:18,829][81400] Updated weights for policy 0, policy_version 74400 (0.0006) [2023-03-07 00:46:19,592][81400] Updated weights for policy 0, policy_version 74410 (0.0006) [2023-03-07 00:46:20,394][81400] Updated weights for policy 0, policy_version 74420 (0.0006) [2023-03-07 00:46:21,142][81400] Updated weights for policy 0, policy_version 74430 (0.0006) [2023-03-07 00:46:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 76217344. Throughput: 0: 13124.6. Samples: 76189302. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:21,237][81074] Avg episode reward: [(0, '3031.338')] [2023-03-07 00:46:21,935][81400] Updated weights for policy 0, policy_version 74440 (0.0007) [2023-03-07 00:46:22,725][81400] Updated weights for policy 0, policy_version 74450 (0.0006) [2023-03-07 00:46:23,508][81400] Updated weights for policy 0, policy_version 74460 (0.0006) [2023-03-07 00:46:24,297][81400] Updated weights for policy 0, policy_version 74470 (0.0006) [2023-03-07 00:46:25,084][81400] Updated weights for policy 0, policy_version 74480 (0.0006) [2023-03-07 00:46:25,878][81400] Updated weights for policy 0, policy_version 74490 (0.0006) [2023-03-07 00:46:26,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 76281856. Throughput: 0: 13113.9. Samples: 76267799. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:26,237][81074] Avg episode reward: [(0, '3159.580')] [2023-03-07 00:46:26,647][81400] Updated weights for policy 0, policy_version 74500 (0.0006) [2023-03-07 00:46:27,413][81400] Updated weights for policy 0, policy_version 74510 (0.0006) [2023-03-07 00:46:28,194][81400] Updated weights for policy 0, policy_version 74520 (0.0006) [2023-03-07 00:46:28,981][81400] Updated weights for policy 0, policy_version 74530 (0.0006) [2023-03-07 00:46:29,745][81400] Updated weights for policy 0, policy_version 74540 (0.0006) [2023-03-07 00:46:30,514][81400] Updated weights for policy 0, policy_version 74550 (0.0006) [2023-03-07 00:46:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 76348416. Throughput: 0: 13123.6. Samples: 76346770. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:31,237][81074] Avg episode reward: [(0, '3088.240')] [2023-03-07 00:46:31,305][81400] Updated weights for policy 0, policy_version 74560 (0.0007) [2023-03-07 00:46:32,086][81400] Updated weights for policy 0, policy_version 74570 (0.0006) [2023-03-07 00:46:32,860][81400] Updated weights for policy 0, policy_version 74580 (0.0006) [2023-03-07 00:46:33,243][81349] KL-divergence is very high: 164.0176 [2023-03-07 00:46:33,651][81400] Updated weights for policy 0, policy_version 74590 (0.0006) [2023-03-07 00:46:34,434][81400] Updated weights for policy 0, policy_version 74600 (0.0006) [2023-03-07 00:46:35,206][81400] Updated weights for policy 0, policy_version 74610 (0.0006) [2023-03-07 00:46:35,994][81400] Updated weights for policy 0, policy_version 74620 (0.0007) [2023-03-07 00:46:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 76413952. Throughput: 0: 13130.5. Samples: 76386188. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:36,237][81074] Avg episode reward: [(0, '2992.783')] [2023-03-07 00:46:36,760][81400] Updated weights for policy 0, policy_version 74630 (0.0006) [2023-03-07 00:46:37,546][81400] Updated weights for policy 0, policy_version 74640 (0.0006) [2023-03-07 00:46:38,309][81400] Updated weights for policy 0, policy_version 74650 (0.0005) [2023-03-07 00:46:39,075][81400] Updated weights for policy 0, policy_version 74660 (0.0006) [2023-03-07 00:46:39,855][81400] Updated weights for policy 0, policy_version 74670 (0.0006) [2023-03-07 00:46:40,633][81400] Updated weights for policy 0, policy_version 74680 (0.0006) [2023-03-07 00:46:41,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 76479488. Throughput: 0: 13140.7. Samples: 76465440. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:41,237][81074] Avg episode reward: [(0, '2851.116')] [2023-03-07 00:46:41,403][81400] Updated weights for policy 0, policy_version 74690 (0.0005) [2023-03-07 00:46:42,177][81400] Updated weights for policy 0, policy_version 74700 (0.0006) [2023-03-07 00:46:42,967][81400] Updated weights for policy 0, policy_version 74710 (0.0006) [2023-03-07 00:46:43,734][81400] Updated weights for policy 0, policy_version 74720 (0.0006) [2023-03-07 00:46:44,522][81400] Updated weights for policy 0, policy_version 74730 (0.0006) [2023-03-07 00:46:45,308][81400] Updated weights for policy 0, policy_version 74740 (0.0006) [2023-03-07 00:46:46,102][81400] Updated weights for policy 0, policy_version 74750 (0.0006) [2023-03-07 00:46:46,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13138.4). Total num frames: 76545024. Throughput: 0: 13144.1. Samples: 76544133. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:46,237][81074] Avg episode reward: [(0, '2431.938')] [2023-03-07 00:46:46,878][81400] Updated weights for policy 0, policy_version 74760 (0.0007) [2023-03-07 00:46:47,646][81400] Updated weights for policy 0, policy_version 74770 (0.0006) [2023-03-07 00:46:48,420][81400] Updated weights for policy 0, policy_version 74780 (0.0007) [2023-03-07 00:46:49,210][81400] Updated weights for policy 0, policy_version 74790 (0.0006) [2023-03-07 00:46:49,986][81400] Updated weights for policy 0, policy_version 74800 (0.0006) [2023-03-07 00:46:50,762][81400] Updated weights for policy 0, policy_version 74810 (0.0006) [2023-03-07 00:46:51,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 76611584. Throughput: 0: 13146.4. Samples: 76583511. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:51,237][81074] Avg episode reward: [(0, '2789.591')] [2023-03-07 00:46:51,539][81400] Updated weights for policy 0, policy_version 74820 (0.0006) [2023-03-07 00:46:52,322][81400] Updated weights for policy 0, policy_version 74830 (0.0006) [2023-03-07 00:46:53,112][81400] Updated weights for policy 0, policy_version 74840 (0.0007) [2023-03-07 00:46:53,895][81400] Updated weights for policy 0, policy_version 74850 (0.0007) [2023-03-07 00:46:54,684][81400] Updated weights for policy 0, policy_version 74860 (0.0006) [2023-03-07 00:46:55,460][81400] Updated weights for policy 0, policy_version 74870 (0.0006) [2023-03-07 00:46:56,214][81400] Updated weights for policy 0, policy_version 74880 (0.0006) [2023-03-07 00:46:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 76677120. Throughput: 0: 13137.9. Samples: 76662371. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:46:56,237][81074] Avg episode reward: [(0, '2668.086')] [2023-03-07 00:46:56,994][81400] Updated weights for policy 0, policy_version 74890 (0.0006) [2023-03-07 00:46:57,776][81400] Updated weights for policy 0, policy_version 74900 (0.0006) [2023-03-07 00:46:58,558][81400] Updated weights for policy 0, policy_version 74910 (0.0005) [2023-03-07 00:46:59,341][81400] Updated weights for policy 0, policy_version 74920 (0.0006) [2023-03-07 00:47:00,115][81400] Updated weights for policy 0, policy_version 74930 (0.0006) [2023-03-07 00:47:00,906][81400] Updated weights for policy 0, policy_version 74940 (0.0006) [2023-03-07 00:47:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 76742656. Throughput: 0: 13140.8. Samples: 76741208. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:01,237][81074] Avg episode reward: [(0, '2585.395')] [2023-03-07 00:47:01,687][81400] Updated weights for policy 0, policy_version 74950 (0.0006) [2023-03-07 00:47:02,456][81400] Updated weights for policy 0, policy_version 74960 (0.0006) [2023-03-07 00:47:03,226][81400] Updated weights for policy 0, policy_version 74970 (0.0006) [2023-03-07 00:47:04,003][81400] Updated weights for policy 0, policy_version 74980 (0.0006) [2023-03-07 00:47:04,774][81400] Updated weights for policy 0, policy_version 74990 (0.0006) [2023-03-07 00:47:05,565][81400] Updated weights for policy 0, policy_version 75000 (0.0007) [2023-03-07 00:47:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 76808192. Throughput: 0: 13142.1. Samples: 76780699. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:06,237][81074] Avg episode reward: [(0, '2440.922')] [2023-03-07 00:47:06,324][81400] Updated weights for policy 0, policy_version 75010 (0.0007) [2023-03-07 00:47:07,093][81400] Updated weights for policy 0, policy_version 75020 (0.0006) [2023-03-07 00:47:07,864][81400] Updated weights for policy 0, policy_version 75030 (0.0007) [2023-03-07 00:47:08,646][81400] Updated weights for policy 0, policy_version 75040 (0.0006) [2023-03-07 00:47:09,420][81400] Updated weights for policy 0, policy_version 75050 (0.0006) [2023-03-07 00:47:10,199][81400] Updated weights for policy 0, policy_version 75060 (0.0006) [2023-03-07 00:47:10,971][81400] Updated weights for policy 0, policy_version 75070 (0.0006) [2023-03-07 00:47:11,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 76874752. Throughput: 0: 13165.4. Samples: 76860242. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:11,237][81074] Avg episode reward: [(0, '2453.286')] [2023-03-07 00:47:11,752][81400] Updated weights for policy 0, policy_version 75080 (0.0005) [2023-03-07 00:47:12,542][81400] Updated weights for policy 0, policy_version 75090 (0.0006) [2023-03-07 00:47:13,321][81400] Updated weights for policy 0, policy_version 75100 (0.0007) [2023-03-07 00:47:14,100][81400] Updated weights for policy 0, policy_version 75110 (0.0005) [2023-03-07 00:47:14,877][81400] Updated weights for policy 0, policy_version 75120 (0.0006) [2023-03-07 00:47:15,657][81400] Updated weights for policy 0, policy_version 75130 (0.0005) [2023-03-07 00:47:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 76940288. Throughput: 0: 13159.9. Samples: 76938967. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:16,237][81074] Avg episode reward: [(0, '2438.970')] [2023-03-07 00:47:16,464][81400] Updated weights for policy 0, policy_version 75140 (0.0006) [2023-03-07 00:47:17,241][81400] Updated weights for policy 0, policy_version 75150 (0.0007) [2023-03-07 00:47:18,015][81400] Updated weights for policy 0, policy_version 75160 (0.0006) [2023-03-07 00:47:18,812][81400] Updated weights for policy 0, policy_version 75170 (0.0006) [2023-03-07 00:47:19,588][81400] Updated weights for policy 0, policy_version 75180 (0.0006) [2023-03-07 00:47:20,366][81400] Updated weights for policy 0, policy_version 75190 (0.0006) [2023-03-07 00:47:21,149][81400] Updated weights for policy 0, policy_version 75200 (0.0005) [2023-03-07 00:47:21,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 77005824. Throughput: 0: 13153.2. Samples: 76978082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:21,237][81074] Avg episode reward: [(0, '2590.178')] [2023-03-07 00:47:21,927][81400] Updated weights for policy 0, policy_version 75210 (0.0007) [2023-03-07 00:47:22,699][81400] Updated weights for policy 0, policy_version 75220 (0.0005) [2023-03-07 00:47:23,487][81400] Updated weights for policy 0, policy_version 75230 (0.0006) [2023-03-07 00:47:24,265][81400] Updated weights for policy 0, policy_version 75240 (0.0006) [2023-03-07 00:47:25,024][81400] Updated weights for policy 0, policy_version 75250 (0.0007) [2023-03-07 00:47:25,811][81400] Updated weights for policy 0, policy_version 75260 (0.0005) [2023-03-07 00:47:26,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 77071360. Throughput: 0: 13148.8. Samples: 77057137. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:26,237][81074] Avg episode reward: [(0, '2254.887')] [2023-03-07 00:47:26,602][81400] Updated weights for policy 0, policy_version 75270 (0.0006) [2023-03-07 00:47:27,390][81400] Updated weights for policy 0, policy_version 75280 (0.0006) [2023-03-07 00:47:28,163][81400] Updated weights for policy 0, policy_version 75290 (0.0006) [2023-03-07 00:47:28,935][81400] Updated weights for policy 0, policy_version 75300 (0.0006) [2023-03-07 00:47:29,728][81400] Updated weights for policy 0, policy_version 75310 (0.0007) [2023-03-07 00:47:30,504][81400] Updated weights for policy 0, policy_version 75320 (0.0006) [2023-03-07 00:47:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 77136896. Throughput: 0: 13145.1. Samples: 77135664. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:31,237][81074] Avg episode reward: [(0, '2767.742')] [2023-03-07 00:47:31,295][81400] Updated weights for policy 0, policy_version 75330 (0.0007) [2023-03-07 00:47:32,058][81400] Updated weights for policy 0, policy_version 75340 (0.0006) [2023-03-07 00:47:32,842][81400] Updated weights for policy 0, policy_version 75350 (0.0006) [2023-03-07 00:47:33,629][81400] Updated weights for policy 0, policy_version 75360 (0.0006) [2023-03-07 00:47:34,388][81400] Updated weights for policy 0, policy_version 75370 (0.0006) [2023-03-07 00:47:35,173][81400] Updated weights for policy 0, policy_version 75380 (0.0006) [2023-03-07 00:47:35,945][81400] Updated weights for policy 0, policy_version 75390 (0.0006) [2023-03-07 00:47:36,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13138.4). Total num frames: 77202432. Throughput: 0: 13144.2. Samples: 77175001. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:36,237][81074] Avg episode reward: [(0, '2158.563')] [2023-03-07 00:47:36,721][81400] Updated weights for policy 0, policy_version 75400 (0.0006) [2023-03-07 00:47:37,497][81400] Updated weights for policy 0, policy_version 75410 (0.0005) [2023-03-07 00:47:38,248][81400] Updated weights for policy 0, policy_version 75420 (0.0006) [2023-03-07 00:47:39,049][81400] Updated weights for policy 0, policy_version 75430 (0.0007) [2023-03-07 00:47:39,837][81400] Updated weights for policy 0, policy_version 75440 (0.0006) [2023-03-07 00:47:40,595][81400] Updated weights for policy 0, policy_version 75450 (0.0005) [2023-03-07 00:47:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 77268992. Throughput: 0: 13152.1. Samples: 77254217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:41,237][81074] Avg episode reward: [(0, '2222.701')] [2023-03-07 00:47:41,369][81400] Updated weights for policy 0, policy_version 75460 (0.0006) [2023-03-07 00:47:42,145][81400] Updated weights for policy 0, policy_version 75470 (0.0006) [2023-03-07 00:47:42,914][81400] Updated weights for policy 0, policy_version 75480 (0.0006) [2023-03-07 00:47:43,697][81400] Updated weights for policy 0, policy_version 75490 (0.0006) [2023-03-07 00:47:44,473][81400] Updated weights for policy 0, policy_version 75500 (0.0005) [2023-03-07 00:47:45,257][81400] Updated weights for policy 0, policy_version 75510 (0.0007) [2023-03-07 00:47:46,045][81400] Updated weights for policy 0, policy_version 75520 (0.0006) [2023-03-07 00:47:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13141.9). Total num frames: 77334528. Throughput: 0: 13157.7. Samples: 77333305. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:46,237][81074] Avg episode reward: [(0, '2241.273')] [2023-03-07 00:47:46,821][81400] Updated weights for policy 0, policy_version 75530 (0.0007) [2023-03-07 00:47:47,603][81400] Updated weights for policy 0, policy_version 75540 (0.0006) [2023-03-07 00:47:48,366][81400] Updated weights for policy 0, policy_version 75550 (0.0006) [2023-03-07 00:47:49,134][81400] Updated weights for policy 0, policy_version 75560 (0.0007) [2023-03-07 00:47:49,901][81400] Updated weights for policy 0, policy_version 75570 (0.0005) [2023-03-07 00:47:50,677][81400] Updated weights for policy 0, policy_version 75580 (0.0006) [2023-03-07 00:47:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 77401088. Throughput: 0: 13162.3. Samples: 77373000. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:51,237][81074] Avg episode reward: [(0, '2411.641')] [2023-03-07 00:47:51,467][81400] Updated weights for policy 0, policy_version 75590 (0.0006) [2023-03-07 00:47:52,229][81400] Updated weights for policy 0, policy_version 75600 (0.0005) [2023-03-07 00:47:53,016][81400] Updated weights for policy 0, policy_version 75610 (0.0006) [2023-03-07 00:47:53,787][81400] Updated weights for policy 0, policy_version 75620 (0.0006) [2023-03-07 00:47:54,562][81400] Updated weights for policy 0, policy_version 75630 (0.0006) [2023-03-07 00:47:55,332][81400] Updated weights for policy 0, policy_version 75640 (0.0007) [2023-03-07 00:47:56,132][81400] Updated weights for policy 0, policy_version 75650 (0.0006) [2023-03-07 00:47:56,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 77466624. Throughput: 0: 13155.2. Samples: 77452231. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:47:56,237][81074] Avg episode reward: [(0, '2583.076')] [2023-03-07 00:47:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000075651_77466624.pth... [2023-03-07 00:47:56,277][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000072571_74312704.pth [2023-03-07 00:47:56,903][81400] Updated weights for policy 0, policy_version 75660 (0.0006) [2023-03-07 00:47:57,682][81400] Updated weights for policy 0, policy_version 75670 (0.0006) [2023-03-07 00:47:58,441][81400] Updated weights for policy 0, policy_version 75680 (0.0005) [2023-03-07 00:47:59,222][81400] Updated weights for policy 0, policy_version 75690 (0.0007) [2023-03-07 00:47:59,989][81400] Updated weights for policy 0, policy_version 75700 (0.0005) [2023-03-07 00:48:00,773][81400] Updated weights for policy 0, policy_version 75710 (0.0006) [2023-03-07 00:48:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 77533184. Throughput: 0: 13169.0. Samples: 77531573. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:48:01,237][81074] Avg episode reward: [(0, '2338.476')] [2023-03-07 00:48:01,547][81400] Updated weights for policy 0, policy_version 75720 (0.0007) [2023-03-07 00:48:02,319][81400] Updated weights for policy 0, policy_version 75730 (0.0008) [2023-03-07 00:48:03,109][81400] Updated weights for policy 0, policy_version 75740 (0.0006) [2023-03-07 00:48:03,888][81400] Updated weights for policy 0, policy_version 75750 (0.0006) [2023-03-07 00:48:04,672][81400] Updated weights for policy 0, policy_version 75760 (0.0006) [2023-03-07 00:48:05,438][81400] Updated weights for policy 0, policy_version 75770 (0.0006) [2023-03-07 00:48:06,209][81400] Updated weights for policy 0, policy_version 75780 (0.0007) [2023-03-07 00:48:06,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13145.4). Total num frames: 77598720. Throughput: 0: 13177.8. Samples: 77571082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:48:06,237][81074] Avg episode reward: [(0, '2310.998')] [2023-03-07 00:48:06,986][81400] Updated weights for policy 0, policy_version 75790 (0.0006) [2023-03-07 00:48:07,771][81400] Updated weights for policy 0, policy_version 75800 (0.0006) [2023-03-07 00:48:08,529][81400] Updated weights for policy 0, policy_version 75810 (0.0006) [2023-03-07 00:48:09,301][81400] Updated weights for policy 0, policy_version 75820 (0.0006) [2023-03-07 00:48:10,041][81400] Updated weights for policy 0, policy_version 75830 (0.0006) [2023-03-07 00:48:10,825][81400] Updated weights for policy 0, policy_version 75840 (0.0006) [2023-03-07 00:48:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 77665280. Throughput: 0: 13189.8. Samples: 77650677. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:48:11,237][81074] Avg episode reward: [(0, '2585.665')] [2023-03-07 00:48:11,601][81400] Updated weights for policy 0, policy_version 75850 (0.0007) [2023-03-07 00:48:12,361][81400] Updated weights for policy 0, policy_version 75860 (0.0006) [2023-03-07 00:48:13,134][81400] Updated weights for policy 0, policy_version 75870 (0.0006) [2023-03-07 00:48:13,913][81400] Updated weights for policy 0, policy_version 75880 (0.0006) [2023-03-07 00:48:14,688][81400] Updated weights for policy 0, policy_version 75890 (0.0006) [2023-03-07 00:48:15,464][81400] Updated weights for policy 0, policy_version 75900 (0.0008) [2023-03-07 00:48:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 77730816. Throughput: 0: 13206.1. Samples: 77729941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:48:16,237][81074] Avg episode reward: [(0, '2355.289')] [2023-03-07 00:48:16,245][81400] Updated weights for policy 0, policy_version 75910 (0.0006) [2023-03-07 00:48:17,011][81400] Updated weights for policy 0, policy_version 75920 (0.0006) [2023-03-07 00:48:17,797][81400] Updated weights for policy 0, policy_version 75930 (0.0005) [2023-03-07 00:48:18,590][81400] Updated weights for policy 0, policy_version 75940 (0.0006) [2023-03-07 00:48:19,373][81400] Updated weights for policy 0, policy_version 75950 (0.0006) [2023-03-07 00:48:20,151][81400] Updated weights for policy 0, policy_version 75960 (0.0006) [2023-03-07 00:48:20,932][81400] Updated weights for policy 0, policy_version 75970 (0.0006) [2023-03-07 00:48:21,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.6, 300 sec: 13152.3). Total num frames: 77797376. Throughput: 0: 13207.2. Samples: 77769325. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:48:21,237][81074] Avg episode reward: [(0, '2513.804')] [2023-03-07 00:48:21,710][81400] Updated weights for policy 0, policy_version 75980 (0.0006) [2023-03-07 00:48:22,471][81400] Updated weights for policy 0, policy_version 75990 (0.0006) [2023-03-07 00:48:23,254][81400] Updated weights for policy 0, policy_version 76000 (0.0006) [2023-03-07 00:48:24,016][81400] Updated weights for policy 0, policy_version 76010 (0.0006) [2023-03-07 00:48:24,780][81400] Updated weights for policy 0, policy_version 76020 (0.0006) [2023-03-07 00:48:25,567][81400] Updated weights for policy 0, policy_version 76030 (0.0006) [2023-03-07 00:48:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13152.3). Total num frames: 77862912. Throughput: 0: 13211.4. Samples: 77848730. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:48:26,237][81074] Avg episode reward: [(0, '2355.539')] [2023-03-07 00:48:26,331][81400] Updated weights for policy 0, policy_version 76040 (0.0006) [2023-03-07 00:48:27,099][81400] Updated weights for policy 0, policy_version 76050 (0.0006) [2023-03-07 00:48:27,901][81400] Updated weights for policy 0, policy_version 76060 (0.0006) [2023-03-07 00:48:28,675][81400] Updated weights for policy 0, policy_version 76070 (0.0005) [2023-03-07 00:48:29,482][81400] Updated weights for policy 0, policy_version 76080 (0.0006) [2023-03-07 00:48:30,244][81400] Updated weights for policy 0, policy_version 76090 (0.0006) [2023-03-07 00:48:31,018][81400] Updated weights for policy 0, policy_version 76100 (0.0007) [2023-03-07 00:48:31,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13148.9). Total num frames: 77928448. Throughput: 0: 13202.7. Samples: 77927425. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:48:31,237][81074] Avg episode reward: [(0, '1968.609')] [2023-03-07 00:48:31,816][81400] Updated weights for policy 0, policy_version 76110 (0.0006) [2023-03-07 00:48:32,594][81400] Updated weights for policy 0, policy_version 76120 (0.0006) [2023-03-07 00:48:33,362][81400] Updated weights for policy 0, policy_version 76130 (0.0006) [2023-03-07 00:48:34,149][81400] Updated weights for policy 0, policy_version 76140 (0.0006) [2023-03-07 00:48:34,936][81400] Updated weights for policy 0, policy_version 76150 (0.0007) [2023-03-07 00:48:35,722][81400] Updated weights for policy 0, policy_version 76160 (0.0006) [2023-03-07 00:48:36,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13192.5, 300 sec: 13148.8). Total num frames: 77993984. Throughput: 0: 13197.1. Samples: 77966873. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:48:36,237][81074] Avg episode reward: [(0, '2088.156')] [2023-03-07 00:48:36,477][81400] Updated weights for policy 0, policy_version 76170 (0.0005) [2023-03-07 00:48:37,264][81400] Updated weights for policy 0, policy_version 76180 (0.0005) [2023-03-07 00:48:38,030][81400] Updated weights for policy 0, policy_version 76190 (0.0006) [2023-03-07 00:48:38,817][81400] Updated weights for policy 0, policy_version 76200 (0.0006) [2023-03-07 00:48:39,590][81400] Updated weights for policy 0, policy_version 76210 (0.0006) [2023-03-07 00:48:40,354][81400] Updated weights for policy 0, policy_version 76220 (0.0006) [2023-03-07 00:48:41,130][81400] Updated weights for policy 0, policy_version 76230 (0.0006) [2023-03-07 00:48:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13152.3). Total num frames: 78060544. Throughput: 0: 13191.2. Samples: 78045834. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:48:41,237][81074] Avg episode reward: [(0, '2117.614')] [2023-03-07 00:48:41,918][81400] Updated weights for policy 0, policy_version 76240 (0.0006) [2023-03-07 00:48:42,689][81400] Updated weights for policy 0, policy_version 76250 (0.0006) [2023-03-07 00:48:43,464][81400] Updated weights for policy 0, policy_version 76260 (0.0007) [2023-03-07 00:48:44,236][81400] Updated weights for policy 0, policy_version 76270 (0.0006) [2023-03-07 00:48:45,005][81400] Updated weights for policy 0, policy_version 76280 (0.0007) [2023-03-07 00:48:45,781][81400] Updated weights for policy 0, policy_version 76290 (0.0006) [2023-03-07 00:48:45,837][81349] KL-divergence is very high: 40591.9766 [2023-03-07 00:48:46,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13152.3). Total num frames: 78126080. Throughput: 0: 13195.0. Samples: 78125347. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:48:46,237][81074] Avg episode reward: [(0, '2383.267')] [2023-03-07 00:48:46,545][81349] KL-divergence is very high: 442.2243 [2023-03-07 00:48:46,552][81400] Updated weights for policy 0, policy_version 76300 (0.0005) [2023-03-07 00:48:47,319][81400] Updated weights for policy 0, policy_version 76310 (0.0006) [2023-03-07 00:48:48,112][81400] Updated weights for policy 0, policy_version 76320 (0.0006) [2023-03-07 00:48:48,870][81400] Updated weights for policy 0, policy_version 76330 (0.0005) [2023-03-07 00:48:49,655][81400] Updated weights for policy 0, policy_version 76340 (0.0006) [2023-03-07 00:48:50,421][81400] Updated weights for policy 0, policy_version 76350 (0.0006) [2023-03-07 00:48:51,193][81400] Updated weights for policy 0, policy_version 76360 (0.0006) [2023-03-07 00:48:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13155.8). Total num frames: 78192640. Throughput: 0: 13198.0. Samples: 78164994. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:48:51,237][81074] Avg episode reward: [(0, '1739.708')] [2023-03-07 00:48:51,966][81400] Updated weights for policy 0, policy_version 76370 (0.0006) [2023-03-07 00:48:52,738][81400] Updated weights for policy 0, policy_version 76380 (0.0006) [2023-03-07 00:48:53,500][81400] Updated weights for policy 0, policy_version 76390 (0.0006) [2023-03-07 00:48:54,276][81400] Updated weights for policy 0, policy_version 76400 (0.0006) [2023-03-07 00:48:55,042][81400] Updated weights for policy 0, policy_version 76410 (0.0006) [2023-03-07 00:48:55,814][81400] Updated weights for policy 0, policy_version 76420 (0.0006) [2023-03-07 00:48:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13209.6, 300 sec: 13155.8). Total num frames: 78259200. Throughput: 0: 13199.5. Samples: 78244655. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:48:56,237][81074] Avg episode reward: [(0, '1777.916')] [2023-03-07 00:48:56,576][81400] Updated weights for policy 0, policy_version 76430 (0.0006) [2023-03-07 00:48:57,365][81400] Updated weights for policy 0, policy_version 76440 (0.0006) [2023-03-07 00:48:58,121][81400] Updated weights for policy 0, policy_version 76450 (0.0006) [2023-03-07 00:48:58,912][81400] Updated weights for policy 0, policy_version 76460 (0.0007) [2023-03-07 00:48:59,678][81400] Updated weights for policy 0, policy_version 76470 (0.0006) [2023-03-07 00:49:00,446][81400] Updated weights for policy 0, policy_version 76480 (0.0006) [2023-03-07 00:49:01,208][81400] Updated weights for policy 0, policy_version 76490 (0.0006) [2023-03-07 00:49:01,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13209.6, 300 sec: 13162.7). Total num frames: 78325760. Throughput: 0: 13208.9. Samples: 78324339. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:49:01,237][81074] Avg episode reward: [(0, '1492.215')] [2023-03-07 00:49:01,985][81400] Updated weights for policy 0, policy_version 76500 (0.0005) [2023-03-07 00:49:02,777][81400] Updated weights for policy 0, policy_version 76510 (0.0005) [2023-03-07 00:49:03,544][81400] Updated weights for policy 0, policy_version 76520 (0.0007) [2023-03-07 00:49:04,320][81400] Updated weights for policy 0, policy_version 76530 (0.0006) [2023-03-07 00:49:05,108][81400] Updated weights for policy 0, policy_version 76540 (0.0007) [2023-03-07 00:49:05,881][81400] Updated weights for policy 0, policy_version 76550 (0.0005) [2023-03-07 00:49:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13159.3). Total num frames: 78391296. Throughput: 0: 13210.9. Samples: 78363818. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:49:06,237][81074] Avg episode reward: [(0, '1573.124')] [2023-03-07 00:49:06,662][81400] Updated weights for policy 0, policy_version 76560 (0.0006) [2023-03-07 00:49:07,436][81400] Updated weights for policy 0, policy_version 76570 (0.0006) [2023-03-07 00:49:08,213][81400] Updated weights for policy 0, policy_version 76580 (0.0007) [2023-03-07 00:49:08,985][81400] Updated weights for policy 0, policy_version 76590 (0.0006) [2023-03-07 00:49:09,750][81400] Updated weights for policy 0, policy_version 76600 (0.0006) [2023-03-07 00:49:10,549][81400] Updated weights for policy 0, policy_version 76610 (0.0006) [2023-03-07 00:49:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13162.7). Total num frames: 78457856. Throughput: 0: 13203.7. Samples: 78442895. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:49:11,237][81074] Avg episode reward: [(0, '1542.329')] [2023-03-07 00:49:11,323][81400] Updated weights for policy 0, policy_version 76620 (0.0006) [2023-03-07 00:49:12,094][81400] Updated weights for policy 0, policy_version 76630 (0.0005) [2023-03-07 00:49:12,842][81400] Updated weights for policy 0, policy_version 76640 (0.0007) [2023-03-07 00:49:13,629][81400] Updated weights for policy 0, policy_version 76650 (0.0006) [2023-03-07 00:49:14,402][81400] Updated weights for policy 0, policy_version 76660 (0.0006) [2023-03-07 00:49:15,189][81400] Updated weights for policy 0, policy_version 76670 (0.0005) [2023-03-07 00:49:15,961][81400] Updated weights for policy 0, policy_version 76680 (0.0006) [2023-03-07 00:49:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13162.7). Total num frames: 78523392. Throughput: 0: 13223.9. Samples: 78522503. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 00:49:16,237][81074] Avg episode reward: [(0, '1638.947')] [2023-03-07 00:49:16,724][81400] Updated weights for policy 0, policy_version 76690 (0.0006) [2023-03-07 00:49:17,477][81400] Updated weights for policy 0, policy_version 76700 (0.0006) [2023-03-07 00:49:18,239][81400] Updated weights for policy 0, policy_version 76710 (0.0005) [2023-03-07 00:49:19,021][81400] Updated weights for policy 0, policy_version 76720 (0.0007) [2023-03-07 00:49:19,795][81400] Updated weights for policy 0, policy_version 76730 (0.0007) [2023-03-07 00:49:20,538][81400] Updated weights for policy 0, policy_version 76740 (0.0006) [2023-03-07 00:49:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13226.7, 300 sec: 13166.2). Total num frames: 78590976. Throughput: 0: 13234.6. Samples: 78562429. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:21,237][81074] Avg episode reward: [(0, '1503.861')] [2023-03-07 00:49:21,308][81400] Updated weights for policy 0, policy_version 76750 (0.0005) [2023-03-07 00:49:22,056][81400] Updated weights for policy 0, policy_version 76760 (0.0005) [2023-03-07 00:49:22,831][81400] Updated weights for policy 0, policy_version 76770 (0.0006) [2023-03-07 00:49:23,581][81400] Updated weights for policy 0, policy_version 76780 (0.0006) [2023-03-07 00:49:24,342][81400] Updated weights for policy 0, policy_version 76790 (0.0007) [2023-03-07 00:49:25,117][81400] Updated weights for policy 0, policy_version 76800 (0.0007) [2023-03-07 00:49:25,878][81400] Updated weights for policy 0, policy_version 76810 (0.0005) [2023-03-07 00:49:26,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13243.7, 300 sec: 13169.7). Total num frames: 78657536. Throughput: 0: 13272.6. Samples: 78643102. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:26,237][81074] Avg episode reward: [(0, '1199.915')] [2023-03-07 00:49:26,658][81400] Updated weights for policy 0, policy_version 76820 (0.0006) [2023-03-07 00:49:27,432][81400] Updated weights for policy 0, policy_version 76830 (0.0005) [2023-03-07 00:49:28,201][81400] Updated weights for policy 0, policy_version 76840 (0.0006) [2023-03-07 00:49:28,948][81400] Updated weights for policy 0, policy_version 76850 (0.0006) [2023-03-07 00:49:29,729][81400] Updated weights for policy 0, policy_version 76860 (0.0005) [2023-03-07 00:49:30,491][81400] Updated weights for policy 0, policy_version 76870 (0.0006) [2023-03-07 00:49:31,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13260.8, 300 sec: 13173.1). Total num frames: 78724096. Throughput: 0: 13280.4. Samples: 78722965. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:31,237][81074] Avg episode reward: [(0, '1211.978')] [2023-03-07 00:49:31,271][81400] Updated weights for policy 0, policy_version 76880 (0.0006) [2023-03-07 00:49:32,024][81400] Updated weights for policy 0, policy_version 76890 (0.0007) [2023-03-07 00:49:32,805][81400] Updated weights for policy 0, policy_version 76900 (0.0006) [2023-03-07 00:49:33,573][81400] Updated weights for policy 0, policy_version 76910 (0.0006) [2023-03-07 00:49:34,337][81400] Updated weights for policy 0, policy_version 76920 (0.0006) [2023-03-07 00:49:35,135][81400] Updated weights for policy 0, policy_version 76930 (0.0006) [2023-03-07 00:49:35,895][81400] Updated weights for policy 0, policy_version 76940 (0.0005) [2023-03-07 00:49:36,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13277.9, 300 sec: 13176.6). Total num frames: 78790656. Throughput: 0: 13286.5. Samples: 78762885. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:36,237][81074] Avg episode reward: [(0, '1293.343')] [2023-03-07 00:49:36,661][81400] Updated weights for policy 0, policy_version 76950 (0.0005) [2023-03-07 00:49:37,436][81400] Updated weights for policy 0, policy_version 76960 (0.0005) [2023-03-07 00:49:38,197][81400] Updated weights for policy 0, policy_version 76970 (0.0006) [2023-03-07 00:49:38,962][81400] Updated weights for policy 0, policy_version 76980 (0.0006) [2023-03-07 00:49:39,726][81400] Updated weights for policy 0, policy_version 76990 (0.0005) [2023-03-07 00:49:40,500][81400] Updated weights for policy 0, policy_version 77000 (0.0006) [2023-03-07 00:49:41,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13277.9, 300 sec: 13176.6). Total num frames: 78857216. Throughput: 0: 13294.2. Samples: 78842892. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:41,237][81074] Avg episode reward: [(0, '1295.157')] [2023-03-07 00:49:41,264][81400] Updated weights for policy 0, policy_version 77010 (0.0006) [2023-03-07 00:49:42,037][81400] Updated weights for policy 0, policy_version 77020 (0.0006) [2023-03-07 00:49:42,818][81400] Updated weights for policy 0, policy_version 77030 (0.0006) [2023-03-07 00:49:43,590][81400] Updated weights for policy 0, policy_version 77040 (0.0006) [2023-03-07 00:49:44,370][81400] Updated weights for policy 0, policy_version 77050 (0.0006) [2023-03-07 00:49:45,139][81400] Updated weights for policy 0, policy_version 77060 (0.0005) [2023-03-07 00:49:45,899][81400] Updated weights for policy 0, policy_version 77070 (0.0005) [2023-03-07 00:49:46,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13294.9, 300 sec: 13183.6). Total num frames: 78923776. Throughput: 0: 13288.7. Samples: 78922333. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:46,237][81074] Avg episode reward: [(0, '1280.769')] [2023-03-07 00:49:46,681][81400] Updated weights for policy 0, policy_version 77080 (0.0006) [2023-03-07 00:49:47,446][81400] Updated weights for policy 0, policy_version 77090 (0.0006) [2023-03-07 00:49:48,226][81400] Updated weights for policy 0, policy_version 77100 (0.0006) [2023-03-07 00:49:48,994][81400] Updated weights for policy 0, policy_version 77110 (0.0007) [2023-03-07 00:49:49,775][81400] Updated weights for policy 0, policy_version 77120 (0.0007) [2023-03-07 00:49:50,553][81400] Updated weights for policy 0, policy_version 77130 (0.0007) [2023-03-07 00:49:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13180.1). Total num frames: 78989312. Throughput: 0: 13296.3. Samples: 78962153. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:51,237][81074] Avg episode reward: [(0, '1446.964')] [2023-03-07 00:49:51,331][81400] Updated weights for policy 0, policy_version 77140 (0.0006) [2023-03-07 00:49:52,114][81400] Updated weights for policy 0, policy_version 77150 (0.0006) [2023-03-07 00:49:52,887][81400] Updated weights for policy 0, policy_version 77160 (0.0006) [2023-03-07 00:49:53,674][81400] Updated weights for policy 0, policy_version 77170 (0.0006) [2023-03-07 00:49:54,446][81400] Updated weights for policy 0, policy_version 77180 (0.0007) [2023-03-07 00:49:55,231][81400] Updated weights for policy 0, policy_version 77190 (0.0006) [2023-03-07 00:49:55,997][81400] Updated weights for policy 0, policy_version 77200 (0.0006) [2023-03-07 00:49:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13183.6). Total num frames: 79055872. Throughput: 0: 13290.1. Samples: 79040951. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:49:56,237][81074] Avg episode reward: [(0, '1632.898')] [2023-03-07 00:49:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000077203_79055872.pth... [2023-03-07 00:49:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000074110_75888640.pth [2023-03-07 00:49:56,780][81400] Updated weights for policy 0, policy_version 77210 (0.0006) [2023-03-07 00:49:57,550][81400] Updated weights for policy 0, policy_version 77220 (0.0006) [2023-03-07 00:49:58,338][81400] Updated weights for policy 0, policy_version 77230 (0.0006) [2023-03-07 00:49:59,115][81400] Updated weights for policy 0, policy_version 77240 (0.0007) [2023-03-07 00:49:59,880][81400] Updated weights for policy 0, policy_version 77250 (0.0005) [2023-03-07 00:50:00,652][81400] Updated weights for policy 0, policy_version 77260 (0.0006) [2023-03-07 00:50:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13187.0). Total num frames: 79121408. Throughput: 0: 13284.3. Samples: 79120295. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:01,237][81074] Avg episode reward: [(0, '1417.701')] [2023-03-07 00:50:01,439][81400] Updated weights for policy 0, policy_version 77270 (0.0006) [2023-03-07 00:50:02,213][81400] Updated weights for policy 0, policy_version 77280 (0.0006) [2023-03-07 00:50:02,985][81400] Updated weights for policy 0, policy_version 77290 (0.0006) [2023-03-07 00:50:03,746][81400] Updated weights for policy 0, policy_version 77300 (0.0006) [2023-03-07 00:50:04,523][81400] Updated weights for policy 0, policy_version 77310 (0.0005) [2023-03-07 00:50:05,294][81400] Updated weights for policy 0, policy_version 77320 (0.0006) [2023-03-07 00:50:06,067][81400] Updated weights for policy 0, policy_version 77330 (0.0005) [2023-03-07 00:50:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13277.9, 300 sec: 13187.0). Total num frames: 79187968. Throughput: 0: 13275.4. Samples: 79159823. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:06,237][81074] Avg episode reward: [(0, '2019.554')] [2023-03-07 00:50:06,841][81400] Updated weights for policy 0, policy_version 77340 (0.0007) [2023-03-07 00:50:07,607][81400] Updated weights for policy 0, policy_version 77350 (0.0007) [2023-03-07 00:50:08,397][81400] Updated weights for policy 0, policy_version 77360 (0.0006) [2023-03-07 00:50:09,164][81400] Updated weights for policy 0, policy_version 77370 (0.0006) [2023-03-07 00:50:09,932][81400] Updated weights for policy 0, policy_version 77380 (0.0007) [2023-03-07 00:50:10,720][81400] Updated weights for policy 0, policy_version 77390 (0.0007) [2023-03-07 00:50:11,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13260.8, 300 sec: 13187.0). Total num frames: 79253504. Throughput: 0: 13251.5. Samples: 79239419. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:11,237][81074] Avg episode reward: [(0, '1743.832')] [2023-03-07 00:50:11,496][81400] Updated weights for policy 0, policy_version 77400 (0.0007) [2023-03-07 00:50:12,263][81400] Updated weights for policy 0, policy_version 77410 (0.0006) [2023-03-07 00:50:13,051][81400] Updated weights for policy 0, policy_version 77420 (0.0006) [2023-03-07 00:50:13,826][81400] Updated weights for policy 0, policy_version 77430 (0.0007) [2023-03-07 00:50:14,601][81400] Updated weights for policy 0, policy_version 77440 (0.0006) [2023-03-07 00:50:15,350][81400] Updated weights for policy 0, policy_version 77450 (0.0007) [2023-03-07 00:50:16,122][81400] Updated weights for policy 0, policy_version 77460 (0.0006) [2023-03-07 00:50:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13277.9, 300 sec: 13190.5). Total num frames: 79320064. Throughput: 0: 13239.5. Samples: 79318742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:16,237][81074] Avg episode reward: [(0, '1695.482')] [2023-03-07 00:50:16,905][81400] Updated weights for policy 0, policy_version 77470 (0.0006) [2023-03-07 00:50:17,676][81400] Updated weights for policy 0, policy_version 77480 (0.0006) [2023-03-07 00:50:18,453][81400] Updated weights for policy 0, policy_version 77490 (0.0007) [2023-03-07 00:50:19,220][81400] Updated weights for policy 0, policy_version 77500 (0.0006) [2023-03-07 00:50:19,988][81400] Updated weights for policy 0, policy_version 77510 (0.0006) [2023-03-07 00:50:20,774][81400] Updated weights for policy 0, policy_version 77520 (0.0005) [2023-03-07 00:50:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13194.0). Total num frames: 79386624. Throughput: 0: 13233.7. Samples: 79358402. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:21,237][81074] Avg episode reward: [(0, '1995.452')] [2023-03-07 00:50:21,545][81400] Updated weights for policy 0, policy_version 77530 (0.0007) [2023-03-07 00:50:22,330][81400] Updated weights for policy 0, policy_version 77540 (0.0006) [2023-03-07 00:50:23,093][81400] Updated weights for policy 0, policy_version 77550 (0.0006) [2023-03-07 00:50:23,851][81400] Updated weights for policy 0, policy_version 77560 (0.0006) [2023-03-07 00:50:24,620][81400] Updated weights for policy 0, policy_version 77570 (0.0007) [2023-03-07 00:50:25,380][81400] Updated weights for policy 0, policy_version 77580 (0.0007) [2023-03-07 00:50:26,178][81400] Updated weights for policy 0, policy_version 77590 (0.0006) [2023-03-07 00:50:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13243.7, 300 sec: 13190.5). Total num frames: 79452160. Throughput: 0: 13228.7. Samples: 79438186. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:26,248][81074] Avg episode reward: [(0, '1595.926')] [2023-03-07 00:50:26,938][81400] Updated weights for policy 0, policy_version 77600 (0.0006) [2023-03-07 00:50:27,410][81349] KL-divergence is very high: 115.6830 [2023-03-07 00:50:27,468][81349] KL-divergence is very high: 447.8007 [2023-03-07 00:50:27,722][81400] Updated weights for policy 0, policy_version 77610 (0.0005) [2023-03-07 00:50:28,490][81400] Updated weights for policy 0, policy_version 77620 (0.0006) [2023-03-07 00:50:29,253][81400] Updated weights for policy 0, policy_version 77630 (0.0006) [2023-03-07 00:50:30,025][81400] Updated weights for policy 0, policy_version 77640 (0.0006) [2023-03-07 00:50:30,786][81400] Updated weights for policy 0, policy_version 77650 (0.0006) [2023-03-07 00:50:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.8, 300 sec: 13194.0). Total num frames: 79518720. Throughput: 0: 13236.7. Samples: 79517983. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:31,237][81074] Avg episode reward: [(0, '1395.633')] [2023-03-07 00:50:31,553][81400] Updated weights for policy 0, policy_version 77660 (0.0007) [2023-03-07 00:50:32,328][81400] Updated weights for policy 0, policy_version 77670 (0.0006) [2023-03-07 00:50:33,090][81400] Updated weights for policy 0, policy_version 77680 (0.0005) [2023-03-07 00:50:33,869][81400] Updated weights for policy 0, policy_version 77690 (0.0006) [2023-03-07 00:50:34,647][81400] Updated weights for policy 0, policy_version 77700 (0.0006) [2023-03-07 00:50:34,998][81349] KL-divergence is very high: 691.6822 [2023-03-07 00:50:35,402][81400] Updated weights for policy 0, policy_version 77710 (0.0006) [2023-03-07 00:50:36,157][81400] Updated weights for policy 0, policy_version 77720 (0.0005) [2023-03-07 00:50:36,236][81074] Fps is (10 sec: 13414.6, 60 sec: 13260.8, 300 sec: 13200.9). Total num frames: 79586304. Throughput: 0: 13235.4. Samples: 79557743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:36,237][81074] Avg episode reward: [(0, '1544.343')] [2023-03-07 00:50:36,937][81400] Updated weights for policy 0, policy_version 77730 (0.0006) [2023-03-07 00:50:37,713][81400] Updated weights for policy 0, policy_version 77740 (0.0005) [2023-03-07 00:50:38,485][81400] Updated weights for policy 0, policy_version 77750 (0.0005) [2023-03-07 00:50:39,275][81400] Updated weights for policy 0, policy_version 77760 (0.0006) [2023-03-07 00:50:40,060][81400] Updated weights for policy 0, policy_version 77770 (0.0006) [2023-03-07 00:50:40,814][81400] Updated weights for policy 0, policy_version 77780 (0.0006) [2023-03-07 00:50:41,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13200.9). Total num frames: 79651840. Throughput: 0: 13251.4. Samples: 79637266. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:41,237][81074] Avg episode reward: [(0, '1635.495')] [2023-03-07 00:50:41,593][81400] Updated weights for policy 0, policy_version 77790 (0.0006) [2023-03-07 00:50:42,362][81400] Updated weights for policy 0, policy_version 77800 (0.0006) [2023-03-07 00:50:43,139][81400] Updated weights for policy 0, policy_version 77810 (0.0006) [2023-03-07 00:50:43,901][81400] Updated weights for policy 0, policy_version 77820 (0.0006) [2023-03-07 00:50:44,671][81400] Updated weights for policy 0, policy_version 77830 (0.0005) [2023-03-07 00:50:45,444][81400] Updated weights for policy 0, policy_version 77840 (0.0006) [2023-03-07 00:50:45,744][81349] KL-divergence is very high: 62110944.0000 [2023-03-07 00:50:46,215][81400] Updated weights for policy 0, policy_version 77850 (0.0006) [2023-03-07 00:50:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13204.4). Total num frames: 79718400. Throughput: 0: 13259.1. Samples: 79716956. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:46,237][81074] Avg episode reward: [(0, '1348.536')] [2023-03-07 00:50:46,988][81400] Updated weights for policy 0, policy_version 77860 (0.0007) [2023-03-07 00:50:47,755][81400] Updated weights for policy 0, policy_version 77870 (0.0006) [2023-03-07 00:50:48,527][81400] Updated weights for policy 0, policy_version 77880 (0.0006) [2023-03-07 00:50:49,282][81400] Updated weights for policy 0, policy_version 77890 (0.0005) [2023-03-07 00:50:50,054][81400] Updated weights for policy 0, policy_version 77900 (0.0006) [2023-03-07 00:50:50,838][81400] Updated weights for policy 0, policy_version 77910 (0.0005) [2023-03-07 00:50:51,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13260.8, 300 sec: 13207.9). Total num frames: 79784960. Throughput: 0: 13265.9. Samples: 79756785. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:51,237][81074] Avg episode reward: [(0, '1525.595')] [2023-03-07 00:50:51,607][81400] Updated weights for policy 0, policy_version 77920 (0.0007) [2023-03-07 00:50:52,381][81400] Updated weights for policy 0, policy_version 77930 (0.0006) [2023-03-07 00:50:53,142][81400] Updated weights for policy 0, policy_version 77940 (0.0006) [2023-03-07 00:50:53,906][81400] Updated weights for policy 0, policy_version 77950 (0.0006) [2023-03-07 00:50:54,666][81400] Updated weights for policy 0, policy_version 77960 (0.0005) [2023-03-07 00:50:55,446][81400] Updated weights for policy 0, policy_version 77970 (0.0006) [2023-03-07 00:50:56,214][81400] Updated weights for policy 0, policy_version 77980 (0.0007) [2023-03-07 00:50:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13260.8, 300 sec: 13211.3). Total num frames: 79851520. Throughput: 0: 13277.4. Samples: 79836903. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:50:56,237][81074] Avg episode reward: [(0, '1505.005')] [2023-03-07 00:50:57,004][81400] Updated weights for policy 0, policy_version 77990 (0.0007) [2023-03-07 00:50:57,771][81400] Updated weights for policy 0, policy_version 78000 (0.0006) [2023-03-07 00:50:58,546][81400] Updated weights for policy 0, policy_version 78010 (0.0006) [2023-03-07 00:50:59,317][81400] Updated weights for policy 0, policy_version 78020 (0.0006) [2023-03-07 00:51:00,077][81400] Updated weights for policy 0, policy_version 78030 (0.0006) [2023-03-07 00:51:00,862][81400] Updated weights for policy 0, policy_version 78040 (0.0006) [2023-03-07 00:51:01,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13260.8, 300 sec: 13211.3). Total num frames: 79917056. Throughput: 0: 13279.6. Samples: 79916322. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:01,237][81074] Avg episode reward: [(0, '1307.836')] [2023-03-07 00:51:01,614][81400] Updated weights for policy 0, policy_version 78050 (0.0007) [2023-03-07 00:51:02,396][81400] Updated weights for policy 0, policy_version 78060 (0.0006) [2023-03-07 00:51:03,163][81400] Updated weights for policy 0, policy_version 78070 (0.0006) [2023-03-07 00:51:03,938][81400] Updated weights for policy 0, policy_version 78080 (0.0006) [2023-03-07 00:51:04,708][81400] Updated weights for policy 0, policy_version 78090 (0.0006) [2023-03-07 00:51:05,485][81400] Updated weights for policy 0, policy_version 78100 (0.0006) [2023-03-07 00:51:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13214.8). Total num frames: 79983616. Throughput: 0: 13282.4. Samples: 79956111. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:06,237][81074] Avg episode reward: [(0, '1324.824')] [2023-03-07 00:51:06,240][81400] Updated weights for policy 0, policy_version 78110 (0.0006) [2023-03-07 00:51:06,783][81349] KL-divergence is very high: 101.0010 [2023-03-07 00:51:07,022][81400] Updated weights for policy 0, policy_version 78120 (0.0006) [2023-03-07 00:51:07,798][81400] Updated weights for policy 0, policy_version 78130 (0.0006) [2023-03-07 00:51:08,546][81400] Updated weights for policy 0, policy_version 78140 (0.0006) [2023-03-07 00:51:09,323][81400] Updated weights for policy 0, policy_version 78150 (0.0005) [2023-03-07 00:51:10,072][81400] Updated weights for policy 0, policy_version 78160 (0.0006) [2023-03-07 00:51:10,813][81400] Updated weights for policy 0, policy_version 78170 (0.0006) [2023-03-07 00:51:11,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13294.9, 300 sec: 13221.8). Total num frames: 80051200. Throughput: 0: 13289.9. Samples: 80036230. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:11,237][81074] Avg episode reward: [(0, '1255.791')] [2023-03-07 00:51:11,589][81400] Updated weights for policy 0, policy_version 78180 (0.0006) [2023-03-07 00:51:12,383][81400] Updated weights for policy 0, policy_version 78190 (0.0006) [2023-03-07 00:51:13,161][81400] Updated weights for policy 0, policy_version 78200 (0.0006) [2023-03-07 00:51:13,934][81400] Updated weights for policy 0, policy_version 78210 (0.0006) [2023-03-07 00:51:14,690][81400] Updated weights for policy 0, policy_version 78220 (0.0005) [2023-03-07 00:51:15,453][81400] Updated weights for policy 0, policy_version 78230 (0.0006) [2023-03-07 00:51:16,213][81400] Updated weights for policy 0, policy_version 78240 (0.0005) [2023-03-07 00:51:16,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13294.9, 300 sec: 13221.7). Total num frames: 80117760. Throughput: 0: 13293.0. Samples: 80116171. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:16,237][81074] Avg episode reward: [(0, '1451.851')] [2023-03-07 00:51:17,002][81400] Updated weights for policy 0, policy_version 78250 (0.0006) [2023-03-07 00:51:17,769][81400] Updated weights for policy 0, policy_version 78260 (0.0006) [2023-03-07 00:51:18,513][81400] Updated weights for policy 0, policy_version 78270 (0.0005) [2023-03-07 00:51:19,305][81400] Updated weights for policy 0, policy_version 78280 (0.0006) [2023-03-07 00:51:20,083][81400] Updated weights for policy 0, policy_version 78290 (0.0006) [2023-03-07 00:51:20,852][81400] Updated weights for policy 0, policy_version 78300 (0.0005) [2023-03-07 00:51:21,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13228.7). Total num frames: 80184320. Throughput: 0: 13297.7. Samples: 80156138. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:21,237][81074] Avg episode reward: [(0, '1461.400')] [2023-03-07 00:51:21,599][81400] Updated weights for policy 0, policy_version 78310 (0.0006) [2023-03-07 00:51:22,379][81400] Updated weights for policy 0, policy_version 78320 (0.0006) [2023-03-07 00:51:23,149][81400] Updated weights for policy 0, policy_version 78330 (0.0006) [2023-03-07 00:51:23,908][81400] Updated weights for policy 0, policy_version 78340 (0.0006) [2023-03-07 00:51:24,682][81400] Updated weights for policy 0, policy_version 78350 (0.0006) [2023-03-07 00:51:25,453][81400] Updated weights for policy 0, policy_version 78360 (0.0006) [2023-03-07 00:51:26,221][81400] Updated weights for policy 0, policy_version 78370 (0.0005) [2023-03-07 00:51:26,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13312.0, 300 sec: 13228.7). Total num frames: 80250880. Throughput: 0: 13304.1. Samples: 80235952. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:26,237][81074] Avg episode reward: [(0, '1510.235')] [2023-03-07 00:51:26,978][81349] KL-divergence is very high: 5627916.5000 [2023-03-07 00:51:26,986][81400] Updated weights for policy 0, policy_version 78380 (0.0006) [2023-03-07 00:51:27,760][81400] Updated weights for policy 0, policy_version 78390 (0.0006) [2023-03-07 00:51:28,524][81400] Updated weights for policy 0, policy_version 78400 (0.0006) [2023-03-07 00:51:29,304][81400] Updated weights for policy 0, policy_version 78410 (0.0006) [2023-03-07 00:51:30,050][81400] Updated weights for policy 0, policy_version 78420 (0.0006) [2023-03-07 00:51:30,832][81400] Updated weights for policy 0, policy_version 78430 (0.0006) [2023-03-07 00:51:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13312.0, 300 sec: 13232.2). Total num frames: 80317440. Throughput: 0: 13310.6. Samples: 80315932. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:31,237][81074] Avg episode reward: [(0, '1358.744')] [2023-03-07 00:51:31,607][81400] Updated weights for policy 0, policy_version 78440 (0.0006) [2023-03-07 00:51:32,375][81400] Updated weights for policy 0, policy_version 78450 (0.0005) [2023-03-07 00:51:33,145][81400] Updated weights for policy 0, policy_version 78460 (0.0006) [2023-03-07 00:51:33,917][81400] Updated weights for policy 0, policy_version 78470 (0.0005) [2023-03-07 00:51:34,689][81400] Updated weights for policy 0, policy_version 78480 (0.0007) [2023-03-07 00:51:35,464][81400] Updated weights for policy 0, policy_version 78490 (0.0006) [2023-03-07 00:51:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13232.2). Total num frames: 80382976. Throughput: 0: 13310.1. Samples: 80355739. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:36,237][81074] Avg episode reward: [(0, '1420.209')] [2023-03-07 00:51:36,244][81400] Updated weights for policy 0, policy_version 78500 (0.0008) [2023-03-07 00:51:37,005][81400] Updated weights for policy 0, policy_version 78510 (0.0006) [2023-03-07 00:51:37,792][81400] Updated weights for policy 0, policy_version 78520 (0.0008) [2023-03-07 00:51:38,557][81400] Updated weights for policy 0, policy_version 78530 (0.0007) [2023-03-07 00:51:39,335][81400] Updated weights for policy 0, policy_version 78540 (0.0005) [2023-03-07 00:51:40,083][81400] Updated weights for policy 0, policy_version 78550 (0.0006) [2023-03-07 00:51:40,860][81400] Updated weights for policy 0, policy_version 78560 (0.0007) [2023-03-07 00:51:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13295.0, 300 sec: 13235.6). Total num frames: 80449536. Throughput: 0: 13300.1. Samples: 80435408. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:41,237][81074] Avg episode reward: [(0, '1451.236')] [2023-03-07 00:51:41,622][81400] Updated weights for policy 0, policy_version 78570 (0.0005) [2023-03-07 00:51:42,394][81400] Updated weights for policy 0, policy_version 78580 (0.0006) [2023-03-07 00:51:43,195][81400] Updated weights for policy 0, policy_version 78590 (0.0006) [2023-03-07 00:51:43,954][81400] Updated weights for policy 0, policy_version 78600 (0.0005) [2023-03-07 00:51:44,730][81400] Updated weights for policy 0, policy_version 78610 (0.0005) [2023-03-07 00:51:45,501][81400] Updated weights for policy 0, policy_version 78620 (0.0006) [2023-03-07 00:51:46,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13294.9, 300 sec: 13235.6). Total num frames: 80516096. Throughput: 0: 13301.0. Samples: 80514867. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:46,237][81074] Avg episode reward: [(0, '1306.751')] [2023-03-07 00:51:46,282][81400] Updated weights for policy 0, policy_version 78630 (0.0005) [2023-03-07 00:51:47,046][81400] Updated weights for policy 0, policy_version 78640 (0.0006) [2023-03-07 00:51:47,834][81400] Updated weights for policy 0, policy_version 78650 (0.0006) [2023-03-07 00:51:48,595][81400] Updated weights for policy 0, policy_version 78660 (0.0006) [2023-03-07 00:51:49,340][81400] Updated weights for policy 0, policy_version 78670 (0.0006) [2023-03-07 00:51:50,121][81400] Updated weights for policy 0, policy_version 78680 (0.0006) [2023-03-07 00:51:50,884][81400] Updated weights for policy 0, policy_version 78690 (0.0005) [2023-03-07 00:51:51,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13294.9, 300 sec: 13239.1). Total num frames: 80582656. Throughput: 0: 13308.3. Samples: 80554985. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:51,237][81074] Avg episode reward: [(0, '1058.318')] [2023-03-07 00:51:51,646][81400] Updated weights for policy 0, policy_version 78700 (0.0006) [2023-03-07 00:51:52,419][81400] Updated weights for policy 0, policy_version 78710 (0.0005) [2023-03-07 00:51:53,174][81400] Updated weights for policy 0, policy_version 78720 (0.0006) [2023-03-07 00:51:53,946][81400] Updated weights for policy 0, policy_version 78730 (0.0006) [2023-03-07 00:51:54,718][81400] Updated weights for policy 0, policy_version 78740 (0.0006) [2023-03-07 00:51:55,490][81400] Updated weights for policy 0, policy_version 78750 (0.0005) [2023-03-07 00:51:56,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13294.9, 300 sec: 13242.6). Total num frames: 80649216. Throughput: 0: 13305.2. Samples: 80634963. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:51:56,237][81074] Avg episode reward: [(0, '1201.818')] [2023-03-07 00:51:56,250][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000078760_80650240.pth... [2023-03-07 00:51:56,252][81400] Updated weights for policy 0, policy_version 78760 (0.0006) [2023-03-07 00:51:56,279][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000075651_77466624.pth [2023-03-07 00:51:57,019][81400] Updated weights for policy 0, policy_version 78770 (0.0006) [2023-03-07 00:51:57,797][81400] Updated weights for policy 0, policy_version 78780 (0.0006) [2023-03-07 00:51:58,555][81400] Updated weights for policy 0, policy_version 78790 (0.0007) [2023-03-07 00:51:59,327][81400] Updated weights for policy 0, policy_version 78800 (0.0005) [2023-03-07 00:52:00,090][81400] Updated weights for policy 0, policy_version 78810 (0.0006) [2023-03-07 00:52:00,857][81400] Updated weights for policy 0, policy_version 78820 (0.0006) [2023-03-07 00:52:01,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13312.0, 300 sec: 13246.0). Total num frames: 80715776. Throughput: 0: 13305.4. Samples: 80714912. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:01,237][81074] Avg episode reward: [(0, '1108.070')] [2023-03-07 00:52:01,613][81400] Updated weights for policy 0, policy_version 78830 (0.0006) [2023-03-07 00:52:02,377][81400] Updated weights for policy 0, policy_version 78840 (0.0007) [2023-03-07 00:52:03,147][81400] Updated weights for policy 0, policy_version 78850 (0.0006) [2023-03-07 00:52:03,922][81400] Updated weights for policy 0, policy_version 78860 (0.0007) [2023-03-07 00:52:04,673][81400] Updated weights for policy 0, policy_version 78870 (0.0005) [2023-03-07 00:52:05,433][81400] Updated weights for policy 0, policy_version 78880 (0.0006) [2023-03-07 00:52:06,211][81400] Updated weights for policy 0, policy_version 78890 (0.0006) [2023-03-07 00:52:06,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13329.1, 300 sec: 13249.5). Total num frames: 80783360. Throughput: 0: 13313.2. Samples: 80755234. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:06,237][81074] Avg episode reward: [(0, '1186.819')] [2023-03-07 00:52:06,989][81400] Updated weights for policy 0, policy_version 78900 (0.0006) [2023-03-07 00:52:07,751][81400] Updated weights for policy 0, policy_version 78910 (0.0006) [2023-03-07 00:52:08,528][81400] Updated weights for policy 0, policy_version 78920 (0.0006) [2023-03-07 00:52:09,302][81400] Updated weights for policy 0, policy_version 78930 (0.0006) [2023-03-07 00:52:10,056][81400] Updated weights for policy 0, policy_version 78940 (0.0007) [2023-03-07 00:52:10,831][81400] Updated weights for policy 0, policy_version 78950 (0.0006) [2023-03-07 00:52:11,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13312.0, 300 sec: 13253.0). Total num frames: 80849920. Throughput: 0: 13314.0. Samples: 80835082. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:11,247][81074] Avg episode reward: [(0, '1168.967')] [2023-03-07 00:52:11,605][81400] Updated weights for policy 0, policy_version 78960 (0.0006) [2023-03-07 00:52:12,365][81400] Updated weights for policy 0, policy_version 78970 (0.0006) [2023-03-07 00:52:13,149][81400] Updated weights for policy 0, policy_version 78980 (0.0006) [2023-03-07 00:52:13,916][81400] Updated weights for policy 0, policy_version 78990 (0.0006) [2023-03-07 00:52:14,690][81400] Updated weights for policy 0, policy_version 79000 (0.0006) [2023-03-07 00:52:15,444][81400] Updated weights for policy 0, policy_version 79010 (0.0006) [2023-03-07 00:52:16,201][81400] Updated weights for policy 0, policy_version 79020 (0.0006) [2023-03-07 00:52:16,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13312.0, 300 sec: 13256.5). Total num frames: 80916480. Throughput: 0: 13314.5. Samples: 80915087. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:16,247][81074] Avg episode reward: [(0, '1197.557')] [2023-03-07 00:52:16,978][81400] Updated weights for policy 0, policy_version 79030 (0.0006) [2023-03-07 00:52:17,737][81400] Updated weights for policy 0, policy_version 79040 (0.0005) [2023-03-07 00:52:18,506][81400] Updated weights for policy 0, policy_version 79050 (0.0006) [2023-03-07 00:52:19,282][81400] Updated weights for policy 0, policy_version 79060 (0.0006) [2023-03-07 00:52:20,049][81400] Updated weights for policy 0, policy_version 79070 (0.0006) [2023-03-07 00:52:20,830][81400] Updated weights for policy 0, policy_version 79080 (0.0006) [2023-03-07 00:52:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13312.0, 300 sec: 13259.9). Total num frames: 80983040. Throughput: 0: 13320.4. Samples: 80955157. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:21,237][81074] Avg episode reward: [(0, '1104.206')] [2023-03-07 00:52:21,573][81400] Updated weights for policy 0, policy_version 79090 (0.0007) [2023-03-07 00:52:22,334][81400] Updated weights for policy 0, policy_version 79100 (0.0006) [2023-03-07 00:52:23,106][81400] Updated weights for policy 0, policy_version 79110 (0.0007) [2023-03-07 00:52:23,863][81400] Updated weights for policy 0, policy_version 79120 (0.0006) [2023-03-07 00:52:24,634][81400] Updated weights for policy 0, policy_version 79130 (0.0006) [2023-03-07 00:52:25,389][81400] Updated weights for policy 0, policy_version 79140 (0.0007) [2023-03-07 00:52:26,152][81400] Updated weights for policy 0, policy_version 79150 (0.0006) [2023-03-07 00:52:26,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13329.1, 300 sec: 13266.9). Total num frames: 81050624. Throughput: 0: 13338.5. Samples: 81035644. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:26,237][81074] Avg episode reward: [(0, '1127.900')] [2023-03-07 00:52:26,922][81400] Updated weights for policy 0, policy_version 79160 (0.0007) [2023-03-07 00:52:27,682][81400] Updated weights for policy 0, policy_version 79170 (0.0007) [2023-03-07 00:52:28,466][81400] Updated weights for policy 0, policy_version 79180 (0.0005) [2023-03-07 00:52:29,229][81400] Updated weights for policy 0, policy_version 79190 (0.0007) [2023-03-07 00:52:29,990][81400] Updated weights for policy 0, policy_version 79200 (0.0006) [2023-03-07 00:52:30,766][81400] Updated weights for policy 0, policy_version 79210 (0.0006) [2023-03-07 00:52:31,236][81074] Fps is (10 sec: 13414.3, 60 sec: 13329.0, 300 sec: 13270.3). Total num frames: 81117184. Throughput: 0: 13353.9. Samples: 81115792. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:31,237][81074] Avg episode reward: [(0, '1149.171')] [2023-03-07 00:52:31,528][81400] Updated weights for policy 0, policy_version 79220 (0.0007) [2023-03-07 00:52:32,289][81400] Updated weights for policy 0, policy_version 79230 (0.0005) [2023-03-07 00:52:33,062][81400] Updated weights for policy 0, policy_version 79240 (0.0006) [2023-03-07 00:52:33,838][81400] Updated weights for policy 0, policy_version 79250 (0.0007) [2023-03-07 00:52:34,607][81400] Updated weights for policy 0, policy_version 79260 (0.0006) [2023-03-07 00:52:34,910][81349] KL-divergence is very high: 101.1116 [2023-03-07 00:52:35,380][81400] Updated weights for policy 0, policy_version 79270 (0.0006) [2023-03-07 00:52:36,142][81400] Updated weights for policy 0, policy_version 79280 (0.0006) [2023-03-07 00:52:36,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13346.1, 300 sec: 13270.3). Total num frames: 81183744. Throughput: 0: 13348.4. Samples: 81155662. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:36,237][81074] Avg episode reward: [(0, '1164.621')] [2023-03-07 00:52:36,921][81400] Updated weights for policy 0, policy_version 79290 (0.0006) [2023-03-07 00:52:37,701][81400] Updated weights for policy 0, policy_version 79300 (0.0005) [2023-03-07 00:52:38,479][81400] Updated weights for policy 0, policy_version 79310 (0.0006) [2023-03-07 00:52:39,249][81400] Updated weights for policy 0, policy_version 79320 (0.0006) [2023-03-07 00:52:40,005][81400] Updated weights for policy 0, policy_version 79330 (0.0006) [2023-03-07 00:52:40,764][81400] Updated weights for policy 0, policy_version 79340 (0.0005) [2023-03-07 00:52:41,236][81074] Fps is (10 sec: 13312.3, 60 sec: 13346.1, 300 sec: 13273.8). Total num frames: 81250304. Throughput: 0: 13337.9. Samples: 81235167. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:41,237][81074] Avg episode reward: [(0, '1200.384')] [2023-03-07 00:52:41,539][81400] Updated weights for policy 0, policy_version 79350 (0.0006) [2023-03-07 00:52:42,326][81400] Updated weights for policy 0, policy_version 79360 (0.0006) [2023-03-07 00:52:43,080][81400] Updated weights for policy 0, policy_version 79370 (0.0006) [2023-03-07 00:52:43,850][81400] Updated weights for policy 0, policy_version 79380 (0.0006) [2023-03-07 00:52:44,620][81400] Updated weights for policy 0, policy_version 79390 (0.0006) [2023-03-07 00:52:45,371][81400] Updated weights for policy 0, policy_version 79400 (0.0007) [2023-03-07 00:52:46,149][81400] Updated weights for policy 0, policy_version 79410 (0.0006) [2023-03-07 00:52:46,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13329.1, 300 sec: 13270.3). Total num frames: 81315840. Throughput: 0: 13338.8. Samples: 81315159. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:46,237][81074] Avg episode reward: [(0, '1097.712')] [2023-03-07 00:52:46,928][81400] Updated weights for policy 0, policy_version 79420 (0.0006) [2023-03-07 00:52:47,671][81400] Updated weights for policy 0, policy_version 79430 (0.0006) [2023-03-07 00:52:48,443][81400] Updated weights for policy 0, policy_version 79440 (0.0006) [2023-03-07 00:52:49,220][81400] Updated weights for policy 0, policy_version 79450 (0.0006) [2023-03-07 00:52:49,967][81400] Updated weights for policy 0, policy_version 79460 (0.0007) [2023-03-07 00:52:50,730][81400] Updated weights for policy 0, policy_version 79470 (0.0006) [2023-03-07 00:52:51,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13346.2, 300 sec: 13277.3). Total num frames: 81383424. Throughput: 0: 13334.5. Samples: 81355284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:51,237][81074] Avg episode reward: [(0, '1111.841')] [2023-03-07 00:52:51,494][81400] Updated weights for policy 0, policy_version 79480 (0.0005) [2023-03-07 00:52:52,256][81400] Updated weights for policy 0, policy_version 79490 (0.0006) [2023-03-07 00:52:53,035][81400] Updated weights for policy 0, policy_version 79500 (0.0006) [2023-03-07 00:52:53,806][81400] Updated weights for policy 0, policy_version 79510 (0.0006) [2023-03-07 00:52:54,567][81400] Updated weights for policy 0, policy_version 79520 (0.0006) [2023-03-07 00:52:55,350][81400] Updated weights for policy 0, policy_version 79530 (0.0006) [2023-03-07 00:52:56,111][81400] Updated weights for policy 0, policy_version 79540 (0.0006) [2023-03-07 00:52:56,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13346.1, 300 sec: 13277.3). Total num frames: 81449984. Throughput: 0: 13347.3. Samples: 81435711. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:52:56,237][81074] Avg episode reward: [(0, '1196.013')] [2023-03-07 00:52:56,872][81400] Updated weights for policy 0, policy_version 79550 (0.0006) [2023-03-07 00:52:57,654][81400] Updated weights for policy 0, policy_version 79560 (0.0006) [2023-03-07 00:52:58,424][81400] Updated weights for policy 0, policy_version 79570 (0.0005) [2023-03-07 00:52:59,181][81400] Updated weights for policy 0, policy_version 79580 (0.0006) [2023-03-07 00:52:59,946][81400] Updated weights for policy 0, policy_version 79590 (0.0006) [2023-03-07 00:53:00,717][81400] Updated weights for policy 0, policy_version 79600 (0.0006) [2023-03-07 00:53:01,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.1, 300 sec: 13280.8). Total num frames: 81516544. Throughput: 0: 13340.3. Samples: 81515400. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:53:01,237][81074] Avg episode reward: [(0, '1115.766')] [2023-03-07 00:53:01,502][81400] Updated weights for policy 0, policy_version 79610 (0.0006) [2023-03-07 00:53:02,248][81400] Updated weights for policy 0, policy_version 79620 (0.0006) [2023-03-07 00:53:03,014][81400] Updated weights for policy 0, policy_version 79630 (0.0007) [2023-03-07 00:53:03,780][81400] Updated weights for policy 0, policy_version 79640 (0.0006) [2023-03-07 00:53:04,547][81400] Updated weights for policy 0, policy_version 79650 (0.0006) [2023-03-07 00:53:05,316][81400] Updated weights for policy 0, policy_version 79660 (0.0007) [2023-03-07 00:53:06,072][81400] Updated weights for policy 0, policy_version 79670 (0.0006) [2023-03-07 00:53:06,236][81074] Fps is (10 sec: 13414.3, 60 sec: 13346.1, 300 sec: 13284.2). Total num frames: 81584128. Throughput: 0: 13342.9. Samples: 81555587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:53:06,237][81074] Avg episode reward: [(0, '1053.023')] [2023-03-07 00:53:06,853][81400] Updated weights for policy 0, policy_version 79680 (0.0006) [2023-03-07 00:53:07,613][81400] Updated weights for policy 0, policy_version 79690 (0.0006) [2023-03-07 00:53:08,381][81400] Updated weights for policy 0, policy_version 79700 (0.0007) [2023-03-07 00:53:09,126][81400] Updated weights for policy 0, policy_version 79710 (0.0006) [2023-03-07 00:53:09,909][81400] Updated weights for policy 0, policy_version 79720 (0.0006) [2023-03-07 00:53:10,677][81400] Updated weights for policy 0, policy_version 79730 (0.0005) [2023-03-07 00:53:11,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13346.1, 300 sec: 13287.7). Total num frames: 81650688. Throughput: 0: 13338.1. Samples: 81635858. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:53:11,237][81074] Avg episode reward: [(0, '880.538')] [2023-03-07 00:53:11,434][81400] Updated weights for policy 0, policy_version 79740 (0.0006) [2023-03-07 00:53:12,188][81400] Updated weights for policy 0, policy_version 79750 (0.0006) [2023-03-07 00:53:12,972][81400] Updated weights for policy 0, policy_version 79760 (0.0006) [2023-03-07 00:53:13,746][81400] Updated weights for policy 0, policy_version 79770 (0.0007) [2023-03-07 00:53:14,499][81400] Updated weights for policy 0, policy_version 79780 (0.0005) [2023-03-07 00:53:15,263][81400] Updated weights for policy 0, policy_version 79790 (0.0005) [2023-03-07 00:53:16,011][81400] Updated weights for policy 0, policy_version 79800 (0.0005) [2023-03-07 00:53:16,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13346.1, 300 sec: 13287.7). Total num frames: 81717248. Throughput: 0: 13345.2. Samples: 81716326. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:16,237][81074] Avg episode reward: [(0, '941.820')] [2023-03-07 00:53:16,778][81400] Updated weights for policy 0, policy_version 79810 (0.0007) [2023-03-07 00:53:17,544][81400] Updated weights for policy 0, policy_version 79820 (0.0007) [2023-03-07 00:53:18,326][81400] Updated weights for policy 0, policy_version 79830 (0.0005) [2023-03-07 00:53:19,074][81400] Updated weights for policy 0, policy_version 79840 (0.0005) [2023-03-07 00:53:19,841][81400] Updated weights for policy 0, policy_version 79850 (0.0006) [2023-03-07 00:53:20,618][81400] Updated weights for policy 0, policy_version 79860 (0.0006) [2023-03-07 00:53:21,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.1, 300 sec: 13291.2). Total num frames: 81783808. Throughput: 0: 13352.9. Samples: 81756539. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:21,237][81074] Avg episode reward: [(0, '945.935')] [2023-03-07 00:53:21,388][81400] Updated weights for policy 0, policy_version 79870 (0.0007) [2023-03-07 00:53:22,163][81400] Updated weights for policy 0, policy_version 79880 (0.0007) [2023-03-07 00:53:22,908][81400] Updated weights for policy 0, policy_version 79890 (0.0006) [2023-03-07 00:53:23,665][81400] Updated weights for policy 0, policy_version 79900 (0.0005) [2023-03-07 00:53:24,417][81400] Updated weights for policy 0, policy_version 79910 (0.0006) [2023-03-07 00:53:25,190][81400] Updated weights for policy 0, policy_version 79920 (0.0007) [2023-03-07 00:53:25,957][81400] Updated weights for policy 0, policy_version 79930 (0.0006) [2023-03-07 00:53:26,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13346.2, 300 sec: 13298.1). Total num frames: 81851392. Throughput: 0: 13373.7. Samples: 81836985. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:26,237][81074] Avg episode reward: [(0, '840.538')] [2023-03-07 00:53:26,730][81400] Updated weights for policy 0, policy_version 79940 (0.0006) [2023-03-07 00:53:27,499][81400] Updated weights for policy 0, policy_version 79950 (0.0006) [2023-03-07 00:53:28,254][81400] Updated weights for policy 0, policy_version 79960 (0.0006) [2023-03-07 00:53:29,009][81400] Updated weights for policy 0, policy_version 79970 (0.0006) [2023-03-07 00:53:29,778][81400] Updated weights for policy 0, policy_version 79980 (0.0006) [2023-03-07 00:53:30,532][81400] Updated weights for policy 0, policy_version 79990 (0.0005) [2023-03-07 00:53:31,236][81074] Fps is (10 sec: 13516.9, 60 sec: 13363.2, 300 sec: 13305.1). Total num frames: 81918976. Throughput: 0: 13384.4. Samples: 81917458. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:31,237][81074] Avg episode reward: [(0, '860.743')] [2023-03-07 00:53:31,289][81400] Updated weights for policy 0, policy_version 80000 (0.0005) [2023-03-07 00:53:32,063][81400] Updated weights for policy 0, policy_version 80010 (0.0005) [2023-03-07 00:53:32,812][81400] Updated weights for policy 0, policy_version 80020 (0.0005) [2023-03-07 00:53:33,581][81400] Updated weights for policy 0, policy_version 80030 (0.0006) [2023-03-07 00:53:34,350][81400] Updated weights for policy 0, policy_version 80040 (0.0006) [2023-03-07 00:53:35,124][81400] Updated weights for policy 0, policy_version 80050 (0.0006) [2023-03-07 00:53:35,902][81400] Updated weights for policy 0, policy_version 80060 (0.0005) [2023-03-07 00:53:36,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13363.2, 300 sec: 13305.1). Total num frames: 81985536. Throughput: 0: 13387.6. Samples: 81957725. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:36,237][81074] Avg episode reward: [(0, '972.120')] [2023-03-07 00:53:36,682][81400] Updated weights for policy 0, policy_version 80070 (0.0005) [2023-03-07 00:53:37,447][81400] Updated weights for policy 0, policy_version 80080 (0.0005) [2023-03-07 00:53:38,223][81400] Updated weights for policy 0, policy_version 80090 (0.0007) [2023-03-07 00:53:38,986][81400] Updated weights for policy 0, policy_version 80100 (0.0006) [2023-03-07 00:53:39,771][81400] Updated weights for policy 0, policy_version 80110 (0.0006) [2023-03-07 00:53:40,533][81400] Updated weights for policy 0, policy_version 80120 (0.0005) [2023-03-07 00:53:41,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13363.2, 300 sec: 13308.5). Total num frames: 82052096. Throughput: 0: 13366.3. Samples: 82037193. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:41,237][81074] Avg episode reward: [(0, '1051.433')] [2023-03-07 00:53:41,305][81400] Updated weights for policy 0, policy_version 80130 (0.0007) [2023-03-07 00:53:42,076][81400] Updated weights for policy 0, policy_version 80140 (0.0006) [2023-03-07 00:53:42,830][81400] Updated weights for policy 0, policy_version 80150 (0.0006) [2023-03-07 00:53:43,606][81400] Updated weights for policy 0, policy_version 80160 (0.0007) [2023-03-07 00:53:44,373][81400] Updated weights for policy 0, policy_version 80170 (0.0006) [2023-03-07 00:53:45,147][81400] Updated weights for policy 0, policy_version 80180 (0.0006) [2023-03-07 00:53:45,930][81400] Updated weights for policy 0, policy_version 80190 (0.0007) [2023-03-07 00:53:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13363.2, 300 sec: 13305.1). Total num frames: 82117632. Throughput: 0: 13366.0. Samples: 82116868. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:46,247][81074] Avg episode reward: [(0, '1047.211')] [2023-03-07 00:53:46,703][81400] Updated weights for policy 0, policy_version 80200 (0.0007) [2023-03-07 00:53:47,498][81400] Updated weights for policy 0, policy_version 80210 (0.0006) [2023-03-07 00:53:48,251][81400] Updated weights for policy 0, policy_version 80220 (0.0006) [2023-03-07 00:53:49,023][81400] Updated weights for policy 0, policy_version 80230 (0.0006) [2023-03-07 00:53:49,805][81400] Updated weights for policy 0, policy_version 80240 (0.0006) [2023-03-07 00:53:50,578][81400] Updated weights for policy 0, policy_version 80250 (0.0005) [2023-03-07 00:53:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13346.1, 300 sec: 13305.1). Total num frames: 82184192. Throughput: 0: 13353.8. Samples: 82156508. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:51,247][81074] Avg episode reward: [(0, '1179.496')] [2023-03-07 00:53:51,325][81400] Updated weights for policy 0, policy_version 80260 (0.0006) [2023-03-07 00:53:52,090][81400] Updated weights for policy 0, policy_version 80270 (0.0006) [2023-03-07 00:53:52,858][81400] Updated weights for policy 0, policy_version 80280 (0.0006) [2023-03-07 00:53:53,615][81400] Updated weights for policy 0, policy_version 80290 (0.0006) [2023-03-07 00:53:54,403][81400] Updated weights for policy 0, policy_version 80300 (0.0005) [2023-03-07 00:53:55,170][81400] Updated weights for policy 0, policy_version 80310 (0.0006) [2023-03-07 00:53:55,950][81400] Updated weights for policy 0, policy_version 80320 (0.0006) [2023-03-07 00:53:56,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13346.1, 300 sec: 13305.1). Total num frames: 82250752. Throughput: 0: 13351.1. Samples: 82236658. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:53:56,247][81074] Avg episode reward: [(0, '1254.320')] [2023-03-07 00:53:56,254][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000080323_82250752.pth... [2023-03-07 00:53:56,294][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000077203_79055872.pth [2023-03-07 00:53:56,716][81400] Updated weights for policy 0, policy_version 80330 (0.0006) [2023-03-07 00:53:57,486][81400] Updated weights for policy 0, policy_version 80340 (0.0005) [2023-03-07 00:53:58,276][81400] Updated weights for policy 0, policy_version 80350 (0.0007) [2023-03-07 00:53:59,046][81400] Updated weights for policy 0, policy_version 80360 (0.0006) [2023-03-07 00:53:59,814][81400] Updated weights for policy 0, policy_version 80370 (0.0006) [2023-03-07 00:54:00,588][81400] Updated weights for policy 0, policy_version 80380 (0.0006) [2023-03-07 00:54:01,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13346.1, 300 sec: 13308.5). Total num frames: 82317312. Throughput: 0: 13329.0. Samples: 82316130. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:01,237][81074] Avg episode reward: [(0, '1113.277')] [2023-03-07 00:54:01,360][81400] Updated weights for policy 0, policy_version 80390 (0.0006) [2023-03-07 00:54:02,099][81400] Updated weights for policy 0, policy_version 80400 (0.0006) [2023-03-07 00:54:02,877][81400] Updated weights for policy 0, policy_version 80410 (0.0005) [2023-03-07 00:54:03,080][81349] KL-divergence is very high: 223.3106 [2023-03-07 00:54:03,640][81400] Updated weights for policy 0, policy_version 80420 (0.0005) [2023-03-07 00:54:04,401][81400] Updated weights for policy 0, policy_version 80430 (0.0006) [2023-03-07 00:54:05,180][81400] Updated weights for policy 0, policy_version 80440 (0.0006) [2023-03-07 00:54:05,929][81400] Updated weights for policy 0, policy_version 80450 (0.0005) [2023-03-07 00:54:06,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13346.1, 300 sec: 13312.0). Total num frames: 82384896. Throughput: 0: 13331.1. Samples: 82356438. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:06,247][81074] Avg episode reward: [(0, '1102.890')] [2023-03-07 00:54:06,704][81400] Updated weights for policy 0, policy_version 80460 (0.0006) [2023-03-07 00:54:07,465][81400] Updated weights for policy 0, policy_version 80470 (0.0006) [2023-03-07 00:54:08,232][81400] Updated weights for policy 0, policy_version 80480 (0.0006) [2023-03-07 00:54:09,005][81400] Updated weights for policy 0, policy_version 80490 (0.0006) [2023-03-07 00:54:09,762][81400] Updated weights for policy 0, policy_version 80500 (0.0006) [2023-03-07 00:54:10,549][81400] Updated weights for policy 0, policy_version 80510 (0.0005) [2023-03-07 00:54:11,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13346.2, 300 sec: 13315.5). Total num frames: 82451456. Throughput: 0: 13324.0. Samples: 82436566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:11,237][81074] Avg episode reward: [(0, '1050.257')] [2023-03-07 00:54:11,322][81400] Updated weights for policy 0, policy_version 80520 (0.0006) [2023-03-07 00:54:12,073][81400] Updated weights for policy 0, policy_version 80530 (0.0007) [2023-03-07 00:54:12,850][81400] Updated weights for policy 0, policy_version 80540 (0.0006) [2023-03-07 00:54:13,625][81400] Updated weights for policy 0, policy_version 80550 (0.0006) [2023-03-07 00:54:14,390][81400] Updated weights for policy 0, policy_version 80560 (0.0006) [2023-03-07 00:54:14,698][81349] KL-divergence is very high: 1503.3031 [2023-03-07 00:54:15,158][81400] Updated weights for policy 0, policy_version 80570 (0.0005) [2023-03-07 00:54:15,914][81400] Updated weights for policy 0, policy_version 80580 (0.0006) [2023-03-07 00:54:16,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.1, 300 sec: 13312.0). Total num frames: 82518016. Throughput: 0: 13310.9. Samples: 82516449. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:16,237][81074] Avg episode reward: [(0, '1149.119')] [2023-03-07 00:54:16,681][81400] Updated weights for policy 0, policy_version 80590 (0.0006) [2023-03-07 00:54:17,454][81400] Updated weights for policy 0, policy_version 80600 (0.0006) [2023-03-07 00:54:18,213][81400] Updated weights for policy 0, policy_version 80610 (0.0006) [2023-03-07 00:54:18,981][81400] Updated weights for policy 0, policy_version 80620 (0.0007) [2023-03-07 00:54:19,752][81400] Updated weights for policy 0, policy_version 80630 (0.0006) [2023-03-07 00:54:20,514][81400] Updated weights for policy 0, policy_version 80640 (0.0006) [2023-03-07 00:54:21,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13346.1, 300 sec: 13312.0). Total num frames: 82584576. Throughput: 0: 13308.7. Samples: 82556619. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:21,237][81074] Avg episode reward: [(0, '936.001')] [2023-03-07 00:54:21,272][81400] Updated weights for policy 0, policy_version 80650 (0.0005) [2023-03-07 00:54:22,059][81400] Updated weights for policy 0, policy_version 80660 (0.0006) [2023-03-07 00:54:22,824][81400] Updated weights for policy 0, policy_version 80670 (0.0006) [2023-03-07 00:54:23,576][81400] Updated weights for policy 0, policy_version 80680 (0.0006) [2023-03-07 00:54:24,338][81400] Updated weights for policy 0, policy_version 80690 (0.0007) [2023-03-07 00:54:25,116][81400] Updated weights for policy 0, policy_version 80700 (0.0005) [2023-03-07 00:54:25,872][81400] Updated weights for policy 0, policy_version 80710 (0.0005) [2023-03-07 00:54:26,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13329.0, 300 sec: 13312.0). Total num frames: 82651136. Throughput: 0: 13320.6. Samples: 82636623. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:26,237][81074] Avg episode reward: [(0, '952.262')] [2023-03-07 00:54:26,648][81400] Updated weights for policy 0, policy_version 80720 (0.0005) [2023-03-07 00:54:27,409][81400] Updated weights for policy 0, policy_version 80730 (0.0006) [2023-03-07 00:54:28,180][81400] Updated weights for policy 0, policy_version 80740 (0.0006) [2023-03-07 00:54:28,950][81400] Updated weights for policy 0, policy_version 80750 (0.0006) [2023-03-07 00:54:29,734][81400] Updated weights for policy 0, policy_version 80760 (0.0007) [2023-03-07 00:54:30,503][81400] Updated weights for policy 0, policy_version 80770 (0.0006) [2023-03-07 00:54:31,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13312.0, 300 sec: 13312.0). Total num frames: 82717696. Throughput: 0: 13328.2. Samples: 82716636. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:31,237][81074] Avg episode reward: [(0, '1122.046')] [2023-03-07 00:54:31,262][81400] Updated weights for policy 0, policy_version 80780 (0.0006) [2023-03-07 00:54:32,021][81400] Updated weights for policy 0, policy_version 80790 (0.0006) [2023-03-07 00:54:32,788][81400] Updated weights for policy 0, policy_version 80800 (0.0007) [2023-03-07 00:54:33,558][81400] Updated weights for policy 0, policy_version 80810 (0.0005) [2023-03-07 00:54:34,324][81400] Updated weights for policy 0, policy_version 80820 (0.0006) [2023-03-07 00:54:35,090][81400] Updated weights for policy 0, policy_version 80830 (0.0006) [2023-03-07 00:54:35,865][81400] Updated weights for policy 0, policy_version 80840 (0.0007) [2023-03-07 00:54:36,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13312.0, 300 sec: 13312.0). Total num frames: 82784256. Throughput: 0: 13340.7. Samples: 82756839. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:36,237][81074] Avg episode reward: [(0, '1088.861')] [2023-03-07 00:54:36,639][81400] Updated weights for policy 0, policy_version 80850 (0.0007) [2023-03-07 00:54:37,423][81400] Updated weights for policy 0, policy_version 80860 (0.0007) [2023-03-07 00:54:38,170][81400] Updated weights for policy 0, policy_version 80870 (0.0005) [2023-03-07 00:54:38,914][81400] Updated weights for policy 0, policy_version 80880 (0.0006) [2023-03-07 00:54:39,715][81400] Updated weights for policy 0, policy_version 80890 (0.0006) [2023-03-07 00:54:40,488][81400] Updated weights for policy 0, policy_version 80900 (0.0006) [2023-03-07 00:54:41,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13312.0, 300 sec: 13312.0). Total num frames: 82850816. Throughput: 0: 13332.2. Samples: 82836603. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:41,237][81074] Avg episode reward: [(0, '1196.121')] [2023-03-07 00:54:41,241][81400] Updated weights for policy 0, policy_version 80910 (0.0006) [2023-03-07 00:54:42,023][81400] Updated weights for policy 0, policy_version 80920 (0.0006) [2023-03-07 00:54:42,770][81400] Updated weights for policy 0, policy_version 80930 (0.0006) [2023-03-07 00:54:43,562][81400] Updated weights for policy 0, policy_version 80940 (0.0006) [2023-03-07 00:54:44,291][81400] Updated weights for policy 0, policy_version 80950 (0.0005) [2023-03-07 00:54:45,048][81400] Updated weights for policy 0, policy_version 80960 (0.0005) [2023-03-07 00:54:45,824][81400] Updated weights for policy 0, policy_version 80970 (0.0007) [2023-03-07 00:54:46,236][81074] Fps is (10 sec: 13414.4, 60 sec: 13346.1, 300 sec: 13318.9). Total num frames: 82918400. Throughput: 0: 13353.7. Samples: 82917044. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:46,237][81074] Avg episode reward: [(0, '1063.113')] [2023-03-07 00:54:46,597][81400] Updated weights for policy 0, policy_version 80980 (0.0006) [2023-03-07 00:54:47,358][81400] Updated weights for policy 0, policy_version 80990 (0.0005) [2023-03-07 00:54:48,130][81400] Updated weights for policy 0, policy_version 81000 (0.0005) [2023-03-07 00:54:48,897][81400] Updated weights for policy 0, policy_version 81010 (0.0006) [2023-03-07 00:54:49,668][81400] Updated weights for policy 0, policy_version 81020 (0.0006) [2023-03-07 00:54:50,436][81400] Updated weights for policy 0, policy_version 81030 (0.0006) [2023-03-07 00:54:51,213][81400] Updated weights for policy 0, policy_version 81040 (0.0005) [2023-03-07 00:54:51,236][81074] Fps is (10 sec: 13414.2, 60 sec: 13346.1, 300 sec: 13318.9). Total num frames: 82984960. Throughput: 0: 13346.8. Samples: 82957042. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:54:51,237][81074] Avg episode reward: [(0, '1160.576')] [2023-03-07 00:54:51,985][81400] Updated weights for policy 0, policy_version 81050 (0.0006) [2023-03-07 00:54:52,761][81400] Updated weights for policy 0, policy_version 81060 (0.0005) [2023-03-07 00:54:53,545][81400] Updated weights for policy 0, policy_version 81070 (0.0005) [2023-03-07 00:54:54,307][81400] Updated weights for policy 0, policy_version 81080 (0.0006) [2023-03-07 00:54:55,072][81400] Updated weights for policy 0, policy_version 81090 (0.0005) [2023-03-07 00:54:55,813][81400] Updated weights for policy 0, policy_version 81100 (0.0005) [2023-03-07 00:54:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.2, 300 sec: 13322.4). Total num frames: 83051520. Throughput: 0: 13336.8. Samples: 83036722. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:54:56,237][81074] Avg episode reward: [(0, '1220.722')] [2023-03-07 00:54:56,603][81400] Updated weights for policy 0, policy_version 81110 (0.0006) [2023-03-07 00:54:57,361][81400] Updated weights for policy 0, policy_version 81120 (0.0006) [2023-03-07 00:54:58,129][81400] Updated weights for policy 0, policy_version 81130 (0.0006) [2023-03-07 00:54:58,910][81400] Updated weights for policy 0, policy_version 81140 (0.0006) [2023-03-07 00:54:59,693][81400] Updated weights for policy 0, policy_version 81150 (0.0007) [2023-03-07 00:55:00,470][81400] Updated weights for policy 0, policy_version 81160 (0.0005) [2023-03-07 00:55:01,232][81400] Updated weights for policy 0, policy_version 81170 (0.0006) [2023-03-07 00:55:01,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13346.1, 300 sec: 13322.4). Total num frames: 83118080. Throughput: 0: 13330.8. Samples: 83116334. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:01,237][81074] Avg episode reward: [(0, '1058.285')] [2023-03-07 00:55:01,997][81400] Updated weights for policy 0, policy_version 81180 (0.0006) [2023-03-07 00:55:02,777][81400] Updated weights for policy 0, policy_version 81190 (0.0006) [2023-03-07 00:55:03,540][81400] Updated weights for policy 0, policy_version 81200 (0.0006) [2023-03-07 00:55:04,309][81400] Updated weights for policy 0, policy_version 81210 (0.0006) [2023-03-07 00:55:05,076][81400] Updated weights for policy 0, policy_version 81220 (0.0007) [2023-03-07 00:55:05,856][81400] Updated weights for policy 0, policy_version 81230 (0.0006) [2023-03-07 00:55:06,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13312.0, 300 sec: 13322.4). Total num frames: 83183616. Throughput: 0: 13327.5. Samples: 83156358. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:06,237][81074] Avg episode reward: [(0, '1206.013')] [2023-03-07 00:55:06,618][81400] Updated weights for policy 0, policy_version 81240 (0.0005) [2023-03-07 00:55:07,417][81400] Updated weights for policy 0, policy_version 81250 (0.0007) [2023-03-07 00:55:08,182][81400] Updated weights for policy 0, policy_version 81260 (0.0005) [2023-03-07 00:55:08,490][81349] KL-divergence is very high: 111.6272 [2023-03-07 00:55:08,968][81400] Updated weights for policy 0, policy_version 81270 (0.0006) [2023-03-07 00:55:09,728][81400] Updated weights for policy 0, policy_version 81280 (0.0006) [2023-03-07 00:55:10,489][81400] Updated weights for policy 0, policy_version 81290 (0.0007) [2023-03-07 00:55:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13312.0, 300 sec: 13322.4). Total num frames: 83250176. Throughput: 0: 13314.3. Samples: 83235765. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:11,237][81074] Avg episode reward: [(0, '1254.810')] [2023-03-07 00:55:11,250][81400] Updated weights for policy 0, policy_version 81300 (0.0006) [2023-03-07 00:55:12,010][81400] Updated weights for policy 0, policy_version 81310 (0.0007) [2023-03-07 00:55:12,778][81400] Updated weights for policy 0, policy_version 81320 (0.0006) [2023-03-07 00:55:13,533][81400] Updated weights for policy 0, policy_version 81330 (0.0007) [2023-03-07 00:55:14,310][81400] Updated weights for policy 0, policy_version 81340 (0.0006) [2023-03-07 00:55:15,080][81400] Updated weights for policy 0, policy_version 81350 (0.0007) [2023-03-07 00:55:15,867][81400] Updated weights for policy 0, policy_version 81360 (0.0006) [2023-03-07 00:55:16,236][81074] Fps is (10 sec: 13414.5, 60 sec: 13329.1, 300 sec: 13325.9). Total num frames: 83317760. Throughput: 0: 13319.4. Samples: 83316010. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:16,237][81074] Avg episode reward: [(0, '1098.362')] [2023-03-07 00:55:16,633][81400] Updated weights for policy 0, policy_version 81370 (0.0006) [2023-03-07 00:55:17,384][81400] Updated weights for policy 0, policy_version 81380 (0.0007) [2023-03-07 00:55:18,157][81400] Updated weights for policy 0, policy_version 81390 (0.0006) [2023-03-07 00:55:18,920][81400] Updated weights for policy 0, policy_version 81400 (0.0006) [2023-03-07 00:55:19,697][81400] Updated weights for policy 0, policy_version 81410 (0.0007) [2023-03-07 00:55:20,476][81400] Updated weights for policy 0, policy_version 81420 (0.0006) [2023-03-07 00:55:21,224][81400] Updated weights for policy 0, policy_version 81430 (0.0006) [2023-03-07 00:55:21,236][81074] Fps is (10 sec: 13414.6, 60 sec: 13329.1, 300 sec: 13329.4). Total num frames: 83384320. Throughput: 0: 13317.5. Samples: 83356128. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:21,237][81074] Avg episode reward: [(0, '1162.422')] [2023-03-07 00:55:21,993][81400] Updated weights for policy 0, policy_version 81440 (0.0007) [2023-03-07 00:55:22,750][81400] Updated weights for policy 0, policy_version 81450 (0.0005) [2023-03-07 00:55:23,515][81400] Updated weights for policy 0, policy_version 81460 (0.0006) [2023-03-07 00:55:24,289][81400] Updated weights for policy 0, policy_version 81470 (0.0007) [2023-03-07 00:55:25,076][81400] Updated weights for policy 0, policy_version 81480 (0.0006) [2023-03-07 00:55:25,849][81400] Updated weights for policy 0, policy_version 81490 (0.0006) [2023-03-07 00:55:26,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13329.1, 300 sec: 13329.4). Total num frames: 83450880. Throughput: 0: 13320.5. Samples: 83436028. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:26,237][81074] Avg episode reward: [(0, '1197.382')] [2023-03-07 00:55:26,609][81400] Updated weights for policy 0, policy_version 81500 (0.0006) [2023-03-07 00:55:27,378][81400] Updated weights for policy 0, policy_version 81510 (0.0007) [2023-03-07 00:55:28,160][81400] Updated weights for policy 0, policy_version 81520 (0.0006) [2023-03-07 00:55:28,922][81400] Updated weights for policy 0, policy_version 81530 (0.0005) [2023-03-07 00:55:29,694][81400] Updated weights for policy 0, policy_version 81540 (0.0005) [2023-03-07 00:55:30,456][81400] Updated weights for policy 0, policy_version 81550 (0.0006) [2023-03-07 00:55:31,233][81400] Updated weights for policy 0, policy_version 81560 (0.0006) [2023-03-07 00:55:31,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13329.1, 300 sec: 13325.9). Total num frames: 83517440. Throughput: 0: 13306.7. Samples: 83515849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:31,237][81074] Avg episode reward: [(0, '1224.046')] [2023-03-07 00:55:32,001][81400] Updated weights for policy 0, policy_version 81570 (0.0006) [2023-03-07 00:55:32,778][81400] Updated weights for policy 0, policy_version 81580 (0.0006) [2023-03-07 00:55:33,541][81400] Updated weights for policy 0, policy_version 81590 (0.0006) [2023-03-07 00:55:34,314][81400] Updated weights for policy 0, policy_version 81600 (0.0008) [2023-03-07 00:55:35,086][81400] Updated weights for policy 0, policy_version 81610 (0.0006) [2023-03-07 00:55:35,861][81400] Updated weights for policy 0, policy_version 81620 (0.0006) [2023-03-07 00:55:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13312.0, 300 sec: 13325.9). Total num frames: 83582976. Throughput: 0: 13301.6. Samples: 83555615. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:36,237][81074] Avg episode reward: [(0, '1179.228')] [2023-03-07 00:55:36,642][81400] Updated weights for policy 0, policy_version 81630 (0.0005) [2023-03-07 00:55:37,410][81400] Updated weights for policy 0, policy_version 81640 (0.0007) [2023-03-07 00:55:38,184][81400] Updated weights for policy 0, policy_version 81650 (0.0006) [2023-03-07 00:55:38,959][81400] Updated weights for policy 0, policy_version 81660 (0.0006) [2023-03-07 00:55:39,743][81400] Updated weights for policy 0, policy_version 81670 (0.0008) [2023-03-07 00:55:40,509][81400] Updated weights for policy 0, policy_version 81680 (0.0007) [2023-03-07 00:55:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13312.0, 300 sec: 13325.9). Total num frames: 83649536. Throughput: 0: 13295.8. Samples: 83635033. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:41,237][81074] Avg episode reward: [(0, '1174.890')] [2023-03-07 00:55:41,285][81400] Updated weights for policy 0, policy_version 81690 (0.0006) [2023-03-07 00:55:42,056][81400] Updated weights for policy 0, policy_version 81700 (0.0006) [2023-03-07 00:55:42,838][81400] Updated weights for policy 0, policy_version 81710 (0.0006) [2023-03-07 00:55:43,609][81400] Updated weights for policy 0, policy_version 81720 (0.0006) [2023-03-07 00:55:44,359][81400] Updated weights for policy 0, policy_version 81730 (0.0005) [2023-03-07 00:55:45,144][81400] Updated weights for policy 0, policy_version 81740 (0.0006) [2023-03-07 00:55:45,892][81400] Updated weights for policy 0, policy_version 81750 (0.0006) [2023-03-07 00:55:46,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13325.9). Total num frames: 83716096. Throughput: 0: 13296.7. Samples: 83714684. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:46,237][81074] Avg episode reward: [(0, '1190.275')] [2023-03-07 00:55:46,670][81400] Updated weights for policy 0, policy_version 81760 (0.0006) [2023-03-07 00:55:47,449][81400] Updated weights for policy 0, policy_version 81770 (0.0005) [2023-03-07 00:55:48,214][81400] Updated weights for policy 0, policy_version 81780 (0.0005) [2023-03-07 00:55:48,986][81400] Updated weights for policy 0, policy_version 81790 (0.0005) [2023-03-07 00:55:49,756][81400] Updated weights for policy 0, policy_version 81800 (0.0005) [2023-03-07 00:55:50,522][81400] Updated weights for policy 0, policy_version 81810 (0.0006) [2023-03-07 00:55:51,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13295.0, 300 sec: 13325.9). Total num frames: 83782656. Throughput: 0: 13297.3. Samples: 83754736. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:51,237][81074] Avg episode reward: [(0, '1415.823')] [2023-03-07 00:55:51,304][81400] Updated weights for policy 0, policy_version 81820 (0.0005) [2023-03-07 00:55:52,061][81400] Updated weights for policy 0, policy_version 81830 (0.0006) [2023-03-07 00:55:52,837][81400] Updated weights for policy 0, policy_version 81840 (0.0005) [2023-03-07 00:55:53,586][81400] Updated weights for policy 0, policy_version 81850 (0.0006) [2023-03-07 00:55:54,358][81400] Updated weights for policy 0, policy_version 81860 (0.0006) [2023-03-07 00:55:55,124][81400] Updated weights for policy 0, policy_version 81870 (0.0006) [2023-03-07 00:55:55,901][81400] Updated weights for policy 0, policy_version 81880 (0.0006) [2023-03-07 00:55:56,236][81074] Fps is (10 sec: 13311.7, 60 sec: 13294.9, 300 sec: 13329.4). Total num frames: 83849216. Throughput: 0: 13307.9. Samples: 83834621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:55:56,237][81074] Avg episode reward: [(0, '1296.903')] [2023-03-07 00:55:56,243][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000081884_83849216.pth... [2023-03-07 00:55:56,275][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000078760_80650240.pth [2023-03-07 00:55:56,674][81400] Updated weights for policy 0, policy_version 81890 (0.0006) [2023-03-07 00:55:57,422][81400] Updated weights for policy 0, policy_version 81900 (0.0005) [2023-03-07 00:55:58,199][81400] Updated weights for policy 0, policy_version 81910 (0.0006) [2023-03-07 00:55:58,962][81400] Updated weights for policy 0, policy_version 81920 (0.0005) [2023-03-07 00:55:59,727][81400] Updated weights for policy 0, policy_version 81930 (0.0005) [2023-03-07 00:56:00,482][81400] Updated weights for policy 0, policy_version 81940 (0.0006) [2023-03-07 00:56:01,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13329.4). Total num frames: 83915776. Throughput: 0: 13308.0. Samples: 83914870. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:01,237][81074] Avg episode reward: [(0, '1199.769')] [2023-03-07 00:56:01,257][81400] Updated weights for policy 0, policy_version 81950 (0.0006) [2023-03-07 00:56:02,035][81400] Updated weights for policy 0, policy_version 81960 (0.0006) [2023-03-07 00:56:02,815][81400] Updated weights for policy 0, policy_version 81970 (0.0006) [2023-03-07 00:56:03,580][81400] Updated weights for policy 0, policy_version 81980 (0.0006) [2023-03-07 00:56:04,367][81400] Updated weights for policy 0, policy_version 81990 (0.0006) [2023-03-07 00:56:05,138][81400] Updated weights for policy 0, policy_version 82000 (0.0006) [2023-03-07 00:56:05,894][81400] Updated weights for policy 0, policy_version 82010 (0.0006) [2023-03-07 00:56:06,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13312.0, 300 sec: 13325.9). Total num frames: 83982336. Throughput: 0: 13296.5. Samples: 83954473. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:06,237][81074] Avg episode reward: [(0, '1509.052')] [2023-03-07 00:56:06,674][81400] Updated weights for policy 0, policy_version 82020 (0.0006) [2023-03-07 00:56:07,434][81400] Updated weights for policy 0, policy_version 82030 (0.0006) [2023-03-07 00:56:08,221][81400] Updated weights for policy 0, policy_version 82040 (0.0006) [2023-03-07 00:56:08,981][81400] Updated weights for policy 0, policy_version 82050 (0.0007) [2023-03-07 00:56:09,741][81400] Updated weights for policy 0, policy_version 82060 (0.0007) [2023-03-07 00:56:10,499][81400] Updated weights for policy 0, policy_version 82070 (0.0007) [2023-03-07 00:56:11,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13312.0, 300 sec: 13325.9). Total num frames: 84048896. Throughput: 0: 13292.9. Samples: 84034206. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:11,237][81074] Avg episode reward: [(0, '1370.934')] [2023-03-07 00:56:11,270][81400] Updated weights for policy 0, policy_version 82080 (0.0006) [2023-03-07 00:56:12,051][81400] Updated weights for policy 0, policy_version 82090 (0.0007) [2023-03-07 00:56:12,823][81400] Updated weights for policy 0, policy_version 82100 (0.0006) [2023-03-07 00:56:13,589][81400] Updated weights for policy 0, policy_version 82110 (0.0006) [2023-03-07 00:56:14,346][81400] Updated weights for policy 0, policy_version 82120 (0.0006) [2023-03-07 00:56:15,145][81400] Updated weights for policy 0, policy_version 82130 (0.0006) [2023-03-07 00:56:15,902][81400] Updated weights for policy 0, policy_version 82140 (0.0005) [2023-03-07 00:56:16,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13294.9, 300 sec: 13325.9). Total num frames: 84115456. Throughput: 0: 13294.5. Samples: 84114102. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:16,237][81074] Avg episode reward: [(0, '1237.357')] [2023-03-07 00:56:16,681][81400] Updated weights for policy 0, policy_version 82150 (0.0006) [2023-03-07 00:56:17,465][81400] Updated weights for policy 0, policy_version 82160 (0.0006) [2023-03-07 00:56:18,229][81400] Updated weights for policy 0, policy_version 82170 (0.0006) [2023-03-07 00:56:18,998][81400] Updated weights for policy 0, policy_version 82180 (0.0005) [2023-03-07 00:56:19,752][81400] Updated weights for policy 0, policy_version 82190 (0.0007) [2023-03-07 00:56:20,552][81400] Updated weights for policy 0, policy_version 82200 (0.0006) [2023-03-07 00:56:21,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13294.9, 300 sec: 13325.9). Total num frames: 84182016. Throughput: 0: 13292.8. Samples: 84153789. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:21,237][81074] Avg episode reward: [(0, '1431.072')] [2023-03-07 00:56:21,307][81400] Updated weights for policy 0, policy_version 82210 (0.0006) [2023-03-07 00:56:22,076][81400] Updated weights for policy 0, policy_version 82220 (0.0007) [2023-03-07 00:56:22,841][81400] Updated weights for policy 0, policy_version 82230 (0.0007) [2023-03-07 00:56:23,613][81400] Updated weights for policy 0, policy_version 82240 (0.0006) [2023-03-07 00:56:24,383][81400] Updated weights for policy 0, policy_version 82250 (0.0006) [2023-03-07 00:56:25,138][81400] Updated weights for policy 0, policy_version 82260 (0.0006) [2023-03-07 00:56:25,915][81400] Updated weights for policy 0, policy_version 82270 (0.0006) [2023-03-07 00:56:26,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13295.0, 300 sec: 13325.9). Total num frames: 84248576. Throughput: 0: 13301.8. Samples: 84233613. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:26,237][81074] Avg episode reward: [(0, '1176.275')] [2023-03-07 00:56:26,680][81400] Updated weights for policy 0, policy_version 82280 (0.0006) [2023-03-07 00:56:27,446][81400] Updated weights for policy 0, policy_version 82290 (0.0006) [2023-03-07 00:56:28,215][81400] Updated weights for policy 0, policy_version 82300 (0.0006) [2023-03-07 00:56:28,981][81400] Updated weights for policy 0, policy_version 82310 (0.0006) [2023-03-07 00:56:29,759][81400] Updated weights for policy 0, policy_version 82320 (0.0006) [2023-03-07 00:56:30,517][81400] Updated weights for policy 0, policy_version 82330 (0.0006) [2023-03-07 00:56:31,236][81074] Fps is (10 sec: 13311.8, 60 sec: 13294.9, 300 sec: 13329.3). Total num frames: 84315136. Throughput: 0: 13314.8. Samples: 84313854. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:31,237][81074] Avg episode reward: [(0, '1199.136')] [2023-03-07 00:56:31,285][81400] Updated weights for policy 0, policy_version 82340 (0.0006) [2023-03-07 00:56:32,061][81400] Updated weights for policy 0, policy_version 82350 (0.0006) [2023-03-07 00:56:32,832][81400] Updated weights for policy 0, policy_version 82360 (0.0006) [2023-03-07 00:56:33,589][81400] Updated weights for policy 0, policy_version 82370 (0.0006) [2023-03-07 00:56:34,385][81400] Updated weights for policy 0, policy_version 82380 (0.0006) [2023-03-07 00:56:35,144][81400] Updated weights for policy 0, policy_version 82390 (0.0006) [2023-03-07 00:56:35,923][81400] Updated weights for policy 0, policy_version 82400 (0.0006) [2023-03-07 00:56:36,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13312.0, 300 sec: 13329.4). Total num frames: 84381696. Throughput: 0: 13310.8. Samples: 84353721. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:36,237][81074] Avg episode reward: [(0, '1291.090')] [2023-03-07 00:56:36,700][81400] Updated weights for policy 0, policy_version 82410 (0.0007) [2023-03-07 00:56:37,477][81400] Updated weights for policy 0, policy_version 82420 (0.0007) [2023-03-07 00:56:38,248][81400] Updated weights for policy 0, policy_version 82430 (0.0006) [2023-03-07 00:56:39,022][81400] Updated weights for policy 0, policy_version 82440 (0.0005) [2023-03-07 00:56:39,815][81400] Updated weights for policy 0, policy_version 82450 (0.0007) [2023-03-07 00:56:40,577][81400] Updated weights for policy 0, policy_version 82460 (0.0005) [2023-03-07 00:56:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13294.9, 300 sec: 13325.9). Total num frames: 84447232. Throughput: 0: 13294.7. Samples: 84432880. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:41,237][81074] Avg episode reward: [(0, '1440.236')] [2023-03-07 00:56:41,343][81400] Updated weights for policy 0, policy_version 82470 (0.0006) [2023-03-07 00:56:42,119][81400] Updated weights for policy 0, policy_version 82480 (0.0007) [2023-03-07 00:56:42,889][81400] Updated weights for policy 0, policy_version 82490 (0.0006) [2023-03-07 00:56:43,668][81400] Updated weights for policy 0, policy_version 82500 (0.0006) [2023-03-07 00:56:44,410][81400] Updated weights for policy 0, policy_version 82510 (0.0005) [2023-03-07 00:56:45,202][81400] Updated weights for policy 0, policy_version 82520 (0.0006) [2023-03-07 00:56:45,993][81400] Updated weights for policy 0, policy_version 82530 (0.0006) [2023-03-07 00:56:46,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13294.9, 300 sec: 13325.9). Total num frames: 84513792. Throughput: 0: 13276.3. Samples: 84512302. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:46,237][81074] Avg episode reward: [(0, '1385.927')] [2023-03-07 00:56:46,743][81400] Updated weights for policy 0, policy_version 82540 (0.0006) [2023-03-07 00:56:47,529][81400] Updated weights for policy 0, policy_version 82550 (0.0006) [2023-03-07 00:56:48,297][81400] Updated weights for policy 0, policy_version 82560 (0.0006) [2023-03-07 00:56:49,069][81400] Updated weights for policy 0, policy_version 82570 (0.0006) [2023-03-07 00:56:49,828][81400] Updated weights for policy 0, policy_version 82580 (0.0006) [2023-03-07 00:56:50,591][81400] Updated weights for policy 0, policy_version 82590 (0.0006) [2023-03-07 00:56:51,236][81074] Fps is (10 sec: 13311.9, 60 sec: 13294.9, 300 sec: 13325.9). Total num frames: 84580352. Throughput: 0: 13283.2. Samples: 84552217. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:51,237][81074] Avg episode reward: [(0, '1259.251')] [2023-03-07 00:56:51,376][81400] Updated weights for policy 0, policy_version 82600 (0.0006) [2023-03-07 00:56:52,141][81400] Updated weights for policy 0, policy_version 82610 (0.0006) [2023-03-07 00:56:52,906][81400] Updated weights for policy 0, policy_version 82620 (0.0006) [2023-03-07 00:56:53,695][81400] Updated weights for policy 0, policy_version 82630 (0.0006) [2023-03-07 00:56:54,462][81400] Updated weights for policy 0, policy_version 82640 (0.0006) [2023-03-07 00:56:55,246][81400] Updated weights for policy 0, policy_version 82650 (0.0006) [2023-03-07 00:56:56,017][81400] Updated weights for policy 0, policy_version 82660 (0.0007) [2023-03-07 00:56:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13277.9, 300 sec: 13322.4). Total num frames: 84645888. Throughput: 0: 13278.6. Samples: 84631743. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:56:56,237][81074] Avg episode reward: [(0, '1558.986')] [2023-03-07 00:56:56,822][81400] Updated weights for policy 0, policy_version 82670 (0.0006) [2023-03-07 00:56:57,578][81400] Updated weights for policy 0, policy_version 82680 (0.0005) [2023-03-07 00:56:58,369][81400] Updated weights for policy 0, policy_version 82690 (0.0007) [2023-03-07 00:56:59,128][81400] Updated weights for policy 0, policy_version 82700 (0.0006) [2023-03-07 00:56:59,892][81400] Updated weights for policy 0, policy_version 82710 (0.0005) [2023-03-07 00:57:00,686][81400] Updated weights for policy 0, policy_version 82720 (0.0006) [2023-03-07 00:57:01,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13277.9, 300 sec: 13318.9). Total num frames: 84712448. Throughput: 0: 13257.5. Samples: 84710687. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:57:01,237][81074] Avg episode reward: [(0, '1588.443')] [2023-03-07 00:57:01,464][81400] Updated weights for policy 0, policy_version 82730 (0.0006) [2023-03-07 00:57:02,233][81400] Updated weights for policy 0, policy_version 82740 (0.0006) [2023-03-07 00:57:03,008][81400] Updated weights for policy 0, policy_version 82750 (0.0006) [2023-03-07 00:57:03,768][81400] Updated weights for policy 0, policy_version 82760 (0.0007) [2023-03-07 00:57:04,543][81400] Updated weights for policy 0, policy_version 82770 (0.0006) [2023-03-07 00:57:05,347][81400] Updated weights for policy 0, policy_version 82780 (0.0006) [2023-03-07 00:57:06,109][81400] Updated weights for policy 0, policy_version 82790 (0.0006) [2023-03-07 00:57:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13260.8, 300 sec: 13315.5). Total num frames: 84777984. Throughput: 0: 13260.3. Samples: 84750505. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:57:06,237][81074] Avg episode reward: [(0, '1450.746')] [2023-03-07 00:57:06,892][81400] Updated weights for policy 0, policy_version 82800 (0.0006) [2023-03-07 00:57:07,679][81400] Updated weights for policy 0, policy_version 82810 (0.0005) [2023-03-07 00:57:08,462][81400] Updated weights for policy 0, policy_version 82820 (0.0006) [2023-03-07 00:57:09,225][81400] Updated weights for policy 0, policy_version 82830 (0.0006) [2023-03-07 00:57:09,990][81400] Updated weights for policy 0, policy_version 82840 (0.0006) [2023-03-07 00:57:10,765][81400] Updated weights for policy 0, policy_version 82850 (0.0005) [2023-03-07 00:57:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13260.8, 300 sec: 13315.5). Total num frames: 84844544. Throughput: 0: 13244.6. Samples: 84829621. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:57:11,247][81074] Avg episode reward: [(0, '1470.300')] [2023-03-07 00:57:11,532][81400] Updated weights for policy 0, policy_version 82860 (0.0005) [2023-03-07 00:57:12,314][81400] Updated weights for policy 0, policy_version 82870 (0.0006) [2023-03-07 00:57:13,083][81400] Updated weights for policy 0, policy_version 82880 (0.0006) [2023-03-07 00:57:13,852][81400] Updated weights for policy 0, policy_version 82890 (0.0005) [2023-03-07 00:57:14,636][81400] Updated weights for policy 0, policy_version 82900 (0.0006) [2023-03-07 00:57:15,397][81400] Updated weights for policy 0, policy_version 82910 (0.0005) [2023-03-07 00:57:16,178][81400] Updated weights for policy 0, policy_version 82920 (0.0006) [2023-03-07 00:57:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13312.0). Total num frames: 84910080. Throughput: 0: 13229.7. Samples: 84909188. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:57:16,248][81074] Avg episode reward: [(0, '1547.744')] [2023-03-07 00:57:16,944][81400] Updated weights for policy 0, policy_version 82930 (0.0005) [2023-03-07 00:57:17,729][81400] Updated weights for policy 0, policy_version 82940 (0.0006) [2023-03-07 00:57:18,504][81400] Updated weights for policy 0, policy_version 82950 (0.0006) [2023-03-07 00:57:19,269][81400] Updated weights for policy 0, policy_version 82960 (0.0006) [2023-03-07 00:57:20,033][81400] Updated weights for policy 0, policy_version 82970 (0.0006) [2023-03-07 00:57:20,808][81400] Updated weights for policy 0, policy_version 82980 (0.0006) [2023-03-07 00:57:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13243.7, 300 sec: 13308.5). Total num frames: 84976640. Throughput: 0: 13227.1. Samples: 84948941. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:57:21,247][81074] Avg episode reward: [(0, '1693.299')] [2023-03-07 00:57:21,575][81400] Updated weights for policy 0, policy_version 82990 (0.0006) [2023-03-07 00:57:22,360][81400] Updated weights for policy 0, policy_version 83000 (0.0007) [2023-03-07 00:57:23,124][81400] Updated weights for policy 0, policy_version 83010 (0.0006) [2023-03-07 00:57:23,905][81400] Updated weights for policy 0, policy_version 83020 (0.0006) [2023-03-07 00:57:24,678][81400] Updated weights for policy 0, policy_version 83030 (0.0006) [2023-03-07 00:57:25,472][81400] Updated weights for policy 0, policy_version 83040 (0.0006) [2023-03-07 00:57:26,233][81400] Updated weights for policy 0, policy_version 83050 (0.0006) [2023-03-07 00:57:26,236][81074] Fps is (10 sec: 13312.2, 60 sec: 13243.7, 300 sec: 13308.5). Total num frames: 85043200. Throughput: 0: 13232.2. Samples: 85028328. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:57:26,247][81074] Avg episode reward: [(0, '1433.221')] [2023-03-07 00:57:26,996][81400] Updated weights for policy 0, policy_version 83060 (0.0007) [2023-03-07 00:57:27,753][81400] Updated weights for policy 0, policy_version 83070 (0.0006) [2023-03-07 00:57:28,526][81400] Updated weights for policy 0, policy_version 83080 (0.0006) [2023-03-07 00:57:29,306][81400] Updated weights for policy 0, policy_version 83090 (0.0006) [2023-03-07 00:57:30,101][81400] Updated weights for policy 0, policy_version 83100 (0.0007) [2023-03-07 00:57:30,859][81400] Updated weights for policy 0, policy_version 83110 (0.0006) [2023-03-07 00:57:31,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13243.8, 300 sec: 13308.5). Total num frames: 85109760. Throughput: 0: 13238.4. Samples: 85108029. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:57:31,247][81074] Avg episode reward: [(0, '1655.591')] [2023-03-07 00:57:31,638][81400] Updated weights for policy 0, policy_version 83120 (0.0006) [2023-03-07 00:57:32,420][81400] Updated weights for policy 0, policy_version 83130 (0.0005) [2023-03-07 00:57:33,168][81400] Updated weights for policy 0, policy_version 83140 (0.0006) [2023-03-07 00:57:33,924][81400] Updated weights for policy 0, policy_version 83150 (0.0007) [2023-03-07 00:57:34,700][81400] Updated weights for policy 0, policy_version 83160 (0.0007) [2023-03-07 00:57:35,463][81400] Updated weights for policy 0, policy_version 83170 (0.0006) [2023-03-07 00:57:36,235][81400] Updated weights for policy 0, policy_version 83180 (0.0006) [2023-03-07 00:57:36,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13308.5). Total num frames: 85176320. Throughput: 0: 13237.8. Samples: 85147915. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:57:36,246][81074] Avg episode reward: [(0, '1770.691')] [2023-03-07 00:57:37,025][81400] Updated weights for policy 0, policy_version 83190 (0.0006) [2023-03-07 00:57:37,800][81400] Updated weights for policy 0, policy_version 83200 (0.0006) [2023-03-07 00:57:38,554][81400] Updated weights for policy 0, policy_version 83210 (0.0006) [2023-03-07 00:57:39,338][81400] Updated weights for policy 0, policy_version 83220 (0.0006) [2023-03-07 00:57:40,096][81400] Updated weights for policy 0, policy_version 83230 (0.0006) [2023-03-07 00:57:40,869][81400] Updated weights for policy 0, policy_version 83240 (0.0006) [2023-03-07 00:57:41,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13243.7, 300 sec: 13308.5). Total num frames: 85241856. Throughput: 0: 13241.6. Samples: 85227616. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:57:41,237][81074] Avg episode reward: [(0, '1605.579')] [2023-03-07 00:57:41,654][81400] Updated weights for policy 0, policy_version 83250 (0.0006) [2023-03-07 00:57:42,433][81400] Updated weights for policy 0, policy_version 83260 (0.0005) [2023-03-07 00:57:43,222][81400] Updated weights for policy 0, policy_version 83270 (0.0006) [2023-03-07 00:57:43,994][81400] Updated weights for policy 0, policy_version 83280 (0.0005) [2023-03-07 00:57:44,747][81400] Updated weights for policy 0, policy_version 83290 (0.0006) [2023-03-07 00:57:45,531][81400] Updated weights for policy 0, policy_version 83300 (0.0006) [2023-03-07 00:57:46,236][81074] Fps is (10 sec: 13106.9, 60 sec: 13226.7, 300 sec: 13301.6). Total num frames: 85307392. Throughput: 0: 13245.8. Samples: 85306749. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:57:46,237][81074] Avg episode reward: [(0, '1771.635')] [2023-03-07 00:57:46,301][81400] Updated weights for policy 0, policy_version 83310 (0.0006) [2023-03-07 00:57:47,078][81400] Updated weights for policy 0, policy_version 83320 (0.0006) [2023-03-07 00:57:47,867][81400] Updated weights for policy 0, policy_version 83330 (0.0006) [2023-03-07 00:57:48,629][81400] Updated weights for policy 0, policy_version 83340 (0.0006) [2023-03-07 00:57:49,411][81400] Updated weights for policy 0, policy_version 83350 (0.0006) [2023-03-07 00:57:50,162][81400] Updated weights for policy 0, policy_version 83360 (0.0006) [2023-03-07 00:57:50,929][81400] Updated weights for policy 0, policy_version 83370 (0.0006) [2023-03-07 00:57:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13226.7, 300 sec: 13301.6). Total num frames: 85373952. Throughput: 0: 13243.1. Samples: 85346443. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:57:51,237][81074] Avg episode reward: [(0, '1712.914')] [2023-03-07 00:57:51,712][81400] Updated weights for policy 0, policy_version 83380 (0.0006) [2023-03-07 00:57:52,484][81400] Updated weights for policy 0, policy_version 83390 (0.0006) [2023-03-07 00:57:53,246][81400] Updated weights for policy 0, policy_version 83400 (0.0006) [2023-03-07 00:57:54,031][81400] Updated weights for policy 0, policy_version 83410 (0.0007) [2023-03-07 00:57:54,796][81400] Updated weights for policy 0, policy_version 83420 (0.0007) [2023-03-07 00:57:55,574][81400] Updated weights for policy 0, policy_version 83430 (0.0007) [2023-03-07 00:57:56,236][81074] Fps is (10 sec: 13312.0, 60 sec: 13243.7, 300 sec: 13301.6). Total num frames: 85440512. Throughput: 0: 13256.4. Samples: 85426161. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:57:56,237][81074] Avg episode reward: [(0, '1797.014')] [2023-03-07 00:57:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000083438_85440512.pth... [2023-03-07 00:57:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000080323_82250752.pth [2023-03-07 00:57:56,358][81400] Updated weights for policy 0, policy_version 83440 (0.0006) [2023-03-07 00:57:57,148][81400] Updated weights for policy 0, policy_version 83450 (0.0006) [2023-03-07 00:57:57,926][81400] Updated weights for policy 0, policy_version 83460 (0.0005) [2023-03-07 00:57:58,691][81400] Updated weights for policy 0, policy_version 83470 (0.0006) [2023-03-07 00:57:59,471][81400] Updated weights for policy 0, policy_version 83480 (0.0005) [2023-03-07 00:58:00,257][81400] Updated weights for policy 0, policy_version 83490 (0.0006) [2023-03-07 00:58:01,026][81400] Updated weights for policy 0, policy_version 83500 (0.0006) [2023-03-07 00:58:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13294.6). Total num frames: 85506048. Throughput: 0: 13239.1. Samples: 85504944. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:01,237][81074] Avg episode reward: [(0, '1715.069')] [2023-03-07 00:58:01,795][81400] Updated weights for policy 0, policy_version 83510 (0.0006) [2023-03-07 00:58:02,568][81400] Updated weights for policy 0, policy_version 83520 (0.0006) [2023-03-07 00:58:03,339][81400] Updated weights for policy 0, policy_version 83530 (0.0007) [2023-03-07 00:58:04,115][81400] Updated weights for policy 0, policy_version 83540 (0.0006) [2023-03-07 00:58:04,896][81400] Updated weights for policy 0, policy_version 83550 (0.0006) [2023-03-07 00:58:05,662][81400] Updated weights for policy 0, policy_version 83560 (0.0006) [2023-03-07 00:58:06,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13243.7, 300 sec: 13294.6). Total num frames: 85572608. Throughput: 0: 13240.1. Samples: 85544747. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:06,237][81074] Avg episode reward: [(0, '2042.167')] [2023-03-07 00:58:06,452][81400] Updated weights for policy 0, policy_version 83570 (0.0007) [2023-03-07 00:58:07,226][81400] Updated weights for policy 0, policy_version 83580 (0.0006) [2023-03-07 00:58:08,009][81400] Updated weights for policy 0, policy_version 83590 (0.0007) [2023-03-07 00:58:08,779][81400] Updated weights for policy 0, policy_version 83600 (0.0005) [2023-03-07 00:58:09,552][81400] Updated weights for policy 0, policy_version 83610 (0.0006) [2023-03-07 00:58:10,325][81400] Updated weights for policy 0, policy_version 83620 (0.0006) [2023-03-07 00:58:11,108][81400] Updated weights for policy 0, policy_version 83630 (0.0006) [2023-03-07 00:58:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13226.7, 300 sec: 13291.2). Total num frames: 85638144. Throughput: 0: 13235.8. Samples: 85623940. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:11,237][81074] Avg episode reward: [(0, '2098.900')] [2023-03-07 00:58:11,862][81400] Updated weights for policy 0, policy_version 83640 (0.0006) [2023-03-07 00:58:12,632][81400] Updated weights for policy 0, policy_version 83650 (0.0006) [2023-03-07 00:58:13,417][81400] Updated weights for policy 0, policy_version 83660 (0.0006) [2023-03-07 00:58:14,185][81400] Updated weights for policy 0, policy_version 83670 (0.0005) [2023-03-07 00:58:14,953][81400] Updated weights for policy 0, policy_version 83680 (0.0006) [2023-03-07 00:58:15,739][81400] Updated weights for policy 0, policy_version 83690 (0.0007) [2023-03-07 00:58:16,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13243.7, 300 sec: 13291.2). Total num frames: 85704704. Throughput: 0: 13231.9. Samples: 85703464. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:16,237][81074] Avg episode reward: [(0, '2040.842')] [2023-03-07 00:58:16,518][81400] Updated weights for policy 0, policy_version 83700 (0.0006) [2023-03-07 00:58:17,283][81400] Updated weights for policy 0, policy_version 83710 (0.0005) [2023-03-07 00:58:18,069][81400] Updated weights for policy 0, policy_version 83720 (0.0006) [2023-03-07 00:58:18,841][81400] Updated weights for policy 0, policy_version 83730 (0.0006) [2023-03-07 00:58:19,621][81400] Updated weights for policy 0, policy_version 83740 (0.0005) [2023-03-07 00:58:20,407][81400] Updated weights for policy 0, policy_version 83750 (0.0006) [2023-03-07 00:58:21,162][81400] Updated weights for policy 0, policy_version 83760 (0.0005) [2023-03-07 00:58:21,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13243.7, 300 sec: 13287.7). Total num frames: 85771264. Throughput: 0: 13224.3. Samples: 85743010. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:21,237][81074] Avg episode reward: [(0, '1972.219')] [2023-03-07 00:58:21,936][81400] Updated weights for policy 0, policy_version 83770 (0.0007) [2023-03-07 00:58:22,701][81400] Updated weights for policy 0, policy_version 83780 (0.0006) [2023-03-07 00:58:23,470][81400] Updated weights for policy 0, policy_version 83790 (0.0005) [2023-03-07 00:58:24,261][81400] Updated weights for policy 0, policy_version 83800 (0.0006) [2023-03-07 00:58:25,026][81400] Updated weights for policy 0, policy_version 83810 (0.0006) [2023-03-07 00:58:25,800][81400] Updated weights for policy 0, policy_version 83820 (0.0006) [2023-03-07 00:58:26,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13226.7, 300 sec: 13280.8). Total num frames: 85836800. Throughput: 0: 13215.7. Samples: 85822321. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:26,237][81074] Avg episode reward: [(0, '2184.069')] [2023-03-07 00:58:26,580][81400] Updated weights for policy 0, policy_version 83830 (0.0006) [2023-03-07 00:58:27,361][81400] Updated weights for policy 0, policy_version 83840 (0.0006) [2023-03-07 00:58:28,138][81400] Updated weights for policy 0, policy_version 83850 (0.0007) [2023-03-07 00:58:28,909][81400] Updated weights for policy 0, policy_version 83860 (0.0006) [2023-03-07 00:58:29,690][81400] Updated weights for policy 0, policy_version 83870 (0.0007) [2023-03-07 00:58:30,478][81400] Updated weights for policy 0, policy_version 83880 (0.0006) [2023-03-07 00:58:31,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13209.6, 300 sec: 13277.3). Total num frames: 85902336. Throughput: 0: 13212.0. Samples: 85901287. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:31,237][81074] Avg episode reward: [(0, '2045.309')] [2023-03-07 00:58:31,272][81400] Updated weights for policy 0, policy_version 83890 (0.0006) [2023-03-07 00:58:32,044][81400] Updated weights for policy 0, policy_version 83900 (0.0007) [2023-03-07 00:58:32,796][81400] Updated weights for policy 0, policy_version 83910 (0.0005) [2023-03-07 00:58:33,586][81400] Updated weights for policy 0, policy_version 83920 (0.0006) [2023-03-07 00:58:34,367][81400] Updated weights for policy 0, policy_version 83930 (0.0006) [2023-03-07 00:58:35,149][81400] Updated weights for policy 0, policy_version 83940 (0.0006) [2023-03-07 00:58:35,901][81400] Updated weights for policy 0, policy_version 83950 (0.0006) [2023-03-07 00:58:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13209.6, 300 sec: 13277.3). Total num frames: 85968896. Throughput: 0: 13208.4. Samples: 85940821. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:36,237][81074] Avg episode reward: [(0, '2009.134')] [2023-03-07 00:58:36,682][81400] Updated weights for policy 0, policy_version 83960 (0.0006) [2023-03-07 00:58:37,454][81400] Updated weights for policy 0, policy_version 83970 (0.0006) [2023-03-07 00:58:38,232][81400] Updated weights for policy 0, policy_version 83980 (0.0006) [2023-03-07 00:58:38,990][81400] Updated weights for policy 0, policy_version 83990 (0.0006) [2023-03-07 00:58:39,767][81400] Updated weights for policy 0, policy_version 84000 (0.0006) [2023-03-07 00:58:40,532][81400] Updated weights for policy 0, policy_version 84010 (0.0006) [2023-03-07 00:58:41,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13277.3). Total num frames: 86034432. Throughput: 0: 13207.0. Samples: 86020474. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:41,237][81074] Avg episode reward: [(0, '1902.323')] [2023-03-07 00:58:41,321][81400] Updated weights for policy 0, policy_version 84020 (0.0006) [2023-03-07 00:58:42,099][81400] Updated weights for policy 0, policy_version 84030 (0.0006) [2023-03-07 00:58:42,871][81400] Updated weights for policy 0, policy_version 84040 (0.0006) [2023-03-07 00:58:43,653][81400] Updated weights for policy 0, policy_version 84050 (0.0006) [2023-03-07 00:58:44,412][81400] Updated weights for policy 0, policy_version 84060 (0.0006) [2023-03-07 00:58:45,187][81400] Updated weights for policy 0, policy_version 84070 (0.0007) [2023-03-07 00:58:45,949][81400] Updated weights for policy 0, policy_version 84080 (0.0006) [2023-03-07 00:58:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13226.7, 300 sec: 13277.3). Total num frames: 86100992. Throughput: 0: 13222.7. Samples: 86099963. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:46,237][81074] Avg episode reward: [(0, '2075.371')] [2023-03-07 00:58:46,723][81400] Updated weights for policy 0, policy_version 84090 (0.0006) [2023-03-07 00:58:47,505][81400] Updated weights for policy 0, policy_version 84100 (0.0006) [2023-03-07 00:58:48,282][81400] Updated weights for policy 0, policy_version 84110 (0.0005) [2023-03-07 00:58:49,082][81400] Updated weights for policy 0, policy_version 84120 (0.0006) [2023-03-07 00:58:49,853][81400] Updated weights for policy 0, policy_version 84130 (0.0006) [2023-03-07 00:58:50,626][81400] Updated weights for policy 0, policy_version 84140 (0.0005) [2023-03-07 00:58:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13273.8). Total num frames: 86166528. Throughput: 0: 13211.1. Samples: 86139247. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:51,237][81074] Avg episode reward: [(0, '2100.047')] [2023-03-07 00:58:51,396][81400] Updated weights for policy 0, policy_version 84150 (0.0006) [2023-03-07 00:58:52,182][81400] Updated weights for policy 0, policy_version 84160 (0.0006) [2023-03-07 00:58:52,944][81400] Updated weights for policy 0, policy_version 84170 (0.0006) [2023-03-07 00:58:53,733][81400] Updated weights for policy 0, policy_version 84180 (0.0006) [2023-03-07 00:58:54,509][81400] Updated weights for policy 0, policy_version 84190 (0.0006) [2023-03-07 00:58:55,279][81400] Updated weights for policy 0, policy_version 84200 (0.0006) [2023-03-07 00:58:56,056][81400] Updated weights for policy 0, policy_version 84210 (0.0006) [2023-03-07 00:58:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13273.8). Total num frames: 86233088. Throughput: 0: 13213.7. Samples: 86218559. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:58:56,237][81074] Avg episode reward: [(0, '2366.380')] [2023-03-07 00:58:56,838][81400] Updated weights for policy 0, policy_version 84220 (0.0006) [2023-03-07 00:58:57,605][81400] Updated weights for policy 0, policy_version 84230 (0.0007) [2023-03-07 00:58:58,386][81400] Updated weights for policy 0, policy_version 84240 (0.0006) [2023-03-07 00:58:59,166][81400] Updated weights for policy 0, policy_version 84250 (0.0007) [2023-03-07 00:58:59,955][81400] Updated weights for policy 0, policy_version 84260 (0.0006) [2023-03-07 00:59:00,730][81400] Updated weights for policy 0, policy_version 84270 (0.0006) [2023-03-07 00:59:01,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13266.9). Total num frames: 86298624. Throughput: 0: 13197.1. Samples: 86297330. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:59:01,237][81074] Avg episode reward: [(0, '2638.837')] [2023-03-07 00:59:01,499][81400] Updated weights for policy 0, policy_version 84280 (0.0007) [2023-03-07 00:59:02,281][81400] Updated weights for policy 0, policy_version 84290 (0.0006) [2023-03-07 00:59:03,063][81400] Updated weights for policy 0, policy_version 84300 (0.0006) [2023-03-07 00:59:03,848][81400] Updated weights for policy 0, policy_version 84310 (0.0006) [2023-03-07 00:59:04,622][81400] Updated weights for policy 0, policy_version 84320 (0.0005) [2023-03-07 00:59:05,389][81400] Updated weights for policy 0, policy_version 84330 (0.0006) [2023-03-07 00:59:06,149][81400] Updated weights for policy 0, policy_version 84340 (0.0006) [2023-03-07 00:59:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13266.9). Total num frames: 86365184. Throughput: 0: 13198.1. Samples: 86336923. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:06,237][81074] Avg episode reward: [(0, '2715.579')] [2023-03-07 00:59:06,923][81400] Updated weights for policy 0, policy_version 84350 (0.0006) [2023-03-07 00:59:07,699][81400] Updated weights for policy 0, policy_version 84360 (0.0005) [2023-03-07 00:59:08,470][81400] Updated weights for policy 0, policy_version 84370 (0.0006) [2023-03-07 00:59:09,252][81400] Updated weights for policy 0, policy_version 84380 (0.0006) [2023-03-07 00:59:10,013][81400] Updated weights for policy 0, policy_version 84390 (0.0006) [2023-03-07 00:59:10,778][81400] Updated weights for policy 0, policy_version 84400 (0.0006) [2023-03-07 00:59:11,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13209.6, 300 sec: 13263.4). Total num frames: 86430720. Throughput: 0: 13205.6. Samples: 86416574. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:11,237][81074] Avg episode reward: [(0, '2753.266')] [2023-03-07 00:59:11,553][81400] Updated weights for policy 0, policy_version 84410 (0.0006) [2023-03-07 00:59:12,341][81400] Updated weights for policy 0, policy_version 84420 (0.0007) [2023-03-07 00:59:13,094][81400] Updated weights for policy 0, policy_version 84430 (0.0006) [2023-03-07 00:59:13,878][81400] Updated weights for policy 0, policy_version 84440 (0.0006) [2023-03-07 00:59:14,655][81400] Updated weights for policy 0, policy_version 84450 (0.0006) [2023-03-07 00:59:15,427][81400] Updated weights for policy 0, policy_version 84460 (0.0005) [2023-03-07 00:59:16,203][81400] Updated weights for policy 0, policy_version 84470 (0.0006) [2023-03-07 00:59:16,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13209.6, 300 sec: 13263.4). Total num frames: 86497280. Throughput: 0: 13215.7. Samples: 86495995. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:16,237][81074] Avg episode reward: [(0, '2527.477')] [2023-03-07 00:59:16,968][81400] Updated weights for policy 0, policy_version 84480 (0.0006) [2023-03-07 00:59:17,745][81400] Updated weights for policy 0, policy_version 84490 (0.0007) [2023-03-07 00:59:18,525][81400] Updated weights for policy 0, policy_version 84500 (0.0007) [2023-03-07 00:59:19,297][81400] Updated weights for policy 0, policy_version 84510 (0.0006) [2023-03-07 00:59:20,078][81400] Updated weights for policy 0, policy_version 84520 (0.0006) [2023-03-07 00:59:20,859][81400] Updated weights for policy 0, policy_version 84530 (0.0007) [2023-03-07 00:59:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13259.9). Total num frames: 86562816. Throughput: 0: 13215.1. Samples: 86535502. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:21,237][81074] Avg episode reward: [(0, '2748.380')] [2023-03-07 00:59:21,642][81400] Updated weights for policy 0, policy_version 84540 (0.0005) [2023-03-07 00:59:22,426][81400] Updated weights for policy 0, policy_version 84550 (0.0007) [2023-03-07 00:59:23,225][81400] Updated weights for policy 0, policy_version 84560 (0.0007) [2023-03-07 00:59:23,994][81400] Updated weights for policy 0, policy_version 84570 (0.0006) [2023-03-07 00:59:24,774][81400] Updated weights for policy 0, policy_version 84580 (0.0005) [2023-03-07 00:59:25,551][81400] Updated weights for policy 0, policy_version 84590 (0.0006) [2023-03-07 00:59:26,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13192.6, 300 sec: 13256.5). Total num frames: 86628352. Throughput: 0: 13194.4. Samples: 86614223. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:26,247][81074] Avg episode reward: [(0, '2309.562')] [2023-03-07 00:59:26,324][81400] Updated weights for policy 0, policy_version 84600 (0.0006) [2023-03-07 00:59:27,111][81400] Updated weights for policy 0, policy_version 84610 (0.0006) [2023-03-07 00:59:27,872][81400] Updated weights for policy 0, policy_version 84620 (0.0006) [2023-03-07 00:59:28,661][81400] Updated weights for policy 0, policy_version 84630 (0.0006) [2023-03-07 00:59:29,429][81400] Updated weights for policy 0, policy_version 84640 (0.0006) [2023-03-07 00:59:30,208][81400] Updated weights for policy 0, policy_version 84650 (0.0006) [2023-03-07 00:59:30,991][81400] Updated weights for policy 0, policy_version 84660 (0.0006) [2023-03-07 00:59:31,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13209.6, 300 sec: 13256.5). Total num frames: 86694912. Throughput: 0: 13185.5. Samples: 86693311. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:31,237][81074] Avg episode reward: [(0, '2293.073')] [2023-03-07 00:59:31,761][81400] Updated weights for policy 0, policy_version 84670 (0.0005) [2023-03-07 00:59:32,550][81400] Updated weights for policy 0, policy_version 84680 (0.0006) [2023-03-07 00:59:33,325][81400] Updated weights for policy 0, policy_version 84690 (0.0006) [2023-03-07 00:59:34,098][81400] Updated weights for policy 0, policy_version 84700 (0.0005) [2023-03-07 00:59:34,884][81400] Updated weights for policy 0, policy_version 84710 (0.0006) [2023-03-07 00:59:35,655][81400] Updated weights for policy 0, policy_version 84720 (0.0006) [2023-03-07 00:59:36,236][81074] Fps is (10 sec: 13209.3, 60 sec: 13192.5, 300 sec: 13253.0). Total num frames: 86760448. Throughput: 0: 13192.1. Samples: 86732891. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:36,237][81074] Avg episode reward: [(0, '2607.832')] [2023-03-07 00:59:36,445][81400] Updated weights for policy 0, policy_version 84730 (0.0006) [2023-03-07 00:59:37,231][81400] Updated weights for policy 0, policy_version 84740 (0.0006) [2023-03-07 00:59:37,999][81400] Updated weights for policy 0, policy_version 84750 (0.0006) [2023-03-07 00:59:38,786][81400] Updated weights for policy 0, policy_version 84760 (0.0006) [2023-03-07 00:59:39,568][81400] Updated weights for policy 0, policy_version 84770 (0.0006) [2023-03-07 00:59:40,345][81400] Updated weights for policy 0, policy_version 84780 (0.0006) [2023-03-07 00:59:41,123][81400] Updated weights for policy 0, policy_version 84790 (0.0005) [2023-03-07 00:59:41,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13192.5, 300 sec: 13246.0). Total num frames: 86825984. Throughput: 0: 13177.9. Samples: 86811565. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:41,237][81074] Avg episode reward: [(0, '2537.469')] [2023-03-07 00:59:41,904][81400] Updated weights for policy 0, policy_version 84800 (0.0006) [2023-03-07 00:59:42,686][81400] Updated weights for policy 0, policy_version 84810 (0.0006) [2023-03-07 00:59:43,462][81400] Updated weights for policy 0, policy_version 84820 (0.0007) [2023-03-07 00:59:44,231][81400] Updated weights for policy 0, policy_version 84830 (0.0006) [2023-03-07 00:59:45,025][81400] Updated weights for policy 0, policy_version 84840 (0.0006) [2023-03-07 00:59:45,799][81400] Updated weights for policy 0, policy_version 84850 (0.0006) [2023-03-07 00:59:46,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13175.5, 300 sec: 13242.6). Total num frames: 86891520. Throughput: 0: 13180.6. Samples: 86890456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:46,237][81074] Avg episode reward: [(0, '2471.159')] [2023-03-07 00:59:46,577][81400] Updated weights for policy 0, policy_version 84860 (0.0006) [2023-03-07 00:59:47,351][81400] Updated weights for policy 0, policy_version 84870 (0.0006) [2023-03-07 00:59:48,134][81400] Updated weights for policy 0, policy_version 84880 (0.0006) [2023-03-07 00:59:48,904][81400] Updated weights for policy 0, policy_version 84890 (0.0005) [2023-03-07 00:59:49,692][81400] Updated weights for policy 0, policy_version 84900 (0.0005) [2023-03-07 00:59:50,478][81400] Updated weights for policy 0, policy_version 84910 (0.0006) [2023-03-07 00:59:51,226][81400] Updated weights for policy 0, policy_version 84920 (0.0006) [2023-03-07 00:59:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13242.6). Total num frames: 86958080. Throughput: 0: 13175.6. Samples: 86929827. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 00:59:51,237][81074] Avg episode reward: [(0, '2638.389')] [2023-03-07 00:59:52,031][81400] Updated weights for policy 0, policy_version 84930 (0.0006) [2023-03-07 00:59:52,809][81400] Updated weights for policy 0, policy_version 84940 (0.0006) [2023-03-07 00:59:53,578][81400] Updated weights for policy 0, policy_version 84950 (0.0006) [2023-03-07 00:59:54,370][81400] Updated weights for policy 0, policy_version 84960 (0.0006) [2023-03-07 00:59:55,159][81400] Updated weights for policy 0, policy_version 84970 (0.0006) [2023-03-07 00:59:55,937][81400] Updated weights for policy 0, policy_version 84980 (0.0007) [2023-03-07 00:59:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13235.6). Total num frames: 87022592. Throughput: 0: 13159.0. Samples: 87008728. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 00:59:56,237][81074] Avg episode reward: [(0, '2465.390')] [2023-03-07 00:59:56,240][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000084984_87023616.pth... [2023-03-07 00:59:56,270][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000081884_83849216.pth [2023-03-07 00:59:56,724][81400] Updated weights for policy 0, policy_version 84990 (0.0006) [2023-03-07 00:59:57,505][81400] Updated weights for policy 0, policy_version 85000 (0.0007) [2023-03-07 00:59:58,272][81400] Updated weights for policy 0, policy_version 85010 (0.0007) [2023-03-07 00:59:59,044][81400] Updated weights for policy 0, policy_version 85020 (0.0007) [2023-03-07 00:59:59,825][81400] Updated weights for policy 0, policy_version 85030 (0.0006) [2023-03-07 01:00:00,592][81400] Updated weights for policy 0, policy_version 85040 (0.0006) [2023-03-07 01:00:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.4, 300 sec: 13239.1). Total num frames: 87089152. Throughput: 0: 13147.1. Samples: 87087616. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:01,237][81074] Avg episode reward: [(0, '2335.796')] [2023-03-07 01:00:01,374][81400] Updated weights for policy 0, policy_version 85050 (0.0006) [2023-03-07 01:00:02,146][81400] Updated weights for policy 0, policy_version 85060 (0.0005) [2023-03-07 01:00:02,933][81400] Updated weights for policy 0, policy_version 85070 (0.0006) [2023-03-07 01:00:03,717][81400] Updated weights for policy 0, policy_version 85080 (0.0006) [2023-03-07 01:00:04,496][81400] Updated weights for policy 0, policy_version 85090 (0.0006) [2023-03-07 01:00:05,271][81400] Updated weights for policy 0, policy_version 85100 (0.0006) [2023-03-07 01:00:06,046][81400] Updated weights for policy 0, policy_version 85110 (0.0006) [2023-03-07 01:00:06,236][81074] Fps is (10 sec: 13209.3, 60 sec: 13158.4, 300 sec: 13235.6). Total num frames: 87154688. Throughput: 0: 13146.4. Samples: 87127090. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:06,237][81074] Avg episode reward: [(0, '2532.851')] [2023-03-07 01:00:06,835][81400] Updated weights for policy 0, policy_version 85120 (0.0006) [2023-03-07 01:00:07,603][81400] Updated weights for policy 0, policy_version 85130 (0.0006) [2023-03-07 01:00:08,392][81400] Updated weights for policy 0, policy_version 85140 (0.0007) [2023-03-07 01:00:09,184][81400] Updated weights for policy 0, policy_version 85150 (0.0006) [2023-03-07 01:00:09,927][81400] Updated weights for policy 0, policy_version 85160 (0.0006) [2023-03-07 01:00:10,709][81400] Updated weights for policy 0, policy_version 85170 (0.0006) [2023-03-07 01:00:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13228.7). Total num frames: 87220224. Throughput: 0: 13153.7. Samples: 87206140. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:11,237][81074] Avg episode reward: [(0, '2431.080')] [2023-03-07 01:00:11,485][81400] Updated weights for policy 0, policy_version 85180 (0.0006) [2023-03-07 01:00:12,258][81400] Updated weights for policy 0, policy_version 85190 (0.0006) [2023-03-07 01:00:13,025][81400] Updated weights for policy 0, policy_version 85200 (0.0006) [2023-03-07 01:00:13,805][81400] Updated weights for policy 0, policy_version 85210 (0.0005) [2023-03-07 01:00:14,587][81400] Updated weights for policy 0, policy_version 85220 (0.0006) [2023-03-07 01:00:15,367][81400] Updated weights for policy 0, policy_version 85230 (0.0006) [2023-03-07 01:00:16,148][81400] Updated weights for policy 0, policy_version 85240 (0.0006) [2023-03-07 01:00:16,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13158.4, 300 sec: 13228.7). Total num frames: 87286784. Throughput: 0: 13150.3. Samples: 87285073. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:16,237][81074] Avg episode reward: [(0, '2423.048')] [2023-03-07 01:00:16,932][81400] Updated weights for policy 0, policy_version 85250 (0.0007) [2023-03-07 01:00:17,716][81400] Updated weights for policy 0, policy_version 85260 (0.0005) [2023-03-07 01:00:18,487][81400] Updated weights for policy 0, policy_version 85270 (0.0006) [2023-03-07 01:00:19,268][81400] Updated weights for policy 0, policy_version 85280 (0.0006) [2023-03-07 01:00:20,044][81400] Updated weights for policy 0, policy_version 85290 (0.0006) [2023-03-07 01:00:20,808][81400] Updated weights for policy 0, policy_version 85300 (0.0006) [2023-03-07 01:00:21,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13158.4, 300 sec: 13225.2). Total num frames: 87352320. Throughput: 0: 13148.8. Samples: 87324583. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:21,237][81074] Avg episode reward: [(0, '2470.409')] [2023-03-07 01:00:21,586][81400] Updated weights for policy 0, policy_version 85310 (0.0006) [2023-03-07 01:00:22,358][81400] Updated weights for policy 0, policy_version 85320 (0.0005) [2023-03-07 01:00:23,114][81400] Updated weights for policy 0, policy_version 85330 (0.0005) [2023-03-07 01:00:23,942][81400] Updated weights for policy 0, policy_version 85340 (0.0006) [2023-03-07 01:00:24,695][81400] Updated weights for policy 0, policy_version 85350 (0.0006) [2023-03-07 01:00:25,474][81400] Updated weights for policy 0, policy_version 85360 (0.0005) [2023-03-07 01:00:26,236][81400] Updated weights for policy 0, policy_version 85370 (0.0006) [2023-03-07 01:00:26,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13225.2). Total num frames: 87418880. Throughput: 0: 13161.9. Samples: 87403849. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:26,237][81074] Avg episode reward: [(0, '2864.754')] [2023-03-07 01:00:27,025][81400] Updated weights for policy 0, policy_version 85380 (0.0006) [2023-03-07 01:00:27,798][81400] Updated weights for policy 0, policy_version 85390 (0.0007) [2023-03-07 01:00:28,571][81400] Updated weights for policy 0, policy_version 85400 (0.0006) [2023-03-07 01:00:29,363][81400] Updated weights for policy 0, policy_version 85410 (0.0005) [2023-03-07 01:00:30,136][81400] Updated weights for policy 0, policy_version 85420 (0.0006) [2023-03-07 01:00:30,931][81400] Updated weights for policy 0, policy_version 85430 (0.0007) [2023-03-07 01:00:31,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13221.7). Total num frames: 87483392. Throughput: 0: 13158.5. Samples: 87482589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:31,237][81074] Avg episode reward: [(0, '2942.982')] [2023-03-07 01:00:31,713][81400] Updated weights for policy 0, policy_version 85440 (0.0006) [2023-03-07 01:00:32,485][81400] Updated weights for policy 0, policy_version 85450 (0.0007) [2023-03-07 01:00:33,255][81400] Updated weights for policy 0, policy_version 85460 (0.0006) [2023-03-07 01:00:34,029][81400] Updated weights for policy 0, policy_version 85470 (0.0006) [2023-03-07 01:00:34,794][81400] Updated weights for policy 0, policy_version 85480 (0.0006) [2023-03-07 01:00:35,574][81400] Updated weights for policy 0, policy_version 85490 (0.0007) [2023-03-07 01:00:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13221.7). Total num frames: 87549952. Throughput: 0: 13166.7. Samples: 87522329. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:36,237][81074] Avg episode reward: [(0, '2895.970')] [2023-03-07 01:00:36,345][81400] Updated weights for policy 0, policy_version 85500 (0.0006) [2023-03-07 01:00:37,133][81400] Updated weights for policy 0, policy_version 85510 (0.0006) [2023-03-07 01:00:37,906][81400] Updated weights for policy 0, policy_version 85520 (0.0006) [2023-03-07 01:00:38,688][81400] Updated weights for policy 0, policy_version 85530 (0.0007) [2023-03-07 01:00:39,468][81400] Updated weights for policy 0, policy_version 85540 (0.0007) [2023-03-07 01:00:40,238][81400] Updated weights for policy 0, policy_version 85550 (0.0007) [2023-03-07 01:00:41,014][81400] Updated weights for policy 0, policy_version 85560 (0.0006) [2023-03-07 01:00:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13218.3). Total num frames: 87615488. Throughput: 0: 13167.8. Samples: 87601281. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:41,237][81074] Avg episode reward: [(0, '3207.475')] [2023-03-07 01:00:41,789][81400] Updated weights for policy 0, policy_version 85570 (0.0006) [2023-03-07 01:00:42,550][81400] Updated weights for policy 0, policy_version 85580 (0.0006) [2023-03-07 01:00:43,353][81400] Updated weights for policy 0, policy_version 85590 (0.0006) [2023-03-07 01:00:44,111][81400] Updated weights for policy 0, policy_version 85600 (0.0006) [2023-03-07 01:00:44,903][81400] Updated weights for policy 0, policy_version 85610 (0.0006) [2023-03-07 01:00:45,668][81400] Updated weights for policy 0, policy_version 85620 (0.0006) [2023-03-07 01:00:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13218.3). Total num frames: 87682048. Throughput: 0: 13174.4. Samples: 87680464. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:00:46,247][81074] Avg episode reward: [(0, '2987.210')] [2023-03-07 01:00:46,453][81400] Updated weights for policy 0, policy_version 85630 (0.0006) [2023-03-07 01:00:47,215][81400] Updated weights for policy 0, policy_version 85640 (0.0006) [2023-03-07 01:00:47,990][81400] Updated weights for policy 0, policy_version 85650 (0.0006) [2023-03-07 01:00:48,770][81400] Updated weights for policy 0, policy_version 85660 (0.0007) [2023-03-07 01:00:49,525][81400] Updated weights for policy 0, policy_version 85670 (0.0006) [2023-03-07 01:00:50,301][81400] Updated weights for policy 0, policy_version 85680 (0.0006) [2023-03-07 01:00:51,096][81400] Updated weights for policy 0, policy_version 85690 (0.0006) [2023-03-07 01:00:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13214.8). Total num frames: 87747584. Throughput: 0: 13182.3. Samples: 87720292. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:00:51,237][81074] Avg episode reward: [(0, '2868.407')] [2023-03-07 01:00:51,886][81400] Updated weights for policy 0, policy_version 85700 (0.0006) [2023-03-07 01:00:52,666][81400] Updated weights for policy 0, policy_version 85710 (0.0007) [2023-03-07 01:00:53,446][81400] Updated weights for policy 0, policy_version 85720 (0.0006) [2023-03-07 01:00:54,231][81400] Updated weights for policy 0, policy_version 85730 (0.0007) [2023-03-07 01:00:55,014][81400] Updated weights for policy 0, policy_version 85740 (0.0006) [2023-03-07 01:00:55,789][81400] Updated weights for policy 0, policy_version 85750 (0.0007) [2023-03-07 01:00:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.4, 300 sec: 13211.3). Total num frames: 87813120. Throughput: 0: 13177.7. Samples: 87799136. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:00:56,237][81074] Avg episode reward: [(0, '2761.341')] [2023-03-07 01:00:56,571][81400] Updated weights for policy 0, policy_version 85760 (0.0006) [2023-03-07 01:00:57,342][81400] Updated weights for policy 0, policy_version 85770 (0.0006) [2023-03-07 01:00:58,120][81400] Updated weights for policy 0, policy_version 85780 (0.0007) [2023-03-07 01:00:58,901][81400] Updated weights for policy 0, policy_version 85790 (0.0006) [2023-03-07 01:00:59,677][81400] Updated weights for policy 0, policy_version 85800 (0.0007) [2023-03-07 01:01:00,460][81400] Updated weights for policy 0, policy_version 85810 (0.0006) [2023-03-07 01:01:01,231][81400] Updated weights for policy 0, policy_version 85820 (0.0005) [2023-03-07 01:01:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13211.3). Total num frames: 87879680. Throughput: 0: 13175.4. Samples: 87877966. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:01,237][81074] Avg episode reward: [(0, '2547.604')] [2023-03-07 01:01:02,006][81400] Updated weights for policy 0, policy_version 85830 (0.0006) [2023-03-07 01:01:02,773][81400] Updated weights for policy 0, policy_version 85840 (0.0006) [2023-03-07 01:01:03,558][81400] Updated weights for policy 0, policy_version 85850 (0.0006) [2023-03-07 01:01:04,356][81400] Updated weights for policy 0, policy_version 85860 (0.0006) [2023-03-07 01:01:05,130][81400] Updated weights for policy 0, policy_version 85870 (0.0006) [2023-03-07 01:01:05,926][81400] Updated weights for policy 0, policy_version 85880 (0.0006) [2023-03-07 01:01:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13207.9). Total num frames: 87945216. Throughput: 0: 13172.2. Samples: 87917334. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:06,237][81074] Avg episode reward: [(0, '2645.572')] [2023-03-07 01:01:06,696][81400] Updated weights for policy 0, policy_version 85890 (0.0005) [2023-03-07 01:01:07,473][81400] Updated weights for policy 0, policy_version 85900 (0.0006) [2023-03-07 01:01:08,250][81400] Updated weights for policy 0, policy_version 85910 (0.0007) [2023-03-07 01:01:09,016][81400] Updated weights for policy 0, policy_version 85920 (0.0006) [2023-03-07 01:01:09,775][81400] Updated weights for policy 0, policy_version 85930 (0.0006) [2023-03-07 01:01:10,557][81400] Updated weights for policy 0, policy_version 85940 (0.0006) [2023-03-07 01:01:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13204.4). Total num frames: 88010752. Throughput: 0: 13172.1. Samples: 87996593. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:11,237][81074] Avg episode reward: [(0, '2919.306')] [2023-03-07 01:01:11,321][81400] Updated weights for policy 0, policy_version 85950 (0.0007) [2023-03-07 01:01:12,086][81400] Updated weights for policy 0, policy_version 85960 (0.0006) [2023-03-07 01:01:12,875][81400] Updated weights for policy 0, policy_version 85970 (0.0006) [2023-03-07 01:01:13,656][81400] Updated weights for policy 0, policy_version 85980 (0.0007) [2023-03-07 01:01:14,436][81400] Updated weights for policy 0, policy_version 85990 (0.0007) [2023-03-07 01:01:15,215][81400] Updated weights for policy 0, policy_version 86000 (0.0006) [2023-03-07 01:01:16,008][81400] Updated weights for policy 0, policy_version 86010 (0.0007) [2023-03-07 01:01:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13200.9). Total num frames: 88076288. Throughput: 0: 13175.8. Samples: 88075501. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:16,237][81074] Avg episode reward: [(0, '2852.483')] [2023-03-07 01:01:16,790][81400] Updated weights for policy 0, policy_version 86020 (0.0006) [2023-03-07 01:01:17,558][81400] Updated weights for policy 0, policy_version 86030 (0.0006) [2023-03-07 01:01:18,354][81400] Updated weights for policy 0, policy_version 86040 (0.0006) [2023-03-07 01:01:19,121][81400] Updated weights for policy 0, policy_version 86050 (0.0006) [2023-03-07 01:01:19,901][81400] Updated weights for policy 0, policy_version 86060 (0.0006) [2023-03-07 01:01:20,677][81400] Updated weights for policy 0, policy_version 86070 (0.0006) [2023-03-07 01:01:21,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13200.9). Total num frames: 88142848. Throughput: 0: 13172.2. Samples: 88115079. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:21,237][81074] Avg episode reward: [(0, '2900.081')] [2023-03-07 01:01:21,432][81400] Updated weights for policy 0, policy_version 86080 (0.0006) [2023-03-07 01:01:22,207][81400] Updated weights for policy 0, policy_version 86090 (0.0006) [2023-03-07 01:01:22,995][81400] Updated weights for policy 0, policy_version 86100 (0.0005) [2023-03-07 01:01:23,778][81400] Updated weights for policy 0, policy_version 86110 (0.0006) [2023-03-07 01:01:24,568][81400] Updated weights for policy 0, policy_version 86120 (0.0006) [2023-03-07 01:01:25,374][81400] Updated weights for policy 0, policy_version 86130 (0.0006) [2023-03-07 01:01:26,146][81400] Updated weights for policy 0, policy_version 86140 (0.0006) [2023-03-07 01:01:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13197.5). Total num frames: 88208384. Throughput: 0: 13168.6. Samples: 88193869. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:26,237][81074] Avg episode reward: [(0, '3251.141')] [2023-03-07 01:01:26,927][81400] Updated weights for policy 0, policy_version 86150 (0.0005) [2023-03-07 01:01:27,718][81400] Updated weights for policy 0, policy_version 86160 (0.0006) [2023-03-07 01:01:28,485][81400] Updated weights for policy 0, policy_version 86170 (0.0006) [2023-03-07 01:01:29,259][81400] Updated weights for policy 0, policy_version 86180 (0.0006) [2023-03-07 01:01:30,046][81400] Updated weights for policy 0, policy_version 86190 (0.0006) [2023-03-07 01:01:30,803][81400] Updated weights for policy 0, policy_version 86200 (0.0006) [2023-03-07 01:01:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 88273920. Throughput: 0: 13161.2. Samples: 88272720. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:31,237][81074] Avg episode reward: [(0, '2814.513')] [2023-03-07 01:01:31,579][81400] Updated weights for policy 0, policy_version 86210 (0.0006) [2023-03-07 01:01:32,384][81400] Updated weights for policy 0, policy_version 86220 (0.0006) [2023-03-07 01:01:33,151][81400] Updated weights for policy 0, policy_version 86230 (0.0006) [2023-03-07 01:01:33,921][81400] Updated weights for policy 0, policy_version 86240 (0.0006) [2023-03-07 01:01:34,706][81400] Updated weights for policy 0, policy_version 86250 (0.0005) [2023-03-07 01:01:35,483][81400] Updated weights for policy 0, policy_version 86260 (0.0006) [2023-03-07 01:01:36,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13194.0). Total num frames: 88339456. Throughput: 0: 13152.2. Samples: 88312141. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:01:36,237][81074] Avg episode reward: [(0, '2998.515')] [2023-03-07 01:01:36,258][81400] Updated weights for policy 0, policy_version 86270 (0.0006) [2023-03-07 01:01:37,026][81400] Updated weights for policy 0, policy_version 86280 (0.0006) [2023-03-07 01:01:37,816][81400] Updated weights for policy 0, policy_version 86290 (0.0006) [2023-03-07 01:01:38,599][81400] Updated weights for policy 0, policy_version 86300 (0.0006) [2023-03-07 01:01:39,365][81400] Updated weights for policy 0, policy_version 86310 (0.0006) [2023-03-07 01:01:40,162][81400] Updated weights for policy 0, policy_version 86320 (0.0006) [2023-03-07 01:01:40,925][81400] Updated weights for policy 0, policy_version 86330 (0.0006) [2023-03-07 01:01:41,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 88406016. Throughput: 0: 13155.4. Samples: 88391129. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:01:41,237][81074] Avg episode reward: [(0, '3137.740')] [2023-03-07 01:01:41,706][81400] Updated weights for policy 0, policy_version 86340 (0.0005) [2023-03-07 01:01:42,477][81400] Updated weights for policy 0, policy_version 86350 (0.0007) [2023-03-07 01:01:43,246][81400] Updated weights for policy 0, policy_version 86360 (0.0006) [2023-03-07 01:01:44,013][81400] Updated weights for policy 0, policy_version 86370 (0.0006) [2023-03-07 01:01:44,798][81400] Updated weights for policy 0, policy_version 86380 (0.0006) [2023-03-07 01:01:45,583][81400] Updated weights for policy 0, policy_version 86390 (0.0006) [2023-03-07 01:01:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13190.5). Total num frames: 88471552. Throughput: 0: 13164.8. Samples: 88470382. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:01:46,237][81074] Avg episode reward: [(0, '3023.383')] [2023-03-07 01:01:46,356][81400] Updated weights for policy 0, policy_version 86400 (0.0006) [2023-03-07 01:01:47,122][81400] Updated weights for policy 0, policy_version 86410 (0.0006) [2023-03-07 01:01:47,906][81400] Updated weights for policy 0, policy_version 86420 (0.0006) [2023-03-07 01:01:48,673][81400] Updated weights for policy 0, policy_version 86430 (0.0006) [2023-03-07 01:01:49,438][81400] Updated weights for policy 0, policy_version 86440 (0.0006) [2023-03-07 01:01:50,226][81400] Updated weights for policy 0, policy_version 86450 (0.0006) [2023-03-07 01:01:50,999][81400] Updated weights for policy 0, policy_version 86460 (0.0006) [2023-03-07 01:01:51,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13194.0). Total num frames: 88538112. Throughput: 0: 13172.0. Samples: 88510072. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:01:51,237][81074] Avg episode reward: [(0, '3024.860')] [2023-03-07 01:01:51,777][81400] Updated weights for policy 0, policy_version 86470 (0.0007) [2023-03-07 01:01:52,560][81400] Updated weights for policy 0, policy_version 86480 (0.0005) [2023-03-07 01:01:53,324][81400] Updated weights for policy 0, policy_version 86490 (0.0006) [2023-03-07 01:01:54,101][81400] Updated weights for policy 0, policy_version 86500 (0.0006) [2023-03-07 01:01:54,874][81400] Updated weights for policy 0, policy_version 86510 (0.0007) [2023-03-07 01:01:55,648][81400] Updated weights for policy 0, policy_version 86520 (0.0006) [2023-03-07 01:01:56,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 88603648. Throughput: 0: 13169.5. Samples: 88589220. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:01:56,237][81074] Avg episode reward: [(0, '3033.812')] [2023-03-07 01:01:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000086527_88603648.pth... [2023-03-07 01:01:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000083438_85440512.pth [2023-03-07 01:01:56,442][81400] Updated weights for policy 0, policy_version 86530 (0.0006) [2023-03-07 01:01:57,216][81400] Updated weights for policy 0, policy_version 86540 (0.0006) [2023-03-07 01:01:58,001][81400] Updated weights for policy 0, policy_version 86550 (0.0006) [2023-03-07 01:01:58,777][81400] Updated weights for policy 0, policy_version 86560 (0.0006) [2023-03-07 01:01:59,553][81400] Updated weights for policy 0, policy_version 86570 (0.0006) [2023-03-07 01:02:00,338][81400] Updated weights for policy 0, policy_version 86580 (0.0006) [2023-03-07 01:02:01,104][81400] Updated weights for policy 0, policy_version 86590 (0.0007) [2023-03-07 01:02:01,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13190.5). Total num frames: 88669184. Throughput: 0: 13169.0. Samples: 88668106. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:02:01,237][81074] Avg episode reward: [(0, '2813.370')] [2023-03-07 01:02:01,872][81400] Updated weights for policy 0, policy_version 86600 (0.0006) [2023-03-07 01:02:02,663][81400] Updated weights for policy 0, policy_version 86610 (0.0007) [2023-03-07 01:02:03,435][81400] Updated weights for policy 0, policy_version 86620 (0.0006) [2023-03-07 01:02:04,205][81400] Updated weights for policy 0, policy_version 86630 (0.0006) [2023-03-07 01:02:04,985][81400] Updated weights for policy 0, policy_version 86640 (0.0006) [2023-03-07 01:02:05,753][81400] Updated weights for policy 0, policy_version 86650 (0.0006) [2023-03-07 01:02:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 88735744. Throughput: 0: 13169.6. Samples: 88707710. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:02:06,237][81074] Avg episode reward: [(0, '3052.303')] [2023-03-07 01:02:06,528][81400] Updated weights for policy 0, policy_version 86660 (0.0006) [2023-03-07 01:02:07,315][81400] Updated weights for policy 0, policy_version 86670 (0.0006) [2023-03-07 01:02:08,082][81400] Updated weights for policy 0, policy_version 86680 (0.0007) [2023-03-07 01:02:08,885][81400] Updated weights for policy 0, policy_version 86690 (0.0006) [2023-03-07 01:02:09,655][81400] Updated weights for policy 0, policy_version 86700 (0.0006) [2023-03-07 01:02:10,419][81400] Updated weights for policy 0, policy_version 86710 (0.0006) [2023-03-07 01:02:11,207][81400] Updated weights for policy 0, policy_version 86720 (0.0006) [2023-03-07 01:02:11,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.5, 300 sec: 13190.5). Total num frames: 88801280. Throughput: 0: 13177.5. Samples: 88786855. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:02:11,237][81074] Avg episode reward: [(0, '3023.976')] [2023-03-07 01:02:11,965][81400] Updated weights for policy 0, policy_version 86730 (0.0005) [2023-03-07 01:02:12,745][81400] Updated weights for policy 0, policy_version 86740 (0.0007) [2023-03-07 01:02:13,539][81400] Updated weights for policy 0, policy_version 86750 (0.0006) [2023-03-07 01:02:14,294][81400] Updated weights for policy 0, policy_version 86760 (0.0006) [2023-03-07 01:02:15,082][81400] Updated weights for policy 0, policy_version 86770 (0.0006) [2023-03-07 01:02:15,865][81400] Updated weights for policy 0, policy_version 86780 (0.0006) [2023-03-07 01:02:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 88866816. Throughput: 0: 13183.6. Samples: 88865981. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:02:16,237][81074] Avg episode reward: [(0, '3079.726')] [2023-03-07 01:02:16,633][81400] Updated weights for policy 0, policy_version 86790 (0.0007) [2023-03-07 01:02:17,406][81400] Updated weights for policy 0, policy_version 86800 (0.0006) [2023-03-07 01:02:18,178][81400] Updated weights for policy 0, policy_version 86810 (0.0006) [2023-03-07 01:02:18,954][81400] Updated weights for policy 0, policy_version 86820 (0.0006) [2023-03-07 01:02:19,749][81400] Updated weights for policy 0, policy_version 86830 (0.0006) [2023-03-07 01:02:20,516][81400] Updated weights for policy 0, policy_version 86840 (0.0006) [2023-03-07 01:02:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13187.0). Total num frames: 88933376. Throughput: 0: 13187.7. Samples: 88905589. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:02:21,237][81074] Avg episode reward: [(0, '3077.480')] [2023-03-07 01:02:21,281][81400] Updated weights for policy 0, policy_version 86850 (0.0006) [2023-03-07 01:02:22,074][81400] Updated weights for policy 0, policy_version 86860 (0.0006) [2023-03-07 01:02:22,848][81400] Updated weights for policy 0, policy_version 86870 (0.0007) [2023-03-07 01:02:23,610][81400] Updated weights for policy 0, policy_version 86880 (0.0006) [2023-03-07 01:02:24,390][81400] Updated weights for policy 0, policy_version 86890 (0.0006) [2023-03-07 01:02:25,153][81400] Updated weights for policy 0, policy_version 86900 (0.0006) [2023-03-07 01:02:25,933][81400] Updated weights for policy 0, policy_version 86910 (0.0006) [2023-03-07 01:02:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13183.6). Total num frames: 88998912. Throughput: 0: 13192.1. Samples: 88984773. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:02:26,237][81074] Avg episode reward: [(0, '3111.952')] [2023-03-07 01:02:26,718][81400] Updated weights for policy 0, policy_version 86920 (0.0007) [2023-03-07 01:02:27,510][81400] Updated weights for policy 0, policy_version 86930 (0.0006) [2023-03-07 01:02:28,285][81400] Updated weights for policy 0, policy_version 86940 (0.0006) [2023-03-07 01:02:29,074][81400] Updated weights for policy 0, policy_version 86950 (0.0006) [2023-03-07 01:02:29,845][81400] Updated weights for policy 0, policy_version 86960 (0.0006) [2023-03-07 01:02:30,612][81400] Updated weights for policy 0, policy_version 86970 (0.0006) [2023-03-07 01:02:31,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 89065472. Throughput: 0: 13187.0. Samples: 89063794. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:02:31,237][81074] Avg episode reward: [(0, '3155.816')] [2023-03-07 01:02:31,388][81400] Updated weights for policy 0, policy_version 86980 (0.0006) [2023-03-07 01:02:32,159][81400] Updated weights for policy 0, policy_version 86990 (0.0007) [2023-03-07 01:02:32,930][81400] Updated weights for policy 0, policy_version 87000 (0.0006) [2023-03-07 01:02:33,710][81400] Updated weights for policy 0, policy_version 87010 (0.0006) [2023-03-07 01:02:34,500][81400] Updated weights for policy 0, policy_version 87020 (0.0006) [2023-03-07 01:02:35,274][81400] Updated weights for policy 0, policy_version 87030 (0.0006) [2023-03-07 01:02:36,040][81400] Updated weights for policy 0, policy_version 87040 (0.0006) [2023-03-07 01:02:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 89131008. Throughput: 0: 13185.9. Samples: 89103439. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:02:36,237][81074] Avg episode reward: [(0, '2949.857')] [2023-03-07 01:02:36,814][81400] Updated weights for policy 0, policy_version 87050 (0.0005) [2023-03-07 01:02:37,580][81400] Updated weights for policy 0, policy_version 87060 (0.0006) [2023-03-07 01:02:38,351][81400] Updated weights for policy 0, policy_version 87070 (0.0006) [2023-03-07 01:02:39,111][81400] Updated weights for policy 0, policy_version 87080 (0.0006) [2023-03-07 01:02:39,883][81400] Updated weights for policy 0, policy_version 87090 (0.0006) [2023-03-07 01:02:40,669][81400] Updated weights for policy 0, policy_version 87100 (0.0006) [2023-03-07 01:02:41,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13192.5, 300 sec: 13187.0). Total num frames: 89197568. Throughput: 0: 13193.6. Samples: 89182931. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:02:41,237][81074] Avg episode reward: [(0, '3183.728')] [2023-03-07 01:02:41,459][81400] Updated weights for policy 0, policy_version 87110 (0.0006) [2023-03-07 01:02:42,229][81400] Updated weights for policy 0, policy_version 87120 (0.0008) [2023-03-07 01:02:43,015][81400] Updated weights for policy 0, policy_version 87130 (0.0006) [2023-03-07 01:02:43,800][81400] Updated weights for policy 0, policy_version 87140 (0.0005) [2023-03-07 01:02:44,562][81400] Updated weights for policy 0, policy_version 87150 (0.0006) [2023-03-07 01:02:45,338][81400] Updated weights for policy 0, policy_version 87160 (0.0006) [2023-03-07 01:02:46,115][81400] Updated weights for policy 0, policy_version 87170 (0.0006) [2023-03-07 01:02:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.6, 300 sec: 13183.6). Total num frames: 89263104. Throughput: 0: 13196.6. Samples: 89261951. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:02:46,237][81074] Avg episode reward: [(0, '3103.633')] [2023-03-07 01:02:46,882][81400] Updated weights for policy 0, policy_version 87180 (0.0006) [2023-03-07 01:02:47,659][81400] Updated weights for policy 0, policy_version 87190 (0.0006) [2023-03-07 01:02:48,425][81400] Updated weights for policy 0, policy_version 87200 (0.0006) [2023-03-07 01:02:49,213][81400] Updated weights for policy 0, policy_version 87210 (0.0006) [2023-03-07 01:02:49,976][81400] Updated weights for policy 0, policy_version 87220 (0.0006) [2023-03-07 01:02:50,759][81400] Updated weights for policy 0, policy_version 87230 (0.0006) [2023-03-07 01:02:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 89329664. Throughput: 0: 13200.7. Samples: 89301741. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:02:51,237][81074] Avg episode reward: [(0, '2955.345')] [2023-03-07 01:02:51,546][81400] Updated weights for policy 0, policy_version 87240 (0.0006) [2023-03-07 01:02:52,315][81400] Updated weights for policy 0, policy_version 87250 (0.0006) [2023-03-07 01:02:53,093][81400] Updated weights for policy 0, policy_version 87260 (0.0006) [2023-03-07 01:02:53,869][81400] Updated weights for policy 0, policy_version 87270 (0.0006) [2023-03-07 01:02:54,658][81400] Updated weights for policy 0, policy_version 87280 (0.0006) [2023-03-07 01:02:55,431][81400] Updated weights for policy 0, policy_version 87290 (0.0006) [2023-03-07 01:02:56,202][81400] Updated weights for policy 0, policy_version 87300 (0.0006) [2023-03-07 01:02:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13183.6). Total num frames: 89395200. Throughput: 0: 13197.2. Samples: 89380728. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:02:56,237][81074] Avg episode reward: [(0, '2938.128')] [2023-03-07 01:02:57,012][81400] Updated weights for policy 0, policy_version 87310 (0.0006) [2023-03-07 01:02:57,764][81400] Updated weights for policy 0, policy_version 87320 (0.0006) [2023-03-07 01:02:58,551][81400] Updated weights for policy 0, policy_version 87330 (0.0006) [2023-03-07 01:02:59,332][81400] Updated weights for policy 0, policy_version 87340 (0.0006) [2023-03-07 01:03:00,105][81400] Updated weights for policy 0, policy_version 87350 (0.0006) [2023-03-07 01:03:00,894][81400] Updated weights for policy 0, policy_version 87360 (0.0006) [2023-03-07 01:03:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13180.1). Total num frames: 89460736. Throughput: 0: 13187.3. Samples: 89459408. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:03:01,237][81074] Avg episode reward: [(0, '2948.291')] [2023-03-07 01:03:01,678][81400] Updated weights for policy 0, policy_version 87370 (0.0006) [2023-03-07 01:03:02,443][81400] Updated weights for policy 0, policy_version 87380 (0.0006) [2023-03-07 01:03:03,231][81400] Updated weights for policy 0, policy_version 87390 (0.0006) [2023-03-07 01:03:04,025][81400] Updated weights for policy 0, policy_version 87400 (0.0006) [2023-03-07 01:03:04,822][81400] Updated weights for policy 0, policy_version 87410 (0.0006) [2023-03-07 01:03:05,577][81400] Updated weights for policy 0, policy_version 87420 (0.0006) [2023-03-07 01:03:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13180.1). Total num frames: 89526272. Throughput: 0: 13180.7. Samples: 89498721. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:03:06,237][81074] Avg episode reward: [(0, '3106.996')] [2023-03-07 01:03:06,361][81400] Updated weights for policy 0, policy_version 87430 (0.0006) [2023-03-07 01:03:07,140][81400] Updated weights for policy 0, policy_version 87440 (0.0006) [2023-03-07 01:03:07,910][81400] Updated weights for policy 0, policy_version 87450 (0.0006) [2023-03-07 01:03:08,687][81400] Updated weights for policy 0, policy_version 87460 (0.0006) [2023-03-07 01:03:09,472][81400] Updated weights for policy 0, policy_version 87470 (0.0006) [2023-03-07 01:03:10,253][81400] Updated weights for policy 0, policy_version 87480 (0.0006) [2023-03-07 01:03:11,030][81400] Updated weights for policy 0, policy_version 87490 (0.0006) [2023-03-07 01:03:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13176.6). Total num frames: 89591808. Throughput: 0: 13173.5. Samples: 89577582. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:03:11,237][81074] Avg episode reward: [(0, '3081.530')] [2023-03-07 01:03:11,812][81400] Updated weights for policy 0, policy_version 87500 (0.0006) [2023-03-07 01:03:12,583][81400] Updated weights for policy 0, policy_version 87510 (0.0006) [2023-03-07 01:03:13,367][81400] Updated weights for policy 0, policy_version 87520 (0.0007) [2023-03-07 01:03:14,160][81400] Updated weights for policy 0, policy_version 87530 (0.0006) [2023-03-07 01:03:14,934][81400] Updated weights for policy 0, policy_version 87540 (0.0006) [2023-03-07 01:03:15,735][81400] Updated weights for policy 0, policy_version 87550 (0.0006) [2023-03-07 01:03:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13173.2). Total num frames: 89657344. Throughput: 0: 13168.8. Samples: 89656391. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:03:16,237][81074] Avg episode reward: [(0, '3145.170')] [2023-03-07 01:03:16,513][81400] Updated weights for policy 0, policy_version 87560 (0.0007) [2023-03-07 01:03:17,282][81400] Updated weights for policy 0, policy_version 87570 (0.0006) [2023-03-07 01:03:18,065][81400] Updated weights for policy 0, policy_version 87580 (0.0006) [2023-03-07 01:03:18,843][81400] Updated weights for policy 0, policy_version 87590 (0.0006) [2023-03-07 01:03:19,624][81400] Updated weights for policy 0, policy_version 87600 (0.0006) [2023-03-07 01:03:20,407][81400] Updated weights for policy 0, policy_version 87610 (0.0006) [2023-03-07 01:03:21,197][81400] Updated weights for policy 0, policy_version 87620 (0.0006) [2023-03-07 01:03:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 89722880. Throughput: 0: 13160.5. Samples: 89695662. Policy #0 lag: (min: 0.0, avg: 1.1, max: 2.0) [2023-03-07 01:03:21,237][81074] Avg episode reward: [(0, '2961.473')] [2023-03-07 01:03:21,967][81400] Updated weights for policy 0, policy_version 87630 (0.0006) [2023-03-07 01:03:22,738][81400] Updated weights for policy 0, policy_version 87640 (0.0006) [2023-03-07 01:03:23,519][81400] Updated weights for policy 0, policy_version 87650 (0.0005) [2023-03-07 01:03:24,305][81400] Updated weights for policy 0, policy_version 87660 (0.0007) [2023-03-07 01:03:25,071][81400] Updated weights for policy 0, policy_version 87670 (0.0006) [2023-03-07 01:03:25,850][81400] Updated weights for policy 0, policy_version 87680 (0.0006) [2023-03-07 01:03:26,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 89788416. Throughput: 0: 13145.4. Samples: 89774476. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:03:26,237][81074] Avg episode reward: [(0, '3066.840')] [2023-03-07 01:03:26,624][81400] Updated weights for policy 0, policy_version 87690 (0.0005) [2023-03-07 01:03:27,413][81400] Updated weights for policy 0, policy_version 87700 (0.0006) [2023-03-07 01:03:28,179][81400] Updated weights for policy 0, policy_version 87710 (0.0006) [2023-03-07 01:03:28,962][81400] Updated weights for policy 0, policy_version 87720 (0.0005) [2023-03-07 01:03:29,734][81400] Updated weights for policy 0, policy_version 87730 (0.0006) [2023-03-07 01:03:30,505][81400] Updated weights for policy 0, policy_version 87740 (0.0006) [2023-03-07 01:03:31,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 89854976. Throughput: 0: 13145.9. Samples: 89853520. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:03:31,237][81074] Avg episode reward: [(0, '2724.765')] [2023-03-07 01:03:31,293][81400] Updated weights for policy 0, policy_version 87750 (0.0006) [2023-03-07 01:03:32,057][81400] Updated weights for policy 0, policy_version 87760 (0.0006) [2023-03-07 01:03:32,862][81400] Updated weights for policy 0, policy_version 87770 (0.0006) [2023-03-07 01:03:33,631][81400] Updated weights for policy 0, policy_version 87780 (0.0007) [2023-03-07 01:03:34,413][81400] Updated weights for policy 0, policy_version 87790 (0.0006) [2023-03-07 01:03:35,191][81400] Updated weights for policy 0, policy_version 87800 (0.0006) [2023-03-07 01:03:35,970][81400] Updated weights for policy 0, policy_version 87810 (0.0006) [2023-03-07 01:03:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13173.2). Total num frames: 89920512. Throughput: 0: 13135.0. Samples: 89892817. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:03:36,237][81074] Avg episode reward: [(0, '3010.832')] [2023-03-07 01:03:36,762][81400] Updated weights for policy 0, policy_version 87820 (0.0006) [2023-03-07 01:03:37,545][81400] Updated weights for policy 0, policy_version 87830 (0.0007) [2023-03-07 01:03:38,318][81400] Updated weights for policy 0, policy_version 87840 (0.0006) [2023-03-07 01:03:39,103][81400] Updated weights for policy 0, policy_version 87850 (0.0006) [2023-03-07 01:03:39,884][81400] Updated weights for policy 0, policy_version 87860 (0.0006) [2023-03-07 01:03:40,669][81400] Updated weights for policy 0, policy_version 87870 (0.0007) [2023-03-07 01:03:41,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 89986048. Throughput: 0: 13130.2. Samples: 89971587. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:03:41,237][81074] Avg episode reward: [(0, '3013.473')] [2023-03-07 01:03:41,446][81400] Updated weights for policy 0, policy_version 87880 (0.0006) [2023-03-07 01:03:42,227][81400] Updated weights for policy 0, policy_version 87890 (0.0007) [2023-03-07 01:03:43,016][81400] Updated weights for policy 0, policy_version 87900 (0.0006) [2023-03-07 01:03:43,790][81400] Updated weights for policy 0, policy_version 87910 (0.0006) [2023-03-07 01:03:44,565][81400] Updated weights for policy 0, policy_version 87920 (0.0005) [2023-03-07 01:03:45,335][81400] Updated weights for policy 0, policy_version 87930 (0.0007) [2023-03-07 01:03:46,104][81400] Updated weights for policy 0, policy_version 87940 (0.0006) [2023-03-07 01:03:46,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 90051584. Throughput: 0: 13138.9. Samples: 90050658. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:03:46,237][81074] Avg episode reward: [(0, '2796.090')] [2023-03-07 01:03:46,874][81400] Updated weights for policy 0, policy_version 87950 (0.0006) [2023-03-07 01:03:47,680][81400] Updated weights for policy 0, policy_version 87960 (0.0006) [2023-03-07 01:03:48,447][81400] Updated weights for policy 0, policy_version 87970 (0.0006) [2023-03-07 01:03:49,237][81400] Updated weights for policy 0, policy_version 87980 (0.0006) [2023-03-07 01:03:50,011][81400] Updated weights for policy 0, policy_version 87990 (0.0006) [2023-03-07 01:03:50,786][81400] Updated weights for policy 0, policy_version 88000 (0.0006) [2023-03-07 01:03:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13166.2). Total num frames: 90117120. Throughput: 0: 13135.9. Samples: 90089834. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:03:51,237][81074] Avg episode reward: [(0, '2964.874')] [2023-03-07 01:03:51,577][81400] Updated weights for policy 0, policy_version 88010 (0.0006) [2023-03-07 01:03:52,344][81400] Updated weights for policy 0, policy_version 88020 (0.0006) [2023-03-07 01:03:53,098][81400] Updated weights for policy 0, policy_version 88030 (0.0006) [2023-03-07 01:03:53,886][81400] Updated weights for policy 0, policy_version 88040 (0.0006) [2023-03-07 01:03:54,656][81400] Updated weights for policy 0, policy_version 88050 (0.0007) [2023-03-07 01:03:55,438][81400] Updated weights for policy 0, policy_version 88060 (0.0006) [2023-03-07 01:03:56,220][81400] Updated weights for policy 0, policy_version 88070 (0.0006) [2023-03-07 01:03:56,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13169.7). Total num frames: 90183680. Throughput: 0: 13146.4. Samples: 90169170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:03:56,237][81074] Avg episode reward: [(0, '3282.637')] [2023-03-07 01:03:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000088070_90183680.pth... [2023-03-07 01:03:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000084984_87023616.pth [2023-03-07 01:03:56,989][81400] Updated weights for policy 0, policy_version 88080 (0.0006) [2023-03-07 01:03:57,770][81400] Updated weights for policy 0, policy_version 88090 (0.0006) [2023-03-07 01:03:58,533][81400] Updated weights for policy 0, policy_version 88100 (0.0006) [2023-03-07 01:03:59,317][81400] Updated weights for policy 0, policy_version 88110 (0.0006) [2023-03-07 01:04:00,092][81400] Updated weights for policy 0, policy_version 88120 (0.0006) [2023-03-07 01:04:00,862][81400] Updated weights for policy 0, policy_version 88130 (0.0007) [2023-03-07 01:04:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 90249216. Throughput: 0: 13153.7. Samples: 90248308. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:01,237][81074] Avg episode reward: [(0, '3265.932')] [2023-03-07 01:04:01,655][81400] Updated weights for policy 0, policy_version 88140 (0.0009) [2023-03-07 01:04:02,445][81400] Updated weights for policy 0, policy_version 88150 (0.0006) [2023-03-07 01:04:03,222][81400] Updated weights for policy 0, policy_version 88160 (0.0007) [2023-03-07 01:04:04,012][81400] Updated weights for policy 0, policy_version 88170 (0.0005) [2023-03-07 01:04:04,777][81400] Updated weights for policy 0, policy_version 88180 (0.0006) [2023-03-07 01:04:05,545][81400] Updated weights for policy 0, policy_version 88190 (0.0006) [2023-03-07 01:04:06,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13166.2). Total num frames: 90314752. Throughput: 0: 13149.3. Samples: 90287379. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:06,237][81074] Avg episode reward: [(0, '3059.597')] [2023-03-07 01:04:06,335][81400] Updated weights for policy 0, policy_version 88200 (0.0006) [2023-03-07 01:04:07,102][81400] Updated weights for policy 0, policy_version 88210 (0.0005) [2023-03-07 01:04:07,900][81400] Updated weights for policy 0, policy_version 88220 (0.0006) [2023-03-07 01:04:08,664][81400] Updated weights for policy 0, policy_version 88230 (0.0005) [2023-03-07 01:04:09,446][81400] Updated weights for policy 0, policy_version 88240 (0.0006) [2023-03-07 01:04:10,241][81400] Updated weights for policy 0, policy_version 88250 (0.0006) [2023-03-07 01:04:11,023][81400] Updated weights for policy 0, policy_version 88260 (0.0007) [2023-03-07 01:04:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 90380288. Throughput: 0: 13151.6. Samples: 90366296. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:11,237][81074] Avg episode reward: [(0, '3081.782')] [2023-03-07 01:04:11,798][81400] Updated weights for policy 0, policy_version 88270 (0.0006) [2023-03-07 01:04:12,582][81400] Updated weights for policy 0, policy_version 88280 (0.0006) [2023-03-07 01:04:13,364][81400] Updated weights for policy 0, policy_version 88290 (0.0007) [2023-03-07 01:04:14,141][81400] Updated weights for policy 0, policy_version 88300 (0.0006) [2023-03-07 01:04:14,921][81400] Updated weights for policy 0, policy_version 88310 (0.0006) [2023-03-07 01:04:15,686][81400] Updated weights for policy 0, policy_version 88320 (0.0006) [2023-03-07 01:04:16,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 90445824. Throughput: 0: 13146.7. Samples: 90445120. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:16,237][81074] Avg episode reward: [(0, '3002.025')] [2023-03-07 01:04:16,480][81400] Updated weights for policy 0, policy_version 88330 (0.0006) [2023-03-07 01:04:17,266][81400] Updated weights for policy 0, policy_version 88340 (0.0006) [2023-03-07 01:04:18,054][81400] Updated weights for policy 0, policy_version 88350 (0.0006) [2023-03-07 01:04:18,830][81400] Updated weights for policy 0, policy_version 88360 (0.0006) [2023-03-07 01:04:19,626][81400] Updated weights for policy 0, policy_version 88370 (0.0006) [2023-03-07 01:04:20,384][81400] Updated weights for policy 0, policy_version 88380 (0.0006) [2023-03-07 01:04:21,159][81400] Updated weights for policy 0, policy_version 88390 (0.0006) [2023-03-07 01:04:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 90511360. Throughput: 0: 13141.0. Samples: 90484162. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:21,237][81074] Avg episode reward: [(0, '2943.134')] [2023-03-07 01:04:21,936][81400] Updated weights for policy 0, policy_version 88400 (0.0005) [2023-03-07 01:04:22,698][81400] Updated weights for policy 0, policy_version 88410 (0.0007) [2023-03-07 01:04:23,474][81400] Updated weights for policy 0, policy_version 88420 (0.0006) [2023-03-07 01:04:24,245][81400] Updated weights for policy 0, policy_version 88430 (0.0006) [2023-03-07 01:04:25,042][81400] Updated weights for policy 0, policy_version 88440 (0.0006) [2023-03-07 01:04:25,818][81400] Updated weights for policy 0, policy_version 88450 (0.0005) [2023-03-07 01:04:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 90577920. Throughput: 0: 13151.2. Samples: 90563392. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:26,237][81074] Avg episode reward: [(0, '3016.491')] [2023-03-07 01:04:26,590][81400] Updated weights for policy 0, policy_version 88460 (0.0005) [2023-03-07 01:04:27,383][81400] Updated weights for policy 0, policy_version 88470 (0.0006) [2023-03-07 01:04:28,153][81400] Updated weights for policy 0, policy_version 88480 (0.0005) [2023-03-07 01:04:28,950][81400] Updated weights for policy 0, policy_version 88490 (0.0008) [2023-03-07 01:04:29,739][81400] Updated weights for policy 0, policy_version 88500 (0.0006) [2023-03-07 01:04:30,506][81400] Updated weights for policy 0, policy_version 88510 (0.0006) [2023-03-07 01:04:31,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 90643456. Throughput: 0: 13142.3. Samples: 90642062. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:31,237][81074] Avg episode reward: [(0, '3004.580')] [2023-03-07 01:04:31,301][81400] Updated weights for policy 0, policy_version 88520 (0.0006) [2023-03-07 01:04:32,058][81400] Updated weights for policy 0, policy_version 88530 (0.0007) [2023-03-07 01:04:32,830][81400] Updated weights for policy 0, policy_version 88540 (0.0006) [2023-03-07 01:04:33,621][81400] Updated weights for policy 0, policy_version 88550 (0.0008) [2023-03-07 01:04:34,401][81400] Updated weights for policy 0, policy_version 88560 (0.0006) [2023-03-07 01:04:35,180][81400] Updated weights for policy 0, policy_version 88570 (0.0006) [2023-03-07 01:04:35,981][81400] Updated weights for policy 0, policy_version 88580 (0.0007) [2023-03-07 01:04:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 90708992. Throughput: 0: 13150.0. Samples: 90681583. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:36,237][81074] Avg episode reward: [(0, '2906.733')] [2023-03-07 01:04:36,756][81400] Updated weights for policy 0, policy_version 88590 (0.0007) [2023-03-07 01:04:37,523][81400] Updated weights for policy 0, policy_version 88600 (0.0006) [2023-03-07 01:04:38,298][81400] Updated weights for policy 0, policy_version 88610 (0.0006) [2023-03-07 01:04:39,082][81400] Updated weights for policy 0, policy_version 88620 (0.0006) [2023-03-07 01:04:39,879][81400] Updated weights for policy 0, policy_version 88630 (0.0006) [2023-03-07 01:04:40,650][81400] Updated weights for policy 0, policy_version 88640 (0.0006) [2023-03-07 01:04:41,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 90774528. Throughput: 0: 13133.7. Samples: 90760183. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:41,237][81074] Avg episode reward: [(0, '3100.758')] [2023-03-07 01:04:41,426][81400] Updated weights for policy 0, policy_version 88650 (0.0005) [2023-03-07 01:04:42,210][81400] Updated weights for policy 0, policy_version 88660 (0.0006) [2023-03-07 01:04:42,983][81400] Updated weights for policy 0, policy_version 88670 (0.0006) [2023-03-07 01:04:43,786][81400] Updated weights for policy 0, policy_version 88680 (0.0007) [2023-03-07 01:04:44,557][81400] Updated weights for policy 0, policy_version 88690 (0.0006) [2023-03-07 01:04:45,345][81400] Updated weights for policy 0, policy_version 88700 (0.0006) [2023-03-07 01:04:46,118][81400] Updated weights for policy 0, policy_version 88710 (0.0006) [2023-03-07 01:04:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13159.3). Total num frames: 90840064. Throughput: 0: 13122.0. Samples: 90838799. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:46,237][81074] Avg episode reward: [(0, '3265.760')] [2023-03-07 01:04:46,909][81400] Updated weights for policy 0, policy_version 88720 (0.0006) [2023-03-07 01:04:47,697][81400] Updated weights for policy 0, policy_version 88730 (0.0007) [2023-03-07 01:04:48,469][81400] Updated weights for policy 0, policy_version 88740 (0.0006) [2023-03-07 01:04:49,235][81400] Updated weights for policy 0, policy_version 88750 (0.0005) [2023-03-07 01:04:50,008][81400] Updated weights for policy 0, policy_version 88760 (0.0006) [2023-03-07 01:04:50,785][81400] Updated weights for policy 0, policy_version 88770 (0.0006) [2023-03-07 01:04:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 90906624. Throughput: 0: 13128.7. Samples: 90878170. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:51,237][81074] Avg episode reward: [(0, '2952.432')] [2023-03-07 01:04:51,557][81400] Updated weights for policy 0, policy_version 88780 (0.0006) [2023-03-07 01:04:52,318][81400] Updated weights for policy 0, policy_version 88790 (0.0006) [2023-03-07 01:04:53,109][81400] Updated weights for policy 0, policy_version 88800 (0.0006) [2023-03-07 01:04:53,890][81400] Updated weights for policy 0, policy_version 88810 (0.0007) [2023-03-07 01:04:54,658][81400] Updated weights for policy 0, policy_version 88820 (0.0006) [2023-03-07 01:04:55,459][81400] Updated weights for policy 0, policy_version 88830 (0.0006) [2023-03-07 01:04:56,215][81400] Updated weights for policy 0, policy_version 88840 (0.0006) [2023-03-07 01:04:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13162.7). Total num frames: 90972160. Throughput: 0: 13135.2. Samples: 90957380. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:04:56,237][81074] Avg episode reward: [(0, '3145.952')] [2023-03-07 01:04:56,999][81400] Updated weights for policy 0, policy_version 88850 (0.0006) [2023-03-07 01:04:57,765][81400] Updated weights for policy 0, policy_version 88860 (0.0005) [2023-03-07 01:04:58,546][81400] Updated weights for policy 0, policy_version 88870 (0.0006) [2023-03-07 01:04:59,338][81400] Updated weights for policy 0, policy_version 88880 (0.0006) [2023-03-07 01:05:00,110][81400] Updated weights for policy 0, policy_version 88890 (0.0006) [2023-03-07 01:05:00,894][81400] Updated weights for policy 0, policy_version 88900 (0.0007) [2023-03-07 01:05:01,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 91037696. Throughput: 0: 13138.3. Samples: 91036343. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:01,237][81074] Avg episode reward: [(0, '2851.354')] [2023-03-07 01:05:01,679][81400] Updated weights for policy 0, policy_version 88910 (0.0007) [2023-03-07 01:05:02,466][81400] Updated weights for policy 0, policy_version 88920 (0.0006) [2023-03-07 01:05:03,255][81400] Updated weights for policy 0, policy_version 88930 (0.0007) [2023-03-07 01:05:04,017][81400] Updated weights for policy 0, policy_version 88940 (0.0006) [2023-03-07 01:05:04,778][81400] Updated weights for policy 0, policy_version 88950 (0.0006) [2023-03-07 01:05:05,589][81400] Updated weights for policy 0, policy_version 88960 (0.0006) [2023-03-07 01:05:06,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 91103232. Throughput: 0: 13142.8. Samples: 91075591. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:06,237][81074] Avg episode reward: [(0, '2587.626')] [2023-03-07 01:05:06,363][81400] Updated weights for policy 0, policy_version 88970 (0.0006) [2023-03-07 01:05:07,126][81400] Updated weights for policy 0, policy_version 88980 (0.0006) [2023-03-07 01:05:07,901][81400] Updated weights for policy 0, policy_version 88990 (0.0006) [2023-03-07 01:05:08,697][81400] Updated weights for policy 0, policy_version 89000 (0.0006) [2023-03-07 01:05:09,462][81400] Updated weights for policy 0, policy_version 89010 (0.0006) [2023-03-07 01:05:10,235][81400] Updated weights for policy 0, policy_version 89020 (0.0006) [2023-03-07 01:05:11,014][81400] Updated weights for policy 0, policy_version 89030 (0.0006) [2023-03-07 01:05:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13159.3). Total num frames: 91168768. Throughput: 0: 13142.7. Samples: 91154813. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:11,237][81074] Avg episode reward: [(0, '2643.737')] [2023-03-07 01:05:11,764][81400] Updated weights for policy 0, policy_version 89040 (0.0007) [2023-03-07 01:05:12,546][81400] Updated weights for policy 0, policy_version 89050 (0.0006) [2023-03-07 01:05:13,344][81400] Updated weights for policy 0, policy_version 89060 (0.0006) [2023-03-07 01:05:14,110][81400] Updated weights for policy 0, policy_version 89070 (0.0006) [2023-03-07 01:05:14,884][81400] Updated weights for policy 0, policy_version 89080 (0.0006) [2023-03-07 01:05:15,655][81400] Updated weights for policy 0, policy_version 89090 (0.0006) [2023-03-07 01:05:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 91235328. Throughput: 0: 13152.9. Samples: 91233942. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:16,237][81074] Avg episode reward: [(0, '2904.929')] [2023-03-07 01:05:16,431][81400] Updated weights for policy 0, policy_version 89100 (0.0006) [2023-03-07 01:05:17,215][81400] Updated weights for policy 0, policy_version 89110 (0.0006) [2023-03-07 01:05:17,998][81400] Updated weights for policy 0, policy_version 89120 (0.0006) [2023-03-07 01:05:18,770][81400] Updated weights for policy 0, policy_version 89130 (0.0007) [2023-03-07 01:05:19,538][81400] Updated weights for policy 0, policy_version 89140 (0.0006) [2023-03-07 01:05:20,321][81400] Updated weights for policy 0, policy_version 89150 (0.0007) [2023-03-07 01:05:21,109][81400] Updated weights for policy 0, policy_version 89160 (0.0006) [2023-03-07 01:05:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 91300864. Throughput: 0: 13151.9. Samples: 91273418. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:21,237][81074] Avg episode reward: [(0, '2867.709')] [2023-03-07 01:05:21,894][81400] Updated weights for policy 0, policy_version 89170 (0.0006) [2023-03-07 01:05:22,657][81400] Updated weights for policy 0, policy_version 89180 (0.0006) [2023-03-07 01:05:23,426][81400] Updated weights for policy 0, policy_version 89190 (0.0006) [2023-03-07 01:05:24,205][81400] Updated weights for policy 0, policy_version 89200 (0.0006) [2023-03-07 01:05:24,975][81400] Updated weights for policy 0, policy_version 89210 (0.0007) [2023-03-07 01:05:25,754][81400] Updated weights for policy 0, policy_version 89220 (0.0006) [2023-03-07 01:05:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 91367424. Throughput: 0: 13168.6. Samples: 91352772. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:26,237][81074] Avg episode reward: [(0, '2743.348')] [2023-03-07 01:05:26,529][81400] Updated weights for policy 0, policy_version 89230 (0.0006) [2023-03-07 01:05:27,315][81400] Updated weights for policy 0, policy_version 89240 (0.0006) [2023-03-07 01:05:28,069][81400] Updated weights for policy 0, policy_version 89250 (0.0006) [2023-03-07 01:05:28,847][81400] Updated weights for policy 0, policy_version 89260 (0.0005) [2023-03-07 01:05:29,635][81400] Updated weights for policy 0, policy_version 89270 (0.0006) [2023-03-07 01:05:30,420][81400] Updated weights for policy 0, policy_version 89280 (0.0006) [2023-03-07 01:05:31,207][81400] Updated weights for policy 0, policy_version 89290 (0.0006) [2023-03-07 01:05:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 91432960. Throughput: 0: 13176.0. Samples: 91431721. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:31,237][81074] Avg episode reward: [(0, '2881.679')] [2023-03-07 01:05:31,989][81400] Updated weights for policy 0, policy_version 89300 (0.0006) [2023-03-07 01:05:32,759][81400] Updated weights for policy 0, policy_version 89310 (0.0006) [2023-03-07 01:05:33,508][81400] Updated weights for policy 0, policy_version 89320 (0.0006) [2023-03-07 01:05:34,305][81400] Updated weights for policy 0, policy_version 89330 (0.0006) [2023-03-07 01:05:35,068][81400] Updated weights for policy 0, policy_version 89340 (0.0006) [2023-03-07 01:05:35,853][81400] Updated weights for policy 0, policy_version 89350 (0.0006) [2023-03-07 01:05:36,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 91498496. Throughput: 0: 13182.8. Samples: 91471399. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:36,237][81074] Avg episode reward: [(0, '2953.392')] [2023-03-07 01:05:36,623][81400] Updated weights for policy 0, policy_version 89360 (0.0006) [2023-03-07 01:05:37,409][81400] Updated weights for policy 0, policy_version 89370 (0.0006) [2023-03-07 01:05:38,181][81400] Updated weights for policy 0, policy_version 89380 (0.0006) [2023-03-07 01:05:38,961][81400] Updated weights for policy 0, policy_version 89390 (0.0006) [2023-03-07 01:05:39,745][81400] Updated weights for policy 0, policy_version 89400 (0.0006) [2023-03-07 01:05:40,520][81400] Updated weights for policy 0, policy_version 89410 (0.0006) [2023-03-07 01:05:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 91565056. Throughput: 0: 13178.8. Samples: 91550425. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:41,237][81074] Avg episode reward: [(0, '2864.248')] [2023-03-07 01:05:41,316][81400] Updated weights for policy 0, policy_version 89420 (0.0006) [2023-03-07 01:05:42,074][81400] Updated weights for policy 0, policy_version 89430 (0.0005) [2023-03-07 01:05:42,849][81400] Updated weights for policy 0, policy_version 89440 (0.0006) [2023-03-07 01:05:43,626][81400] Updated weights for policy 0, policy_version 89450 (0.0006) [2023-03-07 01:05:44,422][81400] Updated weights for policy 0, policy_version 89460 (0.0006) [2023-03-07 01:05:45,193][81400] Updated weights for policy 0, policy_version 89470 (0.0006) [2023-03-07 01:05:45,960][81400] Updated weights for policy 0, policy_version 89480 (0.0007) [2023-03-07 01:05:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13162.7). Total num frames: 91630592. Throughput: 0: 13177.3. Samples: 91629324. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:46,237][81074] Avg episode reward: [(0, '2757.029')] [2023-03-07 01:05:46,745][81400] Updated weights for policy 0, policy_version 89490 (0.0006) [2023-03-07 01:05:47,520][81400] Updated weights for policy 0, policy_version 89500 (0.0006) [2023-03-07 01:05:48,297][81400] Updated weights for policy 0, policy_version 89510 (0.0006) [2023-03-07 01:05:49,082][81400] Updated weights for policy 0, policy_version 89520 (0.0006) [2023-03-07 01:05:49,881][81400] Updated weights for policy 0, policy_version 89530 (0.0006) [2023-03-07 01:05:50,645][81400] Updated weights for policy 0, policy_version 89540 (0.0006) [2023-03-07 01:05:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 91696128. Throughput: 0: 13181.1. Samples: 91668742. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:51,237][81074] Avg episode reward: [(0, '2714.689')] [2023-03-07 01:05:51,425][81400] Updated weights for policy 0, policy_version 89550 (0.0006) [2023-03-07 01:05:52,203][81400] Updated weights for policy 0, policy_version 89560 (0.0006) [2023-03-07 01:05:52,988][81400] Updated weights for policy 0, policy_version 89570 (0.0006) [2023-03-07 01:05:53,770][81400] Updated weights for policy 0, policy_version 89580 (0.0006) [2023-03-07 01:05:54,550][81400] Updated weights for policy 0, policy_version 89590 (0.0006) [2023-03-07 01:05:55,325][81400] Updated weights for policy 0, policy_version 89600 (0.0006) [2023-03-07 01:05:56,105][81400] Updated weights for policy 0, policy_version 89610 (0.0005) [2023-03-07 01:05:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 91761664. Throughput: 0: 13172.7. Samples: 91747586. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:05:56,237][81074] Avg episode reward: [(0, '2857.719')] [2023-03-07 01:05:56,256][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000089612_91762688.pth... [2023-03-07 01:05:56,285][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000086527_88603648.pth [2023-03-07 01:05:56,871][81400] Updated weights for policy 0, policy_version 89620 (0.0006) [2023-03-07 01:05:57,665][81400] Updated weights for policy 0, policy_version 89630 (0.0006) [2023-03-07 01:05:58,447][81400] Updated weights for policy 0, policy_version 89640 (0.0006) [2023-03-07 01:05:59,215][81400] Updated weights for policy 0, policy_version 89650 (0.0007) [2023-03-07 01:05:59,990][81400] Updated weights for policy 0, policy_version 89660 (0.0005) [2023-03-07 01:06:00,762][81400] Updated weights for policy 0, policy_version 89670 (0.0006) [2023-03-07 01:06:01,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 91828224. Throughput: 0: 13169.3. Samples: 91826560. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:06:01,237][81074] Avg episode reward: [(0, '2709.943')] [2023-03-07 01:06:01,533][81400] Updated weights for policy 0, policy_version 89680 (0.0006) [2023-03-07 01:06:02,311][81400] Updated weights for policy 0, policy_version 89690 (0.0006) [2023-03-07 01:06:03,091][81400] Updated weights for policy 0, policy_version 89700 (0.0006) [2023-03-07 01:06:03,874][81400] Updated weights for policy 0, policy_version 89710 (0.0006) [2023-03-07 01:06:04,640][81400] Updated weights for policy 0, policy_version 89720 (0.0006) [2023-03-07 01:06:05,416][81400] Updated weights for policy 0, policy_version 89730 (0.0006) [2023-03-07 01:06:06,165][81400] Updated weights for policy 0, policy_version 89740 (0.0006) [2023-03-07 01:06:06,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 91893760. Throughput: 0: 13174.0. Samples: 91866250. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:06,237][81074] Avg episode reward: [(0, '2963.646')] [2023-03-07 01:06:06,930][81400] Updated weights for policy 0, policy_version 89750 (0.0007) [2023-03-07 01:06:07,728][81400] Updated weights for policy 0, policy_version 89760 (0.0007) [2023-03-07 01:06:08,502][81400] Updated weights for policy 0, policy_version 89770 (0.0007) [2023-03-07 01:06:09,294][81400] Updated weights for policy 0, policy_version 89780 (0.0006) [2023-03-07 01:06:10,063][81400] Updated weights for policy 0, policy_version 89790 (0.0007) [2023-03-07 01:06:10,852][81400] Updated weights for policy 0, policy_version 89800 (0.0006) [2023-03-07 01:06:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 91960320. Throughput: 0: 13171.3. Samples: 91945481. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:11,237][81074] Avg episode reward: [(0, '2698.647')] [2023-03-07 01:06:11,625][81400] Updated weights for policy 0, policy_version 89810 (0.0007) [2023-03-07 01:06:12,401][81400] Updated weights for policy 0, policy_version 89820 (0.0006) [2023-03-07 01:06:13,189][81400] Updated weights for policy 0, policy_version 89830 (0.0007) [2023-03-07 01:06:13,954][81400] Updated weights for policy 0, policy_version 89840 (0.0006) [2023-03-07 01:06:14,725][81400] Updated weights for policy 0, policy_version 89850 (0.0006) [2023-03-07 01:06:15,494][81400] Updated weights for policy 0, policy_version 89860 (0.0006) [2023-03-07 01:06:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13162.7). Total num frames: 92025856. Throughput: 0: 13173.6. Samples: 92024534. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:16,237][81074] Avg episode reward: [(0, '2821.609')] [2023-03-07 01:06:16,292][81400] Updated weights for policy 0, policy_version 89870 (0.0007) [2023-03-07 01:06:17,063][81400] Updated weights for policy 0, policy_version 89880 (0.0007) [2023-03-07 01:06:17,831][81400] Updated weights for policy 0, policy_version 89890 (0.0006) [2023-03-07 01:06:18,608][81400] Updated weights for policy 0, policy_version 89900 (0.0007) [2023-03-07 01:06:19,382][81400] Updated weights for policy 0, policy_version 89910 (0.0006) [2023-03-07 01:06:20,156][81400] Updated weights for policy 0, policy_version 89920 (0.0007) [2023-03-07 01:06:20,940][81400] Updated weights for policy 0, policy_version 89930 (0.0006) [2023-03-07 01:06:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 92091392. Throughput: 0: 13171.5. Samples: 92064114. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:21,237][81074] Avg episode reward: [(0, '2990.294')] [2023-03-07 01:06:21,701][81400] Updated weights for policy 0, policy_version 89940 (0.0007) [2023-03-07 01:06:22,488][81400] Updated weights for policy 0, policy_version 89950 (0.0007) [2023-03-07 01:06:23,262][81400] Updated weights for policy 0, policy_version 89960 (0.0006) [2023-03-07 01:06:24,033][81400] Updated weights for policy 0, policy_version 89970 (0.0006) [2023-03-07 01:06:24,795][81400] Updated weights for policy 0, policy_version 89980 (0.0006) [2023-03-07 01:06:25,580][81400] Updated weights for policy 0, policy_version 89990 (0.0006) [2023-03-07 01:06:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 92157952. Throughput: 0: 13180.3. Samples: 92143539. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:26,237][81074] Avg episode reward: [(0, '2749.036')] [2023-03-07 01:06:26,338][81400] Updated weights for policy 0, policy_version 90000 (0.0006) [2023-03-07 01:06:27,109][81400] Updated weights for policy 0, policy_version 90010 (0.0005) [2023-03-07 01:06:27,899][81400] Updated weights for policy 0, policy_version 90020 (0.0006) [2023-03-07 01:06:28,681][81400] Updated weights for policy 0, policy_version 90030 (0.0006) [2023-03-07 01:06:29,466][81400] Updated weights for policy 0, policy_version 90040 (0.0006) [2023-03-07 01:06:30,238][81400] Updated weights for policy 0, policy_version 90050 (0.0007) [2023-03-07 01:06:31,013][81400] Updated weights for policy 0, policy_version 90060 (0.0006) [2023-03-07 01:06:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13166.2). Total num frames: 92223488. Throughput: 0: 13188.5. Samples: 92222805. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:31,237][81074] Avg episode reward: [(0, '3037.970')] [2023-03-07 01:06:31,763][81400] Updated weights for policy 0, policy_version 90070 (0.0005) [2023-03-07 01:06:32,537][81400] Updated weights for policy 0, policy_version 90080 (0.0006) [2023-03-07 01:06:33,323][81400] Updated weights for policy 0, policy_version 90090 (0.0006) [2023-03-07 01:06:34,097][81400] Updated weights for policy 0, policy_version 90100 (0.0006) [2023-03-07 01:06:34,864][81400] Updated weights for policy 0, policy_version 90110 (0.0006) [2023-03-07 01:06:35,645][81400] Updated weights for policy 0, policy_version 90120 (0.0006) [2023-03-07 01:06:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13192.6, 300 sec: 13166.2). Total num frames: 92290048. Throughput: 0: 13194.3. Samples: 92262484. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:36,237][81074] Avg episode reward: [(0, '2754.474')] [2023-03-07 01:06:36,430][81400] Updated weights for policy 0, policy_version 90130 (0.0006) [2023-03-07 01:06:37,193][81400] Updated weights for policy 0, policy_version 90140 (0.0005) [2023-03-07 01:06:37,964][81400] Updated weights for policy 0, policy_version 90150 (0.0006) [2023-03-07 01:06:38,734][81400] Updated weights for policy 0, policy_version 90160 (0.0007) [2023-03-07 01:06:39,515][81400] Updated weights for policy 0, policy_version 90170 (0.0007) [2023-03-07 01:06:40,298][81400] Updated weights for policy 0, policy_version 90180 (0.0006) [2023-03-07 01:06:41,075][81400] Updated weights for policy 0, policy_version 90190 (0.0006) [2023-03-07 01:06:41,236][81074] Fps is (10 sec: 13312.1, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 92356608. Throughput: 0: 13204.9. Samples: 92341804. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:41,237][81074] Avg episode reward: [(0, '3069.780')] [2023-03-07 01:06:41,870][81400] Updated weights for policy 0, policy_version 90200 (0.0005) [2023-03-07 01:06:42,648][81400] Updated weights for policy 0, policy_version 90210 (0.0006) [2023-03-07 01:06:43,409][81400] Updated weights for policy 0, policy_version 90220 (0.0006) [2023-03-07 01:06:44,190][81400] Updated weights for policy 0, policy_version 90230 (0.0006) [2023-03-07 01:06:44,965][81400] Updated weights for policy 0, policy_version 90240 (0.0006) [2023-03-07 01:06:45,737][81400] Updated weights for policy 0, policy_version 90250 (0.0006) [2023-03-07 01:06:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 92422144. Throughput: 0: 13204.1. Samples: 92420744. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:46,237][81074] Avg episode reward: [(0, '2970.655')] [2023-03-07 01:06:46,529][81400] Updated weights for policy 0, policy_version 90260 (0.0006) [2023-03-07 01:06:47,307][81400] Updated weights for policy 0, policy_version 90270 (0.0007) [2023-03-07 01:06:48,082][81400] Updated weights for policy 0, policy_version 90280 (0.0006) [2023-03-07 01:06:48,855][81400] Updated weights for policy 0, policy_version 90290 (0.0006) [2023-03-07 01:06:49,643][81400] Updated weights for policy 0, policy_version 90300 (0.0006) [2023-03-07 01:06:50,410][81400] Updated weights for policy 0, policy_version 90310 (0.0006) [2023-03-07 01:06:51,187][81400] Updated weights for policy 0, policy_version 90320 (0.0006) [2023-03-07 01:06:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13192.6, 300 sec: 13166.2). Total num frames: 92487680. Throughput: 0: 13200.0. Samples: 92460247. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:51,237][81074] Avg episode reward: [(0, '2929.723')] [2023-03-07 01:06:51,970][81400] Updated weights for policy 0, policy_version 90330 (0.0005) [2023-03-07 01:06:52,741][81400] Updated weights for policy 0, policy_version 90340 (0.0006) [2023-03-07 01:06:53,517][81400] Updated weights for policy 0, policy_version 90350 (0.0006) [2023-03-07 01:06:54,303][81400] Updated weights for policy 0, policy_version 90360 (0.0006) [2023-03-07 01:06:55,072][81400] Updated weights for policy 0, policy_version 90370 (0.0006) [2023-03-07 01:06:55,848][81400] Updated weights for policy 0, policy_version 90380 (0.0006) [2023-03-07 01:06:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13192.6, 300 sec: 13166.2). Total num frames: 92553216. Throughput: 0: 13193.7. Samples: 92539199. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:06:56,237][81074] Avg episode reward: [(0, '2933.431')] [2023-03-07 01:06:56,638][81400] Updated weights for policy 0, policy_version 90390 (0.0006) [2023-03-07 01:06:57,402][81400] Updated weights for policy 0, policy_version 90400 (0.0007) [2023-03-07 01:06:58,182][81400] Updated weights for policy 0, policy_version 90410 (0.0006) [2023-03-07 01:06:58,963][81400] Updated weights for policy 0, policy_version 90420 (0.0006) [2023-03-07 01:06:59,739][81400] Updated weights for policy 0, policy_version 90430 (0.0006) [2023-03-07 01:07:00,499][81400] Updated weights for policy 0, policy_version 90440 (0.0005) [2023-03-07 01:07:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 92619776. Throughput: 0: 13196.8. Samples: 92618388. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:01,237][81074] Avg episode reward: [(0, '2799.667')] [2023-03-07 01:07:01,276][81400] Updated weights for policy 0, policy_version 90450 (0.0006) [2023-03-07 01:07:02,053][81400] Updated weights for policy 0, policy_version 90460 (0.0006) [2023-03-07 01:07:02,835][81400] Updated weights for policy 0, policy_version 90470 (0.0006) [2023-03-07 01:07:03,627][81400] Updated weights for policy 0, policy_version 90480 (0.0006) [2023-03-07 01:07:04,389][81400] Updated weights for policy 0, policy_version 90490 (0.0006) [2023-03-07 01:07:05,173][81400] Updated weights for policy 0, policy_version 90500 (0.0006) [2023-03-07 01:07:05,952][81400] Updated weights for policy 0, policy_version 90510 (0.0006) [2023-03-07 01:07:06,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 92685312. Throughput: 0: 13195.5. Samples: 92657911. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:06,237][81074] Avg episode reward: [(0, '3085.596')] [2023-03-07 01:07:06,720][81400] Updated weights for policy 0, policy_version 90520 (0.0007) [2023-03-07 01:07:07,506][81400] Updated weights for policy 0, policy_version 90530 (0.0006) [2023-03-07 01:07:08,266][81400] Updated weights for policy 0, policy_version 90540 (0.0006) [2023-03-07 01:07:09,059][81400] Updated weights for policy 0, policy_version 90550 (0.0006) [2023-03-07 01:07:09,825][81400] Updated weights for policy 0, policy_version 90560 (0.0006) [2023-03-07 01:07:10,607][81400] Updated weights for policy 0, policy_version 90570 (0.0006) [2023-03-07 01:07:11,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13169.7). Total num frames: 92751872. Throughput: 0: 13190.0. Samples: 92737089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:11,237][81074] Avg episode reward: [(0, '3079.691')] [2023-03-07 01:07:11,379][81400] Updated weights for policy 0, policy_version 90580 (0.0006) [2023-03-07 01:07:12,167][81400] Updated weights for policy 0, policy_version 90590 (0.0006) [2023-03-07 01:07:12,933][81400] Updated weights for policy 0, policy_version 90600 (0.0007) [2023-03-07 01:07:13,706][81400] Updated weights for policy 0, policy_version 90610 (0.0005) [2023-03-07 01:07:14,484][81400] Updated weights for policy 0, policy_version 90620 (0.0006) [2023-03-07 01:07:15,251][81400] Updated weights for policy 0, policy_version 90630 (0.0006) [2023-03-07 01:07:16,019][81400] Updated weights for policy 0, policy_version 90640 (0.0005) [2023-03-07 01:07:16,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.6, 300 sec: 13166.2). Total num frames: 92817408. Throughput: 0: 13192.3. Samples: 92816456. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:16,237][81074] Avg episode reward: [(0, '3056.932')] [2023-03-07 01:07:16,817][81400] Updated weights for policy 0, policy_version 90650 (0.0007) [2023-03-07 01:07:17,627][81400] Updated weights for policy 0, policy_version 90660 (0.0006) [2023-03-07 01:07:18,389][81400] Updated weights for policy 0, policy_version 90670 (0.0006) [2023-03-07 01:07:19,158][81400] Updated weights for policy 0, policy_version 90680 (0.0006) [2023-03-07 01:07:19,950][81400] Updated weights for policy 0, policy_version 90690 (0.0006) [2023-03-07 01:07:20,714][81400] Updated weights for policy 0, policy_version 90700 (0.0006) [2023-03-07 01:07:21,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 92882944. Throughput: 0: 13179.5. Samples: 92855564. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:21,237][81074] Avg episode reward: [(0, '3012.372')] [2023-03-07 01:07:21,503][81400] Updated weights for policy 0, policy_version 90710 (0.0006) [2023-03-07 01:07:22,287][81400] Updated weights for policy 0, policy_version 90720 (0.0006) [2023-03-07 01:07:23,056][81400] Updated weights for policy 0, policy_version 90730 (0.0007) [2023-03-07 01:07:23,857][81400] Updated weights for policy 0, policy_version 90740 (0.0006) [2023-03-07 01:07:24,624][81400] Updated weights for policy 0, policy_version 90750 (0.0006) [2023-03-07 01:07:25,405][81400] Updated weights for policy 0, policy_version 90760 (0.0006) [2023-03-07 01:07:26,187][81400] Updated weights for policy 0, policy_version 90770 (0.0006) [2023-03-07 01:07:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 92948480. Throughput: 0: 13166.2. Samples: 92934284. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:26,237][81074] Avg episode reward: [(0, '3125.108')] [2023-03-07 01:07:26,954][81400] Updated weights for policy 0, policy_version 90780 (0.0006) [2023-03-07 01:07:27,727][81400] Updated weights for policy 0, policy_version 90790 (0.0006) [2023-03-07 01:07:28,516][81400] Updated weights for policy 0, policy_version 90800 (0.0006) [2023-03-07 01:07:29,293][81400] Updated weights for policy 0, policy_version 90810 (0.0005) [2023-03-07 01:07:30,053][81400] Updated weights for policy 0, policy_version 90820 (0.0006) [2023-03-07 01:07:30,855][81400] Updated weights for policy 0, policy_version 90830 (0.0007) [2023-03-07 01:07:31,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13192.5, 300 sec: 13166.2). Total num frames: 93015040. Throughput: 0: 13167.9. Samples: 93013299. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:31,237][81074] Avg episode reward: [(0, '2983.393')] [2023-03-07 01:07:31,635][81400] Updated weights for policy 0, policy_version 90840 (0.0006) [2023-03-07 01:07:32,441][81400] Updated weights for policy 0, policy_version 90850 (0.0006) [2023-03-07 01:07:33,205][81400] Updated weights for policy 0, policy_version 90860 (0.0006) [2023-03-07 01:07:33,954][81400] Updated weights for policy 0, policy_version 90870 (0.0007) [2023-03-07 01:07:34,735][81400] Updated weights for policy 0, policy_version 90880 (0.0005) [2023-03-07 01:07:35,515][81400] Updated weights for policy 0, policy_version 90890 (0.0007) [2023-03-07 01:07:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 93080576. Throughput: 0: 13165.8. Samples: 93052709. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:36,237][81074] Avg episode reward: [(0, '3125.270')] [2023-03-07 01:07:36,304][81400] Updated weights for policy 0, policy_version 90900 (0.0006) [2023-03-07 01:07:37,089][81400] Updated weights for policy 0, policy_version 90910 (0.0007) [2023-03-07 01:07:37,855][81400] Updated weights for policy 0, policy_version 90920 (0.0006) [2023-03-07 01:07:38,632][81400] Updated weights for policy 0, policy_version 90930 (0.0006) [2023-03-07 01:07:39,408][81400] Updated weights for policy 0, policy_version 90940 (0.0006) [2023-03-07 01:07:40,195][81400] Updated weights for policy 0, policy_version 90950 (0.0006) [2023-03-07 01:07:40,959][81400] Updated weights for policy 0, policy_version 90960 (0.0007) [2023-03-07 01:07:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 93146112. Throughput: 0: 13166.3. Samples: 93131686. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:41,237][81074] Avg episode reward: [(0, '2927.740')] [2023-03-07 01:07:41,753][81400] Updated weights for policy 0, policy_version 90970 (0.0006) [2023-03-07 01:07:42,537][81400] Updated weights for policy 0, policy_version 90980 (0.0007) [2023-03-07 01:07:43,319][81400] Updated weights for policy 0, policy_version 90990 (0.0007) [2023-03-07 01:07:44,117][81400] Updated weights for policy 0, policy_version 91000 (0.0006) [2023-03-07 01:07:44,886][81400] Updated weights for policy 0, policy_version 91010 (0.0006) [2023-03-07 01:07:45,673][81400] Updated weights for policy 0, policy_version 91020 (0.0007) [2023-03-07 01:07:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 93211648. Throughput: 0: 13150.1. Samples: 93210140. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:46,237][81074] Avg episode reward: [(0, '2755.954')] [2023-03-07 01:07:46,471][81400] Updated weights for policy 0, policy_version 91030 (0.0006) [2023-03-07 01:07:47,238][81400] Updated weights for policy 0, policy_version 91040 (0.0006) [2023-03-07 01:07:48,015][81400] Updated weights for policy 0, policy_version 91050 (0.0006) [2023-03-07 01:07:48,797][81400] Updated weights for policy 0, policy_version 91060 (0.0006) [2023-03-07 01:07:49,571][81400] Updated weights for policy 0, policy_version 91070 (0.0006) [2023-03-07 01:07:50,354][81400] Updated weights for policy 0, policy_version 91080 (0.0006) [2023-03-07 01:07:51,116][81400] Updated weights for policy 0, policy_version 91090 (0.0006) [2023-03-07 01:07:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 93277184. Throughput: 0: 13149.5. Samples: 93249639. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:07:51,237][81074] Avg episode reward: [(0, '2655.117')] [2023-03-07 01:07:51,893][81400] Updated weights for policy 0, policy_version 91100 (0.0006) [2023-03-07 01:07:52,660][81400] Updated weights for policy 0, policy_version 91110 (0.0007) [2023-03-07 01:07:53,450][81400] Updated weights for policy 0, policy_version 91120 (0.0006) [2023-03-07 01:07:54,246][81400] Updated weights for policy 0, policy_version 91130 (0.0007) [2023-03-07 01:07:55,025][81400] Updated weights for policy 0, policy_version 91140 (0.0007) [2023-03-07 01:07:55,794][81400] Updated weights for policy 0, policy_version 91150 (0.0006) [2023-03-07 01:07:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 93342720. Throughput: 0: 13143.7. Samples: 93328557. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:07:56,237][81074] Avg episode reward: [(0, '2506.884')] [2023-03-07 01:07:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000091155_93342720.pth... [2023-03-07 01:07:56,272][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000088070_90183680.pth [2023-03-07 01:07:56,581][81400] Updated weights for policy 0, policy_version 91160 (0.0006) [2023-03-07 01:07:57,350][81400] Updated weights for policy 0, policy_version 91170 (0.0005) [2023-03-07 01:07:58,132][81400] Updated weights for policy 0, policy_version 91180 (0.0006) [2023-03-07 01:07:58,903][81400] Updated weights for policy 0, policy_version 91190 (0.0007) [2023-03-07 01:07:59,677][81400] Updated weights for policy 0, policy_version 91200 (0.0005) [2023-03-07 01:08:00,449][81400] Updated weights for policy 0, policy_version 91210 (0.0006) [2023-03-07 01:08:01,228][81400] Updated weights for policy 0, policy_version 91220 (0.0006) [2023-03-07 01:08:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 93409280. Throughput: 0: 13139.2. Samples: 93407721. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:01,237][81074] Avg episode reward: [(0, '2634.198')] [2023-03-07 01:08:01,999][81400] Updated weights for policy 0, policy_version 91230 (0.0006) [2023-03-07 01:08:02,797][81400] Updated weights for policy 0, policy_version 91240 (0.0007) [2023-03-07 01:08:03,585][81400] Updated weights for policy 0, policy_version 91250 (0.0006) [2023-03-07 01:08:04,353][81400] Updated weights for policy 0, policy_version 91260 (0.0006) [2023-03-07 01:08:05,148][81400] Updated weights for policy 0, policy_version 91270 (0.0007) [2023-03-07 01:08:05,929][81400] Updated weights for policy 0, policy_version 91280 (0.0007) [2023-03-07 01:08:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13159.3). Total num frames: 93473792. Throughput: 0: 13142.7. Samples: 93446983. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:06,237][81074] Avg episode reward: [(0, '2460.601')] [2023-03-07 01:08:06,694][81400] Updated weights for policy 0, policy_version 91290 (0.0005) [2023-03-07 01:08:07,473][81400] Updated weights for policy 0, policy_version 91300 (0.0006) [2023-03-07 01:08:08,254][81400] Updated weights for policy 0, policy_version 91310 (0.0006) [2023-03-07 01:08:09,059][81400] Updated weights for policy 0, policy_version 91320 (0.0006) [2023-03-07 01:08:09,831][81400] Updated weights for policy 0, policy_version 91330 (0.0006) [2023-03-07 01:08:10,622][81400] Updated weights for policy 0, policy_version 91340 (0.0006) [2023-03-07 01:08:11,236][81074] Fps is (10 sec: 13005.0, 60 sec: 13124.3, 300 sec: 13159.3). Total num frames: 93539328. Throughput: 0: 13137.8. Samples: 93525484. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:11,237][81074] Avg episode reward: [(0, '2823.114')] [2023-03-07 01:08:11,394][81400] Updated weights for policy 0, policy_version 91350 (0.0006) [2023-03-07 01:08:12,170][81400] Updated weights for policy 0, policy_version 91360 (0.0006) [2023-03-07 01:08:12,964][81400] Updated weights for policy 0, policy_version 91370 (0.0006) [2023-03-07 01:08:13,733][81400] Updated weights for policy 0, policy_version 91380 (0.0006) [2023-03-07 01:08:14,532][81400] Updated weights for policy 0, policy_version 91390 (0.0007) [2023-03-07 01:08:15,318][81400] Updated weights for policy 0, policy_version 91400 (0.0006) [2023-03-07 01:08:16,105][81400] Updated weights for policy 0, policy_version 91410 (0.0007) [2023-03-07 01:08:16,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13159.3). Total num frames: 93604864. Throughput: 0: 13125.9. Samples: 93603964. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:16,237][81074] Avg episode reward: [(0, '2743.100')] [2023-03-07 01:08:16,875][81400] Updated weights for policy 0, policy_version 91420 (0.0006) [2023-03-07 01:08:17,669][81400] Updated weights for policy 0, policy_version 91430 (0.0006) [2023-03-07 01:08:18,421][81400] Updated weights for policy 0, policy_version 91440 (0.0007) [2023-03-07 01:08:19,196][81400] Updated weights for policy 0, policy_version 91450 (0.0006) [2023-03-07 01:08:19,968][81400] Updated weights for policy 0, policy_version 91460 (0.0006) [2023-03-07 01:08:20,755][81400] Updated weights for policy 0, policy_version 91470 (0.0006) [2023-03-07 01:08:21,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.4, 300 sec: 13162.7). Total num frames: 93671424. Throughput: 0: 13130.7. Samples: 93643593. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:21,237][81074] Avg episode reward: [(0, '2594.922')] [2023-03-07 01:08:21,541][81400] Updated weights for policy 0, policy_version 91480 (0.0007) [2023-03-07 01:08:22,313][81400] Updated weights for policy 0, policy_version 91490 (0.0006) [2023-03-07 01:08:23,082][81400] Updated weights for policy 0, policy_version 91500 (0.0006) [2023-03-07 01:08:23,869][81400] Updated weights for policy 0, policy_version 91510 (0.0006) [2023-03-07 01:08:24,636][81400] Updated weights for policy 0, policy_version 91520 (0.0005) [2023-03-07 01:08:25,438][81400] Updated weights for policy 0, policy_version 91530 (0.0006) [2023-03-07 01:08:26,202][81400] Updated weights for policy 0, policy_version 91540 (0.0006) [2023-03-07 01:08:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 93736960. Throughput: 0: 13130.7. Samples: 93722566. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:26,248][81074] Avg episode reward: [(0, '2728.145')] [2023-03-07 01:08:26,974][81400] Updated weights for policy 0, policy_version 91550 (0.0006) [2023-03-07 01:08:27,768][81400] Updated weights for policy 0, policy_version 91560 (0.0006) [2023-03-07 01:08:28,530][81400] Updated weights for policy 0, policy_version 91570 (0.0006) [2023-03-07 01:08:29,297][81400] Updated weights for policy 0, policy_version 91580 (0.0007) [2023-03-07 01:08:30,089][81400] Updated weights for policy 0, policy_version 91590 (0.0006) [2023-03-07 01:08:30,859][81400] Updated weights for policy 0, policy_version 91600 (0.0006) [2023-03-07 01:08:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13159.3). Total num frames: 93802496. Throughput: 0: 13145.4. Samples: 93801685. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:31,247][81074] Avg episode reward: [(0, '2906.371')] [2023-03-07 01:08:31,631][81400] Updated weights for policy 0, policy_version 91610 (0.0006) [2023-03-07 01:08:32,404][81400] Updated weights for policy 0, policy_version 91620 (0.0005) [2023-03-07 01:08:33,184][81400] Updated weights for policy 0, policy_version 91630 (0.0005) [2023-03-07 01:08:33,949][81400] Updated weights for policy 0, policy_version 91640 (0.0006) [2023-03-07 01:08:34,729][81400] Updated weights for policy 0, policy_version 91650 (0.0008) [2023-03-07 01:08:35,508][81400] Updated weights for policy 0, policy_version 91660 (0.0006) [2023-03-07 01:08:36,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 93869056. Throughput: 0: 13149.3. Samples: 93841359. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:36,247][81074] Avg episode reward: [(0, '2819.266')] [2023-03-07 01:08:36,278][81400] Updated weights for policy 0, policy_version 91670 (0.0006) [2023-03-07 01:08:37,040][81400] Updated weights for policy 0, policy_version 91680 (0.0006) [2023-03-07 01:08:37,835][81400] Updated weights for policy 0, policy_version 91690 (0.0006) [2023-03-07 01:08:38,605][81400] Updated weights for policy 0, policy_version 91700 (0.0006) [2023-03-07 01:08:39,369][81400] Updated weights for policy 0, policy_version 91710 (0.0007) [2023-03-07 01:08:40,144][81400] Updated weights for policy 0, policy_version 91720 (0.0006) [2023-03-07 01:08:40,937][81400] Updated weights for policy 0, policy_version 91730 (0.0006) [2023-03-07 01:08:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13162.7). Total num frames: 93934592. Throughput: 0: 13157.3. Samples: 93920634. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:41,247][81074] Avg episode reward: [(0, '3088.224')] [2023-03-07 01:08:41,711][81400] Updated weights for policy 0, policy_version 91740 (0.0006) [2023-03-07 01:08:42,495][81400] Updated weights for policy 0, policy_version 91750 (0.0006) [2023-03-07 01:08:43,264][81400] Updated weights for policy 0, policy_version 91760 (0.0006) [2023-03-07 01:08:44,046][81400] Updated weights for policy 0, policy_version 91770 (0.0006) [2023-03-07 01:08:44,840][81400] Updated weights for policy 0, policy_version 91780 (0.0006) [2023-03-07 01:08:45,601][81400] Updated weights for policy 0, policy_version 91790 (0.0006) [2023-03-07 01:08:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13166.2). Total num frames: 94001152. Throughput: 0: 13151.7. Samples: 93999548. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:08:46,247][81074] Avg episode reward: [(0, '2739.283')] [2023-03-07 01:08:46,383][81400] Updated weights for policy 0, policy_version 91800 (0.0006) [2023-03-07 01:08:47,155][81400] Updated weights for policy 0, policy_version 91810 (0.0006) [2023-03-07 01:08:47,945][81400] Updated weights for policy 0, policy_version 91820 (0.0006) [2023-03-07 01:08:48,720][81400] Updated weights for policy 0, policy_version 91830 (0.0006) [2023-03-07 01:08:49,480][81400] Updated weights for policy 0, policy_version 91840 (0.0007) [2023-03-07 01:08:50,260][81400] Updated weights for policy 0, policy_version 91850 (0.0006) [2023-03-07 01:08:51,038][81400] Updated weights for policy 0, policy_version 91860 (0.0006) [2023-03-07 01:08:51,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 94066688. Throughput: 0: 13156.3. Samples: 94039020. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:08:51,237][81074] Avg episode reward: [(0, '2750.973')] [2023-03-07 01:08:51,841][81400] Updated weights for policy 0, policy_version 91870 (0.0006) [2023-03-07 01:08:52,616][81400] Updated weights for policy 0, policy_version 91880 (0.0006) [2023-03-07 01:08:53,408][81400] Updated weights for policy 0, policy_version 91890 (0.0006) [2023-03-07 01:08:54,182][81400] Updated weights for policy 0, policy_version 91900 (0.0006) [2023-03-07 01:08:54,965][81400] Updated weights for policy 0, policy_version 91910 (0.0006) [2023-03-07 01:08:55,743][81400] Updated weights for policy 0, policy_version 91920 (0.0007) [2023-03-07 01:08:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 94132224. Throughput: 0: 13159.7. Samples: 94117673. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:08:56,237][81074] Avg episode reward: [(0, '3049.612')] [2023-03-07 01:08:56,535][81400] Updated weights for policy 0, policy_version 91930 (0.0005) [2023-03-07 01:08:57,314][81400] Updated weights for policy 0, policy_version 91940 (0.0005) [2023-03-07 01:08:58,104][81400] Updated weights for policy 0, policy_version 91950 (0.0006) [2023-03-07 01:08:58,894][81400] Updated weights for policy 0, policy_version 91960 (0.0007) [2023-03-07 01:08:59,667][81400] Updated weights for policy 0, policy_version 91970 (0.0005) [2023-03-07 01:09:00,446][81400] Updated weights for policy 0, policy_version 91980 (0.0006) [2023-03-07 01:09:01,217][81400] Updated weights for policy 0, policy_version 91990 (0.0005) [2023-03-07 01:09:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13162.7). Total num frames: 94197760. Throughput: 0: 13161.8. Samples: 94196247. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:01,237][81074] Avg episode reward: [(0, '2816.296')] [2023-03-07 01:09:02,014][81400] Updated weights for policy 0, policy_version 92000 (0.0006) [2023-03-07 01:09:02,785][81400] Updated weights for policy 0, policy_version 92010 (0.0006) [2023-03-07 01:09:03,558][81400] Updated weights for policy 0, policy_version 92020 (0.0006) [2023-03-07 01:09:04,352][81400] Updated weights for policy 0, policy_version 92030 (0.0006) [2023-03-07 01:09:05,125][81400] Updated weights for policy 0, policy_version 92040 (0.0005) [2023-03-07 01:09:05,889][81400] Updated weights for policy 0, policy_version 92050 (0.0006) [2023-03-07 01:09:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 94263296. Throughput: 0: 13155.3. Samples: 94235582. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:06,237][81074] Avg episode reward: [(0, '3151.551')] [2023-03-07 01:09:06,687][81400] Updated weights for policy 0, policy_version 92060 (0.0006) [2023-03-07 01:09:07,458][81400] Updated weights for policy 0, policy_version 92070 (0.0006) [2023-03-07 01:09:08,254][81400] Updated weights for policy 0, policy_version 92080 (0.0006) [2023-03-07 01:09:09,032][81400] Updated weights for policy 0, policy_version 92090 (0.0006) [2023-03-07 01:09:09,825][81400] Updated weights for policy 0, policy_version 92100 (0.0006) [2023-03-07 01:09:10,607][81400] Updated weights for policy 0, policy_version 92110 (0.0006) [2023-03-07 01:09:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13162.7). Total num frames: 94328832. Throughput: 0: 13145.8. Samples: 94314126. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:11,237][81074] Avg episode reward: [(0, '2950.611')] [2023-03-07 01:09:11,391][81400] Updated weights for policy 0, policy_version 92120 (0.0006) [2023-03-07 01:09:12,167][81400] Updated weights for policy 0, policy_version 92130 (0.0006) [2023-03-07 01:09:12,981][81400] Updated weights for policy 0, policy_version 92140 (0.0007) [2023-03-07 01:09:13,750][81400] Updated weights for policy 0, policy_version 92150 (0.0006) [2023-03-07 01:09:14,531][81400] Updated weights for policy 0, policy_version 92160 (0.0006) [2023-03-07 01:09:15,304][81400] Updated weights for policy 0, policy_version 92170 (0.0007) [2023-03-07 01:09:16,082][81400] Updated weights for policy 0, policy_version 92180 (0.0006) [2023-03-07 01:09:16,236][81074] Fps is (10 sec: 13004.7, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 94393344. Throughput: 0: 13129.4. Samples: 94392509. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:16,237][81074] Avg episode reward: [(0, '2870.818')] [2023-03-07 01:09:16,883][81400] Updated weights for policy 0, policy_version 92190 (0.0006) [2023-03-07 01:09:17,658][81400] Updated weights for policy 0, policy_version 92200 (0.0006) [2023-03-07 01:09:18,426][81400] Updated weights for policy 0, policy_version 92210 (0.0006) [2023-03-07 01:09:19,224][81400] Updated weights for policy 0, policy_version 92220 (0.0006) [2023-03-07 01:09:19,982][81400] Updated weights for policy 0, policy_version 92230 (0.0006) [2023-03-07 01:09:20,757][81400] Updated weights for policy 0, policy_version 92240 (0.0007) [2023-03-07 01:09:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 94459904. Throughput: 0: 13119.1. Samples: 94431717. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:21,237][81074] Avg episode reward: [(0, '2784.702')] [2023-03-07 01:09:21,536][81400] Updated weights for policy 0, policy_version 92250 (0.0007) [2023-03-07 01:09:22,315][81400] Updated weights for policy 0, policy_version 92260 (0.0006) [2023-03-07 01:09:23,095][81400] Updated weights for policy 0, policy_version 92270 (0.0006) [2023-03-07 01:09:23,898][81400] Updated weights for policy 0, policy_version 92280 (0.0006) [2023-03-07 01:09:24,684][81400] Updated weights for policy 0, policy_version 92290 (0.0006) [2023-03-07 01:09:25,456][81400] Updated weights for policy 0, policy_version 92300 (0.0006) [2023-03-07 01:09:26,229][81400] Updated weights for policy 0, policy_version 92310 (0.0007) [2023-03-07 01:09:26,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13141.4, 300 sec: 13159.3). Total num frames: 94525440. Throughput: 0: 13108.3. Samples: 94510507. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:26,237][81074] Avg episode reward: [(0, '2587.961')] [2023-03-07 01:09:27,010][81400] Updated weights for policy 0, policy_version 92320 (0.0006) [2023-03-07 01:09:27,786][81400] Updated weights for policy 0, policy_version 92330 (0.0006) [2023-03-07 01:09:28,558][81400] Updated weights for policy 0, policy_version 92340 (0.0006) [2023-03-07 01:09:29,355][81400] Updated weights for policy 0, policy_version 92350 (0.0006) [2023-03-07 01:09:30,140][81400] Updated weights for policy 0, policy_version 92360 (0.0006) [2023-03-07 01:09:30,911][81400] Updated weights for policy 0, policy_version 92370 (0.0006) [2023-03-07 01:09:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13159.3). Total num frames: 94590976. Throughput: 0: 13109.1. Samples: 94589458. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:31,237][81074] Avg episode reward: [(0, '2609.804')] [2023-03-07 01:09:31,689][81400] Updated weights for policy 0, policy_version 92380 (0.0006) [2023-03-07 01:09:32,476][81400] Updated weights for policy 0, policy_version 92390 (0.0007) [2023-03-07 01:09:33,244][81400] Updated weights for policy 0, policy_version 92400 (0.0006) [2023-03-07 01:09:34,025][81400] Updated weights for policy 0, policy_version 92410 (0.0006) [2023-03-07 01:09:34,819][81400] Updated weights for policy 0, policy_version 92420 (0.0006) [2023-03-07 01:09:35,593][81400] Updated weights for policy 0, policy_version 92430 (0.0006) [2023-03-07 01:09:36,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13124.2, 300 sec: 13159.3). Total num frames: 94656512. Throughput: 0: 13109.6. Samples: 94628951. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:36,237][81074] Avg episode reward: [(0, '2659.035')] [2023-03-07 01:09:36,374][81400] Updated weights for policy 0, policy_version 92440 (0.0006) [2023-03-07 01:09:37,155][81400] Updated weights for policy 0, policy_version 92450 (0.0006) [2023-03-07 01:09:37,918][81400] Updated weights for policy 0, policy_version 92460 (0.0007) [2023-03-07 01:09:38,695][81400] Updated weights for policy 0, policy_version 92470 (0.0006) [2023-03-07 01:09:39,485][81400] Updated weights for policy 0, policy_version 92480 (0.0006) [2023-03-07 01:09:40,251][81400] Updated weights for policy 0, policy_version 92490 (0.0006) [2023-03-07 01:09:41,033][81400] Updated weights for policy 0, policy_version 92500 (0.0006) [2023-03-07 01:09:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13159.3). Total num frames: 94722048. Throughput: 0: 13117.9. Samples: 94707976. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:41,237][81074] Avg episode reward: [(0, '2691.360')] [2023-03-07 01:09:41,786][81400] Updated weights for policy 0, policy_version 92510 (0.0006) [2023-03-07 01:09:42,563][81400] Updated weights for policy 0, policy_version 92520 (0.0006) [2023-03-07 01:09:43,348][81400] Updated weights for policy 0, policy_version 92530 (0.0005) [2023-03-07 01:09:44,119][81400] Updated weights for policy 0, policy_version 92540 (0.0006) [2023-03-07 01:09:44,909][81400] Updated weights for policy 0, policy_version 92550 (0.0006) [2023-03-07 01:09:45,678][81400] Updated weights for policy 0, policy_version 92560 (0.0006) [2023-03-07 01:09:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13155.8). Total num frames: 94787584. Throughput: 0: 13123.7. Samples: 94786813. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:09:46,237][81074] Avg episode reward: [(0, '2923.970')] [2023-03-07 01:09:46,477][81400] Updated weights for policy 0, policy_version 92570 (0.0006) [2023-03-07 01:09:47,266][81400] Updated weights for policy 0, policy_version 92580 (0.0007) [2023-03-07 01:09:48,051][81400] Updated weights for policy 0, policy_version 92590 (0.0006) [2023-03-07 01:09:48,846][81400] Updated weights for policy 0, policy_version 92600 (0.0006) [2023-03-07 01:09:49,630][81400] Updated weights for policy 0, policy_version 92610 (0.0007) [2023-03-07 01:09:50,402][81400] Updated weights for policy 0, policy_version 92620 (0.0007) [2023-03-07 01:09:51,185][81400] Updated weights for policy 0, policy_version 92630 (0.0006) [2023-03-07 01:09:51,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13155.8). Total num frames: 94853120. Throughput: 0: 13114.9. Samples: 94825754. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:09:51,237][81074] Avg episode reward: [(0, '2733.521')] [2023-03-07 01:09:51,982][81400] Updated weights for policy 0, policy_version 92640 (0.0006) [2023-03-07 01:09:52,762][81400] Updated weights for policy 0, policy_version 92650 (0.0006) [2023-03-07 01:09:53,549][81400] Updated weights for policy 0, policy_version 92660 (0.0006) [2023-03-07 01:09:54,332][81400] Updated weights for policy 0, policy_version 92670 (0.0006) [2023-03-07 01:09:55,127][81400] Updated weights for policy 0, policy_version 92680 (0.0006) [2023-03-07 01:09:55,901][81400] Updated weights for policy 0, policy_version 92690 (0.0006) [2023-03-07 01:09:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13155.8). Total num frames: 94918656. Throughput: 0: 13111.6. Samples: 94904149. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:09:56,237][81074] Avg episode reward: [(0, '2823.023')] [2023-03-07 01:09:56,241][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000092694_94918656.pth... [2023-03-07 01:09:56,271][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000089612_91762688.pth [2023-03-07 01:09:56,681][81400] Updated weights for policy 0, policy_version 92700 (0.0006) [2023-03-07 01:09:57,478][81400] Updated weights for policy 0, policy_version 92710 (0.0007) [2023-03-07 01:09:58,247][81400] Updated weights for policy 0, policy_version 92720 (0.0007) [2023-03-07 01:09:59,026][81400] Updated weights for policy 0, policy_version 92730 (0.0006) [2023-03-07 01:09:59,833][81400] Updated weights for policy 0, policy_version 92740 (0.0006) [2023-03-07 01:10:00,609][81400] Updated weights for policy 0, policy_version 92750 (0.0006) [2023-03-07 01:10:01,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13107.2, 300 sec: 13155.8). Total num frames: 94984192. Throughput: 0: 13114.9. Samples: 94982678. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:01,237][81074] Avg episode reward: [(0, '2683.433')] [2023-03-07 01:10:01,369][81400] Updated weights for policy 0, policy_version 92760 (0.0006) [2023-03-07 01:10:02,162][81400] Updated weights for policy 0, policy_version 92770 (0.0006) [2023-03-07 01:10:02,934][81400] Updated weights for policy 0, policy_version 92780 (0.0006) [2023-03-07 01:10:03,724][81400] Updated weights for policy 0, policy_version 92790 (0.0006) [2023-03-07 01:10:04,491][81400] Updated weights for policy 0, policy_version 92800 (0.0006) [2023-03-07 01:10:05,250][81400] Updated weights for policy 0, policy_version 92810 (0.0007) [2023-03-07 01:10:06,048][81400] Updated weights for policy 0, policy_version 92820 (0.0006) [2023-03-07 01:10:06,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13107.2, 300 sec: 13155.8). Total num frames: 95049728. Throughput: 0: 13119.3. Samples: 95022089. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:06,237][81074] Avg episode reward: [(0, '3053.075')] [2023-03-07 01:10:06,818][81400] Updated weights for policy 0, policy_version 92830 (0.0006) [2023-03-07 01:10:07,580][81400] Updated weights for policy 0, policy_version 92840 (0.0006) [2023-03-07 01:10:08,368][81400] Updated weights for policy 0, policy_version 92850 (0.0006) [2023-03-07 01:10:09,155][81400] Updated weights for policy 0, policy_version 92860 (0.0006) [2023-03-07 01:10:09,922][81400] Updated weights for policy 0, policy_version 92870 (0.0006) [2023-03-07 01:10:10,698][81400] Updated weights for policy 0, policy_version 92880 (0.0006) [2023-03-07 01:10:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13152.3). Total num frames: 95115264. Throughput: 0: 13128.8. Samples: 95101304. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:11,237][81074] Avg episode reward: [(0, '2933.133')] [2023-03-07 01:10:11,484][81400] Updated weights for policy 0, policy_version 92890 (0.0006) [2023-03-07 01:10:12,266][81400] Updated weights for policy 0, policy_version 92900 (0.0006) [2023-03-07 01:10:13,042][81400] Updated weights for policy 0, policy_version 92910 (0.0005) [2023-03-07 01:10:13,803][81400] Updated weights for policy 0, policy_version 92920 (0.0006) [2023-03-07 01:10:14,586][81400] Updated weights for policy 0, policy_version 92930 (0.0006) [2023-03-07 01:10:15,366][81400] Updated weights for policy 0, policy_version 92940 (0.0007) [2023-03-07 01:10:16,152][81400] Updated weights for policy 0, policy_version 92950 (0.0006) [2023-03-07 01:10:16,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 95181824. Throughput: 0: 13130.0. Samples: 95180310. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:16,237][81074] Avg episode reward: [(0, '2922.828')] [2023-03-07 01:10:16,939][81400] Updated weights for policy 0, policy_version 92960 (0.0006) [2023-03-07 01:10:17,700][81400] Updated weights for policy 0, policy_version 92970 (0.0006) [2023-03-07 01:10:18,465][81400] Updated weights for policy 0, policy_version 92980 (0.0006) [2023-03-07 01:10:19,237][81400] Updated weights for policy 0, policy_version 92990 (0.0006) [2023-03-07 01:10:20,009][81400] Updated weights for policy 0, policy_version 93000 (0.0007) [2023-03-07 01:10:20,793][81400] Updated weights for policy 0, policy_version 93010 (0.0006) [2023-03-07 01:10:21,236][81074] Fps is (10 sec: 13209.8, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 95247360. Throughput: 0: 13135.2. Samples: 95220031. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:21,237][81074] Avg episode reward: [(0, '2789.522')] [2023-03-07 01:10:21,558][81400] Updated weights for policy 0, policy_version 93020 (0.0006) [2023-03-07 01:10:22,327][81400] Updated weights for policy 0, policy_version 93030 (0.0007) [2023-03-07 01:10:23,135][81400] Updated weights for policy 0, policy_version 93040 (0.0006) [2023-03-07 01:10:23,913][81400] Updated weights for policy 0, policy_version 93050 (0.0006) [2023-03-07 01:10:24,691][81400] Updated weights for policy 0, policy_version 93060 (0.0006) [2023-03-07 01:10:25,462][81400] Updated weights for policy 0, policy_version 93070 (0.0006) [2023-03-07 01:10:26,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.2, 300 sec: 13152.3). Total num frames: 95312896. Throughput: 0: 13132.2. Samples: 95298924. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:26,237][81074] Avg episode reward: [(0, '2775.048')] [2023-03-07 01:10:26,245][81400] Updated weights for policy 0, policy_version 93080 (0.0007) [2023-03-07 01:10:27,016][81400] Updated weights for policy 0, policy_version 93090 (0.0006) [2023-03-07 01:10:27,805][81400] Updated weights for policy 0, policy_version 93100 (0.0007) [2023-03-07 01:10:28,582][81400] Updated weights for policy 0, policy_version 93110 (0.0006) [2023-03-07 01:10:29,357][81400] Updated weights for policy 0, policy_version 93120 (0.0006) [2023-03-07 01:10:30,153][81400] Updated weights for policy 0, policy_version 93130 (0.0006) [2023-03-07 01:10:30,914][81400] Updated weights for policy 0, policy_version 93140 (0.0006) [2023-03-07 01:10:31,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 95379456. Throughput: 0: 13130.7. Samples: 95377692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:31,247][81074] Avg episode reward: [(0, '2666.789')] [2023-03-07 01:10:31,690][81400] Updated weights for policy 0, policy_version 93150 (0.0005) [2023-03-07 01:10:32,466][81400] Updated weights for policy 0, policy_version 93160 (0.0006) [2023-03-07 01:10:33,249][81400] Updated weights for policy 0, policy_version 93170 (0.0006) [2023-03-07 01:10:34,024][81400] Updated weights for policy 0, policy_version 93180 (0.0006) [2023-03-07 01:10:34,793][81400] Updated weights for policy 0, policy_version 93190 (0.0006) [2023-03-07 01:10:35,593][81400] Updated weights for policy 0, policy_version 93200 (0.0006) [2023-03-07 01:10:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13152.3). Total num frames: 95444992. Throughput: 0: 13146.6. Samples: 95417352. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:36,247][81074] Avg episode reward: [(0, '2933.274')] [2023-03-07 01:10:36,361][81400] Updated weights for policy 0, policy_version 93210 (0.0006) [2023-03-07 01:10:37,126][81400] Updated weights for policy 0, policy_version 93220 (0.0006) [2023-03-07 01:10:37,924][81400] Updated weights for policy 0, policy_version 93230 (0.0006) [2023-03-07 01:10:38,675][81400] Updated weights for policy 0, policy_version 93240 (0.0007) [2023-03-07 01:10:39,465][81400] Updated weights for policy 0, policy_version 93250 (0.0006) [2023-03-07 01:10:40,229][81400] Updated weights for policy 0, policy_version 93260 (0.0005) [2023-03-07 01:10:41,005][81400] Updated weights for policy 0, policy_version 93270 (0.0006) [2023-03-07 01:10:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 95511552. Throughput: 0: 13163.6. Samples: 95496511. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:41,237][81074] Avg episode reward: [(0, '2656.540')] [2023-03-07 01:10:41,784][81400] Updated weights for policy 0, policy_version 93280 (0.0007) [2023-03-07 01:10:42,573][81400] Updated weights for policy 0, policy_version 93290 (0.0006) [2023-03-07 01:10:43,349][81400] Updated weights for policy 0, policy_version 93300 (0.0006) [2023-03-07 01:10:44,123][81400] Updated weights for policy 0, policy_version 93310 (0.0006) [2023-03-07 01:10:44,917][81400] Updated weights for policy 0, policy_version 93320 (0.0006) [2023-03-07 01:10:45,677][81400] Updated weights for policy 0, policy_version 93330 (0.0006) [2023-03-07 01:10:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 95577088. Throughput: 0: 13172.9. Samples: 95575458. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:46,237][81074] Avg episode reward: [(0, '2771.917')] [2023-03-07 01:10:46,465][81400] Updated weights for policy 0, policy_version 93340 (0.0006) [2023-03-07 01:10:47,230][81400] Updated weights for policy 0, policy_version 93350 (0.0007) [2023-03-07 01:10:48,021][81400] Updated weights for policy 0, policy_version 93360 (0.0006) [2023-03-07 01:10:48,821][81400] Updated weights for policy 0, policy_version 93370 (0.0007) [2023-03-07 01:10:49,586][81400] Updated weights for policy 0, policy_version 93380 (0.0006) [2023-03-07 01:10:50,347][81400] Updated weights for policy 0, policy_version 93390 (0.0006) [2023-03-07 01:10:51,139][81400] Updated weights for policy 0, policy_version 93400 (0.0007) [2023-03-07 01:10:51,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13155.8). Total num frames: 95642624. Throughput: 0: 13170.7. Samples: 95614768. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:51,237][81074] Avg episode reward: [(0, '2782.295')] [2023-03-07 01:10:51,913][81400] Updated weights for policy 0, policy_version 93410 (0.0006) [2023-03-07 01:10:52,681][81400] Updated weights for policy 0, policy_version 93420 (0.0007) [2023-03-07 01:10:53,466][81400] Updated weights for policy 0, policy_version 93430 (0.0007) [2023-03-07 01:10:54,230][81400] Updated weights for policy 0, policy_version 93440 (0.0006) [2023-03-07 01:10:55,002][81400] Updated weights for policy 0, policy_version 93450 (0.0006) [2023-03-07 01:10:55,777][81400] Updated weights for policy 0, policy_version 93460 (0.0006) [2023-03-07 01:10:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13152.3). Total num frames: 95708160. Throughput: 0: 13171.3. Samples: 95694013. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:10:56,237][81074] Avg episode reward: [(0, '2642.952')] [2023-03-07 01:10:56,564][81400] Updated weights for policy 0, policy_version 93470 (0.0007) [2023-03-07 01:10:57,345][81400] Updated weights for policy 0, policy_version 93480 (0.0007) [2023-03-07 01:10:58,135][81400] Updated weights for policy 0, policy_version 93490 (0.0007) [2023-03-07 01:10:58,913][81400] Updated weights for policy 0, policy_version 93500 (0.0006) [2023-03-07 01:10:59,679][81400] Updated weights for policy 0, policy_version 93510 (0.0006) [2023-03-07 01:11:00,444][81400] Updated weights for policy 0, policy_version 93520 (0.0006) [2023-03-07 01:11:01,209][81400] Updated weights for policy 0, policy_version 93530 (0.0006) [2023-03-07 01:11:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 95774720. Throughput: 0: 13173.0. Samples: 95773097. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:01,237][81074] Avg episode reward: [(0, '2610.765')] [2023-03-07 01:11:01,998][81400] Updated weights for policy 0, policy_version 93540 (0.0006) [2023-03-07 01:11:02,772][81400] Updated weights for policy 0, policy_version 93550 (0.0006) [2023-03-07 01:11:03,548][81400] Updated weights for policy 0, policy_version 93560 (0.0006) [2023-03-07 01:11:04,325][81400] Updated weights for policy 0, policy_version 93570 (0.0007) [2023-03-07 01:11:05,112][81400] Updated weights for policy 0, policy_version 93580 (0.0006) [2023-03-07 01:11:05,891][81400] Updated weights for policy 0, policy_version 93590 (0.0007) [2023-03-07 01:11:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 95840256. Throughput: 0: 13170.0. Samples: 95812682. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:06,237][81074] Avg episode reward: [(0, '2808.817')] [2023-03-07 01:11:06,661][81400] Updated weights for policy 0, policy_version 93600 (0.0007) [2023-03-07 01:11:07,448][81400] Updated weights for policy 0, policy_version 93610 (0.0006) [2023-03-07 01:11:08,215][81400] Updated weights for policy 0, policy_version 93620 (0.0006) [2023-03-07 01:11:08,996][81400] Updated weights for policy 0, policy_version 93630 (0.0006) [2023-03-07 01:11:09,763][81400] Updated weights for policy 0, policy_version 93640 (0.0006) [2023-03-07 01:11:10,528][81400] Updated weights for policy 0, policy_version 93650 (0.0006) [2023-03-07 01:11:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 95905792. Throughput: 0: 13174.8. Samples: 95891790. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:11,237][81074] Avg episode reward: [(0, '2625.263')] [2023-03-07 01:11:11,285][81400] Updated weights for policy 0, policy_version 93660 (0.0006) [2023-03-07 01:11:12,083][81400] Updated weights for policy 0, policy_version 93670 (0.0006) [2023-03-07 01:11:12,845][81400] Updated weights for policy 0, policy_version 93680 (0.0007) [2023-03-07 01:11:13,612][81400] Updated weights for policy 0, policy_version 93690 (0.0006) [2023-03-07 01:11:14,394][81400] Updated weights for policy 0, policy_version 93700 (0.0006) [2023-03-07 01:11:15,180][81400] Updated weights for policy 0, policy_version 93710 (0.0006) [2023-03-07 01:11:15,935][81400] Updated weights for policy 0, policy_version 93720 (0.0006) [2023-03-07 01:11:16,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 95972352. Throughput: 0: 13195.6. Samples: 95971494. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:16,237][81074] Avg episode reward: [(0, '2746.274')] [2023-03-07 01:11:16,716][81400] Updated weights for policy 0, policy_version 93730 (0.0006) [2023-03-07 01:11:17,492][81400] Updated weights for policy 0, policy_version 93740 (0.0006) [2023-03-07 01:11:18,293][81400] Updated weights for policy 0, policy_version 93750 (0.0007) [2023-03-07 01:11:19,065][81400] Updated weights for policy 0, policy_version 93760 (0.0006) [2023-03-07 01:11:19,857][81400] Updated weights for policy 0, policy_version 93770 (0.0006) [2023-03-07 01:11:20,630][81400] Updated weights for policy 0, policy_version 93780 (0.0006) [2023-03-07 01:11:21,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 96037888. Throughput: 0: 13190.0. Samples: 96010901. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:21,237][81074] Avg episode reward: [(0, '2651.942')] [2023-03-07 01:11:21,413][81400] Updated weights for policy 0, policy_version 93790 (0.0006) [2023-03-07 01:11:22,197][81400] Updated weights for policy 0, policy_version 93800 (0.0006) [2023-03-07 01:11:22,986][81400] Updated weights for policy 0, policy_version 93810 (0.0006) [2023-03-07 01:11:23,736][81400] Updated weights for policy 0, policy_version 93820 (0.0006) [2023-03-07 01:11:24,513][81400] Updated weights for policy 0, policy_version 93830 (0.0007) [2023-03-07 01:11:25,296][81400] Updated weights for policy 0, policy_version 93840 (0.0005) [2023-03-07 01:11:26,057][81400] Updated weights for policy 0, policy_version 93850 (0.0006) [2023-03-07 01:11:26,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13155.8). Total num frames: 96104448. Throughput: 0: 13180.5. Samples: 96089636. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:26,237][81074] Avg episode reward: [(0, '2913.840')] [2023-03-07 01:11:26,835][81400] Updated weights for policy 0, policy_version 93860 (0.0006) [2023-03-07 01:11:27,610][81400] Updated weights for policy 0, policy_version 93870 (0.0006) [2023-03-07 01:11:28,385][81400] Updated weights for policy 0, policy_version 93880 (0.0006) [2023-03-07 01:11:29,159][81400] Updated weights for policy 0, policy_version 93890 (0.0006) [2023-03-07 01:11:29,949][81400] Updated weights for policy 0, policy_version 93900 (0.0006) [2023-03-07 01:11:30,721][81400] Updated weights for policy 0, policy_version 93910 (0.0005) [2023-03-07 01:11:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 96169984. Throughput: 0: 13186.4. Samples: 96168849. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:31,237][81074] Avg episode reward: [(0, '3046.092')] [2023-03-07 01:11:31,502][81400] Updated weights for policy 0, policy_version 93920 (0.0006) [2023-03-07 01:11:32,286][81400] Updated weights for policy 0, policy_version 93930 (0.0006) [2023-03-07 01:11:33,062][81400] Updated weights for policy 0, policy_version 93940 (0.0006) [2023-03-07 01:11:33,847][81400] Updated weights for policy 0, policy_version 93950 (0.0006) [2023-03-07 01:11:34,616][81400] Updated weights for policy 0, policy_version 93960 (0.0006) [2023-03-07 01:11:35,386][81400] Updated weights for policy 0, policy_version 93970 (0.0006) [2023-03-07 01:11:36,181][81400] Updated weights for policy 0, policy_version 93980 (0.0005) [2023-03-07 01:11:36,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 96235520. Throughput: 0: 13191.2. Samples: 96208371. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:11:36,237][81074] Avg episode reward: [(0, '2896.734')] [2023-03-07 01:11:36,949][81400] Updated weights for policy 0, policy_version 93990 (0.0006) [2023-03-07 01:11:37,719][81400] Updated weights for policy 0, policy_version 94000 (0.0007) [2023-03-07 01:11:38,493][81400] Updated weights for policy 0, policy_version 94010 (0.0006) [2023-03-07 01:11:39,285][81400] Updated weights for policy 0, policy_version 94020 (0.0005) [2023-03-07 01:11:40,073][81400] Updated weights for policy 0, policy_version 94030 (0.0006) [2023-03-07 01:11:40,861][81400] Updated weights for policy 0, policy_version 94040 (0.0006) [2023-03-07 01:11:41,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 96301056. Throughput: 0: 13179.6. Samples: 96287092. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:11:41,237][81074] Avg episode reward: [(0, '2825.920')] [2023-03-07 01:11:41,626][81400] Updated weights for policy 0, policy_version 94050 (0.0005) [2023-03-07 01:11:42,398][81400] Updated weights for policy 0, policy_version 94060 (0.0006) [2023-03-07 01:11:43,179][81400] Updated weights for policy 0, policy_version 94070 (0.0006) [2023-03-07 01:11:43,954][81400] Updated weights for policy 0, policy_version 94080 (0.0006) [2023-03-07 01:11:44,738][81400] Updated weights for policy 0, policy_version 94090 (0.0006) [2023-03-07 01:11:45,510][81400] Updated weights for policy 0, policy_version 94100 (0.0007) [2023-03-07 01:11:46,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 96367616. Throughput: 0: 13181.3. Samples: 96366255. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:11:46,237][81074] Avg episode reward: [(0, '2964.023')] [2023-03-07 01:11:46,278][81400] Updated weights for policy 0, policy_version 94110 (0.0006) [2023-03-07 01:11:47,045][81400] Updated weights for policy 0, policy_version 94120 (0.0007) [2023-03-07 01:11:47,839][81400] Updated weights for policy 0, policy_version 94130 (0.0006) [2023-03-07 01:11:48,605][81400] Updated weights for policy 0, policy_version 94140 (0.0007) [2023-03-07 01:11:49,375][81400] Updated weights for policy 0, policy_version 94150 (0.0006) [2023-03-07 01:11:50,163][81400] Updated weights for policy 0, policy_version 94160 (0.0006) [2023-03-07 01:11:50,947][81400] Updated weights for policy 0, policy_version 94170 (0.0006) [2023-03-07 01:11:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 96433152. Throughput: 0: 13183.9. Samples: 96405956. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:11:51,237][81074] Avg episode reward: [(0, '3012.029')] [2023-03-07 01:11:51,741][81400] Updated weights for policy 0, policy_version 94180 (0.0006) [2023-03-07 01:11:52,522][81400] Updated weights for policy 0, policy_version 94190 (0.0006) [2023-03-07 01:11:53,290][81400] Updated weights for policy 0, policy_version 94200 (0.0006) [2023-03-07 01:11:54,074][81400] Updated weights for policy 0, policy_version 94210 (0.0006) [2023-03-07 01:11:54,857][81400] Updated weights for policy 0, policy_version 94220 (0.0006) [2023-03-07 01:11:55,638][81400] Updated weights for policy 0, policy_version 94230 (0.0006) [2023-03-07 01:11:56,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13175.5, 300 sec: 13148.9). Total num frames: 96498688. Throughput: 0: 13170.5. Samples: 96484465. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:11:56,237][81074] Avg episode reward: [(0, '2658.431')] [2023-03-07 01:11:56,242][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000094237_96498688.pth... [2023-03-07 01:11:56,274][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000091155_93342720.pth [2023-03-07 01:11:56,422][81400] Updated weights for policy 0, policy_version 94240 (0.0006) [2023-03-07 01:11:57,203][81400] Updated weights for policy 0, policy_version 94250 (0.0006) [2023-03-07 01:11:57,978][81400] Updated weights for policy 0, policy_version 94260 (0.0005) [2023-03-07 01:11:58,758][81400] Updated weights for policy 0, policy_version 94270 (0.0006) [2023-03-07 01:11:59,528][81400] Updated weights for policy 0, policy_version 94280 (0.0006) [2023-03-07 01:12:00,303][81400] Updated weights for policy 0, policy_version 94290 (0.0007) [2023-03-07 01:12:01,097][81400] Updated weights for policy 0, policy_version 94300 (0.0007) [2023-03-07 01:12:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 96564224. Throughput: 0: 13153.5. Samples: 96563401. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:01,237][81074] Avg episode reward: [(0, '2848.657')] [2023-03-07 01:12:01,870][81400] Updated weights for policy 0, policy_version 94310 (0.0006) [2023-03-07 01:12:02,677][81400] Updated weights for policy 0, policy_version 94320 (0.0006) [2023-03-07 01:12:03,452][81400] Updated weights for policy 0, policy_version 94330 (0.0006) [2023-03-07 01:12:04,227][81400] Updated weights for policy 0, policy_version 94340 (0.0006) [2023-03-07 01:12:05,011][81400] Updated weights for policy 0, policy_version 94350 (0.0006) [2023-03-07 01:12:05,785][81400] Updated weights for policy 0, policy_version 94360 (0.0008) [2023-03-07 01:12:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 96629760. Throughput: 0: 13150.7. Samples: 96602683. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:06,237][81074] Avg episode reward: [(0, '2925.218')] [2023-03-07 01:12:06,589][81400] Updated weights for policy 0, policy_version 94370 (0.0006) [2023-03-07 01:12:07,358][81400] Updated weights for policy 0, policy_version 94380 (0.0006) [2023-03-07 01:12:08,122][81400] Updated weights for policy 0, policy_version 94390 (0.0007) [2023-03-07 01:12:08,908][81400] Updated weights for policy 0, policy_version 94400 (0.0006) [2023-03-07 01:12:09,705][81400] Updated weights for policy 0, policy_version 94410 (0.0006) [2023-03-07 01:12:10,490][81400] Updated weights for policy 0, policy_version 94420 (0.0006) [2023-03-07 01:12:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 96695296. Throughput: 0: 13145.4. Samples: 96681177. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:11,237][81074] Avg episode reward: [(0, '3143.625')] [2023-03-07 01:12:11,266][81400] Updated weights for policy 0, policy_version 94430 (0.0006) [2023-03-07 01:12:12,045][81400] Updated weights for policy 0, policy_version 94440 (0.0006) [2023-03-07 01:12:12,834][81400] Updated weights for policy 0, policy_version 94450 (0.0005) [2023-03-07 01:12:13,611][81400] Updated weights for policy 0, policy_version 94460 (0.0006) [2023-03-07 01:12:14,398][81400] Updated weights for policy 0, policy_version 94470 (0.0006) [2023-03-07 01:12:15,193][81400] Updated weights for policy 0, policy_version 94480 (0.0006) [2023-03-07 01:12:15,967][81400] Updated weights for policy 0, policy_version 94490 (0.0006) [2023-03-07 01:12:16,236][81074] Fps is (10 sec: 13107.0, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 96760832. Throughput: 0: 13131.1. Samples: 96759749. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:16,247][81074] Avg episode reward: [(0, '2826.999')] [2023-03-07 01:12:16,733][81400] Updated weights for policy 0, policy_version 94500 (0.0006) [2023-03-07 01:12:17,518][81400] Updated weights for policy 0, policy_version 94510 (0.0006) [2023-03-07 01:12:18,293][81400] Updated weights for policy 0, policy_version 94520 (0.0005) [2023-03-07 01:12:19,066][81400] Updated weights for policy 0, policy_version 94530 (0.0007) [2023-03-07 01:12:19,854][81400] Updated weights for policy 0, policy_version 94540 (0.0006) [2023-03-07 01:12:20,646][81400] Updated weights for policy 0, policy_version 94550 (0.0006) [2023-03-07 01:12:21,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 96826368. Throughput: 0: 13128.8. Samples: 96799169. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:21,247][81074] Avg episode reward: [(0, '2837.332')] [2023-03-07 01:12:21,420][81400] Updated weights for policy 0, policy_version 94560 (0.0006) [2023-03-07 01:12:22,206][81400] Updated weights for policy 0, policy_version 94570 (0.0006) [2023-03-07 01:12:22,983][81400] Updated weights for policy 0, policy_version 94580 (0.0005) [2023-03-07 01:12:23,772][81400] Updated weights for policy 0, policy_version 94590 (0.0006) [2023-03-07 01:12:24,568][81400] Updated weights for policy 0, policy_version 94600 (0.0006) [2023-03-07 01:12:25,325][81400] Updated weights for policy 0, policy_version 94610 (0.0006) [2023-03-07 01:12:26,110][81400] Updated weights for policy 0, policy_version 94620 (0.0007) [2023-03-07 01:12:26,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 96891904. Throughput: 0: 13122.7. Samples: 96877614. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:26,247][81074] Avg episode reward: [(0, '2769.137')] [2023-03-07 01:12:26,889][81400] Updated weights for policy 0, policy_version 94630 (0.0006) [2023-03-07 01:12:27,673][81400] Updated weights for policy 0, policy_version 94640 (0.0006) [2023-03-07 01:12:28,448][81400] Updated weights for policy 0, policy_version 94650 (0.0005) [2023-03-07 01:12:29,224][81400] Updated weights for policy 0, policy_version 94660 (0.0007) [2023-03-07 01:12:30,004][81400] Updated weights for policy 0, policy_version 94670 (0.0006) [2023-03-07 01:12:30,785][81400] Updated weights for policy 0, policy_version 94680 (0.0006) [2023-03-07 01:12:31,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 96957440. Throughput: 0: 13120.5. Samples: 96956677. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:31,237][81074] Avg episode reward: [(0, '2973.149')] [2023-03-07 01:12:31,553][81400] Updated weights for policy 0, policy_version 94690 (0.0006) [2023-03-07 01:12:32,345][81400] Updated weights for policy 0, policy_version 94700 (0.0006) [2023-03-07 01:12:33,106][81400] Updated weights for policy 0, policy_version 94710 (0.0006) [2023-03-07 01:12:33,868][81400] Updated weights for policy 0, policy_version 94720 (0.0008) [2023-03-07 01:12:34,654][81400] Updated weights for policy 0, policy_version 94730 (0.0006) [2023-03-07 01:12:35,416][81400] Updated weights for policy 0, policy_version 94740 (0.0006) [2023-03-07 01:12:36,207][81400] Updated weights for policy 0, policy_version 94750 (0.0006) [2023-03-07 01:12:36,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 97024000. Throughput: 0: 13120.4. Samples: 96996374. Policy #0 lag: (min: 0.0, avg: 1.0, max: 3.0) [2023-03-07 01:12:36,237][81074] Avg episode reward: [(0, '3082.820')] [2023-03-07 01:12:36,975][81400] Updated weights for policy 0, policy_version 94760 (0.0006) [2023-03-07 01:12:37,764][81400] Updated weights for policy 0, policy_version 94770 (0.0006) [2023-03-07 01:12:38,539][81400] Updated weights for policy 0, policy_version 94780 (0.0007) [2023-03-07 01:12:39,337][81400] Updated weights for policy 0, policy_version 94790 (0.0006) [2023-03-07 01:12:40,091][81400] Updated weights for policy 0, policy_version 94800 (0.0006) [2023-03-07 01:12:40,866][81400] Updated weights for policy 0, policy_version 94810 (0.0006) [2023-03-07 01:12:41,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 97089536. Throughput: 0: 13131.1. Samples: 97075363. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:12:41,237][81074] Avg episode reward: [(0, '3221.071')] [2023-03-07 01:12:41,657][81400] Updated weights for policy 0, policy_version 94820 (0.0006) [2023-03-07 01:12:42,446][81400] Updated weights for policy 0, policy_version 94830 (0.0006) [2023-03-07 01:12:43,218][81400] Updated weights for policy 0, policy_version 94840 (0.0006) [2023-03-07 01:12:43,998][81400] Updated weights for policy 0, policy_version 94850 (0.0006) [2023-03-07 01:12:44,784][81400] Updated weights for policy 0, policy_version 94860 (0.0007) [2023-03-07 01:12:45,573][81400] Updated weights for policy 0, policy_version 94870 (0.0007) [2023-03-07 01:12:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 97155072. Throughput: 0: 13126.4. Samples: 97154087. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:12:46,237][81074] Avg episode reward: [(0, '3023.672')] [2023-03-07 01:12:46,352][81400] Updated weights for policy 0, policy_version 94880 (0.0006) [2023-03-07 01:12:47,128][81400] Updated weights for policy 0, policy_version 94890 (0.0006) [2023-03-07 01:12:47,906][81400] Updated weights for policy 0, policy_version 94900 (0.0007) [2023-03-07 01:12:48,685][81400] Updated weights for policy 0, policy_version 94910 (0.0006) [2023-03-07 01:12:49,462][81400] Updated weights for policy 0, policy_version 94920 (0.0006) [2023-03-07 01:12:50,242][81400] Updated weights for policy 0, policy_version 94930 (0.0006) [2023-03-07 01:12:51,018][81400] Updated weights for policy 0, policy_version 94940 (0.0006) [2023-03-07 01:12:51,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13124.3, 300 sec: 13145.4). Total num frames: 97220608. Throughput: 0: 13128.2. Samples: 97193453. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:12:51,237][81074] Avg episode reward: [(0, '2911.692')] [2023-03-07 01:12:51,805][81400] Updated weights for policy 0, policy_version 94950 (0.0006) [2023-03-07 01:12:52,594][81400] Updated weights for policy 0, policy_version 94960 (0.0006) [2023-03-07 01:12:53,371][81400] Updated weights for policy 0, policy_version 94970 (0.0006) [2023-03-07 01:12:54,154][81400] Updated weights for policy 0, policy_version 94980 (0.0005) [2023-03-07 01:12:54,927][81400] Updated weights for policy 0, policy_version 94990 (0.0007) [2023-03-07 01:12:55,686][81400] Updated weights for policy 0, policy_version 95000 (0.0006) [2023-03-07 01:12:56,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13124.3, 300 sec: 13141.9). Total num frames: 97286144. Throughput: 0: 13133.2. Samples: 97272171. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:12:56,237][81074] Avg episode reward: [(0, '2902.278')] [2023-03-07 01:12:56,465][81400] Updated weights for policy 0, policy_version 95010 (0.0006) [2023-03-07 01:12:57,243][81400] Updated weights for policy 0, policy_version 95020 (0.0006) [2023-03-07 01:12:58,026][81400] Updated weights for policy 0, policy_version 95030 (0.0006) [2023-03-07 01:12:58,821][81400] Updated weights for policy 0, policy_version 95040 (0.0006) [2023-03-07 01:12:59,589][81400] Updated weights for policy 0, policy_version 95050 (0.0006) [2023-03-07 01:13:00,348][81400] Updated weights for policy 0, policy_version 95060 (0.0006) [2023-03-07 01:13:01,142][81400] Updated weights for policy 0, policy_version 95070 (0.0006) [2023-03-07 01:13:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 97352704. Throughput: 0: 13143.0. Samples: 97351183. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:13:01,237][81074] Avg episode reward: [(0, '2581.430')] [2023-03-07 01:13:01,912][81400] Updated weights for policy 0, policy_version 95080 (0.0007) [2023-03-07 01:13:02,702][81400] Updated weights for policy 0, policy_version 95090 (0.0006) [2023-03-07 01:13:03,489][81400] Updated weights for policy 0, policy_version 95100 (0.0005) [2023-03-07 01:13:04,267][81400] Updated weights for policy 0, policy_version 95110 (0.0006) [2023-03-07 01:13:05,054][81400] Updated weights for policy 0, policy_version 95120 (0.0007) [2023-03-07 01:13:05,853][81400] Updated weights for policy 0, policy_version 95130 (0.0006) [2023-03-07 01:13:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13148.9). Total num frames: 97418240. Throughput: 0: 13138.8. Samples: 97390413. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:13:06,237][81074] Avg episode reward: [(0, '2566.704')] [2023-03-07 01:13:06,613][81400] Updated weights for policy 0, policy_version 95140 (0.0006) [2023-03-07 01:13:07,393][81400] Updated weights for policy 0, policy_version 95150 (0.0005) [2023-03-07 01:13:08,157][81400] Updated weights for policy 0, policy_version 95160 (0.0007) [2023-03-07 01:13:08,938][81400] Updated weights for policy 0, policy_version 95170 (0.0007) [2023-03-07 01:13:09,707][81400] Updated weights for policy 0, policy_version 95180 (0.0007) [2023-03-07 01:13:10,491][81400] Updated weights for policy 0, policy_version 95190 (0.0005) [2023-03-07 01:13:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.4, 300 sec: 13148.9). Total num frames: 97483776. Throughput: 0: 13156.0. Samples: 97469633. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:13:11,237][81074] Avg episode reward: [(0, '2887.897')] [2023-03-07 01:13:11,282][81400] Updated weights for policy 0, policy_version 95200 (0.0006) [2023-03-07 01:13:12,055][81400] Updated weights for policy 0, policy_version 95210 (0.0006) [2023-03-07 01:13:12,830][81400] Updated weights for policy 0, policy_version 95220 (0.0006) [2023-03-07 01:13:13,623][81400] Updated weights for policy 0, policy_version 95230 (0.0006) [2023-03-07 01:13:14,384][81400] Updated weights for policy 0, policy_version 95240 (0.0006) [2023-03-07 01:13:15,149][81400] Updated weights for policy 0, policy_version 95250 (0.0006) [2023-03-07 01:13:15,934][81400] Updated weights for policy 0, policy_version 95260 (0.0005) [2023-03-07 01:13:16,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13145.4). Total num frames: 97549312. Throughput: 0: 13156.1. Samples: 97548699. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:13:16,237][81074] Avg episode reward: [(0, '2510.231')] [2023-03-07 01:13:16,710][81400] Updated weights for policy 0, policy_version 95270 (0.0006) [2023-03-07 01:13:17,483][81400] Updated weights for policy 0, policy_version 95280 (0.0006) [2023-03-07 01:13:18,294][81400] Updated weights for policy 0, policy_version 95290 (0.0006) [2023-03-07 01:13:19,055][81400] Updated weights for policy 0, policy_version 95300 (0.0007) [2023-03-07 01:13:19,838][81400] Updated weights for policy 0, policy_version 95310 (0.0006) [2023-03-07 01:13:20,610][81400] Updated weights for policy 0, policy_version 95320 (0.0006) [2023-03-07 01:13:21,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 97614848. Throughput: 0: 13144.0. Samples: 97587853. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:13:21,237][81074] Avg episode reward: [(0, '2850.014')] [2023-03-07 01:13:21,399][81400] Updated weights for policy 0, policy_version 95330 (0.0006) [2023-03-07 01:13:22,175][81400] Updated weights for policy 0, policy_version 95340 (0.0006) [2023-03-07 01:13:22,941][81400] Updated weights for policy 0, policy_version 95350 (0.0006) [2023-03-07 01:13:23,724][81400] Updated weights for policy 0, policy_version 95360 (0.0006) [2023-03-07 01:13:24,502][81400] Updated weights for policy 0, policy_version 95370 (0.0006) [2023-03-07 01:13:25,290][81400] Updated weights for policy 0, policy_version 95380 (0.0006) [2023-03-07 01:13:26,059][81400] Updated weights for policy 0, policy_version 95390 (0.0006) [2023-03-07 01:13:26,236][81074] Fps is (10 sec: 13209.4, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 97681408. Throughput: 0: 13141.7. Samples: 97666740. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:13:26,237][81074] Avg episode reward: [(0, '2672.548')] [2023-03-07 01:13:26,821][81400] Updated weights for policy 0, policy_version 95400 (0.0006) [2023-03-07 01:13:27,605][81400] Updated weights for policy 0, policy_version 95410 (0.0006) [2023-03-07 01:13:28,394][81400] Updated weights for policy 0, policy_version 95420 (0.0006) [2023-03-07 01:13:29,172][81400] Updated weights for policy 0, policy_version 95430 (0.0006) [2023-03-07 01:13:29,950][81400] Updated weights for policy 0, policy_version 95440 (0.0006) [2023-03-07 01:13:30,730][81400] Updated weights for policy 0, policy_version 95450 (0.0006) [2023-03-07 01:13:31,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 97746944. Throughput: 0: 13147.5. Samples: 97745721. Policy #0 lag: (min: 0.0, avg: 1.3, max: 3.0) [2023-03-07 01:13:31,237][81074] Avg episode reward: [(0, '2675.357')] [2023-03-07 01:13:31,502][81400] Updated weights for policy 0, policy_version 95460 (0.0007) [2023-03-07 01:13:32,284][81400] Updated weights for policy 0, policy_version 95470 (0.0006) [2023-03-07 01:13:33,078][81400] Updated weights for policy 0, policy_version 95480 (0.0005) [2023-03-07 01:13:33,849][81400] Updated weights for policy 0, policy_version 95490 (0.0006) [2023-03-07 01:13:34,623][81400] Updated weights for policy 0, policy_version 95500 (0.0006) [2023-03-07 01:13:35,404][81400] Updated weights for policy 0, policy_version 95510 (0.0006) [2023-03-07 01:13:36,185][81400] Updated weights for policy 0, policy_version 95520 (0.0006) [2023-03-07 01:13:36,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13145.4). Total num frames: 97812480. Throughput: 0: 13149.9. Samples: 97785201. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:13:36,237][81074] Avg episode reward: [(0, '3005.267')] [2023-03-07 01:13:36,962][81400] Updated weights for policy 0, policy_version 95530 (0.0007) [2023-03-07 01:13:37,741][81400] Updated weights for policy 0, policy_version 95540 (0.0006) [2023-03-07 01:13:38,514][81400] Updated weights for policy 0, policy_version 95550 (0.0006) [2023-03-07 01:13:39,306][81400] Updated weights for policy 0, policy_version 95560 (0.0006) [2023-03-07 01:13:40,077][81400] Updated weights for policy 0, policy_version 95570 (0.0006) [2023-03-07 01:13:40,852][81400] Updated weights for policy 0, policy_version 95580 (0.0006) [2023-03-07 01:13:41,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 97878016. Throughput: 0: 13150.3. Samples: 97863937. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:13:41,237][81074] Avg episode reward: [(0, '2847.337')] [2023-03-07 01:13:41,636][81400] Updated weights for policy 0, policy_version 95590 (0.0006) [2023-03-07 01:13:42,427][81400] Updated weights for policy 0, policy_version 95600 (0.0007) [2023-03-07 01:13:43,197][81400] Updated weights for policy 0, policy_version 95610 (0.0006) [2023-03-07 01:13:43,976][81400] Updated weights for policy 0, policy_version 95620 (0.0006) [2023-03-07 01:13:44,755][81400] Updated weights for policy 0, policy_version 95630 (0.0005) [2023-03-07 01:13:45,560][81400] Updated weights for policy 0, policy_version 95640 (0.0008) [2023-03-07 01:13:46,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13141.3, 300 sec: 13141.9). Total num frames: 97943552. Throughput: 0: 13142.0. Samples: 97942576. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:13:46,237][81074] Avg episode reward: [(0, '2704.100')] [2023-03-07 01:13:46,341][81400] Updated weights for policy 0, policy_version 95650 (0.0006) [2023-03-07 01:13:47,095][81400] Updated weights for policy 0, policy_version 95660 (0.0006) [2023-03-07 01:13:47,886][81400] Updated weights for policy 0, policy_version 95670 (0.0006) [2023-03-07 01:13:48,661][81400] Updated weights for policy 0, policy_version 95680 (0.0006) [2023-03-07 01:13:49,433][81400] Updated weights for policy 0, policy_version 95690 (0.0006) [2023-03-07 01:13:50,222][81400] Updated weights for policy 0, policy_version 95700 (0.0006) [2023-03-07 01:13:51,002][81400] Updated weights for policy 0, policy_version 95710 (0.0006) [2023-03-07 01:13:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 98010112. Throughput: 0: 13154.0. Samples: 97982346. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:13:51,237][81074] Avg episode reward: [(0, '2804.313')] [2023-03-07 01:13:51,774][81400] Updated weights for policy 0, policy_version 95720 (0.0006) [2023-03-07 01:13:52,545][81400] Updated weights for policy 0, policy_version 95730 (0.0007) [2023-03-07 01:13:53,331][81400] Updated weights for policy 0, policy_version 95740 (0.0007) [2023-03-07 01:13:54,094][81400] Updated weights for policy 0, policy_version 95750 (0.0006) [2023-03-07 01:13:54,880][81400] Updated weights for policy 0, policy_version 95760 (0.0007) [2023-03-07 01:13:55,654][81400] Updated weights for policy 0, policy_version 95770 (0.0007) [2023-03-07 01:13:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13145.4). Total num frames: 98075648. Throughput: 0: 13149.5. Samples: 98061360. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:13:56,248][81074] Avg episode reward: [(0, '2934.851')] [2023-03-07 01:13:56,252][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000095777_98075648.pth... [2023-03-07 01:13:56,282][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000092694_94918656.pth [2023-03-07 01:13:56,427][81400] Updated weights for policy 0, policy_version 95780 (0.0006) [2023-03-07 01:13:57,201][81400] Updated weights for policy 0, policy_version 95790 (0.0006) [2023-03-07 01:13:57,984][81400] Updated weights for policy 0, policy_version 95800 (0.0006) [2023-03-07 01:13:58,733][81400] Updated weights for policy 0, policy_version 95810 (0.0006) [2023-03-07 01:13:59,505][81400] Updated weights for policy 0, policy_version 95820 (0.0007) [2023-03-07 01:14:00,289][81400] Updated weights for policy 0, policy_version 95830 (0.0007) [2023-03-07 01:14:01,060][81400] Updated weights for policy 0, policy_version 95840 (0.0007) [2023-03-07 01:14:01,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13158.4, 300 sec: 13148.8). Total num frames: 98142208. Throughput: 0: 13156.8. Samples: 98140760. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:01,248][81074] Avg episode reward: [(0, '2963.993')] [2023-03-07 01:14:01,825][81400] Updated weights for policy 0, policy_version 95850 (0.0007) [2023-03-07 01:14:02,603][81400] Updated weights for policy 0, policy_version 95860 (0.0006) [2023-03-07 01:14:03,371][81400] Updated weights for policy 0, policy_version 95870 (0.0007) [2023-03-07 01:14:04,133][81400] Updated weights for policy 0, policy_version 95880 (0.0006) [2023-03-07 01:14:04,931][81400] Updated weights for policy 0, policy_version 95890 (0.0006) [2023-03-07 01:14:05,702][81400] Updated weights for policy 0, policy_version 95900 (0.0008) [2023-03-07 01:14:06,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13158.4, 300 sec: 13148.9). Total num frames: 98207744. Throughput: 0: 13168.2. Samples: 98180423. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:06,237][81074] Avg episode reward: [(0, '2925.902')] [2023-03-07 01:14:06,467][81400] Updated weights for policy 0, policy_version 95910 (0.0006) [2023-03-07 01:14:07,254][81400] Updated weights for policy 0, policy_version 95920 (0.0006) [2023-03-07 01:14:08,029][81400] Updated weights for policy 0, policy_version 95930 (0.0005) [2023-03-07 01:14:08,798][81400] Updated weights for policy 0, policy_version 95940 (0.0005) [2023-03-07 01:14:09,589][81400] Updated weights for policy 0, policy_version 95950 (0.0006) [2023-03-07 01:14:10,376][81400] Updated weights for policy 0, policy_version 95960 (0.0007) [2023-03-07 01:14:11,149][81400] Updated weights for policy 0, policy_version 95970 (0.0007) [2023-03-07 01:14:11,236][81074] Fps is (10 sec: 13209.9, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 98274304. Throughput: 0: 13173.9. Samples: 98259563. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:11,237][81074] Avg episode reward: [(0, '2659.706')] [2023-03-07 01:14:11,942][81400] Updated weights for policy 0, policy_version 95980 (0.0005) [2023-03-07 01:14:12,723][81400] Updated weights for policy 0, policy_version 95990 (0.0005) [2023-03-07 01:14:13,494][81400] Updated weights for policy 0, policy_version 96000 (0.0006) [2023-03-07 01:14:14,262][81400] Updated weights for policy 0, policy_version 96010 (0.0006) [2023-03-07 01:14:15,038][81400] Updated weights for policy 0, policy_version 96020 (0.0006) [2023-03-07 01:14:15,816][81400] Updated weights for policy 0, policy_version 96030 (0.0006) [2023-03-07 01:14:16,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.4, 300 sec: 13152.3). Total num frames: 98339840. Throughput: 0: 13175.0. Samples: 98338597. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:16,237][81074] Avg episode reward: [(0, '2600.321')] [2023-03-07 01:14:16,591][81400] Updated weights for policy 0, policy_version 96040 (0.0006) [2023-03-07 01:14:17,374][81400] Updated weights for policy 0, policy_version 96050 (0.0006) [2023-03-07 01:14:18,149][81400] Updated weights for policy 0, policy_version 96060 (0.0006) [2023-03-07 01:14:18,917][81400] Updated weights for policy 0, policy_version 96070 (0.0006) [2023-03-07 01:14:19,702][81400] Updated weights for policy 0, policy_version 96080 (0.0006) [2023-03-07 01:14:20,468][81400] Updated weights for policy 0, policy_version 96090 (0.0006) [2023-03-07 01:14:21,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13152.3). Total num frames: 98405376. Throughput: 0: 13177.4. Samples: 98378186. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:21,237][81074] Avg episode reward: [(0, '2644.045')] [2023-03-07 01:14:21,249][81400] Updated weights for policy 0, policy_version 96100 (0.0007) [2023-03-07 01:14:22,036][81400] Updated weights for policy 0, policy_version 96110 (0.0006) [2023-03-07 01:14:22,804][81400] Updated weights for policy 0, policy_version 96120 (0.0007) [2023-03-07 01:14:23,586][81400] Updated weights for policy 0, policy_version 96130 (0.0007) [2023-03-07 01:14:24,356][81400] Updated weights for policy 0, policy_version 96140 (0.0006) [2023-03-07 01:14:25,127][81400] Updated weights for policy 0, policy_version 96150 (0.0006) [2023-03-07 01:14:25,883][81400] Updated weights for policy 0, policy_version 96160 (0.0006) [2023-03-07 01:14:26,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 98471936. Throughput: 0: 13187.5. Samples: 98457373. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:26,237][81074] Avg episode reward: [(0, '2621.711')] [2023-03-07 01:14:26,662][81400] Updated weights for policy 0, policy_version 96170 (0.0005) [2023-03-07 01:14:27,434][81400] Updated weights for policy 0, policy_version 96180 (0.0007) [2023-03-07 01:14:28,216][81400] Updated weights for policy 0, policy_version 96190 (0.0007) [2023-03-07 01:14:28,989][81400] Updated weights for policy 0, policy_version 96200 (0.0005) [2023-03-07 01:14:29,754][81400] Updated weights for policy 0, policy_version 96210 (0.0006) [2023-03-07 01:14:30,528][81400] Updated weights for policy 0, policy_version 96220 (0.0007) [2023-03-07 01:14:31,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13155.8). Total num frames: 98537472. Throughput: 0: 13199.4. Samples: 98536547. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:31,237][81074] Avg episode reward: [(0, '2412.600')] [2023-03-07 01:14:31,327][81400] Updated weights for policy 0, policy_version 96230 (0.0006) [2023-03-07 01:14:32,123][81400] Updated weights for policy 0, policy_version 96240 (0.0006) [2023-03-07 01:14:32,887][81400] Updated weights for policy 0, policy_version 96250 (0.0007) [2023-03-07 01:14:33,682][81400] Updated weights for policy 0, policy_version 96260 (0.0006) [2023-03-07 01:14:34,453][81400] Updated weights for policy 0, policy_version 96270 (0.0008) [2023-03-07 01:14:35,233][81400] Updated weights for policy 0, policy_version 96280 (0.0006) [2023-03-07 01:14:36,010][81400] Updated weights for policy 0, policy_version 96290 (0.0005) [2023-03-07 01:14:36,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13159.3). Total num frames: 98604032. Throughput: 0: 13187.4. Samples: 98575780. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:36,237][81074] Avg episode reward: [(0, '2637.596')] [2023-03-07 01:14:36,775][81400] Updated weights for policy 0, policy_version 96300 (0.0006) [2023-03-07 01:14:37,545][81400] Updated weights for policy 0, policy_version 96310 (0.0006) [2023-03-07 01:14:38,326][81400] Updated weights for policy 0, policy_version 96320 (0.0006) [2023-03-07 01:14:39,092][81400] Updated weights for policy 0, policy_version 96330 (0.0007) [2023-03-07 01:14:39,864][81400] Updated weights for policy 0, policy_version 96340 (0.0007) [2023-03-07 01:14:40,631][81400] Updated weights for policy 0, policy_version 96350 (0.0006) [2023-03-07 01:14:41,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13192.5, 300 sec: 13159.3). Total num frames: 98669568. Throughput: 0: 13198.2. Samples: 98655278. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:41,237][81074] Avg episode reward: [(0, '2533.052')] [2023-03-07 01:14:41,398][81349] KL-divergence is very high: 1139.8796 [2023-03-07 01:14:41,406][81400] Updated weights for policy 0, policy_version 96360 (0.0006) [2023-03-07 01:14:42,181][81400] Updated weights for policy 0, policy_version 96370 (0.0007) [2023-03-07 01:14:42,965][81400] Updated weights for policy 0, policy_version 96380 (0.0006) [2023-03-07 01:14:43,741][81400] Updated weights for policy 0, policy_version 96390 (0.0006) [2023-03-07 01:14:44,504][81400] Updated weights for policy 0, policy_version 96400 (0.0006) [2023-03-07 01:14:45,290][81400] Updated weights for policy 0, policy_version 96410 (0.0006) [2023-03-07 01:14:46,062][81400] Updated weights for policy 0, policy_version 96420 (0.0007) [2023-03-07 01:14:46,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13209.6, 300 sec: 13162.7). Total num frames: 98736128. Throughput: 0: 13193.2. Samples: 98734450. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:46,237][81074] Avg episode reward: [(0, '2165.407')] [2023-03-07 01:14:46,855][81400] Updated weights for policy 0, policy_version 96430 (0.0006) [2023-03-07 01:14:47,632][81400] Updated weights for policy 0, policy_version 96440 (0.0007) [2023-03-07 01:14:48,408][81400] Updated weights for policy 0, policy_version 96450 (0.0006) [2023-03-07 01:14:49,188][81400] Updated weights for policy 0, policy_version 96460 (0.0006) [2023-03-07 01:14:49,952][81400] Updated weights for policy 0, policy_version 96470 (0.0006) [2023-03-07 01:14:50,752][81400] Updated weights for policy 0, policy_version 96480 (0.0006) [2023-03-07 01:14:51,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13192.5, 300 sec: 13162.7). Total num frames: 98801664. Throughput: 0: 13191.4. Samples: 98774034. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:51,237][81074] Avg episode reward: [(0, '2512.520')] [2023-03-07 01:14:51,531][81400] Updated weights for policy 0, policy_version 96490 (0.0006) [2023-03-07 01:14:52,304][81400] Updated weights for policy 0, policy_version 96500 (0.0005) [2023-03-07 01:14:53,083][81400] Updated weights for policy 0, policy_version 96510 (0.0006) [2023-03-07 01:14:53,877][81400] Updated weights for policy 0, policy_version 96520 (0.0006) [2023-03-07 01:14:54,660][81400] Updated weights for policy 0, policy_version 96530 (0.0007) [2023-03-07 01:14:55,433][81400] Updated weights for policy 0, policy_version 96540 (0.0006) [2023-03-07 01:14:56,207][81400] Updated weights for policy 0, policy_version 96550 (0.0006) [2023-03-07 01:14:56,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13192.5, 300 sec: 13162.7). Total num frames: 98867200. Throughput: 0: 13178.9. Samples: 98852614. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:14:56,237][81074] Avg episode reward: [(0, '2704.268')] [2023-03-07 01:14:56,975][81400] Updated weights for policy 0, policy_version 96560 (0.0006) [2023-03-07 01:14:57,743][81400] Updated weights for policy 0, policy_version 96570 (0.0006) [2023-03-07 01:14:58,546][81400] Updated weights for policy 0, policy_version 96580 (0.0006) [2023-03-07 01:14:59,313][81400] Updated weights for policy 0, policy_version 96590 (0.0006) [2023-03-07 01:15:00,109][81400] Updated weights for policy 0, policy_version 96600 (0.0006) [2023-03-07 01:15:00,872][81400] Updated weights for policy 0, policy_version 96610 (0.0006) [2023-03-07 01:15:01,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 98932736. Throughput: 0: 13179.9. Samples: 98931692. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:15:01,237][81074] Avg episode reward: [(0, '2837.631')] [2023-03-07 01:15:01,653][81400] Updated weights for policy 0, policy_version 96620 (0.0006) [2023-03-07 01:15:02,437][81400] Updated weights for policy 0, policy_version 96630 (0.0006) [2023-03-07 01:15:03,202][81400] Updated weights for policy 0, policy_version 96640 (0.0006) [2023-03-07 01:15:04,002][81400] Updated weights for policy 0, policy_version 96650 (0.0006) [2023-03-07 01:15:04,789][81400] Updated weights for policy 0, policy_version 96660 (0.0006) [2023-03-07 01:15:05,563][81400] Updated weights for policy 0, policy_version 96670 (0.0005) [2023-03-07 01:15:06,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 98998272. Throughput: 0: 13171.7. Samples: 98970910. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:15:06,237][81074] Avg episode reward: [(0, '2926.928')] [2023-03-07 01:15:06,349][81400] Updated weights for policy 0, policy_version 96680 (0.0007) [2023-03-07 01:15:07,117][81400] Updated weights for policy 0, policy_version 96690 (0.0006) [2023-03-07 01:15:07,886][81400] Updated weights for policy 0, policy_version 96700 (0.0006) [2023-03-07 01:15:08,675][81400] Updated weights for policy 0, policy_version 96710 (0.0006) [2023-03-07 01:15:09,442][81400] Updated weights for policy 0, policy_version 96720 (0.0007) [2023-03-07 01:15:10,226][81400] Updated weights for policy 0, policy_version 96730 (0.0006) [2023-03-07 01:15:11,010][81400] Updated weights for policy 0, policy_version 96740 (0.0006) [2023-03-07 01:15:11,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 99063808. Throughput: 0: 13167.5. Samples: 99049913. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:15:11,237][81074] Avg episode reward: [(0, '2516.305')] [2023-03-07 01:15:11,783][81400] Updated weights for policy 0, policy_version 96750 (0.0006) [2023-03-07 01:15:12,563][81400] Updated weights for policy 0, policy_version 96760 (0.0006) [2023-03-07 01:15:13,354][81400] Updated weights for policy 0, policy_version 96770 (0.0007) [2023-03-07 01:15:14,131][81400] Updated weights for policy 0, policy_version 96780 (0.0006) [2023-03-07 01:15:14,909][81400] Updated weights for policy 0, policy_version 96790 (0.0006) [2023-03-07 01:15:15,684][81400] Updated weights for policy 0, policy_version 96800 (0.0006) [2023-03-07 01:15:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 99130368. Throughput: 0: 13162.8. Samples: 99128876. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:15:16,237][81074] Avg episode reward: [(0, '2912.393')] [2023-03-07 01:15:16,446][81400] Updated weights for policy 0, policy_version 96810 (0.0006) [2023-03-07 01:15:17,229][81400] Updated weights for policy 0, policy_version 96820 (0.0006) [2023-03-07 01:15:18,025][81400] Updated weights for policy 0, policy_version 96830 (0.0006) [2023-03-07 01:15:18,794][81400] Updated weights for policy 0, policy_version 96840 (0.0006) [2023-03-07 01:15:19,595][81400] Updated weights for policy 0, policy_version 96850 (0.0006) [2023-03-07 01:15:20,383][81400] Updated weights for policy 0, policy_version 96860 (0.0006) [2023-03-07 01:15:21,158][81400] Updated weights for policy 0, policy_version 96870 (0.0005) [2023-03-07 01:15:21,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13175.5, 300 sec: 13162.7). Total num frames: 99195904. Throughput: 0: 13164.1. Samples: 99168165. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:15:21,237][81074] Avg episode reward: [(0, '3056.375')] [2023-03-07 01:15:21,955][81400] Updated weights for policy 0, policy_version 96880 (0.0006) [2023-03-07 01:15:22,728][81400] Updated weights for policy 0, policy_version 96890 (0.0006) [2023-03-07 01:15:23,520][81400] Updated weights for policy 0, policy_version 96900 (0.0006) [2023-03-07 01:15:24,293][81400] Updated weights for policy 0, policy_version 96910 (0.0006) [2023-03-07 01:15:25,081][81400] Updated weights for policy 0, policy_version 96920 (0.0006) [2023-03-07 01:15:25,859][81400] Updated weights for policy 0, policy_version 96930 (0.0006) [2023-03-07 01:15:26,236][81074] Fps is (10 sec: 13107.4, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 99261440. Throughput: 0: 13140.5. Samples: 99246601. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2023-03-07 01:15:26,237][81074] Avg episode reward: [(0, '3118.698')] [2023-03-07 01:15:26,614][81400] Updated weights for policy 0, policy_version 96940 (0.0006) [2023-03-07 01:15:27,378][81400] Updated weights for policy 0, policy_version 96950 (0.0006) [2023-03-07 01:15:28,165][81400] Updated weights for policy 0, policy_version 96960 (0.0007) [2023-03-07 01:15:28,948][81400] Updated weights for policy 0, policy_version 96970 (0.0006) [2023-03-07 01:15:29,733][81400] Updated weights for policy 0, policy_version 96980 (0.0006) [2023-03-07 01:15:30,529][81400] Updated weights for policy 0, policy_version 96990 (0.0007) [2023-03-07 01:15:31,236][81074] Fps is (10 sec: 13107.2, 60 sec: 13158.4, 300 sec: 13159.3). Total num frames: 99326976. Throughput: 0: 13129.8. Samples: 99325292. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:15:31,237][81074] Avg episode reward: [(0, '2998.938')] [2023-03-07 01:15:31,322][81400] Updated weights for policy 0, policy_version 97000 (0.0006) [2023-03-07 01:15:32,107][81400] Updated weights for policy 0, policy_version 97010 (0.0006) [2023-03-07 01:15:32,891][81400] Updated weights for policy 0, policy_version 97020 (0.0005) [2023-03-07 01:15:33,670][81400] Updated weights for policy 0, policy_version 97030 (0.0006) [2023-03-07 01:15:34,465][81400] Updated weights for policy 0, policy_version 97040 (0.0007) [2023-03-07 01:15:35,244][81400] Updated weights for policy 0, policy_version 97050 (0.0007) [2023-03-07 01:15:36,021][81400] Updated weights for policy 0, policy_version 97060 (0.0007) [2023-03-07 01:15:36,236][81074] Fps is (10 sec: 13004.6, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 99391488. Throughput: 0: 13117.3. Samples: 99364311. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:15:36,237][81074] Avg episode reward: [(0, '2696.327')] [2023-03-07 01:15:36,797][81400] Updated weights for policy 0, policy_version 97070 (0.0006) [2023-03-07 01:15:37,579][81400] Updated weights for policy 0, policy_version 97080 (0.0006) [2023-03-07 01:15:38,361][81400] Updated weights for policy 0, policy_version 97090 (0.0006) [2023-03-07 01:15:39,146][81400] Updated weights for policy 0, policy_version 97100 (0.0006) [2023-03-07 01:15:39,934][81400] Updated weights for policy 0, policy_version 97110 (0.0007) [2023-03-07 01:15:40,708][81400] Updated weights for policy 0, policy_version 97120 (0.0006) [2023-03-07 01:15:41,236][81074] Fps is (10 sec: 13004.7, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 99457024. Throughput: 0: 13118.2. Samples: 99442933. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:15:41,237][81074] Avg episode reward: [(0, '2957.488')] [2023-03-07 01:15:41,492][81400] Updated weights for policy 0, policy_version 97130 (0.0006) [2023-03-07 01:15:42,254][81400] Updated weights for policy 0, policy_version 97140 (0.0006) [2023-03-07 01:15:43,037][81400] Updated weights for policy 0, policy_version 97150 (0.0006) [2023-03-07 01:15:43,821][81400] Updated weights for policy 0, policy_version 97160 (0.0006) [2023-03-07 01:15:44,585][81400] Updated weights for policy 0, policy_version 97170 (0.0006) [2023-03-07 01:15:45,369][81400] Updated weights for policy 0, policy_version 97180 (0.0006) [2023-03-07 01:15:46,114][81400] Updated weights for policy 0, policy_version 97190 (0.0006) [2023-03-07 01:15:46,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13124.2, 300 sec: 13155.8). Total num frames: 99523584. Throughput: 0: 13124.0. Samples: 99522271. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:15:46,237][81074] Avg episode reward: [(0, '2978.438')] [2023-03-07 01:15:46,902][81400] Updated weights for policy 0, policy_version 97200 (0.0006) [2023-03-07 01:15:47,674][81400] Updated weights for policy 0, policy_version 97210 (0.0006) [2023-03-07 01:15:48,444][81400] Updated weights for policy 0, policy_version 97220 (0.0007) [2023-03-07 01:15:49,226][81400] Updated weights for policy 0, policy_version 97230 (0.0006) [2023-03-07 01:15:49,996][81400] Updated weights for policy 0, policy_version 97240 (0.0006) [2023-03-07 01:15:50,790][81400] Updated weights for policy 0, policy_version 97250 (0.0006) [2023-03-07 01:15:51,236][81074] Fps is (10 sec: 13209.7, 60 sec: 13124.3, 300 sec: 13155.8). Total num frames: 99589120. Throughput: 0: 13132.5. Samples: 99561873. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:15:51,237][81074] Avg episode reward: [(0, '2926.840')] [2023-03-07 01:15:51,581][81400] Updated weights for policy 0, policy_version 97260 (0.0006) [2023-03-07 01:15:52,360][81400] Updated weights for policy 0, policy_version 97270 (0.0006) [2023-03-07 01:15:53,111][81400] Updated weights for policy 0, policy_version 97280 (0.0006) [2023-03-07 01:15:53,905][81400] Updated weights for policy 0, policy_version 97290 (0.0008) [2023-03-07 01:15:54,685][81400] Updated weights for policy 0, policy_version 97300 (0.0006) [2023-03-07 01:15:55,459][81400] Updated weights for policy 0, policy_version 97310 (0.0006) [2023-03-07 01:15:56,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.3, 300 sec: 13155.8). Total num frames: 99655680. Throughput: 0: 13126.5. Samples: 99640605. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:15:56,237][81400] Updated weights for policy 0, policy_version 97320 (0.0006) [2023-03-07 01:15:56,237][81074] Avg episode reward: [(0, '2899.634')] [2023-03-07 01:15:56,243][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000097320_99655680.pth... [2023-03-07 01:15:56,281][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000094237_96498688.pth [2023-03-07 01:15:57,026][81400] Updated weights for policy 0, policy_version 97330 (0.0006) [2023-03-07 01:15:57,816][81400] Updated weights for policy 0, policy_version 97340 (0.0006) [2023-03-07 01:15:58,592][81400] Updated weights for policy 0, policy_version 97350 (0.0006) [2023-03-07 01:15:59,382][81400] Updated weights for policy 0, policy_version 97360 (0.0006) [2023-03-07 01:16:00,145][81400] Updated weights for policy 0, policy_version 97370 (0.0007) [2023-03-07 01:16:00,922][81400] Updated weights for policy 0, policy_version 97380 (0.0006) [2023-03-07 01:16:01,236][81074] Fps is (10 sec: 13209.6, 60 sec: 13141.4, 300 sec: 13155.8). Total num frames: 99721216. Throughput: 0: 13126.2. Samples: 99719553. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:16:01,237][81074] Avg episode reward: [(0, '3001.268')] [2023-03-07 01:16:01,694][81400] Updated weights for policy 0, policy_version 97390 (0.0007) [2023-03-07 01:16:02,481][81400] Updated weights for policy 0, policy_version 97400 (0.0006) [2023-03-07 01:16:03,238][81400] Updated weights for policy 0, policy_version 97410 (0.0006) [2023-03-07 01:16:04,034][81400] Updated weights for policy 0, policy_version 97420 (0.0006) [2023-03-07 01:16:04,831][81400] Updated weights for policy 0, policy_version 97430 (0.0006) [2023-03-07 01:16:05,618][81400] Updated weights for policy 0, policy_version 97440 (0.0006) [2023-03-07 01:16:06,236][81074] Fps is (10 sec: 13004.8, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 99785728. Throughput: 0: 13129.5. Samples: 99758991. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:16:06,237][81074] Avg episode reward: [(0, '2858.729')] [2023-03-07 01:16:06,401][81400] Updated weights for policy 0, policy_version 97450 (0.0006) [2023-03-07 01:16:07,187][81400] Updated weights for policy 0, policy_version 97460 (0.0006) [2023-03-07 01:16:07,966][81400] Updated weights for policy 0, policy_version 97470 (0.0006) [2023-03-07 01:16:08,742][81400] Updated weights for policy 0, policy_version 97480 (0.0006) [2023-03-07 01:16:09,521][81400] Updated weights for policy 0, policy_version 97490 (0.0006) [2023-03-07 01:16:10,303][81400] Updated weights for policy 0, policy_version 97500 (0.0006) [2023-03-07 01:16:11,080][81400] Updated weights for policy 0, policy_version 97510 (0.0006) [2023-03-07 01:16:11,236][81074] Fps is (10 sec: 13107.3, 60 sec: 13141.4, 300 sec: 13152.3). Total num frames: 99852288. Throughput: 0: 13128.3. Samples: 99837376. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:16:11,237][81074] Avg episode reward: [(0, '3057.397')] [2023-03-07 01:16:11,849][81400] Updated weights for policy 0, policy_version 97520 (0.0006) [2023-03-07 01:16:12,630][81400] Updated weights for policy 0, policy_version 97530 (0.0005) [2023-03-07 01:16:13,413][81400] Updated weights for policy 0, policy_version 97540 (0.0006) [2023-03-07 01:16:14,185][81400] Updated weights for policy 0, policy_version 97550 (0.0006) [2023-03-07 01:16:14,970][81400] Updated weights for policy 0, policy_version 97560 (0.0006) [2023-03-07 01:16:15,743][81400] Updated weights for policy 0, policy_version 97570 (0.0006) [2023-03-07 01:16:16,236][81074] Fps is (10 sec: 13209.5, 60 sec: 13124.3, 300 sec: 13152.3). Total num frames: 99917824. Throughput: 0: 13133.5. Samples: 99916299. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:16:16,237][81074] Avg episode reward: [(0, '2712.901')] [2023-03-07 01:16:16,529][81400] Updated weights for policy 0, policy_version 97580 (0.0006) [2023-03-07 01:16:17,310][81400] Updated weights for policy 0, policy_version 97590 (0.0007) [2023-03-07 01:16:18,092][81400] Updated weights for policy 0, policy_version 97600 (0.0006) [2023-03-07 01:16:18,858][81400] Updated weights for policy 0, policy_version 97610 (0.0006) [2023-03-07 01:16:19,644][81400] Updated weights for policy 0, policy_version 97620 (0.0006) [2023-03-07 01:16:20,425][81400] Updated weights for policy 0, policy_version 97630 (0.0006) [2023-03-07 01:16:21,206][81400] Updated weights for policy 0, policy_version 97640 (0.0006) [2023-03-07 01:16:21,236][81074] Fps is (10 sec: 13107.1, 60 sec: 13124.3, 300 sec: 13148.9). Total num frames: 99983360. Throughput: 0: 13145.7. Samples: 99955868. Policy #0 lag: (min: 0.0, avg: 1.1, max: 3.0) [2023-03-07 01:16:21,237][81074] Avg episode reward: [(0, '3025.322')] [2023-03-07 01:16:21,966][81400] Updated weights for policy 0, policy_version 97650 (0.0006) [2023-03-07 01:16:22,597][81641] Stopping RolloutWorker_w23... [2023-03-07 01:16:22,597][81604] Stopping RolloutWorker_w17... [2023-03-07 01:16:22,597][81566] Stopping RolloutWorker_w11... [2023-03-07 01:16:22,597][81755] Stopping RolloutWorker_w29... [2023-03-07 01:16:22,597][81437] Stopping RolloutWorker_w5... [2023-03-07 01:16:22,597][81636] Stopping RolloutWorker_w22... [2023-03-07 01:16:22,597][81639] Stopping RolloutWorker_w10... [2023-03-07 01:16:22,597][81641] Loop rollout_proc23_evt_loop terminating... [2023-03-07 01:16:22,597][81604] Loop rollout_proc17_evt_loop terminating... [2023-03-07 01:16:22,597][81601] Stopping RolloutWorker_w8... [2023-03-07 01:16:22,597][81603] Stopping RolloutWorker_w14... [2023-03-07 01:16:22,597][81566] Loop rollout_proc11_evt_loop terminating... [2023-03-07 01:16:22,597][81640] Stopping RolloutWorker_w15... [2023-03-07 01:16:22,597][81740] Stopping RolloutWorker_w28... [2023-03-07 01:16:22,597][81644] Stopping RolloutWorker_w24... [2023-03-07 01:16:22,597][81401] Stopping RolloutWorker_w1... [2023-03-07 01:16:22,597][81404] Stopping RolloutWorker_w3... [2023-03-07 01:16:22,597][81555] Stopping RolloutWorker_w20... [2023-03-07 01:16:22,597][81637] Stopping RolloutWorker_w7... [2023-03-07 01:16:22,597][81755] Loop rollout_proc29_evt_loop terminating... [2023-03-07 01:16:22,597][81602] Stopping RolloutWorker_w21... [2023-03-07 01:16:22,597][81868] Stopping RolloutWorker_w25... [2023-03-07 01:16:22,597][81636] Loop rollout_proc22_evt_loop terminating... [2023-03-07 01:16:22,597][81349] Stopping Batcher_0... [2023-03-07 01:16:22,597][81437] Loop rollout_proc5_evt_loop terminating... [2023-03-07 01:16:22,597][81643] Stopping RolloutWorker_w9... [2023-03-07 01:16:22,597][81845] Stopping RolloutWorker_w30... [2023-03-07 01:16:22,597][81603] Loop rollout_proc14_evt_loop terminating... [2023-03-07 01:16:22,597][81601] Loop rollout_proc8_evt_loop terminating... [2023-03-07 01:16:22,597][81567] Stopping RolloutWorker_w18... [2023-03-07 01:16:22,597][81553] Stopping RolloutWorker_w12... [2023-03-07 01:16:22,597][81639] Loop rollout_proc10_evt_loop terminating... [2023-03-07 01:16:22,597][81600] Stopping RolloutWorker_w13... [2023-03-07 01:16:22,597][81401] Loop rollout_proc1_evt_loop terminating... [2023-03-07 01:16:22,597][81644] Loop rollout_proc24_evt_loop terminating... [2023-03-07 01:16:22,597][81846] Stopping RolloutWorker_w31... [2023-03-07 01:16:22,597][81640] Loop rollout_proc15_evt_loop terminating... [2023-03-07 01:16:22,598][81637] Loop rollout_proc7_evt_loop terminating... [2023-03-07 01:16:22,598][81740] Loop rollout_proc28_evt_loop terminating... [2023-03-07 01:16:22,598][81349] Loop batcher_evt_loop terminating... [2023-03-07 01:16:22,598][81404] Loop rollout_proc3_evt_loop terminating... [2023-03-07 01:16:22,598][81555] Loop rollout_proc20_evt_loop terminating... [2023-03-07 01:16:22,597][81738] Stopping RolloutWorker_w26... [2023-03-07 01:16:22,598][81600] Loop rollout_proc13_evt_loop terminating... [2023-03-07 01:16:22,598][81846] Loop rollout_proc31_evt_loop terminating... [2023-03-07 01:16:22,598][81599] Stopping RolloutWorker_w19... [2023-03-07 01:16:22,598][81602] Loop rollout_proc21_evt_loop terminating... [2023-03-07 01:16:22,597][81402] Stopping RolloutWorker_w2... [2023-03-07 01:16:22,597][81074] Component RolloutWorker_w23 stopped! [2023-03-07 01:16:22,598][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 01:16:22,598][81868] Loop rollout_proc25_evt_loop terminating... [2023-03-07 01:16:22,598][81845] Loop rollout_proc30_evt_loop terminating... [2023-03-07 01:16:22,598][81738] Loop rollout_proc26_evt_loop terminating... [2023-03-07 01:16:22,598][81599] Loop rollout_proc19_evt_loop terminating... [2023-03-07 01:16:22,598][81402] Loop rollout_proc2_evt_loop terminating... [2023-03-07 01:16:22,598][81403] Stopping RolloutWorker_w0... [2023-03-07 01:16:22,598][81074] Component RolloutWorker_w17 stopped! [2023-03-07 01:16:22,598][81403] Loop rollout_proc0_evt_loop terminating... [2023-03-07 01:16:22,599][81074] Component RolloutWorker_w29 stopped! [2023-03-07 01:16:22,599][81074] Component RolloutWorker_w11 stopped! [2023-03-07 01:16:22,599][81074] Component RolloutWorker_w5 stopped! [2023-03-07 01:16:22,599][81074] Component RolloutWorker_w10 stopped! [2023-03-07 01:16:22,599][81074] Component RolloutWorker_w22 stopped! [2023-03-07 01:16:22,600][81074] Component RolloutWorker_w15 stopped! [2023-03-07 01:16:22,600][81074] Component RolloutWorker_w14 stopped! [2023-03-07 01:16:22,600][81565] Stopping RolloutWorker_w16... [2023-03-07 01:16:22,600][81074] Component RolloutWorker_w8 stopped! [2023-03-07 01:16:22,600][81565] Loop rollout_proc16_evt_loop terminating... [2023-03-07 01:16:22,600][81074] Component RolloutWorker_w28 stopped! [2023-03-07 01:16:22,600][81074] Component RolloutWorker_w3 stopped! [2023-03-07 01:16:22,601][81074] Component RolloutWorker_w20 stopped! [2023-03-07 01:16:22,601][81074] Component RolloutWorker_w24 stopped! [2023-03-07 01:16:22,601][81074] Component RolloutWorker_w21 stopped! [2023-03-07 01:16:22,602][81074] Component RolloutWorker_w1 stopped! [2023-03-07 01:16:22,602][81074] Component Batcher_0 stopped! [2023-03-07 01:16:22,602][81074] Component RolloutWorker_w25 stopped! [2023-03-07 01:16:22,602][81074] Component RolloutWorker_w7 stopped! [2023-03-07 01:16:22,603][81074] Component RolloutWorker_w9 stopped! [2023-03-07 01:16:22,603][81074] Component RolloutWorker_w30 stopped! [2023-03-07 01:16:22,603][81074] Component RolloutWorker_w18 stopped! [2023-03-07 01:16:22,603][81074] Component RolloutWorker_w12 stopped! [2023-03-07 01:16:22,603][81074] Component RolloutWorker_w13 stopped! [2023-03-07 01:16:22,604][81074] Component RolloutWorker_w31 stopped! [2023-03-07 01:16:22,604][81074] Component RolloutWorker_w2 stopped! [2023-03-07 01:16:22,604][81074] Component RolloutWorker_w26 stopped! [2023-03-07 01:16:22,604][81074] Component RolloutWorker_w19 stopped! [2023-03-07 01:16:22,604][81074] Component RolloutWorker_w0 stopped! [2023-03-07 01:16:22,605][81074] Component RolloutWorker_w16 stopped! [2023-03-07 01:16:22,606][81074] Component RolloutWorker_w6 stopped! [2023-03-07 01:16:22,606][81564] Stopping RolloutWorker_w6... [2023-03-07 01:16:22,607][81074] Component RolloutWorker_w4 stopped! [2023-03-07 01:16:22,607][81564] Loop rollout_proc6_evt_loop terminating... [2023-03-07 01:16:22,607][81405] Stopping RolloutWorker_w4... [2023-03-07 01:16:22,608][81405] Loop rollout_proc4_evt_loop terminating... [2023-03-07 01:16:22,609][81074] Component RolloutWorker_w27 stopped! [2023-03-07 01:16:22,609][81739] Stopping RolloutWorker_w27... [2023-03-07 01:16:22,610][81739] Loop rollout_proc27_evt_loop terminating... [2023-03-07 01:16:22,598][81553] Loop rollout_proc12_evt_loop terminating... [2023-03-07 01:16:22,598][81643] Loop rollout_proc9_evt_loop terminating... [2023-03-07 01:16:22,598][81567] Loop rollout_proc18_evt_loop terminating... [2023-03-07 01:16:22,660][81400] Weights refcount: 2 0 [2023-03-07 01:16:22,663][81400] Stopping InferenceWorker_p0-w0... [2023-03-07 01:16:22,664][81400] Loop inference_proc0-0_evt_loop terminating... [2023-03-07 01:16:22,664][81074] Component InferenceWorker_p0-w0 stopped! [2023-03-07 01:16:22,709][81349] Removing /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000095777_98075648.pth [2023-03-07 01:16:22,718][81349] Saving /home/qgallouedec/gia/data/envs/metaworld/train_dir/button-press-topdown-v2/checkpoint_p0/checkpoint_000097658_100001792.pth... [2023-03-07 01:16:22,812][81349] Stopping LearnerWorker_p0... [2023-03-07 01:16:22,813][81349] Loop learner_proc0_evt_loop terminating... [2023-03-07 01:16:22,813][81074] Component LearnerWorker_p0 stopped! [2023-03-07 01:16:22,814][81074] Waiting for process learner_proc0 to stop... [2023-03-07 01:16:23,986][81074] Waiting for process inference_proc0-0 to join... [2023-03-07 01:16:23,987][81074] Waiting for process rollout_proc0 to join... [2023-03-07 01:16:23,987][81074] Waiting for process rollout_proc1 to join... [2023-03-07 01:16:23,987][81074] Waiting for process rollout_proc2 to join... [2023-03-07 01:16:23,988][81074] Waiting for process rollout_proc3 to join... [2023-03-07 01:16:23,988][81074] Waiting for process rollout_proc4 to join... [2023-03-07 01:16:23,988][81074] Waiting for process rollout_proc5 to join... [2023-03-07 01:16:23,988][81074] Waiting for process rollout_proc6 to join... [2023-03-07 01:16:23,989][81074] Waiting for process rollout_proc7 to join... [2023-03-07 01:16:23,989][81074] Waiting for process rollout_proc8 to join... [2023-03-07 01:16:23,989][81074] Waiting for process rollout_proc9 to join... [2023-03-07 01:16:23,989][81074] Waiting for process rollout_proc10 to join... [2023-03-07 01:16:23,990][81074] Waiting for process rollout_proc11 to join... [2023-03-07 01:16:23,990][81074] Waiting for process rollout_proc12 to join... [2023-03-07 01:16:23,990][81074] Waiting for process rollout_proc13 to join... [2023-03-07 01:16:23,990][81074] Waiting for process rollout_proc14 to join... [2023-03-07 01:16:23,990][81074] Waiting for process rollout_proc15 to join... [2023-03-07 01:16:23,991][81074] Waiting for process rollout_proc16 to join... [2023-03-07 01:16:23,991][81074] Waiting for process rollout_proc17 to join... [2023-03-07 01:16:23,991][81074] Waiting for process rollout_proc18 to join... [2023-03-07 01:16:23,991][81074] Waiting for process rollout_proc19 to join... [2023-03-07 01:16:23,992][81074] Waiting for process rollout_proc20 to join... [2023-03-07 01:16:23,992][81074] Waiting for process rollout_proc21 to join... [2023-03-07 01:16:23,992][81074] Waiting for process rollout_proc22 to join... [2023-03-07 01:16:23,992][81074] Waiting for process rollout_proc23 to join... [2023-03-07 01:16:23,993][81074] Waiting for process rollout_proc24 to join... [2023-03-07 01:16:23,993][81074] Waiting for process rollout_proc25 to join... [2023-03-07 01:16:23,993][81074] Waiting for process rollout_proc26 to join... [2023-03-07 01:16:23,993][81074] Waiting for process rollout_proc27 to join... [2023-03-07 01:16:23,993][81074] Waiting for process rollout_proc28 to join... [2023-03-07 01:16:23,994][81074] Waiting for process rollout_proc29 to join... [2023-03-07 01:16:23,994][81074] Waiting for process rollout_proc30 to join... [2023-03-07 01:16:23,994][81074] Waiting for process rollout_proc31 to join... [2023-03-07 01:16:23,994][81074] Batcher 0 profile tree view: batching: 843.4597, releasing_batches: 1.6274 [2023-03-07 01:16:23,995][81074] InferenceWorker_p0-w0 profile tree view: wait_policy: 0.0001 wait_policy_total: 234.5129 update_model: 133.6116 weight_update: 0.0006 one_step: 0.0035 handle_policy_step: 6824.4978 deserialize: 213.5287, stack: 35.3720, obs_to_device_normalize: 1205.1038, forward: 3054.6701, send_messages: 1333.2688 prepare_outputs: 713.7977 to_cpu: 360.3018 [2023-03-07 01:16:23,995][81074] Learner 0 profile tree view: misc: 0.5940, prepare_batch: 418.7650 train: 915.6318 epoch_init: 0.3959, minibatch_init: 0.4347, losses_postprocess: 29.2699, kl_divergence: 35.6180, after_optimizer: 96.9650 calculate_losses: 305.1939 losses_init: 0.2234, forward_head: 16.7321, bptt_initial: 110.8280, tail: 61.1461, advantages_returns: 7.5089, losses: 28.8224 bptt: 70.8784 bptt_forward_core: 68.4072 update: 424.9864 clip: 56.3943 [2023-03-07 01:16:23,995][81074] RolloutWorker_w0 profile tree view: wait_for_trajectories: 3.8387, enqueue_policy_requests: 178.8343, env_step: 2932.1673, overhead: 157.3375, complete_rollouts: 9.2108 save_policy_outputs: 222.3344 split_output_tensors: 109.5623 [2023-03-07 01:16:23,995][81074] RolloutWorker_w31 profile tree view: wait_for_trajectories: 4.0105, enqueue_policy_requests: 181.0969, env_step: 2972.6546, overhead: 160.3734, complete_rollouts: 9.6616 save_policy_outputs: 224.1497 split_output_tensors: 109.2601 [2023-03-07 01:16:23,995][81074] Loop Runner_EvtLoop terminating... [2023-03-07 01:16:23,996][81074] Runner profile tree view: main_loop: 7584.7565 [2023-03-07 01:16:23,996][81074] Collected {0: 100001792}, FPS: 13184.6